BLASTX nr result

ID: Stemona21_contig00001321 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Stemona21_contig00001321
         (3393 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...   419   e-178
ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein A...   319   e-121
gb|ABN09154.1| RNA-directed DNA polymerase (Reverse transcriptas...   411   e-111
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   317   e-108
gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptas...   397   e-107
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   311   e-105
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   311   e-101
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   321   7e-91
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   313   2e-90
gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptas...   331   2e-87
ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258...   303   3e-86
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   324   2e-85
ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268...   307   5e-84
ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein A...   295   8e-84
gb|AAD24831.1| putative non-LTR retroelement reverse transcripta...   317   2e-83
gb|AAD20714.1| putative non-LTR retroelement reverse transcripta...   317   2e-83
gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam...   309   5e-81
emb|CAB75484.1| putative protein [Arabidopsis thaliana]               309   6e-81
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   307   2e-80
emb|CAB40051.1| putative protein [Arabidopsis thaliana] gi|72677...   305   9e-80

>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 872

 Score =  419 bits (1076), Expect(3) = e-178
 Identities = 221/489 (45%), Positives = 302/489 (61%), Gaps = 1/489 (0%)
 Frame = -2

Query: 2876 IKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDFIATAS 2697
            +  A +IEQFRPI L N +FKII KILA RL+SI SR++SP Q  F+ GR+I D I   S
Sbjct: 5    VDHADSIEQFRPITLTNLVFKIILKILALRLSSIASRIVSPQQHAFVVGRNISDCILVTS 64

Query: 2696 DCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFNSARIS 2517
            +CFN LD KC+GGN+A+K DI KAFDT+SW FLL VL+ FGF+ +F+  + V+  SAR+S
Sbjct: 65   ECFNLLDSKCYGGNVAIKTDITKAFDTLSWDFLLHVLQAFGFHESFVQ-VRVLLLSARLS 123

Query: 2516 ILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPRGGRAP 2337
            +LING   GYF C +GVRQGDPLSPLLFC AE+ LSR +   V    ++ + SPRG  +P
Sbjct: 124  LLINGRTYGYFSCGQGVRQGDPLSPLLFCLAEEVLSRGISMLVSSGQVKRIHSPRGTLSP 183

Query: 2336 THLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXXXXSLL 2157
            +++L+A DV++FC+G ++N+L +   F  YG +SGQ++N DKS VF G         S+ 
Sbjct: 184  SYVLFAGDVIVFCRGNRQNLLRVMSFFYEYGSVSGQIINKDKSQVFIG--KHNRRRHSIS 241

Query: 2156 DTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLSLINSV 1977
            D  G+  G+    YLG P+F G P+    Q I D              SMAGRL LI SV
Sbjct: 242  DCLGIPLGTAPFMYLGAPIFHGKPRVAHFQAIVDKVRLKLSSWVGSFLSMAGRLQLIKSV 301

Query: 1976 ITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHRCYSK-QEGGLGLK 1800
            I S F+++F +Y WP SLL+++    RNF W+G ID+R    V+W  C +   EGGLGLK
Sbjct: 302  IYSMFVYTFQVYEWPVSLLRKVERWCRNFLWSGDIDKRGIPLVSWTSCCAPIDEGGLGLK 361

Query: 1799 DLATMNRALLRKLTWKFMTADNFAYSFLRARYLKSFSDPRRKYLTSSIWPALSEHYLALL 1620
             L  +N +LL K  W+  T+      F+R R+ K     RR Y  SSIWP + + +  + 
Sbjct: 362  KLDVLNSSLLLKRCWEIFTSSFEGCCFIRNRFSK-----RRSYAPSSIWPGVRKFWGLVQ 416

Query: 1619 SETQWLIGKHSKVRFWHDNWLGSPLTELLQIPEHISAKLTAKVSNFYCNGQWLLTELFQK 1440
            + T+WL+G   K+ FW DN+LG PL E       ++   ++ VS++  NG W+L  L Q 
Sbjct: 417  NNTRWLVGTGDKISFWRDNFLGRPLIEFFGNHGALNDN-SSLVSDYIDNGSWVLPPLLQL 475

Query: 1439 EFPEVCSLI 1413
                VC+LI
Sbjct: 476  NLSAVCNLI 484



 Score =  132 bits (333), Expect(3) = e-178
 Identities = 68/160 (42%), Positives = 93/160 (58%), Gaps = 1/160 (0%)
 Frame = -2

Query: 731  GMDKVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFAW 552
            G  K+N+DGA     G+ G G +FR   G   G FA  +    +  A++M VI AI  AW
Sbjct: 714  GWIKINSDGAWKHEEGIGGFGAVFRYYKGQFVGAFASHIDIPSSIAAKVMVVITAIELAW 773

Query: 551  KHGWRQLWLESDSTHMVTILTTRSPK-VPWRWRAKWLKCLHFISHMDFRVSHIYREGNRV 375
               W+ +WLE D + ++  +  RSP  VPW+ R +WL CL+ IS M F+ SHI+REGNRV
Sbjct: 774  VRDWKHVWLEVDFSTVLDYI--RSPSLVPWQLRVRWLNCLYRISTMTFKSSHIFREGNRV 831

Query: 374  ADSLSSRAPSLCAPTWWWNAPIFCSALVQEDLTGRPNFRF 255
            AD+L++   S+    WW   P F  +  + DL G PNFRF
Sbjct: 832  ADALANHGTSMSEEVWWDVPPSFILSYYERDLLGMPNFRF 871



 Score =  126 bits (317), Expect(3) = e-178
 Identities = 69/220 (31%), Positives = 108/220 (49%)
 Frame = -3

Query: 1378 LVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWRMLQNRLPTKDGL 1199
            L+W  S+ GE+  K A+ F +       WGK +             W++++  + +   L
Sbjct: 499  LIWQASSTGELTAKQAFLFLQQASPVVPWGKPLWSKFILPRMSLHAWKVMRGTVISYHLL 558

Query: 1198 QSAGIQLASCCHLCFQAAESPTHIFLQCTYARSLWTAISSTFKHPIQLNGSIAELWKAAM 1019
            Q  G+ L S C  C  + ES  HIFL C++A S+W      F+  +  N +IAE++   +
Sbjct: 559  QRRGVALVSRCEFCGNSTESLDHIFLHCSFAASVWNHFIYIFEIGLVPN-TIAEVFSLGL 617

Query: 1018 EITCSTQISALWRSAIVSTFWAIWYARNQVIFENSWITFAESISFVWRAIKETGMIDSGT 839
             +  S Q+  LW     S  W IW+ARNQ+ F++   + A     V R I+ +  + +G 
Sbjct: 618  AMDRSPQLKELWLICFTSILWYIWHARNQIRFDSRTFSVAGVCRLVSRHIQASSRLATGH 677

Query: 838  MRNSTQDLCILSQFCIAGRPAKAPKIIPVTWFTPLPGWIK 719
            M N+  DLCIL  F    R  + P+++ V W  P  GWIK
Sbjct: 678  MHNTIHDLCILKSFGACCRSRRIPRMVEVIWHPPSIGWIK 717


>ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 751

 Score =  319 bits (817), Expect(3) = e-121
 Identities = 173/442 (39%), Positives = 257/442 (58%), Gaps = 1/442 (0%)
 Frame = -2

Query: 2699 SDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFNSARI 2520
            S+ FN LD+K   GN+ +KVDI KAFDT++W FL+EVL  FGF S F + + ++ NSA +
Sbjct: 3    SEGFNLLDRKIVDGNVGIKVDIAKAFDTLNWQFLIEVLHRFGFGSRFTDLMLILLNSAHL 62

Query: 2519 SILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPRGGRA 2340
            SILING+P G+F C++GVRQGDPLSP+LFC AE+ LSR L        ++++S PR G +
Sbjct: 63   SILINGSPHGFFSCTKGVRQGDPLSPILFCIAEEALSRGLTALFSSKKVRSISLPR-GCS 121

Query: 2339 PTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXXXXSL 2160
             TH+LYADD+ IFC+G  +++  +     +YG  SGQLVN DKS  + G          +
Sbjct: 122  LTHVLYADDLFIFCRGDTKSLRQLQSFLDNYGAASGQLVNKDKSTFYLG-ASHFHRRHQV 180

Query: 2159 LDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLSLINS 1980
                G + G++  +YLGVP+FKG P ++ LQ + D              SMAGR+ L++ 
Sbjct: 181  KKILGFKLGTSPFSYLGVPIFKGKPCRKHLQALVDKAKARLAGWKGKLLSMAGRVQLVHD 240

Query: 1979 VITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYSKQEGGLGL 1803
            V  S  +HSF IY W  SLL  L+A  RNF W+G +  RK +T++W + C  + E GL L
Sbjct: 241  VFQSMLLHSFSIYLWATSLLSHLSACARNFIWSGDLAIRKLVTISWQQVCTPRNEAGLDL 300

Query: 1802 KDLATMNRALLRKLTWKFMTADNFAYSFLRARYLKSFSDPRRKYLTSSIWPALSEHYLAL 1623
            ++L  +  A L  L W+ +   +   SF   R+   F   + +Y TSS+W  L      L
Sbjct: 301  RNLKALYTAGLISLAWQTLLQSSSWGSFACRRF-TIFRHMKFQYFTSSVWHGLKRVLPLL 359

Query: 1622 LSETQWLIGKHSKVRFWHDNWLGSPLTELLQIPEHISAKLTAKVSNFYCNGQWLLTELFQ 1443
               ++W+IG  + + FW D WL S + + L +   +S  L ++V++F  + QW L   F 
Sbjct: 360  FEHSRWIIGDGNSILFWSDKWLHSSIIQQLNMGS-LSHLLNSRVADFIWDQQWALPSHFS 418

Query: 1442 KEFPEVCSLIEDTAVCPDSPDT 1377
              FP+    I +  + P++P++
Sbjct: 419  NLFPDCAKQILEIPL-PNTPES 439



 Score = 84.0 bits (206), Expect(3) = e-121
 Identities = 54/221 (24%), Positives = 90/221 (40%), Gaps = 1/221 (0%)
 Frame = -3

Query: 1378 LVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWRMLQNRLPTKDGL 1199
            L+W  S+ G       Y+  R       W   +             WR+   +LPT D L
Sbjct: 442  LIWEHSSSGIFSFSDGYELVRPYFEKLDWASSVWHSFIPPRYSVLAWRIFHLKLPTDDQL 501

Query: 1198 QSAGIQLASCCHLC-FQAAESPTHIFLQCTYARSLWTAISSTFKHPIQLNGSIAELWKAA 1022
            Q  GI   S C LC F   E   H+F+ C++A+ +W  ++  F   +  +GS+ +LW + 
Sbjct: 502  QRRGIPFVSVCQLCSFSHTEDIPHLFVNCSFAQHIWQWLAYYFGTSLPSSGSLNDLWSSV 561

Query: 1021 MEITCSTQISALWRSAIVSTFWAIWYARNQVIFENSWITFAESISFVWRAIKETGMIDSG 842
                 S Q+  +W ++ +    AIW + N++ F+N   +       V   ++       G
Sbjct: 562  TGKAFSPQLKNIWFASCLFALMAIWKSHNKLRFDNKQPSLMRVFRSVKAWVRYIAPYTPG 621

Query: 841  TMRNSTQDLCILSQFCIAGRPAKAPKIIPVTWFTPLPGWIK 719
             +R       + S   I     ++   I V W  PL  W+K
Sbjct: 622  CVRGVLDSKVLSSMGVILVLKCQSALRI-VLWHPPLIPWLK 661



 Score = 84.0 bits (206), Expect(3) = e-121
 Identities = 40/89 (44%), Positives = 57/89 (64%)
 Frame = -2

Query: 722 KVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFAWKHG 543
           K+NT+G + G+PGLAGCGG+FR + G + G +   LG+   F  ELM VI  + FA+  G
Sbjct: 661 KLNTNGFSKGNPGLAGCGGVFRDSFGRLIGGYCQGLGTQTTFFVELMTVILGVEFAFHFG 720

Query: 542 WRQLWLESDSTHMVTILTTRSPKVPWRWR 456
           W  +WLESDST ++  +++ S   PW  R
Sbjct: 721 WHHIWLESDSTTILQCISSSSFAPPWSQR 749


>gb|ABN09154.1| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
            truncatula]
          Length = 528

 Score =  411 bits (1056), Expect = e-111
 Identities = 216/442 (48%), Positives = 277/442 (62%), Gaps = 1/442 (0%)
 Frame = -2

Query: 3068 EEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLMV 2889
            +EI+ AVFSL+  SAPGPDGF   FY   W+I+ +DVIKAV  FF T +I P  NAN ++
Sbjct: 119  DEIKQAVFSLNNDSAPGPDGFGSCFYQIYWDIVKEDVIKAVLQFFNTGWILPNFNANTLI 178

Query: 2888 LIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDFI 2709
            LIPK + A +++QFRPI + NF FKII+KILADRLA I   ++S  Q GFI+GR+I+D +
Sbjct: 179  LIPKTQNADSMDQFRPIAMANFKFKIISKILADRLAQIMPNIVSQEQRGFIQGRNIKDCV 238

Query: 2708 ATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFNS 2529
              AS+  N LD+K FGGN+A KVDI KAFDT++W FLL+VL+ FGF+ TF NWI  I  S
Sbjct: 239  CLASEAINMLDQKSFGGNLAFKVDISKAFDTLNWKFLLKVLKQFGFSETFCNWIDAILQS 298

Query: 2528 ARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPRG 2349
            A++SI ING+ +GYF CSRGVRQGDPLSPLLFC AED LSR L + V +  ++ M   R 
Sbjct: 299  AKLSICINGSQQGYFSCSRGVRQGDPLSPLLFCLAEDVLSRSLTKLVEQGKLKQMRGTRN 358

Query: 2348 GRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXXX 2169
               P+H+LYADD++IFC G       I+ A                              
Sbjct: 359  CLVPSHILYADDIMIFCNG------GISDA----------------------------RL 384

Query: 2168 XSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLSL 1989
              L++  G  +GS   NYLGVP+FKG PK R+LQPI D              S+AGR+ L
Sbjct: 385  QQLINVIGFNKGSFPFNYLGVPIFKGKPKARFLQPIVDKIKTKLSNWKASILSIAGRVQL 444

Query: 1988 INSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYSKQEGG 1812
            I SV  S  IH+  IY WP  LLKEL    RNF W+G I +RK +TVAW + C  + +GG
Sbjct: 445  IKSVAQSMLIHTITIYDWPSFLLKELETCFRNFIWSGDITKRKLVTVAWKKLCKPQSQGG 504

Query: 1811 LGLKDLATMNRALLRKLTWKFM 1746
            LG++ L+ +N A   KL W  +
Sbjct: 505  LGIRSLSQLNAAGNLKLCWDML 526


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  317 bits (811), Expect(3) = e-108
 Identities = 192/563 (34%), Positives = 292/563 (51%), Gaps = 8/563 (1%)
 Frame = -2

Query: 3080 ERYLEEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNA 2901
            E  L+E++DAVF ++  SA GPDGFS  FY  CW II +D++ AV+ FF  + IP G+ +
Sbjct: 1311 EPSLQEVKDAVFGINSESAAGPDGFSSYFYQQCWNIIAQDLLDAVRDFFHGANIPRGVTS 1370

Query: 2900 NLMVLIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHI 2721
              ++L+PK   A     FRPI L   + KIITK+L++RLA +   +I+ NQ GF+ GR I
Sbjct: 1371 TTLILLPKKSSASKWSDFRPISLCTVMNKIITKLLSNRLAKVLPSIITENQSGFVGGRLI 1430

Query: 2720 EDFIATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISV 2541
             D I  A +    L+ K  GGN+ALK+D+ KA+D + W FL +VL+ FGFN  +I  I  
Sbjct: 1431 SDNILLAQELIGKLNTKSRGGNLALKLDMMKAYDKLDWSFLFKVLQHFGFNGQWIKMIQK 1490

Query: 2540 IFNSARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMS 2361
              ++   S+L+NG  EGYF+  RG+RQGD +SP LF  A ++LSR L+   L +   ++ 
Sbjct: 1491 CISNCWFSLLLNGRTEGYFKSERGLRQGDSISPQLFIIAAEYLSRGLN--ALYDQYPSLH 1548

Query: 2360 SPRG-GRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXX 2184
               G   + +HL +ADDVLIF  G+K  +  I      Y ++SGQ +N  KS        
Sbjct: 1549 YSSGVSISVSHLAFADDVLIFTNGSKSALQRILAFLQEYQEISGQRINVQKSCFVTHTNV 1608

Query: 2183 XXXXXXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMA 2004
                   +  T+G       I YLG PL+KG  K      +                S  
Sbjct: 1609 SSSRRQIIAQTTGFSHQLLLITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPG 1668

Query: 2003 GRLSLINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYS 1827
            GR++L+ SV+ S  I+   + + P  +L+ +N    +F W G+   +K    +W +    
Sbjct: 1669 GRITLLRSVLASLPIYLLQVLKPPICVLERVNRIFNSFLWGGSAASKKIHWASWAKISLP 1728

Query: 1826 KQEGGLGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARYLKSF--SDPRRKYLTSSIW 1653
             +EGGL +++LA +  A   KL W+F T D+    F+R +Y +       + K   S  W
Sbjct: 1729 IKEGGLDIRNLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCRGQLPMHTQPKLHDSQTW 1788

Query: 1652 PALSEHYLALLSETQWLIGKHSKVRFWHDNWLGSPLTELLQIPEHISAKLTAKVSNFYCN 1473
              +  +        +W +G+  K+ FWHD W+G   T L    + +S  +  +V +F+ N
Sbjct: 1789 KRMVANSAITEQNMRWRVGQ-GKLFFWHDCWMGE--TPLTSSNQELSLSM-VQVCDFFMN 1844

Query: 1472 GQW----LLTELFQKEFPEVCSL 1416
              W    L T L Q+   E+  +
Sbjct: 1845 NSWDIEKLKTVLQQEVVDEIAKI 1867



 Score = 70.1 bits (170), Expect(3) = e-108
 Identities = 43/126 (34%), Positives = 69/126 (54%)
 Frame = -2

Query: 731  GMDKVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFAW 552
            G  K+N DG+A  S   AG GG+ R   G++   F+  LG   + +AEL+A+   +    
Sbjct: 2092 GEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMVFGFSENLGIQNSLQAELLALYRGLILCR 2150

Query: 551  KHGWRQLWLESDSTHMVTILTTRSPKVPWRWRAKWLKCLHFISHMDFRVSHIYREGNRVA 372
             +  R+LW+E D+  ++ +L     + P   R   +     +SH  FR+SHI+REGN+ A
Sbjct: 2151 DYNIRRLWIEMDAASVIRLLQGNQ-RGPHAIRYLLVSIRQLLSHFSFRLSHIFREGNQAA 2209

Query: 371  DSLSSR 354
            D L++R
Sbjct: 2210 DFLANR 2215



 Score = 56.2 bits (134), Expect(3) = e-108
 Identities = 61/240 (25%), Positives = 91/240 (37%), Gaps = 9/240 (3%)
 Frame = -3

Query: 1411 KIPLYALIHQTLVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWRM 1232
            KIP+ A+      W P+ +GE   K+A+   R R         I             WR+
Sbjct: 1866 KIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKTVPLTISFFLWRL 1925

Query: 1231 LQNRLPTKDGLQSAGIQLASCCHLCFQAAESPTHIFLQCTYARSLWTAISSTFK----HP 1064
            L + +P +  ++S G QLAS C  C ++ ES  H+      A  +W   S  F+    +P
Sbjct: 1926 LHDWIPVELKMKSKGFQLASRCRCC-KSEESIMHVMWDNPVATQVWNYFSKFFQILVINP 1984

Query: 1063 IQLNGSIAELWKAAMEITCSTQISALWRSAIVSTFWAIWYARNQVIFENSWITFAESISF 884
              +N  I   W  + +      I  L     + T W +W  RN     N  + +   I  
Sbjct: 1985 CTIN-QILGAWFYSGDYCKPGHIRTL---VPIFTLWFLWVERNDAKHRNLGM-YPNRI-- 2037

Query: 883  VWRAIKETGMIDSGTMRNSTQ---DLCILSQFCIA--GRPAKAPKIIPVTWFTPLPGWIK 719
            VWR +K    +  G      Q   D  I  ++ I         PK+ P  W  P  G  K
Sbjct: 2038 VWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESLPPPKVFP--WHKPSIGEFK 2095


>gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
            truncatula]
          Length = 642

 Score =  397 bits (1020), Expect = e-107
 Identities = 198/445 (44%), Positives = 273/445 (61%), Gaps = 1/445 (0%)
 Frame = -2

Query: 3068 EEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLMV 2889
            EE+++AVF L+   APGPD F   F+   W I+ KDV +AV  FF   ++P   NAN ++
Sbjct: 180  EEVKNAVFDLNSDDAPGPDVFGACFFQIYWNIVKKDVYEAVLDFFKNGWLPNNFNANSII 239

Query: 2888 LIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDFI 2709
            LIPK   A +++Q+R I L NF FKII K+LADRLA I   +IS  Q GF++GR+I D I
Sbjct: 240  LIPKTPNADSVDQYRTIALVNFKFKIINKVLADRLAKILPSIISKEQRGFVQGRNIRDCI 299

Query: 2708 ATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFNS 2529
            A  S+  N LD K FGGN+ALK+D+ KAFDT++W FLL VL+ FGFN  F NWI  I +S
Sbjct: 300  ALTSEAINVLDNKSFGGNLALKIDVTKAFDTLNWDFLLLVLKTFGFNELFCNWIKTILHS 359

Query: 2528 ARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPRG 2349
            +++ I +NG   G+F C+RGVRQGDPLSPLLFC  E+ LSR +     +  I  +++ R 
Sbjct: 360  SKMFISMNGAQHGFFNCNRGVRQGDPLSPLLFCIVEEVLSRSISILADKGLIDLIAASRN 419

Query: 2348 GRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXXX 2169
               P H  Y DD+++FCK    +++ +   F+ Y   SGQ++N  KSF+F G        
Sbjct: 420  NCLPFHCFYVDDLMVFCKAKMSSLIVLKSLFTRYADCSGQIMNIRKSFIFAG-GITDTRM 478

Query: 2168 XSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLSL 1989
             ++++  G   GS    YLG P+FKG PK    QPIAD              S+AGR+ L
Sbjct: 479  NNIVNILGFNVGSLPFTYLGAPIFKGKPKGIHFQPIADKVKAKLAKWKASLLSIAGRIQL 538

Query: 1988 INSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYSKQEGG 1812
            + SV+ S  +H+  IY WP  +LKE+   I+NF W+G + +RK +TVAW + C   +EGG
Sbjct: 539  VKSVVQSMLVHTMSIYSWPIKILKEMEKWIKNFIWSGDVTKRKMVTVAWRKICADYEEGG 598

Query: 1811 LGLKDLATMNRALLRKLTWKFMTAD 1737
            LG+K L  +N A   K+ W  M +D
Sbjct: 599  LGVKSLICLNEATNLKICWNLMQSD 623


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  311 bits (797), Expect(3) = e-105
 Identities = 186/548 (33%), Positives = 283/548 (51%), Gaps = 4/548 (0%)
 Frame = -2

Query: 3080 ERYLEEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNA 2901
            E  L+E++DAVF++D+ S  GPDGFS  FY  CW II +D++ AV+ FF  +  P G+ +
Sbjct: 397  EPQLQEVKDAVFAIDKDSVVGPDGFSSFFYQQCWPIIAEDLLAAVRDFFKGAVFPRGVTS 456

Query: 2900 NLMVLIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHI 2721
              +VL+ K  +A T   FRPI L   L KI+TK+LA+RL+ +   +IS NQ GF+ GR I
Sbjct: 457  TTLVLLAKKPDAATWSDFRPISLCTILNKIVTKLLANRLSKVLPSLISENQSGFVSGRLI 516

Query: 2720 EDFIATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISV 2541
             D I  A +    +D K  GGN+ LK+D+ KA+D ++W FL+ VL  FGFN  +I+ I  
Sbjct: 517  NDNILLAQELIGKIDYKARGGNVVLKLDMMKAYDRLNWDFLILVLERFGFNDMWIDMIRR 576

Query: 2540 IFNSARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMS 2361
               +   S+LING   GYF+  RG+RQGD +SP+LF  A ++LSR ++ ++    I    
Sbjct: 577  CITNCWFSVLINGHSAGYFKSERGLRQGDSISPMLFILAAEYLSRGIN-ELFSRYISLHY 635

Query: 2360 SPRGGRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXX 2181
                    +HL +ADD++IF  G+K  +  I +    Y Q+SGQ VN  KS         
Sbjct: 636  HSGCSLNISHLAFADDIMIFTNGSKSVLEKILEFLQEYEQISGQRVNHQKSCFVTANNMP 695

Query: 2180 XXXXXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAG 2001
                  +  T G    +  I YLG PLFKG  K      + +              S  G
Sbjct: 696  SSRRQIISQTIGFLHKTLPITYLGAPLFKGPKKVMLFDSLINKIRERITGWENKILSPGG 755

Query: 2000 RLSLINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYSK 1824
            R++L+ SV++S  I+   + + P  +++++     +F W  ++D  +    AWH   +  
Sbjct: 756  RITLLRSVLSSMPIYLLQVLKPPACVIQKIERLFNSFLWGSSMDSTRIHWTAWHNITFPS 815

Query: 1823 QEGGLGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARYL--KSFSDPRRKYLTSSIWP 1650
             EGGLG++ L     A   KL W+F T  +    ++R +Y   +   +   K   S+ W 
Sbjct: 816  SEGGLGIRSLKDSFDAFSAKLWWRFDTCQSLWVRYMRLKYCTGQIHHNIAPKPHDSATWK 875

Query: 1649 ALSEHYLALLSETQWLIGKHSKVRFWHDNWLG-SPLTELLQIPEHISAKLTAKVSNFYCN 1473
             L         + +W IGK   + FWHD W+G  PL      P    + +  KV+ F+ +
Sbjct: 876  PLLAGRATASQQIRWRIGK-GDIFFWHDAWMGDEPLVN--SFPSFSQSMM--KVNYFFND 930

Query: 1472 GQWLLTEL 1449
              W + +L
Sbjct: 931  DAWDVDKL 938



 Score = 69.3 bits (168), Expect(3) = e-105
 Identities = 65/240 (27%), Positives = 99/240 (41%), Gaps = 8/240 (3%)
 Frame = -3

Query: 1414 LKIPLYALIHQTLVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWR 1235
            LKIP+         W  + +G+   K+A++  R R      G+ I             WR
Sbjct: 951  LKIPISREKEDIAYWALTANGDFSIKSAWELLRQRKQVNLVGQLIWHKSIPLTVSFFLWR 1010

Query: 1234 MLQNRLPTKDGLQSAGIQLASCCHLCFQAAESPTHIFLQCTYARSLWTAISSTFK---HP 1064
             L N LP +  +++ GIQLAS C LC ++ ES  H+  +   A+ +W   S  F+   H 
Sbjct: 1011 TLHNWLPVEVRMKAKGIQLASKC-LCCKSEESLLHVLWESPVAQQVWNYFSKFFQIYVHN 1069

Query: 1063 IQLNGSIAELWKAAMEITCSTQISALWRSAIVSTFWAIWYARNQVIFENSWITFAESISF 884
             Q    I   W  + + T    I  L    ++  FW +W  RN     +  + + + I  
Sbjct: 1070 PQNILQILNSWYYSGDFTKPGHIRTL---ILLFIFWFVWVERNDAKHRDLGM-YPDRI-- 1123

Query: 883  VWRAIKETGMIDSGTMRNSTQ-----DLCILSQFCIAGRPAKAPKIIPVTWFTPLPGWIK 719
            +WR +K    +  G +    Q     D+ I   F  A      PKII   W  PL G +K
Sbjct: 1124 IWRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQARPKII--NWIKPLIGELK 1181



 Score = 52.0 bits (123), Expect(3) = e-105
 Identities = 37/124 (29%), Positives = 64/124 (51%), Gaps = 3/124 (2%)
 Frame = -2

Query: 722  KVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFAWKHG 543
            K+N DG++      A  GG+ R   G +   F+   G   + +AEL+A+   +    ++ 
Sbjct: 1181 KLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLALHRGLCLCMEYN 1240

Query: 542  WRQLWLESDSTHMVTILTTR---SPKVPWRWRAKWLKCLHFISHMDFRVSHIYREGNRVA 372
              ++W+E D+  ++ ++      S K+ +   +   KCL  IS    R+SHI+REGN+ A
Sbjct: 1241 VSRVWIEVDAQVVIQMIQNHHKGSYKIQYLLES-IRKCLQVIS---VRISHIHREGNQAA 1296

Query: 371  DSLS 360
            D LS
Sbjct: 1297 DFLS 1300


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  311 bits (796), Expect(3) = e-101
 Identities = 188/548 (34%), Positives = 291/548 (53%), Gaps = 7/548 (1%)
 Frame = -2

Query: 3071 LEEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLM 2892
            L+EI++ VF++D+ S  GPDGFS  FY HCW+II +D+++AV  FF  + +P G+ +  +
Sbjct: 1019 LKEIKEVVFNIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVLDFFNGTPMPQGVTSTTL 1078

Query: 2891 VLIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDF 2712
            VL+PK   +     FRPI L   L KI+TK LA+RL+ I   +IS NQ GF+ GR I D 
Sbjct: 1079 VLLPKKPNSCQWSDFRPISLCTVLNKIVTKTLANRLSKILPSIISENQSGFVNGRLISDN 1138

Query: 2711 IATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFN 2532
            I  A +    LD K  GGN+ LK+D+ KA+D ++W FL  +++ FGFN  +I+ I    +
Sbjct: 1139 ILLAQELVGKLDAKARGGNVVLKLDMAKAYDRLNWDFLYLMMKQFGFNDRWISMIKACIS 1198

Query: 2531 SARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPR 2352
            +   S+LING+  GYF+  RG+RQGD +SPLLF  A D+LSR +++  L N  +++    
Sbjct: 1199 NCWFSLLINGSLVGYFKSERGLRQGDSISPLLFVLAADYLSRGINQ--LFNRHKSLLYLS 1256

Query: 2351 GGRAP-THLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXX 2175
            G   P +HL +ADD++IF  G +  +  I      Y ++SGQ VN  KS           
Sbjct: 1257 GCFMPISHLAFADDIVIFTNGCRPALQKILVFLQEYEEVSGQQVNHQKSCFITANGCPMT 1316

Query: 2174 XXXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRL 1995
                +  T+G Q  +  + YLG PL KG  K      +                S  GR+
Sbjct: 1317 RRQIIAHTTGFQHKTLPVIYLGAPLHKGPKKVTLFDSLITKIRDRISGWENKTLSPGGRI 1376

Query: 1994 SLINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYSKQE 1818
            +L+ SV++S  ++   + + P  +++++     +F W  + ++++    AWH+  +   E
Sbjct: 1377 TLLRSVLSSLPLYLLQVLKPPVVVIEKIERLFNSFLWGDSTNDKRIHWAAWHKLTFPCSE 1436

Query: 1817 GGLGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARY----LKSFSDPRRKYLTSSIWP 1650
            GGL ++ L  M  A   KL W+F T +     FL+ +Y    +  +  P  K   S +W 
Sbjct: 1437 GGLDIRRLTDMFDAFSLKLWWRFSTCEGLWTKFLKTKYCMGQIPHYVHP--KLHDSQVWK 1494

Query: 1649 ALSEHYLALLSETQWLIGKHSKVRFWHDNWLG-SPLTELLQIPEHISAKLTAKVSNFYCN 1473
             +       +  T+W IGK S + FWHD W+G  PL  +   P   +   T  V NF+  
Sbjct: 1495 RMVRGREVAIQNTRWRIGKGS-LFFWHDCWMGDQPL--VTSFPHFRNDMST--VHNFFNG 1549

Query: 1472 GQWLLTEL 1449
              W + +L
Sbjct: 1550 HNWDVDKL 1557



 Score = 60.5 bits (145), Expect(3) = e-101
 Identities = 37/127 (29%), Positives = 66/127 (51%)
 Frame = -2

Query: 734  PGMDKVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFA 555
            PG  K+N DG++  +   A  GG+ R   G +   F+  +G   + +AEL A++  +   
Sbjct: 1796 PGEHKLNVDGSSRQNQ-TAAIGGVLRDHTGTLVFDFSENIGPSNSLQAELRALLRGLLLC 1854

Query: 554  WKHGWRQLWLESDSTHMVTILTTRSPKVPWRWRAKWLKCLHFISHMDFRVSHIYREGNRV 375
             +    +LW+E D+   + ++  +S K     R        +++   FR+SHI+REGN+ 
Sbjct: 1855 KERNIEKLWVEMDALVAIQMIQ-QSQKGSHDIRYLLASIRKYLNFFSFRISHIFREGNQA 1913

Query: 374  ADSLSSR 354
            AD LS++
Sbjct: 1914 ADFLSNK 1920



 Score = 48.9 bits (115), Expect(3) = e-101
 Identities = 50/237 (21%), Positives = 92/237 (38%), Gaps = 8/237 (3%)
 Frame = -3

Query: 1414 LKIPLYALIHQTLVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWR 1235
            L+IP+         W  +++GE   ++A++  R R +       +             WR
Sbjct: 1570 LQIPIDRSQDDVAYWSLTSNGEFSTRSAWEAIRLRKSPNVLCSLLWHKSIPLSISFFLWR 1629

Query: 1234 MLQNRLPTKDGLQSAGIQLASCCHLCFQAAESPTHIFLQCTYARSLWTAISSTFKHPIQL 1055
            +  N +P    L+  G  LAS C +C  + ES  H+      A+ +W   +++F+  I  
Sbjct: 1630 VFHNWIPVDIRLKEKGFHLASKC-ICCNSEESLIHVLWDNPIAKQVWNFFANSFQIYISK 1688

Query: 1054 NGSIAEL---WKAAMEITCSTQISALWRSAIVSTFWAIWYARNQVIFENSWITFAESISF 884
              +++++   W  + +      I  L    I    W +W  RN     +  +    S   
Sbjct: 1689 PQNVSQILWTWYLSGDYVRKGHIRILIPLFIC---WFLWLERNDAKHRHLGM---YSDRV 1742

Query: 883  VWRAIKETGMIDSGTMRNSTQ-----DLCILSQFCIAGRPAKAPKIIPVTWFTPLPG 728
            VW+ +K    +  G +  S Q     D   +       +   AP+I+   W  P+PG
Sbjct: 1743 VWKIMKLLRQLQDGYLLKSWQWKGDKDFATMWGLFSPPKTRAAPQIL--HWVKPVPG 1797


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  321 bits (823), Expect(2) = 7e-91
 Identities = 196/563 (34%), Positives = 289/563 (51%), Gaps = 8/563 (1%)
 Frame = -2

Query: 3080 ERYLEEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNA 2901
            E  L+E++DAVF +D  SA GPDGFS  FY  CW II  D++ AV+ FF  + IP G+ +
Sbjct: 1483 EPNLQEVKDAVFGIDPESAAGPDGFSSYFYQQCWNIIAHDLLDAVRDFFHGANIPRGVTS 1542

Query: 2900 NLMVLIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHI 2721
              ++L+PK   A     FRPI L   + KIITK+L++RLA I   +I+ NQ GF+ GR I
Sbjct: 1543 TTLILLPKKPSASKWSDFRPISLCTVMNKIITKLLSNRLAKILPSIITENQSGFVGGRLI 1602

Query: 2720 EDFIATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISV 2541
             D I  A +    L+ K  GGN+ALK+D+ KA+D + W FL++VL+ FGFN  +I  I  
Sbjct: 1603 SDNILLAQELIGKLNTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHFGFNDQWIGMIQK 1662

Query: 2540 IFNSARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMS 2361
              ++   S+L+NG  EGYF+  RG+RQGDP+SP LF  A ++LSR L+   L     ++ 
Sbjct: 1663 CISNCWFSLLLNGRTEGYFKFERGLRQGDPISPQLFLIAAEYLSRGLN--ALYEQYPSLH 1720

Query: 2360 SPRGGRAP-THLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXX 2184
               G   P +HL +ADDVLIF  G+K  +  I      Y ++S Q +N  KS        
Sbjct: 1721 YSTGVSIPVSHLAFADDVLIFTNGSKSALQRILAFLQEYEEISRQRINAQKSCFVTHTNV 1780

Query: 2183 XXXXXXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMA 2004
                   +  T+G       I YLG PL+KG  K      +                S  
Sbjct: 1781 SSSRRQIIAQTTGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPG 1840

Query: 2003 GRLSLINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYS 1827
            GR++L+ SV+TS  I+ F + + P  +L+ +N    +F W G+   +K    +W +    
Sbjct: 1841 GRITLLKSVLTSLPIYLFQVLKPPVCVLERINRIFNSFLWGGSAASKKIHWTSWAKISLP 1900

Query: 1826 KQEGGLGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARYLKSF--SDPRRKYLTSSIW 1653
             +EGGL ++ LA +  A   KL W+F T D+    F+R +Y +       + K   S  W
Sbjct: 1901 VKEGGLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQLPMHTQPKLHDSQTW 1960

Query: 1652 PALSEHYLALLSETQWLIGKHSKVRFWHDNWLGSPLTELLQIPEHISAKLTAKVSNFYCN 1473
              +           +W +G+   + FWHD W+G   T L+      S  +  +V +F+ N
Sbjct: 1961 KRMVASSAITEQNMRWRVGQ-GNLFFWHDCWMGE--TPLISSNHEFSLSM-VQVCDFFMN 2016

Query: 1472 GQW----LLTELFQKEFPEVCSL 1416
              W    L T L Q+   E+  +
Sbjct: 2017 NSWDIEKLKTVLQQEVVDEIAKI 2039



 Score = 43.1 bits (100), Expect(2) = 7e-91
 Identities = 28/95 (29%), Positives = 43/95 (45%)
 Frame = -3

Query: 1411 KIPLYALIHQTLVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWRM 1232
            KIP+ A+      W P+ +GE   K+A+   R R         I             WR+
Sbjct: 2038 KIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKAIPLTTSFFLWRL 2097

Query: 1231 LQNRLPTKDGLQSAGIQLASCCHLCFQAAESPTHI 1127
            L + +P +  ++S G QLAS C  C ++ ES  H+
Sbjct: 2098 LHDWIPVELRMKSKGFQLASRCRCC-RSEESIIHV 2131



 Score = 67.8 bits (164), Expect = 3e-08
 Identities = 43/126 (34%), Positives = 69/126 (54%)
 Frame = -2

Query: 731  GMDKVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFAW 552
            G  K+N DG+A  S   AG GG+ R   G++   F+  LG   + +AEL+A+   +    
Sbjct: 2210 GEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMIFGFSENLGIQNSLKAELLALYRGLILCR 2268

Query: 551  KHGWRQLWLESDSTHMVTILTTRSPKVPWRWRAKWLKCLHFISHMDFRVSHIYREGNRVA 372
             +  R+LW+E D+T ++ +L     + P   R         +SH  FR++HI+REGN+ A
Sbjct: 2269 DYNIRRLWIEMDATSVIRLLQGNH-RGPHAIRYLLGSIRQLLSHFSFRLTHIFREGNQAA 2327

Query: 371  DSLSSR 354
            D L++R
Sbjct: 2328 DFLANR 2333


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  313 bits (803), Expect(2) = 2e-90
 Identities = 193/567 (34%), Positives = 289/567 (50%), Gaps = 8/567 (1%)
 Frame = -2

Query: 3080 ERYLEEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNA 2901
            E  L+E++DAVF +D  SA GPDGFS  FY  CW  I  D++ AV+ FF  + IP G+ +
Sbjct: 1313 EPNLQEVKDAVFDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDAVRDFFHGANIPRGVTS 1372

Query: 2900 NLMVLIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHI 2721
              +VL+PK   A    +FRPI L   + KIITK+L++RLA I   +I+ NQ GF+ GR I
Sbjct: 1373 TTLVLLPKKSSASKWSEFRPISLCTVMNKIITKLLSNRLAKILPSIITENQSGFVGGRLI 1432

Query: 2720 EDFIATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISV 2541
             D I  A +    LD K  GGN+ALK+D+ KA+D + W FL++VL+ FGFN  +I  I  
Sbjct: 1433 SDNILLAQELIRKLDTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHFGFNEQWIGMIQK 1492

Query: 2540 IFNSARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMS 2361
              ++   S+L+NG  EGYF+  RG+RQGD +SP LF  A ++LSR L+   L +   ++ 
Sbjct: 1493 CISNCWFSLLLNGRIEGYFKSERGLRQGDSISPQLFILAAEYLSRGLN--ALYDQYPSLH 1550

Query: 2360 SPRG-GRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXX 2184
               G   + +HL +ADDVLIF  G+K  +  I      Y ++SGQ +N  KS        
Sbjct: 1551 YSSGVPLSVSHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQRINAQKSCFVTHTNI 1610

Query: 2183 XXXXXXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMA 2004
                   +   +G       I YLG PL+KG  K      +                S  
Sbjct: 1611 PNSRRQIIAQATGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPG 1670

Query: 2003 GRLSLINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYS 1827
            GR++L+ SV+ S  I+   + + P  +L+ +N    +F W G+   ++    +W +    
Sbjct: 1671 GRITLLRSVLASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSAASKRIHWASWAKIALP 1730

Query: 1826 KQEGGLGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARYLKSF--SDPRRKYLTSSIW 1653
              EGGL ++ LA +  A   KL W+F T D+    F+R +Y +       + K   S  W
Sbjct: 1731 VTEGGLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQLPMQTQPKLHDSQTW 1790

Query: 1652 PALSEHYLALLSETQWLIGKHSKVRFWHDNWLGSPLTELLQIPEHISAKLTAKVSNFYCN 1473
              +           +W +G+   V FWHD W+G     L+   +  ++ +  +V +F+ N
Sbjct: 1791 KRMLTSSTITEQHMRWRVGQ-GNVFFWHDCWMGE--APLISSNQEFTSSM-VQVCDFFTN 1846

Query: 1472 GQW----LLTELFQKEFPEVCSLIEDT 1404
              W    L T L Q+   E+  +  DT
Sbjct: 1847 NSWNIEKLKTVLQQEVVDEIAKIPIDT 1873



 Score = 49.3 bits (116), Expect(2) = 2e-90
 Identities = 45/186 (24%), Positives = 75/186 (40%), Gaps = 4/186 (2%)
 Frame = -3

Query: 1411 KIPLYALIHQTLVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWRM 1232
            KIP+  +      W P+ +G+   K+A+   R R         I             WR+
Sbjct: 1868 KIPIDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLWRL 1927

Query: 1231 LQNRLPTKDGLQSAGIQLASCCHLCFQAAESPTHIFLQCTYARSLWTAISSTFK----HP 1064
            L + +P +  ++S G+QLAS C  C ++ ES  H+      A  +W   +  F+    +P
Sbjct: 1928 LHDWIPVELKMKSKGLQLASRCRCC-KSEESIMHVMWDNPVAMQVWNYFAKLFQILIINP 1986

Query: 1063 IQLNGSIAELWKAAMEITCSTQISALWRSAIVSTFWAIWYARNQVIFENSWITFAESISF 884
              +N  I   W  + +      I  L    I+   W +W  RN     N  + +   +  
Sbjct: 1987 CTIN-QIIGAWFYSGDYCKPGHIRTLVPLFIL---WFLWVERNDAKHRNLGM-YPNRV-- 2039

Query: 883  VWRAIK 866
            VWR +K
Sbjct: 2040 VWRVLK 2045



 Score = 69.7 bits (169), Expect = 8e-09
 Identities = 44/126 (34%), Positives = 68/126 (53%)
 Frame = -2

Query: 731  GMDKVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFAW 552
            G  K+N DG+A  S   AG GGI R   G +   F+  LG+  + +AEL+A+   +    
Sbjct: 2094 GEFKLNVDGSAKQSHNAAG-GGILRDHAGEMVFGFSENLGTQNSLQAELLALYRGLILCR 2152

Query: 551  KHGWRQLWLESDSTHMVTILTTRSPKVPWRWRAKWLKCLHFISHMDFRVSHIYREGNRVA 372
             +  R+LW+E D+  ++ +L     + P   R   +     +SH  FR SHI+REGN+ A
Sbjct: 2153 DYNIRRLWIEMDAISVIRLLQGNH-RGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAA 2211

Query: 371  DSLSSR 354
            D L++R
Sbjct: 2212 DFLANR 2217


>gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H;
            Endonuclease/exonuclease/phosphatase [Medicago
            truncatula]
          Length = 1246

 Score =  331 bits (848), Expect = 2e-87
 Identities = 171/328 (52%), Positives = 217/328 (66%)
 Frame = -2

Query: 3065 EIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLMVL 2886
            E+++AVF+L+   APGP+GF G FY   W+I+G DVI++VQ FF +  +   +N+NL+VL
Sbjct: 446  EVKNAVFTLNGDGAPGPNGFGGHFYQTYWDIVGADVIQSVQDFFISGQLAQNINSNLIVL 505

Query: 2885 IPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDFIA 2706
            IPK+  A  +  +RPI L NF FKII+KILADRLA I  R+IS  Q GFIR R I   + 
Sbjct: 506  IPKVPGARVMGDYRPIALANFQFKIISKILADRLADITMRIISVEQRGFIRDRDISKCVI 565

Query: 2705 TASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFNSA 2526
             AS+  N L+K+ +GGN+ALKVDI KAFDT+ W FLL VL+ FGF+  F++WI VI  SA
Sbjct: 566  LASEAINLLEKRQYGGNVALKVDIAKAFDTLDWNFLLAVLQRFGFDEKFVHWILVILQSA 625

Query: 2525 RISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPRGG 2346
            R+S+L+NG   G+F CS GVRQGDPLSPLLFC  E+ LSR L        +  MS  RG 
Sbjct: 626  RLSVLVNGKAVGFFTCSHGVRQGDPLSPLLFCLVEEVLSRALSMAATDGQLIPMSYCRGV 685

Query: 2345 RAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXXXX 2166
              PTH+LYADDVLIFC GTKRN+  + K FS Y ++SGQL+N  KS  FF          
Sbjct: 686  SFPTHILYADDVLIFCTGTKRNIRRLIKIFSQYSEVSGQLINNAKS-RFFTSAMTGSRVQ 744

Query: 2165 SLLDTSGMQRGSTCINYLGVPLFKGAPK 2082
             +    G   GS    YLG P+F+G PK
Sbjct: 745  MISSLLGFNVGSLPFTYLGCPIFRGKPK 772



 Score =  100 bits (250), Expect(2) = 2e-21
 Identities = 59/143 (41%), Positives = 77/143 (53%), Gaps = 2/143 (1%)
 Frame = -2

Query: 734  PGMDKVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFA 555
            P + KVNTDG+  G  GLA CGG+FR ++G   G F+  +G    F AE +A I A+  A
Sbjct: 1020 PPLLKVNTDGSVVG--GLAACGGLFRDSSGSFLGAFSCNIGLASVFHAETLAFILALEHA 1077

Query: 554  WKHGWRQLWLESDSTHMVTILTTRSPKVPWRWRAKWLKCLHFISHMDFRV--SHIYREGN 381
              HGWR LWLESDST  + I  + S  V W  R +W    H    +  +V  SHI  EGN
Sbjct: 1078 AHHGWRNLWLESDSTSALMIF-SNSSLVQWLLRNRW----HNAQRLGIQVISSHILHEGN 1132

Query: 380  RVADSLSSRAPSLCAPTWWWNAP 312
            R AD+L++    +    W    P
Sbjct: 1133 RCADNLANMGHGIQGSIWLETLP 1155



 Score = 31.6 bits (70), Expect(2) = 2e-21
 Identities = 21/79 (26%), Positives = 37/79 (46%)
 Frame = -3

Query: 955  AIWYARNQVIFENSWITFAESISFVWRAIKETGMIDSGTMRNSTQDLCILSQFCIAGRPA 776
            A+ + RN   F++   +   + + +   I  +G + +G   +S  D  IL +F ++ R  
Sbjct: 948  AVSFLRNAFRFQSQLQSIQSAKARIHSLIAMSGNVSTGKCLHS--DSAILEEFSVSPRHR 1005

Query: 775  KAPKIIPVTWFTPLPGWIK 719
            K   II V W  P P  +K
Sbjct: 1006 KYKDIILVLWKNPSPPLLK 1024



 Score = 88.2 bits (217), Expect(2) = 7e-16
 Identities = 45/134 (33%), Positives = 63/134 (47%), Gaps = 1/134 (0%)
 Frame = -2

Query: 1862 KAITVAWH-RCYSKQEGGLGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARYLKSFSD 1686
            K  TV+W   C    EGGL +K    +N A + KL W  +++ N  ++ L  R   S   
Sbjct: 772  KVCTVSWKILCRPWSEGGLDIKSTRLINNAAMLKLAWNLLSS-NSQWAVLLKRRFFSQGQ 830

Query: 1685 PRRKYLTSSIWPALSEHYLALLSETQWLIGKHSKVRFWHDNWLGSPLTELLQIPEHISAK 1506
            P R ++ SS+W  +  H   L     W++G   ++  W +NWLG PL  L  I     A 
Sbjct: 831  PIRYFVKSSVWHGVKNHMSILRQNKLWIVGTGDRINLWTNNWLGEPLVTLFNIDPFFHAS 890

Query: 1505 LTAKVSNFYCNGQW 1464
             T KVS    NG W
Sbjct: 891  FTGKVSEVIVNGNW 904



 Score = 25.4 bits (54), Expect(2) = 7e-16
 Identities = 14/35 (40%), Positives = 19/35 (54%)
 Frame = -3

Query: 1420 ASLKIPLYALIHQTLVWCPSTDGEVKCKTAYDFFR 1316
            AS+ +P   L   +LVW  S DG++  K A  F R
Sbjct: 920  ASITLPRTEL-PDSLVWTHSADGQLTSKHAVSFLR 953


>ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum
            lycopersicum]
          Length = 1454

 Score =  303 bits (776), Expect(2) = 3e-86
 Identities = 181/560 (32%), Positives = 288/560 (51%), Gaps = 3/560 (0%)
 Frame = -2

Query: 3068 EEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLMV 2889
            +E+R  + S++ +SAPGPDGF G+FY  C++II KD++ AV +F+  + +P  +    ++
Sbjct: 505  DELRRIIMSMNPNSAPGPDGFGGKFYQTCFDIIKKDLLAAVNYFYIGNSMPKYMTHACLI 564

Query: 2888 LIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDFI 2709
            L+PK++    +++FRPI L NF  KII+KI++ RLASI   V+S NQ GF++GR I + I
Sbjct: 565  LLPKVEHPCKLKEFRPISLSNFSNKIISKIMSTRLASILPCVVSENQSGFVKGRSISENI 624

Query: 2708 ATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFNS 2529
              A +  + + K   G N+ +K+ + KA+D +SW +   VLR  GF+  FI+ I  I ++
Sbjct: 625  LLAHEIIHGIKKPRDGSNVVIKLGMVKAYDRVSWTYTCIVLRRMGFSEIFIDRIWRIMSN 684

Query: 2528 ARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPRG 2349
               SI+ING   G+F   RG++QGDPLSP LF    +  SR L         +       
Sbjct: 685  NWYSIVINGKRHGFFHSKRGLKQGDPLSPALFVLGAEVFSRQLSLLYQNQLYKGFHMESN 744

Query: 2348 GRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXXX 2169
            G    HL +ADD++IF      ++  I K    Y ++S Q VN DKSF            
Sbjct: 745  GPKINHLSFADDIIIFSSTDNNSLNLIMKTIDQYEEVSDQKVNKDKSFFMVTSNTSHDII 804

Query: 2168 XSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLSL 1989
              +   +G  R ++ INYLG PL+ G  +  +   I +              +  G+++L
Sbjct: 805  EEISRITGFSRKNSPINYLGCPLYVGGQRIIYYSEIVEKVIKKIAGWHLKILNFGGKVTL 864

Query: 1988 INSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYSKQEGG 1812
            +  V+ S  IH+      PK++L  +   I +FFW    D +K    +W+   +   EGG
Sbjct: 865  VKHVLQSMPIHTLSAISPPKTILNSIKKVIADFFWGIEKDGKKYHWSSWNNMAFPTNEGG 924

Query: 1811 LGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARYLKSFSDPRRKYLT--SSIWPALSE 1638
            +G++ +  M  A   K  W F T ++    FL+A+Y +  +   +KY T  S +W  L+ 
Sbjct: 925  IGVRLIEDMCTAFQYKQWWAFRTNNSLWSKFLKAKYNQRANPVAKKYNTGDSIVWRYLTR 984

Query: 1637 HYLALLSETQWLIGKHSKVRFWHDNWLGSPLTELLQIPEHISAKLTAKVSNFYCNGQWLL 1458
            +   + S  +W I +     FW D WL  PL       +H+S+   + V++F  NG W  
Sbjct: 985  NRQKVESLIKWHI-QSGTCSFWWDCWLDKPLAMQC---DHVSSLNNSVVADFLINGNWNE 1040

Query: 1457 TELFQKEFPEVCSLIEDTAV 1398
              L Q   P++   I  T +
Sbjct: 1041 RLLRQHVPPQLVPYILQTKI 1060



 Score = 45.8 bits (107), Expect(2) = 3e-86
 Identities = 26/101 (25%), Positives = 42/101 (41%)
 Frame = -3

Query: 1381 TLVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWRMLQNRLPTKDG 1202
            T +W P+  G+    +A+D  R + N       I             WR L+ +LPT + 
Sbjct: 1069 TSIWTPTESGQFTISSAWDSIRKKRNKDPINNIIWHKQIPFKVSFFIWRALRGKLPTNEN 1128

Query: 1201 LQSAGIQLASCCHLCFQAAESPTHIFLQCTYARSLWTAISS 1079
            LQ  G  L+ C     +  +   HI +   +A+ +W   SS
Sbjct: 1129 LQRIGKNLSDCYCCYNKGKDDINHILINGNFAKYIWKIYSS 1169



 Score = 74.7 bits (182), Expect = 3e-10
 Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 1/131 (0%)
 Frame = -2

Query: 731  GMDKVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFAW 552
            G  K+NTDG+A  + G  G GGI R   G +   F++P G      AE+ A +H + +  
Sbjct: 1283 GKYKLNTDGSALQNSGKIGGGGILRDNQGKIIYAFSLPFGFGTNNFAEIKAALHGLDWCE 1342

Query: 551  KHGWRQLWLESDSTHMVTILTTRSPKVPWRWRAKWLKCLHFISHMD-FRVSHIYREGNRV 375
            +HG++++ LE DS  +   + + +  +PWR+     +    I  MD F+  HIYRE N  
Sbjct: 1343 QHGYKKIELEVDSKLLCNWINS-NINIPWRYEELIQQIHQIIRKMDQFQCHHIYREANCT 1401

Query: 374  ADSLSSRAPSL 342
            AD LS  + +L
Sbjct: 1402 ADLLSKWSHNL 1412


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  324 bits (830), Expect = 2e-85
 Identities = 199/568 (35%), Positives = 297/568 (52%), Gaps = 8/568 (1%)
 Frame = -2

Query: 3080 ERYLEEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNA 2901
            E  L+E+++AVF +D  SA GPDGFS  FY  CW+II  D+ +AV+ FF  + IP G+ +
Sbjct: 1276 EPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEFFHGADIPQGMTS 1335

Query: 2900 NLMVLIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHI 2721
              +VLIPK   A    +FRPI L   + KIITKILA+RLA I   +I+ NQ GF+ GR I
Sbjct: 1336 TTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKILPSIITENQSGFVGGRLI 1395

Query: 2720 EDFIATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISV 2541
             D I  A +    LD+K  GGN+ALK+D+ KA+D + W FL +VL+  GFN+ +I  I  
Sbjct: 1396 SDNILLAQELIGKLDQKNRGGNVALKLDMMKAYDRLDWSFLFKVLQHLGFNAQWIGMIQK 1455

Query: 2540 IFNSARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMS 2361
              ++   S+L+NG   GYF+  RG+RQGD +SP LF  A ++L+R L+   L +   ++ 
Sbjct: 1456 CISNCWFSLLLNGRTVGYFKSERGLRQGDSISPQLFILAAEYLARGLN--ALYDQYPSLH 1513

Query: 2360 SPRG-GRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXX 2184
               G   + +HL +ADDV+IF  G+K  +  I      Y +LSGQ +N  KS V      
Sbjct: 1514 YSSGCSLSVSHLAFADDVIIFANGSKSALQKIMAFLQEYEKLSGQRINPQKSCVVTHTNM 1573

Query: 2183 XXXXXXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMA 2004
                   +L  +G       I YLG PL+KG  K      +                S  
Sbjct: 1574 ASSRRQIILQATGFSHRPLPITYLGAPLYKGHKKVMLFNDLVAKIEERITGWENKTLSPG 1633

Query: 2003 GRLSLINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYS 1827
            GR++L+ S ++S  I+   + + P  +L+ +N  + NF W G+   ++    +W +    
Sbjct: 1634 GRITLLRSTLSSLPIYLLQVLKPPVIVLERINRLLNNFLWGGSTASKRIHWASWGKIALP 1693

Query: 1826 KQEGGLGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARYL--KSFSDPRRKYLTSSIW 1653
              EGGL ++++  +  A   KL W+F T ++    F+RA+Y   +  +D + K   S  W
Sbjct: 1694 IAEGGLDIRNVEDVCEAFSMKLWWRFRTTNSLWTQFMRAKYCGGQLPTDVQPKLHDSQTW 1753

Query: 1652 PALSEHYLALLSETQWLIGKHSKVRFWHDNWLGSPLTELLQIPEHISAKLTAKVSNFYCN 1473
              +           +W IG H ++ FWHD W+G    E L       A   A+VS+F+ N
Sbjct: 1754 KRMVTISSITEQNIRWRIG-HGELFFWHDCWMGE---EPLVNRNQAFASSMAQVSDFFLN 1809

Query: 1472 GQW----LLTELFQKEFPEVCSLIEDTA 1401
              W    L T L Q+   E+  +  DT+
Sbjct: 1810 NSWNVEKLKTVLQQEVVEEIVKIPIDTS 1837



 Score = 61.6 bits (148), Expect = 2e-06
 Identities = 39/123 (31%), Positives = 63/123 (51%)
 Frame = -2

Query: 722  KVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFAWKHG 543
            K+N DG+   +P  A  GG+ R   G +   F+   G   + +AELMA+   +    +H 
Sbjct: 2060 KLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQAELMALHRGLLLCIEHN 2119

Query: 542  WRQLWLESDSTHMVTILTTRSPKVPWRWRAKWLKCLHFISHMDFRVSHIYREGNRVADSL 363
              +LW+E D+   V ++     +   R R         +S + FR+SHI+REGN+ AD L
Sbjct: 2120 ISRLWIEMDAKVAVQMI-KEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHL 2178

Query: 362  SSR 354
            S++
Sbjct: 2179 SNQ 2181


>ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268853 [Solanum
            lycopersicum]
          Length = 1333

 Score =  307 bits (786), Expect(2) = 5e-84
 Identities = 187/561 (33%), Positives = 290/561 (51%), Gaps = 15/561 (2%)
 Frame = -2

Query: 3071 LEEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLM 2892
            ++E+R  + S++ HSAPGPDGF G+FY  C++II +D++ AV+ F+  + +P  L    +
Sbjct: 382  MDELRRTIMSMNPHSAPGPDGFGGKFYQVCFDIIKEDLLAAVKHFYVGNIMPRYLTHACL 441

Query: 2891 VLIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDF 2712
             LIPKI     ++ FRPI L NF  KII+KIL+ RLA I   ++S NQ GF++GR I + 
Sbjct: 442  TLIPKIDHPCRLKDFRPISLSNFTNKIISKILSTRLALILPSIVSANQSGFVKGRSIAEN 501

Query: 2711 IATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFN 2532
            I  A + F+ + K   G N+ +K+D+ KA+D +SW +   VLR  GF+  FI+ +  I +
Sbjct: 502  ILLAQEIFHGIKKPKDGSNVVIKLDMVKAYDRVSWNYTCLVLRKMGFSEVFIDRVWRIMS 561

Query: 2531 SARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPR 2352
            +   SI+ING   G+FQ  RG++QGDPLSP LF    + LSR L+     +  +     R
Sbjct: 562  NNWYSIVINGKRHGFFQSKRGLKQGDPLSPALFVLGAEILSRQLNLLYQNHQYKGFHMER 621

Query: 2351 GGRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXX 2172
             G    HL +ADD++IF      ++  I K    Y  +S Q VN +KSF           
Sbjct: 622  KGPKINHLSFADDIIIFTSTDTNSIHIIMKTIELYEAVSDQQVNKEKSFFMVTANTGYDI 681

Query: 2171 XXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLS 1992
               +   +G  R ++ INYLG PL+ G  +  +   + +              +  G++ 
Sbjct: 682  IEEIKTATGFNRKNSPINYLGCPLYSGGQRIIYYSELVEKVIKKISGWHSKLLNFGGKII 741

Query: 1991 LINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAW-HRCYSKQEG 1815
            L+  V+ S  IH+      PK+ L  +   I +FFW    D +     +W +  Y   EG
Sbjct: 742  LVKHVLQSIPIHTLAAISPPKTTLNCIKKLIADFFWGIDKDGKTYHWSSWENMAYPTSEG 801

Query: 1814 GLGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARYLKSFSDPRRKYLT--SSIWPALS 1641
            G+G++ L  +  A   K  W F T ++    FL+A+Y +  +   +KY T  S IW  L+
Sbjct: 802  GIGVRLLEDVCTAFQYKQWWDFRTKNSLWSQFLQAKYCQRANPVAKKYDTGDSLIWRYLT 861

Query: 1640 EHYLALLSETQWLIGKHSKVRFWHDNWLGSPLTELLQIPEHISAKLTAKVSNFYCNGQW- 1464
             + L + S  +W I       FW DNWL   +  L    EHIS+   + V++F  +G+W 
Sbjct: 862  RNRLKVESFIKWNI-TSGTCSFWWDNWL--DIENLASQNEHISSLNNSVVADFLKDGKWN 918

Query: 1463 -----------LLTELFQKEF 1434
                       L+ ++ QK+F
Sbjct: 919  ESLIRQQVTPLLVPKILQKQF 939



 Score = 34.7 bits (78), Expect(2) = 5e-84
 Identities = 24/104 (23%), Positives = 44/104 (42%), Gaps = 2/104 (1%)
 Frame = -3

Query: 1381 TLVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWRMLQNRLPTKDG 1202
            T  W P+  G     +A++  R +    +    I             WR L+ +LPT + 
Sbjct: 948  TATWMPTETGIFSIASAWECIRKKRIIDNISTIIWHKHLPFKIAFFIWRALKGKLPTNEF 1007

Query: 1201 LQSAGIQLA--SCCHLCFQAAESPTHIFLQCTYARSLWTAISST 1076
            LQ  G  ++  SCC+   +  +   HI +   +A+ +W   ++T
Sbjct: 1008 LQRIGSDISDYSCCYR--KGKDDINHILINGNFAKYIWKIHAAT 1049



 Score = 66.6 bits (161), Expect = 7e-08
 Identities = 41/125 (32%), Positives = 64/125 (51%), Gaps = 1/125 (0%)
 Frame = -2

Query: 731  GMDKVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFAW 552
            G  K+NTDG+A  + G  G GG  R   G +   F+IP G      AE+ A ++ + +  
Sbjct: 1162 GTYKLNTDGSAIQNSGKIGGGGNLRDFQGKIVYAFSIPFGVGTNNFAEIKAALYGMEWCE 1221

Query: 551  KHGWRQLWLESDSTHMVTILTTRSPKVPWRWRAKWLKCLHFISHMD-FRVSHIYREGNRV 375
            +HG++++ LE +S  +   +   + K+PWR+     +       M+ F   HIYRE N  
Sbjct: 1222 QHGYKKVELEVNSELLYNWI-KNTTKIPWRYEDLVQQIQQISMKMEQFHCHHIYREANNT 1280

Query: 374  ADSLS 360
            AD LS
Sbjct: 1281 ADLLS 1285


>ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            lycopersicum]
          Length = 955

 Score =  295 bits (754), Expect(2) = 8e-84
 Identities = 185/573 (32%), Positives = 291/573 (50%), Gaps = 15/573 (2%)
 Frame = -2

Query: 3071 LEEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLM 2892
            ++E+R+ V +++ HSAPGPDG  G+FY  C++I   D++ AVQ FF    +P  +    +
Sbjct: 4    IDELRNVVMNMNPHSAPGPDGIGGKFYQTCFDIRKDDLLAAVQAFFNGYDMPKHMTHACL 63

Query: 2891 VLIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDF 2712
            +L+PK+     +++FRPI L NF  KII+KI++ RLA I   +IS NQ GF++GR I + 
Sbjct: 64   ILLPKVDNPNKMKEFRPISLSNFTNKIISKIMSTRLAPILPLLISENQSGFVKGRSISEN 123

Query: 2711 IATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFN 2532
            +  A +  + +     G N+ LK+D+ KA+D +SW +   V+R  GF   FI+ +  I N
Sbjct: 124  VMLAQEIIHGIKLPKEGKNVVLKLDMVKAYDRVSWSYTCLVVRKMGFGELFIDRVWRIMN 183

Query: 2531 SARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPR 2352
            +   S++ING   G+F  +RG++QGDPLSP LF    +  SR L+      N        
Sbjct: 184  NNWYSVVINGRRHGFFHSTRGLKQGDPLSPALFILGAELFSRQLNLLYHNQNYIGFQMDS 243

Query: 2351 GGRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXX 2172
             G    HL +A+D++IF    ++++  I K    Y  +S Q VN DKSF           
Sbjct: 244  NGPQINHLSFANDIIIFTSTDRQSLQLIVKTIEEYELISDQQVNKDKSFFMVTTKTNQAI 303

Query: 2171 XXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLS 1992
              S+   +G    ++ I YLG PL+ G  +  +   I +              +  G+++
Sbjct: 304  INSIKIETGFGIQNSPITYLGCPLYVGGQRIIYFSGIVEKIIRKISGWHAKILNFGGKIT 363

Query: 1991 LINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYSKQEG 1815
            L+  V+ S  IH       PK+ LK +   I +FFW    D +K    +W    Y   EG
Sbjct: 364  LVKHVLQSIPIHLLAAVSPPKTTLKYIKNVIADFFWGMDKDGKKYHWASWETLAYPTNEG 423

Query: 1814 GLGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARYLKSFSDPRRKYLT--SSIWPALS 1641
            G+G+++L  +  A   K  W+F T ++    FL+A+Y K  +   +KY T  S +W   +
Sbjct: 424  GIGVRNLEDVCIAFQYKQWWEFRTKNSLWSKFLKAKYCKRANPVAKKYDTGNSLVWRYFT 483

Query: 1640 EHYLALLSETQWLIGKHSKVRFWHDNWLGSPLTELLQIPEHISAKLTAKVSNFYCNGQW- 1464
             +  A+ S  +W I   S   FW DNWLG+    L     +IS+     VS+F  NG W 
Sbjct: 484  RNRQAVESYIKWNIHSGSS-SFWWDNWLGN--EALANQVINISSLNNIHVSDFLTNGIWN 540

Query: 1463 -----------LLTELFQKEFPEVCSLIEDTAV 1398
                       ++ ++ Q +F    + IEDTA+
Sbjct: 541  ERYVRQHVPPTMVPDIMQTQFKYNIN-IEDTAI 572



 Score = 46.2 bits (108), Expect(2) = 8e-84
 Identities = 34/149 (22%), Positives = 58/149 (38%), Gaps = 1/149 (0%)
 Frame = -3

Query: 1390 IHQTLVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWRMLQNRLPT 1211
            I  T +W P  +G+    +A++  R + +T      +             WR L+ +LPT
Sbjct: 567  IEDTAIWTPEENGKFTIASAWEVIRKKKSTDIINNSVWHKHIPFKISFFIWRALRGKLPT 626

Query: 1210 KDGLQSAGIQLASCCHLCFQAAESPTHIFLQCTYARSLWTAISSTFKHPIQLNGSIAELW 1031
             D LQ  G     C     +  +   HI +   +A  +W   + TF    Q+N  +  L 
Sbjct: 627  YDYLQKFGSNATDCYCCNRKGIDDINHILITGNFANYIWKYYAPTF-GITQINIDLRSLL 685

Query: 1030 KAAMEITCSTQISALWRSAIVSTF-WAIW 947
                 +  S Q+  L  S + +   W +W
Sbjct: 686  LQWTNLPSSNQVYKLLISILPNFICWHLW 714



 Score = 65.5 bits (158), Expect = 2e-07
 Identities = 42/125 (33%), Positives = 63/125 (50%), Gaps = 1/125 (0%)
 Frame = -2

Query: 731  GMDKVNTDGAAFGSPGLAGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFAW 552
            G+ K+NTDG+A    G  G GGI R   G +   F+IP G      AE+ A  + + +  
Sbjct: 784  GIYKLNTDGSALPESGKIGGGGILRDYTGKLHYAFSIPFGLGTNNIAEMEAARYGLDWCE 843

Query: 551  KHGWRQLWLESDSTHMVTILTTRSPKVPWRWRAKWLKCLHFISHMD-FRVSHIYREGNRV 375
            +HG++ + LE DS  ++    + +  +PWR++            MD F   H+YRE N  
Sbjct: 844  QHGYKSILLEVDS-EILQKWISNTIAIPWRYQQTIEHIQDIGRKMDHFECQHVYREVNGT 902

Query: 374  ADSLS 360
            AD LS
Sbjct: 903  ADLLS 907


>gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1524

 Score =  317 bits (813), Expect = 2e-83
 Identities = 185/511 (36%), Positives = 268/511 (52%), Gaps = 6/511 (1%)
 Frame = -2

Query: 3065 EIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLMVL 2886
            EI DA+  +    APGPDG + RFY +CW+I+G DVI  V+ FF TSF+ P +N   + +
Sbjct: 588  EIYDAICQIGDDKAPGPDGLTARFYKNCWDIVGYDVILEVKKFFETSFMKPSINHTNICM 647

Query: 2885 IPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDFIA 2706
            IPKI    T+  +RPI L N L+K+I+K L +RL S  + ++S +Q  FI GR I D + 
Sbjct: 648  IPKITNPTTLSDYRPIALCNVLYKVISKCLVNRLKSHLNSIVSDSQAAFIPGRIINDNVM 707

Query: 2705 TASDCFNCLD--KKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFN 2532
             A +  + L   K+     MA+K D+ KA+D + W FL   +R FGF + +I WI     
Sbjct: 708  IAHEVMHSLKVRKRVSKTYMAVKTDVSKAYDRVEWDFLETTMRLFGFCNKWIGWIMAAVK 767

Query: 2531 SARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPR 2352
            S   S+LING+P GY   +RG+RQGDPLSP LF    D LS  ++ +    +++ +    
Sbjct: 768  SVHYSVLINGSPHGYITPTRGIRQGDPLSPYLFILCGDILSHLINGRASSGDLRGVRIGN 827

Query: 2351 GGRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXX 2172
            G  A THL +ADD L FC+   RN  A+   F  Y   SGQ +N  KS + FG       
Sbjct: 828  GAPAITHLQFADDSLFFCQANVRNCQALKDVFDVYEYYSGQKINVQKSMITFGSRVYGST 887

Query: 2171 XXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLS 1992
               L     +        YLG+P   G  KK   + I D              S AG+  
Sbjct: 888  QSKLKQILEIPNQGGGGKYLGLPEQFGRKKKEMFEYIIDRVKKRTSTWSARFLSPAGKEI 947

Query: 1991 LINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHRC-YSKQEG 1815
            ++ SV  +  +++   ++ PK ++ E+ + + NF+W  A ++R    VAW R  YSK+EG
Sbjct: 948  MLKSVALAMPVYAMSCFKLPKGIVSEIESLLMNFWWEKASNQRGIPWVAWKRLQYSKKEG 1007

Query: 1814 GLGLKDLATMNRALLRKLTWKFMTADNFAYS-FLRARYLKSFS--DPRRKYLTSSIWPAL 1644
            GLG +DLA  N ALL K  W+ +   N  ++  ++ARY K  S  D + +   S  W +L
Sbjct: 1008 GLGFRDLAKFNDALLAKQAWRLIQYPNSLFARVMKARYFKDVSILDAKVRKQQSYGWASL 1067

Query: 1643 SEHYLALLSETQWLIGKHSKVRFWHDNWLGS 1551
             +    L   T+ LIG    +R   DN + S
Sbjct: 1068 LDGIALLKKGTRHLIGDGQNIRIGLDNIVDS 1098


>gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1750

 Score =  317 bits (813), Expect = 2e-83
 Identities = 185/511 (36%), Positives = 268/511 (52%), Gaps = 6/511 (1%)
 Frame = -2

Query: 3065 EIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLMVL 2886
            EI DA+  +    APGPDG + RFY +CW+I+G DVI  V+ FF TSF+ P +N   + +
Sbjct: 814  EIYDAICQIGDDKAPGPDGLTARFYKNCWDIVGYDVILEVKKFFETSFMKPSINHTNICM 873

Query: 2885 IPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDFIA 2706
            IPKI    T+  +RPI L N L+K+I+K L +RL S  + ++S +Q  FI GR I D + 
Sbjct: 874  IPKITNPTTLSDYRPIALCNVLYKVISKCLVNRLKSHLNSIVSDSQAAFIPGRIINDNVM 933

Query: 2705 TASDCFNCLD--KKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFN 2532
             A +  + L   K+     MA+K D+ KA+D + W FL   +R FGF + +I WI     
Sbjct: 934  IAHEVMHSLKVRKRVSKTYMAVKTDVSKAYDRVEWDFLETTMRLFGFCNKWIGWIMAAVK 993

Query: 2531 SARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPR 2352
            S   S+LING+P GY   +RG+RQGDPLSP LF    D LS  ++ +    +++ +    
Sbjct: 994  SVHYSVLINGSPHGYITPTRGIRQGDPLSPYLFILCGDILSHLINGRASSGDLRGVRIGN 1053

Query: 2351 GGRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXX 2172
            G  A THL +ADD L FC+   RN  A+   F  Y   SGQ +N  KS + FG       
Sbjct: 1054 GAPAITHLQFADDSLFFCQANVRNCQALKDVFDVYEYYSGQKINVQKSMITFGSRVYGST 1113

Query: 2171 XXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLS 1992
               L     +        YLG+P   G  KK   + I D              S AG+  
Sbjct: 1114 QSRLKQILEIPNQGGGGKYLGLPEQFGRKKKEMFEYIIDRVKKRTSTWSARFLSPAGKEI 1173

Query: 1991 LINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHRC-YSKQEG 1815
            ++ SV  +  +++   ++ PK ++ E+ + + NF+W  A ++R    VAW R  YSK+EG
Sbjct: 1174 MLKSVALAMPVYAMSCFKLPKGIVSEIESLLMNFWWEKASNQRGIPWVAWKRLQYSKKEG 1233

Query: 1814 GLGLKDLATMNRALLRKLTWKFMTADNFAYS-FLRARYLKSFS--DPRRKYLTSSIWPAL 1644
            GLG +DLA  N ALL K  W+ +   N  ++  ++ARY K  S  D + +   S  W +L
Sbjct: 1234 GLGFRDLAKFNDALLAKQAWRLIQYPNSLFARVMKARYFKDVSILDAKVRKQQSYGWASL 1293

Query: 1643 SEHYLALLSETQWLIGKHSKVRFWHDNWLGS 1551
             +    L   T+ LIG    +R   DN + S
Sbjct: 1294 LDGIALLKKGTRHLIGDGQNIRIGLDNIVDS 1324


>gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam: rvt.hmm, score:
            42.57) [Arabidopsis thaliana]
          Length = 1662

 Score =  309 bits (792), Expect = 5e-81
 Identities = 185/552 (33%), Positives = 281/552 (50%), Gaps = 13/552 (2%)
 Frame = -2

Query: 3065 EIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLMVL 2886
            EI +A+  +    APGPDG + RFY  CWEI+G DVIK V+ FF TS++   +N   + +
Sbjct: 814  EIYNAICHIGDDKAPGPDGLTARFYKSCWEIVGPDVIKEVKIFFRTSYMKQSINHTNICM 873

Query: 2885 IPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDFIA 2706
            IPKI    T+  +RPI L N L+KII+K L +RL      ++S +Q  FI GR + D + 
Sbjct: 874  IPKITNPETLSDYRPIALCNVLYKIISKCLVERLKGHLDAIVSDSQAAFIPGRLVNDNVM 933

Query: 2705 TASDCFNCLD--KKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFN 2532
             A +  + L   K+     MA+K D+ KA+D + W FL   +R FGF+ T+I WI     
Sbjct: 934  IAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDRVEWNFLETTMRLFGFSETWIKWIMGAVK 993

Query: 2531 SARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPR 2352
            S   S+L+NG P G  Q  RG+RQGDPLSP LF    D L+  +  +V   +I+ +    
Sbjct: 994  SVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYLFILCADILNHLIKNRVAEGDIRGIRIGN 1053

Query: 2351 GGRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXX 2172
            G    THL +ADD L FC+   RN  A+   F  Y   SGQ +N  KS + FG       
Sbjct: 1054 GVPGVTHLQFADDSLFFCQSNVRNCQALKDVFDVYEYYSGQKINMSKSMITFGSRVHGTT 1113

Query: 2171 XXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLS 1992
               L +  G+Q       YLG+P   G  K+     I +              S AG+  
Sbjct: 1114 QNRLKNILGIQSHGGGGKYLGLPEQFGRKKRDMFNYIIERVKKRTSSWSAKYLSPAGKEI 1173

Query: 1991 LINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHRC-YSKQEG 1815
            ++ SV  S  +++   ++ P +++ E+ A + NF+W     +R+   +AW R  YSK+EG
Sbjct: 1174 MLKSVAMSMPVYAMSCFKLPLNIVSEIEALLMNFWWEKNAKKREIPWIAWKRLQYSKKEG 1233

Query: 1814 GLGLKDLATMNRALLRKLTWKFMTADNFAYS-FLRARYLK--SFSDPRRKYLTSSIWPAL 1644
            GLG +DLA  N ALL K  W+ +   N  ++  ++ARY +  S  D +R+   S  W ++
Sbjct: 1234 GLGFRDLAKFNDALLAKQVWRMINNPNSLFARIMKARYFREDSILDAKRQRYQSYGWTSM 1293

Query: 1643 SEHYLALLSETQWLI--GKHSKVRFWHDNWLG---SPLTELLQIPEHIS--AKLTAKVSN 1485
                  +   +++++  GK    R+W+ + +    SP      +  H+S        V N
Sbjct: 1294 LAGLDVIKKGSRFIVGDGKTGSYRYWNAHLISQLVSPDDHRFVMNHHLSRIVHQDKLVWN 1353

Query: 1484 FYCNGQWLLTEL 1449
            +  +G + L +L
Sbjct: 1354 YSSSGDYTLWKL 1365


>emb|CAB75484.1| putative protein [Arabidopsis thaliana]
          Length = 851

 Score =  309 bits (791), Expect = 6e-81
 Identities = 184/523 (35%), Positives = 269/523 (51%), Gaps = 7/523 (1%)
 Frame = -2

Query: 3065 EIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLMVL 2886
            EI +A+  +    APGPDG + RFY  CW+I+G DVIK V+ FF +S +   +N   + +
Sbjct: 53   EIFEAICQIGDDKAPGPDGLTARFYKQCWDIVGNDVIKEVKLFFESSHMKTSVNHTNICM 112

Query: 2885 IPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDFIA 2706
            IPKI+   T+  +RPI L N L+K+I+K + +RL +  + ++S +Q  FI GR I D + 
Sbjct: 113  IPKIQNPQTLSDYRPIALCNVLYKVISKCMVNRLKAHLNSIVSDSQAAFIPGRIINDNVM 172

Query: 2705 TASDCFNCLD--KKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFN 2532
             A +  + L   K+     MA+K D+ KA+D + W FL   +R FGF   +I WI     
Sbjct: 173  IAHEIMHSLKVRKRVSKTYMAVKTDVSKAYDRVEWDFLETTMRLFGFCDKWIGWIMAAVK 232

Query: 2531 SARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPR 2352
            S   S+LING+P GY   +RG+RQGDPLSP LF    D LS  +  +    +I+ +    
Sbjct: 233  SVHYSVLINGSPHGYISPTRGIRQGDPLSPYLFILCGDILSHLIKVKASSGDIRGVRIGN 292

Query: 2351 GGRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXX 2172
            G  A THL +ADD L FC+   RN  A+   F  Y   SGQ +N  KS + FG       
Sbjct: 293  GAPAITHLQFADDSLFFCQANVRNCQALKDVFDVYEYYSGQKINVQKSLITFGSRVYGST 352

Query: 2171 XXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLS 1992
               L     +        YLG+P   G  KK     I D              S AG+  
Sbjct: 353  QTRLKTLLNIPNQGGGGKYLGLPEQFGRKKKEMFNYIIDRVKERTASWSAKFLSPAGKEI 412

Query: 1991 LINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHRC-YSKQEG 1815
            L+ SV  +  +++   ++ P+ ++ E+ + + NF+W  A ++R    VAW R  YSK+EG
Sbjct: 413  LLKSVALAMPVYAMSCFKLPQGIVSEIESLLMNFWWEKASNKRGIPWVAWKRLQYSKKEG 472

Query: 1814 GLGLKDLATMNRALLRKLTWKFMTADNFAYS-FLRARYLK--SFSDPRRKYLTSSIWPAL 1644
            GLG +DLA  N ALL K  W+ +   N  ++  ++ARY K  S  D + +   S  W +L
Sbjct: 473  GLGFRDLAKFNDALLAKQAWRIIQYPNSLFARVMKARYFKDNSIIDAKTRSQQSYGWSSL 532

Query: 1643 SEHYLALLSETQWLIGKHSKVRFWHDNWLGS-PLTELLQIPEH 1518
                  L   T+++IG    +R   DN + S P   LL   +H
Sbjct: 533  LSGIALLRKGTRYVIGDGKTIRLGIDNVVDSHPPRPLLTDEQH 575


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  307 bits (786), Expect = 2e-80
 Identities = 180/511 (35%), Positives = 276/511 (54%), Gaps = 5/511 (0%)
 Frame = -2

Query: 3071 LEEIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLM 2892
            L+EI++AVF++++ S  GPDGFS  FY HCW+II  D++ AV  FF  S +P G+ +  +
Sbjct: 1193 LQEIKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKNDLLDAVLDFFRGSPLPRGVTSTTL 1252

Query: 2891 VLIPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDF 2712
            VL+PK   A    ++RPI L   L KI+TK+LA+RL+ I   +IS NQ GF+ GR I D 
Sbjct: 1253 VLLPKKPNACHWSEYRPISLCTVLNKIVTKLLANRLSKILPSIISENQSGFVNGRLISDN 1312

Query: 2711 IATASDCFNCLDKKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFN 2532
            I  A +    +D K  GGN+ LK+D+ KA+D ++W FL  ++  FGFN+ +IN I    +
Sbjct: 1313 ILLAQELIGKIDAKSRGGNVVLKLDMAKAYDRLNWDFLYLMMEHFGFNAHWINMIKSCIS 1372

Query: 2531 SARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYL-HRQVLRNNIQAMSSP 2355
            +   S+LING+  GYF+  RG+RQGD +SP+LF  A D+LSR L H     +++Q +S  
Sbjct: 1373 NCWFSLLINGSLAGYFKSERGLRQGDSISPMLFILAADYLSRGLNHLFSCYSSLQYLS-- 1430

Query: 2354 RGGRAP-THLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXX 2178
             G + P +HL +ADD++IF  G +  +  I      Y Q+SGQ VN  KS          
Sbjct: 1431 -GCQMPISHLSFADDIVIFTNGGRSALQKILSFLQEYEQVSGQKVNHQKSCFITANGCSL 1489

Query: 2177 XXXXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGR 1998
                 +  T+G Q  +  + YLG PL KG  K      +                S  GR
Sbjct: 1490 SRRQIISHTTGFQHKTLPVTYLGAPLHKGPKKVLLFDSLISKIRDRISGWENKILSPGGR 1549

Query: 1997 LSLINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHR-CYSKQ 1821
            ++L+ SV++S  ++   + + P ++++ ++    +F W  + + +K     W +  +   
Sbjct: 1550 ITLLRSVLSSLPMYLLQVLKPPVTVIERIDRLFNSFLWGDSTECKKMHWAEWAKISFPCA 1609

Query: 1820 EGGLGLKDLATMNRALLRKLTWKFMTADNFAYSFLRARYL--KSFSDPRRKYLTSSIWPA 1647
            EGGLG++ L  +  A   KL W+F T ++    FLR +Y   +     + K   S +W  
Sbjct: 1610 EGGLGIRKLEDVCAAFTLKLWWRFQTGNSLWTQFLRTKYCLGRIPHHIQPKLHDSHVWKR 1669

Query: 1646 LSEHYLALLSETQWLIGKHSKVRFWHDNWLG 1554
            +       L   +W IGK   + FWHD W+G
Sbjct: 1670 MISGREMALQNIRWKIGK-GDLFFWHDCWMG 1699



 Score = 59.3 bits (142), Expect(2) = 7e-15
 Identities = 40/126 (31%), Positives = 65/126 (51%), Gaps = 1/126 (0%)
 Frame = -2

Query: 731  GMDKVNTDGAAFGSPGL-AGCGGIFRTANGMVKGCFAIPLGSCFAFEAELMAVIHAISFA 555
            G  K+N DG++    GL A  GG+ R   G +   F+  +G C + +AEL A++  +   
Sbjct: 1971 GEYKLNVDGSSRN--GLHAATGGVLRDHTGKLIFGFSENIGPCNSLQAELRALLRGLLLC 2028

Query: 554  WKHGWRQLWLESDSTHMVTILTTRSPKVPWRWRAKWLKCLHFISHMDFRVSHIYREGNRV 375
             +    +LW+E D+   + ++   S K P+  R         +S   +R+SHI REGN+ 
Sbjct: 2029 KERHIEKLWIEMDALVAIQLIQP-SKKGPYNLRYLLESIRMCLSSFSYRLSHILREGNQA 2087

Query: 374  ADSLSS 357
            AD LS+
Sbjct: 2088 ADYLSN 2093



 Score = 50.8 bits (120), Expect(2) = 7e-15
 Identities = 49/240 (20%), Positives = 96/240 (40%), Gaps = 8/240 (3%)
 Frame = -3

Query: 1414 LKIPLYALIHQTLVWCPSTDGEVKCKTAYDFFRARGNTTSWGKQIXXXXXXXXXXXXXWR 1235
            L++P          W  +++G+   ++A++  R R  + +    I             W+
Sbjct: 1744 LQVPFDKSREDVAYWTLTSNGDFSTRSAWEMIRQRQTSNALCSFIWHRSIPLSISFFLWK 1803

Query: 1234 MLQNRLPTKDGLQSAGIQLASCCHLCFQAAESPTHIFLQCTYARSLWTAISSTFKHPIQL 1055
             L N +P +  ++  GIQLAS C +C  + ES  H+  +   A+ +W   +  F+  I  
Sbjct: 1804 TLHNWIPVELRMKEKGIQLASKC-VCCNSEESLIHVLWENPVAKQVWNFFAQLFQIYIWN 1862

Query: 1054 NGSIAEL---WKAAMEITCSTQISALWRSAIVSTFWAIWYARNQVIFENSWITFAESISF 884
               ++++   W  + +         L    I    W +W  RN     ++ + +A+ +  
Sbjct: 1863 PRHVSQIIWAWYVSGDYVRKGHFRVLLPLFIC---WFLWLERNDAKHRHTGL-YADRV-- 1916

Query: 883  VWRAIKETGMIDSGTMRNSTQ-----DLCILSQFCIAGRPAKAPKIIPVTWFTPLPGWIK 719
            +WR +K    +  G++    Q     D+  +  F    +    P+II   W  P  G  K
Sbjct: 1917 IWRTMKHCRQLYDGSLLQQWQWKGDTDIATMLGFSFTHKQHAPPQII--YWKKPSIGEYK 1974


>emb|CAB40051.1| putative protein [Arabidopsis thaliana] gi|7267781|emb|CAB81184.1|
            putative protein [Arabidopsis thaliana]
          Length = 1294

 Score =  305 bits (781), Expect = 9e-80
 Identities = 173/496 (34%), Positives = 259/496 (52%), Gaps = 6/496 (1%)
 Frame = -2

Query: 3065 EIRDAVFSLDQHSAPGPDGFSGRFYCHCWEIIGKDVIKAVQFFFATSFIPPGLNANLMVL 2886
            EI +A+  +    APGPDG + RFY  CWEI+G DVIK V+ FF TS++   +N   + +
Sbjct: 794  EIYNAICHIGDDKAPGPDGLTARFYKSCWEIVGPDVIKEVKIFFRTSYMKQSINHTNICM 853

Query: 2885 IPKIKEAITIEQFRPIVLGNFLFKIITKILADRLASICSRVISPNQFGFIRGRHIEDFIA 2706
            IPKI    T+  +RPI L N L+KII+K L +RL      ++S +Q  FI GR + D + 
Sbjct: 854  IPKITNPETLSDYRPIALCNVLYKIISKCLVERLKGHLDAIVSDSQAAFIPGRLVNDNVM 913

Query: 2705 TASDCFNCLD--KKCFGGNMALKVDIRKAFDTISWPFLLEVLRCFGFNSTFINWISVIFN 2532
             A +  + L   K+     MA+K D+ KA+D + W FL   +R FGF+ T+I WI     
Sbjct: 914  IAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDRVEWNFLETTMRLFGFSETWIKWIMGAVK 973

Query: 2531 SARISILINGTPEGYFQCSRGVRQGDPLSPLLFCFAEDFLSRYLHRQVLRNNIQAMSSPR 2352
            S   S+L+NG P G  Q  RG+RQGDPLSP LF    D L+  +  +V   +I+ +    
Sbjct: 974  SVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYLFILCADILNHLIKNRVAEGDIRGIRIGN 1033

Query: 2351 GGRAPTHLLYADDVLIFCKGTKRNMLAITKAFSHYGQLSGQLVNWDKSFVFFGXXXXXXX 2172
            G    THL +ADD L FC+   RN  A+   F  Y   SGQ +N  KS + FG       
Sbjct: 1034 GVPGVTHLQFADDSLFFCQSNVRNCQALKDVFDVYEYYSGQKINMSKSMITFGSRVHGTT 1093

Query: 2171 XXSLLDTSGMQRGSTCINYLGVPLFKGAPKKRWLQPIADXXXXXXXXXXXXXXSMAGRLS 1992
               L +  G+Q       YLG+P   G  K+     I +              S AG+  
Sbjct: 1094 QNRLKNILGIQSHGGGGKYLGLPEQFGRKKRDMFNYIIERVKKRTSSWSAKYLSPAGKEI 1153

Query: 1991 LINSVITSSFIHSFMIYRWPKSLLKELNAAIRNFFWTGAIDERKAITVAWHRC-YSKQEG 1815
            ++ SV  S  +++   ++ P +++ E+ A + NF+W     +R+   +AW R  YSK+EG
Sbjct: 1154 MLKSVAMSMPVYAMSCFKLPLNIVSEIEALLMNFWWEKNAKKREIPWIAWKRLQYSKKEG 1213

Query: 1814 GLGLKDLATMNRALLRKLTWKFMTADNFAYS-FLRARYLK--SFSDPRRKYLTSSIWPAL 1644
            GLG +DLA  N ALL K  W+ +   N  ++  ++ARY +  S  D +R+   S  W ++
Sbjct: 1214 GLGFRDLAKFNDALLAKQVWRMINNPNSLFARIMKARYFREDSILDAKRQRYQSYGWTSM 1273

Query: 1643 SEHYLALLSETQWLIG 1596
                  +   +++++G
Sbjct: 1274 LAGLDVIKKGSRFIVG 1289


Top