BLASTX nr result

ID: Magnolia22_contig00021826 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Magnolia22_contig00021826
         (1823 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KZV58610.1 hypothetical protein F511_11258 [Dorcoceras hygrometr...   452   e-146
KZV36392.1 hypothetical protein F511_03833 [Dorcoceras hygrometr...   455   e-146
EOY02236.1 Uncharacterized protein TCM_011923 [Theobroma cacao]       473   e-145
EOY02238.1 Uncharacterized protein TCM_016762 [Theobroma cacao]       463   e-141
EOY17513.1 Uncharacterized protein TCM_036737 [Theobroma cacao]       454   e-138
EOY14356.1 Uncharacterized protein TCM_033752 [Theobroma cacao]       453   e-138
EOY17514.1 Uncharacterized protein TCM_042330 [Theobroma cacao]       449   e-136
EOY25454.1 Uncharacterized protein TCM_026877 [Theobroma cacao]       449   e-136
XP_011085143.1 PREDICTED: uncharacterized protein LOC105167219 [...   433   e-135
XP_019166530.1 PREDICTED: uncharacterized protein LOC109162265 [...   421   e-133
XP_016512671.1 PREDICTED: uncharacterized protein LOC107829719 [...   434   e-132
XP_019260139.1 PREDICTED: uncharacterized protein LOC109238160 [...   424   e-129
XP_018822696.1 PREDICTED: uncharacterized protein LOC108992559 [...   418   e-129
EOY02239.1 Uncharacterized protein TCM_016763 [Theobroma cacao]       427   e-129
XP_019235551.1 PREDICTED: uncharacterized protein LOC109215887 [...   423   e-128
XP_019248607.1 PREDICTED: uncharacterized protein LOC109227871 [...   419   e-127
XP_019267209.1 PREDICTED: uncharacterized protein LOC109244554 [...   418   e-126
EOY19200.1 Retrotransposon, unclassified-like protein [Theobroma...   412   e-126
EOY02242.1 Uncharacterized protein TCM_016767 [Theobroma cacao]       414   e-125
XP_019263798.1 PREDICTED: uncharacterized protein LOC109241514 [...   411   e-125

>KZV58610.1 hypothetical protein F511_11258 [Dorcoceras hygrometricum]
          Length = 782

 Score =  452 bits (1162), Expect = e-146
 Identities = 252/602 (41%), Positives = 334/602 (55%)
 Frame = +1

Query: 7    GKIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHF 186
            G IWV++   + A    +  Q L + IS       +  S VYAKC Y+ RR LWL L   
Sbjct: 61   GHIWVFFSEEVQAESVLDHPQFLHVKISAPFLPVEIYCSFVYAKCDYIERRDLWLSLLEV 120

Query: 187  SRNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSN 366
             +  SGPW VGGDFN V +ASE  G+    R  M +F S IL + L+DAGF G+S+TWSN
Sbjct: 121  -KPPSGPWLVGGDFNVVRSASECLGSAGGRRTPMEEFNSFILESALMDAGFEGSSYTWSN 179

Query: 367  NRTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRF 546
                   IW RLD V ++  W + F S RV+HLPR  SDH PLL+S  P        FRF
Sbjct: 180  RH-----IWKRLDRVFVSVNWTDHFDSIRVQHLPRTVSDHCPLLVS-APVFARGPTSFRF 233

Query: 547  LRMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAK 726
              MW  H  F Q V   W   C    M  +             WN++VFGNIF  I+EA+
Sbjct: 234  QSMWLHHPDFLQTVRLNWNLPCHIQGMAGLFAKLKRLKNHLKWWNRDVFGNIFDNIREAE 293

Query: 727  ADLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFH 906
              +   E   +   +  +          LA +  ME  FWKQ+A  +WL++G+RNTK FH
Sbjct: 294  KGVALAEAECERDPSGFNWDRLANCNDDLARITAMESDFWKQEAACNWLEDGERNTKLFH 353

Query: 907  SSARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSL 1086
            +  R++   N I  I DD G   TS   I+ +    FE L   EP+   +  L+     +
Sbjct: 354  NLVRKKHVANKIFRIWDD-GNCLTSPTLIQQSGACFFESLLTGEPSALAAPDLSYFFHEI 412

Query: 1087 TLEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFA 1266
            +  +N  +   PS +E ++  F IP+D   GPDGFSS FF  CW IV +DV +A    F 
Sbjct: 413  SDLENISIAATPSLEEVKAVVFSIPRDSVAGPDGFSSAFFQHCWQIVHQDVFRAVLDFFQ 472

Query: 1267 GEQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLE 1446
            G   P++FT++ ISLIPK      ++D+RPISLCNV  KI SK+L +R+  ++ +LIS  
Sbjct: 473  GTPFPQSFTSTTISLIPKCEGPRAWSDFRPISLCNVTNKIISKLLYSRLRNVVGRLISPN 532

Query: 1447 QGAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLKFGF 1626
            Q  FV GR I++NI +A E    L+ KT GGN+ILKLD+ KAYDR+ WSFL  ++   GF
Sbjct: 533  QSGFVPGRLISDNILLAQELTHRLNCKTHGGNVILKLDMAKAYDRVQWSFLLNIMRHLGF 592

Query: 1627 SDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
            SD  V ++  C     FS+ ING +A FFKS RGLRQGDPLSP LFI+  E  SR   +L
Sbjct: 593  SDTVVGMVSRCISACHFSININGTSAGFFKSTRGLRQGDPLSPLLFILGTEYLSRGLDRL 652

Query: 1807 II 1812
             +
Sbjct: 653  FL 654


>KZV36392.1 hypothetical protein F511_03833 [Dorcoceras hygrometricum]
          Length = 884

 Score =  455 bits (1170), Expect = e-146
 Identities = 255/602 (42%), Positives = 335/602 (55%)
 Frame = +1

Query: 7    GKIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHF 186
            G IWV++   + A    +  Q L + IS       V  S VYAKC Y+ RR LWL L   
Sbjct: 61   GHIWVFFSEEVQAESILDHPQFLHVKISAPFLPVEVYCSFVYAKCGYIERRDLWLSLLEV 120

Query: 187  SRNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSN 366
             +  SGPW VGGDFN V +ASE  G+    R  M +F S IL + L+DAGF G+SFTWSN
Sbjct: 121  -KPPSGPWLVGGDFNVVRSASECLGSVGGRRTPMEEFNSFILESALMDAGFEGSSFTWSN 179

Query: 367  NRTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRF 546
                   IW RLD V ++  W + F S RV+HLPR  SDH PLL+S  P        FRF
Sbjct: 180  RH-----IWKRLDRVFVSVNWTDHFDSIRVQHLPRTVSDHCPLLVS-APVFARGPTSFRF 233

Query: 547  LRMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAK 726
              MW  H  F Q V   W   C    M  +             WN++VFGNIF  I+EA+
Sbjct: 234  QSMWLHHPDFLQTVRLNWNLPCHIQGMAGLFAKLKRLKFHLKWWNRDVFGNIFDNIREAE 293

Query: 727  ADLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFH 906
              +   E   +   + ++          LA +  ME  FWKQKA   WL++G+RNTK FH
Sbjct: 294  KGVALAEAECERDPSRSNWDRLANCNDDLARITAMESDFWKQKAACKWLEDGERNTKLFH 353

Query: 907  SSARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSL 1086
            +  R++   N I  I DD G   TS   I+ +    FE L   EP+   +  L+     +
Sbjct: 354  NLVRKKHVANKIFRIWDD-GNCLTSPTLIQQSGAFFFESLLTGEPSALAAPDLSYFSHEI 412

Query: 1087 TLEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFA 1266
            +  +N  +   PS +E ++  F I +D   GPDGFSS FF  CW IV +DV +A    F 
Sbjct: 413  SDLENISIAATPSLEEVKAVVFSIHRDSVAGPDGFSSAFFQHCWQIVHQDVFRAVLDFFQ 472

Query: 1267 GEQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLE 1446
            G  +P++FT++ ISLIPK      ++D+RPISLCNV  KI SK+L +R+  ++ +LIS  
Sbjct: 473  GTPLPQSFTSTTISLIPKCEGPRAWSDFRPISLCNVTNKIISKLLYSRLRNVVGRLISPN 532

Query: 1447 QGAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLKFGF 1626
            Q  FV GR I++NI +A E    L+ KT GGN+ILKLD+ KAYDR+ WSFL  ++   GF
Sbjct: 533  QSGFVPGRLISDNILLAQELTHRLNCKTHGGNVILKLDMAKAYDRVQWSFLLNIMRHLGF 592

Query: 1627 SDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
            SD  V ++  C     FS+ ING +A FFKS RGLRQGDPLSP LFI+ AE  SR   +L
Sbjct: 593  SDTVVGMVSRCISACHFSIKINGTSAGFFKSTRGLRQGDPLSPLLFILGAEYLSRGLDRL 652

Query: 1807 II 1812
             +
Sbjct: 653  FL 654


>EOY02236.1 Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  473 bits (1217), Expect = e-145
 Identities = 247/606 (40%), Positives = 353/606 (58%), Gaps = 4/606 (0%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            KIW++    +N  V  +  Q L + +S      P+  + VYAKC+   R  LW  L   S
Sbjct: 652  KIWIFSSMEVNCEVLMDHIQCLHVRLSLPWLPHPISATFVYAKCTRQERLELWNCLRSLS 711

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
             ++ GPW VGGDFN + + +ER         SM DF + +   GL+DAGF GNSFTW+NN
Sbjct: 712  SDMQGPWMVGGDFNTIVSCAERLNGAPPHGGSMEDFVATLFDCGLIDAGFEGNSFTWTNN 771

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
                  ++ RLD V+ N  WA  F S RV+HL R  SDH PLL+S           FRFL
Sbjct: 772  H-----MFQRLDRVVYNPEWAHCFSSTRVQHLNRDGSDHCPLLISCATASQKGPSTFRFL 826

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
              WT H  F   V ++W    +++ +                WNK++FG+IF ++K A+ 
Sbjct: 827  HAWTKHHDFLPFVERSWQVPLNSSGLTAFWIKQQRLKRDLKWWNKQIFGDIFEKLKRAEI 886

Query: 730  DLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHS 909
            + ++ E   Q   +  +      A  KL     +EE+FW+QK+   WL EG+RNTK FH 
Sbjct: 887  EAEKREKEFQQDPSSINRNLMNKAYAKLNRQLSIEELFWQQKSGVKWLVEGERNTKFFHL 946

Query: 910  SARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPT----VDPSSFLAAIP 1077
              R++  RN I  I+D  G  +     I+ +A+Q+F+ L  AE       DPS     IP
Sbjct: 947  RMRKKRVRNNIFRIQDSEGNIYEDPQYIQNSAVQYFQNLLTAEQCDFSRFDPS----LIP 1002

Query: 1078 RSLTLEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACS 1257
            R++++ DN  L  APS +E +   F+I KD   GPDGFSS F+  CW I+ +D+ +A   
Sbjct: 1003 RTISITDNEFLCAAPSLKEIKEVVFNIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVLD 1062

Query: 1258 LFAGEQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLI 1437
             F G  +P+  T++ + L+PK   + +++D+RPISLC V+ KI +K LA R+SKILP +I
Sbjct: 1063 FFNGTPMPQGVTSTTLVLLPKKPNSCQWSDFRPISLCTVLNKIVTKTLANRLSKILPSII 1122

Query: 1438 SLEQGAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLK 1617
            S  Q  FV GR I++NI +A E +  LD K RGGN++LKLD+ KAYDR++W FL +++ +
Sbjct: 1123 SENQSGFVNGRLISDNILLAQELVGKLDAKARGGNVVLKLDMAKAYDRLNWDFLYLMMKQ 1182

Query: 1618 FGFSDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSF 1797
            FGF+DRW+++++AC  N WFS+LING    +FKS RGLRQGD +SP LF++AA+  SR  
Sbjct: 1183 FGFNDRWISMIKACISNCWFSLLINGSLVGYFKSERGLRQGDSISPLLFVLAADYLSRGI 1242

Query: 1798 RQLIIR 1815
             QL  R
Sbjct: 1243 NQLFNR 1248


>EOY02238.1 Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  463 bits (1191), Expect = e-141
 Identities = 244/584 (41%), Positives = 342/584 (58%), Gaps = 1/584 (0%)
 Frame = +1

Query: 67   QMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFSRNISGPWAVGGDFNAVSTA 246
            Q L + +S      PV  S VYAKC+ + RR LW  L   S  +  PW VGGDFN++ + 
Sbjct: 932  QCLHVKLSLPWLPHPVFTSFVYAKCTRIERRELWTSLRIISDGMQAPWLVGGDFNSIVSC 991

Query: 247  SERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNNRTGEDRIWARLD*VLMNER 426
             ER         SM D +S +   GLLDAGF GNSFTW+NNR     ++ RLD V+ N+ 
Sbjct: 992  DERLNGAIPHDGSMEDLSSTLFDCGLLDAGFEGNSFTWTNNR-----MFQRLDRVVYNQE 1046

Query: 427  WAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFLRMWTLHDSFHQVVHKAWGG 606
            WAE F S RV+HL R  SDH PLL+S    +      FRFL  WT H  F   V K+W  
Sbjct: 1047 WAEFFSSTRVQHLNRDGSDHCPLLISCSNTNQRGPATFRFLHAWTKHHDFISFVEKSWNT 1106

Query: 607  MCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKADLDQLEGLLQDSGTDTSLQ 786
                  +                WNK +FG+IF  ++ A+ + +Q E   Q + +  + +
Sbjct: 1107 PIHAEGLNAFWTKQQRLKRDLKWWNKHIFGDIFKILRLAEVEAEQRELNFQQNPSAANRE 1166

Query: 787  DSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHSSARERVRRNMITSIKDDSG 966
                A  KL     +EE+FW+QK+   WL EG+RNTK FH   R++  RN I  I+D  G
Sbjct: 1167 LMHKAYAKLNRQLSIEELFWQQKSGVKWLVEGERNTKFFHMRMRKKRMRNHIFRIQDQEG 1226

Query: 967  ITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAI-PRSLTLEDNNELLVAPSFQETQS 1143
                    I+ + ++ F+ L +AE   D S F  +I PR ++  DN  L   PS QE + 
Sbjct: 1227 NVLEEPHLIQNSGVEFFQNLLKAEQC-DISRFDPSITPRIISTTDNEFLCATPSLQEVKE 1285

Query: 1144 AAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFAGEQIPRAFTTSMISLIPKS 1323
            A F+I KD   GPDGFSS F+  CW I+ +D+ +A    F G  +PR  T++ + L+PK+
Sbjct: 1286 AVFNINKDSVAGPDGFSSLFYQHCWDIIKQDLFEAVLDFFKGSPLPRGITSTTLVLLPKT 1345

Query: 1324 SEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLEQGAFVKGRAITENIAMALE 1503
               ++++++RPISLC V+ KI +K+LA R+SKILP +IS  Q  FV GR I++NI +A E
Sbjct: 1346 QNVSQWSEFRPISLCTVLNKIVTKLLANRLSKILPSIISENQSGFVNGRLISDNILLAQE 1405

Query: 1504 AMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLKFGFSDRWVALMEAC*GNSWFSV 1683
             +  ++ ++RGGN++LKLD+ KAYDR++W FL +++ +FGF+  W+ +++AC  N WFS+
Sbjct: 1406 LVDKINARSRGGNVVLKLDMAKAYDRLNWEFLYLMMEQFGFNALWINMIKACISNCWFSL 1465

Query: 1684 LINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQLIIR 1815
            LING    +FKS RGLRQGD +SP LFI+AAE  SR   QL  R
Sbjct: 1466 LINGSLVGYFKSERGLRQGDSISPSLFILAAEYLSRGLNQLFSR 1509


>EOY17513.1 Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  454 bits (1169), Expect = e-138
 Identities = 243/599 (40%), Positives = 342/599 (57%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            KIW+++       V  +  Q L + ++      P+  + VYAKC+   R  LW  L + +
Sbjct: 912  KIWLFHSVEFICEVLLDHPQCLHVRVTIPWLDLPIFTTFVYAKCTRSERTPLWNCLRNLA 971

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
             ++ GPW VGGDFN +    ER         S+ DFAS +L  GLLD GF GN FTW+NN
Sbjct: 972  ADMEGPWIVGGDFNIILKREERLYGADPHEGSIEDFASVLLDCGLLDGGFEGNPFTWTNN 1031

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
            R     ++ RLD ++ N++W   FP  R++HL R  SDH PLLLS       A   FRFL
Sbjct: 1032 R-----MFQRLDRMVYNQQWINKFPITRIQHLNRDGSDHCPLLLSCSNSSEKAPSSFRFL 1086

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
              W LH +F+  V   W    + + ++               WNK VFG+IF  IKEA+ 
Sbjct: 1087 HAWALHHNFNASVEGNWNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFGDIFSNIKEAEK 1146

Query: 730  DLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHS 909
             +++ E L Q   T  S      +  +L     MEEIFWKQK+   W+ EG+RNTK FH 
Sbjct: 1147 RVEECEILHQQEQTIGSRIQLNKSYAQLNKQLSMEEIFWKQKSGVKWVVEGERNTKFFHM 1206

Query: 910  SARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSLT 1089
              +++  R+ I  I++  G       Q++ +AI  F  L +AE   D     +  P  ++
Sbjct: 1207 RMQKKRIRSHIFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAESCDDTRFQSSLCPSIIS 1266

Query: 1090 LEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFAG 1269
              DN  L   P+ QE + A F I  + A GPDGFSS F+  CW I+  D+ +A    F G
Sbjct: 1267 DTDNGFLCAEPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEFFHG 1326

Query: 1270 EQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLEQ 1449
              IP+  T++ + LIPK++ A+K++++RPISLC V+ KI +KILA R++KILP +I+  Q
Sbjct: 1327 ADIPQGMTSTTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKILPSIITENQ 1386

Query: 1450 GAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLKFGFS 1629
              FV GR I++NI +A E +  LD+K RGGN+ LKLD+ KAYDR+DWSFL  VL   GF+
Sbjct: 1387 SGFVGGRLISDNILLAQELIGKLDQKNRGGNVALKLDMMKAYDRLDWSFLFKVLQHLGFN 1446

Query: 1630 DRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
             +W+ +++ C  N WFS+L+NG    +FKS RGLRQGD +SP LFI+AAE  +R    L
Sbjct: 1447 AQWIGMIQKCISNCWFSLLLNGRTVGYFKSERGLRQGDSISPQLFILAAEYLARGLNAL 1505


>EOY14356.1 Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  453 bits (1165), Expect = e-138
 Identities = 248/603 (41%), Positives = 349/603 (57%), Gaps = 4/603 (0%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            KIW+++   +++ +  +  Q L + ++      P   + VYAKC+   R  LW  L   +
Sbjct: 949  KIWLFHSLELHSDIILDHPQCLHVRLTSPWLEKPFFATFVYAKCTRSERTLLWDCLRRLA 1008

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
             +   PW VGGDFN +    ER         SM DFAS +L  GLLD GF GN FTW+NN
Sbjct: 1009 ADNEEPWLVGGDFNIILKREERLYGSAPHEGSMEDFASVLLDCGLLDGGFEGNPFTWTNN 1068

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
            R     ++ RLD V+ N +W   FP  R++HL R  SDH PLL+S       +   FRF 
Sbjct: 1069 R-----MFQRLDRVVYNHQWINMFPITRIQHLNRDGSDHCPLLISCFISSEKSPSSFRFQ 1123

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
              W LH  F   V   W    + + +                WNK VFG+IF ++KEA+ 
Sbjct: 1124 HAWVLHHDFKTSVEGNWNLPINGSGLQAFWIKQHRLKQHLKWWNKAVFGDIFSKLKEAEK 1183

Query: 730  DLDQLEGLLQDS---GTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKH 900
             +++ E L Q     G+  +L  S A   K  ++E   EIFWKQK+   W+ EG+RNTK 
Sbjct: 1184 RVEECEILHQQEQTVGSRINLNKSYAQLNKQLNVE---EIFWKQKSGVKWVVEGERNTKF 1240

Query: 901  FHSSARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPR 1080
            FH   +++  R+ I  +++  G     Q Q+K +AI++F  L +AEP  D S F  ++  
Sbjct: 1241 FHMRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIEYFSSLLKAEPC-DISRFQNSLIP 1299

Query: 1081 SLTLEDNNELLVA-PSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACS 1257
            S+     NELL A P+ QE + A FDI  + A GPDGFSS F+  CW  +  D+  A   
Sbjct: 1300 SIISNSENELLCAEPNLQEVKDAVFDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDAVRD 1359

Query: 1258 LFAGEQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLI 1437
             F G  IPR  T++ + L+PK S A+K++++RPISLC V+ KI +K+L+ R++KILP +I
Sbjct: 1360 FFHGANIPRGVTSTTLVLLPKKSSASKWSEFRPISLCTVMNKIITKLLSNRLAKILPSII 1419

Query: 1438 SLEQGAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLK 1617
            +  Q  FV GR I++NI +A E +R LD K+RGGN+ LKLD+ KAYDR+DWSFL  VL  
Sbjct: 1420 TENQSGFVGGRLISDNILLAQELIRKLDTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQH 1479

Query: 1618 FGFSDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSF 1797
            FGF+++W+ +++ C  N WFS+L+NG    +FKS RGLRQGD +SP LFI+AAE  SR  
Sbjct: 1480 FGFNEQWIGMIQKCISNCWFSLLLNGRIEGYFKSERGLRQGDSISPQLFILAAEYLSRGL 1539

Query: 1798 RQL 1806
              L
Sbjct: 1540 NAL 1542


>EOY17514.1 Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  449 bits (1155), Expect = e-136
 Identities = 244/570 (42%), Positives = 334/570 (58%), Gaps = 4/570 (0%)
 Frame = +1

Query: 109  PVLLSIVYAKCSYLSRRSLWLELEHFSRNISGPWAVGGDFNAVSTASERQGTRTASRLSM 288
            P  ++IVYAKC+   R  LW  L   + +I  PW VGGDFN +    ER         +M
Sbjct: 980  PFFVTIVYAKCTRSERTLLWDCLRRLADDIEVPWLVGGDFNVILKREERLYGSAPHEGAM 1039

Query: 289  SDFASAILHAGLLDAGFVGNSFTWSNNRTGEDRIWARLD*VLMNERWAEAFPSFRVEHLP 468
             DFAS +L  GLLD GF GNSFTW+NNR     ++ RLD ++ N  W   FP  R++HL 
Sbjct: 1040 EDFASTLLDCGLLDGGFEGNSFTWTNNR-----MFQRLDRIVYNHHWINKFPVTRIQHLN 1094

Query: 469  RANSDHAPLLLSFPPQHTPAIRPFRFLRMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXX 648
            R  SDH PLL+S       A   FRF   W LH  F   V   W    + + +       
Sbjct: 1095 RDGSDHCPLLISCFNSSEKAPSSFRFQHAWVLHHDFKTSVESNWNLPINGSGLQAFWSKQ 1154

Query: 649  XXXXXXXXVWNKEVFGNIFCQIKEAKADLDQLEGLLQDSGTDTS---LQDSLAARRKLAH 819
                     WNK VFG+IF ++KEA+  +++ E L Q   T  S   L  S A   K  +
Sbjct: 1155 HRLKQHLKWWNKAVFGDIFSKLKEAEKRVEECEILHQQEQTFESRIKLNKSYAQLNKQLN 1214

Query: 820  LELMEEIFWKQKARNSWLDEGDRNTKHFHSSARERVRRNMITSIKDDSGITFTSQAQIKA 999
            +E   E+FWKQK+   W+ EG+RNTK FH   +++  R+ I  ++D  G     Q Q+K 
Sbjct: 1215 IE---ELFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSHIFKVQDPEGRWIEDQEQLKH 1271

Query: 1000 AAIQHFERLFQAEPTVDPSSFLAAIPRSLTLEDNNELLVA-PSFQETQSAAFDIPKDGAP 1176
            +AI++F  L + EP  D S F +++  S+     NELL A PS QE + A F I  + A 
Sbjct: 1272 SAIEYFSSLLKVEPCYD-SRFQSSLIPSIISNSENELLCAEPSLQEVKDAVFGINSESAA 1330

Query: 1177 GPDGFSSTFFIDCWPIVGEDVHKAACSLFAGEQIPRAFTTSMISLIPKSSEATKFADYRP 1356
            GPDGFSS F+  CW I+ +D+  A    F G  IPR  T++ + L+PK S A+K++D+RP
Sbjct: 1331 GPDGFSSYFYQQCWNIIAQDLLDAVRDFFHGANIPRGVTSTTLILLPKKSSASKWSDFRP 1390

Query: 1357 ISLCNVVYKIFSKILATRMSKILPKLISLEQGAFVKGRAITENIAMALEAMRHLDRKTRG 1536
            ISLC V+ KI +K+L+ R++K+LP +I+  Q  FV GR I++NI +A E +  L+ K+RG
Sbjct: 1391 ISLCTVMNKIITKLLSNRLAKVLPSIITENQSGFVGGRLISDNILLAQELIGKLNTKSRG 1450

Query: 1537 GNIILKLDLEKAYDRIDWSFLKMVLLKFGFSDRWVALMEAC*GNSWFSVLINGEAASFFK 1716
            GN+ LKLD+ KAYD++DWSFL  VL  FGF+ +W+ +++ C  N WFS+L+NG    +FK
Sbjct: 1451 GNLALKLDMMKAYDKLDWSFLFKVLQHFGFNGQWIKMIQKCISNCWFSLLLNGRTEGYFK 1510

Query: 1717 SFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
            S RGLRQGD +SP LFIIAAE  SR    L
Sbjct: 1511 SERGLRQGDSISPQLFIIAAEYLSRGLNAL 1540


>EOY25454.1 Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  449 bits (1155), Expect = e-136
 Identities = 242/591 (40%), Positives = 343/591 (58%), Gaps = 1/591 (0%)
 Frame = +1

Query: 37   INATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFSRNISGPWAV 216
            +++ V  +  Q L + ++     FP+ ++ VYAKC+   R  LW  L   + +I  PW V
Sbjct: 1128 LHSDVIFDHPQCLHVRLTSPWLEFPIFVTFVYAKCTRSERTLLWDCLRRLAADIEVPWLV 1187

Query: 217  GGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNNRTGEDRIWA 396
            GGDFN +    ER         +M DFAS +L  GLLD GF GN FTW+NNR     ++ 
Sbjct: 1188 GGDFNIILKREERLYGSAPHEGAMEDFASTLLDCGLLDGGFEGNPFTWTNNR-----MFQ 1242

Query: 397  RLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFLRMWTLHDSF 576
            RLD ++ N  W   FP  R++HL R  SDH PLL+S       A   FRF   W LH  F
Sbjct: 1243 RLDRIVYNHHWINKFPITRIQHLNRDGSDHCPLLISCFNSSEKAPSSFRFQHAWVLHHDF 1302

Query: 577  HQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKADLDQLEGLL 756
               V   W    + + +                WNK +FG+IF ++KEA+  +++ E L 
Sbjct: 1303 KTSVESNWNLPINGSGLQAFWSKQHRLKQHLKWWNKVMFGDIFSKLKEAEKRVEECEILH 1362

Query: 757  QDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHSSARERVRRN 936
            Q+  T  S+     +  +L     +EEIFWKQK+   W+ EG+RNTK FH+  +++  R+
Sbjct: 1363 QNEQTVESIIKLNKSYAQLNKQLNIEEIFWKQKSGVKWVVEGERNTKFFHTRMQKKRIRS 1422

Query: 937  MITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSLTLEDNNELLV 1116
             I  +++  G     Q Q+K +AI++F  L + EP  D S F  ++  S+     NELL 
Sbjct: 1423 HIFKVQEPDGRWIEDQEQLKQSAIKYFSSLLKFEPC-DDSRFQRSLIPSIISNSENELLC 1481

Query: 1117 A-PSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFAGEQIPRAFT 1293
            A P+ QE + A F I  + A GPDGFSS F+  CW I+  D+  A    F G  IPR  T
Sbjct: 1482 AEPNLQEVKDAVFGIDPESAAGPDGFSSYFYQQCWNIIAHDLLDAVRDFFHGANIPRGVT 1541

Query: 1294 TSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLEQGAFVKGRA 1473
            ++ + L+PK   A+K++D+RPISLC V+ KI +K+L+ R++KILP +I+  Q  FV GR 
Sbjct: 1542 STTLILLPKKPSASKWSDFRPISLCTVMNKIITKLLSNRLAKILPSIITENQSGFVGGRL 1601

Query: 1474 ITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLKFGFSDRWVALME 1653
            I++NI +A E +  L+ K+RGGN+ LKLD+ KAYDR+DWSFL  VL  FGF+D+W+ +++
Sbjct: 1602 ISDNILLAQELIGKLNTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHFGFNDQWIGMIQ 1661

Query: 1654 AC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
             C  N WFS+L+NG    +FK  RGLRQGDP+SP LF+IAAE  SR    L
Sbjct: 1662 KCISNCWFSLLLNGRTEGYFKFERGLRQGDPISPQLFLIAAEYLSRGLNAL 1712


>XP_011085143.1 PREDICTED: uncharacterized protein LOC105167219 [Sesamum indicum]
          Length = 1203

 Score =  433 bits (1113), Expect = e-135
 Identities = 234/594 (39%), Positives = 331/594 (55%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            KIW +    ++  +  +  Q L L I        +L + VYAK +   RR LW  L +  
Sbjct: 32   KIWCFMKEDLDCEILISQEQFLHLRIFSDFWPNGILCTWVYAKHTRAERRELWDALRNID 91

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
                 PW +GGDFN V   SER+G       +M DF   ++  GL DAGF G+ FTWS +
Sbjct: 92   DG-EEPWLLGGDFNTVLYCSERKGGAAPKIRTMEDFGDMMMDCGLQDAGFEGSKFTWSRS 150

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
            R     +W RLD  L +  W +AFP  R++HL R  SDH PLLLS   +      PFRF 
Sbjct: 151  R-----LWQRLDRFLFSHTWTQAFPLSRIQHLTRNVSDHCPLLLSVKQEKKTGPTPFRFQ 205

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
             MWT H  F   V  +W      + M               +WN EVFGNIF  I +A+ 
Sbjct: 206  NMWTKHHDFKHCVTTSWQHPIHGHGMFAFQQKLHRIKAALKLWNTEVFGNIFQNITDAEQ 265

Query: 730  DLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHS 909
             +   E       +D +L     A  +L     +EE +WKQKA   WL+EG++NTK+FHS
Sbjct: 266  RVKIAEQAYDGDPSDENLIAMNKATAELTFALSVEESYWKQKAACKWLEEGEKNTKYFHS 325

Query: 910  SARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSLT 1089
              +++ +++ I  I+ + G T T    IK + + +F + F  + TV     L  +P  L+
Sbjct: 326  LTKKKRKQSRIYKIQHN-GATLTKAEDIKVSVVDYFTQAFTRDDTVSVDD-LHWVPNILS 383

Query: 1090 LEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFAG 1269
             ED ++L   P+ ++ ++  FD+      GPDGFS+ FF  CW I+G+D++ A     +G
Sbjct: 384  EEDRHQLNATPTIEDVKTIIFDMCPHSTAGPDGFSAHFFQCCWEIIGQDLYGAVLDFLSG 443

Query: 1270 EQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLEQ 1449
               P+ FTT+ I LIPK    + + D+RPISLCNV  KI SK++  +M+K+LPK+IS  Q
Sbjct: 444  STPPKNFTTTTIVLIPKIEAPSTWKDFRPISLCNVTGKILSKVINNQMAKLLPKIISPSQ 503

Query: 1450 GAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLKFGFS 1629
             +FV+GR I++NI +A E    L +     N I K+D+EKAYDR++W+FL  +L++ GF 
Sbjct: 504  SSFVQGRMISDNILLAQELSHCLGKNGSLSNTIFKIDMEKAYDRVNWTFLYHMLMRVGFP 563

Query: 1630 DRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSR 1791
              W+ +++    N WFS+LINGE   FFKS RGLRQGDPLSP LF+IAAE  SR
Sbjct: 564  THWINMIKKLIENCWFSILINGEGVGFFKSTRGLRQGDPLSPTLFVIAAECLSR 617


>XP_019166530.1 PREDICTED: uncharacterized protein LOC109162265 [Ipomoea nil]
          Length = 906

 Score =  421 bits (1083), Expect = e-133
 Identities = 232/604 (38%), Positives = 338/604 (55%), Gaps = 10/604 (1%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVL----LSIVYAKCSYLSRRSLWLEL 177
            K WV++   +         Q+++L       T P +    +S +YAKC    R +LW ++
Sbjct: 70   KTWVFWDRGVTLISIEIGDQLINLE-----ATVPEIGNFNISFIYAKCDRSIRLNLWEQI 124

Query: 178  EHFSRNISGP-----WAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFV 342
            E  +    GP     W++ GDFN +  + E++G +  S     DF + +  A L +  F 
Sbjct: 125  ESMAE---GPMKDQKWSLIGDFNCILKSEEKRGGQPYSMWKSRDFQNCVDSARLREVVFY 181

Query: 343  GNSFTWSNNRTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHT 522
            GN FTW N R GE  +W RLD   +NE W  +  +  + H  +  SDH+PL++S  PQ  
Sbjct: 182  GNPFTWWNGRRGEQAVWERLDRGFVNENWEGSLKT-HIHHYAKLTSDHSPLIMSVEPQVR 240

Query: 523  PAIRPFRFLRMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNI 702
             + RPF FL  W  H+ F  +V +AW    + N M                WN E FGNI
Sbjct: 241  MSRRPFSFLNSWGEHEQFLGIVREAWQERVNGNEMYTFMTKLKRVKEVLKKWNWETFGNI 300

Query: 703  FCQIKEAKADLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEG 882
            F +++E +  + +LE  LQ+  TD +L +    +  L     MEE +W+QKA + W+ EG
Sbjct: 301  FKKVEELEGKMRELEEKLQERPTDETLLEYKRVQALLQRQVRMEESYWQQKAHSQWVVEG 360

Query: 883  DRNTKHFHSSARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPT-VDPSS 1059
            +RN+K+F    RER  R +I  I  D G+    Q +I   A++ F++LF AE T +DPS 
Sbjct: 361  ERNSKYFQGLVRERRSRQIIHKIMSDEGVWVEDQGKIAGMAVEFFQQLFTAEATNLDPSI 420

Query: 1060 FLAAIPRSLTLEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDV 1239
            F   +   +T ++N  L+  PS +E +S+ F +  + A GPDGF   F+  CW I+ ED+
Sbjct: 421  F-DCLQVVVTEQENQALVAEPSMEEVRSSVFSMNANSAAGPDGFGGGFYQLCWEIIKEDL 479

Query: 1240 HKAACSLFAGEQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSK 1419
             K   S FAG+ + +A T++ I LIPK S    F +YRPISL N   KI +KI+  R++ 
Sbjct: 480  LKMVRSFFAGKSLTKAITSTSIVLIPKVSNPANFGEYRPISLSNFCSKIITKIMVLRLAG 539

Query: 1420 ILPKLISLEQGAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFL 1599
            +L ++IS  Q  FVKGR+IT+NI +A E    +  K    +I+LKLD+ KAYDR+DW  +
Sbjct: 540  VLNRVISPVQAGFVKGRSITDNILLAQEICHGM--KASNEDIVLKLDMAKAYDRMDWGCI 597

Query: 1600 KMVLLKFGFSDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAE 1779
             +VL + GF  +W+ L+     N W+SV++NGEA  FF S RGLRQGDPLSP LFI+AAE
Sbjct: 598  ALVLTRLGFCKKWIDLVLNSINNIWYSVIVNGEARGFFHSSRGLRQGDPLSPSLFILAAE 657

Query: 1780 AFSR 1791
             FSR
Sbjct: 658  LFSR 661


>XP_016512671.1 PREDICTED: uncharacterized protein LOC107829719 [Nicotiana tabacum]
          Length = 1679

 Score =  434 bits (1117), Expect = e-132
 Identities = 226/600 (37%), Positives = 329/600 (54%), Gaps = 1/600 (0%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            KIWV+     + TV  N  Q L+L +         +L++VYAKC  + R  LW  L   +
Sbjct: 63   KIWVFLDEVFDVTVMYNMVQQLTLRLHHSETHVEFVLTLVYAKCDAIERIELWDSLYSMA 122

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
             ++  PW VGGDFN +    E+ G        + DF   I    L+D GF G+ +TW N 
Sbjct: 123  ADMDVPWLVGGDFNVILDEEEKFGGLPVHINEIDDFRHCINTCNLVDLGFKGSIYTWWNG 182

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
            R  ED I+ R+D  + N  + + FP   V+HLP+  SDH+P+ +    +  P I+PFRFL
Sbjct: 183  RAEEDCIFERIDRCVANSEFQDTFPGIEVQHLPKIGSDHSPMQIKCDIEAPPVIKPFRFL 242

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
              W  H SF ++V + W    S NP                VW+K+ FG+IF +I   + 
Sbjct: 243  NFWVEHASFKEIVKEHWTADFSANPYTIFNHKLKKLKKALSVWSKQTFGDIFQKIASLEE 302

Query: 730  DLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHS 909
             +   E   + + T  ++        +L  +  +EE +WKQKA  SW  +GDRNTK FH+
Sbjct: 303  VVLVHEAEFEANPTKLNMHRLQKVNAELIKVLALEEKYWKQKAEMSWFKDGDRNTKFFHA 362

Query: 910  SARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSLT 1089
              R R ++  +  I++  G+      +I A AIQ F   F+          +  +P  L 
Sbjct: 363  QVRCRRKKLQLNRIQNKQGVWIEEDEEIAAEAIQFFTDQFRESAASTSFQIIDNVPVLLD 422

Query: 1090 LEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFAG 1269
            ++ N EL+  P+ +E ++A F +  D A  PDG++  F+  CW I+G DV     + F G
Sbjct: 423  MDQNEELIKQPTIEEVKAAVFGLNGDSAGSPDGYTGKFYQSCWEIIGGDVFDMVRAFFNG 482

Query: 1270 EQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLEQ 1449
              +P+  T + + L+PK  E   F+D RPISL N   K+ S+++  R+  +LP LIS EQ
Sbjct: 483  HDLPKCVTHTNLVLLPKKKEVCTFSDLRPISLSNFSNKVISRVIHERLVDLLPNLISEEQ 542

Query: 1450 GAFVKGRAITENIAMALEAMRHLDRKTRGG-NIILKLDLEKAYDRIDWSFLKMVLLKFGF 1626
              FVKGR+I ENI +  E +  +  +T+ G N+ILKLD+ KAYDR+ W FL  VL K GF
Sbjct: 543  SGFVKGRSIVENILLTQEIVTDMRLRTKAGPNVILKLDMTKAYDRLSWLFLTKVLRKMGF 602

Query: 1627 SDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
            S+R++ ++     N+W+SVLING+A  FFKS RG++QGDPLSP LFI+AAEA SRS   L
Sbjct: 603  SERFIGMVYGIVSNNWYSVLINGQAHGFFKSSRGVKQGDPLSPTLFILAAEALSRSLNAL 662


>XP_019260139.1 PREDICTED: uncharacterized protein LOC109238160 [Nicotiana attenuata]
          Length = 1534

 Score =  424 bits (1091), Expect = e-129
 Identities = 226/602 (37%), Positives = 330/602 (54%), Gaps = 3/602 (0%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            KIW +       T+  N +Q L+L +        ++L++VYAKC  + R  LW  L   +
Sbjct: 63   KIWAFIDEVYEVTILYNMTQQLTLKLFHTETHIELILTLVYAKCDAIERIELWDSLYAMA 122

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
             +++ PW VGGDFN +    E+ G        + DF   I    L D GF G+ +TW N 
Sbjct: 123  TDMTSPWLVGGDFNVIWDEEEKFGGLPVHLNEIDDFRHCINTCNLTDLGFKGSIYTWWNG 182

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
            R+ ED I+ RLD  L N    + FP   + HL +  SDH PL L    +  P  + FRFL
Sbjct: 183  RSEEDCIFERLDRCLGNLELQQTFPGLEITHLSKIGSDHCPLYLKCDIEAAPIRKSFRFL 242

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
              WT HD+F  VV + W    + NP I               W++  +G+IF +I   + 
Sbjct: 243  NFWTKHDTFKDVVRENWNADFAANPFILFNYKLKKLKKALSSWSRATYGDIFQKIASLEE 302

Query: 730  DLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHS 909
             +   E   +   T  + +     + ++     +EE FWKQKA   W  +GDRNTK FH+
Sbjct: 303  VVLVHERQFEAFPTQMNRERLHKVKAEMIRYLAVEEEFWKQKAGMLWFKDGDRNTKFFHA 362

Query: 910  SARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSF--LAAIPRS 1083
              R R ++  +  I+++ GI    + QI   A+  ++  F    +V PS+F  +  +P  
Sbjct: 363  QVRGRRKKLQLRRIQNNMGIWIEEEEQIAEEAVSFYKDQF--TESVVPSTFHIIDHVPTL 420

Query: 1084 LTLEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLF 1263
            +  E N  L   P+ +E + A + +  D A GPDGF+  F+  CW IVGED++      F
Sbjct: 421  VEEEQNARLTELPTKEEVRKAVYGLNGDSAGGPDGFTGAFYHTCWDIVGEDIYAMVLQFF 480

Query: 1264 AGEQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISL 1443
             G+Q+P+  T + + L+PK  E T F+D RPISL N + KIFS+++  R+ +ILP LIS 
Sbjct: 481  CGQQLPKCVTHTNLVLLPKKKEVTTFSDLRPISLSNFINKIFSRVIHDRLVEILPNLISE 540

Query: 1444 EQGAFVKGRAITENIAMALEAMRHLDRKTRGG-NIILKLDLEKAYDRIDWSFLKMVLLKF 1620
            EQ  FVKGR+I ENI +  E +  +  +T+ G N+++KLD+ KAYDR+ W FL  VL K 
Sbjct: 541  EQAGFVKGRSIVENILLTQEIITDIRLRTKAGPNVVMKLDMTKAYDRLSWLFLTKVLRKM 600

Query: 1621 GFSDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFR 1800
            GF +R++ ++    GN+W+SVLING+   FFKS RG++QGDPLSP LFI+AAEA SR   
Sbjct: 601  GFCERFIGMIFDLVGNNWYSVLINGQPRGFFKSTRGVKQGDPLSPTLFILAAEALSRGLN 660

Query: 1801 QL 1806
             L
Sbjct: 661  SL 662


>XP_018822696.1 PREDICTED: uncharacterized protein LOC108992559 [Juglans regia]
          Length = 1206

 Score =  418 bits (1075), Expect = e-129
 Identities = 232/601 (38%), Positives = 332/601 (55%), Gaps = 1/601 (0%)
 Frame = +1

Query: 1    EGGKIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELE 180
            +GGK+WV+++      +   ++Q +S    +      VL++ VYAKCSY+ RR LW +LE
Sbjct: 363  QGGKLWVFWNIPNIFEIVLCTTQSVSGWFKWD--AHRVLVTFVYAKCSYVDRRELWHDLE 420

Query: 181  HFSRNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTW 360
             ++ ++  PW V GDFN +   SER G      +SM +F   I   GLL+    G   +W
Sbjct: 421  EWT-DLDQPWLVLGDFNVIRRDSERVGGNPRPFISMLEFNDCIDRCGLLEVSSSGQCMSW 479

Query: 361  SNNRTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLL-SFPPQHTPAIRP 537
             N   G  R WA+LD VLMN  +A  FPS    +L R +SDH P+++ S  P  +    P
Sbjct: 480  CNGHGGVSRSWAKLDRVLMNNFFATLFPSVHFNYLSRKSSDHCPMVVYSDHPAMSYGPSP 539

Query: 538  FRFLRMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIK 717
            F FL MW  HD F   V +AW    + + ++ +             WNK VFG +   I 
Sbjct: 540  FHFLNMWCSHDGFLMCVKEAWNQQDTASGLLKLSIRLKRTKIALRAWNKNVFGRVDVNIH 599

Query: 718  EAKADLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTK 897
              +  LD L+  LQ   ++    D +A + ++   E ME     Q A+  WL EGD+N+K
Sbjct: 600  ALEEKLDFLDSQLQSGFSEEIEDDFVATKTEIEIWEKMEASRLGQIAKKKWLTEGDQNSK 659

Query: 898  HFHSSARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIP 1077
             FHS   +R  +  I+ +    G    +  ++   A+ +F        TV+       I 
Sbjct: 660  FFHSVINQRRNKGHISKMVLVDGRVLCTAEEVHEEAVAYFRNFLSDVSTVEHCDLSRLIE 719

Query: 1078 RSLTLEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACS 1257
            + ++ E+N  L  APS  E + A F IPK+ +PGPDGF S F++ CW IV EDV  AA  
Sbjct: 720  KKISNEENRWLCAAPSKLEVKQAVFSIPKNSSPGPDGFGSGFYMSCWDIVKEDVVAAAGD 779

Query: 1258 LFAGEQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLI 1437
             F G  + R +++S I LIPK  E + F  +RPISLC+V YK FSKI   R++ +L  L+
Sbjct: 780  FFRGVPLSRFYSSSFIVLIPKVPEPSGFDKFRPISLCSVAYKNFSKIFVNRLNSVLDVLV 839

Query: 1438 SLEQGAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLK 1617
            S EQGAF+  R+I ENI +A E ++ L +K+ GGN+++KLD+ KAYDR++W FL  V+  
Sbjct: 840  SHEQGAFIPRRSIFENITLAQEMVQSLHKKSVGGNVLIKLDMAKAYDRVNWDFLLHVIRA 899

Query: 1618 FGFSDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSF 1797
            FGFSD    ++  C  + WFSV++NG    FF+S RGLRQGD LSP LFI+  E  SR  
Sbjct: 900  FGFSDIVCQIIANCVQSQWFSVMMNGTFKGFFQSARGLRQGDSLSPYLFILMEEVLSRLL 959

Query: 1798 R 1800
            R
Sbjct: 960  R 960


>EOY02239.1 Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  427 bits (1098), Expect = e-129
 Identities = 232/548 (42%), Positives = 319/548 (58%), Gaps = 5/548 (0%)
 Frame = +1

Query: 178  EHFSRNI----SGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVG 345
            E+F R +    +GPW VGGDFN++ +  ER         SM DFAS +   GLLDAGF G
Sbjct: 878  EYFRRKLGFHKAGPWMVGGDFNSIVSTVERLNGAAPHVGSMEDFASTLFDCGLLDAGFEG 937

Query: 346  NSFTWSNNRTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTP 525
            NSFTW+NN      ++ RLD V+ N  WA+ F S RV+HL R  SDH PLL+S       
Sbjct: 938  NSFTWTNNH-----MFQRLDRVVYNPEWAQCFSSTRVQHLNRDGSDHCPLLISCNTASQK 992

Query: 526  AIRPFRFLRMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIF 705
                FRFL  WT H  F   V ++W      + +                WNK +FG+IF
Sbjct: 993  GASTFRFLHAWTKHHDFLPFVTRSWQTPIQGSGLSAFWFKQQRLKRDLKWWNKHIFGDIF 1052

Query: 706  CQIKEAKADLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGD 885
             +++ A+ + ++ E   Q + + T+      A  KL     +EE+FW+QK+   WL EG+
Sbjct: 1053 EKLRLAEEEAEKKEIEFQHNPSLTNRNLMHKAYAKLNRQLSIEELFWQQKSGVKWLVEGE 1112

Query: 886  RNTKHFHSSARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSF- 1062
             NTK FH   R++  R+ I  I+D  G  F     I+ +A   F  L QAE   D S F 
Sbjct: 1113 NNTKFFHMRMRKKRVRSHIFQIQDSEGNVFDDIHSIQKSATDFFRDLMQAE-NCDLSRFD 1171

Query: 1063 LAAIPRSLTLEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVH 1242
             + IPR ++  DN  L  AP  QE + A F+I KD   GPDGFSS F+  CW I+  D+ 
Sbjct: 1172 PSLIPRIISSADNEFLCAAPPLQEIKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKNDLL 1231

Query: 1243 KAACSLFAGEQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKI 1422
             A    F G  +PR  T++ + L+PK   A  +++YRPISLC V+ KI +K+LA R+SKI
Sbjct: 1232 DAVLDFFRGSPLPRGVTSTTLVLLPKKPNACHWSEYRPISLCTVLNKIVTKLLANRLSKI 1291

Query: 1423 LPKLISLEQGAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLK 1602
            LP +IS  Q  FV GR I++NI +A E +  +D K+RGGN++LKLD+ KAYDR++W FL 
Sbjct: 1292 LPSIISENQSGFVNGRLISDNILLAQELIGKIDAKSRGGNVVLKLDMAKAYDRLNWDFLY 1351

Query: 1603 MVLLKFGFSDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEA 1782
            +++  FGF+  W+ ++++C  N WFS+LING  A +FKS RGLRQGD +SP LFI+AA+ 
Sbjct: 1352 LMMEHFGFNAHWINMIKSCISNCWFSLLINGSLAGYFKSERGLRQGDSISPMLFILAADY 1411

Query: 1783 FSRSFRQL 1806
             SR    L
Sbjct: 1412 LSRGLNHL 1419


>XP_019235551.1 PREDICTED: uncharacterized protein LOC109215887 [Nicotiana attenuata]
          Length = 1724

 Score =  423 bits (1087), Expect = e-128
 Identities = 225/600 (37%), Positives = 327/600 (54%), Gaps = 1/600 (0%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            KIW +       TV  N  Q L+L +      F  +L++VYAKC  + R  LW  L + +
Sbjct: 63   KIWAFIDEVYEVTVVYNLVQQLTLRLLHTESHFEFVLTLVYAKCDVIERIELWDSLYYMA 122

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
            ++++ PW VGGDFN +    E+      S   + DF   I    L+D GF G+ FTW N 
Sbjct: 123  QDMTVPWLVGGDFNVIWDEEEKFRGLPVSLNEIDDFRHCINTCNLMDLGFKGSIFTWWNG 182

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
            R  ED I+ RLD  + N  + +AFP   V+HL +  SDH PL L    +  P  +PFRFL
Sbjct: 183  RAEEDCIFKRLDRCVANTEFQQAFPGIEVQHLSKIGSDHCPLQLKCDLETPPVKKPFRFL 242

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
              W  H SF +VV + W    S +P +               W+K+ FG+IF +I   + 
Sbjct: 243  NFWVDHASFKEVVRQNWTADFSASPYVIFNHKLKKLKKVLSEWSKKTFGDIFQKIASLEE 302

Query: 730  DLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHS 909
             +   E   + + T  + Q     + +L     +EE FWKQKA  SW  +GDRNTK FH+
Sbjct: 303  VVLVHEAEFEANPTRLNRQRLQKVQAELIKFLAVEEKFWKQKAGMSWFKDGDRNTKFFHA 362

Query: 910  SARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSLT 1089
              R R ++  ++ I++  G+       I   AI+ FE  F           +  +P  + 
Sbjct: 363  QVRGRRKKLQLSRIQNSQGVWIEEDEDIAKEAIKFFENQFTENAVPTSFEIIEHVPSLID 422

Query: 1090 LEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFAG 1269
            +  N+EL+  P+ +E ++A F +  D A GPDGF+  F+  CW I+G D+     + F G
Sbjct: 423  MGQNSELIKQPTLEEVKTAVFGLNGDSAGGPDGFTGKFYQSCWDIIGGDLFDMVRAFFNG 482

Query: 1270 EQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLEQ 1449
             ++ +  T S + L+PK  E   F+D RPISL N   K+ S+++  R+  +LP LIS EQ
Sbjct: 483  HELLKCVTHSNLVLLPKKKEVCTFSDLRPISLSNFTNKVISRVIHERLVGMLPGLISDEQ 542

Query: 1450 GAFVKGRAITENIAMALEAMRHLDRKTRGG-NIILKLDLEKAYDRIDWSFLKMVLLKFGF 1626
              FVKGR+I ENI +  E +  +  +T+ G N+I+KLD+ KAYDR+ W FL  VL + GF
Sbjct: 543  SGFVKGRSIVENILLTQEIVTDMRLRTKAGPNVIMKLDMTKAYDRLSWLFLTKVLRRMGF 602

Query: 1627 SDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
            S+R++ ++     N+W+SVLING+A  FF S RG++QGDPLSP LFI+AAEA SR    L
Sbjct: 603  SERFIGMVFGIVSNNWYSVLINGQAHGFFTSSRGVKQGDPLSPTLFILAAEALSRGLNAL 662


>XP_019248607.1 PREDICTED: uncharacterized protein LOC109227871 [Nicotiana attenuata]
          Length = 1592

 Score =  419 bits (1076), Expect = e-127
 Identities = 218/600 (36%), Positives = 326/600 (54%), Gaps = 1/600 (0%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            KIW +        +  N  Q L+L +        ++L++VYAKC ++ R  LW  L   +
Sbjct: 32   KIWAFIDE---VDIMYNMVQQLTLRLFHTETHVELILNLVYAKCDHIERIELWDSLYTMA 88

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
             +++ PW VGGDFN +    E+ G    S   + DF   +    L D GF G+ FTW N 
Sbjct: 89   SDMTSPWLVGGDFNVIWDEEEKYGGLPVSLNEIDDFRHCVNTCNLSDLGFKGSIFTWWNG 148

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
            R+ +D I+ RLD  L N    E FP   V HL +  SDH P++L    +  P  +PFRFL
Sbjct: 149  RSEDDCIFKRLDRCLGNVELQEIFPGLEVTHLSKTGSDHRPMMLKCDIETPPIKKPFRFL 208

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
              WT HDSF  VV + W      NP +               W++  +G+IF +I   + 
Sbjct: 209  NFWTKHDSFKAVVKENWTTDFCANPFVLFNHKLKKLKKVLSTWSRTTYGDIFQKIASLEE 268

Query: 730  DLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHS 909
             +   E   +   T  + +     + ++     +EE FWKQKA  SW  +GDRNTK FHS
Sbjct: 269  VVLVHERHFEAYPTQLNRERLKKVQAEMMKYLALEEEFWKQKAGMSWFKDGDRNTKFFHS 328

Query: 910  SARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSLT 1089
              R R +R  +  IKD +G       +I   A+  +   F           +  +P  + 
Sbjct: 329  QVRGRRKRLQLKRIKDCNGCWLEEDEKIAEEAVNFYREQFTESVVPTVFHIMEHVPTLIE 388

Query: 1090 LEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFAG 1269
             E N++L+  P+  E ++A + +  +   GPDG++  F+  CW IVGED++    + F G
Sbjct: 389  NEQNDKLIAIPTRDEVKNAVYGLNGESTSGPDGYTGAFYQTCWDIVGEDIYAMVVAFFNG 448

Query: 1270 EQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLEQ 1449
            +Q+P+  T + + L+PK  E T F+D RPISL N + KIFS+++  R+ ++LP +IS EQ
Sbjct: 449  QQLPKCVTHTNLVLLPKKKEITTFSDLRPISLSNFINKIFSRVIHERLVELLPGIISEEQ 508

Query: 1450 GAFVKGRAITENIAMALEAMRHLDRKTRGG-NIILKLDLEKAYDRIDWSFLKMVLLKFGF 1626
              FVK R+I EN+ +  E +  +  +T+ G N+++KLD+ KAYDR+ W FL  +L K GF
Sbjct: 509  AGFVKRRSIVENVLLTQEIITDIRLRTKAGPNVVIKLDMTKAYDRVSWLFLTKMLRKLGF 568

Query: 1627 SDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
             +R++ ++    GN+W+SVL+NG+A  FFKS RG++QGDPLSP LFI+AAEA SR    L
Sbjct: 569  CERFIGMIFDLVGNNWYSVLVNGQAHGFFKSSRGVKQGDPLSPTLFILAAEALSRGLNSL 628


>XP_019267209.1 PREDICTED: uncharacterized protein LOC109244554 [Nicotiana attenuata]
          Length = 1737

 Score =  418 bits (1074), Expect = e-126
 Identities = 222/600 (37%), Positives = 325/600 (54%), Gaps = 1/600 (0%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            KIW +       T+  N  Q L+L +        ++L++VYAKC  + R  LW  L   +
Sbjct: 63   KIWAFIDEVFEVTILYNMVQQLTLKLFHTETHVELILTLVYAKCDPIERIELWDSLYAMA 122

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
             +++ PW VGGDFN +    E+ G    S   + DF   I    L D GF G+ +TW N 
Sbjct: 123  TDMTSPWLVGGDFNVIWDEEEKFGGLPVSLNEVDDFRHCINTCNLADLGFKGSIYTWWNG 182

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
            R+ ED I+ RLD  L N    + FP   + HL +  SDH P+LL    +  P  +PFRFL
Sbjct: 183  RSEEDCIFKRLDRCLGNMELQQTFPGLEITHLSKTGSDHCPMLLKCDIEVEPIKKPFRFL 242

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
              WT H++F  VV + W    + NP I                    +G+IF +I   + 
Sbjct: 243  NFWTKHENFKAVVKENWSADFAANPFII-----------------STYGDIFQKIASLEE 285

Query: 730  DLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHS 909
             +   E   +   T  + +     + +L     +EE FWKQKA  +W  +GDRNT+ FH+
Sbjct: 286  VVLVHEKQFEAVPTQMNRERLHKVKAELIRYLALEEEFWKQKAGMAWFKDGDRNTRFFHA 345

Query: 910  SARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSLT 1089
              R R ++  +  IKD +G       QI   A+  +   F  +        +  +P  + 
Sbjct: 346  QVRGRRKKLQLKRIKDSNGNWIEEDVQIAEEAVNFYRDQFTEDVVPSVFHIMDHVPTLID 405

Query: 1090 LEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFAG 1269
             E N  L+  P+ +E + A + +  D A GPDGF+  FF  CW IVG+DV+    + F G
Sbjct: 406  EEQNEMLIANPTKEEVKKAVYGLNGDSAGGPDGFTGAFFHTCWDIVGDDVYAMVLAFFNG 465

Query: 1270 EQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLEQ 1449
            +Q+P++ T + + L+PK  E   F+D RPISL N V KIFS+++  R+ ++LP +IS EQ
Sbjct: 466  QQLPKSITHTNLVLLPKKKEVNTFSDLRPISLSNFVNKIFSRVIHERLVELLPNIISEEQ 525

Query: 1450 GAFVKGRAITENIAMALEAMRHLDRKTRGG-NIILKLDLEKAYDRIDWSFLKMVLLKFGF 1626
              FVKGR+I EN+ +  E +  +  +T+ G N+++KLD+ KAYDR+ W FL  +L K GF
Sbjct: 526  AGFVKGRSIVENVLLTQEIITDIRLRTKAGPNVVMKLDMTKAYDRLSWLFLTKMLRKLGF 585

Query: 1627 SDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
             +R++ L+    GN+W+SVLING++  FFKS RG++QGDPLSP LFIIAAEA SR    L
Sbjct: 586  CERFIGLIFDLVGNNWYSVLINGQSHGFFKSTRGVKQGDPLSPTLFIIAAEALSRGLNSL 645


>EOY19200.1 Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  412 bits (1058), Expect = e-126
 Identities = 234/603 (38%), Positives = 325/603 (53%), Gaps = 1/603 (0%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            KIW+++   +  TV  +  Q L + I+F    F    S +YAKC+   RR LW  L + +
Sbjct: 63   KIWMFWAEEVGCTVQRDHHQCLHVRIAFPWLPFSFQTSFIYAKCTKTERRHLWDCLRNVA 122

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
             ++  PW VGGDFN + +  ER      +  SM +FA+A+   GL+DAGF GN FTW+N 
Sbjct: 123  TDMQEPWLVGGDFNTILSREERLFGAEPNAGSMEEFATALFDCGLMDAGFEGNKFTWTNT 182

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
                  ++ RLD V+ N  WA +F   R+ HL R   DH PLL+S           FRFL
Sbjct: 183  -----HMFQRLDRVVYNMEWASSFSHTRIHHLNRDGFDHCPLLISCCNFSLQRPSSFRFL 237

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
              W  H  F   V   W     +  ++               WNK+VFG+IF  ++ A+ 
Sbjct: 238  HAWVKHHGFLNFVANNWRQTIYSTGLMAFWNKQQRLKKSLKGWNKDVFGDIFSNLRAAEK 297

Query: 730  DLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHS 909
              ++ E   Q    D+S+ +    +   A L                           + 
Sbjct: 298  TAEEKELTYQH---DSSVFNRTQLQYAYAKLN--------------------------NQ 328

Query: 910  SARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAA-IPRSL 1086
              ++RV RN I  I+D  G        I+++A++ FE L +AE   D S F A  IP+ L
Sbjct: 329  MQKKRV-RNSIFKIQDSEGTLMEEPGLIESSAVEFFENLLKAE-NYDLSRFKAEFIPQML 386

Query: 1087 TLEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFA 1266
            +  DNN L   P  QE + A F I KD   GPDGFSS F+  CWPI+ ED+  A    F 
Sbjct: 387  SDADNNLLCAEPQLQEVKDAVFAIDKDSVVGPDGFSSFFYQQCWPIIAEDLLAAVRDFFK 446

Query: 1267 GEQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLE 1446
            G   PR  T++ + L+ K  +A  ++D+RPISLC ++ KI +K+LA R+SK+LP LIS  
Sbjct: 447  GAVFPRGVTSTTLVLLAKKPDAATWSDFRPISLCTILNKIVTKLLANRLSKVLPSLISEN 506

Query: 1447 QGAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLKFGF 1626
            Q  FV GR I +NI +A E +  +D K RGGN++LKLD+ KAYDR++W FL +VL +FGF
Sbjct: 507  QSGFVSGRLINDNILLAQELIGKIDYKARGGNVVLKLDMMKAYDRLNWDFLILVLERFGF 566

Query: 1627 SDRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
            +D W+ ++  C  N WFSVLING +A +FKS RGLRQGD +SP LFI+AAE  SR   +L
Sbjct: 567  NDMWIDMIRRCITNCWFSVLINGHSAGYFKSERGLRQGDSISPMLFILAAEYLSRGINEL 626

Query: 1807 IIR 1815
              R
Sbjct: 627  FSR 629


>EOY02242.1 Uncharacterized protein TCM_016767 [Theobroma cacao]
          Length = 1707

 Score =  414 bits (1065), Expect = e-125
 Identities = 221/537 (41%), Positives = 305/537 (56%)
 Frame = +1

Query: 196  ISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNNRT 375
            I GPW VGGDFN++ +  ER         SM DFAS +   GL+DAGF GNSFTW+NN  
Sbjct: 845  IEGPWMVGGDFNSIVSTVERLNGAAPHVGSMEDFASTLFDCGLVDAGFEGNSFTWTNNH- 903

Query: 376  GEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFLRM 555
                ++ RLD V+ N  WA+ F S RV+HL    SDH PLL+S           FRFL  
Sbjct: 904  ----MFQRLDRVVYNPEWAQCFSSTRVQHLNLDGSDHCPLLISCNTASQKGPSTFRFLHA 959

Query: 556  WTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKADL 735
            WT H  F   + K+W      + +                WNK +FG+IF +++ A+ + 
Sbjct: 960  WTKHHDFLPFITKSWQTPLQGSGLSTFWFKQQRLKRDLKWWNKHIFGDIFEKLRLAEEEA 1019

Query: 736  DQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHSSA 915
             + E   Q + + T+      A  KL     +EE+FW+QK    WL EG+ NTK FH   
Sbjct: 1020 KKREIEFQHNPSLTNRNLMHKAYTKLNRQLSIEELFWQQKFSVKWLVEGESNTKFFHMRM 1079

Query: 916  RERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSLTLE 1095
            R++  R+ +  I+D  G  F     I+ +A   F  L QAE   +     + IPR ++  
Sbjct: 1080 RKKRVRSHVFQIQDSEGNVFDDTHSIQKSATDFFRNLMQAENCDNSRFDPSLIPRIISSA 1139

Query: 1096 DNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFAGEQ 1275
            DN  L  APS QE +   F+I KD   G DGFSS F+  CW I+  D+  A    F G  
Sbjct: 1140 DNEFLCAAPSLQEVKETVFNINKDSVAGSDGFSSLFYQHCWDIIKHDLLDAVLDFFRGSP 1199

Query: 1276 IPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLEQGA 1455
            +PR  T++ + L+PK   A  ++DY PISLC V+ KI +K+LA R+SKILP +IS  Q  
Sbjct: 1200 LPRGVTSTTLVLLPKKPNACHWSDYSPISLCTVLNKIVTKLLANRLSKILPLIISENQSG 1259

Query: 1456 FVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLKFGFSDR 1635
            FV GR I++NI +A E +  +D K+RGGN++LKLD+ KAYDR++W FL +++  FGF+  
Sbjct: 1260 FVNGRLISDNILLAHELIGKIDAKSRGGNVVLKLDMAKAYDRLNWDFLYLMMEHFGFNAH 1319

Query: 1636 WVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQL 1806
            W+ ++++C  N W S+LING    +FKS RGLRQGD +SP LFI+AA+  SR    L
Sbjct: 1320 WINMIKSCISNYWLSLLINGSLVGYFKSERGLRQGDSISPMLFILAADYLSRGLNHL 1376


>XP_019263798.1 PREDICTED: uncharacterized protein LOC109241514 [Nicotiana attenuata]
          Length = 1511

 Score =  411 bits (1056), Expect = e-125
 Identities = 216/604 (35%), Positives = 316/604 (52%)
 Frame = +1

Query: 10   KIWVYYHNYINATVHSNSSQMLSLHISFQNCTFPVLLSIVYAKCSYLSRRSLWLELEHFS 189
            +IW+++   +   V   S Q ++  I +   T  +++S VYAKC  + R  LW  L   +
Sbjct: 189  QIWIFWDELLECRVIEESEQQVTCEIKWNGDT--IIISAVYAKCDAVLREDLWDSLRDIA 246

Query: 190  RNISGPWAVGGDFNAVSTASERQGTRTASRLSMSDFASAILHAGLLDAGFVGNSFTWSNN 369
                 PW + GDFN +    E++G +         F   I+   L+D G+ G+ FTW N 
Sbjct: 247  DRYKLPWLIAGDFNCIVDPGEKKGGKPHGMSKSLPFIQCIMDCELIDPGYSGSIFTWCNG 306

Query: 370  RTGEDRIWARLD*VLMNERWAEAFPSFRVEHLPRANSDHAPLLLSFPPQHTPAIRPFRFL 549
               E RIW RLD VL+N+ W   F S  V HL R  SDH+PL +     H   I+ FRFL
Sbjct: 307  WCPEKRIWKRLDRVLINQEWLNLFDSTSVNHLIRTGSDHSPLFVIAKTTHREPIKYFRFL 366

Query: 550  RMWTLHDSFHQVVHKAWGGMCSTNPMINVXXXXXXXXXXXXVWNKEVFGNIFCQIKEAKA 729
              WT    F +VV +AW      +PM                W++   GNIF +I+E + 
Sbjct: 367  DFWTKEADFSRVVEQAWNMEVQGSPMWKFHMKLKNTCKKLSEWSRNTLGNIFDKIEELEH 426

Query: 730  DLDQLEGLLQDSGTDTSLQDSLAARRKLAHLELMEEIFWKQKARNSWLDEGDRNTKHFHS 909
             ++++E  +    ++ +      A   L      EE FWKQK+   W  EG+ N+K FHS
Sbjct: 427  KVEEMETNIIADNSEVNRAGLNQANALLVRAYKKEESFWKQKSGVKWFVEGEVNSKFFHS 486

Query: 910  SARERVRRNMITSIKDDSGITFTSQAQIKAAAIQHFERLFQAEPTVDPSSFLAAIPRSLT 1089
              + R +R  +  ++ + G       +I   AI  F+  F  E   +  S L  IP  + 
Sbjct: 487  VVKGRKKRLTLKKMRKEDGTWVEGDEEIAHEAISFFQNQFTRENFDNDFSVLGCIPTIID 546

Query: 1090 LEDNNELLVAPSFQETQSAAFDIPKDGAPGPDGFSSTFFIDCWPIVGEDVHKAACSLFAG 1269
              DN +L+  P+ +E +   F +    APGPDG S  F+  CW I+ ED+       FAG
Sbjct: 547  DADNEKLIAVPTMEELKDVVFSMSSQSAPGPDGVSGKFYHSCWEIIKEDLLLMVLDFFAG 606

Query: 1270 EQIPRAFTTSMISLIPKSSEATKFADYRPISLCNVVYKIFSKILATRMSKILPKLISLEQ 1449
             QIP+AFT + + LIPK      F + RPISL N   KI SK++  R+S ++ KL+S  Q
Sbjct: 607  NQIPKAFTHTCLVLIPKVDYPQSFTELRPISLSNFSCKILSKLVNQRLSPLMQKLVSPNQ 666

Query: 1450 GAFVKGRAITENIAMALEAMRHLDRKTRGGNIILKLDLEKAYDRIDWSFLKMVLLKFGFS 1629
              F+KGR+ITENI +  + + ++ + +  GN++LKLD+ KAYDR+ W +L  VL + GFS
Sbjct: 667  TGFIKGRSITENIMLTQDMVHNIVKPSASGNVVLKLDMAKAYDRVSWEYLCQVLRQMGFS 726

Query: 1630 DRWVALMEAC*GNSWFSVLINGEAASFFKSFRGLRQGDPLSPGLFIIAAEAFSRSFRQLI 1809
            + W+ ++     N W+S+ ING    FFKS RG++QGDPLSP LF+I AE  SR    LI
Sbjct: 727  EIWIDIVWRLMTNVWYSININGVRHGFFKSSRGIKQGDPLSPSLFVIGAELLSRLMDNLI 786

Query: 1810 IRGW 1821
              G+
Sbjct: 787  DSGF 790


Top