BLASTX nr result

ID: Phellodendron21_contig00019512 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Phellodendron21_contig00019512
         (1878 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_006484256.1 PREDICTED: uncharacterized protein LOC102625369 i...   604   0.0  
XP_006484258.1 PREDICTED: uncharacterized protein LOC102625369 i...   583   0.0  
KDO70270.1 hypothetical protein CISIN_1g0116142mg [Citrus sinensis]   572   0.0  
XP_006437853.1 hypothetical protein CICLE_v10031644mg, partial [...   525   e-180
KDO70271.1 hypothetical protein CISIN_1g0116142mg [Citrus sinensis]   495   e-168
XP_006437852.1 hypothetical protein CICLE_v10031644mg, partial [...   449   e-151
XP_006437854.1 hypothetical protein CICLE_v10031644mg [Citrus cl...   325   e-103
XP_017971961.1 PREDICTED: uncharacterized protein LOC18610175 is...   330   e-102
XP_007045751.2 PREDICTED: uncharacterized protein LOC18610175 is...   330   e-102
EOY01581.1 18S pre-ribosomal assembly protein gar2-related, puta...   328   e-101
XP_017971960.1 PREDICTED: uncharacterized protein LOC18610175 is...   327   e-101
GAV79538.1 hypothetical protein CFOL_v3_23003 [Cephalotus follic...   320   3e-99
XP_018860380.1 PREDICTED: uncharacterized protein LOC109022047 i...   321   5e-99
XP_018860379.1 PREDICTED: uncharacterized protein LOC109022047 i...   322   6e-99
XP_018860377.1 PREDICTED: uncharacterized protein LOC109022047 i...   322   7e-99
XP_012080467.1 PREDICTED: uncharacterized protein LOC105640684 i...   314   3e-96
XP_012080460.1 PREDICTED: uncharacterized protein LOC105640684 i...   313   5e-96
XP_012080459.1 PREDICTED: uncharacterized protein LOC105640684 i...   314   6e-96
EOY01582.1 18S pre-ribosomal assembly protein gar2-related, puta...   303   1e-92
XP_007045750.2 PREDICTED: uncharacterized protein LOC18610175 is...   302   2e-92

>XP_006484256.1 PREDICTED: uncharacterized protein LOC102625369 isoform X1 [Citrus
            sinensis] XP_006484257.1 PREDICTED: uncharacterized
            protein LOC102625369 isoform X1 [Citrus sinensis]
          Length = 496

 Score =  604 bits (1558), Expect = 0.0
 Identities = 332/529 (62%), Positives = 373/529 (70%), Gaps = 16/529 (3%)
 Frame = +2

Query: 98   MKFVSDGEQVFPHSTLGR--KPDSKHFDYHENGLDYTGLKSENSIVKEYHIGTLCASKAN 271
            MKFVSD EQ+FPH TLG   KPDSKH                         G + AS +N
Sbjct: 1    MKFVSDSEQLFPHLTLGHSHKPDSKHS------------------------GAISASNSN 36

Query: 272  EGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFEDSV 451
            EGVAD LPH TN  DG T RKL  STSLN+L KDNEK ++D E+P+SHS  EMESF + V
Sbjct: 37   EGVADRLPHVTNDVDGCTVRKLERSTSLNDLAKDNEKNVQDLESPNSHSCGEMESFREPV 96

Query: 452  FYMDKIVTECELPELIVCYEENTYHIKDICIDEGVRFHDRILFESNVNKKGVRAFFSPEE 631
            FYMDK VTECELPELIVCY+ENTYH+KDICIDEGV  HDRILFES+V K  VR+F  P+E
Sbjct: 97   FYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVGKS-VRSFLPPKE 155

Query: 632  DRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDLCDSKDLLP 811
            DRN+EL+EE+K+S +P+PDVLKSSAEN S E IVNRC           I D+CDSKDL P
Sbjct: 156  DRNSELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRP 215

Query: 812  AGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEAEKESFQFQGSS 991
            AG++ DD T+E  N  S+KLF LGDLLSMHNVGT+NS SKS+  NE++AEKESFQ  GSS
Sbjct: 216  AGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQ--GSS 273

Query: 992  GKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSGEAILANPSLVSAREKAH 1171
             KA+LANP      EE+NGGT E +  G+D VSASEES NG GE I  NP+LVSA EKAH
Sbjct: 274  AKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGNPTLVSASEKAH 327

Query: 1172 DNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSITFDFDASAPGARGKEEHLQI 1351
            D S EASLAS D VSA +ESTKISTA+  SYNSMVETGSITFDFDASAPGA GKEE LQI
Sbjct: 328  DKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGASGKEEPLQI 387

Query: 1352 GDSQRIESLGMSKLEDAPRQSVSSQVHSG--------------LXXXXXXXAYXXXXXXX 1489
            GDSQRIE+ GMS+LEDAPRQSVSSQ HSG              L       AY       
Sbjct: 388  GDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSGPVAYSGSISLR 447

Query: 1490 XXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHKWRQGLLCCRF 1636
                      FAFPILQ EW  SPVRMAKA+RRHY +HKW+QGLLCCRF
Sbjct: 448  SDSSTTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHKWKQGLLCCRF 496


>XP_006484258.1 PREDICTED: uncharacterized protein LOC102625369 isoform X2 [Citrus
            sinensis] XP_015387480.1 PREDICTED: uncharacterized
            protein LOC102625369 isoform X2 [Citrus sinensis]
          Length = 483

 Score =  583 bits (1504), Expect = 0.0
 Identities = 313/480 (65%), Positives = 353/480 (73%), Gaps = 14/480 (2%)
 Frame = +2

Query: 239  HIGTLCASKANEGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHS 418
            H G + AS +NEGVAD LPH TN  DG T RKL  STSLN+L KDNEK ++D E+P+SHS
Sbjct: 13   HSGAISASNSNEGVADRLPHVTNDVDGCTVRKLERSTSLNDLAKDNEKNVQDLESPNSHS 72

Query: 419  FVEMESFEDSVFYMDKIVTECELPELIVCYEENTYHIKDICIDEGVRFHDRILFESNVNK 598
              EMESF + VFYMDK VTECELPELIVCY+ENTYH+KDICIDEGV  HDRILFES+V K
Sbjct: 73   CGEMESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVGK 132

Query: 599  KGVRAFFSPEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXI 778
              VR+F  P+EDRN+EL+EE+K+S +P+PDVLKSSAEN S E IVNRC           I
Sbjct: 133  S-VRSFLPPKEDRNSELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDI 191

Query: 779  VDLCDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEA 958
             D+CDSKDL PAG++ DD T+E  N  S+KLF LGDLLSMHNVGT+NS SKS+  NE++A
Sbjct: 192  DDICDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDA 251

Query: 959  EKESFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSGEAILAN 1138
            EKESFQ  GSS KA+LANP      EE+NGGT E +  G+D VSASEES NG GE I  N
Sbjct: 252  EKESFQ--GSSAKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGN 303

Query: 1139 PSLVSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSITFDFDASAP 1318
            P+LVSA EKAHD S EASLAS D VSA +ESTKISTA+  SYNSMVETGSITFDFDASAP
Sbjct: 304  PTLVSASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAP 363

Query: 1319 GARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVHSG--------------LXXXXX 1456
            GA GKEE LQIGDSQRIE+ GMS+LEDAPRQSVSSQ HSG              L     
Sbjct: 364  GASGKEEPLQIGDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSG 423

Query: 1457 XXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHKWRQGLLCCRF 1636
              AY                 FAFPILQ EW  SPVRMAKA+RRHY +HKW+QGLLCCRF
Sbjct: 424  PVAYSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHKWKQGLLCCRF 483


>KDO70270.1 hypothetical protein CISIN_1g0116142mg [Citrus sinensis]
          Length = 481

 Score =  572 bits (1474), Expect = 0.0
 Identities = 309/480 (64%), Positives = 351/480 (73%), Gaps = 14/480 (2%)
 Frame = +2

Query: 239  HIGTLCASKANEGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHS 418
            H G + AS +NEGVAD LPH TN  DG T RKL  STSLN+L KDNEK ++D E+P+SHS
Sbjct: 13   HSGAISASNSNEGVADRLPHVTNDVDGCTVRKLERSTSLNDLAKDNEKNVQDLESPNSHS 72

Query: 419  FVEMESFEDSVFYMDKIVTECELPELIVCYEENTYHIKDICIDEGVRFHDRILFESNVNK 598
              EMESF + VFYMDK VTECELPELIVCY+ENTYH+KDICIDEGV  HDRILFES+V  
Sbjct: 73   CGEMESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVG- 131

Query: 599  KGVRAFFSPEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXI 778
            K VR+F  P+EDRN+E++EE+K+S +P+PDVLKSSAEN S + IVNRC           I
Sbjct: 132  KSVRSFLPPKEDRNSEVLEESKNSVIPIPDVLKSSAENYSDKRIVNRCGSSQESDSDEDI 191

Query: 779  VDLCDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEA 958
             D+CDSKDL PAG++ DD T+E  N  S+KLF LGDLLSMHNVGT+NS SKS+  NE++A
Sbjct: 192  DDICDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDA 251

Query: 959  EKESFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSGEAILAN 1138
            EKES  FQGSS KA+LANP      EE+NGGT E +  G+D VSASEES NG GE I  N
Sbjct: 252  EKES--FQGSSAKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGN 303

Query: 1139 PSLVSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSITFDFDASAP 1318
            P+LVSA EKAHD S EASLAS D VSA +ESTKISTA+  SYNSMVETGSITFDFDASAP
Sbjct: 304  PTLVSASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAP 363

Query: 1319 GARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVHSG--------------LXXXXX 1456
            GA GKEE L  GDSQRIE+ GMS+LEDAPRQSVSSQ HSG              L     
Sbjct: 364  GASGKEEPL--GDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSG 421

Query: 1457 XXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHKWRQGLLCCRF 1636
              AY                 FAFPILQ EW  SPVRMAKA+RRHY +HKW+QGLLCCRF
Sbjct: 422  PVAYSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHKWKQGLLCCRF 481


>XP_006437853.1 hypothetical protein CICLE_v10031644mg, partial [Citrus clementina]
            ESR51093.1 hypothetical protein CICLE_v10031644mg,
            partial [Citrus clementina]
          Length = 410

 Score =  525 bits (1353), Expect = e-180
 Identities = 279/401 (69%), Positives = 316/401 (78%)
 Frame = +2

Query: 239  HIGTLCASKANEGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHS 418
            H G + AS +NEGVAD LPH TN  DG T RKL  STSLN+L KDNEK ++D E+P+SHS
Sbjct: 13   HSGAISASNSNEGVADRLPHVTNDVDGCTVRKLERSTSLNDLAKDNEKNVQDLESPNSHS 72

Query: 419  FVEMESFEDSVFYMDKIVTECELPELIVCYEENTYHIKDICIDEGVRFHDRILFESNVNK 598
              EMESF + VFYMDK VTECELPELIVCY+ENTYH+KDICIDEGV  HDRILFES+V K
Sbjct: 73   CGEMESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVGK 132

Query: 599  KGVRAFFSPEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXI 778
              VR+F  P+EDRN+EL+EE+K+S +P+PDVLKSSAEN S E IVNRC           I
Sbjct: 133  S-VRSFLPPKEDRNSELLEESKNSVIPIPDVLKSSAENYSDERIVNRCGSSQESDSDEDI 191

Query: 779  VDLCDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEA 958
             D+CDSKDL PAG++ DD T+E  N  S+KLF LGDLLSMHNVGT+NS SKS+  NE++A
Sbjct: 192  DDICDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDA 251

Query: 959  EKESFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSGEAILAN 1138
            EKESFQ  GSS KA+LANP      EE+NGGT E +  G+D VSASEES NG GE I  N
Sbjct: 252  EKESFQ--GSSAKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGN 303

Query: 1139 PSLVSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSITFDFDASAP 1318
            P+LVSA EKAHD S EASLAS D VSA +ESTKISTA+  SYNSMVETGSITFDFDASAP
Sbjct: 304  PTLVSASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAP 363

Query: 1319 GARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVHSGL 1441
            GA GKEE LQIGDSQRIE+ GMS+LEDAPRQSVSSQ HSGL
Sbjct: 364  GASGKEEPLQIGDSQRIETPGMSRLEDAPRQSVSSQFHSGL 404


>KDO70271.1 hypothetical protein CISIN_1g0116142mg [Citrus sinensis]
          Length = 406

 Score =  495 bits (1275), Expect = e-168
 Identities = 270/417 (64%), Positives = 305/417 (73%), Gaps = 14/417 (3%)
 Frame = +2

Query: 428  MESFEDSVFYMDKIVTECELPELIVCYEENTYHIKDICIDEGVRFHDRILFESNVNKKGV 607
            MESF + VFYMDK VTECELPELIVCY+ENTYH+KDICIDEGV  HDRILFES+V K  V
Sbjct: 1    MESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVGKS-V 59

Query: 608  RAFFSPEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDL 787
            R+F  P+EDRN+E++EE+K+S +P+PDVLKSSAEN S + IVNRC           I D+
Sbjct: 60   RSFLPPKEDRNSEVLEESKNSVIPIPDVLKSSAENYSDKRIVNRCGSSQESDSDEDIDDI 119

Query: 788  CDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEAEKE 967
            CDSKDL PAG++ DD T+E  N  S+KLF LGDLLSMHNVGT+NS SKS+  NE++AEKE
Sbjct: 120  CDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKE 179

Query: 968  SFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSGEAILANPSL 1147
            S  FQGSS KA+LANP      EE+NGGT E +  G+D VSASEES NG GE I  NP+L
Sbjct: 180  S--FQGSSAKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGNPTL 231

Query: 1148 VSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSITFDFDASAPGAR 1327
            VSA EKAHD S EASLAS D VSA +ESTKISTA+  SYNSMVETGSITFDFDASAPGA 
Sbjct: 232  VSASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGAS 291

Query: 1328 GKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVHSG--------------LXXXXXXXA 1465
            GKEE L  GDSQRIE+ GMS+LEDAPRQSVSSQ HSG              L       A
Sbjct: 292  GKEEPL--GDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSGPVA 349

Query: 1466 YXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHKWRQGLLCCRF 1636
            Y                 FAFPILQ EW  SPVRMAKA+RRHY +HKW+QGLLCCRF
Sbjct: 350  YSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHKWKQGLLCCRF 406


>XP_006437852.1 hypothetical protein CICLE_v10031644mg, partial [Citrus clementina]
            ESR51092.1 hypothetical protein CICLE_v10031644mg,
            partial [Citrus clementina]
          Length = 335

 Score =  449 bits (1154), Expect = e-151
 Identities = 240/338 (71%), Positives = 270/338 (79%)
 Frame = +2

Query: 428  MESFEDSVFYMDKIVTECELPELIVCYEENTYHIKDICIDEGVRFHDRILFESNVNKKGV 607
            MESF + VFYMDK VTECELPELIVCY+ENTYH+KDICIDEGV  HDRILFES+V K  V
Sbjct: 1    MESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVGKS-V 59

Query: 608  RAFFSPEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDL 787
            R+F  P+EDRN+EL+EE+K+S +P+PDVLKSSAEN S E IVNRC           I D+
Sbjct: 60   RSFLPPKEDRNSELLEESKNSVIPIPDVLKSSAENYSDERIVNRCGSSQESDSDEDIDDI 119

Query: 788  CDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEAEKE 967
            CDSKDL PAG++ DD T+E  N  S+KLF LGDLLSMHNVGT+NS SKS+  NE++AEKE
Sbjct: 120  CDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKE 179

Query: 968  SFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSGEAILANPSL 1147
            SFQ  GSS KA+LANP      EE+NGGT E +  G+D VSASEES NG GE I  NP+L
Sbjct: 180  SFQ--GSSAKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGNPTL 231

Query: 1148 VSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSITFDFDASAPGAR 1327
            VSA EKAHD S EASLAS D VSA +ESTKISTA+  SYNSMVETGSITFDFDASAPGA 
Sbjct: 232  VSASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGAS 291

Query: 1328 GKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVHSGL 1441
            GKEE LQIGDSQRIE+ GMS+LEDAPRQSVSSQ HSGL
Sbjct: 292  GKEEPLQIGDSQRIETPGMSRLEDAPRQSVSSQFHSGL 329


>XP_006437854.1 hypothetical protein CICLE_v10031644mg [Citrus clementina]
           ESR51094.1 hypothetical protein CICLE_v10031644mg
           [Citrus clementina]
          Length = 297

 Score =  325 bits (832), Expect = e-103
 Identities = 164/246 (66%), Positives = 192/246 (78%)
 Frame = +2

Query: 239 HIGTLCASKANEGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHS 418
           H G + AS +NEGVAD LPH TN  DG T RKL  STSLN+L KDNEK ++D E+P+SHS
Sbjct: 13  HSGAISASNSNEGVADRLPHVTNDVDGCTVRKLERSTSLNDLAKDNEKNVQDLESPNSHS 72

Query: 419 FVEMESFEDSVFYMDKIVTECELPELIVCYEENTYHIKDICIDEGVRFHDRILFESNVNK 598
             EMESF + VFYMDK VTECELPELIVCY+ENTYH+KDICIDEGV  HDRILFES+V  
Sbjct: 73  CGEMESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVG- 131

Query: 599 KGVRAFFSPEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXI 778
           K VR+F  P+EDRN+EL+EE+K+S +P+PDVLKSSAEN S E IVNRC           I
Sbjct: 132 KSVRSFLPPKEDRNSELLEESKNSVIPIPDVLKSSAENYSDERIVNRCGSSQESDSDEDI 191

Query: 779 VDLCDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEA 958
            D+CDSKDL PAG++ DD T+E  N  S+KLF LGDLLSMHNVGT+NS SKS+  NE++A
Sbjct: 192 DDICDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDA 251

Query: 959 EKESFQ 976
           EKESFQ
Sbjct: 252 EKESFQ 257


>XP_017971961.1 PREDICTED: uncharacterized protein LOC18610175 isoform X3 [Theobroma
            cacao]
          Length = 527

 Score =  330 bits (845), Expect = e-102
 Identities = 215/549 (39%), Positives = 290/549 (52%), Gaps = 41/549 (7%)
 Frame = +2

Query: 113  DGEQVFPHSTLGRKPDSKHFDY---------HENGLDYTGLKSENSIVKEYHIGTLCASK 265
            D EQV  HST G K DSK + +          +  LD TGL +E  +VKE   G +   K
Sbjct: 4    DNEQVLCHSTTGHKSDSKPYSFLADTKPFENKDKPLDSTGLNAEG-VVKENQNGVMHDIK 62

Query: 266  ANEGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFED 445
             N+G +D   +  N   GW A KL  S S+N+    NEK +RD    +S S   M+SF++
Sbjct: 63   GNDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQN 122

Query: 446  SVFYMDKIVTECELPELIVCYEENTYH-IKDICIDEGVRFHDRILFESNVNKKGVRAFFS 622
            SVFY+DK V ECELPEL+VCY+E+TYH +KDICIDEGV   D+ LFE+ +++K    F  
Sbjct: 123  SVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLP 182

Query: 623  PEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDL----- 787
             E++++++LM E  ++DM + DV  S  EN S +DI N C           + D+     
Sbjct: 183  SEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLE 242

Query: 788  -----------CDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKS 934
                       CDSKDL+    +  D    + +  SK+LF+LG+LLSM  +   NS + S
Sbjct: 243  KNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMS 302

Query: 935  SKDNELEAEKESFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNG 1114
            S       E++S  FQ SS K  +  P LV   EES    EEA+     LVSA+EE  +G
Sbjct: 303  SDCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSG 360

Query: 1115 SGEAILANPSLVSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSIT 1294
             GEAIL +P+ VS  E                     EST  S  + +SY++ +ETGSIT
Sbjct: 361  KGEAILISPAQVSTPE---------------------ESTSSSLVNEVSYDNKLETGSIT 399

Query: 1295 FDFDASAPGARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVH-------------- 1432
            F+ D+SAP +   E H  + DS+ + +    KLE A  QS+S+ +               
Sbjct: 400  FNLDSSAPTSSKDECHHNL-DSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLV 458

Query: 1433 SGLXXXXXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHK-W 1609
            +GL       AY                 FAFPILQ+EW+ SPVRMAKA+RRHY +HK W
Sbjct: 459  TGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGW 518

Query: 1610 RQGLLCCRF 1636
            R GLLCCRF
Sbjct: 519  RHGLLCCRF 527


>XP_007045751.2 PREDICTED: uncharacterized protein LOC18610175 isoform X1 [Theobroma
            cacao]
          Length = 543

 Score =  330 bits (845), Expect = e-102
 Identities = 215/549 (39%), Positives = 290/549 (52%), Gaps = 41/549 (7%)
 Frame = +2

Query: 113  DGEQVFPHSTLGRKPDSKHFDY---------HENGLDYTGLKSENSIVKEYHIGTLCASK 265
            D EQV  HST G K DSK + +          +  LD TGL +E  +VKE   G +   K
Sbjct: 20   DNEQVLCHSTTGHKSDSKPYSFLADTKPFENKDKPLDSTGLNAEG-VVKENQNGVMHDIK 78

Query: 266  ANEGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFED 445
             N+G +D   +  N   GW A KL  S S+N+    NEK +RD    +S S   M+SF++
Sbjct: 79   GNDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQN 138

Query: 446  SVFYMDKIVTECELPELIVCYEENTYH-IKDICIDEGVRFHDRILFESNVNKKGVRAFFS 622
            SVFY+DK V ECELPEL+VCY+E+TYH +KDICIDEGV   D+ LFE+ +++K    F  
Sbjct: 139  SVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLP 198

Query: 623  PEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDL----- 787
             E++++++LM E  ++DM + DV  S  EN S +DI N C           + D+     
Sbjct: 199  SEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLE 258

Query: 788  -----------CDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKS 934
                       CDSKDL+    +  D    + +  SK+LF+LG+LLSM  +   NS + S
Sbjct: 259  KNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMS 318

Query: 935  SKDNELEAEKESFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNG 1114
            S       E++S  FQ SS K  +  P LV   EES    EEA+     LVSA+EE  +G
Sbjct: 319  SDCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSG 376

Query: 1115 SGEAILANPSLVSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSIT 1294
             GEAIL +P+ VS  E                     EST  S  + +SY++ +ETGSIT
Sbjct: 377  KGEAILISPAQVSTPE---------------------ESTSSSLVNEVSYDNKLETGSIT 415

Query: 1295 FDFDASAPGARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVH-------------- 1432
            F+ D+SAP +   E H  + DS+ + +    KLE A  QS+S+ +               
Sbjct: 416  FNLDSSAPTSSKDECHHNL-DSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLV 474

Query: 1433 SGLXXXXXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHK-W 1609
            +GL       AY                 FAFPILQ+EW+ SPVRMAKA+RRHY +HK W
Sbjct: 475  TGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGW 534

Query: 1610 RQGLLCCRF 1636
            R GLLCCRF
Sbjct: 535  RHGLLCCRF 543


>EOY01581.1 18S pre-ribosomal assembly protein gar2-related, putative isoform 1
            [Theobroma cacao]
          Length = 527

 Score =  328 bits (840), Expect = e-101
 Identities = 214/549 (38%), Positives = 289/549 (52%), Gaps = 41/549 (7%)
 Frame = +2

Query: 113  DGEQVFPHSTLGRKPDSKHFDY---------HENGLDYTGLKSENSIVKEYHIGTLCASK 265
            D EQV  HS  G K DSK + +          +  LD TGL +E  +VKE   G +   K
Sbjct: 4    DNEQVLCHSITGHKSDSKPYSFLADTKPFENKDKPLDSTGLNAEG-VVKENQNGVMHDIK 62

Query: 266  ANEGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFED 445
             N+G +D   +  N   GW A KL  S S+N+    NEK +RD    +S S   M+SF++
Sbjct: 63   GNDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQN 122

Query: 446  SVFYMDKIVTECELPELIVCYEENTYH-IKDICIDEGVRFHDRILFESNVNKKGVRAFFS 622
            SVFY+DK V ECELPEL+VCY+E+TYH +KDICIDEGV   D+ LFE+ +++K    F  
Sbjct: 123  SVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLP 182

Query: 623  PEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDL----- 787
             E++++++LM E  ++DM + DV  S  EN S +DI N C           + D+     
Sbjct: 183  SEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLE 242

Query: 788  -----------CDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKS 934
                       CDSKDL+    +  D    + +  SK+LF+LG+LLSM  +   NS + S
Sbjct: 243  KNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMS 302

Query: 935  SKDNELEAEKESFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNG 1114
            S       E++S  FQ SS K  +  P LV   EES    EEA+     LVSA+EE  +G
Sbjct: 303  SDCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSG 360

Query: 1115 SGEAILANPSLVSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSIT 1294
             GEAIL +P+                      VS S EST  S  + +SY++ +ETGSIT
Sbjct: 361  KGEAILISPA---------------------QVSTSEESTSSSLVNEVSYDNKLETGSIT 399

Query: 1295 FDFDASAPGARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVH-------------- 1432
            F+ D+SAP +   E H  + DS+ + +    KLE A  QS+S+ +               
Sbjct: 400  FNLDSSAPTSSKDECHHNL-DSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLV 458

Query: 1433 SGLXXXXXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHK-W 1609
            +GL       AY                 FAFPILQ+EW+ SPVRMAKA+RRHY +HK W
Sbjct: 459  TGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGW 518

Query: 1610 RQGLLCCRF 1636
            R GLLCCRF
Sbjct: 519  RHGLLCCRF 527


>XP_017971960.1 PREDICTED: uncharacterized protein LOC18610175 isoform X2 [Theobroma
            cacao]
          Length = 538

 Score =  327 bits (839), Expect = e-101
 Identities = 214/547 (39%), Positives = 289/547 (52%), Gaps = 41/547 (7%)
 Frame = +2

Query: 119  EQVFPHSTLGRKPDSKHFDY---------HENGLDYTGLKSENSIVKEYHIGTLCASKAN 271
            EQV  HST G K DSK + +          +  LD TGL +E  +VKE   G +   K N
Sbjct: 17   EQVLCHSTTGHKSDSKPYSFLADTKPFENKDKPLDSTGLNAEG-VVKENQNGVMHDIKGN 75

Query: 272  EGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFEDSV 451
            +G +D   +  N   GW A KL  S S+N+    NEK +RD    +S S   M+SF++SV
Sbjct: 76   DGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSV 135

Query: 452  FYMDKIVTECELPELIVCYEENTYH-IKDICIDEGVRFHDRILFESNVNKKGVRAFFSPE 628
            FY+DK V ECELPEL+VCY+E+TYH +KDICIDEGV   D+ LFE+ +++K    F   E
Sbjct: 136  FYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSE 195

Query: 629  EDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDL------- 787
            ++++++LM E  ++DM + DV  S  EN S +DI N C           + D+       
Sbjct: 196  KEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKN 255

Query: 788  ---------CDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSK 940
                     CDSKDL+    +  D    + +  SK+LF+LG+LLSM  +   NS + SS 
Sbjct: 256  ESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSD 315

Query: 941  DNELEAEKESFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSG 1120
                  E++S  FQ SS K  +  P LV   EES    EEA+     LVSA+EE  +G G
Sbjct: 316  CKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKG 373

Query: 1121 EAILANPSLVSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSITFD 1300
            EAIL +P+ VS  E                     EST  S  + +SY++ +ETGSITF+
Sbjct: 374  EAILISPAQVSTPE---------------------ESTSSSLVNEVSYDNKLETGSITFN 412

Query: 1301 FDASAPGARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVH--------------SG 1438
             D+SAP +   E H  + DS+ + +    KLE A  QS+S+ +               +G
Sbjct: 413  LDSSAPTSSKDECHHNL-DSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLVTG 471

Query: 1439 LXXXXXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHK-WRQ 1615
            L       AY                 FAFPILQ+EW+ SPVRMAKA+RRHY +HK WR 
Sbjct: 472  LISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRH 531

Query: 1616 GLLCCRF 1636
            GLLCCRF
Sbjct: 532  GLLCCRF 538


>GAV79538.1 hypothetical protein CFOL_v3_23003 [Cephalotus follicularis]
          Length = 475

 Score =  320 bits (820), Expect = 3e-99
 Identities = 218/532 (40%), Positives = 288/532 (54%), Gaps = 19/532 (3%)
 Frame = +2

Query: 98   MKFVSDGEQVFPHSTLGRKPDSKHFDYHENGLDYTGLKSENSIVKEYHIGTLCASKANEG 277
            MKF  D EQV  HSTL R+PDSK F+YH   +D TGLKSEN ++K+     L   K  EG
Sbjct: 1    MKF--DNEQVLCHSTLARRPDSKPFEYHGKAMDSTGLKSENGVMKDNQKRVLSFLKGKEG 58

Query: 278  VADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFED-SVF 454
             A+ LP   N +      KL      N    DNE                  SFE  SVF
Sbjct: 59   NAECLPCERNES------KLDCPVVANYSTNDNE------------------SFEKHSVF 94

Query: 455  YMDKIVTECELPELIVCYEENTYHI-KDICIDEGVRFHDR-ILFESNVNKKGVRAFFSPE 628
            Y ++ V +CELPELI+CY+E+ YH+ KDICI+E V   D+ + FES V++K V   F P+
Sbjct: 95   YFNRSVMKCELPELILCYKESPYHVVKDICINEDVPSKDKNLFFESGVDEKSV-CTFPPD 153

Query: 629  EDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDLCDSKDLL 808
             D+N E   E K  DMP+P  +K+SAENDS +DI                 D  D  DL+
Sbjct: 154  MDQNIE-STEGKPFDMPIPVAMKASAENDSDKDIN----------------DKYDIPDLM 196

Query: 809  PAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEAEKESFQFQGS 988
            P G + DD TD+ AN   K+  SLGD+LSM  + +EN+ SKS   + +    E    Q S
Sbjct: 197  PIGEVQDDATDKNANDIPKQKISLGDMLSMEKLHSENTFSKSC--DVVSKNAEQLSVQSS 254

Query: 989  SGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSGEAILANPSLVSAREKA 1168
            S K   ++ A +  ++ESN                +EES+N S +  LA+P+LVSA +++
Sbjct: 255  SEKTVASSLASLSTSDESNNSGNR-----------TEESNNDSEDLTLASPTLVSATKES 303

Query: 1169 HDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSITFDFDASAPGARGKEEHLQ 1348
                 E    S  +VSAS ES   S +++LSYNS VETGSITFDF++ AP A  ++E  Q
Sbjct: 304  DSGRDEMVFVSPAIVSASEESANSSFSNDLSYNSKVETGSITFDFNSGAPAASDRKECPQ 363

Query: 1349 IGDSQRI-ESLGMSKLEDAPRQSVSSQVH--------------SGLXXXXXXXAYXXXXX 1483
            I +S+ + ++   S+LEDA  Q V+SQ                SG        AY     
Sbjct: 364  ITESECLDDTQSSSRLEDADIQLVTSQTQHSHGESSFSTAGPISGSIIYSGPIAYSGSVS 423

Query: 1484 XXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHK-WRQGLLCCRF 1636
                        FAFP+LQ+EW+SSPVRMAKA+RRHY +H+ WRQGLLCCRF
Sbjct: 424  LRSDSSTTSTRSFAFPVLQSEWNSSPVRMAKADRRHYRKHRGWRQGLLCCRF 475


>XP_018860380.1 PREDICTED: uncharacterized protein LOC109022047 isoform X3 [Juglans
            regia] XP_018860382.1 PREDICTED: uncharacterized protein
            LOC109022047 isoform X3 [Juglans regia] XP_018860383.1
            PREDICTED: uncharacterized protein LOC109022047 isoform
            X3 [Juglans regia] XP_018860384.1 PREDICTED:
            uncharacterized protein LOC109022047 isoform X3 [Juglans
            regia] XP_018860385.1 PREDICTED: uncharacterized protein
            LOC109022047 isoform X3 [Juglans regia] XP_018860386.1
            PREDICTED: uncharacterized protein LOC109022047 isoform
            X3 [Juglans regia] XP_018860387.1 PREDICTED:
            uncharacterized protein LOC109022047 isoform X3 [Juglans
            regia]
          Length = 517

 Score =  321 bits (822), Expect = 5e-99
 Identities = 213/537 (39%), Positives = 293/537 (54%), Gaps = 29/537 (5%)
 Frame = +2

Query: 113  DGEQVFPHSTLGRKPDSKHFDYHENGLDYTGLKSENSIVKEYHIGTLCASKANEGVADHL 292
            D E VF HSTLG KPDSK FDY++  LD + +KS+N I+ E     LC  K +E  A   
Sbjct: 4    DSEPVFCHSTLGHKPDSKPFDYNDIALD-SAMKSQNLIMTENQ-SLLCDLKGDEKDAVPF 61

Query: 293  PHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFE-DSVFYMDKI 469
             + TN  DGWTA K   S S+ ++  +N+  ++D  A  + S  + ESF+ D  F MDK 
Sbjct: 62   SNATNDGDGWTAIKFDCSMSMVDINNENKDEVKDFVALHNQSSQKTESFDKDLHFVMDKG 121

Query: 470  VTECELPELIVCYEENTYHI-KDICIDEGVRFHDRILFESNVNKKGVRAFFSPEEDRNNE 646
            V ECELPEL VCY+ +TYH+ KDIC+DEGV   ++ILFES  +KK V     P++D+N E
Sbjct: 122  VMECELPELTVCYKGSTYHVVKDICVDEGVHSREKILFESGRDKKTVCIVLPPDKDQNKE 181

Query: 647  LMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDLCDSKDLLPAGNIN 826
            L +E +D D+  PD L  SAEN S +D  N+                 DSKD +  G   
Sbjct: 182  LAKEKEDIDISGPDGLNFSAENYSDKDSTNQY----------------DSKDSMQTG--- 222

Query: 827  DDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEAEKESFQFQGSSGKASL 1006
            +D T  I   ASKK+F  G++L M   G   S      ++  + E++ FQ  G   +  L
Sbjct: 223  EDATGSILTDASKKMFFPGNMLPMVASGVCASQFDCLSNDSNKVEQQPFQVYGE--RPIL 280

Query: 1007 ANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSGEAILANPSLVSAREKAHDNSGE 1186
            A PALV   EESN  +   +   S LV A +ES+  S +  +A+P  VS+ E++++++G+
Sbjct: 281  AGPALVSAVEESNNSSGNKVLASSTLVYAVKESNIRSVDPKIASPDFVSSAEESNNSTGD 340

Query: 1187 ASLASTDLVSASNESTKISTADNLSYNSMVETGSITFDFDASAPGARGKEEHLQIGDSQR 1366
              LAS  LV A+  S   ST + L YNS VE+GSITFDFD+  P   G+ E L+ GDS+ 
Sbjct: 341  QMLASPTLVPATELSNSSSTVNELFYNSKVESGSITFDFDSLEPSDSGRLEGLENGDSEC 400

Query: 1367 IESLGMSKLED--APRQSVSSQ------------------------VHSGLXXXXXXXAY 1468
             E+   SK+E+  +   +VS +                          S L        Y
Sbjct: 401  HETQKTSKVENGLSDAHTVSRRHQHALGETSFSAVGRGEESCSAAGTLSSLINYSGPMGY 460

Query: 1469 XXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHK-WRQGLLCCRF 1636
                             FAFPILQ+EW+SSPVRMAKA++RH+ +H+ WRQGLLCC+F
Sbjct: 461  SGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADQRHFRKHRCWRQGLLCCKF 517


>XP_018860379.1 PREDICTED: uncharacterized protein LOC109022047 isoform X2 [Juglans
            regia]
          Length = 559

 Score =  322 bits (825), Expect = 6e-99
 Identities = 216/543 (39%), Positives = 296/543 (54%), Gaps = 29/543 (5%)
 Frame = +2

Query: 95   NMKFVSDGEQVFPHSTLGRKPDSKHFDYHENGLDYTGLKSENSIVKEYHIGTLCASKANE 274
            NMK   D E VF HSTLG KPDSK FDY++  LD + +KS+N I+ E     LC  K +E
Sbjct: 42   NMKL--DSEPVFCHSTLGHKPDSKPFDYNDIALD-SAMKSQNLIMTENQ-SLLCDLKGDE 97

Query: 275  GVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFE-DSV 451
              A    + TN  DGWTA K   S S+ ++  +N+  ++D  A  + S  + ESF+ D  
Sbjct: 98   KDAVPFSNATNDGDGWTAIKFDCSMSMVDINNENKDEVKDFVALHNQSSQKTESFDKDLH 157

Query: 452  FYMDKIVTECELPELIVCYEENTYHI-KDICIDEGVRFHDRILFESNVNKKGVRAFFSPE 628
            F MDK V ECELPEL VCY+ +TYH+ KDIC+DEGV   ++ILFES  +KK V     P+
Sbjct: 158  FVMDKGVMECELPELTVCYKGSTYHVVKDICVDEGVHSREKILFESGRDKKTVCIVLPPD 217

Query: 629  EDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDLCDSKDLL 808
            +D+N EL +E +D D+  PD L  SAEN S +D  N+                 DSKD +
Sbjct: 218  KDQNKELAKEKEDIDISGPDGLNFSAENYSDKDSTNQY----------------DSKDSM 261

Query: 809  PAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEAEKESFQFQGS 988
              G   +D T  I   ASKK+F  G++L M   G   S      ++  + E++ FQ  G 
Sbjct: 262  QTG---EDATGSILTDASKKMFFPGNMLPMVASGVCASQFDCLSNDSNKVEQQPFQVYGE 318

Query: 989  SGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSGEAILANPSLVSAREKA 1168
              +  LA PALV   EESN  +   +   S LV A +ES+  S +  +A+P  VS+ E++
Sbjct: 319  --RPILAGPALVSAVEESNNSSGNKVLASSTLVYAVKESNIRSVDPKIASPDFVSSAEES 376

Query: 1169 HDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSITFDFDASAPGARGKEEHLQ 1348
            ++++G+  LAS  LV A+  S   ST + L YNS VE+GSITFDFD+  P   G+ E L+
Sbjct: 377  NNSTGDQMLASPTLVPATELSNSSSTVNELFYNSKVESGSITFDFDSLEPSDSGRLEGLE 436

Query: 1349 IGDSQRIESLGMSKLED--APRQSVSSQ------------------------VHSGLXXX 1450
             GDS+  E+   SK+E+  +   +VS +                          S L   
Sbjct: 437  NGDSECHETQKTSKVENGLSDAHTVSRRHQHALGETSFSAVGRGEESCSAAGTLSSLINY 496

Query: 1451 XXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHK-WRQGLLC 1627
                 Y                 FAFPILQ+EW+SSPVRMAKA++RH+ +H+ WRQGLLC
Sbjct: 497  SGPMGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADQRHFRKHRCWRQGLLC 556

Query: 1628 CRF 1636
            C+F
Sbjct: 557  CKF 559


>XP_018860377.1 PREDICTED: uncharacterized protein LOC109022047 isoform X1 [Juglans
            regia] XP_018860378.1 PREDICTED: uncharacterized protein
            LOC109022047 isoform X1 [Juglans regia]
          Length = 567

 Score =  322 bits (825), Expect = 7e-99
 Identities = 216/543 (39%), Positives = 296/543 (54%), Gaps = 29/543 (5%)
 Frame = +2

Query: 95   NMKFVSDGEQVFPHSTLGRKPDSKHFDYHENGLDYTGLKSENSIVKEYHIGTLCASKANE 274
            NMK   D E VF HSTLG KPDSK FDY++  LD + +KS+N I+ E     LC  K +E
Sbjct: 50   NMKL--DSEPVFCHSTLGHKPDSKPFDYNDIALD-SAMKSQNLIMTENQ-SLLCDLKGDE 105

Query: 275  GVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFE-DSV 451
              A    + TN  DGWTA K   S S+ ++  +N+  ++D  A  + S  + ESF+ D  
Sbjct: 106  KDAVPFSNATNDGDGWTAIKFDCSMSMVDINNENKDEVKDFVALHNQSSQKTESFDKDLH 165

Query: 452  FYMDKIVTECELPELIVCYEENTYHI-KDICIDEGVRFHDRILFESNVNKKGVRAFFSPE 628
            F MDK V ECELPEL VCY+ +TYH+ KDIC+DEGV   ++ILFES  +KK V     P+
Sbjct: 166  FVMDKGVMECELPELTVCYKGSTYHVVKDICVDEGVHSREKILFESGRDKKTVCIVLPPD 225

Query: 629  EDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDLCDSKDLL 808
            +D+N EL +E +D D+  PD L  SAEN S +D  N+                 DSKD +
Sbjct: 226  KDQNKELAKEKEDIDISGPDGLNFSAENYSDKDSTNQY----------------DSKDSM 269

Query: 809  PAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKDNELEAEKESFQFQGS 988
              G   +D T  I   ASKK+F  G++L M   G   S      ++  + E++ FQ  G 
Sbjct: 270  QTG---EDATGSILTDASKKMFFPGNMLPMVASGVCASQFDCLSNDSNKVEQQPFQVYGE 326

Query: 989  SGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHNGSGEAILANPSLVSAREKA 1168
              +  LA PALV   EESN  +   +   S LV A +ES+  S +  +A+P  VS+ E++
Sbjct: 327  --RPILAGPALVSAVEESNNSSGNKVLASSTLVYAVKESNIRSVDPKIASPDFVSSAEES 384

Query: 1169 HDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSITFDFDASAPGARGKEEHLQ 1348
            ++++G+  LAS  LV A+  S   ST + L YNS VE+GSITFDFD+  P   G+ E L+
Sbjct: 385  NNSTGDQMLASPTLVPATELSNSSSTVNELFYNSKVESGSITFDFDSLEPSDSGRLEGLE 444

Query: 1349 IGDSQRIESLGMSKLED--APRQSVSSQ------------------------VHSGLXXX 1450
             GDS+  E+   SK+E+  +   +VS +                          S L   
Sbjct: 445  NGDSECHETQKTSKVENGLSDAHTVSRRHQHALGETSFSAVGRGEESCSAAGTLSSLINY 504

Query: 1451 XXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHK-WRQGLLC 1627
                 Y                 FAFPILQ+EW+SSPVRMAKA++RH+ +H+ WRQGLLC
Sbjct: 505  SGPMGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADQRHFRKHRCWRQGLLC 564

Query: 1628 CRF 1636
            C+F
Sbjct: 565  CKF 567


>XP_012080467.1 PREDICTED: uncharacterized protein LOC105640684 isoform X3 [Jatropha
            curcas] XP_012080468.1 PREDICTED: uncharacterized protein
            LOC105640684 isoform X3 [Jatropha curcas] XP_012080469.1
            PREDICTED: uncharacterized protein LOC105640684 isoform
            X3 [Jatropha curcas]
          Length = 531

 Score =  314 bits (804), Expect = 3e-96
 Identities = 226/563 (40%), Positives = 297/563 (52%), Gaps = 53/563 (9%)
 Frame = +2

Query: 107  VSDGEQVFPHSTLGRKPDSKHFDYHENGLDYTGLKSENSIVKEYHIGTLCASKANEGVAD 286
            + DGEQV  H T+  KP SKHF      LD TGLKS N IV E   G  C  K  E  +D
Sbjct: 2    LEDGEQVLCHGTIDHKPGSKHFGCSNIALDSTGLKSGNGIVNEEQNGAFCDLKGRESNSD 61

Query: 287  HLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPD--SHSFVEMESFE-DSVFY 457
            HL +T N  + WTA KL  S   + L  DNEK +RD  AP   S S +++ESFE DSVFY
Sbjct: 62   HLQYTVNDENNWTASKLDSSMRADALTDDNEKEVRDFVAPIPLSLSSLKVESFEGDSVFY 121

Query: 458  MDKIVTECELPELIVCYEENTYH-IKDICIDEGVRFHDRILFESNVNKKGVRAFFSPEED 634
            +DK V E ELPEL+VCY+ENTYH IKDICIDEGV   D+ LF++ +++K +R     E+ 
Sbjct: 122  VDKNVMEPELPELVVCYKENTYHVIKDICIDEGVPSKDKFLFDT-IDEKNLRTLLFHEKH 180

Query: 635  RNNELMEETKDS-------------------DMPVPDVLKSSAENDSVEDIVNRCXXXXX 757
            RN+E+ +ET D                    D+P+PDV  SSAEN S  +I         
Sbjct: 181  RNSEVRKETADQDIFIPESLKSLPEDEKSALDLPIPDVFISSAENGSKNEI--------- 231

Query: 758  XXXXXXIVDLCDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSS 937
                     L DS++ +  G I DD  +EIAN  SK++FSLG+LLSM  VGTE S  K S
Sbjct: 232  --------SLHDSEEFMTTGKIEDDTMEEIANGKSKEIFSLGELLSMPEVGTELSQPKFS 283

Query: 938  KDNELEAEKESFQFQGSSGKASLANPA---------------LVCPAEESNGGTEEALSR 1072
             D+  EA+++  Q    +   + A+                 +V  AE S+   +E +SR
Sbjct: 284  HDSMHEAKQQPIQRPSENTILATASSCDEAKNGNELTSFVRPMVPAAEVSDCHHDEEISR 343

Query: 1073 GSDLVSASEESHNGSGEAILANPSLVSAREKAHDNSGEASLASTDLVSASNESTKISTAD 1252
                  A + S++   EA+LA+P+L SA +++        LAS +L S+  EST IS   
Sbjct: 344  ----TKALDHSYD---EAVLASPALNSATQESEKVCEGEKLASHNL-SSERESTNISGC- 394

Query: 1253 NLSYNSMVETGSITFDFDASAPGARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVH 1432
             L+ NS V++ SI F   ASA      EE  Q G S+ + S   S+LE+   +  +SQ+ 
Sbjct: 395  GLANNSNVKSESINFYTPASA-----GEEDSQNGGSENLNSRS-SRLEETNTEPCTSQLQ 448

Query: 1433 --------------SGLXXXXXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRM 1570
                          SGL       AY                 FAFPILQ+EW+SSPVRM
Sbjct: 449  HGIGESSFSAAGPLSGLISYSGPIAYSGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRM 508

Query: 1571 AKAERRHYW-QHKWRQGLLCCRF 1636
            AKA+RR +  Q  W+QGLLCCRF
Sbjct: 509  AKADRRRFQKQRSWKQGLLCCRF 531


>XP_012080460.1 PREDICTED: uncharacterized protein LOC105640684 isoform X2 [Jatropha
            curcas] XP_012080461.1 PREDICTED: uncharacterized protein
            LOC105640684 isoform X2 [Jatropha curcas] XP_012080462.1
            PREDICTED: uncharacterized protein LOC105640684 isoform
            X2 [Jatropha curcas] XP_012080463.1 PREDICTED:
            uncharacterized protein LOC105640684 isoform X2 [Jatropha
            curcas] XP_012080464.1 PREDICTED: uncharacterized protein
            LOC105640684 isoform X2 [Jatropha curcas] XP_012080465.1
            PREDICTED: uncharacterized protein LOC105640684 isoform
            X2 [Jatropha curcas] XP_012080466.1 PREDICTED:
            uncharacterized protein LOC105640684 isoform X2 [Jatropha
            curcas] KDP31404.1 hypothetical protein JCGZ_11780
            [Jatropha curcas]
          Length = 531

 Score =  313 bits (803), Expect = 5e-96
 Identities = 226/561 (40%), Positives = 296/561 (52%), Gaps = 53/561 (9%)
 Frame = +2

Query: 113  DGEQVFPHSTLGRKPDSKHFDYHENGLDYTGLKSENSIVKEYHIGTLCASKANEGVADHL 292
            DGEQV  H T+  KP SKHF      LD TGLKS N IV E   G  C  K  E  +DHL
Sbjct: 4    DGEQVLCHGTIDHKPGSKHFGCSNIALDSTGLKSGNGIVNEEQNGAFCDLKGRESNSDHL 63

Query: 293  PHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPD--SHSFVEMESFE-DSVFYMD 463
             +T N  + WTA KL  S   + L  DNEK +RD  AP   S S +++ESFE DSVFY+D
Sbjct: 64   QYTVNDENNWTASKLDSSMRADALTDDNEKEVRDFVAPIPLSLSSLKVESFEGDSVFYVD 123

Query: 464  KIVTECELPELIVCYEENTYH-IKDICIDEGVRFHDRILFESNVNKKGVRAFFSPEEDRN 640
            K V E ELPEL+VCY+ENTYH IKDICIDEGV   D+ LF++ +++K +R     E+ RN
Sbjct: 124  KNVMEPELPELVVCYKENTYHVIKDICIDEGVPSKDKFLFDT-IDEKNLRTLLFHEKHRN 182

Query: 641  NELMEETKDS-------------------DMPVPDVLKSSAENDSVEDIVNRCXXXXXXX 763
            +E+ +ET D                    D+P+PDV  SSAEN S  +I           
Sbjct: 183  SEVRKETADQDIFIPESLKSLPEDEKSALDLPIPDVFISSAENGSKNEI----------- 231

Query: 764  XXXXIVDLCDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSSKD 943
                   L DS++ +  G I DD  +EIAN  SK++FSLG+LLSM  VGTE S  K S D
Sbjct: 232  ------SLHDSEEFMTTGKIEDDTMEEIANGKSKEIFSLGELLSMPEVGTELSQPKFSHD 285

Query: 944  NELEAEKESFQFQGSSGKASLANPA---------------LVCPAEESNGGTEEALSRGS 1078
            +  EA+++  Q    +   + A+                 +V  AE S+   +E +SR  
Sbjct: 286  SMHEAKQQPIQRPSENTILATASSCDEAKNGNELTSFVRPMVPAAEVSDCHHDEEISR-- 343

Query: 1079 DLVSASEESHNGSGEAILANPSLVSAREKAHDNSGEASLASTDLVSASNESTKISTADNL 1258
                A + S++   EA+LA+P+L SA +++        LAS +L S+  EST IS    L
Sbjct: 344  --TKALDHSYD---EAVLASPALNSATQESEKVCEGEKLASHNL-SSERESTNISGC-GL 396

Query: 1259 SYNSMVETGSITFDFDASAPGARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVH-- 1432
            + NS V++ SI F   ASA      EE  Q G S+ + S   S+LE+   +  +SQ+   
Sbjct: 397  ANNSNVKSESINFYTPASA-----GEEDSQNGGSENLNSRS-SRLEETNTEPCTSQLQHG 450

Query: 1433 ------------SGLXXXXXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAK 1576
                        SGL       AY                 FAFPILQ+EW+SSPVRMAK
Sbjct: 451  IGESSFSAAGPLSGLISYSGPIAYSGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAK 510

Query: 1577 AERRHYW-QHKWRQGLLCCRF 1636
            A+RR +  Q  W+QGLLCCRF
Sbjct: 511  ADRRRFQKQRSWKQGLLCCRF 531


>XP_012080459.1 PREDICTED: uncharacterized protein LOC105640684 isoform X1 [Jatropha
            curcas]
          Length = 555

 Score =  314 bits (804), Expect = 6e-96
 Identities = 226/563 (40%), Positives = 297/563 (52%), Gaps = 53/563 (9%)
 Frame = +2

Query: 107  VSDGEQVFPHSTLGRKPDSKHFDYHENGLDYTGLKSENSIVKEYHIGTLCASKANEGVAD 286
            + DGEQV  H T+  KP SKHF      LD TGLKS N IV E   G  C  K  E  +D
Sbjct: 26   LEDGEQVLCHGTIDHKPGSKHFGCSNIALDSTGLKSGNGIVNEEQNGAFCDLKGRESNSD 85

Query: 287  HLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPD--SHSFVEMESFE-DSVFY 457
            HL +T N  + WTA KL  S   + L  DNEK +RD  AP   S S +++ESFE DSVFY
Sbjct: 86   HLQYTVNDENNWTASKLDSSMRADALTDDNEKEVRDFVAPIPLSLSSLKVESFEGDSVFY 145

Query: 458  MDKIVTECELPELIVCYEENTYH-IKDICIDEGVRFHDRILFESNVNKKGVRAFFSPEED 634
            +DK V E ELPEL+VCY+ENTYH IKDICIDEGV   D+ LF++ +++K +R     E+ 
Sbjct: 146  VDKNVMEPELPELVVCYKENTYHVIKDICIDEGVPSKDKFLFDT-IDEKNLRTLLFHEKH 204

Query: 635  RNNELMEETKDS-------------------DMPVPDVLKSSAENDSVEDIVNRCXXXXX 757
            RN+E+ +ET D                    D+P+PDV  SSAEN S  +I         
Sbjct: 205  RNSEVRKETADQDIFIPESLKSLPEDEKSALDLPIPDVFISSAENGSKNEI--------- 255

Query: 758  XXXXXXIVDLCDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSKSS 937
                     L DS++ +  G I DD  +EIAN  SK++FSLG+LLSM  VGTE S  K S
Sbjct: 256  --------SLHDSEEFMTTGKIEDDTMEEIANGKSKEIFSLGELLSMPEVGTELSQPKFS 307

Query: 938  KDNELEAEKESFQFQGSSGKASLANPA---------------LVCPAEESNGGTEEALSR 1072
             D+  EA+++  Q    +   + A+                 +V  AE S+   +E +SR
Sbjct: 308  HDSMHEAKQQPIQRPSENTILATASSCDEAKNGNELTSFVRPMVPAAEVSDCHHDEEISR 367

Query: 1073 GSDLVSASEESHNGSGEAILANPSLVSAREKAHDNSGEASLASTDLVSASNESTKISTAD 1252
                  A + S++   EA+LA+P+L SA +++        LAS +L S+  EST IS   
Sbjct: 368  ----TKALDHSYD---EAVLASPALNSATQESEKVCEGEKLASHNL-SSERESTNISGC- 418

Query: 1253 NLSYNSMVETGSITFDFDASAPGARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVH 1432
             L+ NS V++ SI F   ASA      EE  Q G S+ + S   S+LE+   +  +SQ+ 
Sbjct: 419  GLANNSNVKSESINFYTPASA-----GEEDSQNGGSENLNSRS-SRLEETNTEPCTSQLQ 472

Query: 1433 --------------SGLXXXXXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRM 1570
                          SGL       AY                 FAFPILQ+EW+SSPVRM
Sbjct: 473  HGIGESSFSAAGPLSGLISYSGPIAYSGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRM 532

Query: 1571 AKAERRHYW-QHKWRQGLLCCRF 1636
            AKA+RR +  Q  W+QGLLCCRF
Sbjct: 533  AKADRRRFQKQRSWKQGLLCCRF 555


>EOY01582.1 18S pre-ribosomal assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] EOY01583.1 18S pre-ribosomal assembly
            protein gar2-related, putative isoform 2 [Theobroma
            cacao] EOY01584.1 18S pre-ribosomal assembly protein
            gar2-related, putative isoform 2 [Theobroma cacao]
          Length = 470

 Score =  303 bits (775), Expect = 1e-92
 Identities = 193/490 (39%), Positives = 262/490 (53%), Gaps = 32/490 (6%)
 Frame = +2

Query: 263  KANEGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFE 442
            K N+G +D   +  N   GW A KL  S S+N+    NEK +RD    +S S   M+SF+
Sbjct: 5    KGNDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQ 64

Query: 443  DSVFYMDKIVTECELPELIVCYEENTYH-IKDICIDEGVRFHDRILFESNVNKKGVRAFF 619
            +SVFY+DK V ECELPEL+VCY+E+TYH +KDICIDEGV   D+ LFE+ +++K    F 
Sbjct: 65   NSVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFL 124

Query: 620  SPEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDL---- 787
              E++++++LM E  ++DM + DV  S  EN S +DI N C           + D+    
Sbjct: 125  PSEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSL 184

Query: 788  ------------CDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSK 931
                        CDSKDL+    +  D    + +  SK+LF+LG+LLSM  +   NS + 
Sbjct: 185  EKNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAM 244

Query: 932  SSKDNELEAEKESFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHN 1111
            SS       E++S  FQ SS K  +  P LV   EES    EEA+     LVSA+EE  +
Sbjct: 245  SSDCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDS 302

Query: 1112 GSGEAILANPSLVSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSI 1291
            G GEAIL +P+                      VS S EST  S  + +SY++ +ETGSI
Sbjct: 303  GKGEAILISPA---------------------QVSTSEESTSSSLVNEVSYDNKLETGSI 341

Query: 1292 TFDFDASAPGARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVH------------- 1432
            TF+ D+SAP +   E H  + DS+ + +    KLE A  QS+S+ +              
Sbjct: 342  TFNLDSSAPTSSKDECHHNL-DSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGL 400

Query: 1433 -SGLXXXXXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHK- 1606
             +GL       AY                 FAFPILQ+EW+ SPVRMAKA+RRHY +HK 
Sbjct: 401  VTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKG 460

Query: 1607 WRQGLLCCRF 1636
            WR GLLCCRF
Sbjct: 461  WRHGLLCCRF 470


>XP_007045750.2 PREDICTED: uncharacterized protein LOC18610175 isoform X4 [Theobroma
            cacao] XP_007045752.2 PREDICTED: uncharacterized protein
            LOC18610175 isoform X4 [Theobroma cacao]
          Length = 470

 Score =  302 bits (774), Expect = 2e-92
 Identities = 193/490 (39%), Positives = 262/490 (53%), Gaps = 32/490 (6%)
 Frame = +2

Query: 263  KANEGVADHLPHTTNVADGWTARKLHHSTSLNELEKDNEKVIRDQEAPDSHSFVEMESFE 442
            K N+G +D   +  N   GW A KL  S S+N+    NEK +RD    +S S   M+SF+
Sbjct: 5    KGNDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQ 64

Query: 443  DSVFYMDKIVTECELPELIVCYEENTYH-IKDICIDEGVRFHDRILFESNVNKKGVRAFF 619
            +SVFY+DK V ECELPEL+VCY+E+TYH +KDICIDEGV   D+ LFE+ +++K    F 
Sbjct: 65   NSVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFL 124

Query: 620  SPEEDRNNELMEETKDSDMPVPDVLKSSAENDSVEDIVNRCXXXXXXXXXXXIVDL---- 787
              E++++++LM E  ++DM + DV  S  EN S +DI N C           + D+    
Sbjct: 125  PSEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSL 184

Query: 788  ------------CDSKDLLPAGNINDDVTDEIANYASKKLFSLGDLLSMHNVGTENSHSK 931
                        CDSKDL+    +  D    + +  SK+LF+LG+LLSM  +   NS + 
Sbjct: 185  EKNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAM 244

Query: 932  SSKDNELEAEKESFQFQGSSGKASLANPALVCPAEESNGGTEEALSRGSDLVSASEESHN 1111
            SS       E++S  FQ SS K  +  P LV   EES    EEA+     LVSA+EE  +
Sbjct: 245  SSDCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDS 302

Query: 1112 GSGEAILANPSLVSAREKAHDNSGEASLASTDLVSASNESTKISTADNLSYNSMVETGSI 1291
            G GEAIL +P+ VS  E                     EST  S  + +SY++ +ETGSI
Sbjct: 303  GKGEAILISPAQVSTPE---------------------ESTSSSLVNEVSYDNKLETGSI 341

Query: 1292 TFDFDASAPGARGKEEHLQIGDSQRIESLGMSKLEDAPRQSVSSQVH------------- 1432
            TF+ D+SAP +   E H  + DS+ + +    KLE A  QS+S+ +              
Sbjct: 342  TFNLDSSAPTSSKDECHHNL-DSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGL 400

Query: 1433 -SGLXXXXXXXAYXXXXXXXXXXXXXXXXXFAFPILQAEWSSSPVRMAKAERRHYWQHK- 1606
             +GL       AY                 FAFPILQ+EW+ SPVRMAKA+RRHY +HK 
Sbjct: 401  VTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKG 460

Query: 1607 WRQGLLCCRF 1636
            WR GLLCCRF
Sbjct: 461  WRHGLLCCRF 470


Top