BLASTX nr result

ID: Phellodendron21_contig00024770 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Phellodendron21_contig00024770
         (1538 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_006484256.1 PREDICTED: uncharacterized protein LOC102625369 i...   585   0.0  
XP_006484258.1 PREDICTED: uncharacterized protein LOC102625369 i...   571   0.0  
KDO70270.1 hypothetical protein CISIN_1g0116142mg [Citrus sinensis]   565   0.0  
XP_006437853.1 hypothetical protein CICLE_v10031644mg, partial [...   501   e-172
KDO70271.1 hypothetical protein CISIN_1g0116142mg [Citrus sinensis]   466   e-159
XP_006437852.1 hypothetical protein CICLE_v10031644mg, partial [...   402   e-135
XP_017971961.1 PREDICTED: uncharacterized protein LOC18610175 is...   357   e-114
EOY01581.1 18S pre-ribosomal assembly protein gar2-related, puta...   356   e-114
XP_007045751.2 PREDICTED: uncharacterized protein LOC18610175 is...   352   e-112
XP_017971960.1 PREDICTED: uncharacterized protein LOC18610175 is...   350   e-111
XP_006437854.1 hypothetical protein CICLE_v10031644mg [Citrus cl...   341   e-111
EOY01582.1 18S pre-ribosomal assembly protein gar2-related, puta...   327   e-103
XP_007045750.2 PREDICTED: uncharacterized protein LOC18610175 is...   326   e-103
XP_012080460.1 PREDICTED: uncharacterized protein LOC105640684 i...   322   e-101
XP_012080467.1 PREDICTED: uncharacterized protein LOC105640684 i...   317   8e-99
OMO72168.1 hypothetical protein COLO4_27800 [Corchorus olitorius]     315   1e-98
XP_012080459.1 PREDICTED: uncharacterized protein LOC105640684 i...   317   2e-98
GAV79538.1 hypothetical protein CFOL_v3_23003 [Cephalotus follic...   313   5e-98
XP_012464097.1 PREDICTED: uncharacterized protein LOC105783281 [...   306   4e-95
KHG21027.1 Formate--tetrahydrofolate ligase [Gossypium arboreum]      305   2e-94

>XP_006484256.1 PREDICTED: uncharacterized protein LOC102625369 isoform X1 [Citrus
            sinensis] XP_006484257.1 PREDICTED: uncharacterized
            protein LOC102625369 isoform X1 [Citrus sinensis]
          Length = 496

 Score =  585 bits (1509), Expect = 0.0
 Identities = 332/519 (63%), Positives = 370/519 (71%), Gaps = 14/519 (2%)
 Frame = -2

Query: 1516 DSEQVFPHSTLGH--KPDSEHFDYHENVLDSAGRKSENSIVKEYHTDALCASKSNEGVAD 1343
            DSEQ+FPH TLGH  KPDS+H                        + A+ AS SNEGVAD
Sbjct: 6    DSEQLFPHLTLGHSHKPDSKH------------------------SGAISASNSNEGVAD 41

Query: 1342 HLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMESFEDSVFYEDK 1163
             LP VTND DG T RKL+RSTSLNDLAKDNEK+VQDLE+P+SHSCGEMESF + VFY DK
Sbjct: 42   RLPHVTNDVDGCTVRKLERSTSLNDLAKDNEKNVQDLESPNSHSCGEMESFREPVFYMDK 101

Query: 1162 SVTECELPELIVCYKENTYHVKDICIDEGVRFNDRILFESNVDEKGARAFFSLEEDRNSE 983
            SVTECELPELIVCYKENTYHVKDICIDEGV  +DRILFES+V  K  R+F   +EDRNSE
Sbjct: 102  SVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVG-KSVRSFLPPKEDRNSE 160

Query: 982  LMEETKNSVMPIPNVLKSSAENDSNEDIANRCXXXXXXXXXXDIVNLCDSKDLTPAEDVK 803
            L+EE+KNSV+PIP+VLKSSAEN S+E I NRC          DI ++CDSKDL PA DVK
Sbjct: 161  LLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPAGDVK 220

Query: 802  DDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKDNELEAEKESLQFQGSSGKATL 623
            DD+T+E  +D S KLF +GDLLSMHNVGT+NSL K +  NE++AEKES  FQGSS KA L
Sbjct: 221  DDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKES--FQGSSAKAAL 278

Query: 622  VNPALACPAEESNGGTEVVLSRGSDLVSTSEESHNGSGESILANPALVSATEK------- 464
             NP      EE+NGGT   +  G+D VS SEES NG GE I  NP LVSA+EK       
Sbjct: 279  ANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGNPTLVSASEKAHDKSEE 332

Query: 463  AXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGASASGTRGKEEH-QIGDFLC 287
            A LAS D V A  ESTKIS A+  SYNSM+ETG ITFDF ASA G  GKEE  QIGD   
Sbjct: 333  ASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGASGKEEPLQIGDSQR 392

Query: 286  IETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITYSGPVAYXXXXXXXXXXXX 107
            IET GM RLEDAPRQSVSSQ HSGLGESSFSA  SLP LI+YSGPVAY            
Sbjct: 393  IETPGMSRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSGPVAYSGSISLRSDSST 452

Query: 106  XXXXSFAFPILQSEWNSSPVRMAKAD----RKHKWRQGL 2
                SFAFPILQ+EW+ SPVRMAKAD    RKHKW+QGL
Sbjct: 453  TSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHKWKQGL 491


>XP_006484258.1 PREDICTED: uncharacterized protein LOC102625369 isoform X2 [Citrus
            sinensis] XP_015387480.1 PREDICTED: uncharacterized
            protein LOC102625369 isoform X2 [Citrus sinensis]
          Length = 483

 Score =  571 bits (1471), Expect = 0.0
 Identities = 317/475 (66%), Positives = 353/475 (74%), Gaps = 12/475 (2%)
 Frame = -2

Query: 1390 HTDALCASKSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHS 1211
            H+ A+ AS SNEGVAD LP VTND DG T RKL+RSTSLNDLAKDNEK+VQDLE+P+SHS
Sbjct: 13   HSGAISASNSNEGVADRLPHVTNDVDGCTVRKLERSTSLNDLAKDNEKNVQDLESPNSHS 72

Query: 1210 CGEMESFEDSVFYEDKSVTECELPELIVCYKENTYHVKDICIDEGVRFNDRILFESNVDE 1031
            CGEMESF + VFY DKSVTECELPELIVCYKENTYHVKDICIDEGV  +DRILFES+V  
Sbjct: 73   CGEMESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVG- 131

Query: 1030 KGARAFFSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRCXXXXXXXXXXDI 851
            K  R+F   +EDRNSEL+EE+KNSV+PIP+VLKSSAEN S+E I NRC          DI
Sbjct: 132  KSVRSFLPPKEDRNSELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDI 191

Query: 850  VNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKDNELEA 671
             ++CDSKDL PA DVKDD+T+E  +D S KLF +GDLLSMHNVGT+NSL K +  NE++A
Sbjct: 192  DDICDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDA 251

Query: 670  EKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHNGSGESILAN 491
            EKES  FQGSS KA L NP      EE+NGGT   +  G+D VS SEES NG GE I  N
Sbjct: 252  EKES--FQGSSAKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGN 303

Query: 490  PALVSATEK-------AXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGASAS 332
            P LVSA+EK       A LAS D V A  ESTKIS A+  SYNSM+ETG ITFDF ASA 
Sbjct: 304  PTLVSASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAP 363

Query: 331  GTRGKEEH-QIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITYSG 155
            G  GKEE  QIGD   IET GM RLEDAPRQSVSSQ HSGLGESSFSA  SLP LI+YSG
Sbjct: 364  GASGKEEPLQIGDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSG 423

Query: 154  PVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKAD----RKHKWRQGL 2
            PVAY                SFAFPILQ+EW+ SPVRMAKAD    RKHKW+QGL
Sbjct: 424  PVAYSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHKWKQGL 478


>KDO70270.1 hypothetical protein CISIN_1g0116142mg [Citrus sinensis]
          Length = 481

 Score =  565 bits (1457), Expect = 0.0
 Identities = 313/474 (66%), Positives = 352/474 (74%), Gaps = 11/474 (2%)
 Frame = -2

Query: 1390 HTDALCASKSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHS 1211
            H+ A+ AS SNEGVAD LP VTND DG T RKL+RSTSLNDLAKDNEK+VQDLE+P+SHS
Sbjct: 13   HSGAISASNSNEGVADRLPHVTNDVDGCTVRKLERSTSLNDLAKDNEKNVQDLESPNSHS 72

Query: 1210 CGEMESFEDSVFYEDKSVTECELPELIVCYKENTYHVKDICIDEGVRFNDRILFESNVDE 1031
            CGEMESF + VFY DKSVTECELPELIVCYKENTYHVKDICIDEGV  +DRILFES+V  
Sbjct: 73   CGEMESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVG- 131

Query: 1030 KGARAFFSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRCXXXXXXXXXXDI 851
            K  R+F   +EDRNSE++EE+KNSV+PIP+VLKSSAEN S++ I NRC          DI
Sbjct: 132  KSVRSFLPPKEDRNSEVLEESKNSVIPIPDVLKSSAENYSDKRIVNRCGSSQESDSDEDI 191

Query: 850  VNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKDNELEA 671
             ++CDSKDL PA DVKDD+T+E  +D S KLF +GDLLSMHNVGT+NSL K +  NE++A
Sbjct: 192  DDICDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDA 251

Query: 670  EKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHNGSGESILAN 491
            EKES  FQGSS KA L NP      EE+NGGT   +  G+D VS SEES NG GE I  N
Sbjct: 252  EKES--FQGSSAKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGN 303

Query: 490  PALVSATEK-------AXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGASAS 332
            P LVSA+EK       A LAS D V A  ESTKIS A+  SYNSM+ETG ITFDF ASA 
Sbjct: 304  PTLVSASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAP 363

Query: 331  GTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITYSGP 152
            G  GKEE  +GD   IET GM RLEDAPRQSVSSQ HSGLGESSFSA  SLP LI+YSGP
Sbjct: 364  GASGKEE-PLGDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSGP 422

Query: 151  VAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKAD----RKHKWRQGL 2
            VAY                SFAFPILQ+EW+ SPVRMAKAD    RKHKW+QGL
Sbjct: 423  VAYSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHKWKQGL 476


>XP_006437853.1 hypothetical protein CICLE_v10031644mg, partial [Citrus clementina]
            ESR51093.1 hypothetical protein CICLE_v10031644mg,
            partial [Citrus clementina]
          Length = 410

 Score =  501 bits (1291), Expect = e-172
 Identities = 277/407 (68%), Positives = 309/407 (75%), Gaps = 8/407 (1%)
 Frame = -2

Query: 1390 HTDALCASKSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHS 1211
            H+ A+ AS SNEGVAD LP VTND DG T RKL+RSTSLNDLAKDNEK+VQDLE+P+SHS
Sbjct: 13   HSGAISASNSNEGVADRLPHVTNDVDGCTVRKLERSTSLNDLAKDNEKNVQDLESPNSHS 72

Query: 1210 CGEMESFEDSVFYEDKSVTECELPELIVCYKENTYHVKDICIDEGVRFNDRILFESNVDE 1031
            CGEMESF + VFY DKSVTECELPELIVCYKENTYHVKDICIDEGV  +DRILFES+V  
Sbjct: 73   CGEMESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVG- 131

Query: 1030 KGARAFFSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRCXXXXXXXXXXDI 851
            K  R+F   +EDRNSEL+EE+KNSV+PIP+VLKSSAEN S+E I NRC          DI
Sbjct: 132  KSVRSFLPPKEDRNSELLEESKNSVIPIPDVLKSSAENYSDERIVNRCGSSQESDSDEDI 191

Query: 850  VNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKDNELEA 671
             ++CDSKDL PA DVKDD+T+E  +D S KLF +GDLLSMHNVGT+NSL K +  NE++A
Sbjct: 192  DDICDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDA 251

Query: 670  EKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHNGSGESILAN 491
            EKES  FQGSS KA L NP      EE+NGGT   +  G+D VS SEES NG GE I  N
Sbjct: 252  EKES--FQGSSAKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGN 303

Query: 490  PALVSATEK-------AXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGASAS 332
            P LVSA+EK       A LAS D V A  ESTKIS A+  SYNSM+ETG ITFDF ASA 
Sbjct: 304  PTLVSASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAP 363

Query: 331  GTRGKEEH-QIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFS 194
            G  GKEE  QIGD   IET GM RLEDAPRQSVSSQ HSGLGESSFS
Sbjct: 364  GASGKEEPLQIGDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFS 410


>KDO70271.1 hypothetical protein CISIN_1g0116142mg [Citrus sinensis]
          Length = 406

 Score =  466 bits (1199), Expect = e-159
 Identities = 264/411 (64%), Positives = 297/411 (72%), Gaps = 11/411 (2%)
 Frame = -2

Query: 1201 MESFEDSVFYEDKSVTECELPELIVCYKENTYHVKDICIDEGVRFNDRILFESNVDEKGA 1022
            MESF + VFY DKSVTECELPELIVCYKENTYHVKDICIDEGV  +DRILFES+V  K  
Sbjct: 1    MESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVG-KSV 59

Query: 1021 RAFFSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRCXXXXXXXXXXDIVNL 842
            R+F   +EDRNSE++EE+KNSV+PIP+VLKSSAEN S++ I NRC          DI ++
Sbjct: 60   RSFLPPKEDRNSEVLEESKNSVIPIPDVLKSSAENYSDKRIVNRCGSSQESDSDEDIDDI 119

Query: 841  CDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKDNELEAEKE 662
            CDSKDL PA DVKDD+T+E  +D S KLF +GDLLSMHNVGT+NSL K +  NE++AEKE
Sbjct: 120  CDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKE 179

Query: 661  SLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHNGSGESILANPAL 482
            S  FQGSS KA L NP      EE+NGGT   +  G+D VS SEES NG GE I  NP L
Sbjct: 180  S--FQGSSAKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGNPTL 231

Query: 481  VSATEK-------AXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGASASGTR 323
            VSA+EK       A LAS D V A  ESTKIS A+  SYNSM+ETG ITFDF ASA G  
Sbjct: 232  VSASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGAS 291

Query: 322  GKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITYSGPVAY 143
            GKEE  +GD   IET GM RLEDAPRQSVSSQ HSGLGESSFSA  SLP LI+YSGPVAY
Sbjct: 292  GKEE-PLGDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSGPVAY 350

Query: 142  XXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKAD----RKHKWRQGL 2
                            SFAFPILQ+EW+ SPVRMAKAD    RKHKW+QGL
Sbjct: 351  SGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHKWKQGL 401


>XP_006437852.1 hypothetical protein CICLE_v10031644mg, partial [Citrus clementina]
            ESR51092.1 hypothetical protein CICLE_v10031644mg,
            partial [Citrus clementina]
          Length = 335

 Score =  402 bits (1033), Expect = e-135
 Identities = 228/344 (66%), Positives = 254/344 (73%), Gaps = 8/344 (2%)
 Frame = -2

Query: 1201 MESFEDSVFYEDKSVTECELPELIVCYKENTYHVKDICIDEGVRFNDRILFESNVDEKGA 1022
            MESF + VFY DKSVTECELPELIVCYKENTYHVKDICIDEGV  +DRILFES+V  K  
Sbjct: 1    MESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVG-KSV 59

Query: 1021 RAFFSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRCXXXXXXXXXXDIVNL 842
            R+F   +EDRNSEL+EE+KNSV+PIP+VLKSSAEN S+E I NRC          DI ++
Sbjct: 60   RSFLPPKEDRNSELLEESKNSVIPIPDVLKSSAENYSDERIVNRCGSSQESDSDEDIDDI 119

Query: 841  CDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKDNELEAEKE 662
            CDSKDL PA DVKDD+T+E  +D S KLF +GDLLSMHNVGT+NSL K +  NE++AEKE
Sbjct: 120  CDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKE 179

Query: 661  SLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHNGSGESILANPAL 482
            S  FQGSS KA L NP      EE+NGGT   +  G+D VS SEES NG GE I  NP L
Sbjct: 180  S--FQGSSAKAALANP------EEANGGTAEEILTGADFVSASEESQNGCGEGISGNPTL 231

Query: 481  VSATEK-------AXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGASASGTR 323
            VSA+EK       A LAS D V A  ESTKIS A+  SYNSM+ETG ITFDF ASA G  
Sbjct: 232  VSASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGAS 291

Query: 322  GKEEH-QIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFS 194
            GKEE  QIGD   IET GM RLEDAPRQSVSSQ HSGLGESSFS
Sbjct: 292  GKEEPLQIGDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFS 335


>XP_017971961.1 PREDICTED: uncharacterized protein LOC18610175 isoform X3 [Theobroma
            cacao]
          Length = 527

 Score =  357 bits (917), Expect = e-114
 Identities = 228/539 (42%), Positives = 294/539 (54%), Gaps = 31/539 (5%)
 Frame = -2

Query: 1525 MKLDSEQVFPHSTLGHKPDSEHFDY---------HENVLDSAGRKSENSIVKEYHTDALC 1373
            MKLD+EQV  HST GHK DS+ + +          +  LDS G  +E  +VKE     + 
Sbjct: 1    MKLDNEQVLCHSTTGHKSDSKPYSFLADTKPFENKDKPLDSTGLNAEG-VVKENQNGVMH 59

Query: 1372 ASKSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMES 1193
              K N+G +D    + N   GW A KLD S S+ND A  NEK+V+D    +S S   M+S
Sbjct: 60   DIKGNDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDS 119

Query: 1192 FEDSVFYEDKSVTECELPELIVCYKENTYH-VKDICIDEGVRFNDRILFESNVDEKGARA 1016
            F++SVFY DKSV ECELPEL+VCYKE+TYH VKDICIDEGV   D+ LFE+ +DEK    
Sbjct: 120  FQNSVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCN 179

Query: 1015 FFSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRC----------------X 884
            F   E++++S+LM E   + M + +V  S  EN S +DI N C                 
Sbjct: 180  FLPSEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSL 239

Query: 883  XXXXXXXXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSL 704
                      I N CDSKDL     VK D+   + DD S++LF++G+LLSM  +   NS 
Sbjct: 240  SLEKNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSE 299

Query: 703  CKPSKDNELEAEKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEES 524
               S       E++S  FQ SS K  +V P L    EES    E  +     LVS +EE 
Sbjct: 300  AMSSDCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEEL 357

Query: 523  HNGSGESILANPALVSATEKAXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFG 344
             +G GE+IL +PA VS  E              EST  S+ + +SY++ +ETG ITF+  
Sbjct: 358  DSGKGEAILISPAQVSTPE--------------ESTSSSLVNEVSYDNKLETGSITFNLD 403

Query: 343  ASASGTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLIT 164
            +SA  +   E H   D   + T   P+LE A  QS+S+ +  G+GESSFSA   + GLI+
Sbjct: 404  SSAPTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLVTGLIS 463

Query: 163  YSGPVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKADRKH-----KWRQGL 2
            YSGPVAY                SFAFPILQSEWN SPVRMAKADR+H      WR GL
Sbjct: 464  YSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGL 522


>EOY01581.1 18S pre-ribosomal assembly protein gar2-related, putative isoform 1
            [Theobroma cacao]
          Length = 527

 Score =  356 bits (913), Expect = e-114
 Identities = 227/539 (42%), Positives = 294/539 (54%), Gaps = 31/539 (5%)
 Frame = -2

Query: 1525 MKLDSEQVFPHSTLGHKPDSEHFDY---------HENVLDSAGRKSENSIVKEYHTDALC 1373
            MKLD+EQV  HS  GHK DS+ + +          +  LDS G  +E  +VKE     + 
Sbjct: 1    MKLDNEQVLCHSITGHKSDSKPYSFLADTKPFENKDKPLDSTGLNAEG-VVKENQNGVMH 59

Query: 1372 ASKSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMES 1193
              K N+G +D    + N   GW A KLD S S+ND A  NEK+V+D    +S S   M+S
Sbjct: 60   DIKGNDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDS 119

Query: 1192 FEDSVFYEDKSVTECELPELIVCYKENTYH-VKDICIDEGVRFNDRILFESNVDEKGARA 1016
            F++SVFY DKSV ECELPEL+VCYKE+TYH VKDICIDEGV   D+ LFE+ +DEK    
Sbjct: 120  FQNSVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCN 179

Query: 1015 FFSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRC----------------X 884
            F   E++++S+LM E   + M + +V  S  EN S +DI N C                 
Sbjct: 180  FLPSEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSL 239

Query: 883  XXXXXXXXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSL 704
                      I N CDSKDL     VK D+   + DD S++LF++G+LLSM  +   NS 
Sbjct: 240  SLEKNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSE 299

Query: 703  CKPSKDNELEAEKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEES 524
               S       E++S  FQ SS K  +V P L    EES    E  +     LVS +EE 
Sbjct: 300  AMSSDCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEEL 357

Query: 523  HNGSGESILANPALVSATEKAXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFG 344
             +G GE+IL +PA VS               S+EST  S+ + +SY++ +ETG ITF+  
Sbjct: 358  DSGKGEAILISPAQVS--------------TSEESTSSSLVNEVSYDNKLETGSITFNLD 403

Query: 343  ASASGTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLIT 164
            +SA  +   E H   D   + T   P+LE A  QS+S+ +  G+GESSFSA   + GLI+
Sbjct: 404  SSAPTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLVTGLIS 463

Query: 163  YSGPVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKADRKH-----KWRQGL 2
            YSGPVAY                SFAFPILQSEWN SPVRMAKADR+H      WR GL
Sbjct: 464  YSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGL 522


>XP_007045751.2 PREDICTED: uncharacterized protein LOC18610175 isoform X1 [Theobroma
            cacao]
          Length = 543

 Score =  352 bits (903), Expect = e-112
 Identities = 225/536 (41%), Positives = 291/536 (54%), Gaps = 31/536 (5%)
 Frame = -2

Query: 1516 DSEQVFPHSTLGHKPDSEHFDY---------HENVLDSAGRKSENSIVKEYHTDALCASK 1364
            D+EQV  HST GHK DS+ + +          +  LDS G  +E  +VKE     +   K
Sbjct: 20   DNEQVLCHSTTGHKSDSKPYSFLADTKPFENKDKPLDSTGLNAEG-VVKENQNGVMHDIK 78

Query: 1363 SNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMESFED 1184
             N+G +D    + N   GW A KLD S S+ND A  NEK+V+D    +S S   M+SF++
Sbjct: 79   GNDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQN 138

Query: 1183 SVFYEDKSVTECELPELIVCYKENTYH-VKDICIDEGVRFNDRILFESNVDEKGARAFFS 1007
            SVFY DKSV ECELPEL+VCYKE+TYH VKDICIDEGV   D+ LFE+ +DEK    F  
Sbjct: 139  SVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLP 198

Query: 1006 LEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRC----------------XXXX 875
             E++++S+LM E   + M + +V  S  EN S +DI N C                    
Sbjct: 199  SEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLE 258

Query: 874  XXXXXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKP 695
                   I N CDSKDL     VK D+   + DD S++LF++G+LLSM  +   NS    
Sbjct: 259  KNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMS 318

Query: 694  SKDNELEAEKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHNG 515
            S       E++S  FQ SS K  +V P L    EES    E  +     LVS +EE  +G
Sbjct: 319  SDCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSG 376

Query: 514  SGESILANPALVSATEKAXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGASA 335
             GE+IL +PA VS  E              EST  S+ + +SY++ +ETG ITF+  +SA
Sbjct: 377  KGEAILISPAQVSTPE--------------ESTSSSLVNEVSYDNKLETGSITFNLDSSA 422

Query: 334  SGTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITYSG 155
              +   E H   D   + T   P+LE A  QS+S+ +  G+GESSFSA   + GLI+YSG
Sbjct: 423  PTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLVTGLISYSG 482

Query: 154  PVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKADRKH-----KWRQGL 2
            PVAY                SFAFPILQSEWN SPVRMAKADR+H      WR GL
Sbjct: 483  PVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGL 538


>XP_017971960.1 PREDICTED: uncharacterized protein LOC18610175 isoform X2 [Theobroma
            cacao]
          Length = 538

 Score =  350 bits (897), Expect = e-111
 Identities = 224/535 (41%), Positives = 290/535 (54%), Gaps = 31/535 (5%)
 Frame = -2

Query: 1513 SEQVFPHSTLGHKPDSEHFDY---------HENVLDSAGRKSENSIVKEYHTDALCASKS 1361
            +EQV  HST GHK DS+ + +          +  LDS G  +E  +VKE     +   K 
Sbjct: 16   NEQVLCHSTTGHKSDSKPYSFLADTKPFENKDKPLDSTGLNAEG-VVKENQNGVMHDIKG 74

Query: 1360 NEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMESFEDS 1181
            N+G +D    + N   GW A KLD S S+ND A  NEK+V+D    +S S   M+SF++S
Sbjct: 75   NDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNS 134

Query: 1180 VFYEDKSVTECELPELIVCYKENTYH-VKDICIDEGVRFNDRILFESNVDEKGARAFFSL 1004
            VFY DKSV ECELPEL+VCYKE+TYH VKDICIDEGV   D+ LFE+ +DEK    F   
Sbjct: 135  VFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPS 194

Query: 1003 EEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRC----------------XXXXX 872
            E++++S+LM E   + M + +V  S  EN S +DI N C                     
Sbjct: 195  EKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEK 254

Query: 871  XXXXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPS 692
                  I N CDSKDL     VK D+   + DD S++LF++G+LLSM  +   NS    S
Sbjct: 255  NESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSS 314

Query: 691  KDNELEAEKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHNGS 512
                   E++S  FQ SS K  +V P L    EES    E  +     LVS +EE  +G 
Sbjct: 315  DCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGK 372

Query: 511  GESILANPALVSATEKAXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGASAS 332
            GE+IL +PA VS  E              EST  S+ + +SY++ +ETG ITF+  +SA 
Sbjct: 373  GEAILISPAQVSTPE--------------ESTSSSLVNEVSYDNKLETGSITFNLDSSAP 418

Query: 331  GTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITYSGP 152
             +   E H   D   + T   P+LE A  QS+S+ +  G+GESSFSA   + GLI+YSGP
Sbjct: 419  TSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLVTGLISYSGP 478

Query: 151  VAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKADRKH-----KWRQGL 2
            VAY                SFAFPILQSEWN SPVRMAKADR+H      WR GL
Sbjct: 479  VAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGL 533


>XP_006437854.1 hypothetical protein CICLE_v10031644mg [Citrus clementina] ESR51094.1
            hypothetical protein CICLE_v10031644mg [Citrus
            clementina]
          Length = 297

 Score =  341 bits (874), Expect = e-111
 Identities = 174/246 (70%), Positives = 200/246 (81%)
 Frame = -2

Query: 1390 HTDALCASKSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHS 1211
            H+ A+ AS SNEGVAD LP VTND DG T RKL+RSTSLNDLAKDNEK+VQDLE+P+SHS
Sbjct: 13   HSGAISASNSNEGVADRLPHVTNDVDGCTVRKLERSTSLNDLAKDNEKNVQDLESPNSHS 72

Query: 1210 CGEMESFEDSVFYEDKSVTECELPELIVCYKENTYHVKDICIDEGVRFNDRILFESNVDE 1031
            CGEMESF + VFY DKSVTECELPELIVCYKENTYHVKDICIDEGV  +DRILFES+V  
Sbjct: 73   CGEMESFREPVFYMDKSVTECELPELIVCYKENTYHVKDICIDEGVHSHDRILFESDVG- 131

Query: 1030 KGARAFFSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRCXXXXXXXXXXDI 851
            K  R+F   +EDRNSEL+EE+KNSV+PIP+VLKSSAEN S+E I NRC          DI
Sbjct: 132  KSVRSFLPPKEDRNSELLEESKNSVIPIPDVLKSSAENYSDERIVNRCGSSQESDSDEDI 191

Query: 850  VNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKDNELEA 671
             ++CDSKDL PA DVKDD+T+E  +D S KLF +GDLLSMHNVGT+NSL K +  NE++A
Sbjct: 192  DDICDSKDLRPAGDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDA 251

Query: 670  EKESLQ 653
            EKES Q
Sbjct: 252  EKESFQ 257


>EOY01582.1 18S pre-ribosomal assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] EOY01583.1 18S pre-ribosomal assembly
            protein gar2-related, putative isoform 2 [Theobroma
            cacao] EOY01584.1 18S pre-ribosomal assembly protein
            gar2-related, putative isoform 2 [Theobroma cacao]
          Length = 470

 Score =  327 bits (838), Expect = e-103
 Identities = 205/477 (42%), Positives = 264/477 (55%), Gaps = 22/477 (4%)
 Frame = -2

Query: 1366 KSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMESFE 1187
            K N+G +D    + N   GW A KLD S S+ND A  NEK+V+D    +S S   M+SF+
Sbjct: 5    KGNDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQ 64

Query: 1186 DSVFYEDKSVTECELPELIVCYKENTYH-VKDICIDEGVRFNDRILFESNVDEKGARAFF 1010
            +SVFY DKSV ECELPEL+VCYKE+TYH VKDICIDEGV   D+ LFE+ +DEK    F 
Sbjct: 65   NSVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFL 124

Query: 1009 SLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRC----------------XXX 878
              E++++S+LM E   + M + +V  S  EN S +DI N C                   
Sbjct: 125  PSEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSL 184

Query: 877  XXXXXXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCK 698
                    I N CDSKDL     VK D+   + DD S++LF++G+LLSM  +   NS   
Sbjct: 185  EKNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAM 244

Query: 697  PSKDNELEAEKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHN 518
             S       E++S  FQ SS K  +V P L    EES    E  +     LVS +EE  +
Sbjct: 245  SSDCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDS 302

Query: 517  GSGESILANPALVSATEKAXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGAS 338
            G GE+IL +PA VS               S+EST  S+ + +SY++ +ETG ITF+  +S
Sbjct: 303  GKGEAILISPAQVS--------------TSEESTSSSLVNEVSYDNKLETGSITFNLDSS 348

Query: 337  ASGTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITYS 158
            A  +   E H   D   + T   P+LE A  QS+S+ +  G+GESSFSA   + GLI+YS
Sbjct: 349  APTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLVTGLISYS 408

Query: 157  GPVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKADRKH-----KWRQGL 2
            GPVAY                SFAFPILQSEWN SPVRMAKADR+H      WR GL
Sbjct: 409  GPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGL 465


>XP_007045750.2 PREDICTED: uncharacterized protein LOC18610175 isoform X4 [Theobroma
            cacao] XP_007045752.2 PREDICTED: uncharacterized protein
            LOC18610175 isoform X4 [Theobroma cacao]
          Length = 470

 Score =  326 bits (836), Expect = e-103
 Identities = 205/477 (42%), Positives = 263/477 (55%), Gaps = 22/477 (4%)
 Frame = -2

Query: 1366 KSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMESFE 1187
            K N+G +D    + N   GW A KLD S S+ND A  NEK+V+D    +S S   M+SF+
Sbjct: 5    KGNDGDSDPSLYLDNTRGGWPALKLDCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQ 64

Query: 1186 DSVFYEDKSVTECELPELIVCYKENTYH-VKDICIDEGVRFNDRILFESNVDEKGARAFF 1010
            +SVFY DKSV ECELPEL+VCYKE+TYH VKDICIDEGV   D+ LFE+ +DEK    F 
Sbjct: 65   NSVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFL 124

Query: 1009 SLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRC----------------XXX 878
              E++++S+LM E   + M + +V  S  EN S +DI N C                   
Sbjct: 125  PSEKEQDSQLMTEKLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSL 184

Query: 877  XXXXXXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCK 698
                    I N CDSKDL     VK D+   + DD S++LF++G+LLSM  +   NS   
Sbjct: 185  EKNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAM 244

Query: 697  PSKDNELEAEKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHN 518
             S       E++S  FQ SS K  +V P L    EES    E  +     LVS +EE  +
Sbjct: 245  SSDCKSDGIEQQS--FQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDS 302

Query: 517  GSGESILANPALVSATEKAXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGAS 338
            G GE+IL +PA VS  E              EST  S+ + +SY++ +ETG ITF+  +S
Sbjct: 303  GKGEAILISPAQVSTPE--------------ESTSSSLVNEVSYDNKLETGSITFNLDSS 348

Query: 337  ASGTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITYS 158
            A  +   E H   D   + T   P+LE A  QS+S+ +  G+GESSFSA   + GLI+YS
Sbjct: 349  APTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLVTGLISYS 408

Query: 157  GPVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKADRKH-----KWRQGL 2
            GPVAY                SFAFPILQSEWN SPVRMAKADR+H      WR GL
Sbjct: 409  GPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGL 465


>XP_012080460.1 PREDICTED: uncharacterized protein LOC105640684 isoform X2 [Jatropha
            curcas] XP_012080461.1 PREDICTED: uncharacterized protein
            LOC105640684 isoform X2 [Jatropha curcas] XP_012080462.1
            PREDICTED: uncharacterized protein LOC105640684 isoform
            X2 [Jatropha curcas] XP_012080463.1 PREDICTED:
            uncharacterized protein LOC105640684 isoform X2 [Jatropha
            curcas] XP_012080464.1 PREDICTED: uncharacterized protein
            LOC105640684 isoform X2 [Jatropha curcas] XP_012080465.1
            PREDICTED: uncharacterized protein LOC105640684 isoform
            X2 [Jatropha curcas] XP_012080466.1 PREDICTED:
            uncharacterized protein LOC105640684 isoform X2 [Jatropha
            curcas] KDP31404.1 hypothetical protein JCGZ_11780
            [Jatropha curcas]
          Length = 531

 Score =  322 bits (826), Expect = e-101
 Identities = 228/555 (41%), Positives = 303/555 (54%), Gaps = 47/555 (8%)
 Frame = -2

Query: 1525 MKLDSEQVFPHSTLGHKPDSEHFDYHENVLDSAGRKSENSIVKEYHTDALCASKSNEGVA 1346
            MKLD EQV  H T+ HKP S+HF      LDS G KS N IV E    A C  K  E  +
Sbjct: 1    MKLDGEQVLCHGTIDHKPGSKHFGCSNIALDSTGLKSGNGIVNEEQNGAFCDLKGRESNS 60

Query: 1345 DHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPD--SHSCGEMESFE-DSVF 1175
            DHL    ND + WTA KLD S   + L  DNEK+V+D  AP   S S  ++ESFE DSVF
Sbjct: 61   DHLQYTVNDENNWTASKLDSSMRADALTDDNEKEVRDFVAPIPLSLSSLKVESFEGDSVF 120

Query: 1174 YEDKSVTECELPELIVCYKENTYHV-KDICIDEGVRFNDRILFESNVDEKGARAFFSLEE 998
            Y DK+V E ELPEL+VCYKENTYHV KDICIDEGV   D+ LF++ +DEK  R     E+
Sbjct: 121  YVDKNVMEPELPELVVCYKENTYHVIKDICIDEGVPSKDKFLFDT-IDEKNLRTLLFHEK 179

Query: 997  DRNSELMEETKNS-------------------VMPIPNVLKSSAENDSNEDIANRCXXXX 875
             RNSE+ +ET +                     +PIP+V  SSAEN S  +I        
Sbjct: 180  HRNSEVRKETADQDIFIPESLKSLPEDEKSALDLPIPDVFISSAENGSKNEI-------- 231

Query: 874  XXXXXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKP 695
                     +L DS++      ++DD+ +EI +  S+++FS+G+LLSM  VGTE S  K 
Sbjct: 232  ---------SLHDSEEFMTTGKIEDDTMEEIANGKSKEIFSLGELLSMPEVGTELSQPKF 282

Query: 694  SKDNELEAEKESLQFQGSSGKATLVNPALACPAEESNG------------GTEVVLSRGS 551
            S D+  EA+++ +Q      + T++  A +C  E  NG              EV      
Sbjct: 283  SHDSMHEAKQQPIQ---RPSENTILATASSCD-EAKNGNELTSFVRPMVPAAEVSDCHHD 338

Query: 550  DLVSTSEESHNGSGESILANPALVSATEKA-------XLASTDLVFASDESTKISIADNI 392
            + +S ++   +   E++LA+PAL SAT+++        LAS +L  +  EST IS    +
Sbjct: 339  EEISRTKALDHSYDEAVLASPALNSATQESEKVCEGEKLASHNL-SSERESTNIS-GCGL 396

Query: 391  SYNSMMETGIITFDFGASASGTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGL 212
            + NS +++  I F   ASA    G+E+ Q G    + +    RLE+   +  +SQ+  G+
Sbjct: 397  ANNSNVKSESINFYTPASA----GEEDSQNGGSENLNSRS-SRLEETNTEPCTSQLQHGI 451

Query: 211  GESSFSAPVSLPGLITYSGPVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKA 32
            GESSFSA   L GLI+YSGP+AY                SFAFPILQSEWNSSPVRMAKA
Sbjct: 452  GESSFSAAGPLSGLISYSGPIAYSGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKA 511

Query: 31   DRK-----HKWRQGL 2
            DR+       W+QGL
Sbjct: 512  DRRRFQKQRSWKQGL 526


>XP_012080467.1 PREDICTED: uncharacterized protein LOC105640684 isoform X3 [Jatropha
            curcas] XP_012080468.1 PREDICTED: uncharacterized protein
            LOC105640684 isoform X3 [Jatropha curcas] XP_012080469.1
            PREDICTED: uncharacterized protein LOC105640684 isoform
            X3 [Jatropha curcas]
          Length = 531

 Score =  317 bits (812), Expect = 8e-99
 Identities = 225/552 (40%), Positives = 300/552 (54%), Gaps = 47/552 (8%)
 Frame = -2

Query: 1516 DSEQVFPHSTLGHKPDSEHFDYHENVLDSAGRKSENSIVKEYHTDALCASKSNEGVADHL 1337
            D EQV  H T+ HKP S+HF      LDS G KS N IV E    A C  K  E  +DHL
Sbjct: 4    DGEQVLCHGTIDHKPGSKHFGCSNIALDSTGLKSGNGIVNEEQNGAFCDLKGRESNSDHL 63

Query: 1336 PQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPD--SHSCGEMESFE-DSVFYED 1166
                ND + WTA KLD S   + L  DNEK+V+D  AP   S S  ++ESFE DSVFY D
Sbjct: 64   QYTVNDENNWTASKLDSSMRADALTDDNEKEVRDFVAPIPLSLSSLKVESFEGDSVFYVD 123

Query: 1165 KSVTECELPELIVCYKENTYHV-KDICIDEGVRFNDRILFESNVDEKGARAFFSLEEDRN 989
            K+V E ELPEL+VCYKENTYHV KDICIDEGV   D+ LF++ +DEK  R     E+ RN
Sbjct: 124  KNVMEPELPELVVCYKENTYHVIKDICIDEGVPSKDKFLFDT-IDEKNLRTLLFHEKHRN 182

Query: 988  SELMEETKNS-------------------VMPIPNVLKSSAENDSNEDIANRCXXXXXXX 866
            SE+ +ET +                     +PIP+V  SSAEN S  +I           
Sbjct: 183  SEVRKETADQDIFIPESLKSLPEDEKSALDLPIPDVFISSAENGSKNEI----------- 231

Query: 865  XXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKD 686
                  +L DS++      ++DD+ +EI +  S+++FS+G+LLSM  VGTE S  K S D
Sbjct: 232  ------SLHDSEEFMTTGKIEDDTMEEIANGKSKEIFSLGELLSMPEVGTELSQPKFSHD 285

Query: 685  NELEAEKESLQFQGSSGKATLVNPALACPAEESNG------------GTEVVLSRGSDLV 542
            +  EA+++ +Q      + T++  A +C  E  NG              EV      + +
Sbjct: 286  SMHEAKQQPIQ---RPSENTILATASSCD-EAKNGNELTSFVRPMVPAAEVSDCHHDEEI 341

Query: 541  STSEESHNGSGESILANPALVSATEKA-------XLASTDLVFASDESTKISIADNISYN 383
            S ++   +   E++LA+PAL SAT+++        LAS +L  +  EST IS    ++ N
Sbjct: 342  SRTKALDHSYDEAVLASPALNSATQESEKVCEGEKLASHNL-SSERESTNIS-GCGLANN 399

Query: 382  SMMETGIITFDFGASASGTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGES 203
            S +++  I F   ASA    G+E+ Q G    + +    RLE+   +  +SQ+  G+GES
Sbjct: 400  SNVKSESINFYTPASA----GEEDSQNGGSENLNSRS-SRLEETNTEPCTSQLQHGIGES 454

Query: 202  SFSAPVSLPGLITYSGPVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKADRK 23
            SFSA   L GLI+YSGP+AY                SFAFPILQSEWNSSPVRMAKADR+
Sbjct: 455  SFSAAGPLSGLISYSGPIAYSGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRR 514

Query: 22   -----HKWRQGL 2
                   W+QGL
Sbjct: 515  RFQKQRSWKQGL 526


>OMO72168.1 hypothetical protein COLO4_27800 [Corchorus olitorius]
          Length = 503

 Score =  315 bits (808), Expect = 1e-98
 Identities = 216/530 (40%), Positives = 289/530 (54%), Gaps = 22/530 (4%)
 Frame = -2

Query: 1525 MKLDSEQVFPHSTLGHKPDSEH---------FDYHENVLDSAGRKSENSIVKEYHTDALC 1373
            MKLD+EQV  HS +G+  DS+          ++  E  LDSA   ++  IVKE     + 
Sbjct: 1    MKLDNEQVLCHSNIGYNSDSKPVSFIADTKTYENKEKPLDSAALNADG-IVKEKQNGVMR 59

Query: 1372 ASKSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMES 1193
              K N+G +D L  + N  DGW A KLD S  +N+    NEK+ +D    DSHS  +M+S
Sbjct: 60   DIKGNDGDSDSLC-LENTRDGWPASKLDSSMHVNEFGNGNEKEFRDFVTSDSHSSKKMDS 118

Query: 1192 FEDSVFYEDKSVTECELPELIVCYKENTYH-VKDICIDEGVRFNDRILFESNVDEKGARA 1016
             + SVFY DKSV EC+LPEL+VCYKENTYH VKDICIDEGV   D+ LFES+++EK    
Sbjct: 119  LQGSVFYLDKSVMECDLPELVVCYKENTYHVVKDICIDEGVPTQDKFLFESDMNEKNNCN 178

Query: 1015 FFSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRCXXXXXXXXXXDIV---- 848
            F       + +L+EE ++  +PI     SS E+ S ++I N C                 
Sbjct: 179  FLP-----SCKLVEEKQD--IPI-----SSPEDQSGKNIDNGCDFNEKLDADACRQDESN 226

Query: 847  --NLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKDNELE 674
              N CD +D      VKD+    I DD S++LF++G+LLSM  + T  S    S+     
Sbjct: 227  KGNQCDFEDFMMKRKVKDEEMKTIPDDLSKELFTLGELLSMTELSTVTSKAMSSECKSDG 286

Query: 673  AEKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHNGSGESILA 494
             E++S+  Q SS K   VNP     AEESN  TE +L     L+S + ES NG       
Sbjct: 287  IEQQSI--QSSSEKEVNVNPPSVFVAEESNNNTEAMLD-APGLISAAGESDNGK------ 337

Query: 493  NPALVSATEKAXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGASASGTRGKE 314
                    E A   ST  V  S+EST  ++++ +S ++ +ET  ITF+FG+SA  T  K+
Sbjct: 338  --------EDAIPISTSQVSVSEESTNNTLSNEVSDDNRLETESITFNFGSSAP-TNSKD 388

Query: 313  EHQIG-DFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITYSGPVAYXX 137
            E +   +    ET   P+LED   Q +S+ +  G GE+SFSA   + GLI+YSGP+AY  
Sbjct: 389  ECRPNLNCELPETGTTPKLEDTADQPISNILQRGTGETSFSASGPVTGLISYSGPIAYSG 448

Query: 136  XXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKADRKH-----KWRQGL 2
                          SFAFP+LQSEWNSSPVRMAKADR+H      WR GL
Sbjct: 449  SLSLRSDSSTTSTRSFAFPVLQSEWNSSPVRMAKADRRHYRKHRGWRHGL 498


>XP_012080459.1 PREDICTED: uncharacterized protein LOC105640684 isoform X1 [Jatropha
            curcas]
          Length = 555

 Score =  317 bits (812), Expect = 2e-98
 Identities = 225/552 (40%), Positives = 300/552 (54%), Gaps = 47/552 (8%)
 Frame = -2

Query: 1516 DSEQVFPHSTLGHKPDSEHFDYHENVLDSAGRKSENSIVKEYHTDALCASKSNEGVADHL 1337
            D EQV  H T+ HKP S+HF      LDS G KS N IV E    A C  K  E  +DHL
Sbjct: 28   DGEQVLCHGTIDHKPGSKHFGCSNIALDSTGLKSGNGIVNEEQNGAFCDLKGRESNSDHL 87

Query: 1336 PQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPD--SHSCGEMESFE-DSVFYED 1166
                ND + WTA KLD S   + L  DNEK+V+D  AP   S S  ++ESFE DSVFY D
Sbjct: 88   QYTVNDENNWTASKLDSSMRADALTDDNEKEVRDFVAPIPLSLSSLKVESFEGDSVFYVD 147

Query: 1165 KSVTECELPELIVCYKENTYHV-KDICIDEGVRFNDRILFESNVDEKGARAFFSLEEDRN 989
            K+V E ELPEL+VCYKENTYHV KDICIDEGV   D+ LF++ +DEK  R     E+ RN
Sbjct: 148  KNVMEPELPELVVCYKENTYHVIKDICIDEGVPSKDKFLFDT-IDEKNLRTLLFHEKHRN 206

Query: 988  SELMEETKNS-------------------VMPIPNVLKSSAENDSNEDIANRCXXXXXXX 866
            SE+ +ET +                     +PIP+V  SSAEN S  +I           
Sbjct: 207  SEVRKETADQDIFIPESLKSLPEDEKSALDLPIPDVFISSAENGSKNEI----------- 255

Query: 865  XXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKD 686
                  +L DS++      ++DD+ +EI +  S+++FS+G+LLSM  VGTE S  K S D
Sbjct: 256  ------SLHDSEEFMTTGKIEDDTMEEIANGKSKEIFSLGELLSMPEVGTELSQPKFSHD 309

Query: 685  NELEAEKESLQFQGSSGKATLVNPALACPAEESNG------------GTEVVLSRGSDLV 542
            +  EA+++ +Q      + T++  A +C  E  NG              EV      + +
Sbjct: 310  SMHEAKQQPIQ---RPSENTILATASSCD-EAKNGNELTSFVRPMVPAAEVSDCHHDEEI 365

Query: 541  STSEESHNGSGESILANPALVSATEKA-------XLASTDLVFASDESTKISIADNISYN 383
            S ++   +   E++LA+PAL SAT+++        LAS +L  +  EST IS    ++ N
Sbjct: 366  SRTKALDHSYDEAVLASPALNSATQESEKVCEGEKLASHNL-SSERESTNIS-GCGLANN 423

Query: 382  SMMETGIITFDFGASASGTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGES 203
            S +++  I F   ASA    G+E+ Q G    + +    RLE+   +  +SQ+  G+GES
Sbjct: 424  SNVKSESINFYTPASA----GEEDSQNGGSENLNSRS-SRLEETNTEPCTSQLQHGIGES 478

Query: 202  SFSAPVSLPGLITYSGPVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKADRK 23
            SFSA   L GLI+YSGP+AY                SFAFPILQSEWNSSPVRMAKADR+
Sbjct: 479  SFSAAGPLSGLISYSGPIAYSGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRR 538

Query: 22   -----HKWRQGL 2
                   W+QGL
Sbjct: 539  RFQKQRSWKQGL 550


>GAV79538.1 hypothetical protein CFOL_v3_23003 [Cephalotus follicularis]
          Length = 475

 Score =  313 bits (802), Expect = 5e-98
 Identities = 218/526 (41%), Positives = 287/526 (54%), Gaps = 18/526 (3%)
 Frame = -2

Query: 1525 MKLDSEQVFPHSTLGHKPDSEHFDYHENVLDSAGRKSENSIVKEYHTDALCASKSNEGVA 1346
            MK D+EQV  HSTL  +PDS+ F+YH   +DS G KSEN ++K+     L   K  EG A
Sbjct: 1    MKFDNEQVLCHSTLARRPDSKPFEYHGKAMDSTGLKSENGVMKDNQKRVLSFLKGKEGNA 60

Query: 1345 DHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMESFE-DSVFYE 1169
            + LP   N++      KLD     N    DN                  ESFE  SVFY 
Sbjct: 61   ECLPCERNES------KLDCPVVANYSTNDN------------------ESFEKHSVFYF 96

Query: 1168 DKSVTECELPELIVCYKENTYH-VKDICIDEGVRFNDR-ILFESNVDEKGARAFFSLEED 995
            ++SV +CELPELI+CYKE+ YH VKDICI+E V   D+ + FES VDEK     F  + D
Sbjct: 97   NRSVMKCELPELILCYKESPYHVVKDICINEDVPSKDKNLFFESGVDEKSV-CTFPPDMD 155

Query: 994  RNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRCXXXXXXXXXXDIVNLCDSKDLTPA 815
            +N E   E K   MPIP  +K+SAENDS++DI ++                 D  DL P 
Sbjct: 156  QNIE-STEGKPFDMPIPVAMKASAENDSDKDINDK----------------YDIPDLMPI 198

Query: 814  EDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLCKPSKDNELEAEKESLQFQGSSG 635
             +V+DD+TD+  +D  ++  S+GD+LSM  + +EN+  K    + +    E L  Q SS 
Sbjct: 199  GEVQDDATDKNANDIPKQKISLGDMLSMEKLHSENTFSKSC--DVVSKNAEQLSVQSSSE 256

Query: 634  KATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESHNGSGESILANPALVSATEKAXL 455
            K    + A    ++ESN              + +EES+N S +  LA+P LVSAT+++  
Sbjct: 257  KTVASSLASLSTSDESNNSG-----------NRTEESNNDSEDLTLASPTLVSATKESDS 305

Query: 454  ASTDLVF-------ASDESTKISIADNISYNSMMETGIITFDF--GASASGTRGKEEHQI 302
               ++VF       AS+ES   S ++++SYNS +ETG ITFDF  GA A+  R KE  QI
Sbjct: 306  GRDEMVFVSPAIVSASEESANSSFSNDLSYNSKVETGSITFDFNSGAPAASDR-KECPQI 364

Query: 301  GDFLCI-ETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITYSGPVAYXXXXXX 125
             +  C+ +T    RLEDA  Q V+SQ     GESSFS    + G I YSGP+AY      
Sbjct: 365  TESECLDDTQSSSRLEDADIQLVTSQTQHSHGESSFSTAGPISGSIIYSGPIAYSGSVSL 424

Query: 124  XXXXXXXXXXSFAFPILQSEWNSSPVRMAKADRKH-----KWRQGL 2
                      SFAFP+LQSEWNSSPVRMAKADR+H      WRQGL
Sbjct: 425  RSDSSTTSTRSFAFPVLQSEWNSSPVRMAKADRRHYRKHRGWRQGL 470


>XP_012464097.1 PREDICTED: uncharacterized protein LOC105783281 [Gossypium raimondii]
            XP_012464099.1 PREDICTED: uncharacterized protein
            LOC105783281 [Gossypium raimondii] KJB80435.1
            hypothetical protein B456_013G097400 [Gossypium
            raimondii] KJB80436.1 hypothetical protein
            B456_013G097400 [Gossypium raimondii]
          Length = 505

 Score =  306 bits (785), Expect = 4e-95
 Identities = 217/539 (40%), Positives = 291/539 (53%), Gaps = 32/539 (5%)
 Frame = -2

Query: 1525 MKLDSEQVFPHSTLGHKPDSEHFDYHENVL--------DSAGRKSENSIVKEYHTDALCA 1370
            MKLD+EQV  HST+G+K DS+ + +  ++           A   S +  VKE     +  
Sbjct: 1    MKLDTEQVICHSTIGYKNDSKPYSFLADIKPFENKEKSSDATELSMDDTVKENQNGVVHD 60

Query: 1369 SKSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMESF 1190
             KS+E  +D      N  D WTA +LD S S++D +  NEK+V+D    +SHS   M+SF
Sbjct: 61   IKSDELDSDFSIYSENTRDEWTASELDCSNSVHDFSNGNEKEVRDFVTFNSHSSKNMDSF 120

Query: 1189 EDSVFYEDKSVTECELPELIVCYKENTYH-VKDICIDEGVRFNDRILFESNVDEKGARAF 1013
            +DSVFY DKSV +CELPEL+VCYKE+TYH VKDICIDEGV   D  LFES+VDEK    F
Sbjct: 121  QDSVFYLDKSVMDCELPELVVCYKESTYHVVKDICIDEGVPTQDMFLFESSVDEKSECNF 180

Query: 1012 FSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRC----------------XX 881
               ++D+++ELM+E   + MP+ ++  S  EN S +DI N C                  
Sbjct: 181  SYPKKDQDNELMKEMSETDMPMQDISFSPEENQSGKDIDNECGSNKKLDADTYMQDIALS 240

Query: 880  XXXXXXXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGT--ENS 707
                     I N  D +DL    D+KDD+ + + +D S++LF++GD+LS+  + T    +
Sbjct: 241  LEENKSNKGIPNEWDPRDLLVTRDMKDDAMEMMSNDGSKELFTLGDILSLPELATLKSEA 300

Query: 706  LCKPSKDNELEAEKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEE 527
            +    K + +E +     F+ SS K  +V    A   EESN    ++LS  + LVST+E 
Sbjct: 301  MSPDCKSDRIEQQ----SFENSSKKEVIV----ASAVEESN---NLILSAPA-LVSTAEG 348

Query: 526  SHNGSGESILANPALVSATEKAXLASTDLVFASDESTKISIADNISYNSMMETGIITFDF 347
            S  G GE+   +PA  SA+ +A   S+ LV                     ETG ITFD 
Sbjct: 349  SDIGKGEATPISPAPASASLEA--TSSGLV--------------------NETGSITFDS 386

Query: 346  GASASGTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLI 167
             +SA  T GK     G    +E     +LE+   Q  SS + SG GESSFSA   L GLI
Sbjct: 387  RSSAP-TSGK-----GSNKPLEAGRTSKLEETADQPFSSNLQSGNGESSFSAAGPLTGLI 440

Query: 166  TYSGPVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKAD----RKHK-WRQG 5
            +YSGP+AY                SFAFPILQSEWNSSPVRMAKAD    R+H+ WRQG
Sbjct: 441  SYSGPIAYSGNLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRQYRRHRGWRQG 499


>KHG21027.1 Formate--tetrahydrofolate ligase [Gossypium arboreum]
          Length = 505

 Score =  305 bits (780), Expect = 2e-94
 Identities = 216/537 (40%), Positives = 289/537 (53%), Gaps = 30/537 (5%)
 Frame = -2

Query: 1525 MKLDSEQVFPHSTLGHKPDSEHFDY--------HENVLDSAGRKSENSIVKEYHTDALCA 1370
            MKLD+EQV  HST+G+K DS+ + +        ++     A   S +  VKE     +  
Sbjct: 1    MKLDNEQVICHSTIGYKNDSKPYSFLVDTKPFENKEKSSDATELSTDDTVKENQNGVMHD 60

Query: 1369 SKSNEGVADHLPQVTNDADGWTARKLDRSTSLNDLAKDNEKDVQDLEAPDSHSCGEMESF 1190
             KS+E  +D      N  D WTA +LD S S++D +  NEK+V+D+   +SHS   M+SF
Sbjct: 61   IKSDELDSDFSIYSENTRDEWTASELDCSNSVHDFSNGNEKEVRDIVTFNSHSSKNMDSF 120

Query: 1189 EDSVFYEDKSVTECELPELIVCYKENTYH-VKDICIDEGVRFNDRILFESNVDEKGARAF 1013
            +DSVFY DKSV +CELPEL+VCYKE+TYH VKDICIDEGV   D  LFES+VDEK    F
Sbjct: 121  QDSVFYLDKSVMDCELPELVVCYKESTYHVVKDICIDEGVPTQDMFLFESSVDEKSECNF 180

Query: 1012 FSLEEDRNSELMEETKNSVMPIPNVLKSSAENDSNEDIANRC----------------XX 881
               ++D+++ELM+E   + +P+ N+  S  EN S +DI N C                  
Sbjct: 181  SYPKKDQDNELMKEMSETDIPMQNISFSPEENQSGKDIDNDCGSNKKLNADTYMQDIALS 240

Query: 880  XXXXXXXXDIVNLCDSKDLTPAEDVKDDSTDEIKDDTSEKLFSIGDLLSMHNVGTENSLC 701
                     I N  D +DL    D+KDD+T+ + ++ S++LF +GD+LS   + T  S  
Sbjct: 241  LEENKSNKGIPNEWDPRDLLVTRDMKDDATEMMSNEGSKELFILGDILSFPELTTLKSEA 300

Query: 700  KPSKDNELEAEKESLQFQGSSGKATLVNPALACPAEESNGGTEVVLSRGSDLVSTSEESH 521
                      E++S  F+ SS K  +V    A   E+SN    ++LS  + L ST+E S 
Sbjct: 301  MSPDFKSDRNEQQS--FENSSKKEVIV----ASEVEDSN---NLILSAPA-LASTAEGSD 350

Query: 520  NGSGESILANPALVSATEKAXLASTDLVFASDESTKISIADNISYNSMMETGIITFDFGA 341
            +G GE+   +PA  SA+ +A   S+ LV                     ETG ITFD  +
Sbjct: 351  SGKGEATPISPAPASASLEA--TSSGLV--------------------NETGSITFDSRS 388

Query: 340  SASGTRGKEEHQIGDFLCIETMGMPRLEDAPRQSVSSQVHSGLGESSFSAPVSLPGLITY 161
            SA  T GK     G    +ET    +LE+   Q  SS + SG GESSFSA   L GLI+Y
Sbjct: 389  SAP-TSGK-----GSSEPLETGRTSKLEETADQPFSSNLQSGNGESSFSAAGPLTGLISY 442

Query: 160  SGPVAYXXXXXXXXXXXXXXXXSFAFPILQSEWNSSPVRMAKAD----RKHK-WRQG 5
            SGP+ Y                SFAFPILQSEWNSSPVRMAKAD    R+H+ WRQG
Sbjct: 443  SGPITYSGNLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADQRQYRRHRGWRQG 499


Top