BLASTX nr result

ID: Phellodendron21_contig00042726 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Phellodendron21_contig00042726
         (1306 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_006440743.1 hypothetical protein CICLE_v10021474mg [Citrus cl...   286   4e-91
KDO65745.1 hypothetical protein CISIN_1g039495mg, partial [Citru...   286   1e-90
GAV86447.1 RVT_3 domain-containing protein, partial [Cephalotus ...   261   1e-76
EOY22061.1 Uncharacterized protein TCM_014255 [Theobroma cacao]       224   3e-61
XP_017972749.1 PREDICTED: uncharacterized protein LOC108661242 [...   207   3e-59
KDP31341.1 hypothetical protein JCGZ_11717 [Jatropha curcas]          184   1e-48
XP_012080373.1 PREDICTED: uncharacterized protein LOC105640619 [...   184   1e-47
OMP08004.1 Zinc finger, FYVE/PHD-type [Corchorus olitorius]           172   2e-43
KCW83711.1 hypothetical protein EUGRSUZ_B00585 [Eucalyptus grandis]   147   5e-37
XP_010522729.1 PREDICTED: uncharacterized protein LOC104801226 i...   149   6e-36
XP_010522710.1 PREDICTED: uncharacterized protein LOC104801226 i...   149   6e-36
OMO53161.1 Histone deacetylase superfamily [Corchorus capsularis]     148   6e-35
EOY16798.1 Uncharacterized protein TCM_035679 [Theobroma cacao]       137   7e-35
EOY04279.1 Non-LTR retroelement reverse transcriptase-like [Theo...   143   1e-34
XP_018723236.1 PREDICTED: uncharacterized protein LOC104434358 i...   147   1e-34
XP_018723235.1 PREDICTED: uncharacterized protein LOC104434358 i...   147   1e-34
XP_018723234.1 PREDICTED: uncharacterized protein LOC104434358 i...   147   1e-34
OAY52390.1 hypothetical protein MANES_04G079500 [Manihot esculenta]   134   2e-33
EOY19161.1 Polynucleotidyl transferase, putative [Theobroma cacao]    139   3e-33
OAY52615.1 hypothetical protein MANES_04G097300 [Manihot esculenta]   132   1e-32

>XP_006440743.1 hypothetical protein CICLE_v10021474mg [Citrus clementina] ESR53983.1
            hypothetical protein CICLE_v10021474mg [Citrus
            clementina]
          Length = 290

 Score =  286 bits (732), Expect = 4e-91
 Identities = 136/173 (78%), Positives = 144/173 (83%)
 Frame = +2

Query: 653  NMSICWRPPPKNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASEL 832
            N  ICW PPP NWVKLNIEGSSSR  G AGAGG +RDESGKW+LGYSKN+GTS + ASEL
Sbjct: 118  NSRICWMPPPTNWVKLNIEGSSSRAQGSAGAGGIVRDESGKWILGYSKNLGTSNSLASEL 177

Query: 833  WALYLGLKLVWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKL 1012
            WALY GL LVWERGFRKVLVE  SH+AVKCLE P +FLDPNR LI SCR+ L  NWDCKL
Sbjct: 178  WALYHGLNLVWERGFRKVLVECNSHEAVKCLELPASFLDPNRALILSCREYLCRNWDCKL 237

Query: 1013 QRIHREANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGIVQTHSH 1171
            Q I REAN CANWLA HCENQPLG+LAVFDTPP AL PIFQKDS GIVQT SH
Sbjct: 238  QLILREANSCANWLAAHCENQPLGSLAVFDTPPYALAPIFQKDSTGIVQTRSH 290



 Score =  124 bits (310), Expect = 1e-28
 Identities = 65/106 (61%), Positives = 75/106 (70%), Gaps = 1/106 (0%)
 Frame = +2

Query: 2   EATDMDIDMEXXXXXXXXXXXXSKELSRRACGSHNKGDVAPVTEMVLFEPKH-KRCASSD 178
           EATDM +DME            SKEL  +ACGSH KGDV P  E +L EPK  ++  SSD
Sbjct: 15  EATDMVVDMEGGKIVGPVDVVVSKELYGKACGSHEKGDVNPANETMLSEPKQGQKSNSSD 74

Query: 179 LFTKSSERIKFKISSEHPLSKGKLVNAKVDADCPPGFKKIPRLNSR 316
           LFT+ SE+IKFKISS  PLSKGKLVN+ +DADCPPGF+KI R NSR
Sbjct: 75  LFTRRSEKIKFKISSARPLSKGKLVNSMIDADCPPGFEKIHRPNSR 120


>KDO65745.1 hypothetical protein CISIN_1g039495mg, partial [Citrus sinensis]
          Length = 327

 Score =  286 bits (732), Expect = 1e-90
 Identities = 136/173 (78%), Positives = 144/173 (83%)
 Frame = +2

Query: 653  NMSICWRPPPKNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASEL 832
            N  ICW PPP NWVKLNIEGSSSR  G AGAGG +RDESGKW+LGYSKN+GTS + ASEL
Sbjct: 155  NSRICWMPPPTNWVKLNIEGSSSRAQGSAGAGGIVRDESGKWILGYSKNLGTSNSLASEL 214

Query: 833  WALYLGLKLVWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKL 1012
            WALY GL LVWERGFRKVLVE  SH+AVKCLE P +FLDPNR LI SCR+ L  NWDCKL
Sbjct: 215  WALYHGLNLVWERGFRKVLVECNSHEAVKCLELPASFLDPNRALILSCREYLCRNWDCKL 274

Query: 1013 QRIHREANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGIVQTHSH 1171
            Q I REAN CANWLA HCENQPLG+LAVFDTPP AL PIFQKDS GIVQT SH
Sbjct: 275  QLILREANSCANWLAAHCENQPLGSLAVFDTPPYALAPIFQKDSTGIVQTRSH 327



 Score =  122 bits (307), Expect = 5e-28
 Identities = 65/106 (61%), Positives = 74/106 (69%), Gaps = 1/106 (0%)
 Frame = +2

Query: 2   EATDMDIDMEXXXXXXXXXXXXSKELSRRACGSHNKGDVAPVTEMVLFEPKH-KRCASSD 178
           E TDM +DME            SKEL  +ACGSH KGDV P TE +L EPK  ++  SSD
Sbjct: 52  EVTDMVVDMEGGKIVGPVDVVVSKELYGKACGSHEKGDVNPATETMLSEPKQGQKSNSSD 111

Query: 179 LFTKSSERIKFKISSEHPLSKGKLVNAKVDADCPPGFKKIPRLNSR 316
           LFT  SE+IKFKISS  PLSKGKLVN+ +DADCPPGF+KI R NSR
Sbjct: 112 LFTGRSEKIKFKISSARPLSKGKLVNSMIDADCPPGFEKIHRPNSR 157


>GAV86447.1 RVT_3 domain-containing protein, partial [Cephalotus follicularis]
          Length = 677

 Score =  261 bits (666), Expect = 1e-76
 Identities = 170/462 (36%), Positives = 232/462 (50%), Gaps = 81/462 (17%)
 Frame = +2

Query: 11   DMDIDMEXXXXXXXXXXXXSKELSRRACGSHNKGDVAPVT-------------------E 133
            DM++DM             SK+ SR   G+  K  V  +T                   +
Sbjct: 211  DMEVDMVGGRFVGVVDEAISKDSSRGVWGTSEKEIVEEITPQKSVGKILAVKEDINSTLK 270

Query: 134  MVLFEPKHKRCASSDLFTKSSERIKFKISSEHPL--SKGKLVNAKVDADCPPGFKKIPRL 307
             +L E  H+ C  SD   KS ++IK K+S+E  L  S GKL+NA+ +   PPGF+K+   
Sbjct: 271  QLLLERIHEDCNDSDSVAKSIQKIKSKLSAERILQRSPGKLLNAREEMGIPPGFEKLS-- 328

Query: 308  NSRTI-------EEKAGVVDGGVINHVKKENIGLQGLSGAAFGASRPQIQRSSEDSILTK 466
            NS  +       E +  + DG  I+HV +EN+ LQ + G  FGA  P+ Q   E S L  
Sbjct: 329  NSEILPCKPTYTEGEDCLNDGRDISHVMRENMALQVVGGRDFGARSPRPQSFMEASNLKG 388

Query: 467  SQKHRQEPGGGEAYDPFAAFPSSLSEKGNQRGLKVKAEVNDDESLHRTWKKK-------- 622
            S KH  EP  G A D  AA+PS L +K     LK+K  VN  E      ++K        
Sbjct: 389  SVKHL-EPARGFANDHLAAYPSPLLDKDGISELKIKGGVNGVEKPVEVEEQKVLSGTPIC 447

Query: 623  --------------------------------------ELEVEAERTMNMSICWRPPPKN 688
                                                  EL  ++   + +SICW PPPKN
Sbjct: 448  NEGTSQSTAGSTNGGISERKQNGYLQSVKMTTEAHKFNELHTKSPNMVEVSICWTPPPKN 507

Query: 689  WVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYLGLKLVWE 868
            WVKLN +G S    G A A G IRDESGKWL+GY+ ++   + SA++ WALY GL+LVW+
Sbjct: 508  WVKLNTQGFSQGHPGSARAAGVIRDESGKWLMGYNLDVQCYENSAADFWALYQGLQLVWD 567

Query: 869  RGFRKVLVESVSHKAVKCLEQP-------TAFLDPNRDLIESCRDLLHLNWDCKLQRIHR 1027
            RG++KV+VE VS   V+ L++        +A  DP+R ++E C+DLL+  W CK+  IHR
Sbjct: 568  RGYKKVMVECVSSLVVEYLKEAHTPFDPNSAIDDPDRAIVECCKDLLNREWTCKVNHIHR 627

Query: 1028 EANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGI 1153
            +AN CA+WLAT CE+     L +F  PP  L+PI Q D AGI
Sbjct: 628  KANSCAHWLATQCED--CKGLTLFKNPPPELVPIMQNDCAGI 667


>EOY22061.1 Uncharacterized protein TCM_014255 [Theobroma cacao]
          Length = 956

 Score =  224 bits (570), Expect = 3e-61
 Identities = 162/459 (35%), Positives = 221/459 (48%), Gaps = 71/459 (15%)
 Frame = +2

Query: 2    EATDMDIDMEXXXXXXXXXXXXSKELSR---------------RACGSHNKGDVAPVTEM 136
            EA DMDIDM             SK  +R                   S    D+  V + 
Sbjct: 486  EAVDMDIDMVGGKIVGISDTAVSKASTRDFNEWAVKETLDAITNRKFSTVLEDLDSVIQP 545

Query: 137  VLFEPKHKRCASSDLFTKSSERIKFKISSE---HPLSKG--KLVNAKVDADCPPGFKKIP 301
            +  E + + C  SDL +K++E+IK  ++ E   HPL     +L ++  +   PPGF+ + 
Sbjct: 546  ISLEFEQECCNDSDLPSKNTEKIKSTLTLEPLEHPLESSLCRLPSSMGETRIPPGFEGLM 605

Query: 302  RLNSR-------TIEEKAGVVDGGVINHVKKENIGLQGLSGAAFGASRPQIQRSSEDSIL 460
            +LNS        + EEK  + +  +++ VK E IG QG+   AFG S PQ Q S ED   
Sbjct: 606  KLNSSNRAPKAPSAEEKNFLNERRIVSQVKDE-IGCQGILQIAFGVSSPQRQGSIEDYSS 664

Query: 461  TKSQKHRQEPGGGEAYDPFAAFPSS-LSEKGNQRGLKVKAEVNDDESLHRTWKKKEL--- 628
              SQK  Q      A DP AA P   LS KG + G  +K E ND+E      +KK L   
Sbjct: 665  RGSQKQHQVLACHLANDPSAACPPPCLSVKGTKVGSVIKIEGNDNEKPFEMREKKGLQST 724

Query: 629  ---------------------EVEAER-------------------TMNMSICWRPPPKN 688
                                 EV+AE                     M  S C RP    
Sbjct: 725  LFLSKDKSKMVAGSFNGPCAVEVKAEMQAVDDRVNNSHGLHARLPSNMETSQCLRPVAST 784

Query: 689  WVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYLGLKLVWE 868
             V L++ G S    G A AGG IRD+ GKW++GY+ N+G   + +++LWAL+ G+K  W+
Sbjct: 785  RVNLSVAGRSRGNTGAASAGGLIRDKLGKWIVGYNLNLGRCSSLSTDLWALFQGIKFTWD 844

Query: 869  RGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRIHREANLCAN 1048
            RG+RKVLVES    AV+CL +  + L+ N  LI+SCRDLL+  WDCK+  I RE NLCAN
Sbjct: 845  RGYRKVLVESDCVAAVECLRKAPSLLNSNIALIKSCRDLLNRKWDCKVHPIRREENLCAN 904

Query: 1049 WLATHCENQPLGALAVFDTPPCALIPIFQKDSAGIVQTH 1165
            WLATH E    G L++   PP  L P+ + D   + + H
Sbjct: 905  WLATHVEGCSPG-LSIIKEPPLELNPLLENDCVRVARAH 942


>XP_017972749.1 PREDICTED: uncharacterized protein LOC108661242 [Theobroma cacao]
          Length = 365

 Score =  207 bits (526), Expect = 3e-59
 Identities = 134/347 (38%), Positives = 178/347 (51%), Gaps = 51/347 (14%)
 Frame = +2

Query: 278  PPGFKKIPRLNSR-------TIEEKAGVVDGGVINHVKKENIGLQGLSGAAFGASRPQIQ 436
            PPGF+ + +LNS        + EEK  + +  +++ VK+E IG QG+   AFG S PQ Q
Sbjct: 7    PPGFEGLMKLNSSNRAPKAPSAEEKNFLNERRIVSQVKEE-IGCQGILQIAFGVSSPQRQ 65

Query: 437  RSSEDSILTKSQKHRQEPGGGEAYDPFAAFPSS-LSEKGNQRGLKVKAEVNDDESLHRTW 613
             S ED     SQK  Q      A DP AA P   LS KG + G  +K E ND+E      
Sbjct: 66   GSIEDYSSRGSQKQHQVLACHLANDPSAACPPPCLSVKGTKVGSVIKIEGNDNEKPFEMR 125

Query: 614  KKKEL------------------------EVEAER-------------------TMNMSI 664
            +KK L                        EV+AE                     M  S 
Sbjct: 126  EKKGLQSTLFLSKDKSKMVAGSFNGPCAVEVKAEMQAVDDRVNNSHGLHARLPSNMETSQ 185

Query: 665  CWRPPPKNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALY 844
            C RP     V L++ G S    G A AGG IRD+ GKW++GY+ N+G   + +++LWAL+
Sbjct: 186  CLRPVASTRVNLSVAGRSRGNTGAASAGGLIRDKLGKWIVGYNLNLGRCSSLSTDLWALF 245

Query: 845  LGLKLVWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRIH 1024
             G+K  W+RG+RKVLVES    AV+CL +  + L+ N  LI+SCRDLL+  WDCK+  I 
Sbjct: 246  QGIKFTWDRGYRKVLVESDCVAAVECLRKAPSLLNSNIALIKSCRDLLNRKWDCKVHPIR 305

Query: 1025 REANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGIVQTH 1165
            RE NLCANWLATH E    G L++   PP  L P+ + D   + + H
Sbjct: 306  REENLCANWLATHVEGCSPG-LSIIKEPPLELNPLLENDCVRVARAH 351


>KDP31341.1 hypothetical protein JCGZ_11717 [Jatropha curcas]
          Length = 581

 Score =  184 bits (466), Expect = 1e-48
 Identities = 119/328 (36%), Positives = 166/328 (50%), Gaps = 8/328 (2%)
 Frame = +2

Query: 212  KISSEHPLSKGKLVNA-KVDADCPPGFKKIPRLNSRTI-------EEKAGVVDGGVINHV 367
            ++ ++  +S  K++NA K +A+ PPGF+++ R   R +       EE+    DG V+   
Sbjct: 303  RVDADVNVSNSKILNAFKSEANIPPGFEELSRQKLRNMQCKAKFREEETHFNDGEVVRRA 362

Query: 368  KKENIGLQGLSGAAFGASRPQIQRSSEDSILTKSQKHRQEPGGGEAYDPFAAFPSSLSEK 547
            KK NI L                             HR              +P S    
Sbjct: 363  KKGNIDL-----------------------------HR--------------YPQSSP-- 377

Query: 548  GNQRGLKVKAEVNDDESLHRTWKKKELEVEAERTMNMSICWRPPPKNWVKLNIEGSSSRV 727
                 +++K EV + E   +    K L   +        CW PP    VKLNI  SS   
Sbjct: 378  -----MELKIEVQNFECDKQDHSHK-LSARSLGKTQRPACWIPPRTGSVKLNIAESSKTK 431

Query: 728  MGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYLGLKLVWERGFRKVLVESVSH 907
               AGA G IRDESGKW++GY+ N+G+     +ELWA+Y GL L W+RGF+KVLVES S 
Sbjct: 432  TEAAGASGVIRDESGKWVIGYTLNLGSYSCLGAELWAVYHGLMLAWDRGFKKVLVESQSL 491

Query: 908  KAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRIHREANLCANWLATHCENQPLGA 1087
            ++++ L++    LDPNR L   C +L+  +WDC+   IHREAN CANWLAT+ ++ PLG 
Sbjct: 492  ESLEYLKRVPDELDPNRALATCCMNLIKRDWDCQFLHIHREANGCANWLATNFQDHPLG- 550

Query: 1088 LAVFDTPPCALIPIFQKDSAGIVQTHSH 1171
            L VF  PP +LIPI Q DS     + +H
Sbjct: 551  LTVFYKPPVSLIPILQNDSMMAAPSSTH 578


>XP_012080373.1 PREDICTED: uncharacterized protein LOC105640619 [Jatropha curcas]
          Length = 778

 Score =  184 bits (466), Expect = 1e-47
 Identities = 119/328 (36%), Positives = 166/328 (50%), Gaps = 8/328 (2%)
 Frame = +2

Query: 212  KISSEHPLSKGKLVNA-KVDADCPPGFKKIPRLNSRTI-------EEKAGVVDGGVINHV 367
            ++ ++  +S  K++NA K +A+ PPGF+++ R   R +       EE+    DG V+   
Sbjct: 500  RVDADVNVSNSKILNAFKSEANIPPGFEELSRQKLRNMQCKAKFREEETHFNDGEVVRRA 559

Query: 368  KKENIGLQGLSGAAFGASRPQIQRSSEDSILTKSQKHRQEPGGGEAYDPFAAFPSSLSEK 547
            KK NI L                             HR              +P S    
Sbjct: 560  KKGNIDL-----------------------------HR--------------YPQSSP-- 574

Query: 548  GNQRGLKVKAEVNDDESLHRTWKKKELEVEAERTMNMSICWRPPPKNWVKLNIEGSSSRV 727
                 +++K EV + E   +    K L   +        CW PP    VKLNI  SS   
Sbjct: 575  -----MELKIEVQNFECDKQDHSHK-LSARSLGKTQRPACWIPPRTGSVKLNIAESSKTK 628

Query: 728  MGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYLGLKLVWERGFRKVLVESVSH 907
               AGA G IRDESGKW++GY+ N+G+     +ELWA+Y GL L W+RGF+KVLVES S 
Sbjct: 629  TEAAGASGVIRDESGKWVIGYTLNLGSYSCLGAELWAVYHGLMLAWDRGFKKVLVESQSL 688

Query: 908  KAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRIHREANLCANWLATHCENQPLGA 1087
            ++++ L++    LDPNR L   C +L+  +WDC+   IHREAN CANWLAT+ ++ PLG 
Sbjct: 689  ESLEYLKRVPDELDPNRALATCCMNLIKRDWDCQFLHIHREANGCANWLATNFQDHPLG- 747

Query: 1088 LAVFDTPPCALIPIFQKDSAGIVQTHSH 1171
            L VF  PP +LIPI Q DS     + +H
Sbjct: 748  LTVFYKPPVSLIPILQNDSMMAAPSSTH 775


>OMP08004.1 Zinc finger, FYVE/PHD-type [Corchorus olitorius]
          Length = 899

 Score =  172 bits (437), Expect = 2e-43
 Identities = 99/232 (42%), Positives = 137/232 (59%), Gaps = 20/232 (8%)
 Frame = +2

Query: 506  YDPFAAFPSSLSEKGNQRGLKVKAEVNDDESLHRTWKKKELEVEA----ERTMNMSICWR 673
            Y+P AA  S L +KG Q GL+V+   +DDE      +KK L+  +    +++  +S  + 
Sbjct: 661  YNPSAACRSPLVDKGTQIGLEVQG--HDDEKAIAAGEKKGLQGSSIPSEDKSKRVSGSFN 718

Query: 674  PP---------------PKNWVKLNIEGSS-SRVMGCAGAGGTIRDESGKWLLGYSKNIG 805
             P                K  VKL+  G S  R+ G    GG IRDESGKW++GY+  IG
Sbjct: 719  APCAGVFKAKVNVENDIVKTRVKLSFAGRSRGRIPGATTGGGLIRDESGKWIIGYNLKIG 778

Query: 806  TSQTSASELWALYLGLKLVWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDL 985
               +  ++LWALY GLKL W++G RKV+VES S   V+CL++P   LD NR LI+SCRDL
Sbjct: 779  VCSSLTTDLWALYEGLKLSWDKGCRKVVVESDSVAVVECLKKPLCLLDSNRALIQSCRDL 838

Query: 986  LHLNWDCKLQRIHREANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKD 1141
            L+ NWDCKLQ + R+ N CA+WLA + E    G L++   PP  LIP+ ++D
Sbjct: 839  LNNNWDCKLQIVRRDENSCADWLAANVEVNQQG-LSIIKVPPFKLIPLLERD 889


>KCW83711.1 hypothetical protein EUGRSUZ_B00585 [Eucalyptus grandis]
          Length = 322

 Score =  147 bits (370), Expect = 5e-37
 Identities = 72/157 (45%), Positives = 96/157 (61%)
 Frame = +2

Query: 683  KNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYLGLKLV 862
            +NWVKLN  G +    G AG GGT+   SGKW+LGYS  +G   + A++LWA++ GL LV
Sbjct: 163  RNWVKLNTSGITRGSTGSAGVGGTVHTSSGKWVLGYSFQLGKWSSLAADLWAIFQGLNLV 222

Query: 863  WERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRIHREANLC 1042
            W+R +RKVLV S S+ A+  L+      DPN+DLI+ CR+L+  NW+C +  I RE N  
Sbjct: 223  WDRNYRKVLVLSDSYPALNSLKTAPQASDPNKDLIQCCRELILRNWECNVSHISREKNAA 282

Query: 1043 ANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGI 1153
            A WLA H +  P+G L   D PP  L  I +    GI
Sbjct: 283  AAWLADHSQLYPMG-LTELDEPPQELAHILKNQDNGI 318


>XP_010522729.1 PREDICTED: uncharacterized protein LOC104801226 isoform X2 [Tarenaya
            hassleriana]
          Length = 609

 Score =  149 bits (377), Expect = 6e-36
 Identities = 69/165 (41%), Positives = 99/165 (60%)
 Frame = +2

Query: 662  ICWRPPPKNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWAL 841
            + W PPP  WVKLN++ +        G+ G  RDESG+W+ GY  N+      +++LWA+
Sbjct: 443  LSWMPPPSKWVKLNVQRTLKPESDLTGSAGIARDESGRWVCGYVVNLKNLSELSADLWAV 502

Query: 842  YLGLKLVWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRI 1021
            Y GLKL+W+RGFRKV+VE+ S  A++ L   +     +  +++ C+D+L  NW+C++  I
Sbjct: 503  YQGLKLLWDRGFRKVIVETTSLNALEALNANSLPFQQSNSILQRCKDMLLKNWECRICAI 562

Query: 1022 HREANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGIV 1156
              E N CA WLA   E QP G L VFD+PP  LI +  KD    V
Sbjct: 563  SEEQNSCAVWLANKAEEQPTG-LVVFDSPPGRLITLLGKDCTAAV 606


>XP_010522710.1 PREDICTED: uncharacterized protein LOC104801226 isoform X1 [Tarenaya
            hassleriana] XP_010522718.1 PREDICTED: uncharacterized
            protein LOC104801226 isoform X1 [Tarenaya hassleriana]
          Length = 614

 Score =  149 bits (377), Expect = 6e-36
 Identities = 69/165 (41%), Positives = 99/165 (60%)
 Frame = +2

Query: 662  ICWRPPPKNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWAL 841
            + W PPP  WVKLN++ +        G+ G  RDESG+W+ GY  N+      +++LWA+
Sbjct: 448  LSWMPPPSKWVKLNVQRTLKPESDLTGSAGIARDESGRWVCGYVVNLKNLSELSADLWAV 507

Query: 842  YLGLKLVWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRI 1021
            Y GLKL+W+RGFRKV+VE+ S  A++ L   +     +  +++ C+D+L  NW+C++  I
Sbjct: 508  YQGLKLLWDRGFRKVIVETTSLNALEALNANSLPFQQSNSILQRCKDMLLKNWECRICAI 567

Query: 1022 HREANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGIV 1156
              E N CA WLA   E QP G L VFD+PP  LI +  KD    V
Sbjct: 568  SEEQNSCAVWLANKAEEQPTG-LVVFDSPPGRLITLLGKDCTAAV 611


>OMO53161.1 Histone deacetylase superfamily [Corchorus capsularis]
          Length = 1662

 Score =  148 bits (374), Expect = 6e-35
 Identities = 73/130 (56%), Positives = 91/130 (70%), Gaps = 1/130 (0%)
 Frame = +2

Query: 683  KNWVKLNIEG-SSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYLGLKL 859
            K  VKL+  G S  R+ G    GG IRDESGKW++GY+  IG   +  + LWALY GLKL
Sbjct: 850  KTRVKLSFAGRSQGRIPGATTGGGLIRDESGKWIIGYNLKIGVCSSLTTYLWALYEGLKL 909

Query: 860  VWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRIHREANL 1039
             W++G RKV+VES S  AV+CL++P   LD NR LI+SCRDLL+ NWDCKLQ I R+ N 
Sbjct: 910  SWDKGCRKVVVESDSVAAVECLKKPLCLLDSNRALIQSCRDLLNNNWDCKLQIIRRDENS 969

Query: 1040 CANWLATHCE 1069
            CA+WLA + E
Sbjct: 970  CADWLAANVE 979


>EOY16798.1 Uncharacterized protein TCM_035679 [Theobroma cacao]
          Length = 203

 Score =  137 bits (346), Expect = 7e-35
 Identities = 70/141 (49%), Positives = 87/141 (61%)
 Frame = +2

Query: 662  ICWRPPPKNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWAL 841
            I W  P   +VKLN++GS+      A  GG I DE G WLLG++  IG S +   ELWAL
Sbjct: 44   ISWELPKHLYVKLNVDGSARGQPEMATVGGVITDEVGNWLLGFNYKIGISCSLQVELWAL 103

Query: 842  YLGLKLVWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRI 1021
            Y GL L W++GFRKV VES S  AV+ +   +     N  L++  R+L   +WDC L  I
Sbjct: 104  YWGLTLCWDKGFRKVQVESDSLLAVQKISNQSLQPKQNAGLLKCIRELFQRSWDCTLTHI 163

Query: 1022 HREANLCANWLATHCENQPLG 1084
            HREAN CANW+ATH EN PLG
Sbjct: 164  HREANQCANWMATHHENLPLG 184


>EOY04279.1 Non-LTR retroelement reverse transcriptase-like [Theobroma cacao]
          Length = 438

 Score =  143 bits (361), Expect = 1e-34
 Identities = 75/164 (45%), Positives = 101/164 (61%)
 Frame = +2

Query: 662  ICWRPPPKNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWAL 841
            I W  P  ++VKLN++GS+    G A AGG IR E G WLLG++  IG S +  +ELWAL
Sbjct: 270  ISWELPKHSYVKLNVDGSAKGQPGMAAAGGVIRYEVGNWLLGFNYKIGISCSLQAELWAL 329

Query: 842  YLGLKLVWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRI 1021
            Y GL L W++GFRKV VES S  AV+ +   +   + N  L++  ++L    W+C L  I
Sbjct: 330  YWGLTLCWDKGFRKVQVESDSLLAVQKISNQSLQPEQNAGLLKCIKELFQRFWNCTLTHI 389

Query: 1022 HREANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGI 1153
            HREAN CA+W+ATH EN  L  L + D+PP ++  I   DS  I
Sbjct: 390  HREANQCADWMATHHENLLL-KLHIMDSPPSSISAILLADSISI 432


>XP_018723236.1 PREDICTED: uncharacterized protein LOC104434358 isoform X3
            [Eucalyptus grandis]
          Length = 837

 Score =  147 bits (370), Expect = 1e-34
 Identities = 72/157 (45%), Positives = 96/157 (61%)
 Frame = +2

Query: 683  KNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYLGLKLV 862
            +NWVKLN  G +    G AG GGT+   SGKW+LGYS  +G   + A++LWA++ GL LV
Sbjct: 678  RNWVKLNTSGITRGSTGSAGVGGTVHTSSGKWVLGYSFQLGKWSSLAADLWAIFQGLNLV 737

Query: 863  WERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRIHREANLC 1042
            W+R +RKVLV S S+ A+  L+      DPN+DLI+ CR+L+  NW+C +  I RE N  
Sbjct: 738  WDRNYRKVLVLSDSYPALNSLKTAPQASDPNKDLIQCCRELILRNWECNVSHISREKNAA 797

Query: 1043 ANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGI 1153
            A WLA H +  P+G L   D PP  L  I +    GI
Sbjct: 798  AAWLADHSQLYPMG-LTELDEPPQELAHILKNQDNGI 833


>XP_018723235.1 PREDICTED: uncharacterized protein LOC104434358 isoform X2
            [Eucalyptus grandis]
          Length = 837

 Score =  147 bits (370), Expect = 1e-34
 Identities = 72/157 (45%), Positives = 96/157 (61%)
 Frame = +2

Query: 683  KNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYLGLKLV 862
            +NWVKLN  G +    G AG GGT+   SGKW+LGYS  +G   + A++LWA++ GL LV
Sbjct: 678  RNWVKLNTSGITRGSTGSAGVGGTVHTSSGKWVLGYSFQLGKWSSLAADLWAIFQGLNLV 737

Query: 863  WERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRIHREANLC 1042
            W+R +RKVLV S S+ A+  L+      DPN+DLI+ CR+L+  NW+C +  I RE N  
Sbjct: 738  WDRNYRKVLVLSDSYPALNSLKTAPQASDPNKDLIQCCRELILRNWECNVSHISREKNAA 797

Query: 1043 ANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGI 1153
            A WLA H +  P+G L   D PP  L  I +    GI
Sbjct: 798  AAWLADHSQLYPMG-LTELDEPPQELAHILKNQDNGI 833


>XP_018723234.1 PREDICTED: uncharacterized protein LOC104434358 isoform X1
            [Eucalyptus grandis]
          Length = 844

 Score =  147 bits (370), Expect = 1e-34
 Identities = 72/157 (45%), Positives = 96/157 (61%)
 Frame = +2

Query: 683  KNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYLGLKLV 862
            +NWVKLN  G +    G AG GGT+   SGKW+LGYS  +G   + A++LWA++ GL LV
Sbjct: 685  RNWVKLNTSGITRGSTGSAGVGGTVHTSSGKWVLGYSFQLGKWSSLAADLWAIFQGLNLV 744

Query: 863  WERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRIHREANLC 1042
            W+R +RKVLV S S+ A+  L+      DPN+DLI+ CR+L+  NW+C +  I RE N  
Sbjct: 745  WDRNYRKVLVLSDSYPALNSLKTAPQASDPNKDLIQCCRELILRNWECNVSHISREKNAA 804

Query: 1043 ANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGI 1153
            A WLA H +  P+G L   D PP  L  I +    GI
Sbjct: 805  AAWLADHSQLYPMG-LTELDEPPQELAHILKNQDNGI 840


>OAY52390.1 hypothetical protein MANES_04G079500 [Manihot esculenta]
          Length = 209

 Score =  134 bits (336), Expect = 2e-33
 Identities = 64/162 (39%), Positives = 96/162 (59%)
 Frame = +2

Query: 668  WRPPPKNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYL 847
            W+PPP+ W+KLN++GS     G A AGG +RD S  W++G+  NIG +    +E+  + +
Sbjct: 43   WQPPPQGWMKLNVDGSCLGNPGPASAGGLLRDSSSNWVIGFGLNIGETSILNAEIIGILV 102

Query: 848  GLKLVWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRIHR 1027
            GL+LVW  GFR+V+VES S KAV+ + +      P  D I+ CR LL+L+W C L  ++R
Sbjct: 103  GLQLVWSMGFRRVIVESDSLKAVRLITEEDISFHPLGDYIQDCRTLLNLDWQCTLSHVYR 162

Query: 1028 EANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGI 1153
            + N  A+ LA    +     L ++ +PP  +I    KD  GI
Sbjct: 163  QRNYSADSLAKQSHDLKPEELKIWYSPPWKVIHFLNKDRLGI 204


>EOY19161.1 Polynucleotidyl transferase, putative [Theobroma cacao]
          Length = 419

 Score =  139 bits (350), Expect = 3e-33
 Identities = 66/164 (40%), Positives = 96/164 (58%)
 Frame = +2

Query: 662  ICWRPPPKNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWAL 841
            I W  P   +VKLN++GS+    G A +GG IRDE G W+ G+ + IG + +  +E W +
Sbjct: 251  IAWEKPKNGYVKLNVDGSAKGQPGLAASGGVIRDEYGNWIAGFCQKIGITFSLTAEPWGI 310

Query: 842  YLGLKLVWERGFRKVLVESVSHKAVKCLEQPTAFLDPNRDLIESCRDLLHLNWDCKLQRI 1021
            Y GL L W RG RK  VE  S  A++ +   ++ LDPN  L+   ++LL  +WD  +  +
Sbjct: 311  YQGLTLCWNRGLRKFCVEIDSMLALQKIYSQSSMLDPNAQLLRRIKELLQQSWDVTISHV 370

Query: 1022 HREANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGI 1153
            HREA+ C +W+ TH EN  LG L +F+ PP  ++     DS GI
Sbjct: 371  HREADQCTDWMTTHIENLKLG-LHIFEYPPHDIVYYLFTDSLGI 413


>OAY52615.1 hypothetical protein MANES_04G097300 [Manihot esculenta]
          Length = 223

 Score =  132 bits (332), Expect = 1e-32
 Identities = 69/164 (42%), Positives = 92/164 (56%), Gaps = 2/164 (1%)
 Frame = +2

Query: 668  WRPPPKNWVKLNIEGSSSRVMGCAGAGGTIRDESGKWLLGYSKNIGTSQTSASELWALYL 847
            W+PPP+ WVKLN++GS     G A AGG +RD S  WL G+  NIG S    +E+  +  
Sbjct: 56   WQPPPQGWVKLNVDGSYLGNSGLASAGGLLRDASSNWLCGFGFNIGESSILHAEIIGISY 115

Query: 848  GLKLVWERGFRKVLVESVSHKAVKCL--EQPTAFLDPNRDLIESCRDLLHLNWDCKLQRI 1021
            GL+  W  GFR+V+ ES S  A+K +  EQ  +F  P   LIE CR LL L+W C L  I
Sbjct: 116  GLQFAWNMGFRRVIAESDSLTAIKLITQEQNISF-HPLAHLIEDCRKLLSLDWYCSLFHI 174

Query: 1022 HREANLCANWLATHCENQPLGALAVFDTPPCALIPIFQKDSAGI 1153
            +RE N  A+ LA            ++D+PP  ++P   KD  GI
Sbjct: 175  YRERNFSADALAKKSHGLEFEEFTIWDSPPWKVVPFLNKDKLGI 218


Top