BLASTX nr result

ID: Rehmannia31_contig00012245 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00012245
         (1604 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_020552583.1| uncharacterized protein LOC105173753 [Sesamu...   610   0.0  
ref|XP_012843862.1| PREDICTED: uncharacterized protein LOC105963...   485   e-163
gb|EYU32258.1| hypothetical protein MIMGU_mgv1a024121mg, partial...   478   e-161
emb|CDP08668.1| unnamed protein product [Coffea canephora]            377   e-121
ref|NP_001311632.1| uncharacterized LOC107760831 [Nicotiana taba...   370   e-118
ref|XP_009757840.1| PREDICTED: uncharacterized protein LOC104210...   370   e-118
ref|XP_009621902.1| PREDICTED: uncharacterized protein LOC104113...   372   e-118
ref|XP_019158388.1| PREDICTED: uncharacterized protein LOC109155...   363   e-116
ref|XP_016461388.1| PREDICTED: uncharacterized protein LOC107784...   368   e-115
ref|XP_016461387.1| PREDICTED: uncharacterized protein LOC107784...   368   e-115
ref|XP_019265620.1| PREDICTED: uncharacterized protein LOC109242...   357   e-112
ref|XP_004306659.1| PREDICTED: uncharacterized protein LOC101309...   310   2e-95
ref|XP_021613680.1| uncharacterized protein LOC110615833 [Maniho...   304   3e-93
ref|XP_017232029.1| PREDICTED: uncharacterized protein LOC108206...   297   8e-91
ref|XP_012083199.1| uncharacterized protein LOC105642839 [Jatrop...   251   8e-73
ref|XP_021280140.1| uncharacterized protein LOC110413599 [Herran...   237   3e-68
gb|EOX95687.1| Zinc knuckle family protein, putative isoform 2 [...   235   1e-67
ref|XP_007051530.2| PREDICTED: uncharacterized protein LOC186139...   233   6e-67
gb|KDP28479.1| hypothetical protein JCGZ_14250 [Jatropha curcas]      233   2e-66
gb|EOX95686.1| Zinc knuckle family protein, putative isoform 1 [...   235   3e-66

>ref|XP_020552583.1| uncharacterized protein LOC105173753 [Sesamum indicum]
          Length = 714

 Score =  610 bits (1574), Expect = 0.0
 Identities = 337/619 (54%), Positives = 411/619 (66%), Gaps = 91/619 (14%)
 Frame = +2

Query: 20   DEDLSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRS 199
            +E L+ASKRRKIEDTVLQILK +DL+T TEF VRAAA +RLG GLS L HRRLV+ L+ S
Sbjct: 3    EECLNASKRRKIEDTVLQILKTSDLETTTEFDVRAAAAERLGFGLSGLRHRRLVRQLVDS 62

Query: 200  FLLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNGKIICKLS 379
            FLLS+A +ILGTTS H                ++ +Q+QL  V +GL+ +YNGK+ICKL+
Sbjct: 63   FLLSTAGSILGTTSLHSNIGSNNDDERAE---RREKQQQLRGVGVGLQGHYNGKVICKLT 119

Query: 380  DKRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIE 559
            DKRMVTIHD  G TMV+IRDF +KDGNMLP++GVN+G+SL+ TQWSSF+ SFPSIQEAI 
Sbjct: 120  DKRMVTIHDLNGTTMVSIRDFYVKDGNMLPRKGVNSGVSLSPTQWSSFKTSFPSIQEAIV 179

Query: 560  NMESRLGRKHAVHQLNNLNRKPEAFAKQSDADMTN------------------------- 664
             +ESRL RK A H LNNLN   EA AKQ+++DMTN                         
Sbjct: 180  KLESRLRRKDAAHSLNNLNSTLEAVAKQNESDMTNLLTESAADKSQTEAGICNSSFVSEA 239

Query: 665  ----SVIDSAVEKSQTEADISNLTSILHHPTKSEQTESEAE--ITNPATD---------- 796
                +V DSA  KSQT+A ISNLT+I+H P + +QTE++    +T P+            
Sbjct: 240  EMAVAVADSAAVKSQTQAGISNLTTIVHSPVE-KQTEADTSWSVTAPSLQENILAERKQG 298

Query: 797  -------SVAEKNHSQAVISNLTSTSDSREQIPSERNQTDAVISDLVPPFVA-------F 934
                   +++     +A  S   + S S+EQIP+ERNQT+A +S  V  F          
Sbjct: 299  ADTSGSIAISTSQEQRADTSGSIAISTSQEQIPAERNQTEADVSTSVQAFPTGGRSHDRV 358

Query: 935  NAVGPERSIPDERKQT------------------------------------EAGTSTSA 1006
            +AV PE+ +P ERKQ                                     EA  STS 
Sbjct: 359  SAVCPEQLVPAERKQAGAHISTTTPIIPTEGQLYDTVSAVHPDRLIAAERKQEADVSTSL 418

Query: 1007 PAFPTQEQLHHTVNTVRPKQLVPIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPS 1186
            PAFP Q   HHTVN V  +++ PIQ  RLDGRNY+ W+HQMEFFL+ L+I Y L++ CPS
Sbjct: 419  PAFPNQGHSHHTVNAVHFERVNPIQTTRLDGRNYNLWRHQMEFFLDLLDIGYVLAKPCPS 478

Query: 1187 LSSNPEASFDEQVKVKAAIQRWIDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELK 1366
            +S + E S DE+VK KAA+QRWIDDDY+CRHNILNSLCDNLFQ YS K+ SARELWEELK
Sbjct: 479  ISLDQETSLDEKVKEKAAVQRWIDDDYICRHNILNSLCDNLFQLYSQKSCSARELWEELK 538

Query: 1367 LAYDEDFGTKRSQINKYIHFQMVDGVSILEQIQELHKIADSIIASGTWIDQNFHVSVIVS 1546
            L YDED GT RSQINKYIHFQMVDGVSI+EQ+QELH+IA+SI+ASGTWID+NFHVS IVS
Sbjct: 539  LVYDEDLGTTRSQINKYIHFQMVDGVSIIEQVQELHRIANSIMASGTWIDENFHVSTIVS 598

Query: 1547 KLPPSWKEFRARLMREEFL 1603
            KLPPSWKEFR RLM EEF+
Sbjct: 599  KLPPSWKEFRVRLMHEEFI 617


>ref|XP_012843862.1| PREDICTED: uncharacterized protein LOC105963917 [Erythranthe guttata]
          Length = 546

 Score =  485 bits (1248), Expect = e-163
 Identities = 286/532 (53%), Positives = 347/532 (65%), Gaps = 3/532 (0%)
 Frame = +2

Query: 17   MDED-LSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLI 193
            MDE+ L+ SKRRKIEDTV QIL+++DL+T TE SVRAAA +RLG GLS  T RRLV+ L+
Sbjct: 1    MDEECLNESKRRKIEDTVFQILRSSDLETTTELSVRAAAAERLGFGLSHSTQRRLVRQLV 60

Query: 194  RSFLLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNGKIICK 373
             SFLLS+AAAIL  +S H               + + +Q Q +   +  E NY+GK ICK
Sbjct: 61   DSFLLSTAAAILCPSSLHTNSAVTNNNDGNA--LNRGKQHQRSGSGVDSEGNYDGKAICK 118

Query: 374  LSDKRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEA 553
            LSDKR VT+ D  G TMV+IRDF +KDGNM+P++G    + LTA QWS+FRN+FPSI+EA
Sbjct: 119  LSDKRRVTVRDVNGTTMVSIRDFIIKDGNMVPQKG----MCLTAEQWSTFRNNFPSIEEA 174

Query: 554  IENMESRLGRKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQTEADISNLTSIL 733
            I  MES+L RK+AVH  +NLNR  EA A QS                             
Sbjct: 175  IVKMESQLRRKNAVHPSDNLNRLSEAVALQS----------------------------- 205

Query: 734  HHPTKSEQTESEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSERNQTDAVISDL 913
                       EAE  N A DS  +++ ++  ISN   T  S    P ERNQ+++     
Sbjct: 206  -----------EAERINSAGDSALDRSQTRDGISNSKDTFHS----PIERNQSESEA--- 247

Query: 914  VPPFVAFNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRPKQLVPIQIARL 1093
                              E+KQT+AG ST       Q Q H +VN +   QLVPIQ ARL
Sbjct: 248  ------------------EKKQTQAGIST-------QGQSHCSVNAIHSGQLVPIQTARL 282

Query: 1094 DGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRWIDDDYMC 1273
            DGRNYHSW+HQMEFFL+QL IAY LSE CPS        FDE+VKVK A  +W DDDY+C
Sbjct: 283  DGRNYHSWRHQMEFFLHQLKIAYVLSEPCPS--------FDEKVKVKDAHSKWKDDDYLC 334

Query: 1274 RHNILNSLCDNLFQFYSPKNYSARELWEELKLAYDEDFG-TKRSQINKYIHFQMVDGVSI 1450
            RH+IL+SLCDNLFQ +S K+ SARELWEELKL Y EDFG TKRSQINKYIHF+M DGVSI
Sbjct: 335  RHSILSSLCDNLFQLHSQKSCSARELWEELKLFY-EDFGTTKRSQINKYIHFEMADGVSI 393

Query: 1451 LEQIQELHKIADSIIASG-TWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            L+Q++ELHK+ADSIIASG +WID++FHVSVIVSKLPPSWKE R RLM+EE+L
Sbjct: 394  LQQVEELHKMADSIIASGNSWIDEDFHVSVIVSKLPPSWKELRVRLMQEEYL 445


>gb|EYU32258.1| hypothetical protein MIMGU_mgv1a024121mg, partial [Erythranthe
            guttata]
          Length = 548

 Score =  478 bits (1230), Expect = e-161
 Identities = 286/541 (52%), Positives = 347/541 (64%), Gaps = 12/541 (2%)
 Frame = +2

Query: 17   MDED-LSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLI 193
            MDE+ L+ SKRRKIEDTV QIL+++DL+T TE SVRAAA +RLG GLS  T RRLV+ L+
Sbjct: 1    MDEECLNESKRRKIEDTVFQILRSSDLETTTELSVRAAAAERLGFGLSHSTQRRLVRQLV 60

Query: 194  RSFLLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNGKIICK 373
             SFLLS+AAAIL  +S H               + + +Q Q +   +  E NY+GK ICK
Sbjct: 61   DSFLLSTAAAILCPSSLHTNSAVTNNNDGNA--LNRGKQHQRSGSGVDSEGNYDGKAICK 118

Query: 374  LSDKRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEA 553
            LSDKR VT+ D  G TMV+IRDF +KDGNM+P++G    + LTA QWS+FRN+FPSI+EA
Sbjct: 119  LSDKRRVTVRDVNGTTMVSIRDFIIKDGNMVPQKG----MCLTAEQWSTFRNNFPSIEEA 174

Query: 554  IENMESRLG---------RKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQTEA 706
            I  MES+L          RK+AVH  +NLNR  EA A QS                    
Sbjct: 175  IVKMESQLSSSLFYSYVRRKNAVHPSDNLNRLSEAVALQS-------------------- 214

Query: 707  DISNLTSILHHPTKSEQTESEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSERN 886
                                EAE  N A DS  +++ ++  ISN   T  S    P ERN
Sbjct: 215  --------------------EAERINSAGDSALDRSQTRDGISNSKDTFHS----PIERN 250

Query: 887  QTDAVISDLVPPFVAFNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRPKQ 1066
            Q+++                       E+KQT+AG ST       Q Q H +VN +   Q
Sbjct: 251  QSESEA---------------------EKKQTQAGIST-------QGQSHCSVNAIHSGQ 282

Query: 1067 LVPIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQ 1246
            LVPIQ ARLDGRNYHSW+HQMEFFL+QL IAY LSE CPS        FDE+VKVK A  
Sbjct: 283  LVPIQTARLDGRNYHSWRHQMEFFLHQLKIAYVLSEPCPS--------FDEKVKVKDAHS 334

Query: 1247 RWIDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAYDEDFG-TKRSQINKYIH 1423
            +W DDDY+CRH+IL+SLCDNLFQ +S K+ SARELWEELKL Y EDFG TKRSQINKYIH
Sbjct: 335  KWKDDDYLCRHSILSSLCDNLFQLHSQKSCSARELWEELKLFY-EDFGTTKRSQINKYIH 393

Query: 1424 FQMVDGVSILEQIQELHKIADSIIASG-TWIDQNFHVSVIVSKLPPSWKEFRARLMREEF 1600
            F+M DGVSIL+Q++ELHK+ADSIIASG +WID++FHVSVIVSKLPPSWKE R RLM+EE+
Sbjct: 394  FEMADGVSILQQVEELHKMADSIIASGNSWIDEDFHVSVIVSKLPPSWKELRVRLMQEEY 453

Query: 1601 L 1603
            L
Sbjct: 454  L 454


>emb|CDP08668.1| unnamed protein product [Coffea canephora]
          Length = 593

 Score =  377 bits (969), Expect = e-121
 Identities = 233/533 (43%), Positives = 313/533 (58%), Gaps = 6/533 (1%)
 Frame = +2

Query: 23   EDLSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRSF 202
            E +  SKRRKIE+TV+ ILK  DL+TATE+SVR+AA  +L   LSDL H+ LV+  + SF
Sbjct: 4    EIIPTSKRRKIEETVVNILKNADLETATEYSVRSAAAHQLSTDLSDLAHKCLVRQALESF 63

Query: 203  LLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNGKIICKLSD 382
            LLS+A  +L   +S+               +  PQ+ +       L +   G++ICKLSD
Sbjct: 64   LLSTATTMLDDVNSN-----------DVRKVSVPQKNKDDQE---LPSCSTGRVICKLSD 109

Query: 383  KRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIEN 562
            KR V +HD  G  +V+IRD+  KDG  L       G+SLT  QWS FR+SFP+I+EAI  
Sbjct: 110  KRSVAVHDFRGKCLVSIRDYLEKDGKQLFS---GKGISLTGRQWSLFRSSFPAIEEAIAK 166

Query: 563  MESRLGRKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAV----EKSQTEADISNLTSI 730
            M S+   + AV +            KQS  D+    I S      +K++ E DISN    
Sbjct: 167  MTSQT--RLAVGE------------KQSAVDLLVGDITSQDIFPDDKNKMETDISNCADA 212

Query: 731  LHHPTKSEQTESEAEITNP--ATDSVAEKNHSQAVISNLTSTSDSREQIPSERNQTDAVI 904
            +    +  +  + A  TN   A  +  +   ++ V  N     D + Q   E       +
Sbjct: 213  VDPQREVGERSTVALGTNNWMAIPNGRQSLQTELVQVNSFGVMDHQSQGDGEWKHDGLDV 272

Query: 905  SDLVPPFVAFNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRPKQLVPIQI 1084
            +  V    +      +R  P   +   A TS  AP     +   H+V +  P+ LVPI  
Sbjct: 273  NHSVATPSSQGQTLNQRYHP---RVDSAATSAFAPGGHMPQ---HSVASF-PQSLVPIMT 325

Query: 1085 ARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRWIDDD 1264
             RLDG+NYH W HQMEFFL QL +A+ L + CPS+S+    SF+E+ + KAA+Q+W+DD+
Sbjct: 326  TRLDGKNYHCWAHQMEFFLKQLKVAHVLKDPCPSISAE-SMSFEEKYQAKAAVQKWVDDE 384

Query: 1265 YMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAYDEDFGTKRSQINKYIHFQMVDGV 1444
            Y+CRH ILNSL DNLF  YS K  SA+ELWEEL+  Y+EDFGT RSQ+NKYI FQMVDGV
Sbjct: 385  YICRHYILNSLSDNLFNQYSKKRCSAKELWEELESVYNEDFGTIRSQVNKYIQFQMVDGV 444

Query: 1445 SILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            S+LEQ  EL +I  +I+ASG W+D+NFHVSVI+SKLPPSWKE RA+ M+EEFL
Sbjct: 445  SVLEQTHELQRILATIMASGIWMDENFHVSVIISKLPPSWKECRAKWMQEEFL 497


>ref|NP_001311632.1| uncharacterized LOC107760831 [Nicotiana tabacum]
 emb|CAD10638.1| PBF68 protein [Nicotiana tabacum]
          Length = 594

 Score =  370 bits (950), Expect = e-118
 Identities = 224/539 (41%), Positives = 300/539 (55%), Gaps = 13/539 (2%)
 Frame = +2

Query: 17   MDEDLSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIR 196
            M+E L   KRRKI + VL ILK  D++TATE+SVR    Q+LG  + ++  ++ ++H+I 
Sbjct: 1    MEEQLPEHKRRKIREVVLDILKTADIETATEYSVRTTVAQQLGTEILNIQEKQFIRHVIE 60

Query: 197  SFLLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQ-------PQQRQLTDVRLG----LE 343
            SFLLS+      T  ++R               ++       P Q Q  D  L     ++
Sbjct: 61   SFLLSTVEN--PTLDNNRRISTAEKGVNTDFVAEEQLSADHPPTQHQEADGSLPNGNLVD 118

Query: 344  ANYNG-KIICKLSDKRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSS 520
            +N N  + ICKLSDKR V I D +G   VAIRDF  KDG ++P    + G++L+  QWSS
Sbjct: 119  SNENNCRTICKLSDKRSVGILDIHGKPFVAIRDFYEKDGKLVPS---SRGINLSVQQWSS 175

Query: 521  FRNSFPSIQEAIENMESRLGRKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQT 700
            FR+SFP+I EAI  ME ++              +      Q+ AD+      +A  + Q 
Sbjct: 176  FRSSFPAIVEAIATMELKI--------------RSTTCENQTAADV------AAQGREQI 215

Query: 701  EADISNLTSILHHPTKSEQTESEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSE 880
            + +IS   S+ H   K                  A++N +   +SN    ++S+ Q+P E
Sbjct: 216  QTNISQ--SVNHQEGKLS----------------ADRNENGDDVSNSAIITNSQVQMPIE 257

Query: 881  RNQTDAVISDLVPPFVAFNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVR- 1057
            R                              +QTEAG S SAP F  Q Q+  +  T   
Sbjct: 258  R------------------------------QQTEAGISNSAPCFAPQGQIQQSSRTTSL 287

Query: 1058 PKQLVPIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKA 1237
               LVP++  RLDG+NY+ WKHQ EFFL QLNIAY LSE CP+   N             
Sbjct: 288  AHSLVPVKTIRLDGKNYYCWKHQAEFFLKQLNIAYVLSEPCPNTLENR------------ 335

Query: 1238 AIQRWIDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAYDEDFGTKRSQINKY 1417
              Q+W+DDDY+C HNILNSL D LF+ YS KNYSA+ELWEEL+  YDEDFGTK S++NKY
Sbjct: 336  --QKWVDDDYLCCHNILNSLSDKLFEEYSKKNYSAKELWEELRSTYDEDFGTKSSEVNKY 393

Query: 1418 IHFQMVDGVSILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMRE 1594
            + F MVDG+SILEQ+QELHKIADS++ASG WID+NFH+S I++KLPPSWK+ R RLM E
Sbjct: 394  LQFLMVDGISILEQVQELHKIADSLMASGIWIDENFHISAIIAKLPPSWKDCRTRLMHE 452


>ref|XP_009757840.1| PREDICTED: uncharacterized protein LOC104210598 [Nicotiana
            sylvestris]
          Length = 594

 Score =  370 bits (950), Expect = e-118
 Identities = 224/539 (41%), Positives = 300/539 (55%), Gaps = 13/539 (2%)
 Frame = +2

Query: 17   MDEDLSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIR 196
            M+E L   KRRKI + VL ILK  D++TATE+SVR    Q+LG  + ++  ++ ++H+I 
Sbjct: 1    MEEQLPEHKRRKIREVVLDILKTADIETATEYSVRTTVAQQLGTEILNIQEKQFIRHVIE 60

Query: 197  SFLLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQ-------PQQRQLTDVRLG----LE 343
            SFLLS+      T  ++R               ++       P Q Q  D  L     ++
Sbjct: 61   SFLLSTVEN--PTLDNNRRISTAEKGVNTDFVAEEQLAADHPPTQHQEADGSLPNGNLVD 118

Query: 344  ANYNG-KIICKLSDKRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSS 520
            +N N  + ICKLSDKR V I D +G   VAIRDF  KDG ++P    + G++L+  QWSS
Sbjct: 119  SNENNCRTICKLSDKRSVGILDIHGKPFVAIRDFYEKDGKLVPS---SRGINLSVQQWSS 175

Query: 521  FRNSFPSIQEAIENMESRLGRKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQT 700
            FR+SFP+I EAI  ME ++              +      Q+ AD+      +A  + Q 
Sbjct: 176  FRSSFPAIVEAIATMELKI--------------RSTTCENQTAADV------AAQGREQI 215

Query: 701  EADISNLTSILHHPTKSEQTESEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSE 880
            + +IS   S+ H   K                  A++N +   +SN    ++S+ Q+P E
Sbjct: 216  QTNISQ--SVNHQEGKLS----------------ADRNENGDDVSNSAIITNSQVQMPIE 257

Query: 881  RNQTDAVISDLVPPFVAFNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVR- 1057
            R                              +QTEAG S SAP F  Q Q+  +  T   
Sbjct: 258  R------------------------------QQTEAGISNSAPCFAPQGQIQQSSRTTSL 287

Query: 1058 PKQLVPIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKA 1237
               LVP++  RLDG+NY+ WKHQ EFFL QLNIAY LSE CP+   N             
Sbjct: 288  AHSLVPVKTIRLDGKNYYCWKHQAEFFLKQLNIAYVLSEPCPNTLENR------------ 335

Query: 1238 AIQRWIDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAYDEDFGTKRSQINKY 1417
              Q+W+DDDY+C HNILNSL D LF+ YS KNYSA+ELWEEL+  YDEDFGTK S++NKY
Sbjct: 336  --QKWVDDDYLCCHNILNSLSDKLFEEYSKKNYSAKELWEELRSTYDEDFGTKSSEVNKY 393

Query: 1418 IHFQMVDGVSILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMRE 1594
            + F MVDG+SILEQ+QELHKIADS++ASG WID+NFH+S I++KLPPSWK+ R RLM E
Sbjct: 394  LQFLMVDGISILEQVQELHKIADSLMASGIWIDENFHISAIIAKLPPSWKDCRTRLMHE 452


>ref|XP_009621902.1| PREDICTED: uncharacterized protein LOC104113443 [Nicotiana
            tomentosiformis]
          Length = 690

 Score =  372 bits (955), Expect = e-118
 Identities = 222/542 (40%), Positives = 303/542 (55%), Gaps = 13/542 (2%)
 Frame = +2

Query: 17   MDEDLSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIR 196
            M+E L   KRRKI+  VL ILK  D++TATE+SVR  A Q+LG  + ++  +  ++H++ 
Sbjct: 1    MEEQLPDPKRRKIQGIVLDILKTADIETATEYSVRTTAAQQLGTEILNIQEKNYIRHVVE 60

Query: 197  SFLLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQ-------PQQRQLTDVRLG----LE 343
            SFLLS+      T  ++R               ++       P Q+Q  D  L     ++
Sbjct: 61   SFLLSTVEK--PTLDNNRRISTAEKETNKDFVAEEQLSADHPPTQQQEADGSLPNPHFVD 118

Query: 344  ANYNG-KIICKLSDKRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSS 520
            +N N  + ICKLS KR V I D +G   VAIRDF  KDG ++P    + G++L+A QWSS
Sbjct: 119  SNENNCRTICKLSGKRSVGILDIHGKPFVAIRDFYEKDGKLVPS---SRGINLSAQQWSS 175

Query: 521  FRNSFPSIQEAIENMESRLGRKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQT 700
            FR+SFP+I EAI  MES++                          +T S       ++QT
Sbjct: 176  FRSSFPAIVEAIVTMESKIR-------------------------LTTS-------ENQT 203

Query: 701  EADISNLTSILHHPTKSEQTESEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSE 880
             A+++       H  +   T     + +      A++  +   I N    ++SR Q+P E
Sbjct: 204  AAEVAA------HGREQIHTNISQSVNHQEGKITADRKENGDDICNSAIITNSRVQMPLE 257

Query: 881  RNQTDAVISDLVPPFVAFNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVR- 1057
            R+Q                              TEAG S SAP F  Q Q+  +  T   
Sbjct: 258  RSQ------------------------------TEAGISNSAPCFAPQGQIQPSSRTTSL 287

Query: 1058 PKQLVPIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKA 1237
             + LVP++  RLDG NY+ WKHQ+EFF+ QLNIAY +SE CP++  N             
Sbjct: 288  ARSLVPVKTIRLDGTNYYCWKHQIEFFIKQLNIAYVISEPCPNILENR------------ 335

Query: 1238 AIQRWIDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAYDEDFGTKRSQINKY 1417
              Q+W+D+DY+C HNILNSL D LF+ YS KNYSA+ELWEEL+  YDEDFGTK S++NKY
Sbjct: 336  --QKWVDNDYLCSHNILNSLSDKLFEEYSKKNYSAKELWEELRSTYDEDFGTKSSEVNKY 393

Query: 1418 IHFQMVDGVSILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREE 1597
            + FQMVDG+SILEQ+QELHKIADS++ASG WID+NFH+S I++KLPPSWK+ RARLM E 
Sbjct: 394  LQFQMVDGISILEQVQELHKIADSLMASGIWIDENFHISAIIAKLPPSWKDCRARLMHEN 453

Query: 1598 FL 1603
             L
Sbjct: 454  VL 455


>ref|XP_019158388.1| PREDICTED: uncharacterized protein LOC109155106 [Ipomoea nil]
 ref|XP_019158389.1| PREDICTED: uncharacterized protein LOC109155106 [Ipomoea nil]
          Length = 506

 Score =  363 bits (932), Expect = e-116
 Identities = 216/528 (40%), Positives = 303/528 (57%)
 Frame = +2

Query: 20   DEDLSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRS 199
            +E L   KRRKIEDTV+ IL   DL+TATE SVR  A  RLG  LS L H+RL++  I S
Sbjct: 3    EEQLPEPKRRKIEDTVIDILTTADLETATELSVRTDAAVRLGCDLSTLPHKRLIRDTIES 62

Query: 200  FLLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNGKIICKLS 379
            FLLSSA     T+++H                 Q +Q  +        A+  G++ICKL+
Sbjct: 63   FLLSSA-----TSAAHPKELLRNNNETDNQENDQGRQVDIDAAPDPDTADGIGQVICKLT 117

Query: 380  DKRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIE 559
             +RMVT+    G T+V+I ++  KDG  +     N G+++T  QWSSFR+SFP+I+EAI 
Sbjct: 118  WRRMVTVRSLGGETLVSIWEYYRKDGKQI---ATNKGVNMTVKQWSSFRSSFPAIEEAII 174

Query: 560  NMESRLGRKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQTEADISNLTSILHH 739
             MES++  + A       ++K +A    +    T   +D+     +TE  I +     +H
Sbjct: 175  KMESKIRCERA-------SKKTKADKAVTSRSFT---VDAPEVSGKTETYIPSSNDSFNH 224

Query: 740  PTKSEQTESEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSERNQTDAVISDLVP 919
                   + +A+      D   +  +++ +IS              +RNQ     S  +P
Sbjct: 225  QENGSVEKKQAD----NLDDTVKSTNTEGLIS-------------IQRNQKLLATSSPMP 267

Query: 920  PFVAFNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRPKQLVPIQIARLDG 1099
                        S P+ER Q                  H+++       LVP  I RLDG
Sbjct: 268  -----------NSAPEERMQ------------------HNSLTNFPSVGLVP--ITRLDG 296

Query: 1100 RNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRWIDDDYMCRH 1279
            +NY+ WKH MEFFL QLNIAY L+E CP +   PE S +E ++ KAA+++WI DD++C  
Sbjct: 297  KNYYCWKHLMEFFLKQLNIAYVLTEPCPKVPITPEVSSEETLQAKAAVKKWIHDDHVCCR 356

Query: 1280 NILNSLCDNLFQFYSPKNYSARELWEELKLAYDEDFGTKRSQINKYIHFQMVDGVSILEQ 1459
            +ILNSL D LF+ YS K Y+++ELWE+LKL YDEDFGT RSQ+NKYI FQ++DG+SIL+Q
Sbjct: 357  SILNSLSDKLFEEYSNKTYTSKELWEKLKLIYDEDFGTMRSQVNKYIQFQILDGISILDQ 416

Query: 1460 IQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            + EL+ IADSI+ASG  +D+NFHVS I+SKLPPSWK++R +LM E+FL
Sbjct: 417  VIELNNIADSIMASGVLVDENFHVSAIISKLPPSWKDYRVKLMNEKFL 464


>ref|XP_016461388.1| PREDICTED: uncharacterized protein LOC107784729 isoform X2 [Nicotiana
            tabacum]
          Length = 784

 Score =  368 bits (945), Expect = e-115
 Identities = 227/542 (41%), Positives = 300/542 (55%), Gaps = 13/542 (2%)
 Frame = +2

Query: 17   MDEDLSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIR 196
            M+E L   KRRKI+  VL ILK  D++TATE+SVR  A Q+LG  + ++  +  ++H++ 
Sbjct: 1    MEEQLPDPKRRKIQGIVLDILKTADIETATEYSVRTTAAQQLGTEILNIQEKNYIRHVVE 60

Query: 197  SFLLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQ-------PQQRQLTDVRLG----LE 343
            SFLLS+      T  ++R               ++       P Q+Q  D  L     ++
Sbjct: 61   SFLLSTVEK--PTLDNNRRISTAEKETNKDFVAEEQLSADHPPTQQQEADGSLPNPHFVD 118

Query: 344  ANYNG-KIICKLSDKRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSS 520
            +N N  + ICKLS KR V I D +G   VAIRDF  KDG ++P    + G++L+A QWSS
Sbjct: 119  SNENNCRTICKLSGKRSVGILDIHGKPFVAIRDFYEKDGKLVPS---SRGINLSAQQWSS 175

Query: 521  FRNSFPSIQEAIENMESRLGRKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQT 700
            FR+SFP+I EAI  ME                                S I     ++QT
Sbjct: 176  FRSSFPAIVEAIATME--------------------------------SKIRLTTSENQT 203

Query: 701  EADISNLTSILHHPTKSEQTESEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSE 880
             A+++                   +I    + SV   NH +  I     T+D +E     
Sbjct: 204  AAEVA--------------ANGREQIQTNISQSV---NHQEGKI-----TADRKE----- 236

Query: 881  RNQTDAVISDLVPPFVAFNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRP 1060
             N  D   S ++           +  +P ER+QTEAG S SAP F  Q Q+  +  T   
Sbjct: 237  -NGDDVCNSAII--------TNSQVQMPLERQQTEAGISNSAPCFAPQGQIQQSSRTTSL 287

Query: 1061 KQ-LVPIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKA 1237
             Q LVP++  RLDG NY+ WKHQ+EFFL QLNIAY LSE CP+   N             
Sbjct: 288  AQSLVPVKTIRLDGTNYYCWKHQIEFFLKQLNIAYVLSEPCPNTLENR------------ 335

Query: 1238 AIQRWIDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAYDEDFGTKRSQINKY 1417
              Q+W+DDDY+C  NI NSL D LF+ YS KNYSA+ELWEEL+  YDEDFGTK S++NKY
Sbjct: 336  --QKWVDDDYLCCRNISNSLSDKLFEEYSKKNYSAKELWEELRSTYDEDFGTKSSEVNKY 393

Query: 1418 IHFQMVDGVSILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREE 1597
            + FQMVDG+SILEQ+QELHKIADS++ASG WID+NFH+S I++KLPPSWK+ RARLM E 
Sbjct: 394  LQFQMVDGISILEQVQELHKIADSLMASGIWIDENFHISAIIAKLPPSWKDCRARLMHEN 453

Query: 1598 FL 1603
             L
Sbjct: 454  VL 455


>ref|XP_016461387.1| PREDICTED: uncharacterized protein LOC107784729 isoform X1 [Nicotiana
            tabacum]
          Length = 828

 Score =  368 bits (945), Expect = e-115
 Identities = 227/542 (41%), Positives = 300/542 (55%), Gaps = 13/542 (2%)
 Frame = +2

Query: 17   MDEDLSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIR 196
            M+E L   KRRKI+  VL ILK  D++TATE+SVR  A Q+LG  + ++  +  ++H++ 
Sbjct: 1    MEEQLPDPKRRKIQGIVLDILKTADIETATEYSVRTTAAQQLGTEILNIQEKNYIRHVVE 60

Query: 197  SFLLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQ-------PQQRQLTDVRLG----LE 343
            SFLLS+      T  ++R               ++       P Q+Q  D  L     ++
Sbjct: 61   SFLLSTVEK--PTLDNNRRISTAEKETNKDFVAEEQLSADHPPTQQQEADGSLPNPHFVD 118

Query: 344  ANYNG-KIICKLSDKRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSS 520
            +N N  + ICKLS KR V I D +G   VAIRDF  KDG ++P    + G++L+A QWSS
Sbjct: 119  SNENNCRTICKLSGKRSVGILDIHGKPFVAIRDFYEKDGKLVPS---SRGINLSAQQWSS 175

Query: 521  FRNSFPSIQEAIENMESRLGRKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQT 700
            FR+SFP+I EAI  ME                                S I     ++QT
Sbjct: 176  FRSSFPAIVEAIATME--------------------------------SKIRLTTSENQT 203

Query: 701  EADISNLTSILHHPTKSEQTESEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSE 880
             A+++                   +I    + SV   NH +  I     T+D +E     
Sbjct: 204  AAEVA--------------ANGREQIQTNISQSV---NHQEGKI-----TADRKE----- 236

Query: 881  RNQTDAVISDLVPPFVAFNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRP 1060
             N  D   S ++           +  +P ER+QTEAG S SAP F  Q Q+  +  T   
Sbjct: 237  -NGDDVCNSAII--------TNSQVQMPLERQQTEAGISNSAPCFAPQGQIQQSSRTTSL 287

Query: 1061 KQ-LVPIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKA 1237
             Q LVP++  RLDG NY+ WKHQ+EFFL QLNIAY LSE CP+   N             
Sbjct: 288  AQSLVPVKTIRLDGTNYYCWKHQIEFFLKQLNIAYVLSEPCPNTLENR------------ 335

Query: 1238 AIQRWIDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAYDEDFGTKRSQINKY 1417
              Q+W+DDDY+C  NI NSL D LF+ YS KNYSA+ELWEEL+  YDEDFGTK S++NKY
Sbjct: 336  --QKWVDDDYLCCRNISNSLSDKLFEEYSKKNYSAKELWEELRSTYDEDFGTKSSEVNKY 393

Query: 1418 IHFQMVDGVSILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREE 1597
            + FQMVDG+SILEQ+QELHKIADS++ASG WID+NFH+S I++KLPPSWK+ RARLM E 
Sbjct: 394  LQFQMVDGISILEQVQELHKIADSLMASGIWIDENFHISAIIAKLPPSWKDCRARLMHEN 453

Query: 1598 FL 1603
             L
Sbjct: 454  VL 455


>ref|XP_019265620.1| PREDICTED: uncharacterized protein LOC109242899 [Nicotiana attenuata]
          Length = 628

 Score =  357 bits (916), Expect = e-112
 Identities = 221/534 (41%), Positives = 295/534 (55%), Gaps = 5/534 (0%)
 Frame = +2

Query: 17   MDEDLSASKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIR 196
            M+E L   KRRKI + VL ILK  D++TATE+SVR    Q+LG  + ++  ++ ++H+I 
Sbjct: 1    MEEQLPEPKRRKIREVVLDILKTADIETATEYSVRTTVAQQLGTEILNIQEKQFIRHVIE 60

Query: 197  SFLLSSAAAILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLG----LEANYNG-K 361
            SFLLS+                             P Q+Q  D  L     L++N N  +
Sbjct: 61   SFLLSTV----------EQPTLDFVAEVQLSADNVPTQKQEADGSLPNGNFLDSNENNCR 110

Query: 362  IICKLSDKRMVTIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPS 541
             ICKLSDKR + I D +G   VAIRDF  KDG ++P    + G++L+A QWSSFR+SFP+
Sbjct: 111  TICKLSDKRSIGILDIHGKPFVAIRDFYEKDGKLVPS---SRGINLSAQQWSSFRSSFPA 167

Query: 542  IQEAIENMESRLGRKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQTEADISNL 721
            I EAI  MES++              +      Q+ AD+      +A  + Q + +IS  
Sbjct: 168  IVEAIATMESKI--------------RLTTCENQTAADV------AAQGREQIQTNISQ- 206

Query: 722  TSILHHPTKSEQTESEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSERNQTDAV 901
             S+ H   K       + + N   D V     + A+I+N      S+ Q+P ER QT+A 
Sbjct: 207  -SVNHQEGKL------SAVRNENGDDVC----NSAIITN------SQVQMPIERQQTEAD 249

Query: 902  ISDLVPPFVAFNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRPKQLVPIQ 1081
            IS+                              S P F  Q  +  +  T    Q+ PI+
Sbjct: 250  ISN------------------------------SLPCFSPQGHIQQSSRTTSLAQM-PIK 278

Query: 1082 IARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRWIDD 1261
              RLDG+NY+ WKHQ EFFL QLNIAY LSE CP+   N               Q+W+DD
Sbjct: 279  TIRLDGKNYYCWKHQTEFFLKQLNIAYVLSEPCPNTLENR--------------QKWVDD 324

Query: 1262 DYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAYDEDFGTKRSQINKYIHFQMVDG 1441
            DY+   NILNSL D LF+ YS KNYSA+ELWEEL+  +DEDFGTK S++NKY+ FQMVDG
Sbjct: 325  DYLSCRNILNSLSDKLFEEYSKKNYSAKELWEELRSTFDEDFGTKSSEVNKYLQFQMVDG 384

Query: 1442 VSILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            +SILEQ+QELHKI DS++ASG WID+NFH+S I++KLPPSWK+ R RLM E  L
Sbjct: 385  ISILEQVQELHKIVDSLMASGIWIDENFHISAIIAKLPPSWKDCRTRLMHENVL 438


>ref|XP_004306659.1| PREDICTED: uncharacterized protein LOC101309666 [Fragaria vesca
            subsp. vesca]
          Length = 564

 Score =  310 bits (794), Expect = 2e-95
 Identities = 193/521 (37%), Positives = 284/521 (54%), Gaps = 2/521 (0%)
 Frame = +2

Query: 47   RKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRSFLLSSAAAI 226
            RKIE+TV+ IL++T LD  +EF VRAAA +RLG+ LSD+  +  V+ ++ SFL+S+A A 
Sbjct: 7    RKIEETVVDILRSTSLDEMSEFKVRAAASERLGIDLSDVERKSFVRGVVESFLISTAEAA 66

Query: 227  LGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNG-KIICKLSDKRMVTIH 403
               +                   + P   +  + RL  EAN +G + ICKLS+KR V IH
Sbjct: 67   APES-------------------EPPGVGEEKEARLKKEANEDGERFICKLSNKRNVVIH 107

Query: 404  DAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIENMESRLGR 583
            D  G T+V+IR+F  K G  LP      G+SL A QW++F+NS P+I+EAI+ MES+L  
Sbjct: 108  DFRGKTLVSIREFYKKGGKELPSA---RGISLPAEQWTTFKNSVPAIEEAIKKMESKLR- 163

Query: 584  KHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQTEADISNLTSILHHPTKSEQTE 763
                   + +N K     K+++            +  Q E                +QTE
Sbjct: 164  -------SEINSKRTEDGKEAE------------DFKQAE--------------DGKQTE 190

Query: 764  SEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSERNQTDAVISDLVPPFVAFNAV 943
            S  +I N       ++      I N     D ++    ++++    I D           
Sbjct: 191  SSKQIENGKQAEDGKRTEGSKQIENGKRNEDGKQAEGGKQSEISKRIED----------- 239

Query: 944  GPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRPKQLVPIQIARLDGRNYHSWKH 1123
              E++  ++ KQTE    +        E +  ++N V P +   I+ +R +G+NY  W  
Sbjct: 240  -SEQN--EDGKQTEDARQS--------EDISASLNGVAPHEFFSIETSRFNGKNYPIWAQ 288

Query: 1124 QMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRWIDDDYMCRHNILNSLCD 1303
            QMEF L QL I Y L   CP ++  PEAS DE  + KAA Q+W++DD++CR +ILN+L D
Sbjct: 289  QMEFLLKQLKIGYVLFVSCPVITLGPEASTDEIAQAKAAEQKWMNDDFVCRRSILNALSD 348

Query: 1304 NLFQFYSPKNYSARELWEELKLAY-DEDFGTKRSQINKYIHFQMVDGVSILEQIQELHKI 1480
            +L   Y+ K  +ARELWE+LKL +  E FGTKRS + KY+ FQM++G  +L+QIQE + I
Sbjct: 349  DLLNLYARKTTTARELWEDLKLLHLYEKFGTKRSLVKKYMEFQMLEGRLVLDQIQEFNDI 408

Query: 1481 ADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            ADSI+ASG  +++ FHV  ++SKLP SWK+   +LM EE L
Sbjct: 409  ADSIVASGMVVEEKFHVGAVISKLPSSWKDVSIKLMSEEHL 449


>ref|XP_021613680.1| uncharacterized protein LOC110615833 [Manihot esculenta]
          Length = 556

 Score =  304 bits (779), Expect = 3e-93
 Identities = 198/520 (38%), Positives = 274/520 (52%), Gaps = 2/520 (0%)
 Frame = +2

Query: 50   KIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRSFLLSSAAAIL 229
            KI++TVL IL+  D++  TEF VRAAA +RLG+ LSD+  +R ++ L+ SFLLS+    +
Sbjct: 8    KIQETVLHILRKADMNEMTEFKVRAAASERLGIDLSDIHCKRFIRGLVESFLLSTLE--V 65

Query: 230  GTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNGKIICKLSDKRMVTIHDA 409
            G                     ++ QQ ++       E N   ++ICKLS+KR V I   
Sbjct: 66   GAEEGKEAGSSNGGREDTPEMAREGQQ-EIPRKEFDSEGN---RVICKLSNKRNVVIQKF 121

Query: 410  YGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIENMESRLGRKH 589
             G + V+I +F  KDG  +     N G+SLT  QW +FR S P I+E I  ++S+   + 
Sbjct: 122  KGKSFVSIWEFYHKDGRQIRS---NKGISLTGEQWLAFRKSVPLIEEGIIKLKSK--SRS 176

Query: 590  AVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQT-EADISNLTSILHHPTKSEQTES 766
             +H  +N      A A  +  ++   V +     S     +  NL       T S   E 
Sbjct: 177  NLHDDSNEQISNLATAS-TPCELNRQVFNMVTASSHVLSGEAPNLV------TASSPCEL 229

Query: 767  EAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSERNQTDAVISDLVPPFVAFNAVG 946
              +I+N  T S    N      SNL + S   EQ+ +  N                    
Sbjct: 230  NRQISNMTTSSTHRLNGEA---SNLVTASRLHEQVSTSVND------------------- 267

Query: 947  PERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRPKQLVPIQIARLDGRNYHSWKHQ 1126
               S P+E                   Q+   VNT    + VP +I R DG+NY  W   
Sbjct: 268  ---STPNEHTS----------------QVSQLVNTPSFHEFVPFEINRFDGKNYQLWAPF 308

Query: 1127 MEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRWIDDDYMCRHNILNSLCDN 1306
            ME FL++L IAY L++ CPS+   PEAS +E  + KAA Q+W +DD++CRHNIL SL D 
Sbjct: 309  MESFLDKLKIAYVLTDPCPSVDIRPEASAEEIAQAKAAEQKWYNDDHLCRHNILTSLSDA 368

Query: 1307 LFQFYSPKNYSARELWEELKLAY-DEDFGTKRSQINKYIHFQMVDGVSILEQIQELHKIA 1483
            L+  YS K  SARELWEELKL Y  E+FG KRSQ+  YI FQ+VD   +L+Q++EL+ IA
Sbjct: 369  LYYQYSKKTKSARELWEELKLVYLYEEFGKKRSQVRNYIEFQIVDERPVLDQVKELNNIA 428

Query: 1484 DSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            DSI+ASG + D+ FHVS I+SKLPPSWK+F  +LM EE+L
Sbjct: 429  DSIVASGMFFDEKFHVSAIISKLPPSWKDFCIKLMCEEYL 468


>ref|XP_017232029.1| PREDICTED: uncharacterized protein LOC108206293 [Daucus carota subsp.
            sativus]
          Length = 521

 Score =  297 bits (760), Expect = 8e-91
 Identities = 187/524 (35%), Positives = 277/524 (52%), Gaps = 2/524 (0%)
 Frame = +2

Query: 38   SKRRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRSFLLSSA 217
            ++R KIE+ V++I++  DL+T +EFS+R    +RL +  S L  + LV+ ++ S+LLS  
Sbjct: 5    AERTKIEEAVVEIIRNGDLETLSEFSIRVMLAERLNIDFSGLESKLLVRRIVESYLLSLP 64

Query: 218  AAILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNGKIICKLSDKRMVT 397
                      R               K P              +YN   IC+LS++R V+
Sbjct: 65   DETENAVEVVREQSN-----------KAPHC-----------ISYN---ICELSNRRSVS 99

Query: 398  IHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIENMESRL 577
            +    G T+V   DF  KDGN       + G+SLT +QWS+F+    +I+EAI  + S+ 
Sbjct: 100  VKKFRGDTLVWFSDFYEKDGNQF-----DGGISLTESQWSAFKQGISAIEEAILKINSQK 154

Query: 578  GRKHAVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQTEADISNLTSILHHPTKSEQ 757
             +     +   ++ +  A A Q +  +              EA+++N  SI         
Sbjct: 155  RKCEVKKKCEAISNEVSAVAPQGEISIEGK-----------EANVNNKVSIF-------- 195

Query: 758  TESEAEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSERNQ--TDAVISDLVPPFVA 931
                     P  +   ++ H++A  SN ++ S   E IPS R+Q  TD       PP   
Sbjct: 196  --------TPGGEISTKREHAEAGESNASTASGLEEHIPSMRHQKHTD-------PPDSV 240

Query: 932  FNAVGPERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRPKQLVPIQIARLDGRNYH 1111
             N + P             G   S+PAF              P++++PI   RL GRNY 
Sbjct: 241  AN-ISPNGQ----------GKYNSSPAF-------------TPQRIIPIPNTRLSGRNYS 276

Query: 1112 SWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRWIDDDYMCRHNILN 1291
             W  Q+ F LNQL IAY L++ CP  + + EA  ++  + KAA ++W+DDDY+CR  ILN
Sbjct: 277  CWMRQISFVLNQLKIAYVLTQPCPDTTPHDEAYSEKAAQAKAAARKWVDDDYLCRLTILN 336

Query: 1292 SLCDNLFQFYSPKNYSARELWEELKLAYDEDFGTKRSQINKYIHFQMVDGVSILEQIQEL 1471
            SL D+L+  YS +  S++ELWEELK +YDEDF TK S +++Y+ +Q+VDG SILEQ+QE 
Sbjct: 337  SLSDHLYDQYSKRMLSSKELWEELKSSYDEDFRTKISHVSRYMQYQIVDGASILEQVQEF 396

Query: 1472 HKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            H+IAD+IIA G  ID+NFHV  IVSKLPPSWKE R +L++E++L
Sbjct: 397  HEIADAIIACGMRIDENFHVGAIVSKLPPSWKECRMKLLKEDWL 440


>ref|XP_012083199.1| uncharacterized protein LOC105642839 [Jatropha curcas]
          Length = 544

 Score =  251 bits (640), Expect = 8e-73
 Identities = 182/519 (35%), Positives = 254/519 (48%), Gaps = 1/519 (0%)
 Frame = +2

Query: 50   KIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRSFLLSSAAAIL 229
            KIE+TVL ILK  D+D  TEF VR    +RLG+ LSD+  +R ++ ++ SFLLS+     
Sbjct: 8    KIEETVLDILKNADMDDMTEFKVRVTTSERLGIDLSDIQRKRFIRGVVESFLLSTMEV-- 65

Query: 230  GTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNGKIICKLSDKRMVTIHDA 409
               S                 + + +   +       + N   +IICKLS++R V I   
Sbjct: 66   ---SGEEGKEADTNFREENQGMARKEHETIPKKEFDSDGN---RIICKLSNRRNVVI--- 116

Query: 410  YGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIENMESRLGRKH 589
                    +DF  K                          SF SI+E       + GR+H
Sbjct: 117  --------QDFKGK--------------------------SFISIREFYH----KDGRQH 138

Query: 590  AVHQLNNLNRKPEAFAKQSDADMTNSVIDSAVEKSQTEADISNLTSILHHPTKSEQTESE 769
              ++   L  +  +  ++S       +I+ A+ K Q     S L S  H        E  
Sbjct: 139  PSNKGICLTAEQWSVFRKSVP-----LIEDAIVKMQ-----SKLRSESHD-------EKN 181

Query: 770  AEITNPATDSVAEKNHSQAVISNLTSTSDSREQIPSERNQTDAVISDLVPPFVAFNAVGP 949
             +I+N  T   +E N     +S++ + S +     +    T +   +L   F        
Sbjct: 182  DQISNVVTACTSEINGR---VSDVVTVSTNELNGQASNFATASAHHELNGQF-------- 230

Query: 950  ERSIPDERKQTEAGTSTSAPAFPTQEQLHHTVNTVRPKQLVPIQIARLDGRNYHSWKHQM 1129
             +S+ +   +     S S  A    E             L PI+I R DG+NY  W  QM
Sbjct: 231  SKSVTNSTHELNGQVSDSGIASSVHE-------------LFPIEINRFDGKNYQCWAPQM 277

Query: 1130 EFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRWIDDDYMCRHNILNSLCDNL 1309
            E FL QLNIAY L+  CPS +  PEAS +   + KA  Q+W++DDYMCR NIL SL D L
Sbjct: 278  ELFLKQLNIAYVLTNPCPSSAMKPEASAEGIAQAKAVEQKWLNDDYMCRRNILASLSDAL 337

Query: 1310 FQFYSPKNYSARELWEELKLAY-DEDFGTKRSQINKYIHFQMVDGVSILEQIQELHKIAD 1486
            +  YS    SA+ELWEELKL Y  E+FG KRS + KYI FQMV+   IL+Q+QEL+ IAD
Sbjct: 338  YYQYSKNAKSAKELWEELKLVYLYEEFGKKRSHVKKYIEFQMVEEKPILDQVQELNSIAD 397

Query: 1487 SIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            SI+A+G +ID+ FHVS I+SKLPPSWK+F  +LM EE+L
Sbjct: 398  SIVATGIFIDEKFHVSAIISKLPPSWKDFCMKLMCEEYL 436


>ref|XP_021280140.1| uncharacterized protein LOC110413599 [Herrania umbratica]
          Length = 474

 Score =  237 bits (604), Expect = 3e-68
 Identities = 112/178 (62%), Positives = 140/178 (78%), Gaps = 1/178 (0%)
 Frame = +2

Query: 1073 PIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRW 1252
            PI+  R DG+NYH W  QME FL QL IAY L++ CPSLS +PEAS +E  + KA  ++W
Sbjct: 187  PIETTRFDGKNYHCWAEQMELFLKQLQIAYVLTDPCPSLSLSPEASSEESAQAKATEKKW 246

Query: 1253 IDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAY-DEDFGTKRSQINKYIHFQ 1429
            ++DDY+CRH+IL+SL DNL+  +S K  SA+ELWEELKL Y  E+FGTKRSQ+ KYI FQ
Sbjct: 247  MNDDYLCRHSILSSLSDNLYYQFSKKTKSAKELWEELKLVYLYEEFGTKRSQVRKYIEFQ 306

Query: 1430 MVDGVSILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            +VDG  ILEQ+QEL+ IADSI+A+G  ID+NFHVS I+SKLPPSWK+F   LMREE+L
Sbjct: 307  IVDGRPILEQMQELNSIADSIVAAGMMIDENFHVSTIISKLPPSWKDFCVELMREEYL 364



 Score =  119 bits (297), Expect = 5e-25
 Identities = 70/203 (34%), Positives = 111/203 (54%), Gaps = 3/203 (1%)
 Frame = +2

Query: 44  RRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRSFLLSSAAA 223
           R KIE+TV +IL   D++  TEF VR AA +RLG+ LSD  H++ V+ +I SFLLS+   
Sbjct: 6   RPKIEETVKEILSKADMEEMTEFKVRVAASERLGIDLSDFNHKKFVREVIESFLLSTVEE 65

Query: 224 ILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNG---KIICKLSDKRMV 394
                                    +    +L +    ++  ++G   ++ICKL+DKR V
Sbjct: 66  ---------------------NGDGKELNSKLREEEAKIKKEFDGDGDRLICKLADKRNV 104

Query: 395 TIHDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIENMESR 574
            +H+  G T V+IR+F +KDG  LP      G+SLT+  WS+ +NSFP++  A++ M+S+
Sbjct: 105 VVHEFRGKTYVSIREFYVKDGKELPSA---RGVSLTSEIWSALKNSFPAVDAAVKKMQSK 161

Query: 575 LGRKHAVHQLNNLNRKPEAFAKQ 643
           L  K    Q  +++    AF+ +
Sbjct: 162 LSTKLDGEQNGDVSNSVTAFSHE 184


>gb|EOX95687.1| Zinc knuckle family protein, putative isoform 2 [Theobroma cacao]
          Length = 476

 Score =  235 bits (599), Expect = 1e-67
 Identities = 110/178 (61%), Positives = 141/178 (79%), Gaps = 1/178 (0%)
 Frame = +2

Query: 1073 PIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRW 1252
            PI+  R DG+NYH W  QME FL QL IAY L++ CPSL+ +PEAS +E  + KA  ++W
Sbjct: 189  PIETTRFDGKNYHCWAEQMELFLKQLQIAYVLTDPCPSLTLSPEASSEESAQAKATEKKW 248

Query: 1253 IDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAY-DEDFGTKRSQINKYIHFQ 1429
            ++DDY+CRH+IL+SL DNL+  +S K  SA+ELWEELKL Y  E+FGTKRSQ+ KYI FQ
Sbjct: 249  MNDDYLCRHSILSSLSDNLYYQFSKKTKSAKELWEELKLVYLYEEFGTKRSQVRKYIEFQ 308

Query: 1430 MVDGVSILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            +VDG  IL+Q+QEL+ IADSI+A+G  ID+NFHVS I+SKLPPSWK+F  +LMREE+L
Sbjct: 309  IVDGRPILKQMQELNSIADSIVAAGMMIDENFHVSTIISKLPPSWKDFCVKLMREEYL 366



 Score =  123 bits (308), Expect = 2e-26
 Identities = 71/201 (35%), Positives = 113/201 (56%), Gaps = 1/201 (0%)
 Frame = +2

Query: 44  RRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRSFLLSSAAA 223
           R+KIE+TV +IL   D++  TEF VR AA +RLG+ LSD  H++ V+ +I SFLLS+   
Sbjct: 6   RQKIEETVREILSKADMEEMTEFKVRVAASERLGIDLSDFNHKKFVREVIESFLLSTV-- 63

Query: 224 ILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNG-KIICKLSDKRMVTI 400
                                  +    + +   +++  E + +G ++ICKL+DKR V +
Sbjct: 64  ---------------EENGDVEELNSKLREEEAKIKIKKEIDGDGDRLICKLADKRNVVV 108

Query: 401 HDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIENMESRLG 580
           H+  G T V+IR+F +KDG  LP      G+SLT+  WS+ +NSFP+I  A++ M+S+L 
Sbjct: 109 HEFRGKTYVSIREFYVKDGKELPSA---RGVSLTSEIWSALKNSFPAIDAAVKKMQSKLS 165

Query: 581 RKHAVHQLNNLNRKPEAFAKQ 643
            K    Q  +++    AF+ +
Sbjct: 166 TKLDGEQNGDVSNSVTAFSHE 186


>ref|XP_007051530.2| PREDICTED: uncharacterized protein LOC18613965 isoform X1 [Theobroma
            cacao]
          Length = 476

 Score =  233 bits (595), Expect = 6e-67
 Identities = 109/178 (61%), Positives = 140/178 (78%), Gaps = 1/178 (0%)
 Frame = +2

Query: 1073 PIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRW 1252
            PI+  R DG+NYH W  QME FL QL  AY L++ CPSL+ +PEAS +E  + KA  ++W
Sbjct: 189  PIETTRFDGKNYHCWAEQMELFLKQLQFAYVLTDPCPSLTLSPEASSEESAQAKATEKKW 248

Query: 1253 IDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAY-DEDFGTKRSQINKYIHFQ 1429
            ++DDY+CRH+IL+SL DNL+  +S K  SA+ELWEELKL Y  E+FGTKRSQ+ KYI FQ
Sbjct: 249  MNDDYLCRHSILSSLSDNLYYQFSKKTKSAKELWEELKLVYLYEEFGTKRSQVRKYIEFQ 308

Query: 1430 MVDGVSILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            +VDG  IL+Q+QEL+ IADSI+A+G  ID+NFHVS I+SKLPPSWK+F  +LMREE+L
Sbjct: 309  IVDGRPILKQMQELNSIADSIVAAGMMIDENFHVSTIISKLPPSWKDFCVKLMREEYL 366



 Score =  123 bits (308), Expect = 2e-26
 Identities = 71/201 (35%), Positives = 113/201 (56%), Gaps = 1/201 (0%)
 Frame = +2

Query: 44  RRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRSFLLSSAAA 223
           R+KIE+TV +IL   D++  TEF VR AA +RLG+ LSD  H++ V+ +I SFLLS+   
Sbjct: 6   RQKIEETVREILSKADMEEMTEFKVRVAASERLGIDLSDFNHKKFVREVIESFLLSTV-- 63

Query: 224 ILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNG-KIICKLSDKRMVTI 400
                                  +    + +   +++  E + +G ++ICKL+DKR V +
Sbjct: 64  ---------------EENGDVEELNSKLREEEAKIKIKKEIDGDGDRLICKLADKRNVVV 108

Query: 401 HDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIENMESRLG 580
           H+  G T V+IR+F +KDG  LP      G+SLT+  WS+ +NSFP+I  A++ M+S+L 
Sbjct: 109 HEFRGKTYVSIREFYVKDGKELPSA---RGVSLTSEIWSALKNSFPAIDAAVKKMQSKLS 165

Query: 581 RKHAVHQLNNLNRKPEAFAKQ 643
            K    Q  +++    AF+ +
Sbjct: 166 TKLDGEQNGDVSNSVTAFSHE 186


>gb|KDP28479.1| hypothetical protein JCGZ_14250 [Jatropha curcas]
          Length = 523

 Score =  233 bits (594), Expect = 2e-66
 Identities = 172/505 (34%), Positives = 243/505 (48%), Gaps = 1/505 (0%)
 Frame = +2

Query: 92   LDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRSFLLSSAAAILGTTSSHRXXXXXXX 271
            +D  TEF VR    +RLG+ LSD+  +R ++ ++ SFLLS+        S          
Sbjct: 1    MDDMTEFKVRVTTSERLGIDLSDIQRKRFIRGVVESFLLSTMEV-----SGEEGKEADTN 55

Query: 272  XXXXXXXIKQPQQRQLTDVRLGLEANYNGKIICKLSDKRMVTIHDAYGATMVAIRDFDMK 451
                   + + +   +       + N   +IICKLS++R V I           +DF  K
Sbjct: 56   FREENQGMARKEHETIPKKEFDSDGN---RIICKLSNRRNVVI-----------QDFKGK 101

Query: 452  DGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIENMESRLGRKHAVHQLNNLNRKPEA 631
                                      SF SI+E       + GR+H  ++   L  +  +
Sbjct: 102  --------------------------SFISIREFYH----KDGRQHPSNKGICLTAEQWS 131

Query: 632  FAKQSDADMTNSVIDSAVEKSQTEADISNLTSILHHPTKSEQTESEAEITNPATDSVAEK 811
              ++S       +I+ A+ K Q     S L S  H        E   +I+N  T   +E 
Sbjct: 132  VFRKSVP-----LIEDAIVKMQ-----SKLRSESHD-------EKNDQISNVVTACTSEI 174

Query: 812  NHSQAVISNLTSTSDSREQIPSERNQTDAVISDLVPPFVAFNAVGPERSIPDERKQTEAG 991
            N     +S++ + S +     +    T +   +L   F         +S+ +   +    
Sbjct: 175  NGR---VSDVVTVSTNELNGQASNFATASAHHELNGQF--------SKSVTNSTHELNGQ 223

Query: 992  TSTSAPAFPTQEQLHHTVNTVRPKQLVPIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLS 1171
             S S  A    E             L PI+I R DG+NY  W  QME FL QLNIAY L+
Sbjct: 224  VSDSGIASSVHE-------------LFPIEINRFDGKNYQCWAPQMELFLKQLNIAYVLT 270

Query: 1172 EQCPSLSSNPEASFDEQVKVKAAIQRWIDDDYMCRHNILNSLCDNLFQFYSPKNYSAREL 1351
              CPS +  PEAS +   + KA  Q+W++DDYMCR NIL SL D L+  YS    SA+EL
Sbjct: 271  NPCPSSAMKPEASAEGIAQAKAVEQKWLNDDYMCRRNILASLSDALYYQYSKNAKSAKEL 330

Query: 1352 WEELKLAY-DEDFGTKRSQINKYIHFQMVDGVSILEQIQELHKIADSIIASGTWIDQNFH 1528
            WEELKL Y  E+FG KRS + KYI FQMV+   IL+Q+QEL+ IADSI+A+G +ID+ FH
Sbjct: 331  WEELKLVYLYEEFGKKRSHVKKYIEFQMVEEKPILDQVQELNSIADSIVATGIFIDEKFH 390

Query: 1529 VSVIVSKLPPSWKEFRARLMREEFL 1603
            VS I+SKLPPSWK+F  +LM EE+L
Sbjct: 391  VSAIISKLPPSWKDFCMKLMCEEYL 415


>gb|EOX95686.1| Zinc knuckle family protein, putative isoform 1 [Theobroma cacao]
          Length = 612

 Score =  235 bits (599), Expect = 3e-66
 Identities = 110/178 (61%), Positives = 141/178 (79%), Gaps = 1/178 (0%)
 Frame = +2

Query: 1073 PIQIARLDGRNYHSWKHQMEFFLNQLNIAYTLSEQCPSLSSNPEASFDEQVKVKAAIQRW 1252
            PI+  R DG+NYH W  QME FL QL IAY L++ CPSL+ +PEAS +E  + KA  ++W
Sbjct: 189  PIETTRFDGKNYHCWAEQMELFLKQLQIAYVLTDPCPSLTLSPEASSEESAQAKATEKKW 248

Query: 1253 IDDDYMCRHNILNSLCDNLFQFYSPKNYSARELWEELKLAY-DEDFGTKRSQINKYIHFQ 1429
            ++DDY+CRH+IL+SL DNL+  +S K  SA+ELWEELKL Y  E+FGTKRSQ+ KYI FQ
Sbjct: 249  MNDDYLCRHSILSSLSDNLYYQFSKKTKSAKELWEELKLVYLYEEFGTKRSQVRKYIEFQ 308

Query: 1430 MVDGVSILEQIQELHKIADSIIASGTWIDQNFHVSVIVSKLPPSWKEFRARLMREEFL 1603
            +VDG  IL+Q+QEL+ IADSI+A+G  ID+NFHVS I+SKLPPSWK+F  +LMREE+L
Sbjct: 309  IVDGRPILKQMQELNSIADSIVAAGMMIDENFHVSTIISKLPPSWKDFCVKLMREEYL 366



 Score =  123 bits (308), Expect = 5e-26
 Identities = 71/201 (35%), Positives = 113/201 (56%), Gaps = 1/201 (0%)
 Frame = +2

Query: 44  RRKIEDTVLQILKATDLDTATEFSVRAAAEQRLGLGLSDLTHRRLVQHLIRSFLLSSAAA 223
           R+KIE+TV +IL   D++  TEF VR AA +RLG+ LSD  H++ V+ +I SFLLS+   
Sbjct: 6   RQKIEETVREILSKADMEEMTEFKVRVAASERLGIDLSDFNHKKFVREVIESFLLSTV-- 63

Query: 224 ILGTTSSHRXXXXXXXXXXXXXXIKQPQQRQLTDVRLGLEANYNG-KIICKLSDKRMVTI 400
                                  +    + +   +++  E + +G ++ICKL+DKR V +
Sbjct: 64  ---------------EENGDVEELNSKLREEEAKIKIKKEIDGDGDRLICKLADKRNVVV 108

Query: 401 HDAYGATMVAIRDFDMKDGNMLPKRGVNTGLSLTATQWSSFRNSFPSIQEAIENMESRLG 580
           H+  G T V+IR+F +KDG  LP      G+SLT+  WS+ +NSFP+I  A++ M+S+L 
Sbjct: 109 HEFRGKTYVSIREFYVKDGKELPSA---RGVSLTSEIWSALKNSFPAIDAAVKKMQSKLS 165

Query: 581 RKHAVHQLNNLNRKPEAFAKQ 643
            K    Q  +++    AF+ +
Sbjct: 166 TKLDGEQNGDVSNSVTAFSHE 186


Top