BLASTX nr result

ID: Chrysanthemum22_contig00031642 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00031642
         (792 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_023755781.1| protein CHUP1, chloroplastic [Lactuca sativa...   298   1e-96
gb|KVI10844.1| hypothetical protein Ccrd_010754 [Cynara carduncu...   288   3e-92
ref|XP_022036438.1| protein CHUP1, chloroplastic-like [Helianthu...   253   3e-79
ref|XP_022016261.1| protein CHUP1, chloroplastic-like [Helianthu...   245   4e-76
gb|KZN06523.1| hypothetical protein DCAR_007360 [Daucus carota s...   184   5e-53
ref|XP_017232931.1| PREDICTED: protein CHUP1, chloroplastic isof...   184   3e-52
ref|XP_017232930.1| PREDICTED: protein CHUP1, chloroplastic isof...   184   5e-52
ref|XP_020536551.1| protein CHUP1, chloroplastic isoform X2 [Jat...   171   3e-47
ref|XP_012079077.1| protein CHUP1, chloroplastic isoform X1 [Jat...   171   4e-47
gb|KDP45786.1| hypothetical protein JCGZ_17393 [Jatropha curcas]      169   1e-46
gb|PNT20110.1| hypothetical protein POPTR_009G073600v3 [Populus ...   168   2e-46
gb|EOY34534.1| F10K1.18 protein, putative isoform 4 [Theobroma c...   163   7e-45
ref|XP_007016912.2| PREDICTED: protein CHUP1, chloroplastic isof...   164   2e-44
ref|XP_022753610.1| protein CHUP1, chloroplastic-like [Durio zib...   166   2e-44
ref|XP_019178642.1| PREDICTED: protein CHUP1, chloroplastic [Ipo...   163   3e-44
gb|EOY34536.1| F10K1.18 protein, putative isoform 6 [Theobroma c...   163   6e-44
gb|EOY34531.1| F10K1.18 protein, putative isoform 1 [Theobroma c...   163   7e-44
ref|XP_014503946.1| protein CHUP1, chloroplastic isoform X3 [Vig...   160   4e-43
gb|KOM47416.1| hypothetical protein LR48_Vigan07g112000 [Vigna a...   159   4e-43
ref|XP_014503945.1| protein CHUP1, chloroplastic isoform X2 [Vig...   160   5e-43

>ref|XP_023755781.1| protein CHUP1, chloroplastic [Lactuca sativa]
 gb|PLY91519.1| hypothetical protein LSAT_7X85001 [Lactuca sativa]
          Length = 403

 Score =  298 bits (763), Expect = 1e-96
 Identities = 164/223 (73%), Positives = 177/223 (79%)
 Frame = +2

Query: 122 MPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALW 301
           MPSG DG+ +TFLKRELEASFVKIDS+E ENHELKQE+ RLKAQI+TLKAHDLERKS LW
Sbjct: 1   MPSGEDGSIITFLKRELEASFVKIDSMEKENHELKQEMGRLKAQINTLKAHDLERKSMLW 60

Query: 302 KKLQSSMDSSKDFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQWLQK 481
           KKLQSSMD  K  ++P QKTKLQV V    AP KK ++  TEA A+L        Q LQK
Sbjct: 61  KKLQSSMDCGKVIDEPTQKTKLQVAV--PEAPLKKPTSNHTEANAILPKPPPSPPQSLQK 118

Query: 482 RVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKENKNGVIGVVSVTYSRDM 661
           RV+A            VGS+AVRRVPAVMEFYRSLMKRDTQKENKNG  G + VT SRDM
Sbjct: 119 RVIAPPPPPPPLPSSPVGSRAVRRVPAVMEFYRSLMKRDTQKENKNGATGFLPVT-SRDM 177

Query: 662 IGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
           IGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV
Sbjct: 178 IGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 220


>gb|KVI10844.1| hypothetical protein Ccrd_010754 [Cynara cardunculus var. scolymus]
          Length = 427

 Score =  288 bits (736), Expect = 3e-92
 Identities = 157/226 (69%), Positives = 171/226 (75%), Gaps = 3/226 (1%)
 Frame = +2

Query: 122 MPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALW 301
           MPSG+DG+T+TFLKRELEAS VKIDSLE ENHELKQE++ LKAQ++TLKAHDLERKS LW
Sbjct: 1   MPSGDDGSTLTFLKRELEASLVKIDSLEKENHELKQEMASLKAQVNTLKAHDLERKSVLW 60

Query: 302 KKLQSSMDSSKDFNDPPQKTKLQVDVVLDHAPFKKSS---TIDTEAKAMLXXXXXXXXQW 472
           +KLQ+SMD  K  N+ PQK KL V+V     P KKSS   T +  A   L        QW
Sbjct: 61  RKLQNSMDCGKVANESPQKPKLHVEVPEAVLPSKKSSSNHTTEDNAMMNLPKPPPSPPQW 120

Query: 473 LQKRVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKENKNGVIGVVSVTYS 652
           LQKRVMA            VGS+AVRRVPAVMEFYRSLMKRDTQKENKNG  G + V   
Sbjct: 121 LQKRVMAPPPPPPPLPSTPVGSRAVRRVPAVMEFYRSLMKRDTQKENKNGATGFLPVMNC 180

Query: 653 RDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
           RDMIGEIENRSTYLTSIKSDVEKYGQ LNFLIREVQ AAFTEISDV
Sbjct: 181 RDMIGEIENRSTYLTSIKSDVEKYGQRLNFLIREVQGAAFTEISDV 226


>ref|XP_022036438.1| protein CHUP1, chloroplastic-like [Helianthus annuus]
 gb|OTG30032.1| hypothetical protein HannXRQ_Chr04g0128661 [Helianthus annuus]
          Length = 395

 Score =  253 bits (647), Expect = 3e-79
 Identities = 148/224 (66%), Positives = 160/224 (71%), Gaps = 1/224 (0%)
 Frame = +2

Query: 122 MPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALW 301
           MPSG+D    T L+ ELEAS +KI SLE EN ELKQE SRLKAQI+TLKAHDLERKS LW
Sbjct: 1   MPSGDDDDGAT-LRTELEASLLKIHSLEKENQELKQETSRLKAQINTLKAHDLERKSILW 59

Query: 302 KKLQSSMDSSKDFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQWLQK 481
           KKL  SMD  K  +DPPQK KLQVDV     P KKS++  T     +        Q LQ+
Sbjct: 60  KKLHHSMDCGKVIDDPPQKQKLQVDV-----PIKKSTSDQTN----VINPPPSPPQLLQR 110

Query: 482 RVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQ-KENKNGVIGVVSVTYSRD 658
           RV A            VGSKAVRRVPAVMEFYRSLMKRDTQ KE KNG  G+    YSRD
Sbjct: 111 RVTAPPPPPPPILPTPVGSKAVRRVPAVMEFYRSLMKRDTQLKETKNGASGLAPAAYSRD 170

Query: 659 MIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
           MIGEIENRSTYLTSIKSDVEKYG LLN+LIREVQSAAF EIS+V
Sbjct: 171 MIGEIENRSTYLTSIKSDVEKYGPLLNYLIREVQSAAFREISEV 214


>ref|XP_022016261.1| protein CHUP1, chloroplastic-like [Helianthus annuus]
 gb|OTF91150.1| hypothetical protein HannXRQ_Chr16g0507591 [Helianthus annuus]
          Length = 394

 Score =  245 bits (626), Expect = 4e-76
 Identities = 142/226 (62%), Positives = 163/226 (72%), Gaps = 3/226 (1%)
 Frame = +2

Query: 122 MPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALW 301
           MP G DGAT++FL+RELEAS +KIDSLE ENHELKQE SRLKAQI+TLKAHDLERKS LW
Sbjct: 1   MPVGYDGATLSFLRRELEASLMKIDSLEKENHELKQETSRLKAQINTLKAHDLERKSILW 60

Query: 302 KKLQSSMDSSKDFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQWLQK 481
           KKLQ++M+ +   ++PP K KLQ+DV     P KKS+    E   +           LQ+
Sbjct: 61  KKLQNTMNCT---DEPPHKPKLQIDV-----PLKKST---IEGNTISLPKPPPSPSPLQR 109

Query: 482 RVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRD---TQKENKNGVIGVVSVTYS 652
           +V              VGS+ VRRVPAVMEFYRSLMKRD    QKENK   +G+V VTYS
Sbjct: 110 KVAGPPPPPPPLPSSPVGSRTVRRVPAVMEFYRSLMKRDMTTLQKENKG--VGLVPVTYS 167

Query: 653 RDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
           RDMIGEIENRSTYLTSIKSDVEKYG+ LN LIREV+SA F EISDV
Sbjct: 168 RDMIGEIENRSTYLTSIKSDVEKYGERLNDLIREVESAGFREISDV 213


>gb|KZN06523.1| hypothetical protein DCAR_007360 [Daucus carota subsp. sativus]
          Length = 330

 Score =  184 bits (467), Expect = 5e-53
 Identities = 113/243 (46%), Positives = 146/243 (60%), Gaps = 16/243 (6%)
 Frame = +2

Query: 110 LELKMPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERK 289
           + L   SG +   + FLK+E EAS ++I SLE+EN ELKQE++RLKAQ+ TLKAHD ERK
Sbjct: 1   MPLNRSSGEESMRIAFLKKEFEASLLRIHSLENENQELKQEVARLKAQVHTLKAHDSERK 60

Query: 290 SALWKKLQSSMDSSKDFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXX- 466
           S LWKKLQ+S+D  K      QK    V+V      F+K S  D  A A +         
Sbjct: 61  SVLWKKLQNSLDV-KVPEKSQQKPSFSVEVPERSQVFQKFSPKDDLADAAVKKEIPAIRV 119

Query: 467 -------------QWLQKRV--MAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDT 601
                        + L   +  ++            VGSKAVRRVP VMEFYRSLM+RD+
Sbjct: 120 AIPPPPRPVTSSLKQLHGNIGQLSPPPPPPPPSKALVGSKAVRRVPEVMEFYRSLMRRDS 179

Query: 602 QKENKNGVIGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEI 781
             +NK+  IG + V  SR+MIGEIEN+STYL++IKSDVE  G L+N L  EV++A+F +I
Sbjct: 180 HVDNKSSPIGFLQVVNSRNMIGEIENKSTYLSAIKSDVEMQGGLINNLTGEVETASFNKI 239

Query: 782 SDV 790
           SDV
Sbjct: 240 SDV 242


>ref|XP_017232931.1| PREDICTED: protein CHUP1, chloroplastic isoform X2 [Daucus carota
           subsp. sativus]
          Length = 407

 Score =  184 bits (467), Expect = 3e-52
 Identities = 113/243 (46%), Positives = 146/243 (60%), Gaps = 16/243 (6%)
 Frame = +2

Query: 110 LELKMPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERK 289
           + L   SG +   + FLK+E EAS ++I SLE+EN ELKQE++RLKAQ+ TLKAHD ERK
Sbjct: 1   MPLNRSSGEESMRIAFLKKEFEASLLRIHSLENENQELKQEVARLKAQVHTLKAHDSERK 60

Query: 290 SALWKKLQSSMDSSKDFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXX- 466
           S LWKKLQ+S+D  K      QK    V+V      F+K S  D  A A +         
Sbjct: 61  SVLWKKLQNSLDV-KVPEKSQQKPSFSVEVPERSQVFQKFSPKDDLADAAVKKEIPAIRV 119

Query: 467 -------------QWLQKRV--MAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDT 601
                        + L   +  ++            VGSKAVRRVP VMEFYRSLM+RD+
Sbjct: 120 AIPPPPRPVTSSLKQLHGNIGQLSPPPPPPPPSKALVGSKAVRRVPEVMEFYRSLMRRDS 179

Query: 602 QKENKNGVIGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEI 781
             +NK+  IG + V  SR+MIGEIEN+STYL++IKSDVE  G L+N L  EV++A+F +I
Sbjct: 180 HVDNKSSPIGFLQVVNSRNMIGEIENKSTYLSAIKSDVEMQGGLINNLTGEVETASFNKI 239

Query: 782 SDV 790
           SDV
Sbjct: 240 SDV 242


>ref|XP_017232930.1| PREDICTED: protein CHUP1, chloroplastic isoform X1 [Daucus carota
           subsp. sativus]
          Length = 426

 Score =  184 bits (467), Expect = 5e-52
 Identities = 113/243 (46%), Positives = 146/243 (60%), Gaps = 16/243 (6%)
 Frame = +2

Query: 110 LELKMPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERK 289
           + L   SG +   + FLK+E EAS ++I SLE+EN ELKQE++RLKAQ+ TLKAHD ERK
Sbjct: 1   MPLNRSSGEESMRIAFLKKEFEASLLRIHSLENENQELKQEVARLKAQVHTLKAHDSERK 60

Query: 290 SALWKKLQSSMDSSKDFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXX- 466
           S LWKKLQ+S+D  K      QK    V+V      F+K S  D  A A +         
Sbjct: 61  SVLWKKLQNSLDV-KVPEKSQQKPSFSVEVPERSQVFQKFSPKDDLADAAVKKEIPAIRV 119

Query: 467 -------------QWLQKRV--MAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDT 601
                        + L   +  ++            VGSKAVRRVP VMEFYRSLM+RD+
Sbjct: 120 AIPPPPRPVTSSLKQLHGNIGQLSPPPPPPPPSKALVGSKAVRRVPEVMEFYRSLMRRDS 179

Query: 602 QKENKNGVIGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEI 781
             +NK+  IG + V  SR+MIGEIEN+STYL++IKSDVE  G L+N L  EV++A+F +I
Sbjct: 180 HVDNKSSPIGFLQVVNSRNMIGEIENKSTYLSAIKSDVEMQGGLINNLTGEVETASFNKI 239

Query: 782 SDV 790
           SDV
Sbjct: 240 SDV 242


>ref|XP_020536551.1| protein CHUP1, chloroplastic isoform X2 [Jatropha curcas]
          Length = 389

 Score =  171 bits (432), Expect = 3e-47
 Identities = 108/229 (47%), Positives = 143/229 (62%), Gaps = 5/229 (2%)
 Frame = +2

Query: 119 KMPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSAL 298
           KMP   D + + +LK+ELE+S ++ DSLE EN EL+QEI RLKAQI++LKAHD ERKS L
Sbjct: 9   KMPQEEDESLIIYLKKELESSLIRNDSLEKENRELRQEIIRLKAQITSLKAHDNERKSLL 68

Query: 299 WKKLQSSMDSSKDF--NDPPQKT-KLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQ 469
           WKKLQ+S DSS+    ++PP    KLQ+       P  K + I  + K            
Sbjct: 69  WKKLQNSNDSSQQIRPSEPPDNNPKLQLP-----NPPPKLTPIFNQNK------------ 111

Query: 470 WLQKRVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKE-NKNGVIGVVSVT 646
            L   + A            +GSK+VRRVP V+EFYR L +++   E NKN       VT
Sbjct: 112 -LPPPISAAPPPPPPPSKMFMGSKSVRRVPEVVEFYRLLTRKNVSSENNKNHSTAATPVT 170

Query: 647 -YSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
            +S +MIGEIENRST+L++IKSDVEK  + +N+LI+EV+SAAF  IS+V
Sbjct: 171 AFSPNMIGEIENRSTHLSAIKSDVEKRKEFINYLIKEVESAAFKGISEV 219


>ref|XP_012079077.1| protein CHUP1, chloroplastic isoform X1 [Jatropha curcas]
          Length = 401

 Score =  171 bits (432), Expect = 4e-47
 Identities = 108/229 (47%), Positives = 143/229 (62%), Gaps = 5/229 (2%)
 Frame = +2

Query: 119 KMPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSAL 298
           KMP   D + + +LK+ELE+S ++ DSLE EN EL+QEI RLKAQI++LKAHD ERKS L
Sbjct: 9   KMPQEEDESLIIYLKKELESSLIRNDSLEKENRELRQEIIRLKAQITSLKAHDNERKSLL 68

Query: 299 WKKLQSSMDSSKDF--NDPPQKT-KLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQ 469
           WKKLQ+S DSS+    ++PP    KLQ+       P  K + I  + K            
Sbjct: 69  WKKLQNSNDSSQQIRPSEPPDNNPKLQLP-----NPPPKLTPIFNQNK------------ 111

Query: 470 WLQKRVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKE-NKNGVIGVVSVT 646
            L   + A            +GSK+VRRVP V+EFYR L +++   E NKN       VT
Sbjct: 112 -LPPPISAAPPPPPPPSKMFMGSKSVRRVPEVVEFYRLLTRKNVSSENNKNHSTAATPVT 170

Query: 647 -YSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
            +S +MIGEIENRST+L++IKSDVEK  + +N+LI+EV+SAAF  IS+V
Sbjct: 171 AFSPNMIGEIENRSTHLSAIKSDVEKRKEFINYLIKEVESAAFKGISEV 219


>gb|KDP45786.1| hypothetical protein JCGZ_17393 [Jatropha curcas]
          Length = 381

 Score =  169 bits (427), Expect = 1e-46
 Identities = 107/228 (46%), Positives = 142/228 (62%), Gaps = 5/228 (2%)
 Frame = +2

Query: 122 MPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALW 301
           MP   D + + +LK+ELE+S ++ DSLE EN EL+QEI RLKAQI++LKAHD ERKS LW
Sbjct: 1   MPQEEDESLIIYLKKELESSLIRNDSLEKENRELRQEIIRLKAQITSLKAHDNERKSLLW 60

Query: 302 KKLQSSMDSSKDF--NDPPQKT-KLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQW 472
           KKLQ+S DSS+    ++PP    KLQ+       P  K + I  + K             
Sbjct: 61  KKLQNSNDSSQQIRPSEPPDNNPKLQLP-----NPPPKLTPIFNQNK------------- 102

Query: 473 LQKRVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKE-NKNGVIGVVSVT- 646
           L   + A            +GSK+VRRVP V+EFYR L +++   E NKN       VT 
Sbjct: 103 LPPPISAAPPPPPPPSKMFMGSKSVRRVPEVVEFYRLLTRKNVSSENNKNHSTAATPVTA 162

Query: 647 YSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
           +S +MIGEIENRST+L++IKSDVEK  + +N+LI+EV+SAAF  IS+V
Sbjct: 163 FSPNMIGEIENRSTHLSAIKSDVEKRKEFINYLIKEVESAAFKGISEV 210


>gb|PNT20110.1| hypothetical protein POPTR_009G073600v3 [Populus trichocarpa]
          Length = 386

 Score =  168 bits (426), Expect = 2e-46
 Identities = 101/223 (45%), Positives = 130/223 (58%)
 Frame = +2

Query: 122 MPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALW 301
           M    D + + +LK+E+EA+ ++ DSLE EN EL+QE+ RLKAQIS+LKAHD ERKS LW
Sbjct: 1   MRKEEDESLIIYLKKEVEAALLRTDSLEKENQELQQEVVRLKAQISSLKAHDNERKSMLW 60

Query: 302 KKLQSSMDSSKDFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQWLQK 481
           KKLQ+ +DSSK        T + +    D      SS  +  +              L  
Sbjct: 61  KKLQNPIDSSK--------TDVFLQKQSDFVKVTPSSPKEVNSNK------------LSP 100

Query: 482 RVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKENKNGVIGVVSVTYSRDM 661
                           VGSK VRRVP V EFYR + +RD   EN+     +  V ++  M
Sbjct: 101 APAPAPPPPPPPPKMSVGSKTVRRVPEVAEFYRLVTRRDVHMENRINSAAIPVVAFTPSM 160

Query: 662 IGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
           IGEIENRSTYL++IKSDVEK  + +NFLI+EV+SAAF EISDV
Sbjct: 161 IGEIENRSTYLSAIKSDVEKQKEFINFLIKEVESAAFKEISDV 203


>gb|EOY34534.1| F10K1.18 protein, putative isoform 4 [Theobroma cacao]
 gb|EOY34535.1| F10K1.18 protein, putative isoform 4 [Theobroma cacao]
          Length = 328

 Score =  163 bits (412), Expect = 7e-45
 Identities = 102/235 (43%), Positives = 141/235 (60%), Gaps = 21/235 (8%)
 Frame = +2

Query: 149 VTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALWKKLQSSMDS 328
           +T LK+ELEA+  +  SLE EN ELKQE++RLKAQIS+LKAHD ERKS LWKKL +S+D+
Sbjct: 13  ITRLKKELEAALGRNGSLEKENQELKQEVARLKAQISSLKAHDNERKSMLWKKLHNSIDN 72

Query: 329 SK---------DFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQWLQ- 478
           S          DF    ++ +L+ + V     F++ + +  E ++ +         ++  
Sbjct: 73  SNADASLQKSSDFLKVSEQ-RLEAENVYPRPSFQELA-VRKERQSKVPKPPPRSNSFISP 130

Query: 479 --KRV---------MAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKENKNGV 625
             K V         +              GS++VRRVP V+E YRSL ++DT  ENK   
Sbjct: 131 SPKEVSENKVTTPSVPPPPPPPLPSKLLAGSRSVRRVPEVVELYRSLTRKDTNMENKTNA 190

Query: 626 IGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
                + +SR+MIGEIENRSTY+++IKSDVEK  + +NFLI EVQSAAF +ISDV
Sbjct: 191 AATPVLAFSRNMIGEIENRSTYVSAIKSDVEKQKEFINFLISEVQSAAFKDISDV 245


>ref|XP_007016912.2| PREDICTED: protein CHUP1, chloroplastic isoform X3 [Theobroma
           cacao]
          Length = 430

 Score =  164 bits (416), Expect = 2e-44
 Identities = 103/235 (43%), Positives = 141/235 (60%), Gaps = 21/235 (8%)
 Frame = +2

Query: 149 VTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALWKKLQSSMDS 328
           +T LK+ELEA+  +  SLE EN ELKQE++RLKAQIS+LKAHD ERKS LWKKL +S+D+
Sbjct: 13  ITRLKKELEAALGRNGSLEKENQELKQEVARLKAQISSLKAHDNERKSMLWKKLHNSIDN 72

Query: 329 SK---------DFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQWLQ- 478
           S          DF   P++  L+ + V     F++ + +  E ++ +         ++  
Sbjct: 73  SNADASLQKSSDFLKVPEQG-LEAENVYPRPSFQELA-VRKERQSKVPKPPPRSNSFISP 130

Query: 479 --KRV---------MAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKENKNGV 625
             K V         +              GS++VRRVP V+E YRSL ++DT  ENK   
Sbjct: 131 SPKEVSENKVTTPSVPAPPPPPLPSKLLAGSRSVRRVPEVVELYRSLTRKDTNMENKTNA 190

Query: 626 IGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
                + +SR+MIGEIENRSTY+++IKSDVEK  + +NFLI EVQSAAF +ISDV
Sbjct: 191 AATPVLAFSRNMIGEIENRSTYVSAIKSDVEKQKEFINFLISEVQSAAFKDISDV 245


>ref|XP_022753610.1| protein CHUP1, chloroplastic-like [Durio zibethinus]
          Length = 532

 Score =  166 bits (421), Expect = 2e-44
 Identities = 102/240 (42%), Positives = 138/240 (57%), Gaps = 4/240 (1%)
 Frame = +2

Query: 83  PIFTFTTSSLELKMPSGNDGAT---VTFLKRELEASFVKIDSLESENHELKQEISRLKAQ 253
           P+        E KMP  +D +    +T LK ELE S  + DS+E EN ELKQE++RLKA 
Sbjct: 14  PVTLLKKVEKERKMPLEDDESEFCEITRLKMELETSLARNDSMEKENQELKQEVARLKAH 73

Query: 254 ISTLKAHDLERKSALWKKLQSSMDSSKDFNDP-PQKTKLQVDVVLDHAPFKKSSTIDTEA 430
           IS+LKAHD ERKS LWKKL +S D+S    DP PQK+          + F K     +EA
Sbjct: 74  ISSLKAHDNERKSMLWKKLHNSSDNSN--TDPSPQKS----------SDFMKVLEQSSEA 121

Query: 431 KAMLXXXXXXXXQWLQKRVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKE 610
           + +           +    +              G ++VRRVP V++ YRSL ++D   E
Sbjct: 122 ENVYPRPSFQELNKVNAPSVPVPPPPPLPSKLLAGFRSVRRVPEVVDLYRSLTRKDANME 181

Query: 611 NKNGVIGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
           NK        + ++R+MIGEIENRSTY+++IKSDV+K  + +NFLI EVQSA+F +ISDV
Sbjct: 182 NKTNATTTPELAFTRNMIGEIENRSTYVSAIKSDVQKQKEFINFLISEVQSASFKDISDV 241


>ref|XP_019178642.1| PREDICTED: protein CHUP1, chloroplastic [Ipomoea nil]
          Length = 409

 Score =  163 bits (413), Expect = 3e-44
 Identities = 97/224 (43%), Positives = 132/224 (58%), Gaps = 10/224 (4%)
 Frame = +2

Query: 149 VTFLKRELEASFVKIDSLESENHELKQEISRLKAQIST-LKAHDLERKSALWKKLQSSMD 325
           ++ LKRE EAS  +++ LE EN EL+QEI RL+AQ++   K+HDLERKS LWKK+QSS +
Sbjct: 12  ISLLKREFEASLARVNFLEKENQELRQEIVRLRAQVNNAFKSHDLERKSMLWKKVQSSPE 71

Query: 326 SSKDFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQWLQKRVM----- 490
           S        QK            P + +  ++   K  +        Q+ QK V      
Sbjct: 72  SKIADKSQVQKPSF---------PAETTEAVEPANKKEMVTKPAYYYQFPQKPVCKLPSL 122

Query: 491 ----AXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKENKNGVIGVVSVTYSRD 658
               A             GSKA+RRVPAVME YRSL+KRD QKE +   I  ++    ++
Sbjct: 123 PAGPAPPPPPPPPPSKTGGSKALRRVPAVMELYRSLVKRDAQKERRTNAIASMAALNPKN 182

Query: 659 MIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
           MIGEIENRST+L ++KSDVE +G+ +  L+RE++SAAFTEIS+V
Sbjct: 183 MIGEIENRSTHLLAVKSDVETHGEAIEALLREIESAAFTEISEV 226


>gb|EOY34536.1| F10K1.18 protein, putative isoform 6 [Theobroma cacao]
          Length = 420

 Score =  163 bits (412), Expect = 6e-44
 Identities = 102/235 (43%), Positives = 141/235 (60%), Gaps = 21/235 (8%)
 Frame = +2

Query: 149 VTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALWKKLQSSMDS 328
           +T LK+ELEA+  +  SLE EN ELKQE++RLKAQIS+LKAHD ERKS LWKKL +S+D+
Sbjct: 13  ITRLKKELEAALGRNGSLEKENQELKQEVARLKAQISSLKAHDNERKSMLWKKLHNSIDN 72

Query: 329 SK---------DFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQWLQ- 478
           S          DF    ++ +L+ + V     F++ + +  E ++ +         ++  
Sbjct: 73  SNADASLQKSSDFLKVSEQ-RLEAENVYPRPSFQELA-VRKERQSKVPKPPPRSNSFISP 130

Query: 479 --KRV---------MAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKENKNGV 625
             K V         +              GS++VRRVP V+E YRSL ++DT  ENK   
Sbjct: 131 SPKEVSENKVTTPSVPPPPPPPLPSKLLAGSRSVRRVPEVVELYRSLTRKDTNMENKTNA 190

Query: 626 IGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
                + +SR+MIGEIENRSTY+++IKSDVEK  + +NFLI EVQSAAF +ISDV
Sbjct: 191 AATPVLAFSRNMIGEIENRSTYVSAIKSDVEKQKEFINFLISEVQSAAFKDISDV 245


>gb|EOY34531.1| F10K1.18 protein, putative isoform 1 [Theobroma cacao]
          Length = 430

 Score =  163 bits (412), Expect = 7e-44
 Identities = 102/235 (43%), Positives = 141/235 (60%), Gaps = 21/235 (8%)
 Frame = +2

Query: 149 VTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALWKKLQSSMDS 328
           +T LK+ELEA+  +  SLE EN ELKQE++RLKAQIS+LKAHD ERKS LWKKL +S+D+
Sbjct: 13  ITRLKKELEAALGRNGSLEKENQELKQEVARLKAQISSLKAHDNERKSMLWKKLHNSIDN 72

Query: 329 SK---------DFNDPPQKTKLQVDVVLDHAPFKKSSTIDTEAKAMLXXXXXXXXQWLQ- 478
           S          DF    ++ +L+ + V     F++ + +  E ++ +         ++  
Sbjct: 73  SNADASLQKSSDFLKVSEQ-RLEAENVYPRPSFQELA-VRKERQSKVPKPPPRSNSFISP 130

Query: 479 --KRV---------MAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRDTQKENKNGV 625
             K V         +              GS++VRRVP V+E YRSL ++DT  ENK   
Sbjct: 131 SPKEVSENKVTTPSVPPPPPPPLPSKLLAGSRSVRRVPEVVELYRSLTRKDTNMENKTNA 190

Query: 626 IGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTEISDV 790
                + +SR+MIGEIENRSTY+++IKSDVEK  + +NFLI EVQSAAF +ISDV
Sbjct: 191 AATPVLAFSRNMIGEIENRSTYVSAIKSDVEKQKEFINFLISEVQSAAFKDISDV 245


>ref|XP_014503946.1| protein CHUP1, chloroplastic isoform X3 [Vigna radiata var.
           radiata]
          Length = 419

 Score =  160 bits (406), Expect = 4e-43
 Identities = 95/244 (38%), Positives = 143/244 (58%), Gaps = 21/244 (8%)
 Frame = +2

Query: 122 MPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALW 301
           M  G + + +T LK++LE    + + L+ EN EL++E+ RLK+Q+ +LKAH+LERKS LW
Sbjct: 7   MLQGENESEITSLKKKLEVHMARDELLQKENQELREEVGRLKSQVISLKAHNLERKSFLW 66

Query: 302 KKLQSSMDSSKDFNDPPQKTKLQVDVVL-----------------DHAPFKKSSTI---- 418
           KK+Q S+D + + ++P Q     V V+                  D AP K+   I    
Sbjct: 67  KKIQKSIDGNNN-SEPIQLKASPVQVITCEKSSENANIHTNPDFQDSAPRKEKPAIVPAP 125

Query: 419 DTEAKAMLXXXXXXXXQWLQKRVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRD 598
                A L        + L+ + +A            VG KAVRRVP V+E YRSL ++D
Sbjct: 126 PPRPSAALLLPLHRKEKALKAQPIAPPPPPTPPKLSLVGLKAVRRVPEVIELYRSLTRKD 185

Query: 599 TQKENKNGVIGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTE 778
              ENK    G+ +V +SR+MI EIENRSTYL++IKS+V++ G+ ++FLI+EV+SA+F +
Sbjct: 186 ANMENKIHSNGIPTVAFSRNMIEEIENRSTYLSAIKSEVQRQGEFISFLIKEVESASFAD 245

Query: 779 ISDV 790
           +S+V
Sbjct: 246 VSEV 249


>gb|KOM47416.1| hypothetical protein LR48_Vigan07g112000 [Vigna angularis]
          Length = 374

 Score =  159 bits (403), Expect = 4e-43
 Identities = 92/244 (37%), Positives = 140/244 (57%), Gaps = 21/244 (8%)
 Frame = +2

Query: 122 MPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALW 301
           M  G + + +T LK++LE    + + L+ EN EL++E+ RLK+Q+ +LKAH+LERKS LW
Sbjct: 1   MLQGENESEITSLKKKLEVHMARDELLQKENQELREEVGRLKSQVISLKAHNLERKSVLW 60

Query: 302 KKLQSSMDSSKDFNDPPQKTKLQVDVVL-----------------DHAPFKKSSTI---- 418
           KK+Q S+D + +      K    V V+                  + AP K+   I    
Sbjct: 61  KKIQKSIDGNNNSEPIQLKASSPVQVITCEKSSENANIHTNPDFQESAPRKEKPAIVPAP 120

Query: 419 DTEAKAMLXXXXXXXXQWLQKRVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRD 598
                A L        + L+ + +A            VG KAVRRVP V+E YRSL ++D
Sbjct: 121 PPRPSAALLLPLHKKEKVLKMQPIAPPPPPTPPKLSLVGLKAVRRVPEVIELYRSLTRKD 180

Query: 599 TQKENKNGVIGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTE 778
              ENK    G+ ++ +SR+MI EIENRSTYL++IKS+V++ G+ ++FLI+EV+SA+F +
Sbjct: 181 ANMENKIHSNGIPTIAFSRNMIEEIENRSTYLSAIKSEVQRQGEFISFLIKEVESASFAD 240

Query: 779 ISDV 790
           +S+V
Sbjct: 241 VSEV 244


>ref|XP_014503945.1| protein CHUP1, chloroplastic isoform X2 [Vigna radiata var.
           radiata]
          Length = 424

 Score =  160 bits (406), Expect = 5e-43
 Identities = 95/244 (38%), Positives = 143/244 (58%), Gaps = 21/244 (8%)
 Frame = +2

Query: 122 MPSGNDGATVTFLKRELEASFVKIDSLESENHELKQEISRLKAQISTLKAHDLERKSALW 301
           M  G + + +T LK++LE    + + L+ EN EL++E+ RLK+Q+ +LKAH+LERKS LW
Sbjct: 7   MLQGENESEITSLKKKLEVHMARDELLQKENQELREEVGRLKSQVISLKAHNLERKSFLW 66

Query: 302 KKLQSSMDSSKDFNDPPQKTKLQVDVVL-----------------DHAPFKKSSTI---- 418
           KK+Q S+D + + ++P Q     V V+                  D AP K+   I    
Sbjct: 67  KKIQKSIDGNNN-SEPIQLKASPVQVITCEKSSENANIHTNPDFQDSAPRKEKPAIVPAP 125

Query: 419 DTEAKAMLXXXXXXXXQWLQKRVMAXXXXXXXXXXXXVGSKAVRRVPAVMEFYRSLMKRD 598
                A L        + L+ + +A            VG KAVRRVP V+E YRSL ++D
Sbjct: 126 PPRPSAALLLPLHRKEKALKAQPIAPPPPPTPPKLSLVGLKAVRRVPEVIELYRSLTRKD 185

Query: 599 TQKENKNGVIGVVSVTYSRDMIGEIENRSTYLTSIKSDVEKYGQLLNFLIREVQSAAFTE 778
              ENK    G+ +V +SR+MI EIENRSTYL++IKS+V++ G+ ++FLI+EV+SA+F +
Sbjct: 186 ANMENKIHSNGIPTVAFSRNMIEEIENRSTYLSAIKSEVQRQGEFISFLIKEVESASFAD 245

Query: 779 ISDV 790
           +S+V
Sbjct: 246 VSEV 249


Top