BLASTX nr result

ID: Mentha24_contig00008183 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00008183
         (1570 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007024349.1| Actin binding family protein, putative isofo...   353   1e-94
ref|XP_007024348.1| Actin binding family protein, putative isofo...   353   1e-94
ref|XP_004155990.1| PREDICTED: protein CHUP1, chloroplastic-like...   344   6e-92
ref|XP_004141788.1| PREDICTED: protein CHUP1, chloroplastic-like...   344   6e-92
gb|EXB74603.1| hypothetical protein L484_026300 [Morus notabilis]     338   4e-90
ref|XP_004235634.1| PREDICTED: protein CHUP1, chloroplastic-like...   325   3e-86
ref|XP_003627081.1| Protein CHUP1 [Medicago truncatula] gi|35552...   323   1e-85
ref|XP_006585558.1| PREDICTED: protein CHUP1, chloroplastic-like...   319   2e-84
ref|XP_006343028.1| PREDICTED: protein CHUP1, chloroplastic-like...   318   4e-84
ref|XP_002515939.1| conserved hypothetical protein [Ricinus comm...   316   2e-83
ref|XP_007214940.1| hypothetical protein PRUPE_ppa002785mg [Prun...   316   2e-83
ref|XP_006465715.1| PREDICTED: protein CHUP1, chloroplastic-like...   314   8e-83
ref|XP_007135615.1| hypothetical protein PHAVU_010G143700g [Phas...   313   1e-82
ref|XP_007135614.1| hypothetical protein PHAVU_010G143700g [Phas...   313   1e-82
ref|XP_002298248.2| hypothetical protein POPTR_0001s19210g [Popu...   313   2e-82
ref|XP_004302842.1| PREDICTED: protein CHUP1, chloroplastic-like...   312   3e-82
ref|XP_006426846.1| hypothetical protein CICLE_v10025160mg [Citr...   311   5e-82
ref|XP_004510323.1| PREDICTED: protein CHUP1, chloroplastic-like...   310   1e-81
ref|XP_004510324.1| PREDICTED: protein CHUP1, chloroplastic-like...   305   4e-80
ref|XP_003546609.1| PREDICTED: protein CHUP1, chloroplastic-like...   298   3e-78

>ref|XP_007024349.1| Actin binding family protein, putative isoform 2 [Theobroma cacao]
            gi|508779715|gb|EOY26971.1| Actin binding family protein,
            putative isoform 2 [Theobroma cacao]
          Length = 630

 Score =  353 bits (906), Expect = 1e-94
 Identities = 211/435 (48%), Positives = 267/435 (61%), Gaps = 13/435 (2%)
 Frame = +2

Query: 74   YYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQLDNKRLQAKVADYEKVASELESAE 253
            YYGLKEQE    ELQNRL+I+NMEAKL+  KIESLQ +N+RL+++VAD+ KV +ELE+A 
Sbjct: 184  YYGLKEQETAALELQNRLKINNMEAKLFTLKIESLQSENRRLESQVADHAKVVAELETAR 243

Query: 254  AKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEKNPVEINQDADTQSQXXXXXXXXXX 433
            ++IKLL+KKLR EAEQNREQIL LQ+RV ++Q+QE   +  NQD +++ Q          
Sbjct: 244  SRIKLLKKKLRHEAEQNREQILNLQKRVARLQEQELKALADNQDIESKLQRLKVLEGEAD 303

Query: 434  XXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEVQALKEESQLLRQENEIFRKEIDQL 613
                 N SL+ EN ELAQK++S Q LA S L+D E +AL E S  LRQENE   K+I+QL
Sbjct: 304  ELRKSNRSLQTENSELAQKLESTQILANSVLEDPETEALNEMSNCLRQENEDLTKQIEQL 363

Query: 614  QADRCTDVEELVYLRWINACLRYELRNYQPGPDETVARDLSKTLSPKSEEKAKKLILAYA 793
            QADRC DVEELVYLRWINACLRYELRNYQP P +TVARDLSK+LSPKSEEKAKKLIL YA
Sbjct: 364  QADRCADVEELVYLRWINACLRYELRNYQPPPGKTVARDLSKSLSPKSEEKAKKLILEYA 423

Query: 794  NREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDDLPTDKYQDSKTSHRNKKRVFAKLM 973
            + EG   +  +  DF  D+WS SQ+SY TD+GE DD   +    +KT++  K + F  L 
Sbjct: 424  HTEGMGDRGMNSMDFDCDQWSSSQASYGTDTGELDDSSFENSSATKTTNSGKIKFFKNLR 483

Query: 974  KLLRGKDNDHHLPTTPSTPRERAASVDDVLIRYSVSCADGGGDGPTKXXXXXXXXXXRQS 1153
            +LLRGKD+ HH     ST   +   ++DV    S + + G G+                S
Sbjct: 484  RLLRGKDSHHHHSQVSST--SKTDHLEDV---DSPTWSSGRGNDSITMLQSHSDRVTTPS 538

Query: 1154 FDQRNTTAE-----XXXXXXXXXXXXIFRSFDS--------IGGYDDDSTSGFRPGKETQ 1294
                  + +                   RS D         I G DD S S      +  
Sbjct: 539  LSSCRPSLDIPRWRSLNVDHIKDVENFRRSSDGSSYGYKRFILGRDDASESPLEHLLDQD 598

Query: 1295 SAAKNDLVKYAEALK 1339
            S +K+DLVK+AE LK
Sbjct: 599  SDSKSDLVKFAEVLK 613


>ref|XP_007024348.1| Actin binding family protein, putative isoform 1 [Theobroma cacao]
            gi|508779714|gb|EOY26970.1| Actin binding family protein,
            putative isoform 1 [Theobroma cacao]
          Length = 629

 Score =  353 bits (906), Expect = 1e-94
 Identities = 211/435 (48%), Positives = 267/435 (61%), Gaps = 13/435 (2%)
 Frame = +2

Query: 74   YYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQLDNKRLQAKVADYEKVASELESAE 253
            YYGLKEQE    ELQNRL+I+NMEAKL+  KIESLQ +N+RL+++VAD+ KV +ELE+A 
Sbjct: 183  YYGLKEQETAALELQNRLKINNMEAKLFTLKIESLQSENRRLESQVADHAKVVAELETAR 242

Query: 254  AKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEKNPVEINQDADTQSQXXXXXXXXXX 433
            ++IKLL+KKLR EAEQNREQIL LQ+RV ++Q+QE   +  NQD +++ Q          
Sbjct: 243  SRIKLLKKKLRHEAEQNREQILNLQKRVARLQEQELKALADNQDIESKLQRLKVLEGEAD 302

Query: 434  XXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEVQALKEESQLLRQENEIFRKEIDQL 613
                 N SL+ EN ELAQK++S Q LA S L+D E +AL E S  LRQENE   K+I+QL
Sbjct: 303  ELRKSNRSLQTENSELAQKLESTQILANSVLEDPETEALNEMSNCLRQENEDLTKQIEQL 362

Query: 614  QADRCTDVEELVYLRWINACLRYELRNYQPGPDETVARDLSKTLSPKSEEKAKKLILAYA 793
            QADRC DVEELVYLRWINACLRYELRNYQP P +TVARDLSK+LSPKSEEKAKKLIL YA
Sbjct: 363  QADRCADVEELVYLRWINACLRYELRNYQPPPGKTVARDLSKSLSPKSEEKAKKLILEYA 422

Query: 794  NREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDDLPTDKYQDSKTSHRNKKRVFAKLM 973
            + EG   +  +  DF  D+WS SQ+SY TD+GE DD   +    +KT++  K + F  L 
Sbjct: 423  HTEGMGDRGMNSMDFDCDQWSSSQASYGTDTGELDDSSFENSSATKTTNSGKIKFFKNLR 482

Query: 974  KLLRGKDNDHHLPTTPSTPRERAASVDDVLIRYSVSCADGGGDGPTKXXXXXXXXXXRQS 1153
            +LLRGKD+ HH     ST   +   ++DV    S + + G G+                S
Sbjct: 483  RLLRGKDSHHHHSQVSST--SKTDHLEDV---DSPTWSSGRGNDSITMLQSHSDRVTTPS 537

Query: 1154 FDQRNTTAE-----XXXXXXXXXXXXIFRSFDS--------IGGYDDDSTSGFRPGKETQ 1294
                  + +                   RS D         I G DD S S      +  
Sbjct: 538  LSSCRPSLDIPRWRSLNVDHIKDVENFRRSSDGSSYGYKRFILGRDDASESPLEHLLDQD 597

Query: 1295 SAAKNDLVKYAEALK 1339
            S +K+DLVK+AE LK
Sbjct: 598  SDSKSDLVKFAEVLK 612


>ref|XP_004155990.1| PREDICTED: protein CHUP1, chloroplastic-like [Cucumis sativus]
          Length = 624

 Score =  344 bits (883), Expect = 6e-92
 Identities = 212/460 (46%), Positives = 279/460 (60%), Gaps = 13/460 (2%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +R L+SKV                YYGLKEQE  + ELQNRL+I+NMEAKL+  KIESL+
Sbjct: 153  IRYLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTFKIESLE 212

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             DN+RL+++V D+ K  S+LE+A AKIK L+KKLR+EAEQNR QIL LQ+RV+K+QDQE 
Sbjct: 213  ADNRRLESQVCDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQKRVLKLQDQEH 272

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
               + N+DA  + Q               N  L++EN +L +++D+ Q LA S L+DQE 
Sbjct: 273  KTNQSNKDAQIKLQKIEDLEKEIEELRKSNLRLEIENSDLGRRLDATQFLANSLLEDQEK 332

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
            ++LKEE++ L +ENE   KEI+QLQA R  DVEELVYLRWINACLRYELRN+QP   +T 
Sbjct: 333  ESLKEETERLTRENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNFQPPAGKTA 392

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDD 901
            ARDLSKTLSPKSEEKAKKLIL YAN EG  GK  +++DF  D+WS SQ+S  TD G+PDD
Sbjct: 393  ARDLSKTLSPKSEEKAKKLILDYANTEGNEGKSMNVTDFDSDQWSSSQASSHTDPGDPDD 452

Query: 902  LPTDKYQDSKTSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDV-LIRYSV 1078
              TD    +KT   NK +  +KL KLL+GK +  ++        + AASV+D     YS 
Sbjct: 453  STTDFPSTAKTG-SNKIKFISKLRKLLKGKGSQQNMTL---LAEKSAASVEDSDSPCYST 508

Query: 1079 SCADG----GGDGPTKXXXXXXXXXXRQSFDQRNTTAEXXXXXXXXXXXXIFRSFD---- 1234
            S + G      +G               S D     ++            I R+ D    
Sbjct: 509  SNSTGTNATRAEGQAIGYATPLLNSSGHSMDFHRLQSQ--KEDDVKIEDSIRRNSDVGCV 566

Query: 1235 ---SIGGYDDDSTSGFR-PGKETQSAAKNDLVKYAEALKN 1342
                + G D  S S +R   ++T+S  K++L+KYAE LK+
Sbjct: 567  NKRFVVGSDQLSDSSYRSQNQDTESTEKSELMKYAEVLKD 606


>ref|XP_004141788.1| PREDICTED: protein CHUP1, chloroplastic-like [Cucumis sativus]
          Length = 635

 Score =  344 bits (883), Expect = 6e-92
 Identities = 212/460 (46%), Positives = 279/460 (60%), Gaps = 13/460 (2%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +R L+SKV                YYGLKEQE  + ELQNRL+I+NMEAKL+  KIESL+
Sbjct: 164  IRYLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTFKIESLE 223

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             DN+RL+++V D+ K  S+LE+A AKIK L+KKLR+EAEQNR QIL LQ+RV+K+QDQE 
Sbjct: 224  ADNRRLESQVCDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQKRVLKLQDQEH 283

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
               + N+DA  + Q               N  L++EN +L +++D+ Q LA S L+DQE 
Sbjct: 284  KTNQSNKDAQIKLQKIEDLEKEIEELRKSNLRLEIENSDLGRRLDATQFLANSLLEDQEK 343

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
            ++LKEE++ L +ENE   KEI+QLQA R  DVEELVYLRWINACLRYELRN+QP   +T 
Sbjct: 344  ESLKEETERLTRENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNFQPPAGKTA 403

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDD 901
            ARDLSKTLSPKSEEKAKKLIL YAN EG  GK  +++DF  D+WS SQ+S  TD G+PDD
Sbjct: 404  ARDLSKTLSPKSEEKAKKLILDYANTEGNEGKSMNVTDFDSDQWSSSQASSHTDPGDPDD 463

Query: 902  LPTDKYQDSKTSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDV-LIRYSV 1078
              TD    +KT   NK +  +KL KLL+GK +  ++        + AASV+D     YS 
Sbjct: 464  STTDFPSTAKTG-SNKIKFISKLRKLLKGKGSQQNMTL---LAEKSAASVEDSDSPCYST 519

Query: 1079 SCADG----GGDGPTKXXXXXXXXXXRQSFDQRNTTAEXXXXXXXXXXXXIFRSFD---- 1234
            S + G      +G               S D     ++            I R+ D    
Sbjct: 520  SNSTGTNATRAEGQAIGYATPLLNSSGHSMDFHRLQSQ--KEDDVKIEDSIRRNSDVGCV 577

Query: 1235 ---SIGGYDDDSTSGFR-PGKETQSAAKNDLVKYAEALKN 1342
                + G D  S S +R   ++T+S  K++L+KYAE LK+
Sbjct: 578  NKRFVVGSDQLSDSSYRSQNQDTESTEKSELMKYAEVLKD 617


>gb|EXB74603.1| hypothetical protein L484_026300 [Morus notabilis]
          Length = 644

 Score =  338 bits (867), Expect = 4e-90
 Identities = 203/437 (46%), Positives = 263/437 (60%), Gaps = 15/437 (3%)
 Frame = +2

Query: 74   YYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQLDNKRLQAKVADYEKVASELESAE 253
            YYG+KEQE T+ ELQNRL+++NMEAKL++ KIESL  +N+RL+A+VA +    +ELE+A 
Sbjct: 191  YYGVKEQETTVMELQNRLKLNNMEAKLFSLKIESLHAENQRLEAQVAGHANAVTELEAAR 250

Query: 254  AKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEKNPVEINQDADTQSQXXXXXXXXXX 433
            AKIKLL+KKLRFEAEQN+EQIL LQ+RV KMQD+E   +  N D   + +          
Sbjct: 251  AKIKLLKKKLRFEAEQNKEQILNLQQRVAKMQDEEYKSLASNSDVQLKLKRIKDLEGEIE 310

Query: 434  XXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEVQALKEESQLLRQENEIFRKEIDQL 613
                 N  L+LEN ELAQ+++S + LA   L+D E  ALKEES  LRQ NE  R+EI+QL
Sbjct: 311  ELRKSNLMLQLENSELAQRLESTKILANYVLEDPETDALKEESVRLRQANEDLRQEIEQL 370

Query: 614  QADRCTDVEELVYLRWINACLRYELRNYQPGPDETVARDLSKTLSPKSEEKAKKLILAYA 793
            +ADRC D+EELVYLRWINACLRYELR+YQP   + VARDLSKTLSPKSEEKAK+LIL YA
Sbjct: 371  KADRCADIEELVYLRWINACLRYELRDYQPATGKMVARDLSKTLSPKSEEKAKQLILEYA 430

Query: 794  NREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDDLPTDKYQDSKTSHRNKKRVFAKLM 973
            N EG   K  SI DF  D WS SQ+S+ TDS + D+   D    +KT+  +KK+ F KL 
Sbjct: 431  NTEGIGEKGISIMDFDSDRWSSSQASF-TDSVDLDESSLDNSSAAKTNTSSKKKFFNKLR 489

Query: 974  KLLRGKDNDHHLPTTPSTPRERAASVDDVLIRYSVSCADGG-GDGPTKXXXXXXXXXXRQ 1150
            KL+RG+D  H         +  +   D    RY  S   G                  R 
Sbjct: 490  KLVRGRDGHHSSQVLSGDHKPESVEQDGDSPRYIPSTLTGDYAVAEDNRFRTSSQNLSRP 549

Query: 1151 SFD---------QRNTTAEXXXXXXXXXXXXIFRSFDSIGGY-----DDDSTSGFRPGKE 1288
            S D         +     +            +++SF ++GG       +DST+     K 
Sbjct: 550  SLDLSRLRSLKEREVVDVQSVQRNSDVGSSYVYKSF-ALGGEIANDPTNDSTAKDEIEKH 608

Query: 1289 TQSAAKNDLVKYAEALK 1339
            + S  K++L+KYAEAL+
Sbjct: 609  SDSTDKSELLKYAEALR 625


>ref|XP_004235634.1| PREDICTED: protein CHUP1, chloroplastic-like [Solanum lycopersicum]
          Length = 626

 Score =  325 bits (834), Expect = 3e-86
 Identities = 195/453 (43%), Positives = 271/453 (59%), Gaps = 7/453 (1%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            ++SL++ V                YYGLKEQE  + ELQN+L+I+N+EAK++  KIESL 
Sbjct: 161  IKSLKNTVKTLQERERILEIQLLEYYGLKEQETAIMELQNQLKINNVEAKIFGLKIESLT 220

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             D  RL+AKV+DY K   ELE+A+ KIK L+KK+R EA+ ++E IL LQE+VMK+ DQEK
Sbjct: 221  EDKMRLEAKVSDYGKAVCELEAAKVKIKQLKKKVRSEADHSKEHILALQEKVMKLHDQEK 280

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
              VE   D   + +               N SL+ EN +LA +++S+Q +A S L+++E 
Sbjct: 281  KNVEAESDVQLKLRRLEDLEIQTVELNKSNQSLRKENSDLAHRLESVQIIASSVLENEET 340

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
            +ALK+E+  L+++NE   K++++LQADRCTD EELVYLRWINACLR+ELRNYQP   +T+
Sbjct: 341  EALKKETLQLKKQNEDLAKDVERLQADRCTDAEELVYLRWINACLRHELRNYQPDTGKTI 400

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNG-KDPSISDFYFDEWSVSQSSYLTDSGEPD 898
            ARDLSKTLSPKSEEKAK+LIL YAN+E   G ++ ++SD    EWS SQ+S+LTDS E D
Sbjct: 401  ARDLSKTLSPKSEEKAKQLILEYANKEESQGEREVNVSDL-DSEWSSSQTSFLTDSVEFD 459

Query: 899  DLPTDKYQDSKTSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDVLI---R 1069
            +  TD     KT   +KK+VF+KLM+LLRGK      P + S+  +   +++D +     
Sbjct: 460  ETSTDNSSPCKTQSSSKKKVFSKLMRLLRGKGR----PLSRSSSMDTVHTLEDNVAGHSS 515

Query: 1070 YSVSCADGGGDGPTKXXXXXXXXXXRQSFDQRNTTAEXXXXXXXXXXXXIFRS-FDSIGG 1246
            YS    D G +G             +Q  D  + +                 S  +S GG
Sbjct: 516  YSPGYIDSGANGLNIRSRTSSQGSSKQFLDLHSVSQGSRSGKLGENNNYQMNSRQNSDGG 575

Query: 1247 YDDDSTSGFRPGKET--QSAAKNDLVKYAEALK 1339
                S     P + T      K +L+KYA+ALK
Sbjct: 576  SSSGSRRLDSPQENTSKNEPEKAELLKYAKALK 608


>ref|XP_003627081.1| Protein CHUP1 [Medicago truncatula] gi|355521103|gb|AET01557.1|
            Protein CHUP1 [Medicago truncatula]
          Length = 594

 Score =  323 bits (829), Expect = 1e-85
 Identities = 194/455 (42%), Positives = 275/455 (60%), Gaps = 8/455 (1%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +R L++ V                Y GL+EQE  + ELQNRL+I N+EAK++N K+E+LQ
Sbjct: 141  IRKLKNMVIMLQERERSLEVQLLEYCGLREQETVVMELQNRLKISNIEAKMFNLKVETLQ 200

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             +N+RL+A+VA + KV +ELE+++ K+KLL+KK+++EAEQN+E I+ L+++V K+QD E 
Sbjct: 201  SENRRLEAQVAGHAKVLAELEASKTKVKLLKKKIKYEAEQNKEHIINLKQKVSKLQDLEC 260

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
              V  +Q+   + +               N  L+++N +LA ++DS Q LA S L+D E 
Sbjct: 261  KAVAKDQEIQMKLKRLSDLEAEAEQCRKSNLRLQMDNSDLATRLDSTQILANSVLEDPEA 320

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
             AL+EES  LRQ NE   KEI+QL+ADRCTDVEELVYL+W+NAC R+ELRNYQP P +TV
Sbjct: 321  DALREESDRLRQANEDLTKEIEQLKADRCTDVEELVYLKWLNACFRHELRNYQPAPGKTV 380

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKDPSISDFYFDEWSVSQ-SSYLTDSGEPD 898
            ARDLSK LSP SE+KAK+LIL YAN EG      SISDF  D+WS S+ SSY+TD G+ D
Sbjct: 381  ARDLSKNLSPTSEKKAKQLILEYANAEG----RTSISDFDSDQWSSSRASSYVTDPGDSD 436

Query: 899  DL-PTDKYQDSKTSH-RNKKRVFAKLMKLLRGKDNDHHLP---TTPSTPRERAASVDDVL 1063
            D  P +   D++ ++ +NK ++F KLMKL+RGKD+ +HL    T+    R R  S++D L
Sbjct: 437  DYSPLENPSDARVNNAKNKSKIFGKLMKLIRGKDSSNHLSGSVTSVEKSRSREDSINDGL 496

Query: 1064 IRYSVSCADGGGDGPTKXXXXXXXXXXRQSFDQRNTTA--EXXXXXXXXXXXXIFRSFDS 1237
                            K          + S D  +T +  E             F    S
Sbjct: 497  ----------------KSEYETLTDMSQNSIDLNSTLSLKEETRRNSDVGSLKNFGRRKS 540

Query: 1238 IGGYDDDSTSGFRPGKETQSAAKNDLVKYAEALKN 1342
            + G     T  F    ++ ++ K++L+KYAEALK+
Sbjct: 541  VAGDLKFITQSF---SDSYASEKSNLIKYAEALKD 572


>ref|XP_006585558.1| PREDICTED: protein CHUP1, chloroplastic-like isoform X1 [Glycine max]
            gi|571472287|ref|XP_006585559.1| PREDICTED: protein
            CHUP1, chloroplastic-like isoform X2 [Glycine max]
            gi|571472289|ref|XP_006585560.1| PREDICTED: protein
            CHUP1, chloroplastic-like isoform X3 [Glycine max]
          Length = 640

 Score =  319 bits (817), Expect = 2e-84
 Identities = 185/452 (40%), Positives = 272/452 (60%), Gaps = 5/452 (1%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +R LRS +                Y G+KEQE  + ELQNRL+I NME K++N K+E+LQ
Sbjct: 176  IRKLRSMIIMLQERETNLEVQLLEYCGIKEQEAAVMELQNRLKISNMETKMFNLKVETLQ 235

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             +N+RL+A+V D+ K+ +ELE+ + K+K L+KKL++EAEQNRE I+ L+++V K+QD E 
Sbjct: 236  SENRRLEAQVVDHAKLMTELETTKTKVKFLKKKLKYEAEQNREHIMNLKQKVAKLQDNEY 295

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
            N    +Q+   + +               N  L+L+N +L +++DS Q LA + L+D E 
Sbjct: 296  NASANDQEIQIKLKRLKDLECEAEQLRKSNLRLQLDNSDLVRRLDSTQILANAVLEDPEA 355

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
             ALKEE + LR+ENE   KE++QL ADRC D+EELVYLRWINACLR+ELR+YQP P +TV
Sbjct: 356  HALKEEGERLRRENEGLTKELEQLHADRCLDLEELVYLRWINACLRHELRSYQPPPGKTV 415

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDD 901
            ARDLSK+LSP SE+KAK+LIL YA+ EG      S+SD   D+WS SQ+S+LTD GE +D
Sbjct: 416  ARDLSKSLSPTSEKKAKQLILEYASNEGRG----SVSDMDSDQWSSSQASFLTDPGERED 471

Query: 902  -LPTDKYQDSK-TSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDVLI--- 1066
              P D   + K T++ +K R+F KLM+L+RGK++ +      +T +E++ S +D      
Sbjct: 472  YFPLDNSSELKATNNTSKSRIFGKLMRLIRGKESQNQ--RDRATSKEKSMSREDSNTNSP 529

Query: 1067 RYSVSCADGGGDGPTKXXXXXXXXXXRQSFDQRNTTAEXXXXXXXXXXXXIFRSFDSIGG 1246
             +S+S + G     ++             F+Q  +  E              ++      
Sbjct: 530  HFSLSISTGTEGLRSENATPSATSRTSFDFNQTMSMKEESSRNSDSHTPGSSKNLSPRRT 589

Query: 1247 YDDDSTSGFRPGKETQSAAKNDLVKYAEALKN 1342
               D  +  R   E+  + K++LVKYAEA+K+
Sbjct: 590  RSVDFKNHLRSFSESSGSEKSNLVKYAEAIKD 621


>ref|XP_006343028.1| PREDICTED: protein CHUP1, chloroplastic-like [Solanum tuberosum]
          Length = 487

 Score =  318 bits (815), Expect = 4e-84
 Identities = 186/429 (43%), Positives = 263/429 (61%), Gaps = 7/429 (1%)
 Frame = +2

Query: 74   YYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQLDNKRLQAKVADYEKVASELESAE 253
            YYGLKEQE  + ELQN+L+I+++EAK++  KIESL  D  RL+AKV+DY KV  ELE+A+
Sbjct: 46   YYGLKEQETAIMELQNQLKINSVEAKIFGLKIESLTADKMRLEAKVSDYGKVVCELEAAK 105

Query: 254  AKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEKNPVEINQDADTQSQXXXXXXXXXX 433
             KIK L+KK+R EA+Q++E IL LQE+VMK+ DQEK  VE   +   + +          
Sbjct: 106  VKIKQLKKKVRSEADQSKEHILALQEKVMKLHDQEKKIVEAESNVQLKLRRLKDLENQSD 165

Query: 434  XXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEVQALKEESQLLRQENEIFRKEIDQL 613
                 N SL+ EN +LA +++S+Q +A S L+++E +ALKEE+  L+++NE   K++++L
Sbjct: 166  ELNKSNQSLRKENSDLAHRLESVQIIASSVLENEETEALKEETLQLKKQNEDLAKDVERL 225

Query: 614  QADRCTDVEELVYLRWINACLRYELRNYQPGPDETVARDLSKTLSPKSEEKAKKLILAYA 793
            QADRCTD EELVYLRWINACLR+ELRNYQP   +T+ARDLSKTLSPKSEEKAK+LIL YA
Sbjct: 226  QADRCTDAEELVYLRWINACLRHELRNYQPVTGKTIARDLSKTLSPKSEEKAKQLILEYA 285

Query: 794  NREGCNG-KDPSISDFYFDEWSVSQSSYLTDSGEPDDLPTDKYQDSKTSHRNKKRVFAKL 970
            N+E   G ++ ++SD    EWS S++S+LTDS E D+  TD     KT   +K +VF+KL
Sbjct: 286  NKEESQGEREVNVSDL-DSEWSSSRTSFLTDSVEFDETSTDNSSPRKTQSSSKNKVFSKL 344

Query: 971  MKLLRGKDNDHHLPTTPSTPRERAASVDDVLI---RYSVSCADGGGDGPTKXXXXXXXXX 1141
            M+L+RGK      P + S+  +   +++D +     YS    D G +G            
Sbjct: 345  MRLVRGKGR----PLSRSSSMDMVHTLEDNVAGHSSYSPGYIDSGVNGLNIRSRTSSQGS 400

Query: 1142 XRQSFDQRNTTAEXXXXXXXXXXXXIFRSFDSIGGYDDDSTSGFRPGKETQS---AAKND 1312
             +Q  D  + +                 S  +  G     +      +E +S     K +
Sbjct: 401  SKQFLDLHSVSQGSRSCKLGENNNYPMNSRQNSDGGSSSGSRRLDSPQENRSKDEPEKAE 460

Query: 1313 LVKYAEALK 1339
            L+KYA+ LK
Sbjct: 461  LLKYAKVLK 469


>ref|XP_002515939.1| conserved hypothetical protein [Ricinus communis]
            gi|223544844|gb|EEF46359.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 640

 Score =  316 bits (810), Expect = 2e-83
 Identities = 168/311 (54%), Positives = 217/311 (69%)
 Frame = +2

Query: 74   YYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQLDNKRLQAKVADYEKVASELESAE 253
            +YGLKEQE  M ELQNRL+I NME KL+N KIESLQ DN+RLQA+ AD+ K+ +EL++A 
Sbjct: 181  FYGLKEQETAMMELQNRLKISNMETKLFNLKIESLQADNQRLQAQFADHAKIVAELDAAR 240

Query: 254  AKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEKNPVEINQDADTQSQXXXXXXXXXX 433
            +KIKLLRK+L+ EA QN+E IL LQ+RV ++Q++E      + D   + Q          
Sbjct: 241  SKIKLLRKRLKSEAGQNKEHILVLQKRVSRLQEEELKAAANDSDIKVKLQRLKDLEVEAE 300

Query: 434  XXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEVQALKEESQLLRQENEIFRKEIDQL 613
                 NH L LEN ELA++++S + LA S L+D E +AL+E S  L+QEN+   KE++QL
Sbjct: 301  DLRNSNHRLTLENSELARQLESAKILANSVLEDPETEALRELSDKLKQENDHLVKEVEQL 360

Query: 614  QADRCTDVEELVYLRWINACLRYELRNYQPGPDETVARDLSKTLSPKSEEKAKKLILAYA 793
             ADRC D EELVYLRW+NACLRYELRN+QP   +TVARDLSK+LSPKSEEKAK+LIL YA
Sbjct: 361  HADRCKDCEELVYLRWVNACLRYELRNFQPAHGKTVARDLSKSLSPKSEEKAKQLILEYA 420

Query: 794  NREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDDLPTDKYQDSKTSHRNKKRVFAKLM 973
            N E    K  +I DF  D+WS S +SY+ DSG+ D    D     KTS+ +K + F KL 
Sbjct: 421  NSEEMGEKGINIMDFESDQWSSSHTSYVIDSGDFD----DSVVSPKTSNSSKIKFFNKLR 476

Query: 974  KLLRGKDNDHH 1006
            +L+RGK+  HH
Sbjct: 477  RLIRGKEIQHH 487


>ref|XP_007214940.1| hypothetical protein PRUPE_ppa002785mg [Prunus persica]
            gi|462411090|gb|EMJ16139.1| hypothetical protein
            PRUPE_ppa002785mg [Prunus persica]
          Length = 633

 Score =  316 bits (809), Expect = 2e-83
 Identities = 199/458 (43%), Positives = 259/458 (56%), Gaps = 13/458 (2%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +R LRS V                YYGLKEQE  + ELQN+L+I+ MEAKL+  KIESL+
Sbjct: 169  IRHLRSTVRMLRERERSLEVQLLEYYGLKEQETAVMELQNQLKINTMEAKLFTLKIESLE 228

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             +N+R++A+VAD+ KV  ELE+  AKIK+L+KKLRFEAEQN+EQIL L++RV K  D E 
Sbjct: 229  AENRRVEAQVADHAKVVGELEATRAKIKILKKKLRFEAEQNKEQILNLKKRVEKFHDSEA 288

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
                 N +     +               N  L++EN ELA+ ++S Q LA S L+D E 
Sbjct: 289  AD---NSEIQLNLRRLKDLEGEAEELRKSNFQLQIENSELARSLESTQILANSILEDPEA 345

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
            +ALKE S  LRQENE   KEI QLQ DRC+DVEELVYLRWINACLRYELRN+QP   +T 
Sbjct: 346  EALKEASARLRQENEDLTKEIQQLQVDRCSDVEELVYLRWINACLRYELRNFQPPTGKTA 405

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDD 901
            ARDLSK+LSP+SEEKAK+LI+ YAN EG  G+   + DF  D+WS S +S+ TDS E DD
Sbjct: 406  ARDLSKSLSPRSEEKAKQLIVEYANTEGM-GEKGMMVDFDSDQWSSSHASFFTDSPEFDD 464

Query: 902  LPTDKYQDSKTSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDVLIRYSVS 1081
               D    +KT+   K ++F KL +L+ GKD  H+     ST  +R    +D    Y  S
Sbjct: 465  FSVDNSSATKTNTTTKSKLFNKLRRLVLGKD-IHYENRVLST--DRTGYAEDNESPYCSS 521

Query: 1082 ----CADGGGDGPTKXXXXXXXXXXRQSFD---------QRNTTAEXXXXXXXXXXXXIF 1222
                 A  G +G +           R S D         Q     +             +
Sbjct: 522  SKSTAAYTGPEGQSNVFATSSRSSSRASLDLPRWRSPKQQDTKDVQSVQRHSDVGSSPAY 581

Query: 1223 RSFDSIGGYDDDSTSGFRPGKETQSAAKNDLVKYAEAL 1336
            ++F   G  D       +  +++ S  K +LVKYAEAL
Sbjct: 582  KTFSREGSAD----LPLKSDQDSDSTEKAELVKYAEAL 615


>ref|XP_006465715.1| PREDICTED: protein CHUP1, chloroplastic-like isoform X1 [Citrus
            sinensis] gi|568822595|ref|XP_006465716.1| PREDICTED:
            protein CHUP1, chloroplastic-like isoform X2 [Citrus
            sinensis]
          Length = 624

 Score =  314 bits (804), Expect = 8e-83
 Identities = 179/354 (50%), Positives = 230/354 (64%), Gaps = 1/354 (0%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +++L+S V                YYGLKEQE  + ELQNRL+++NME +L N KIESLQ
Sbjct: 170  VKNLKSMVQMLQDREKNLEVELLEYYGLKEQETIVMELQNRLKLNNMEGRLLNLKIESLQ 229

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             DN+RL+A+VAD+ K  SELE+A+ KIKLL+KKLR EAEQNREQIL +QERV K+Q+Q  
Sbjct: 230  ADNRRLEAQVADHAKTVSELEAAKTKIKLLKKKLRTEAEQNREQILAVQERVTKLQEQAH 289

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
                I+ D  ++ Q               N  L+LEN +LA++++S Q L  S L+D E 
Sbjct: 290  KAAAIDPDTQSRLQRLKVLEAEAEDLRKSNMKLQLENSQLARRLESTQMLEISVLEDGER 349

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
            +AL E SQ LR+EN    KE+++L AD+C  VEELVYL+WINACLRYELRNYQP   +TV
Sbjct: 350  EALNEMSQRLREENTSLSKEVEKLHADKCAGVEELVYLKWINACLRYELRNYQPPAGKTV 409

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKDPSISDFYFDEWSVSQSSYLTDS-GEPD 898
            ARDLSKTLSP SEEKAK+LIL YA+ EG      +I +   D WS SQ+S +TDS    D
Sbjct: 410  ARDLSKTLSPNSEEKAKQLILEYAHAEG----HGNIMNIDSDHWSTSQASCITDSENHHD 465

Query: 899  DLPTDKYQDSKTSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDV 1060
            D   DK   +K S  NK + F KL KL+RGKD          +P +R++SVD +
Sbjct: 466  DSSADKSFSTKISSSNKTKFFHKLRKLVRGKD---------VSPLKRSSSVDKI 510


>ref|XP_007135615.1| hypothetical protein PHAVU_010G143700g [Phaseolus vulgaris]
            gi|561008660|gb|ESW07609.1| hypothetical protein
            PHAVU_010G143700g [Phaseolus vulgaris]
          Length = 552

 Score =  313 bits (802), Expect = 1e-82
 Identities = 183/449 (40%), Positives = 267/449 (59%), Gaps = 2/449 (0%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +R LRS +                Y G++EQE  + ELQNRL+I NMEAK++N K+ +LQ
Sbjct: 91   MRKLRSMIRMLQERETNLQVQLLEYCGIREQEAAVMELQNRLKISNMEAKMFNLKVVTLQ 150

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             +N+RL+A+VAD+ K+ SELE+A+ K+K L+KK+++EAEQNRE I+ L+++V K+QD E 
Sbjct: 151  SENRRLEAQVADHAKLTSELETAKTKVKFLKKKIKYEAEQNREHIMNLKQKVGKLQDHEF 210

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
                 +Q+   + +               N  L++EN +L++++DS Q LA + L+D E 
Sbjct: 211  KVAANDQEIQIKLKRLKDLDCETEQLRKSNLRLQMENSDLSRRLDSTQLLANAVLEDPEA 270

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
            QALKEE + LRQENE   KE++QL ADRC+D+EELVYLRWINACLR+ELR+YQ    +T 
Sbjct: 271  QALKEEGERLRQENEGLAKELEQLHADRCSDLEELVYLRWINACLRHELRSYQLPSGKTA 330

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDD 901
            ARDLSK+LSP SE+KAK+LIL YA+ E       SISD   D+WS SQ+S+ TD GE +D
Sbjct: 331  ARDLSKSLSPTSEKKAKQLILEYASNE----VRASISDMDSDQWSSSQTSFFTDPGEHED 386

Query: 902  LPT-DKYQDSKTSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDV-LIRYS 1075
                D   ++K ++  K R+F KLM+L+RGKD+ H      S  +E++ S +D     +S
Sbjct: 387  YSLHDASSEAKLNNSTKSRIFGKLMRLIRGKDSHHQRGQIMS--KEKSISREDSNSSHFS 444

Query: 1076 VSCADGGGDGPTKXXXXXXXXXXRQSFDQRNTTAEXXXXXXXXXXXXIFRSFDSIGGYDD 1255
            +S + G     ++             ++Q  +  +              ++F        
Sbjct: 445  LSMSTGNECLRSEYTTPSATSRTSFDYNQSQSLKDDSGRNSDSHTPGSSKNFSPNRRSSA 504

Query: 1256 DSTSGFRPGKETQSAAKNDLVKYAEALKN 1342
            DS +      E+ +  K +L KYAEALKN
Sbjct: 505  DSKNRLDSFSESSAMEKTNLAKYAEALKN 533


>ref|XP_007135614.1| hypothetical protein PHAVU_010G143700g [Phaseolus vulgaris]
            gi|561008659|gb|ESW07608.1| hypothetical protein
            PHAVU_010G143700g [Phaseolus vulgaris]
          Length = 635

 Score =  313 bits (802), Expect = 1e-82
 Identities = 183/449 (40%), Positives = 267/449 (59%), Gaps = 2/449 (0%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +R LRS +                Y G++EQE  + ELQNRL+I NMEAK++N K+ +LQ
Sbjct: 174  MRKLRSMIRMLQERETNLQVQLLEYCGIREQEAAVMELQNRLKISNMEAKMFNLKVVTLQ 233

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             +N+RL+A+VAD+ K+ SELE+A+ K+K L+KK+++EAEQNRE I+ L+++V K+QD E 
Sbjct: 234  SENRRLEAQVADHAKLTSELETAKTKVKFLKKKIKYEAEQNREHIMNLKQKVGKLQDHEF 293

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
                 +Q+   + +               N  L++EN +L++++DS Q LA + L+D E 
Sbjct: 294  KVAANDQEIQIKLKRLKDLDCETEQLRKSNLRLQMENSDLSRRLDSTQLLANAVLEDPEA 353

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
            QALKEE + LRQENE   KE++QL ADRC+D+EELVYLRWINACLR+ELR+YQ    +T 
Sbjct: 354  QALKEEGERLRQENEGLAKELEQLHADRCSDLEELVYLRWINACLRHELRSYQLPSGKTA 413

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDD 901
            ARDLSK+LSP SE+KAK+LIL YA+ E       SISD   D+WS SQ+S+ TD GE +D
Sbjct: 414  ARDLSKSLSPTSEKKAKQLILEYASNE----VRASISDMDSDQWSSSQTSFFTDPGEHED 469

Query: 902  LPT-DKYQDSKTSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDV-LIRYS 1075
                D   ++K ++  K R+F KLM+L+RGKD+ H      S  +E++ S +D     +S
Sbjct: 470  YSLHDASSEAKLNNSTKSRIFGKLMRLIRGKDSHHQRGQIMS--KEKSISREDSNSSHFS 527

Query: 1076 VSCADGGGDGPTKXXXXXXXXXXRQSFDQRNTTAEXXXXXXXXXXXXIFRSFDSIGGYDD 1255
            +S + G     ++             ++Q  +  +              ++F        
Sbjct: 528  LSMSTGNECLRSEYTTPSATSRTSFDYNQSQSLKDDSGRNSDSHTPGSSKNFSPNRRSSA 587

Query: 1256 DSTSGFRPGKETQSAAKNDLVKYAEALKN 1342
            DS +      E+ +  K +L KYAEALKN
Sbjct: 588  DSKNRLDSFSESSAMEKTNLAKYAEALKN 616


>ref|XP_002298248.2| hypothetical protein POPTR_0001s19210g [Populus trichocarpa]
            gi|550347663|gb|EEE83053.2| hypothetical protein
            POPTR_0001s19210g [Populus trichocarpa]
          Length = 655

 Score =  313 bits (801), Expect = 2e-82
 Identities = 168/313 (53%), Positives = 222/313 (70%), Gaps = 1/313 (0%)
 Frame = +2

Query: 74   YYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQLDNKRLQAKVADYEKVASELESAE 253
            +YGLKEQE  + ELQNRL+I+NMEAKL+  KIESL+ DN+RLQA+V D+ KV +EL++A 
Sbjct: 199  FYGLKEQEAAVMELQNRLKINNMEAKLFALKIESLRADNRRLQAQVVDHAKVVAELDAAR 258

Query: 254  AKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEKNPVEINQDADTQSQXXXXXXXXXX 433
            +K++L++KKLR EAEQN+EQIL L++RV ++Q+QE    E + D   + Q          
Sbjct: 259  SKLELVKKKLRSEAEQNKEQILSLKKRVSRLQEQELMSAETDSDIKMKLQRLKDLEIEAE 318

Query: 434  XXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV-QALKEESQLLRQENEIFRKEIDQ 610
                 N  L LEN EL  +++S Q LA S L+D EV + L+++   LRQENE   KE++Q
Sbjct: 319  ELRKSNSRLHLENSELFSQLESTQILANSILEDPEVIKTLRKQGNRLRQENEDLAKEVEQ 378

Query: 611  LQADRCTDVEELVYLRWINACLRYELRNYQPGPDETVARDLSKTLSPKSEEKAKKLILAY 790
            LQADRC+DVEELVYLRW+NACLRYE+RN+QP   +TVARDLSK+LSP+SE KAK+LIL +
Sbjct: 379  LQADRCSDVEELVYLRWVNACLRYEMRNFQPPHGKTVARDLSKSLSPRSEMKAKQLILEF 438

Query: 791  ANREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDDLPTDKYQDSKTSHRNKKRVFAKL 970
            AN EG   K  +I +F  D WS SQ+SY+TD+GE DD         KTSH  K ++F KL
Sbjct: 439  ANTEGMAEKGINIMEFEPDHWSSSQASYITDAGELDD-----PLSPKTSHSGKTKMFHKL 493

Query: 971  MKLLRGKDNDHHL 1009
             KLL GK+  +H+
Sbjct: 494  RKLLLGKETHNHI 506


>ref|XP_004302842.1| PREDICTED: protein CHUP1, chloroplastic-like [Fragaria vesca subsp.
            vesca]
          Length = 626

 Score =  312 bits (799), Expect = 3e-82
 Identities = 191/425 (44%), Positives = 254/425 (59%), Gaps = 3/425 (0%)
 Frame = +2

Query: 74   YYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQLDNKRLQAKVADYEKVASELESAE 253
            YYGLKEQE  + EL+NRL+I +MEAKL++ KIESLQ +N+RL+ + +D+ KV +ELE+A+
Sbjct: 192  YYGLKEQETAVMELENRLKISSMEAKLFSLKIESLQAENRRLEGQASDHAKVVAELEAAK 251

Query: 254  AKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEKNPVEINQDADTQSQXXXXXXXXXX 433
            AK++ L+KKLR EAEQNREQIL L+ RV  +QD E      N +   + +          
Sbjct: 252  AKVRTLKKKLRSEAEQNREQILSLKRRVENLQDNEA--AAFNSEIQLKLRRLKVLEGETE 309

Query: 434  XXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEVQALKEESQLLRQENEIFRKEIDQL 613
                 N  L+L+N +LA++++S Q LA S L+D   +ALKEE + LRQENE  RKEI+QL
Sbjct: 310  ELTASNLKLQLQNSDLARRLESAQVLANSILEDPGAEALKEERERLRQENEELRKEIEQL 369

Query: 614  QADRCTDVEELVYLRWINACLRYELRNYQPGPDETVARDLSKTLSPKSEEKAKKLILAYA 793
              DR +DVEELVYLRWINACLRYELRN+QP   +TVARDLSK+LS +SEEKAK+LIL YA
Sbjct: 370  CVDRSSDVEELVYLRWINACLRYELRNFQPPNGKTVARDLSKSLSHESEEKAKQLILEYA 429

Query: 794  NREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDDLPTDKYQDSKTSHRNKKRVFAKLM 973
            N EG   K   I DF  D W+ S +S LTDSGE DD   D    +KT   +K ++F+KL 
Sbjct: 430  NTEGIGDKGSHI-DFESDRWT-SPTSLLTDSGEYDDFSADHSSATKTHTSSKHKLFSKLR 487

Query: 974  KLLRGKDNDHHLPTTPSTPRERAASVDDVLI---RYSVSCADGGGDGPTKXXXXXXXXXX 1144
            +++RGKD  H    +       A+S   V       S S +    D PT           
Sbjct: 488  RIIRGKDTHHDHNLSEDNCSGYASSSKSVAAYGGHESHSSSRASLDLPT-------VPRW 540

Query: 1145 RQSFDQRNTTAEXXXXXXXXXXXXIFRSFDSIGGYDDDSTSGFRPGKETQSAAKNDLVKY 1324
            R   +  +  +             +++ F   G    DS    R   ++ SA K++L KY
Sbjct: 541  RSPKEHDSKDSHSVQRHSDVGVFPVYKRFILGGEGSSDSPPKDRSDHDSDSAEKSELAKY 600

Query: 1325 AEALK 1339
            AEALK
Sbjct: 601  AEALK 605


>ref|XP_006426846.1| hypothetical protein CICLE_v10025160mg [Citrus clementina]
            gi|557528836|gb|ESR40086.1| hypothetical protein
            CICLE_v10025160mg [Citrus clementina]
          Length = 624

 Score =  311 bits (797), Expect = 5e-82
 Identities = 175/330 (53%), Positives = 222/330 (67%), Gaps = 1/330 (0%)
 Frame = +2

Query: 74   YYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQLDNKRLQAKVADYEKVASELESAE 253
            YYGLKEQE  + ELQNRL+++NME +L N KIESLQ DN+RL+A+VAD+ K  SELE+A+
Sbjct: 194  YYGLKEQETIVMELQNRLKLNNMEGRLLNLKIESLQADNRRLEAQVADHAKTVSELEAAK 253

Query: 254  AKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEKNPVEINQDADTQSQXXXXXXXXXX 433
             KIKLL+KKLR EAEQNREQIL +QERV K+Q+Q      I+ D  ++ Q          
Sbjct: 254  TKIKLLKKKLRTEAEQNREQILAVQERVTKLQEQAHKAAAIDPDTQSRLQRLKVLEAEAE 313

Query: 434  XXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEVQALKEESQLLRQENEIFRKEIDQL 613
                 N  L+LEN +LA++++S Q L  S L+D E +AL E SQ LR+EN    KE+++L
Sbjct: 314  DLRKSNMKLQLENSQLARRLESTQMLEISVLEDGEREALNEMSQRLREENTSLSKEVEKL 373

Query: 614  QADRCTDVEELVYLRWINACLRYELRNYQPGPDETVARDLSKTLSPKSEEKAKKLILAYA 793
             AD+C  VEELVYL+WINACLRYELRNYQP   +TVARDLSKTLSP SEEKAK+LIL YA
Sbjct: 374  HADKCAGVEELVYLKWINACLRYELRNYQPPAGKTVARDLSKTLSPNSEEKAKQLILEYA 433

Query: 794  NREGCNGKDPSISDFYFDEWSVSQSSYLTDS-GEPDDLPTDKYQDSKTSHRNKKRVFAKL 970
            + EG      +I +   D W  SQ+S +TDS    DD   DK   +K S  NK + F KL
Sbjct: 434  HTEG----HGNIMNIDSDHWLTSQASCITDSKNHHDDSSADKSFSTKISSSNKTKFFHKL 489

Query: 971  MKLLRGKDNDHHLPTTPSTPRERAASVDDV 1060
             KL+RGKD          +P +R++SVD +
Sbjct: 490  RKLVRGKD---------VSPLKRSSSVDKI 510


>ref|XP_004510323.1| PREDICTED: protein CHUP1, chloroplastic-like isoform X1 [Cicer
            arietinum]
          Length = 636

 Score =  310 bits (794), Expect = 1e-81
 Identities = 189/457 (41%), Positives = 270/457 (59%), Gaps = 11/457 (2%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +R LR+ +                Y GL+EQE  + ELQNRL+I +MEAK++N K+ +LQ
Sbjct: 166  VRQLRNMIRMLQERERSLEVQLLEYCGLREQETVVMELQNRLKISSMEAKMFNLKVATLQ 225

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             DN+RL+A+V+ + KV +ELE+A+ K+K L++K+R+EAEQNRE I+ L+++V K+Q+ E 
Sbjct: 226  SDNRRLEAQVSGHAKVLAELEAAKTKVKFLKRKIRYEAEQNREHIMNLKQKVSKLQEIES 285

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
                 +++   + +               N  L+ +N +LA+++DS Q LA + L+D E 
Sbjct: 286  KSAACDEEIQMKLKRLNDLEAEVEQWRKSNLRLQKDNSDLARRLDSTQILANAVLEDPEA 345

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
             AL+EES  LR+ENE   KEI+QLQADRCTD+EELVYLRWINACLR+ELR+YQP   +TV
Sbjct: 346  DALREESNSLRRENEGLMKEIEQLQADRCTDLEELVYLRWINACLRHELRHYQPPTGKTV 405

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKD-PSISDFYFDEWSVSQSSYLTDSGEPD 898
            ARDLSK+LSP SE+KAK+LIL YAN    NG+   SISDF  D+WS SQ+SY+TD  E  
Sbjct: 406  ARDLSKSLSPSSEKKAKQLILEYAN----NGEGRTSISDFDSDQWSSSQASYITDCDEYS 461

Query: 899  DL----PTDKYQDSKTSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDVL- 1063
             L     T   +D + +  NK ++F KLMKL+RGKD+  +   + +   E+  S +D + 
Sbjct: 462  PLGNPSNTRDARDVRVNTTNKSKIFGKLMKLMRGKDSSSNQQNSRARSLEKFGSREDSIS 521

Query: 1064 --IRYSVSCA---DGGGDGPTKXXXXXXXXXXRQSFDQRNTTAEXXXXXXXXXXXXIFRS 1228
                +S+S +   D G +G  +          R S D  N T                ++
Sbjct: 522  NSSHFSLSMSARHDSGAEG-LRSEYETPTDASRTSLD-FNGTLSLKEESRRNSDVGSSKN 579

Query: 1229 FDSIGGYDDDSTSGFRPGKETQSAAKNDLVKYAEALK 1339
            F        D         ++ SA K +L+KYAEALK
Sbjct: 580  FSPSKSGSGDLKITAHSFSDSYSAEKANLIKYAEALK 616


>ref|XP_004510324.1| PREDICTED: protein CHUP1, chloroplastic-like isoform X2 [Cicer
            arietinum]
          Length = 597

 Score =  305 bits (781), Expect = 4e-80
 Identities = 186/451 (41%), Positives = 263/451 (58%), Gaps = 5/451 (1%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +R LR+ +                Y GL+EQE  + ELQNRL+I +MEAK++N K+ +LQ
Sbjct: 144  VRQLRNMIRMLQERERSLEVQLLEYCGLREQETVVMELQNRLKISSMEAKMFNLKVATLQ 203

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             DN+RL+A+V+ + KV +ELE+A+ K+K L++K+R+EAEQNRE I+ L+++V K+Q+ E 
Sbjct: 204  SDNRRLEAQVSGHAKVLAELEAAKTKVKFLKRKIRYEAEQNREHIMNLKQKVSKLQEIES 263

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
                 +++   + +               N  L+ +N +LA+++DS Q LA + L+D E 
Sbjct: 264  KSAACDEEIQMKLKRLNDLEAEVEQWRKSNLRLQKDNSDLARRLDSTQILANAVLEDPEA 323

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
             AL+EES  LR+ENE   KEI+QLQADRCTD+EELVYLRWINACLR+ELR+YQP   +TV
Sbjct: 324  DALREESNSLRRENEGLMKEIEQLQADRCTDLEELVYLRWINACLRHELRHYQPPTGKTV 383

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKD-PSISDFYFDEWSVSQSSYLTDSGEPD 898
            ARDLSK+LSP SE+KAK+LIL YAN    NG+   SISDF  D+WS SQ+SY+TD  E  
Sbjct: 384  ARDLSKSLSPSSEKKAKQLILEYAN----NGEGRTSISDFDSDQWSSSQASYITDCDEYS 439

Query: 899  DL----PTDKYQDSKTSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDVLI 1066
             L     T   +D + +  NK ++F KLMKL+RGKD+  +   + +   E+  S +D   
Sbjct: 440  PLGNPSNTRDARDVRVNTTNKSKIFGKLMKLMRGKDSSSNQQNSRARSLEKFGSRED--- 496

Query: 1067 RYSVSCADGGGDGPTKXXXXXXXXXXRQSFDQRNTTAEXXXXXXXXXXXXIFRSFDSIGG 1246
                S  +G      +          R S D  N T                ++F     
Sbjct: 497  ----SITEG-----LRSEYETPTDASRTSLD-FNGTLSLKEESRRNSDVGSSKNFSPSKS 546

Query: 1247 YDDDSTSGFRPGKETQSAAKNDLVKYAEALK 1339
               D         ++ SA K +L+KYAEALK
Sbjct: 547  GSGDLKITAHSFSDSYSAEKANLIKYAEALK 577


>ref|XP_003546609.1| PREDICTED: protein CHUP1, chloroplastic-like isoform X1 [Glycine max]
          Length = 595

 Score =  298 bits (764), Expect = 3e-78
 Identities = 179/449 (39%), Positives = 264/449 (58%), Gaps = 2/449 (0%)
 Frame = +2

Query: 2    LRSLRSKVXXXXXXXXXXXXXXXXYYGLKEQEKTMSELQNRLRIHNMEAKLYNNKIESLQ 181
            +R LR+ +                + GL+EQE  + ELQNRL+   ME K++N K+++LQ
Sbjct: 149  VRQLRNMIRMLQDREQSLEVQLLEFCGLREQETAVMELQNRLKASTMEVKIFNLKVKTLQ 208

Query: 182  LDNKRLQAKVADYEKVASELESAEAKIKLLRKKLRFEAEQNREQILRLQERVMKMQDQEK 361
             +N RL+ +VAD+EKV +ELE+A+A+++LL KK+R E EQNRE+I+ L+++V ++QDQE 
Sbjct: 209  SENWRLKEQVADHEKVLTELENAKAQVELLNKKIRHETEQNREKIITLKQKVSRLQDQEC 268

Query: 362  NPVEINQDADTQSQXXXXXXXXXXXXXXYNHSLKLENLELAQKVDSLQKLAKSALDDQEV 541
                 +QD   + Q               N  L++EN +LA+++DS Q LA + L+D E 
Sbjct: 269  KDAAYDQDIQIKMQKLKYLESEAEELRKSNLRLQIENSDLARRLDSTQILANAFLEDPEA 328

Query: 542  QALKEESQLLRQENEIFRKEIDQLQADRCTDVEELVYLRWINACLRYELRNYQPGPDETV 721
             A+K+ES+ L+QEN    KEI+Q Q+DRC+D+EELVYLRWINACLRYELRNYQ  P +TV
Sbjct: 329  GAVKQESECLKQENVRLMKEIEQFQSDRCSDLEELVYLRWINACLRYELRNYQAPPGKTV 388

Query: 722  ARDLSKTLSPKSEEKAKKLILAYANREGCNGKDPSISDFYFDEWSVSQSSYLTDSGEPDD 901
            A+DLS++LSP SE+KAK+LIL YAN  G      +I DF  D+WS SQ+S +TD GE DD
Sbjct: 389  AKDLSRSLSPMSEKKAKQLILEYANANG----PGNIVDFDIDQWSSSQASSITDFGECDD 444

Query: 902  LPTDKYQDSKTSHRNKKRVFAKLMKLLRGKDNDHHLPTTPSTPRERAASVDDVLIRYSVS 1081
              +     +  ++ N  ++F KL +L++GK + HH   + ++ +E++   D   +  S S
Sbjct: 445  FSSADNSSAARTNTNPTKLFGKLRQLIQGKGSSHH--HSHASSQEKSGYQDSNPLCLSTS 502

Query: 1082 CADGGGDGPTKXXXXXXXXXXRQSFDQRNTTAEXXXXXXXXXXXXIFRSFDS--IGGYDD 1255
                G     +          R S D  + T+               R+ DS  +G  + 
Sbjct: 503  TRSEG----LRSEFATPIATSRTSLDFSSLTSVKEGDR---------RNSDSCVMGSSNK 549

Query: 1256 DSTSGFRPGKETQSAAKNDLVKYAEALKN 1342
             ST       ++    KN+L KYAEALK+
Sbjct: 550  FSTRKKGSFSDSLGLEKNNLEKYAEALKD 578


Top