BLASTX nr result

ID: Glycyrrhiza30_contig00014464 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza30_contig00014464
         (1831 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_012573389.1 PREDICTED: uncharacterized protein LOC101511271 i...   677   0.0  
XP_012573390.1 PREDICTED: uncharacterized protein LOC101511271 i...   674   0.0  
XP_012573388.1 PREDICTED: uncharacterized protein LOC101511271 i...   674   0.0  
GAU16748.1 hypothetical protein TSUD_199910 [Trifolium subterran...   658   0.0  
KHN45011.1 Protein CHUP1, chloroplastic [Glycine soja]                653   0.0  
XP_006594000.1 PREDICTED: protein CHUP1, chloroplastic-like isof...   653   0.0  
KRH19467.1 hypothetical protein GLYMA_13G118400 [Glycine max] KR...   652   0.0  
XP_003609889.1 hydroxyproline-rich glycoprotein family protein [...   649   0.0  
XP_013458360.1 hydroxyproline-rich glycoprotein family protein [...   649   0.0  
XP_006593995.1 PREDICTED: protein CHUP1, chloroplastic-like isof...   646   0.0  
XP_006593999.1 PREDICTED: protein CHUP1, chloroplastic-like isof...   644   0.0  
XP_006600413.1 PREDICTED: protein CHUP1, chloroplastic-like isof...   642   0.0  
XP_006600414.1 PREDICTED: protein CHUP1, chloroplastic-like isof...   641   0.0  
XP_007154485.1 hypothetical protein PHAVU_003G122900g [Phaseolus...   634   0.0  
XP_019419024.1 PREDICTED: protein CHUP1, chloroplastic [Lupinus ...   627   0.0  
XP_016194601.1 PREDICTED: protein CHUP1, chloroplastic isoform X...   626   0.0  
XP_015945214.1 PREDICTED: protein CHUP1, chloroplastic isoform X...   625   0.0  
XP_016194585.1 PREDICTED: protein CHUP1, chloroplastic isoform X...   626   0.0  
XP_015945204.1 PREDICTED: protein CHUP1, chloroplastic isoform X...   625   0.0  
XP_017411993.1 PREDICTED: protein CHUP1, chloroplastic isoform X...   619   0.0  

>XP_012573389.1 PREDICTED: uncharacterized protein LOC101511271 isoform X2 [Cicer
            arietinum]
          Length = 609

 Score =  677 bits (1748), Expect = 0.0
 Identities = 373/479 (77%), Positives = 404/479 (84%), Gaps = 1/479 (0%)
 Frame = +3

Query: 21   SQKGSREAEDASKVVVVTASRPRRRVGSEEDDTDGXXXXXXXXXXXXXVSENLIKDLQSE 200
            SQKG++EAE+A  VVV    RPRRR  +++ D                +S+NLIK+L+SE
Sbjct: 143  SQKGTKEAEEAKIVVV----RPRRRRTNDDPDEK----EKKEMVEKLEMSDNLIKNLESE 194

Query: 201  VLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSSSGKKEPIGEHQSPKFKDI 380
            V ALKAELDKVK+LNVELESQN KLTQ+LAAAEAKIAAVGS++ +KE IGEHQSPKFKDI
Sbjct: 195  VKALKAELDKVKNLNVELESQNVKLTQNLAAAEAKIAAVGSNNSRKELIGEHQSPKFKDI 254

Query: 381  QKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPETT-SIGRKSXXXXXXXXXXXX 557
            QKLIADKLE SKVK+EA  EV+FVKASIPAPT + AIPETT S+GRK             
Sbjct: 255  QKLIADKLEMSKVKKEANHEVIFVKASIPAPTQNHAIPETTTSLGRK-FPPNLCVMPPPP 313

Query: 558  XXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRKDSKGSVNHQKPAAFSAHSSIV 737
                      AKLAN TQKAP +V+LF  LKNQ+G  +KDSKGS+NH KP A SAHSSIV
Sbjct: 314  PPPPIPSRPLAKLAN-TQKAPAVVQLFHSLKNQDG--KKDSKGSINHHKPIAISAHSSIV 370

Query: 738  GEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDIEDVLKFVDWLDGELSTLADER 917
            GEIQNRSAHLLAIRADI+TKGEFINDLIKKVVDAAYV+IEDVLKFVDWLDGELSTLADER
Sbjct: 371  GEIQNRSAHLLAIRADIQTKGEFINDLIKKVVDAAYVEIEDVLKFVDWLDGELSTLADER 430

Query: 918  AVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCGASLKKMASLLDKSER 1097
            AVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPC ASLKKMASLLDKSER
Sbjct: 431  AVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCAASLKKMASLLDKSER 490

Query: 1098 SIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQASMTLVKMYMKRLTMELESIRNS 1277
            SIQKLI LRNSV RSYQMY+IPTAWMLDSG+ SKIK+ASMTLVKMYMKRLTMELESIRNS
Sbjct: 491  SIQKLITLRNSVTRSYQMYNIPTAWMLDSGITSKIKKASMTLVKMYMKRLTMELESIRNS 550

Query: 1278 DRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIRQRVPRHMAGSRELLAGIPSS 1454
            DRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA E IRQRVP ++AGSRELLAGI SS
Sbjct: 551  DRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAFEGIRQRVPGNLAGSRELLAGIQSS 609


>XP_012573390.1 PREDICTED: uncharacterized protein LOC101511271 isoform X3 [Cicer
            arietinum] XP_012573391.1 PREDICTED: uncharacterized
            protein LOC101511271 isoform X3 [Cicer arietinum]
            XP_012573392.1 PREDICTED: uncharacterized protein
            LOC101511271 isoform X3 [Cicer arietinum]
          Length = 577

 Score =  674 bits (1740), Expect = 0.0
 Identities = 375/480 (78%), Positives = 404/480 (84%), Gaps = 2/480 (0%)
 Frame = +3

Query: 21   SQKGSREAEDASKVVVVTASRPRRRVGSEEDDTDGXXXXXXXXXXXXXVSENLIKDLQSE 200
            SQKG++EAE+A  VVV    RPRRR  +++ D                +S+NLIK+L+SE
Sbjct: 110  SQKGTKEAEEAKIVVV----RPRRRRTNDDPDEK----EKKEMVEKLEMSDNLIKNLESE 161

Query: 201  VLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSS-SGKKEPIGEHQSPKFKD 377
            V ALKAELDKVK+LNVELESQN KLTQ+LAAAEAKIAAVGS+ S KKE IGEHQSPKFKD
Sbjct: 162  VKALKAELDKVKNLNVELESQNVKLTQNLAAAEAKIAAVGSNNSRKKELIGEHQSPKFKD 221

Query: 378  IQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPETT-SIGRKSXXXXXXXXXXX 554
            IQKLIADKLE SKVK+EA  EV+FVKASIPAPT + AIPETT S+GRK            
Sbjct: 222  IQKLIADKLEMSKVKKEANHEVIFVKASIPAPTQNHAIPETTTSLGRK-FPPNLCVMPPP 280

Query: 555  XXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRKDSKGSVNHQKPAAFSAHSSI 734
                       AKLAN TQKAP +V+LF  LKNQ+G  +KDSKGS+NH KP A SAHSSI
Sbjct: 281  PPPPPIPSRPLAKLAN-TQKAPAVVQLFHSLKNQDG--KKDSKGSINHHKPIAISAHSSI 337

Query: 735  VGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDIEDVLKFVDWLDGELSTLADE 914
            VGEIQNRSAHLLAIRADI+TKGEFINDLIKKVVDAAYV+IEDVLKFVDWLDGELSTLADE
Sbjct: 338  VGEIQNRSAHLLAIRADIQTKGEFINDLIKKVVDAAYVEIEDVLKFVDWLDGELSTLADE 397

Query: 915  RAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCGASLKKMASLLDKSE 1094
            RAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPC ASLKKMASLLDKSE
Sbjct: 398  RAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCAASLKKMASLLDKSE 457

Query: 1095 RSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQASMTLVKMYMKRLTMELESIRN 1274
            RSIQKLI LRNSV RSYQMY+IPTAWMLDSG+ SKIK+ASMTLVKMYMKRLTMELESIRN
Sbjct: 458  RSIQKLITLRNSVTRSYQMYNIPTAWMLDSGITSKIKKASMTLVKMYMKRLTMELESIRN 517

Query: 1275 SDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIRQRVPRHMAGSRELLAGIPSS 1454
            SDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA E IRQRVP ++AGSRELLAGI SS
Sbjct: 518  SDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAFEGIRQRVPGNLAGSRELLAGIQSS 577


>XP_012573388.1 PREDICTED: uncharacterized protein LOC101511271 isoform X1 [Cicer
            arietinum]
          Length = 610

 Score =  674 bits (1740), Expect = 0.0
 Identities = 375/480 (78%), Positives = 404/480 (84%), Gaps = 2/480 (0%)
 Frame = +3

Query: 21   SQKGSREAEDASKVVVVTASRPRRRVGSEEDDTDGXXXXXXXXXXXXXVSENLIKDLQSE 200
            SQKG++EAE+A  VVV    RPRRR  +++ D                +S+NLIK+L+SE
Sbjct: 143  SQKGTKEAEEAKIVVV----RPRRRRTNDDPDEK----EKKEMVEKLEMSDNLIKNLESE 194

Query: 201  VLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSS-SGKKEPIGEHQSPKFKD 377
            V ALKAELDKVK+LNVELESQN KLTQ+LAAAEAKIAAVGS+ S KKE IGEHQSPKFKD
Sbjct: 195  VKALKAELDKVKNLNVELESQNVKLTQNLAAAEAKIAAVGSNNSRKKELIGEHQSPKFKD 254

Query: 378  IQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPETT-SIGRKSXXXXXXXXXXX 554
            IQKLIADKLE SKVK+EA  EV+FVKASIPAPT + AIPETT S+GRK            
Sbjct: 255  IQKLIADKLEMSKVKKEANHEVIFVKASIPAPTQNHAIPETTTSLGRK-FPPNLCVMPPP 313

Query: 555  XXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRKDSKGSVNHQKPAAFSAHSSI 734
                       AKLAN TQKAP +V+LF  LKNQ+G  +KDSKGS+NH KP A SAHSSI
Sbjct: 314  PPPPPIPSRPLAKLAN-TQKAPAVVQLFHSLKNQDG--KKDSKGSINHHKPIAISAHSSI 370

Query: 735  VGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDIEDVLKFVDWLDGELSTLADE 914
            VGEIQNRSAHLLAIRADI+TKGEFINDLIKKVVDAAYV+IEDVLKFVDWLDGELSTLADE
Sbjct: 371  VGEIQNRSAHLLAIRADIQTKGEFINDLIKKVVDAAYVEIEDVLKFVDWLDGELSTLADE 430

Query: 915  RAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCGASLKKMASLLDKSE 1094
            RAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPC ASLKKMASLLDKSE
Sbjct: 431  RAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCAASLKKMASLLDKSE 490

Query: 1095 RSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQASMTLVKMYMKRLTMELESIRN 1274
            RSIQKLI LRNSV RSYQMY+IPTAWMLDSG+ SKIK+ASMTLVKMYMKRLTMELESIRN
Sbjct: 491  RSIQKLITLRNSVTRSYQMYNIPTAWMLDSGITSKIKKASMTLVKMYMKRLTMELESIRN 550

Query: 1275 SDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIRQRVPRHMAGSRELLAGIPSS 1454
            SDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA E IRQRVP ++AGSRELLAGI SS
Sbjct: 551  SDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAFEGIRQRVPGNLAGSRELLAGIQSS 610


>GAU16748.1 hypothetical protein TSUD_199910 [Trifolium subterraneum]
          Length = 577

 Score =  658 bits (1697), Expect = 0.0
 Identities = 366/480 (76%), Positives = 391/480 (81%), Gaps = 1/480 (0%)
 Frame = +3

Query: 18   GSQKGSREAEDASKVVVVTASRPRRRVGSEEDDTDGXXXXXXXXXXXXXVSENLIKDLQS 197
            G   G +E E+A KV+VVT  RPRRR    EDD D              VSENLIK LQS
Sbjct: 108  GGGGGGKEVEEA-KVIVVT--RPRRR--RIEDDPD--VKEKKELMEKLEVSENLIKSLQS 160

Query: 198  EVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSSSGKKEPIGEHQSPKFKD 377
            EV ALK ELDKVKSLN++LESQN KL Q+LA+AEAKIAA G+S+ KKEPIGEHQSPKFKD
Sbjct: 161  EVKALKDELDKVKSLNIDLESQNMKLNQNLASAEAKIAASGTSNRKKEPIGEHQSPKFKD 220

Query: 378  IQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPETTSIGRKSXXXXXXXXXXXX 557
            IQKLIADKLERSK+K+EA PEV+FVKASI AP  S+AIPE T +GRKS            
Sbjct: 221  IQKLIADKLERSKIKKEANPEVIFVKASIQAPKPSQAIPEITGLGRKSPPNQCLFPPPPP 280

Query: 558  XXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRKDSKGSVN-HQKPAAFSAHSSI 734
                      AKL+N TQK PPIV LF  +KNQ+G  +KD KGS+N H KP   SAH+SI
Sbjct: 281  PPPPIPSRPLAKLSN-TQKLPPIVPLFHSIKNQDG--KKDLKGSMNQHHKPITNSAHNSI 337

Query: 735  VGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDIEDVLKFVDWLDGELSTLADE 914
            VGEIQNRSAHLLAIR DI+TKGEFIN LIKKVVDAAYVDIEDVL FVDWLDGELSTLADE
Sbjct: 338  VGEIQNRSAHLLAIREDIQTKGEFINGLIKKVVDAAYVDIEDVLNFVDWLDGELSTLADE 397

Query: 915  RAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCGASLKKMASLLDKSE 1094
            RAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPC  SLKKMASLLDKSE
Sbjct: 398  RAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCVTSLKKMASLLDKSE 457

Query: 1095 RSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQASMTLVKMYMKRLTMELESIRN 1274
            RSIQKLI LRNSVMRSYQ Y+IPTAWMLDSG+ SKIKQASMTLVKMYMKRLTMELES R+
Sbjct: 458  RSIQKLIMLRNSVMRSYQTYNIPTAWMLDSGVTSKIKQASMTLVKMYMKRLTMELESNRH 517

Query: 1275 SDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIRQRVPRHMAGSRELLAGIPSS 1454
            SDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA EEIRQRVP H+ GSRELLA I SS
Sbjct: 518  SDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAFEEIRQRVPGHLVGSRELLACIASS 577


>KHN45011.1 Protein CHUP1, chloroplastic [Glycine soja]
          Length = 584

 Score =  653 bits (1685), Expect = 0.0
 Identities = 360/503 (71%), Positives = 399/503 (79%), Gaps = 30/503 (5%)
 Frame = +3

Query: 30   GSREAEDASKVVVVTASRPRRRVG------SEEDDTDGXXXXXXXXXXXXXVSENLIKDL 191
            GS++AE+   V+V   +RPRRRVG      SE+DD+ G             VSENLIK L
Sbjct: 89   GSQKAEEGKIVIV---ARPRRRVGDFGSRKSEDDDSHGKKKKELLQEKLE-VSENLIKSL 144

Query: 192  QSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVG-SSSGKKEPIGEHQSPK 368
            QSEVLAL+ ELD+VKSLNVELESQNTKLTQ+LAAAEAKI+ VG  ++GKKEPIGEH+SPK
Sbjct: 145  QSEVLALREELDRVKSLNVELESQNTKLTQNLAAAEAKISNVGIGNNGKKEPIGEHRSPK 204

Query: 369  FKDIQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPET---------------- 500
            FKDIQKLIA+KLERS+VK+E  PE++F KASI APT S A+PET                
Sbjct: 205  FKDIQKLIAEKLERSRVKKEGTPEIIFAKASISAPTPSYAVPETISVGRKSPPNTCLQPP 264

Query: 501  -------TSIGRKSXXXXXXXXXXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQE 659
                   TS+GR S                      A+LAN TQKAP IVELF  LKN++
Sbjct: 265  PPPPPPITSVGRNSPSNTCLQPPPPPPPPPIPTPPLARLAN-TQKAPTIVELFHSLKNKD 323

Query: 660  GNNRKDSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDA 839
            G  + DSKGSVNHQ+P   SAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDA
Sbjct: 324  G--KIDSKGSVNHQRPVVISAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDA 381

Query: 840  AYVDIEDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEIS 1019
            A+ DIE+VLKFVDWLDG+LS+LADE AVLKHFKWPEKKADAMREAAVEY ELKMLEQEIS
Sbjct: 382  AFTDIEEVLKFVDWLDGKLSSLADECAVLKHFKWPEKKADAMREAAVEYHELKMLEQEIS 441

Query: 1020 SYKDDPDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSK 1199
            SYKDDPDIPCGA+LKKMASLLDKSERSIQ+LIKLR+SV  SYQMY+IPTAWMLDSG+MSK
Sbjct: 442  SYKDDPDIPCGAALKKMASLLDKSERSIQRLIKLRSSVTHSYQMYNIPTAWMLDSGIMSK 501

Query: 1200 IKQASMTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA 1379
            IKQASMTLVK YMKR+TMELESIRNSDRES QDSLLLQGVHFAYRAHQF GGLDSET+CA
Sbjct: 502  IKQASMTLVKTYMKRVTMELESIRNSDRESIQDSLLLQGVHFAYRAHQFTGGLDSETMCA 561

Query: 1380 IEEIRQRVPRHMAGSRELLAGIP 1448
             EEIRQRVP ++ GSRELLAGIP
Sbjct: 562  FEEIRQRVPGNLTGSRELLAGIP 584


>XP_006594000.1 PREDICTED: protein CHUP1, chloroplastic-like isoform X3 [Glycine max]
            KRH19473.1 hypothetical protein GLYMA_13G118400 [Glycine
            max] KRH19474.1 hypothetical protein GLYMA_13G118400
            [Glycine max] KRH19475.1 hypothetical protein
            GLYMA_13G118400 [Glycine max]
          Length = 585

 Score =  653 bits (1685), Expect = 0.0
 Identities = 360/503 (71%), Positives = 399/503 (79%), Gaps = 30/503 (5%)
 Frame = +3

Query: 30   GSREAEDASKVVVVTASRPRRRVG------SEEDDTDGXXXXXXXXXXXXXVSENLIKDL 191
            GS++AE+   V+V   +RPRRRVG      SE+DD+ G             VSENLIK L
Sbjct: 90   GSQKAEEGKIVIV---ARPRRRVGDFGSRKSEDDDSHGKKKKELLQEKLE-VSENLIKSL 145

Query: 192  QSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVG-SSSGKKEPIGEHQSPK 368
            QSEVLAL+ ELD+VKSLNVELESQNTKLTQ+LAAAEAKI+ VG  ++GKKEPIGEH+SPK
Sbjct: 146  QSEVLALREELDRVKSLNVELESQNTKLTQNLAAAEAKISNVGIGNNGKKEPIGEHRSPK 205

Query: 369  FKDIQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPET---------------- 500
            FKDIQKLIA+KLERS+VK+E  PE++F KASI APT S A+PET                
Sbjct: 206  FKDIQKLIAEKLERSRVKKEGTPEIIFAKASISAPTPSYAVPETISVGRKSPPNTCLQPP 265

Query: 501  -------TSIGRKSXXXXXXXXXXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQE 659
                   TS+GR S                      A+LAN TQKAP IVELF  LKN++
Sbjct: 266  PPPPPPITSVGRNSPSNTCLPPPPPPPPPPIPTPPLARLAN-TQKAPTIVELFHSLKNKD 324

Query: 660  GNNRKDSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDA 839
            G  + DSKGSVNHQ+P   SAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDA
Sbjct: 325  G--KIDSKGSVNHQRPVVISAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDA 382

Query: 840  AYVDIEDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEIS 1019
            A+ DIE+VLKFVDWLDG+LS+LADE AVLKHFKWPEKKADAMREAAVEY ELKMLEQEIS
Sbjct: 383  AFTDIEEVLKFVDWLDGKLSSLADECAVLKHFKWPEKKADAMREAAVEYHELKMLEQEIS 442

Query: 1020 SYKDDPDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSK 1199
            SYKDDPDIPCGA+LKKMASLLDKSERSIQ+LIKLR+SV  SYQMY+IPTAWMLDSG+MSK
Sbjct: 443  SYKDDPDIPCGAALKKMASLLDKSERSIQRLIKLRSSVTHSYQMYNIPTAWMLDSGIMSK 502

Query: 1200 IKQASMTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA 1379
            IKQASMTLVK YMKR+TMELESIRNSDRES QDSLLLQGVHFAYRAHQF GGLDSET+CA
Sbjct: 503  IKQASMTLVKTYMKRVTMELESIRNSDRESIQDSLLLQGVHFAYRAHQFTGGLDSETMCA 562

Query: 1380 IEEIRQRVPRHMAGSRELLAGIP 1448
             EEIRQRVP ++ GSRELLAGIP
Sbjct: 563  FEEIRQRVPGNLTGSRELLAGIP 585


>KRH19467.1 hypothetical protein GLYMA_13G118400 [Glycine max] KRH19468.1
            hypothetical protein GLYMA_13G118400 [Glycine max]
            KRH19469.1 hypothetical protein GLYMA_13G118400 [Glycine
            max]
          Length = 584

 Score =  652 bits (1681), Expect = 0.0
 Identities = 358/502 (71%), Positives = 396/502 (78%), Gaps = 29/502 (5%)
 Frame = +3

Query: 30   GSREAEDASKVVVVTASRPRRRVG------SEEDDTDGXXXXXXXXXXXXXVSENLIKDL 191
            GS++AE+   V+V   +RPRRRVG      SE+DD+ G             VSENLIK L
Sbjct: 90   GSQKAEEGKIVIV---ARPRRRVGDFGSRKSEDDDSHGKKKKELLQEKLE-VSENLIKSL 145

Query: 192  QSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSSSGKKEPIGEHQSPKF 371
            QSEVLAL+ ELD+VKSLNVELESQNTKLTQ+LAAAEAKI+ VG  +  KEPIGEH+SPKF
Sbjct: 146  QSEVLALREELDRVKSLNVELESQNTKLTQNLAAAEAKISNVGIGNNGKEPIGEHRSPKF 205

Query: 372  KDIQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPET----------------- 500
            KDIQKLIA+KLERS+VK+E  PE++F KASI APT S A+PET                 
Sbjct: 206  KDIQKLIAEKLERSRVKKEGTPEIIFAKASISAPTPSYAVPETISVGRKSPPNTCLQPPP 265

Query: 501  ------TSIGRKSXXXXXXXXXXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEG 662
                  TS+GR S                      A+LAN TQKAP IVELF  LKN++G
Sbjct: 266  PPPPPITSVGRNSPSNTCLPPPPPPPPPPIPTPPLARLAN-TQKAPTIVELFHSLKNKDG 324

Query: 663  NNRKDSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAA 842
              + DSKGSVNHQ+P   SAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAA
Sbjct: 325  --KIDSKGSVNHQRPVVISAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAA 382

Query: 843  YVDIEDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISS 1022
            + DIE+VLKFVDWLDG+LS+LADE AVLKHFKWPEKKADAMREAAVEY ELKMLEQEISS
Sbjct: 383  FTDIEEVLKFVDWLDGKLSSLADECAVLKHFKWPEKKADAMREAAVEYHELKMLEQEISS 442

Query: 1023 YKDDPDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKI 1202
            YKDDPDIPCGA+LKKMASLLDKSERSIQ+LIKLR+SV  SYQMY+IPTAWMLDSG+MSKI
Sbjct: 443  YKDDPDIPCGAALKKMASLLDKSERSIQRLIKLRSSVTHSYQMYNIPTAWMLDSGIMSKI 502

Query: 1203 KQASMTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAI 1382
            KQASMTLVK YMKR+TMELESIRNSDRES QDSLLLQGVHFAYRAHQF GGLDSET+CA 
Sbjct: 503  KQASMTLVKTYMKRVTMELESIRNSDRESIQDSLLLQGVHFAYRAHQFTGGLDSETMCAF 562

Query: 1383 EEIRQRVPRHMAGSRELLAGIP 1448
            EEIRQRVP ++ GSRELLAGIP
Sbjct: 563  EEIRQRVPGNLTGSRELLAGIP 584


>XP_003609889.1 hydroxyproline-rich glycoprotein family protein [Medicago truncatula]
            AES92086.1 hydroxyproline-rich glycoprotein family
            protein [Medicago truncatula]
          Length = 574

 Score =  649 bits (1674), Expect = 0.0
 Identities = 362/484 (74%), Positives = 394/484 (81%), Gaps = 3/484 (0%)
 Frame = +3

Query: 12   VEGSQKGSREAEDASKVVVVTASRPRRRVGSEEDDTDGXXXXXXXXXXXXXVSENLIKDL 191
            VE S KGS+E E A KVVVV   R RRR+  EEDD D              VSENLIK L
Sbjct: 100  VESSHKGSKEGEVA-KVVVVAPPR-RRRI--EEDDPD--VKEKKELLEKLEVSENLIKSL 153

Query: 192  QSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSSSG--KKEPIGEHQSP 365
            QSE+ ALK EL++VK LN++LESQN KL Q+LA+AEAKI A G+SS   KKEPIGE QSP
Sbjct: 154  QSEIKALKDELNQVKGLNIDLESQNIKLNQNLASAEAKIVAFGTSSSTRKKEPIGERQSP 213

Query: 366  KFKDIQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRA-IPETTSIGRKSXXXXXXX 542
            KFKDIQK+IADKLE SKVK+EA PEV+FVK+SIPAP  + A I E TS+GRKS       
Sbjct: 214  KFKDIQKIIADKLEMSKVKKEANPEVIFVKSSIPAPIPNHAAIREITSLGRKSPPNHCLM 273

Query: 543  XXXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRKDSKGSVNHQKPAAFSA 722
                           AKLAN TQKAP +V+LF  LKNQ+   +KD KGS+NHQKP   SA
Sbjct: 274  PPPPPPPPPIPSRPLAKLAN-TQKAPAVVQLFHSLKNQD--TKKDLKGSINHQKPITNSA 330

Query: 723  HSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDIEDVLKFVDWLDGELST 902
            H+SIVGEIQNRSAHLLAIR DI+TKGEFIN LI KVVDA+YVDIEDVLKFVDWLDGELST
Sbjct: 331  HNSIVGEIQNRSAHLLAIREDIQTKGEFINGLINKVVDASYVDIEDVLKFVDWLDGELST 390

Query: 903  LADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCGASLKKMASLL 1082
            LADERAVLKHFKWPE+KAD MREAAVEYRELKMLEQEISSYKDDPDIPC ASLKK+ASLL
Sbjct: 391  LADERAVLKHFKWPERKADTMREAAVEYRELKMLEQEISSYKDDPDIPCVASLKKIASLL 450

Query: 1083 DKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQASMTLVKMYMKRLTMELE 1262
            DKSERSIQKLI LRNSV+RSYQMY+IPTAWMLDSG+ SKIKQ+SMTLVKMYMKRLTMELE
Sbjct: 451  DKSERSIQKLIVLRNSVIRSYQMYNIPTAWMLDSGISSKIKQSSMTLVKMYMKRLTMELE 510

Query: 1263 SIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIRQRVPRHMAGSRELLAG 1442
            SIRNSDRES+QDSLLLQGVHFAYRAHQFAGGLDSETLCA EEIRQRVP H+AGSRELLA 
Sbjct: 511  SIRNSDRESNQDSLLLQGVHFAYRAHQFAGGLDSETLCAFEEIRQRVPGHLAGSRELLAC 570

Query: 1443 IPSS 1454
            I SS
Sbjct: 571  IASS 574


>XP_013458360.1 hydroxyproline-rich glycoprotein family protein [Medicago truncatula]
            KEH32391.1 hydroxyproline-rich glycoprotein family
            protein [Medicago truncatula]
          Length = 573

 Score =  649 bits (1673), Expect = 0.0
 Identities = 362/483 (74%), Positives = 394/483 (81%), Gaps = 2/483 (0%)
 Frame = +3

Query: 12   VEGSQKGSREAEDASKVVVVTASRPRRRVGSEEDDTDGXXXXXXXXXXXXXVSENLIKDL 191
            VE S KGS+E E A KVVVV   R RRR+  EEDD D              VSENLIK L
Sbjct: 100  VESSHKGSKEGEVA-KVVVVAPPR-RRRI--EEDDPD--VKEKKELLEKLEVSENLIKSL 153

Query: 192  QSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVG-SSSGKKEPIGEHQSPK 368
            QSE+ ALK EL++VK LN++LESQN KL Q+LA+AEAKI A G SSS +KEPIGE QSPK
Sbjct: 154  QSEIKALKDELNQVKGLNIDLESQNIKLNQNLASAEAKIVAFGTSSSTRKEPIGERQSPK 213

Query: 369  FKDIQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRA-IPETTSIGRKSXXXXXXXX 545
            FKDIQK+IADKLE SKVK+EA PEV+FVK+SIPAP  + A I E TS+GRKS        
Sbjct: 214  FKDIQKIIADKLEMSKVKKEANPEVIFVKSSIPAPIPNHAAIREITSLGRKSPPNHCLMP 273

Query: 546  XXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRKDSKGSVNHQKPAAFSAH 725
                          AKLAN TQKAP +V+LF  LKNQ+   +KD KGS+NHQKP   SAH
Sbjct: 274  PPPPPPPPIPSRPLAKLAN-TQKAPAVVQLFHSLKNQD--TKKDLKGSINHQKPITNSAH 330

Query: 726  SSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDIEDVLKFVDWLDGELSTL 905
            +SIVGEIQNRSAHLLAIR DI+TKGEFIN LI KVVDA+YVDIEDVLKFVDWLDGELSTL
Sbjct: 331  NSIVGEIQNRSAHLLAIREDIQTKGEFINGLINKVVDASYVDIEDVLKFVDWLDGELSTL 390

Query: 906  ADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCGASLKKMASLLD 1085
            ADERAVLKHFKWPE+KAD MREAAVEYRELKMLEQEISSYKDDPDIPC ASLKK+ASLLD
Sbjct: 391  ADERAVLKHFKWPERKADTMREAAVEYRELKMLEQEISSYKDDPDIPCVASLKKIASLLD 450

Query: 1086 KSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQASMTLVKMYMKRLTMELES 1265
            KSERSIQKLI LRNSV+RSYQMY+IPTAWMLDSG+ SKIKQ+SMTLVKMYMKRLTMELES
Sbjct: 451  KSERSIQKLIVLRNSVIRSYQMYNIPTAWMLDSGISSKIKQSSMTLVKMYMKRLTMELES 510

Query: 1266 IRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIRQRVPRHMAGSRELLAGI 1445
            IRNSDRES+QDSLLLQGVHFAYRAHQFAGGLDSETLCA EEIRQRVP H+AGSRELLA I
Sbjct: 511  IRNSDRESNQDSLLLQGVHFAYRAHQFAGGLDSETLCAFEEIRQRVPGHLAGSRELLACI 570

Query: 1446 PSS 1454
             SS
Sbjct: 571  ASS 573


>XP_006593995.1 PREDICTED: protein CHUP1, chloroplastic-like isoform X1 [Glycine max]
            XP_006593996.1 PREDICTED: protein CHUP1,
            chloroplastic-like isoform X1 [Glycine max]
            XP_006593997.1 PREDICTED: protein CHUP1,
            chloroplastic-like isoform X1 [Glycine max]
            XP_006593998.1 PREDICTED: protein CHUP1,
            chloroplastic-like isoform X1 [Glycine max] KRH19476.1
            hypothetical protein GLYMA_13G118400 [Glycine max]
            KRH19477.1 hypothetical protein GLYMA_13G118400 [Glycine
            max] KRH19478.1 hypothetical protein GLYMA_13G118400
            [Glycine max]
          Length = 593

 Score =  646 bits (1666), Expect = 0.0
 Identities = 360/511 (70%), Positives = 399/511 (78%), Gaps = 38/511 (7%)
 Frame = +3

Query: 30   GSREAEDASKVVVVTASRPRRRVG------SEEDDTDGXXXXXXXXXXXXXVSENLIKDL 191
            GS++AE+   V+V   +RPRRRVG      SE+DD+ G             VSENLIK L
Sbjct: 90   GSQKAEEGKIVIV---ARPRRRVGDFGSRKSEDDDSHGKKKKELLQEKLE-VSENLIKSL 145

Query: 192  QSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVG-SSSGKKEPIGEHQSPK 368
            QSEVLAL+ ELD+VKSLNVELESQNTKLTQ+LAAAEAKI+ VG  ++GKKEPIGEH+SPK
Sbjct: 146  QSEVLALREELDRVKSLNVELESQNTKLTQNLAAAEAKISNVGIGNNGKKEPIGEHRSPK 205

Query: 369  FKDIQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPET---------------- 500
            FKDIQKLIA+KLERS+VK+E  PE++F KASI APT S A+PET                
Sbjct: 206  FKDIQKLIAEKLERSRVKKEGTPEIIFAKASISAPTPSYAVPETISVGRKSPPNTCLQPP 265

Query: 501  -------TSIGRKSXXXXXXXXXXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQE 659
                   TS+GR S                      A+LAN TQKAP IVELF  LKN++
Sbjct: 266  PPPPPPITSVGRNSPSNTCLPPPPPPPPPPIPTPPLARLAN-TQKAPTIVELFHSLKNKD 324

Query: 660  GNNRKDSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDA 839
            G  + DSKGSVNHQ+P   SAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDA
Sbjct: 325  G--KIDSKGSVNHQRPVVISAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDA 382

Query: 840  AYVDIEDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEIS 1019
            A+ DIE+VLKFVDWLDG+LS+LADE AVLKHFKWPEKKADAMREAAVEY ELKMLEQEIS
Sbjct: 383  AFTDIEEVLKFVDWLDGKLSSLADECAVLKHFKWPEKKADAMREAAVEYHELKMLEQEIS 442

Query: 1020 SYKDDPDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSK 1199
            SYKDDPDIPCGA+LKKMASLLDKSERSIQ+LIKLR+SV  SYQMY+IPTAWMLDSG+MSK
Sbjct: 443  SYKDDPDIPCGAALKKMASLLDKSERSIQRLIKLRSSVTHSYQMYNIPTAWMLDSGIMSK 502

Query: 1200 --------IKQASMTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGG 1355
                    IKQASMTLVK YMKR+TMELESIRNSDRES QDSLLLQGVHFAYRAHQF GG
Sbjct: 503  TSNIPSMQIKQASMTLVKTYMKRVTMELESIRNSDRESIQDSLLLQGVHFAYRAHQFTGG 562

Query: 1356 LDSETLCAIEEIRQRVPRHMAGSRELLAGIP 1448
            LDSET+CA EEIRQRVP ++ GSRELLAGIP
Sbjct: 563  LDSETMCAFEEIRQRVPGNLTGSRELLAGIP 593


>XP_006593999.1 PREDICTED: protein CHUP1, chloroplastic-like isoform X2 [Glycine max]
            KRH19470.1 hypothetical protein GLYMA_13G118400 [Glycine
            max] KRH19471.1 hypothetical protein GLYMA_13G118400
            [Glycine max] KRH19472.1 hypothetical protein
            GLYMA_13G118400 [Glycine max]
          Length = 592

 Score =  644 bits (1662), Expect = 0.0
 Identities = 358/510 (70%), Positives = 396/510 (77%), Gaps = 37/510 (7%)
 Frame = +3

Query: 30   GSREAEDASKVVVVTASRPRRRVG------SEEDDTDGXXXXXXXXXXXXXVSENLIKDL 191
            GS++AE+   V+V   +RPRRRVG      SE+DD+ G             VSENLIK L
Sbjct: 90   GSQKAEEGKIVIV---ARPRRRVGDFGSRKSEDDDSHGKKKKELLQEKLE-VSENLIKSL 145

Query: 192  QSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSSSGKKEPIGEHQSPKF 371
            QSEVLAL+ ELD+VKSLNVELESQNTKLTQ+LAAAEAKI+ VG  +  KEPIGEH+SPKF
Sbjct: 146  QSEVLALREELDRVKSLNVELESQNTKLTQNLAAAEAKISNVGIGNNGKEPIGEHRSPKF 205

Query: 372  KDIQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPET----------------- 500
            KDIQKLIA+KLERS+VK+E  PE++F KASI APT S A+PET                 
Sbjct: 206  KDIQKLIAEKLERSRVKKEGTPEIIFAKASISAPTPSYAVPETISVGRKSPPNTCLQPPP 265

Query: 501  ------TSIGRKSXXXXXXXXXXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEG 662
                  TS+GR S                      A+LAN TQKAP IVELF  LKN++G
Sbjct: 266  PPPPPITSVGRNSPSNTCLPPPPPPPPPPIPTPPLARLAN-TQKAPTIVELFHSLKNKDG 324

Query: 663  NNRKDSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAA 842
              + DSKGSVNHQ+P   SAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAA
Sbjct: 325  --KIDSKGSVNHQRPVVISAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAA 382

Query: 843  YVDIEDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISS 1022
            + DIE+VLKFVDWLDG+LS+LADE AVLKHFKWPEKKADAMREAAVEY ELKMLEQEISS
Sbjct: 383  FTDIEEVLKFVDWLDGKLSSLADECAVLKHFKWPEKKADAMREAAVEYHELKMLEQEISS 442

Query: 1023 YKDDPDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSK- 1199
            YKDDPDIPCGA+LKKMASLLDKSERSIQ+LIKLR+SV  SYQMY+IPTAWMLDSG+MSK 
Sbjct: 443  YKDDPDIPCGAALKKMASLLDKSERSIQRLIKLRSSVTHSYQMYNIPTAWMLDSGIMSKT 502

Query: 1200 -------IKQASMTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGL 1358
                   IKQASMTLVK YMKR+TMELESIRNSDRES QDSLLLQGVHFAYRAHQF GGL
Sbjct: 503  SNIPSMQIKQASMTLVKTYMKRVTMELESIRNSDRESIQDSLLLQGVHFAYRAHQFTGGL 562

Query: 1359 DSETLCAIEEIRQRVPRHMAGSRELLAGIP 1448
            DSET+CA EEIRQRVP ++ GSRELLAGIP
Sbjct: 563  DSETMCAFEEIRQRVPGNLTGSRELLAGIP 592


>XP_006600413.1 PREDICTED: protein CHUP1, chloroplastic-like isoform X1 [Glycine max]
            KRH02485.1 hypothetical protein GLYMA_17G041500 [Glycine
            max]
          Length = 567

 Score =  642 bits (1657), Expect = 0.0
 Identities = 355/490 (72%), Positives = 394/490 (80%), Gaps = 23/490 (4%)
 Frame = +3

Query: 48   DASKVVVVTASRPRRRVG------SEEDDTDGXXXXXXXXXXXXXVSENLIKDLQSEVLA 209
            + +KVVVV  +RPRRRVG      +E+DD DG             VSENLIK LQSEVLA
Sbjct: 85   EEAKVVVV--ARPRRRVGDFDLQKNEDDDPDGKKKKELQEKLE--VSENLIKSLQSEVLA 140

Query: 210  LKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVG-SSSGKKEPIGEHQSPKFKDIQK 386
            L+ ELD+VKSLNVELES+NTKLTQ+LAAAEAKI+ V   ++GKK PIGEHQSPKFKDIQK
Sbjct: 141  LREELDRVKSLNVELESRNTKLTQNLAAAEAKISTVDIGNNGKKGPIGEHQSPKFKDIQK 200

Query: 387  LIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPETTSIGRKSXXXXXXXXXXXXXXX 566
            LIA+KLERS+VK+E  PE++F KASI APT S AIPETTSIGRKS               
Sbjct: 201  LIAEKLERSRVKKEGTPEIIFAKASISAPTPSYAIPETTSIGRKSPPNTCLQPPPPVTSV 260

Query: 567  XXXXXXX----------------AKLANNTQKAPPIVELFRFLKNQEGNNRKDSKGSVNH 698
                                   A+LAN+ QK+P IVELF  LKN++   + DSKGSVNH
Sbjct: 261  GRKSPSNTCLQPPPPPPIPTRPLARLANS-QKSPAIVELFHSLKNKDW--KIDSKGSVNH 317

Query: 699  QKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDIEDVLKFVD 878
            Q+P   SAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLI+KVVDAA+ DIE+VLKFVD
Sbjct: 318  QRPVVISAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIRKVVDAAFTDIEEVLKFVD 377

Query: 879  WLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCGAS 1058
            WLD +LS+LADERAVLK FKWPEKKADAMREAAVEY ELKMLEQEISSYKDDPDIPCGA+
Sbjct: 378  WLDVKLSSLADERAVLKPFKWPEKKADAMREAAVEYHELKMLEQEISSYKDDPDIPCGAA 437

Query: 1059 LKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQASMTLVKMYM 1238
            LKKMASLLDKSERSIQ+LIKLR+SV  SYQMY+IPTAWMLDSG+MS+IKQASMTLVK YM
Sbjct: 438  LKKMASLLDKSERSIQRLIKLRSSVTHSYQMYNIPTAWMLDSGIMSEIKQASMTLVKTYM 497

Query: 1239 KRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIRQRVPRHMA 1418
            KR+TMELESIRNSDRES QDSLLLQG+HFAYRAHQF GGLDSET+CA EEIRQRVP H+A
Sbjct: 498  KRVTMELESIRNSDRESIQDSLLLQGMHFAYRAHQFTGGLDSETMCAFEEIRQRVPGHLA 557

Query: 1419 GSRELLAGIP 1448
            GSRELLAGIP
Sbjct: 558  GSRELLAGIP 567


>XP_006600414.1 PREDICTED: protein CHUP1, chloroplastic-like isoform X2 [Glycine max]
            KHN17796.1 Protein CHUP1, chloroplastic [Glycine soja]
            KRH02486.1 hypothetical protein GLYMA_17G041500 [Glycine
            max] KRH02487.1 hypothetical protein GLYMA_17G041500
            [Glycine max]
          Length = 566

 Score =  641 bits (1653), Expect = 0.0
 Identities = 353/489 (72%), Positives = 391/489 (79%), Gaps = 22/489 (4%)
 Frame = +3

Query: 48   DASKVVVVTASRPRRRVG------SEEDDTDGXXXXXXXXXXXXXVSENLIKDLQSEVLA 209
            + +KVVVV  +RPRRRVG      +E+DD DG             VSENLIK LQSEVLA
Sbjct: 85   EEAKVVVV--ARPRRRVGDFDLQKNEDDDPDGKKKKELQEKLE--VSENLIKSLQSEVLA 140

Query: 210  LKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSSSGKKEPIGEHQSPKFKDIQKL 389
            L+ ELD+VKSLNVELES+NTKLTQ+LAAAEAKI+ V   +  K PIGEHQSPKFKDIQKL
Sbjct: 141  LREELDRVKSLNVELESRNTKLTQNLAAAEAKISTVDIGNNGKGPIGEHQSPKFKDIQKL 200

Query: 390  IADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPETTSIGRKSXXXXXXXXXXXXXXXX 569
            IA+KLERS+VK+E  PE++F KASI APT S AIPETTSIGRKS                
Sbjct: 201  IAEKLERSRVKKEGTPEIIFAKASISAPTPSYAIPETTSIGRKSPPNTCLQPPPPVTSVG 260

Query: 570  XXXXXX----------------AKLANNTQKAPPIVELFRFLKNQEGNNRKDSKGSVNHQ 701
                                  A+LAN+ QK+P IVELF  LKN++   + DSKGSVNHQ
Sbjct: 261  RKSPSNTCLQPPPPPPIPTRPLARLANS-QKSPAIVELFHSLKNKDW--KIDSKGSVNHQ 317

Query: 702  KPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDIEDVLKFVDW 881
            +P   SAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLI+KVVDAA+ DIE+VLKFVDW
Sbjct: 318  RPVVISAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIRKVVDAAFTDIEEVLKFVDW 377

Query: 882  LDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCGASL 1061
            LD +LS+LADERAVLK FKWPEKKADAMREAAVEY ELKMLEQEISSYKDDPDIPCGA+L
Sbjct: 378  LDVKLSSLADERAVLKPFKWPEKKADAMREAAVEYHELKMLEQEISSYKDDPDIPCGAAL 437

Query: 1062 KKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQASMTLVKMYMK 1241
            KKMASLLDKSERSIQ+LIKLR+SV  SYQMY+IPTAWMLDSG+MS+IKQASMTLVK YMK
Sbjct: 438  KKMASLLDKSERSIQRLIKLRSSVTHSYQMYNIPTAWMLDSGIMSEIKQASMTLVKTYMK 497

Query: 1242 RLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIRQRVPRHMAG 1421
            R+TMELESIRNSDRES QDSLLLQG+HFAYRAHQF GGLDSET+CA EEIRQRVP H+AG
Sbjct: 498  RVTMELESIRNSDRESIQDSLLLQGMHFAYRAHQFTGGLDSETMCAFEEIRQRVPGHLAG 557

Query: 1422 SRELLAGIP 1448
            SRELLAGIP
Sbjct: 558  SRELLAGIP 566


>XP_007154485.1 hypothetical protein PHAVU_003G122900g [Phaseolus vulgaris]
            XP_007154486.1 hypothetical protein PHAVU_003G122900g
            [Phaseolus vulgaris] ESW26479.1 hypothetical protein
            PHAVU_003G122900g [Phaseolus vulgaris] ESW26480.1
            hypothetical protein PHAVU_003G122900g [Phaseolus
            vulgaris]
          Length = 567

 Score =  634 bits (1635), Expect = 0.0
 Identities = 352/504 (69%), Positives = 389/504 (77%), Gaps = 28/504 (5%)
 Frame = +3

Query: 21   SQKGSREAEDASKVVVVTASRPRRRVG------SEEDDTDGXXXXXXXXXXXXXVSENLI 182
            SQKG REAE+A  VVV   +R RRR+G      SE+DD DG             VS+NLI
Sbjct: 72   SQKG-REAEEAKVVVV---ARSRRRLGDFDLKKSEDDDPDGKKRKELQEKLE--VSDNLI 125

Query: 183  KDLQSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSSSGKKEPIGEHQS 362
            K LQSEVLALK ELDKVKSLNVELESQNTKLT++LAAAEAK A VG  +  KE IGEHQS
Sbjct: 126  KSLQSEVLALKEELDKVKSLNVELESQNTKLTRNLAAAEAKEATVGIGNSGKESIGEHQS 185

Query: 363  PKFKDIQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPETTSIGRKSXXXXXXX 542
            PKFKDIQKLIADKLE S+VK+E  PEV F KASIP+PT S +I ET SIGRKS       
Sbjct: 186  PKFKDIQKLIADKLELSRVKKEGAPEVNFAKASIPSPTPSFSIYETISIGRKSPPNSCLQ 245

Query: 543  XXXXXXXXXXXXXXXAK----------------------LANNTQKAPPIVELFRFLKNQ 656
                           +                         +NTQKAP +VELF+ L N+
Sbjct: 246  PLPPPPPPITSLGRNSAPRTCLQPPPPPPPPPIPSRPSARLSNTQKAPAVVELFQSLNNK 305

Query: 657  EGNNRKDSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVD 836
             G  + DSKG VNH +P   SAHSSIVGEIQNRSAHLLAIRADIETKGEF+NDLIKKVVD
Sbjct: 306  NG--KIDSKGPVNHPRPVVISAHSSIVGEIQNRSAHLLAIRADIETKGEFVNDLIKKVVD 363

Query: 837  AAYVDIEDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEI 1016
            AA+ DIE+VLKFV+WLDG+LS+LADERAVLKHFKWPEKKADAMREAAVEY ELKMLEQEI
Sbjct: 364  AAFTDIEEVLKFVNWLDGKLSSLADERAVLKHFKWPEKKADAMREAAVEYHELKMLEQEI 423

Query: 1017 SSYKDDPDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMS 1196
            SSYKDDPDIPCGA+LKKM SLLDKSER IQ+LIKLR+SV+ SYQ+Y+IPTAWMLDSG+M 
Sbjct: 424  SSYKDDPDIPCGAALKKMGSLLDKSERIIQRLIKLRSSVIHSYQVYNIPTAWMLDSGIMK 483

Query: 1197 KIKQASMTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLC 1376
             IKQASMTLVKMYMKR+TMELESIRNSDRES QDSLLLQGVHFAYRAHQFAGGLD+ET+C
Sbjct: 484  NIKQASMTLVKMYMKRVTMELESIRNSDRESIQDSLLLQGVHFAYRAHQFAGGLDAETMC 543

Query: 1377 AIEEIRQRVPRHMAGSRELLAGIP 1448
            A EE+RQRVP H+AGSRELL GIP
Sbjct: 544  AFEEMRQRVPGHLAGSRELLVGIP 567


>XP_019419024.1 PREDICTED: protein CHUP1, chloroplastic [Lupinus angustifolius]
            OIV95295.1 hypothetical protein TanjilG_07451 [Lupinus
            angustifolius]
          Length = 546

 Score =  627 bits (1617), Expect = 0.0
 Identities = 344/481 (71%), Positives = 376/481 (78%)
 Frame = +3

Query: 12   VEGSQKGSREAEDASKVVVVTASRPRRRVGSEEDDTDGXXXXXXXXXXXXXVSENLIKDL 191
            V GSQKGS+E E+   VV V      +    +E                  VSENLIK L
Sbjct: 91   VVGSQKGSKEVEEGKVVVGVQRVFVLKEKELQEK---------------LEVSENLIKHL 135

Query: 192  QSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSSSGKKEPIGEHQSPKF 371
            QSEVL LKAELDKVK+LNV+LESQN KLT+DL AAEAK+        K EPIGEH++PKF
Sbjct: 136  QSEVLELKAELDKVKTLNVKLESQNRKLTEDLVAAEAKVE-------KNEPIGEHKTPKF 188

Query: 372  KDIQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPETTSIGRKSXXXXXXXXXX 551
            KDIQKLIADKLE SKVK+EA  E  FVKASIP P  S  I ET+SIGRKS          
Sbjct: 189  KDIQKLIADKLEWSKVKKEATTEAFFVKASIPVPAASHVISETSSIGRKSPPKPCLPPPP 248

Query: 552  XXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRKDSKGSVNHQKPAAFSAHSS 731
                        AKLA + QKAP +V+LF  LKNQ  N +K+SKG VNHQKP   SAHSS
Sbjct: 249  PPPPPSIPSRPSAKLATS-QKAPSVVQLFHSLKNQ--NEKKESKGYVNHQKPLPSSAHSS 305

Query: 732  IVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDIEDVLKFVDWLDGELSTLAD 911
            IVGEIQNRSAHLLAIR DIETKGEFINDLIKKVVDA Y DIEDVLKFVDWLDGELS+LAD
Sbjct: 306  IVGEIQNRSAHLLAIRTDIETKGEFINDLIKKVVDARYKDIEDVLKFVDWLDGELSSLAD 365

Query: 912  ERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDDPDIPCGASLKKMASLLDKS 1091
            ERAVLKHFKWPE+KADAMREAAVEYRELK+LE EISSYKDDPDIPCG++LK+M SL DKS
Sbjct: 366  ERAVLKHFKWPERKADAMREAAVEYRELKILEHEISSYKDDPDIPCGSALKRMTSLFDKS 425

Query: 1092 ERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQASMTLVKMYMKRLTMELESIR 1271
            ER+IQ+LIKLRNS +RSYQ Y+IPTAWMLDSGMMSKIKQASMTLVK+YMKR+TMELESIR
Sbjct: 426  ERNIQRLIKLRNSAVRSYQEYNIPTAWMLDSGMMSKIKQASMTLVKIYMKRVTMELESIR 485

Query: 1272 NSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIRQRVPRHMAGSRELLAGIPS 1451
            NSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLC  EEIRQRVP H+AGS+ELLA I S
Sbjct: 486  NSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCTFEEIRQRVPGHLAGSQELLACIAS 545

Query: 1452 S 1454
            +
Sbjct: 546  T 546


>XP_016194601.1 PREDICTED: protein CHUP1, chloroplastic isoform X3 [Arachis ipaensis]
          Length = 621

 Score =  626 bits (1614), Expect = 0.0
 Identities = 350/499 (70%), Positives = 388/499 (77%), Gaps = 18/499 (3%)
 Frame = +3

Query: 6    VVVEGSQKGSREAE-------DASKVVVVTASRPRRRVGSE-------EDDTDGXXXXXX 143
            VVV GSQK   EA+        +    V   +RPRRRV  +       ED+ DG      
Sbjct: 117  VVVLGSQKAVEEAKVVVGRFVRSQHGSVEQFARPRRRVIGDSGLSRRIEDEADGVVKRKE 176

Query: 144  XXXXXXX-VSENLIKDLQSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVG 320
                    +SENLIKDL+SEV+ALKAELD+VK LNVELES+N KL++DLAAAEAK+ A  
Sbjct: 177  KELPEKLELSENLIKDLKSEVVALKAELDRVKGLNVELESKNKKLSEDLAAAEAKMVAAV 236

Query: 321  SSSGKKEPIGEHQSPKFKDIQKLIADKLERSKVKREAVPEVVFVKAS-IPAPT-TSRAIP 494
             +SGKKE IGEHQSPKFKDIQKLIADKLERSKVK+EA PE +F KAS IP+PT T     
Sbjct: 237  GTSGKKEAIGEHQSPKFKDIQKLIADKLERSKVKKEATPEAIFRKASSIPSPTATIHVNN 296

Query: 495  ETTSIGRKSXXXXXXXXXXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRK 674
            E+ SI RKS                           + QKAPP+VELF  LKN +   ++
Sbjct: 297  ESKSIERKSPPNQCLPPPPPPPLPPSMPSRPLAKLASAQKAPPLVELFHSLKNHD--MKR 354

Query: 675  DSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDI 854
            D KG +NH +P A SAHSSIVGEIQNRSAHLLAIR DIETKGEFINDLIK+V DAAY+DI
Sbjct: 355  DIKGPLNHPQPVAISAHSSIVGEIQNRSAHLLAIRVDIETKGEFINDLIKRVEDAAYMDI 414

Query: 855  EDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDD 1034
            E+VLKFVDWLDGELS+L DERAVLKHFKWPEKKADAMREAAVEYRELK+LEQEISSYKDD
Sbjct: 415  EEVLKFVDWLDGELSSLVDERAVLKHFKWPEKKADAMREAAVEYRELKLLEQEISSYKDD 474

Query: 1035 PDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQAS 1214
            PDIPCGA+LKKMASLLDKSE SIQ+LIKLRNSVMRSYQ Y+IPTAWMLDSG+MSKIKQAS
Sbjct: 475  PDIPCGAALKKMASLLDKSELSIQRLIKLRNSVMRSYQAYNIPTAWMLDSGIMSKIKQAS 534

Query: 1215 MTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIR 1394
            MTL KMYMKR+TMEL+S RN+DRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA EEIR
Sbjct: 535  MTLAKMYMKRVTMELKSNRNTDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAFEEIR 594

Query: 1395 QRVPRHM-AGSRELLAGIP 1448
            QRVP H+ AGSRELLAGIP
Sbjct: 595  QRVPGHLAAGSRELLAGIP 613


>XP_015945214.1 PREDICTED: protein CHUP1, chloroplastic isoform X3 [Arachis
            duranensis]
          Length = 621

 Score =  625 bits (1613), Expect = 0.0
 Identities = 351/499 (70%), Positives = 387/499 (77%), Gaps = 18/499 (3%)
 Frame = +3

Query: 6    VVVEGSQKGSREAE-------DASKVVVVTASRPRRRVGSE-------EDDTDGXXXXXX 143
            VVV GSQK   EA+        +    V   +RPRR+V  +       ED+ DG      
Sbjct: 117  VVVLGSQKAVEEAKVVVGRFVRSQHGSVEQFARPRRKVIGDSGLSRRIEDEADGVVKKKE 176

Query: 144  XXXXXXX-VSENLIKDLQSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVG 320
                    VSENLIKDL+SEV+ALKAELD+VK LNVELES+N KL++DLAAAEAK+ A  
Sbjct: 177  KELPEKLEVSENLIKDLKSEVVALKAELDRVKGLNVELESKNKKLSEDLAAAEAKMVAAV 236

Query: 321  SSSGKKEPIGEHQSPKFKDIQKLIADKLERSKVKREAVPEVVFVKAS-IPAPT-TSRAIP 494
             +SGKKE IGEHQSPKFKDIQKLIADKLERSKVK+EA PE +F KAS IP+PT T     
Sbjct: 237  GTSGKKEAIGEHQSPKFKDIQKLIADKLERSKVKKEATPEAIFRKASSIPSPTATIHVNN 296

Query: 495  ETTSIGRKSXXXXXXXXXXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRK 674
            E+ SI RKS                           + QKAPP+VELF  LKN +   ++
Sbjct: 297  ESKSIERKSPPNQCLPPPPPPPLPPSMPSRPLAKLASAQKAPPLVELFHSLKNHD--MKR 354

Query: 675  DSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDI 854
            D KG +NH +P A SAHSSIVGEIQNRSAHLLAIR DIETKGEFINDLIKKV DAAY+DI
Sbjct: 355  DIKGPLNHPQPVAISAHSSIVGEIQNRSAHLLAIRVDIETKGEFINDLIKKVEDAAYMDI 414

Query: 855  EDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDD 1034
            E+VLKFVDWLDGELS+L DERAVLKHFKWPEKKADAMREAAVEYRELK+LEQEISSYKDD
Sbjct: 415  EEVLKFVDWLDGELSSLVDERAVLKHFKWPEKKADAMREAAVEYRELKLLEQEISSYKDD 474

Query: 1035 PDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQAS 1214
             DIPCGA+LKKMASLLDKSE SIQ+LIKLRNSVMRSYQ Y+IPTAWMLDSG+MSKIKQAS
Sbjct: 475  SDIPCGAALKKMASLLDKSELSIQRLIKLRNSVMRSYQAYNIPTAWMLDSGIMSKIKQAS 534

Query: 1215 MTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIR 1394
            MTL KMYMKR+TMELES RN+DRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA EEIR
Sbjct: 535  MTLAKMYMKRVTMELESNRNTDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAFEEIR 594

Query: 1395 QRVPRHM-AGSRELLAGIP 1448
            QRVP H+ AGSRELLAGIP
Sbjct: 595  QRVPGHLAAGSRELLAGIP 613


>XP_016194585.1 PREDICTED: protein CHUP1, chloroplastic isoform X1 [Arachis ipaensis]
          Length = 633

 Score =  626 bits (1614), Expect = 0.0
 Identities = 350/499 (70%), Positives = 388/499 (77%), Gaps = 18/499 (3%)
 Frame = +3

Query: 6    VVVEGSQKGSREAE-------DASKVVVVTASRPRRRVGSE-------EDDTDGXXXXXX 143
            VVV GSQK   EA+        +    V   +RPRRRV  +       ED+ DG      
Sbjct: 129  VVVLGSQKAVEEAKVVVGRFVRSQHGSVEQFARPRRRVIGDSGLSRRIEDEADGVVKRKE 188

Query: 144  XXXXXXX-VSENLIKDLQSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVG 320
                    +SENLIKDL+SEV+ALKAELD+VK LNVELES+N KL++DLAAAEAK+ A  
Sbjct: 189  KELPEKLELSENLIKDLKSEVVALKAELDRVKGLNVELESKNKKLSEDLAAAEAKMVAAV 248

Query: 321  SSSGKKEPIGEHQSPKFKDIQKLIADKLERSKVKREAVPEVVFVKAS-IPAPT-TSRAIP 494
             +SGKKE IGEHQSPKFKDIQKLIADKLERSKVK+EA PE +F KAS IP+PT T     
Sbjct: 249  GTSGKKEAIGEHQSPKFKDIQKLIADKLERSKVKKEATPEAIFRKASSIPSPTATIHVNN 308

Query: 495  ETTSIGRKSXXXXXXXXXXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRK 674
            E+ SI RKS                           + QKAPP+VELF  LKN +   ++
Sbjct: 309  ESKSIERKSPPNQCLPPPPPPPLPPSMPSRPLAKLASAQKAPPLVELFHSLKNHD--MKR 366

Query: 675  DSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDI 854
            D KG +NH +P A SAHSSIVGEIQNRSAHLLAIR DIETKGEFINDLIK+V DAAY+DI
Sbjct: 367  DIKGPLNHPQPVAISAHSSIVGEIQNRSAHLLAIRVDIETKGEFINDLIKRVEDAAYMDI 426

Query: 855  EDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDD 1034
            E+VLKFVDWLDGELS+L DERAVLKHFKWPEKKADAMREAAVEYRELK+LEQEISSYKDD
Sbjct: 427  EEVLKFVDWLDGELSSLVDERAVLKHFKWPEKKADAMREAAVEYRELKLLEQEISSYKDD 486

Query: 1035 PDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQAS 1214
            PDIPCGA+LKKMASLLDKSE SIQ+LIKLRNSVMRSYQ Y+IPTAWMLDSG+MSKIKQAS
Sbjct: 487  PDIPCGAALKKMASLLDKSELSIQRLIKLRNSVMRSYQAYNIPTAWMLDSGIMSKIKQAS 546

Query: 1215 MTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIR 1394
            MTL KMYMKR+TMEL+S RN+DRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA EEIR
Sbjct: 547  MTLAKMYMKRVTMELKSNRNTDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAFEEIR 606

Query: 1395 QRVPRHM-AGSRELLAGIP 1448
            QRVP H+ AGSRELLAGIP
Sbjct: 607  QRVPGHLAAGSRELLAGIP 625


>XP_015945204.1 PREDICTED: protein CHUP1, chloroplastic isoform X1 [Arachis
            duranensis]
          Length = 633

 Score =  625 bits (1613), Expect = 0.0
 Identities = 351/499 (70%), Positives = 387/499 (77%), Gaps = 18/499 (3%)
 Frame = +3

Query: 6    VVVEGSQKGSREAE-------DASKVVVVTASRPRRRVGSE-------EDDTDGXXXXXX 143
            VVV GSQK   EA+        +    V   +RPRR+V  +       ED+ DG      
Sbjct: 129  VVVLGSQKAVEEAKVVVGRFVRSQHGSVEQFARPRRKVIGDSGLSRRIEDEADGVVKKKE 188

Query: 144  XXXXXXX-VSENLIKDLQSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVG 320
                    VSENLIKDL+SEV+ALKAELD+VK LNVELES+N KL++DLAAAEAK+ A  
Sbjct: 189  KELPEKLEVSENLIKDLKSEVVALKAELDRVKGLNVELESKNKKLSEDLAAAEAKMVAAV 248

Query: 321  SSSGKKEPIGEHQSPKFKDIQKLIADKLERSKVKREAVPEVVFVKAS-IPAPT-TSRAIP 494
             +SGKKE IGEHQSPKFKDIQKLIADKLERSKVK+EA PE +F KAS IP+PT T     
Sbjct: 249  GTSGKKEAIGEHQSPKFKDIQKLIADKLERSKVKKEATPEAIFRKASSIPSPTATIHVNN 308

Query: 495  ETTSIGRKSXXXXXXXXXXXXXXXXXXXXXXAKLANNTQKAPPIVELFRFLKNQEGNNRK 674
            E+ SI RKS                           + QKAPP+VELF  LKN +   ++
Sbjct: 309  ESKSIERKSPPNQCLPPPPPPPLPPSMPSRPLAKLASAQKAPPLVELFHSLKNHD--MKR 366

Query: 675  DSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDAAYVDI 854
            D KG +NH +P A SAHSSIVGEIQNRSAHLLAIR DIETKGEFINDLIKKV DAAY+DI
Sbjct: 367  DIKGPLNHPQPVAISAHSSIVGEIQNRSAHLLAIRVDIETKGEFINDLIKKVEDAAYMDI 426

Query: 855  EDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEISSYKDD 1034
            E+VLKFVDWLDGELS+L DERAVLKHFKWPEKKADAMREAAVEYRELK+LEQEISSYKDD
Sbjct: 427  EEVLKFVDWLDGELSSLVDERAVLKHFKWPEKKADAMREAAVEYRELKLLEQEISSYKDD 486

Query: 1035 PDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSKIKQAS 1214
             DIPCGA+LKKMASLLDKSE SIQ+LIKLRNSVMRSYQ Y+IPTAWMLDSG+MSKIKQAS
Sbjct: 487  SDIPCGAALKKMASLLDKSELSIQRLIKLRNSVMRSYQAYNIPTAWMLDSGIMSKIKQAS 546

Query: 1215 MTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAIEEIR 1394
            MTL KMYMKR+TMELES RN+DRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA EEIR
Sbjct: 547  MTLAKMYMKRVTMELESNRNTDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCAFEEIR 606

Query: 1395 QRVPRHM-AGSRELLAGIP 1448
            QRVP H+ AGSRELLAGIP
Sbjct: 607  QRVPGHLAAGSRELLAGIP 625


>XP_017411993.1 PREDICTED: protein CHUP1, chloroplastic isoform X3 [Vigna angularis]
          Length = 567

 Score =  619 bits (1596), Expect = 0.0
 Identities = 337/505 (66%), Positives = 384/505 (76%), Gaps = 26/505 (5%)
 Frame = +3

Query: 18   GSQKGSREAEDASKVVVVTASRPRRRVG------SEEDDTDGXXXXXXXXXXXXXVSENL 179
            G+  GSR+  +A +  VV  +RPRRR+G      S +DD DG             VSENL
Sbjct: 67   GAAVGSRKGREAEEAEVVVVARPRRRLGDFGLRKSGDDDPDGKKRKELQEKLE--VSENL 124

Query: 180  IKDLQSEVLALKAELDKVKSLNVELESQNTKLTQDLAAAEAKIAAVGSSSGKKEPIGEHQ 359
            IK LQSEVLALK ELD+VKSLNVELESQNTKLT++LA A+     +G+S  K+  IGEHQ
Sbjct: 125  IKSLQSEVLALKEELDRVKSLNVELESQNTKLTRNLAEAKQATVGIGNSGKKESVIGEHQ 184

Query: 360  SPKFKDIQKLIADKLERSKVKREAVPEVVFVKASIPAPTTSRAIPETTSIGRKSXXXXXX 539
            SPKFKDIQKLIADKLE S+VK+E  PEV F KASI +PT S +I ET SIGRKS      
Sbjct: 185  SPKFKDIQKLIADKLELSRVKKEGNPEVNFAKASILSPTRSFSIHETKSIGRKSPPNICL 244

Query: 540  XXXXXXXXXXXXXXXXAKLA--------------------NNTQKAPPIVELFRFLKNQE 659
                                                    ++T+K   IVELF  LK+++
Sbjct: 245  QPPPPLPPPIIGRNSAHSTCLQPPPPPPPPPIPSRPSARLSDTKKGAAIVELFHSLKSKD 304

Query: 660  GNNRKDSKGSVNHQKPAAFSAHSSIVGEIQNRSAHLLAIRADIETKGEFINDLIKKVVDA 839
            G  + DSKG VNHQ+P   SAHSSIVGEIQNRSAHLLAIR DIETKGEF+NDLIKKVVDA
Sbjct: 305  G--KIDSKGPVNHQRPVVISAHSSIVGEIQNRSAHLLAIRTDIETKGEFVNDLIKKVVDA 362

Query: 840  AYVDIEDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEIS 1019
            A+ DIE+VLKFV+WLDG+LS+LADERAVLKHFKWPEK+ADA+REAA+EY ELKMLEQEIS
Sbjct: 363  AFTDIEEVLKFVNWLDGKLSSLADERAVLKHFKWPEKRADALREAAIEYHELKMLEQEIS 422

Query: 1020 SYKDDPDIPCGASLKKMASLLDKSERSIQKLIKLRNSVMRSYQMYSIPTAWMLDSGMMSK 1199
            SYKDDPDIPCGA+LKKMASLLDKSER IQ+LIKLR+SV+ SYQ+Y+IPTAWMLDSG+MS 
Sbjct: 423  SYKDDPDIPCGAALKKMASLLDKSERRIQRLIKLRSSVIHSYQVYNIPTAWMLDSGIMSN 482

Query: 1200 IKQASMTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQFAGGLDSETLCA 1379
            IKQASMTLVKMYMKR+T+ELES+RNSDRES QDSLLLQGVHFAYRAHQFAGGLDSET+C 
Sbjct: 483  IKQASMTLVKMYMKRVTIELESVRNSDRESIQDSLLLQGVHFAYRAHQFAGGLDSETMCC 542

Query: 1380 IEEIRQRVPRHMAGSRELLAGIPSS 1454
             EEIRQRVP H+AGSRELLAGIP S
Sbjct: 543  FEEIRQRVPGHLAGSRELLAGIPLS 567


Top