BLASTX nr result

ID: Glycyrrhiza32_contig00027967 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza32_contig00027967
         (1853 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KYP75352.1 hypothetical protein KK1_008076 [Cajanus cajan]            416   e-138
XP_003555274.1 PREDICTED: putative uncharacterized protein DDB_G...   414   e-137
XP_007143034.1 hypothetical protein PHAVU_007G038000g [Phaseolus...   412   e-136
KHN06753.1 hypothetical protein glysoja_021299 [Glycine soja]         412   e-136
XP_003536625.1 PREDICTED: putative uncharacterized protein DDB_G...   412   e-136
XP_015942069.1 PREDICTED: bromodomain and WD repeat-containing D...   409   e-134
XP_016175066.1 PREDICTED: hybrid signal transduction histidine k...   407   e-134
XP_014510893.1 PREDICTED: serine, glycine and glutamine-rich pro...   400   e-131
XP_017409476.1 PREDICTED: serine, glycine and glutamine-rich pro...   398   e-130
KHN42381.1 hypothetical protein glysoja_020093 [Glycine soja]         396   e-130
GAU24292.1 hypothetical protein TSUD_48800 [Trifolium subterraneum]   375   e-121
XP_019441465.1 PREDICTED: heavy metal-associated isoprenylated p...   370   e-119
KRH35766.1 hypothetical protein GLYMA_10G264000 [Glycine max]         368   e-119
AFK47709.1 unknown [Lotus japonicus]                                  365   e-118
XP_002284132.1 PREDICTED: heavy metal-associated isoprenylated p...   340   e-108
XP_018843651.1 PREDICTED: neurogenic protein mastermind-like [Ju...   337   e-107
XP_007045083.1 PREDICTED: myb-like protein I [Theobroma cacao] X...   333   e-105
GAV65585.1 HMA domain-containing protein [Cephalotus follicularis]    333   e-105
EOY00920.1 Heavy metal transport/detoxification superfamily prot...   329   e-104
EOY00919.1 Heavy metal transport/detoxification superfamily prot...   329   e-104

>KYP75352.1 hypothetical protein KK1_008076 [Cajanus cajan]
          Length = 397

 Score =  416 bits (1069), Expect = e-138
 Identities = 248/405 (61%), Positives = 267/405 (65%), Gaps = 7/405 (1%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTV GSVDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVLGSVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQ-QKF 817
            ATLIKKLVR+GKYAELWS                           L KG+EAFKNQ QKF
Sbjct: 61   ATLIKKLVRAGKYAELWS--QKTNQNQKQKNNNAKDEKNKGQKQGLPKGIEAFKNQHQKF 118

Query: 818  PAFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLR-QAADANNVKKGMGAISAVSX 994
            PAFSS                   MR +REKA Q+++L+ Q  +ANNV+KGMG+ISA + 
Sbjct: 119  PAFSSEEDEYYFETDEEDEDEDEEMRMLREKAIQMKMLKHQPPNANNVRKGMGSISAGAN 178

Query: 995  XXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGG-GESLNLGE 1171
                                     D+P     LDQKTMAALKLNNAHL G G +LNLGE
Sbjct: 179  NGKMNNACNANSGKKGGPNHNMAMKDNP---NGLDQKTMAALKLNNAHLNGEGLNLNLGE 235

Query: 1172 AKRASDIGAMMNLAGFNGNGANVGSATVLGG-NSNGLGGFPVQSNNMIPGSSAGIPNGGL 1348
            AKRA+DIGAMMNLAGF+GNGANVGSATVLGG NSNG GGFPVQSNNMIPGSSA  P+GGL
Sbjct: 236  AKRANDIGAMMNLAGFHGNGANVGSATVLGGNNSNGFGGFPVQSNNMIPGSSAAFPSGGL 295

Query: 1349 ATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMYHRSXXXXXXXXXX 1528
            A+GQYPSSLLMNMNGFN  NH SPSPL      QAR AMQQQPQMMYHRS          
Sbjct: 296  ASGQYPSSLLMNMNGFN--NHTSPSPLMMNMNMQARQAMQQQPQMMYHRSPFVPPNTGYY 353

Query: 1529 XXXXXXXXXXXXXXXXXXXV---DDHSSAAHMFSDDNTSSGCSIM 1654
                                   DDH +AAHMFSDDNTSS CSIM
Sbjct: 354  YNHSGYSPAHYSYSYGLPSYPGGDDH-TAAHMFSDDNTSSSCSIM 397


>XP_003555274.1 PREDICTED: putative uncharacterized protein DDB_G0286901 [Glycine
            max] KRG90987.1 hypothetical protein GLYMA_20G126300
            [Glycine max]
          Length = 407

 Score =  414 bits (1065), Expect = e-137
 Identities = 255/414 (61%), Positives = 269/414 (64%), Gaps = 16/414 (3%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            ATLIKKLVR+GK+AELWS                           LVKGLEAFKNQQKFP
Sbjct: 61   ATLIKKLVRAGKHAELWS--QKINQNQKQKNNNAKDDKNKGQKQALVKGLEAFKNQQKFP 118

Query: 821  AFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLR-QAADANNVKKGMGAISA-VSX 994
            AFSS                   MRF+REKANQLQ+L+ Q A+ANNV+KGMGAI+A  + 
Sbjct: 119  AFSSEEDEYYYDDEDDEEDEDEEMRFLREKANQLQMLKQQTANANNVRKGMGAIAAGANN 178

Query: 995  XXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGG-GESLNLGE 1171
                                     DSP     LDQKTMAALK NN HLGG G +LNLGE
Sbjct: 179  GKTNNGDNANSGKKGGPNHQNMGMKDSP--NGGLDQKTMAALKFNNGHLGGDGLNLNLGE 236

Query: 1172 AKRASDIGAMMNLAGFNGNGA--NVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIP 1336
            AKRA+DIGAMMNLAGFNGN    NVGSATVLGG  NSNGLGGFPVQS NNMIPGS+A   
Sbjct: 237  AKRANDIGAMMNLAGFNGNNCANNVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSAAAFS 296

Query: 1337 NGGLATGQYPSSLLMNMNGFNNINHPSPSPL-XXXXXXQARHAMQQQPQMMYHRSXXXXX 1513
            NGGL+ GQYPSSLLMNMNGFN  NHPSPSPL       QAR AMQQQPQMMYHRS     
Sbjct: 297  NGGLSGGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQQARQAMQQQPQMMYHRSPFVPP 354

Query: 1514 XXXXXXXXXXXXXXXXXXXXXXXXV-------DDHSSAAHMFSDDNTSSGCSIM 1654
                                            DDH SAAHMFSDDNTSS CSIM
Sbjct: 355  NTGYYYNHSSYSPAHYSYSYGLPSYPAAAGGGDDH-SAAHMFSDDNTSSSCSIM 407


>XP_007143034.1 hypothetical protein PHAVU_007G038000g [Phaseolus vulgaris]
            ESW15028.1 hypothetical protein PHAVU_007G038000g
            [Phaseolus vulgaris]
          Length = 400

 Score =  412 bits (1059), Expect = e-136
 Identities = 245/407 (60%), Positives = 264/407 (64%), Gaps = 9/407 (2%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            ATLIKKLVR+GKYAELWS                           LVKGL+AFKNQQKFP
Sbjct: 61   ATLIKKLVRAGKYAELWSQKINQNQKQKNNNAKDDKNKGQKQA--LVKGLDAFKNQQKFP 118

Query: 821  AFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXX 997
            AFSS                   MRF+REKANQLQ+L+Q AA+ANNV+K M  + A +  
Sbjct: 119  AFSSEEDEYYSEYDDEDEDEDEEMRFLREKANQLQMLKQQAANANNVRKNMAPMGAGAIN 178

Query: 998  XXXXXXXXXXXXXXXXXXXXXXXX-DSPXXXXXLDQKTMAALKLNNAHLGG-GESLNLGE 1171
                                     +SP     LDQKTMAALKLN  HLGG G +LNLGE
Sbjct: 179  GKMNNGGGNAGNGKKGGPNPNMGVKESPNVG--LDQKTMAALKLNGGHLGGEGLNLNLGE 236

Query: 1172 AKRASDIGAMMNLAGFNGNGANVGSATVLGGNS-NGLGGFPVQSNNMIPGSSAGIPNGGL 1348
            AKRA+DIGAMMN+AGFNGNG NV SATVLG N+ N +GGFPVQSNNMIPGSSA   NGG+
Sbjct: 237  AKRANDIGAMMNMAGFNGNGGNVSSATVLGANNPNAMGGFPVQSNNMIPGSSAAFSNGGM 296

Query: 1349 ATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMYHRSXXXXXXXXXX 1528
            ATGQYPSSLLMNM+GFN  NHPSPSPL      QAR AMQQQPQMMYHRS          
Sbjct: 297  ATGQYPSSLLMNMSGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMYHRSPVIPTNTGYY 354

Query: 1529 XXXXXXXXXXXXXXXXXXXV-----DDHSSAAHMFSDDNTSSGCSIM 1654
                                     DDH SAAHMFSDDNT+S CSIM
Sbjct: 355  YNHSNSYSPAQYSYSYGLPSYPGSGDDH-SAAHMFSDDNTNSSCSIM 400


>KHN06753.1 hypothetical protein glysoja_021299 [Glycine soja]
          Length = 407

 Score =  412 bits (1059), Expect = e-136
 Identities = 257/414 (62%), Positives = 270/414 (65%), Gaps = 16/414 (3%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            ATLIKKLVR+GK+AELWS                           LV+GLEAFKNQQKFP
Sbjct: 61   ATLIKKLVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFP 118

Query: 821  AFSS-XXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS- 991
            AFSS                    MRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S 
Sbjct: 119  AFSSEEDEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSN 178

Query: 992  XXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGGESLNLGE 1171
                                      DSP     LDQKTM+ALKLNN HL GGE LNLGE
Sbjct: 179  NGKMNNGCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGE 235

Query: 1172 AKRASDIGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN 1339
            AKRA+DIGAMMNLAGFNG NGANVGSATVLGG  NSNGLGGFPVQS NNMIPGSSA   N
Sbjct: 236  AKRANDIGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSN 295

Query: 1340 -GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHA-MQQQPQMMYHRSXXXXX 1513
             GGL+ GQYPSSLLMNMNGFN  NHPSPSPL      QAR A MQQQPQMMYHRS     
Sbjct: 296  GGGLSGGQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPP 353

Query: 1514 XXXXXXXXXXXXXXXXXXXXXXXXV-------DDHSSAAHMFSDDNTSSGCSIM 1654
                                            DDH SAAHMFSDDNTSS CSIM
Sbjct: 354  NTGYYYNHSSSYSPAHYSYSSYGLPGYLAAGGDDHHSAAHMFSDDNTSSSCSIM 407


>XP_003536625.1 PREDICTED: putative uncharacterized protein DDB_G0286901 [Glycine
            max] KRH35767.1 hypothetical protein GLYMA_10G264000
            [Glycine max]
          Length = 407

 Score =  412 bits (1059), Expect = e-136
 Identities = 257/414 (62%), Positives = 270/414 (65%), Gaps = 16/414 (3%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            ATLIKKLVR+GK+AELWS                           LV+GLEAFKNQQKFP
Sbjct: 61   ATLIKKLVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFP 118

Query: 821  AFSS-XXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS- 991
            AFSS                    MRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S 
Sbjct: 119  AFSSEEDEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSN 178

Query: 992  XXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGGESLNLGE 1171
                                      DSP     LDQKTM+ALKLNN HL GGE LNLGE
Sbjct: 179  NGKMNNGCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGE 235

Query: 1172 AKRASDIGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN 1339
            AKRA+DIGAMMNLAGFNG NGANVGSATVLGG  NSNGLGGFPVQS NNMIPGSSA   N
Sbjct: 236  AKRANDIGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSN 295

Query: 1340 -GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHA-MQQQPQMMYHRSXXXXX 1513
             GGL+ GQYPSSLLMNMNGFN  NHPSPSPL      QAR A MQQQPQMMYHRS     
Sbjct: 296  GGGLSGGQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPP 353

Query: 1514 XXXXXXXXXXXXXXXXXXXXXXXXV-------DDHSSAAHMFSDDNTSSGCSIM 1654
                                            DDH SAAHMFSDDNTSS CSIM
Sbjct: 354  NTGYYYNHSSSYSPAHYSYSSYGLPGYPAAGGDDHHSAAHMFSDDNTSSSCSIM 407


>XP_015942069.1 PREDICTED: bromodomain and WD repeat-containing DDB_G0285837 [Arachis
            duranensis]
          Length = 417

 Score =  409 bits (1050), Expect = e-134
 Identities = 250/420 (59%), Positives = 262/420 (62%), Gaps = 22/420 (5%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDA+QQKVTVSGSVD+
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDADQQKVTVSGSVDA 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXX---LVKGLEAFKNQQ 811
            ATLIKKLVR+GKYAE WS                              LVKGLEAFKNQQ
Sbjct: 61   ATLIKKLVRAGKYAEPWSQQKTIQNPKQKNNNNIVKDDKNKGGQKQQGLVKGLEAFKNQQ 120

Query: 812  -KFP-AFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLRQ-----AADANNVKKGM 970
             KFP AFSS                   MRFIREKANQL LLRQ     AA+ANN+KKG+
Sbjct: 121  QKFPSAFSSEEDDDYYDYDDEDEDDDEEMRFIREKANQLHLLRQQAAAAAAEANNLKKGV 180

Query: 971  GAISAVSXXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGG 1150
            GAIS  S                                  LDQKTMAALKLNN H+GGG
Sbjct: 181  GAISGGSNNVKMNNNAGNNNNVGKKGGPGNNMGLKDGHGGVLDQKTMAALKLNNGHMGGG 240

Query: 1151 ESLNLGEAKRASDIGAMMNLAGFNGN-----GANVGSATVLGGNSNGLGGFPVQSNNMIP 1315
            E LNLGEAKRASDIGAMMNLAGFNGN       NVGSATVLG NSNGLGGFPV SNNM P
Sbjct: 241  EGLNLGEAKRASDIGAMMNLAGFNGNNNNNVANNVGSATVLGANSNGLGGFPVLSNNMAP 300

Query: 1316 GSSAGI-PNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMYH 1492
            GS+A + PNG  +TGQYPSSLLMNMNGFN  NHPSPSPL      QAR AMQQQPQMMYH
Sbjct: 301  GSTAAVLPNGAFSTGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMYH 358

Query: 1493 RS------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXVDDHSSAAHMFSDDNTSSGCSIM 1654
            RS                                     D +SAAHMFSDDNTSS CSIM
Sbjct: 359  RSPFVPPNTGYYYNHSNNYSPANYSYALPNYYPHQCPATDDNSAAHMFSDDNTSS-CSIM 417


>XP_016175066.1 PREDICTED: hybrid signal transduction histidine kinase A [Arachis
            ipaensis]
          Length = 418

 Score =  407 bits (1046), Expect = e-134
 Identities = 249/421 (59%), Positives = 262/421 (62%), Gaps = 23/421 (5%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDA+QQKVTVSGSVD+
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDADQQKVTVSGSVDA 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXX---LVKGLEAFKNQQ 811
            ATLIKKLVR+GKYAE WS                              LVKGLEAFKNQQ
Sbjct: 61   ATLIKKLVRAGKYAEPWSQQKTNQNPKQKNNNNIVKDDKNKGGQKQQGLVKGLEAFKNQQ 120

Query: 812  -KFP-AFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLRQ------AADANNVKKG 967
             KFP AFSS                   MRFIREKANQL LLRQ      AA+ANN+KKG
Sbjct: 121  QKFPSAFSSEEDDDYYDYDDEDEDDDEEMRFIREKANQLHLLRQQAAAAAAAEANNLKKG 180

Query: 968  MGAISAVSXXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGG 1147
            +GAIS  S                                  LDQKTMAALKLNN H+GG
Sbjct: 181  VGAISGGSNNVKMNNNAGNNNNVGKKGGPGNNMGLKDGHGGVLDQKTMAALKLNNGHMGG 240

Query: 1148 GESLNLGEAKRASDIGAMMNLAGFNGN-----GANVGSATVLGGNSNGLGGFPVQSNNMI 1312
            GE LNLGEAKRASDIGAMMNLAGFNGN       NVG+ATVLG NSNGLGGFPV SNNM 
Sbjct: 241  GEGLNLGEAKRASDIGAMMNLAGFNGNNNNNVANNVGNATVLGANSNGLGGFPVLSNNMA 300

Query: 1313 PGSSAGI-PNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMY 1489
            PGS+A + PNG  +TGQYPSSLLMNMNGFN  NHPSPSPL      QAR AMQQQPQMMY
Sbjct: 301  PGSTAAVLPNGAFSTGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMY 358

Query: 1490 HRS------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXVDDHSSAAHMFSDDNTSSGCSI 1651
            HRS                                     D +SAAHMFSDDNTSS CSI
Sbjct: 359  HRSPFVPPNTGYYYNHSNNYSPANYSYALPNYYPHQCPATDDNSAAHMFSDDNTSS-CSI 417

Query: 1652 M 1654
            M
Sbjct: 418  M 418


>XP_014510893.1 PREDICTED: serine, glycine and glutamine-rich protein [Vigna radiata
            var. radiata]
          Length = 402

 Score =  400 bits (1027), Expect = e-131
 Identities = 240/409 (58%), Positives = 257/409 (62%), Gaps = 11/409 (2%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            ATLIKKLVR+GKYAELWS                           L KGL+AFKNQQKFP
Sbjct: 61   ATLIKKLVRAGKYAELWSQKTNQNQKQKNNNAKDDKNKGQKQG--LAKGLDAFKNQQKFP 118

Query: 821  AFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLRQAADANNVKKGMGAISAVSXXX 1000
            AFSS                   MRF+REKA+ LQ+L+Q A   NV+K MG + A +   
Sbjct: 119  AFSSEEDEYYSEYEDDDEEEDEEMRFLREKAHHLQMLKQQAANANVRKSMGGMGAGAING 178

Query: 1001 XXXXXXXXXXXXXXXXXXXXXXX---DSPXXXXXLDQKTMAALKLNNAHLGG-GESLNLG 1168
                                      +SP     LDQKTMAALKLN  H+GG G  LNLG
Sbjct: 179  KMNNGGGNGGGGGGKKGGPNPNMGMKESPNGG--LDQKTMAALKLNGGHVGGEGLGLNLG 236

Query: 1169 EAKRASDIGAMMNLAGFNGNGANVGSATVLGGN-SNGLGGFPVQSNNMIPGSSAGIPNGG 1345
            EAKRA+DIGAMMN+AGFNGNG NV SATVLG N S+G+GGFPVQSNNMIPGSSAG  NGG
Sbjct: 237  EAKRANDIGAMMNMAGFNGNGGNVTSATVLGANNSSGMGGFPVQSNNMIPGSSAGFSNGG 296

Query: 1346 LATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXX--QARHAMQQQPQMMYHRSXXXXXXX 1519
            +  GQYPSSLLMNMNGFNN  HPSPSPL        QAR AMQQQPQMMYHRS       
Sbjct: 297  IGAGQYPSSLLMNMNGFNN--HPSPSPLMMNMNMNMQARQAMQQQPQMMYHRSPLIPPNT 354

Query: 1520 XXXXXXXXXXXXXXXXXXXXXXV----DDHSSAAHMFSDDNTSSGCSIM 1654
                                       DDH SA HMFSDDNTSS CSIM
Sbjct: 355  GYYYNHSNSYSPAQYSYSYGLPSYPGGDDH-SATHMFSDDNTSSSCSIM 402


>XP_017409476.1 PREDICTED: serine, glycine and glutamine-rich protein [Vigna
            angularis] KOM28905.1 hypothetical protein
            LR48_Vigan609s003700 [Vigna angularis] BAT93876.1
            hypothetical protein VIGAN_08042400 [Vigna angularis var.
            angularis]
          Length = 404

 Score =  398 bits (1022), Expect = e-130
 Identities = 240/411 (58%), Positives = 256/411 (62%), Gaps = 13/411 (3%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            ATLIKKLVR+GKYAELWS                           L KGL+AFKNQQKFP
Sbjct: 61   ATLIKKLVRAGKYAELWSQKSNQNQKQKNNNAKDDKNKGQKQG--LPKGLDAFKNQQKFP 118

Query: 821  AFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLRQAADANNVKKGMG-----AISA 985
            AFSS                   MRF+REKA+ LQ+L+Q     NV+K MG     AI+ 
Sbjct: 119  AFSSEEDEYYSEYEDDDEDEDEEMRFLREKAHHLQMLKQQTANANVRKSMGGMGAGAING 178

Query: 986  VSXXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGG-GESLN 1162
                                        +SP     LDQKTMAALKLN  H+GG G  LN
Sbjct: 179  KMNNGGGNGGGGGGGGKKGGPNPNMGMKESPNVG--LDQKTMAALKLNGGHVGGEGLGLN 236

Query: 1163 LGEAKRASDIGAMMNLAGFNGNGANVGSATVLGGN-SNGLGGFPVQSNNMIPGSSAGIPN 1339
            LGEAKRA+DIGAMMN+AGFNGNG NV SATVLG N S+G+GGFPVQSNNMIPGSSAG  N
Sbjct: 237  LGEAKRANDIGAMMNMAGFNGNGGNVTSATVLGANNSSGMGGFPVQSNNMIPGSSAGFSN 296

Query: 1340 GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXX--QARHAMQQQPQMMYHRSXXXXX 1513
            GG+  GQYPSSLLMNMNGFNN  HPSPSPL        QAR AMQQQPQMMYHRS     
Sbjct: 297  GGIGAGQYPSSLLMNMNGFNN--HPSPSPLMMNMNMNMQARQAMQQQPQMMYHRSPLIPP 354

Query: 1514 XXXXXXXXXXXXXXXXXXXXXXXXV----DDHSSAAHMFSDDNTSSGCSIM 1654
                                         DDH SA HMFSDDNTSS CSIM
Sbjct: 355  NTGYYYNHSNSYSPAQYAYSYGLPSYPGGDDH-SATHMFSDDNTSSSCSIM 404


>KHN42381.1 hypothetical protein glysoja_020093 [Glycine soja]
          Length = 363

 Score =  396 bits (1017), Expect = e-130
 Identities = 235/355 (66%), Positives = 249/355 (70%), Gaps = 9/355 (2%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            ATLIKKLVR+GK+AELWS                           LVKGLEAFKNQQKFP
Sbjct: 61   ATLIKKLVRAGKHAELWS--QKINQNQKQKNNNAKDDKNKGQKQALVKGLEAFKNQQKFP 118

Query: 821  AFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLR-QAADANNVKKGMGAISA-VSX 994
            AFSS                   MRF+REKANQLQ+L+ Q A+ANNV+KGMGAI+A  + 
Sbjct: 119  AFSSEEDEYYYDDEDDEEDEDEEMRFLREKANQLQMLKQQTANANNVRKGMGAIAAGANN 178

Query: 995  XXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGG-GESLNLGE 1171
                                     DSP     LDQKTMAALK NN HLGG G +LNLGE
Sbjct: 179  GKTNNGDNANSGKKGGPNHQNMGMKDSP--NGGLDQKTMAALKFNNGHLGGDGLNLNLGE 236

Query: 1172 AKRASDIGAMMNLAGFNGNGA--NVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIP 1336
            AKRA+DIGAMMNLAGFNGN    NVGSATVLGG  NSNGLGGFPVQS NNMIPGS+A   
Sbjct: 237  AKRANDIGAMMNLAGFNGNNCANNVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSAAAFS 296

Query: 1337 NGGLATGQYPSSLLMNMNGFNNINHPSPSPL-XXXXXXQARHAMQQQPQMMYHRS 1498
            NGGL+ GQYPSSLLMNMNGFN  NHPSPSPL       QAR AMQQQPQMMYHRS
Sbjct: 297  NGGLSGGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQQARQAMQQQPQMMYHRS 349


>GAU24292.1 hypothetical protein TSUD_48800 [Trifolium subterraneum]
          Length = 399

 Score =  375 bits (962), Expect = e-121
 Identities = 228/411 (55%), Positives = 249/411 (60%), Gaps = 13/411 (3%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVD+
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDA 60

Query: 641  ATLIKKLVRSGKYAELWS-XXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKF 817
            ATLIKKLVRSGKYAELWS                            +VKGLEAFKNQQKF
Sbjct: 61   ATLIKKLVRSGKYAELWSQKTNNNQNQKQKNNNIVKDDKNKGQKQVVVKGLEAFKNQQKF 120

Query: 818  PAFSS---XXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLR-QAADANNVKKGMGAISA 985
            PAFSS                       R+IRE ANQ+Q++R Q  DANN KK +GA   
Sbjct: 121  PAFSSEEDGGYYGGYGDDDDEEEEDQETRYIREAANQIQMMRQQVVDANNAKKAIGA--- 177

Query: 986  VSXXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGGESLNL 1165
                                                +DQKT+AA+KLNN HL G ES+NL
Sbjct: 178  -----KMNNAGNVNGNSGKKGNSNQNMVGMKESANGVDQKTIAAMKLNNGHLVGNESMNL 232

Query: 1166 GEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSN--NMIPGSSAG-IP 1336
            GE+KR SDIGAMMNLAGFNGN   VG+AT+LGGNSNGLGGFPVQSN  NMI GSSA  IP
Sbjct: 233  GESKRVSDIGAMMNLAGFNGNNNVVGNATILGGNSNGLGGFPVQSNNTNMIQGSSAATIP 292

Query: 1337 NGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAM-QQQPQMMYHRSXXXXX 1513
            NGG  TGQ P S++MNMNGFNN     PS L      QARH M QQQPQMMYHRS     
Sbjct: 293  NGGFVTGQIPPSMMMNMNGFNN----HPSSLMNMNMQQARHVMQQQQPQMMYHRSPYVPP 348

Query: 1514 XXXXXXXXXXXXXXXXXXXXXXXXVDDH----SSAAHMFSDDNTSSGCSIM 1654
                                    +  +    +SAAHMFSDDNT+S CSIM
Sbjct: 349  NTGYYYNNYNNYIPPNATNYSSYAMPSYPTEDNSAAHMFSDDNTTSSCSIM 399


>XP_019441465.1 PREDICTED: heavy metal-associated isoprenylated plant protein 37-like
            [Lupinus angustifolius] OIW12890.1 hypothetical protein
            TanjilG_24823 [Lupinus angustifolius]
          Length = 402

 Score =  370 bits (949), Expect = e-119
 Identities = 231/415 (55%), Positives = 249/415 (60%), Gaps = 17/415 (4%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VD+
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDA 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXX---LVKGLEAFKNQQ 811
            ATLIKKL R+GKYA+LWS                              LVKGLE FKNQQ
Sbjct: 61   ATLIKKLARAGKYAQLWSQKSSNQNQKQNNNNNNCVKDDNKNKGQKQGLVKGLEDFKNQQ 120

Query: 812  KFPAFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLRQ-AADANNVKKGMGAISAV 988
            KFPAFSS                   MRF+RE+ NQLQ+LRQ A DANN  K   A++  
Sbjct: 121  KFPAFSSEEDDDFYDYDDDEDDDDEEMRFMRERVNQLQMLRQQAVDANNAAKNGVAVNNN 180

Query: 989  SXXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGG--ESLN 1162
                                               LDQKT+AALK+NN HLGGG  E LN
Sbjct: 181  GKINNNAGKKGGPNQNMVIKDNTGG----------LDQKTIAALKMNNGHLGGGGGEGLN 230

Query: 1163 LGEAKRASDIGAMMNLAGFNGNGA-NVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGIPN 1339
            +G++KRA+DIG MMNLAGFNGNGA N GSATVLG NSNGLGGF  QS NMIPGSSA IPN
Sbjct: 231  IGDSKRANDIGVMMNLAGFNGNGANNAGSATVLGPNSNGLGGFSAQS-NMIPGSSAVIPN 289

Query: 1340 GGL-ATG-QYPSSLLMNMNGFNNINHPSPSPL-XXXXXXQARHAMQQQPQMMYHRSXXXX 1510
            G   ATG QYPSSLLMNMNGFN  NHPSPSPL       QARHAMQQQPQMMYHRS    
Sbjct: 290  GAFAATGQQYPSSLLMNMNGFN--NHPSPSPLMMNNMNMQARHAMQQQPQMMYHRSPYIP 347

Query: 1511 XXXXXXXXXXXXXXXXXXXXXXXXXVD-------DHSSAAHMFSDDNTSSGCSIM 1654
                                                +SA H+FSDD T S CS+M
Sbjct: 348  PNTGYYYNHNLNNNHTPANYNYATMPSYPVGGGGSDNSATHIFSDDYTGSSCSVM 402


>KRH35766.1 hypothetical protein GLYMA_10G264000 [Glycine max]
          Length = 392

 Score =  368 bits (944), Expect = e-119
 Identities = 242/414 (58%), Positives = 255/414 (61%), Gaps = 16/414 (3%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQ               KVKKLLQRIEGVYQVQIDAEQQKVTVSG VDS
Sbjct: 1    MTKEEDFKLLKIQ---------------KVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDS 45

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            ATLIKKLVR+GK+AELWS                           LV+GLEAFKNQQKFP
Sbjct: 46   ATLIKKLVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFP 103

Query: 821  AFSS-XXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS- 991
            AFSS                    MRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S 
Sbjct: 104  AFSSEEDEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSN 163

Query: 992  XXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGGESLNLGE 1171
                                      DSP     LDQKTM+ALKLNN HL GGE LNLGE
Sbjct: 164  NGKMNNGCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGE 220

Query: 1172 AKRASDIGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN 1339
            AKRA+DIGAMMNLAGFNG NGANVGSATVLGG  NSNGLGGFPVQS NNMIPGSSA   N
Sbjct: 221  AKRANDIGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSN 280

Query: 1340 -GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHA-MQQQPQMMYHRSXXXXX 1513
             GGL+ GQYPSSLLMNMNGFN  NHPSPSPL      QAR A MQQQPQMMYHRS     
Sbjct: 281  GGGLSGGQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPP 338

Query: 1514 XXXXXXXXXXXXXXXXXXXXXXXXV-------DDHSSAAHMFSDDNTSSGCSIM 1654
                                            DDH SAAHMFSDDNTSS CSIM
Sbjct: 339  NTGYYYNHSSSYSPAHYSYSSYGLPGYPAAGGDDHHSAAHMFSDDNTSSSCSIM 392


>AFK47709.1 unknown [Lotus japonicus]
          Length = 400

 Score =  365 bits (937), Expect = e-118
 Identities = 233/419 (55%), Positives = 247/419 (58%), Gaps = 21/419 (5%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXX---LVKGLEA-FKN- 805
            A LIKKL RSGK+AELWS                              LVKGLEA FKN 
Sbjct: 61   AALIKKLNRSGKHAELWSQKANQNQKQKNNNNINNVKDDKNNKGQKQGLVKGLEAAFKNH 120

Query: 806  ---QQKFPAFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLRQAADA-----NNVK 961
               QQKFPAFSS                   +RFIREKANQLQLLRQ   A     NNVK
Sbjct: 121  QQQQQKFPAFSSEEDDEYYDYDDEDDDDEE-LRFIREKANQLQLLRQQQQAVVDANNNVK 179

Query: 962  KGMGAISAVSXXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHL 1141
            K + A S                              S       DQKTMAALKLNNAHL
Sbjct: 180  KAISAASNNGHNKMNNAAGKKGGQNQNMGGMKESNVGS-------DQKTMAALKLNNAHL 232

Query: 1142 GGGESLNLGEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGS 1321
            GGGESLNLGEAKRA+DIGAMMNLAGF  NG N G+ATVLGGNSNG+GGFPVQSNNM  G+
Sbjct: 233  GGGESLNLGEAKRANDIGAMMNLAGF--NGGNAGNATVLGGNSNGMGGFPVQSNNMFQGN 290

Query: 1322 S-AGIPNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMYHRS 1498
            S A +PNGG     Y  S+LMNMNGFNN      SP+      Q RHAMQQQPQMM+HRS
Sbjct: 291  SPAAVPNGG-----YAPSMLMNMNGFNN----HQSPMMNMNMMQTRHAMQQQPQMMFHRS 341

Query: 1499 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXV-------DDHSSAAHMFSDDNTSSGCSIM 1654
                                                  DH SAAHMFSDDNT+S CS+M
Sbjct: 342  PVIPPNTGYYFNHNNYNPAANYSYYASLPSYPGGDYDHDHHSAAHMFSDDNTTSSCSVM 400


>XP_002284132.1 PREDICTED: heavy metal-associated isoprenylated plant protein 37
            [Vitis vinifera]
          Length = 390

 Score =  340 bits (872), Expect = e-108
 Identities = 210/401 (52%), Positives = 237/401 (59%), Gaps = 3/401 (0%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVY V IDAEQQ+VTVSGSVDS
Sbjct: 1    MTKDEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYTVNIDAEQQRVTVSGSVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
             TLIKKLV++GK+AELWS                           L+KGLEAFK QQKFP
Sbjct: 61   GTLIKKLVKAGKHAELWS-QKSNQNQKQKTNCIKDDKNNKGQKQGLIKGLEAFKTQQKFP 119

Query: 821  AFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLR-QAADANNVKKGMGAISAVSXX 997
             FSS                   +RF++EKANQL LLR QA DA+N KKG GAI+A +  
Sbjct: 120  VFSS-EEDEDDFDDDEEDYEEEELRFLQEKANQLSLLRQQALDASNAKKGFGAIAASNNG 178

Query: 998  XXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGGESLNLGEAK 1177
                                     SP     +DQKT+AALK+NN HL GG ++N GE K
Sbjct: 179  KINNNVGNGNVQKKGNPNQNMGMKGSP---GGIDQKTIAALKMNNPHLVGGGNINSGEVK 235

Query: 1178 RASDIGAMMNLAGFNGNGANV-GSATVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLAT 1354
            R +DI +MM L GF+GNG NV  +A  LGGNSN LGGF +Q NN   GSS G PNGG AT
Sbjct: 236  RGNDINSMMGLGGFHGNGGNVAATAAALGGNSNALGGFQIQPNNGFQGSSTGFPNGGFAT 295

Query: 1355 G-QYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMYHRSXXXXXXXXXXX 1531
            G  +PS +LMN+NG N  NHPS   +      Q RHA  QQPQMMYHRS           
Sbjct: 296  GHHHPSPMLMNLNG-NQYNHPS-QMMMNMNMQQNRHAPMQQPQMMYHRSPFIPPSTGYYY 353

Query: 1532 XXXXXXXXXXXXXXXXXXVDDHSSAAHMFSDDNTSSGCSIM 1654
                                DH SA+HMFSD+NTSS CSIM
Sbjct: 354  NYSPALSPYTHCDTNYS--GDH-SASHMFSDENTSS-CSIM 390


>XP_018843651.1 PREDICTED: neurogenic protein mastermind-like [Juglans regia]
          Length = 378

 Score =  337 bits (864), Expect = e-107
 Identities = 214/401 (53%), Positives = 237/401 (59%), Gaps = 3/401 (0%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVY V IDAEQQKVTVSGSVD+
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYLVNIDAEQQKVTVSGSVDA 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            +TLIKKLVR+GK+AE WS                           L KGLEAFKNQQKFP
Sbjct: 61   STLIKKLVRAGKHAEPWS-QKNTQNQKQMNNCVKDAKNNKSQKPLLFKGLEAFKNQQKFP 119

Query: 821  AFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXX 997
            AFSS                   +RFIREKANQL LLRQ A DANN KKG+ AI A S  
Sbjct: 120  AFSS-EEEDDYFDDVEEDEEEDELRFIREKANQLNLLRQRAIDANNAKKGVAAIGAAS-N 177

Query: 998  XXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGGESLNLGEAK 1177
                                            +D KT+AALK+N+AHLGGG ++N GE +
Sbjct: 178  NGKMNNVGNGNIGNGKKANTEQNMGMRASPAGIDPKTLAALKINSAHLGGG-NVNAGEGR 236

Query: 1178 RASDIGAMMNLAGFNGNGANVGS--ATVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLA 1351
            R SD+  MM LAGF+GNG NV S  A  LGGNSNGLGGF         GSSAG P GG A
Sbjct: 237  RVSDLNGMMGLAGFHGNGLNVASAGAAALGGNSNGLGGF--------QGSSAGFPTGGYA 288

Query: 1352 TGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMYHRSXXXXXXXXXXX 1531
            TGQYPSS+LMNMNG    NHPSP  +      QAR+AMQQQPQMMYHRS           
Sbjct: 289  TGQYPSSMLMNMNGH---NHPSPMMM----NMQARNAMQQQPQMMYHRSPHVPPTTGYYY 341

Query: 1532 XXXXXXXXXXXXXXXXXXVDDHSSAAHMFSDDNTSSGCSIM 1654
                                  +SAA+MFSD+NT+S CSIM
Sbjct: 342  NYSPSPNPYSYTDPNH---TGRNSAAYMFSDENTNS-CSIM 378


>XP_007045083.1 PREDICTED: myb-like protein I [Theobroma cacao] XP_007045084.1
            PREDICTED: myb-like protein I [Theobroma cacao]
            XP_007045086.1 PREDICTED: myb-like protein I [Theobroma
            cacao] EOY00915.1 Heavy metal transport/detoxification
            superfamily protein isoform 1 [Theobroma cacao]
            EOY00916.1 Heavy metal transport/detoxification
            superfamily protein isoform 1 [Theobroma cacao]
            EOY00917.1 Heavy metal transport/detoxification
            superfamily protein isoform 1 [Theobroma cacao]
            EOY00918.1 Heavy metal transport/detoxification
            superfamily protein isoform 1 [Theobroma cacao]
          Length = 392

 Score =  333 bits (855), Expect = e-105
 Identities = 208/404 (51%), Positives = 234/404 (57%), Gaps = 6/404 (1%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQV IDAEQQKVTVSGSVDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVSIDAEQQKVTVSGSVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            ATLIKKLVR+GK+AE+WS                           L+KGLEAFK QQKFP
Sbjct: 61   ATLIKKLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKFP 119

Query: 821  AFSSXXXXXXXXXXXXXXXXXXXMRFIRE----KANQLQLLR-QAADANNVKKGMGAISA 985
            +F S                   ++F++     +  QL LLR QA DANN K G+G I+A
Sbjct: 120  SFVS-EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNITA 178

Query: 986  VSXXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGGESLNL 1165
             S                                  LDQKT+AALK+NNA L GG ++N 
Sbjct: 179  TS-NNNNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNINA 236

Query: 1166 GEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PNG 1342
             E KR  DI  +M L+GF+GNGANV  A  LGGN N +GGF VQSNN + GSSA I  NG
Sbjct: 237  AEGKRGHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQNG 296

Query: 1343 GLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMYHRSXXXXXXXX 1522
            G  TGQ PSS+LMNMNG+N      PS +      Q RHAMQQQPQMMYHRS        
Sbjct: 297  GYVTGQNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPSTG 351

Query: 1523 XXXXXXXXXXXXXXXXXXXXXVDDHSSAAHMFSDDNTSSGCSIM 1654
                                   DHS+A HMFSDDNTSS CSIM
Sbjct: 352  YYYNYGPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 392


>GAV65585.1 HMA domain-containing protein [Cephalotus follicularis]
          Length = 383

 Score =  333 bits (853), Expect = e-105
 Identities = 205/401 (51%), Positives = 235/401 (58%), Gaps = 3/401 (0%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 640
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGV+QV IDAEQQ+VT+SGSVDS
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVFQVNIDAEQQRVTISGSVDS 60

Query: 641  ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKFP 820
            ATLIKKLVR+GK+AELWS                           L+KGLE+ KNQQKFP
Sbjct: 61   ATLIKKLVRAGKHAELWSQKSNQNQKQKNNCIKEDKNNESQKQG-LIKGLESLKNQQKFP 119

Query: 821  AFSSXXXXXXXXXXXXXXXXXXXMRFIREKANQLQLLRQAA--DANNVKKGMGAISAVSX 994
            AFSS                   +RF+ +  +QL LLRQ A  +ANN KKG GAI+A + 
Sbjct: 120  AFSSEEDDDYLDDDEDDDEEVEQLRFLEKANHQLGLLRQQAAIEANNAKKG-GAIAAAAA 178

Query: 995  XXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGGESLNLGEA 1174
                                             LDQKTMAALK+NNAHLGGG ++N GE 
Sbjct: 179  NNGKMNTSVGNLNTGKKGNPNQNTGIK-VNPGGLDQKTMAALKMNNAHLGGG-NINTGEV 236

Query: 1175 KRASDIGAMMNLAGFNGNGANVGSA-TVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLA 1351
            KR +D+  MM L GF+GNGAN+G+A T LGGN+NGLGG  VQ N     S AG PNGG A
Sbjct: 237  KRGNDLSTMMGLTGFHGNGANIGNAATALGGNANGLGGIQVQPNGYQGSSGAGFPNGGYA 296

Query: 1352 TGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMYHRSXXXXXXXXXXX 1531
            TGQYPS++LMNMNG+N+       P       Q RH    QPQMMYHRS           
Sbjct: 297  TGQYPSAMLMNMNGYNH-------PASMMMNMQNRH---PQPQMMYHRSPYIPASTGYYH 346

Query: 1532 XXXXXXXXXXXXXXXXXXVDDHSSAAHMFSDDNTSSGCSIM 1654
                                 HS+A HMFSD+NTSS CSIM
Sbjct: 347  NYSPSPYSYTEQPNHRI---GHSAATHMFSDENTSS-CSIM 383


>EOY00920.1 Heavy metal transport/detoxification superfamily protein isoform 6
            [Theobroma cacao]
          Length = 393

 Score =  329 bits (843), Expect = e-104
 Identities = 208/405 (51%), Positives = 234/405 (57%), Gaps = 7/405 (1%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQ-TCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVD 637
            MTK+EDFKLLKIQ TCVLKVNIHCDGCKQKVKKLLQRIEGVYQV IDAEQQKVTVSGSVD
Sbjct: 1    MTKEEDFKLLKIQQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVSIDAEQQKVTVSGSVD 60

Query: 638  SATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKF 817
            SATLIKKLVR+GK+AE+WS                           L+KGLEAFK QQKF
Sbjct: 61   SATLIKKLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKF 119

Query: 818  PAFSSXXXXXXXXXXXXXXXXXXXMRFIRE----KANQLQLLR-QAADANNVKKGMGAIS 982
            P+F S                   ++F++     +  QL LLR QA DANN K G+G I+
Sbjct: 120  PSFVS-EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNIT 178

Query: 983  AVSXXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGGESLN 1162
            A S                                  LDQKT+AALK+NNA L GG ++N
Sbjct: 179  ATS-NNNNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNIN 236

Query: 1163 LGEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PN 1339
              E KR  DI  +M L+GF+GNGANV  A  LGGN N +GGF VQSNN + GSSA I  N
Sbjct: 237  AAEGKRGHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQN 296

Query: 1340 GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMYHRSXXXXXXX 1519
            GG  TGQ PSS+LMNMNG+N      PS +      Q RHAMQQQPQMMYHRS       
Sbjct: 297  GGYVTGQNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPST 351

Query: 1520 XXXXXXXXXXXXXXXXXXXXXXVDDHSSAAHMFSDDNTSSGCSIM 1654
                                    DHS+A HMFSDDNTSS CSIM
Sbjct: 352  GYYYNYGPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 393


>EOY00919.1 Heavy metal transport/detoxification superfamily protein isoform 5
            [Theobroma cacao]
          Length = 393

 Score =  329 bits (843), Expect = e-104
 Identities = 208/405 (51%), Positives = 234/405 (57%), Gaps = 7/405 (1%)
 Frame = +2

Query: 461  MTKKEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIE-GVYQVQIDAEQQKVTVSGSVD 637
            MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIE GVYQV IDAEQQKVTVSGSVD
Sbjct: 1    MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGGVYQVSIDAEQQKVTVSGSVD 60

Query: 638  SATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXXLVKGLEAFKNQQKF 817
            SATLIKKLVR+GK+AE+WS                           L+KGLEAFK QQKF
Sbjct: 61   SATLIKKLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKF 119

Query: 818  PAFSSXXXXXXXXXXXXXXXXXXXMRFIRE----KANQLQLLR-QAADANNVKKGMGAIS 982
            P+F S                   ++F++     +  QL LLR QA DANN K G+G I+
Sbjct: 120  PSFVS-EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNIT 178

Query: 983  AVSXXXXXXXXXXXXXXXXXXXXXXXXXXDSPXXXXXLDQKTMAALKLNNAHLGGGESLN 1162
            A S                                  LDQKT+AALK+NNA L GG ++N
Sbjct: 179  ATS-NNNNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNIN 236

Query: 1163 LGEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PN 1339
              E KR  DI  +M L+GF+GNGANV  A  LGGN N +GGF VQSNN + GSSA I  N
Sbjct: 237  AAEGKRGHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQN 296

Query: 1340 GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXXQARHAMQQQPQMMYHRSXXXXXXX 1519
            GG  TGQ PSS+LMNMNG+N      PS +      Q RHAMQQQPQMMYHRS       
Sbjct: 297  GGYVTGQNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPST 351

Query: 1520 XXXXXXXXXXXXXXXXXXXXXXVDDHSSAAHMFSDDNTSSGCSIM 1654
                                    DHS+A HMFSDDNTSS CSIM
Sbjct: 352  GYYYNYGPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 393


Top