BLASTX nr result

ID: Glycyrrhiza29_contig00016981 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza29_contig00016981
         (1377 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KYP75352.1 hypothetical protein KK1_008076 [Cajanus cajan]            405   e-136
XP_003555274.1 PREDICTED: putative uncharacterized protein DDB_G...   404   e-135
XP_007143034.1 hypothetical protein PHAVU_007G038000g [Phaseolus...   402   e-134
KHN06753.1 hypothetical protein glysoja_021299 [Glycine soja]         402   e-134
XP_003536625.1 PREDICTED: putative uncharacterized protein DDB_G...   402   e-134
XP_015942069.1 PREDICTED: bromodomain and WD repeat-containing D...   398   e-133
XP_016175066.1 PREDICTED: hybrid signal transduction histidine k...   397   e-132
XP_014510893.1 PREDICTED: serine, glycine and glutamine-rich pro...   389   e-129
XP_017409476.1 PREDICTED: serine, glycine and glutamine-rich pro...   387   e-129
KHN42381.1 hypothetical protein glysoja_020093 [Glycine soja]         385   e-128
GAU24292.1 hypothetical protein TSUD_48800 [Trifolium subterraneum]   364   e-120
XP_019441465.1 PREDICTED: heavy metal-associated isoprenylated p...   359   e-118
KRH35766.1 hypothetical protein GLYMA_10G264000 [Glycine max]         357   e-117
AFK47709.1 unknown [Lotus japonicus]                                  355   e-116
XP_002284132.1 PREDICTED: heavy metal-associated isoprenylated p...   330   e-106
XP_018843651.1 PREDICTED: neurogenic protein mastermind-like [Ju...   327   e-105
XP_007045083.1 PREDICTED: myb-like protein I [Theobroma cacao] X...   323   e-104
GAV65585.1 HMA domain-containing protein [Cephalotus follicularis]    322   e-103
EOY00920.1 Heavy metal transport/detoxification superfamily prot...   318   e-102
EOY00919.1 Heavy metal transport/detoxification superfamily prot...   318   e-102

>KYP75352.1 hypothetical protein KK1_008076 [Cajanus cajan]
          Length = 397

 Score =  405 bits (1042), Expect = e-136
 Identities = 248/399 (62%), Positives = 266/399 (66%), Gaps = 7/399 (1%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTV GSVDSATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVLGSVDSATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQ-QKFPAFSSX 1019
            LVR+GKYAELWS                          GL KG+EAFKNQ QKFPAFSS 
Sbjct: 67   LVRAGKYAELWS--QKTNQNQKQKNNNAKDEKNKGQKQGLPKGIEAFKNQHQKFPAFSSE 124

Query: 1018 XXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVSXXXXXXX 842
                             EMR +REKA Q+++L+ Q  +ANNV+KGMG+ISA +       
Sbjct: 125  EDEYYFETDEEDEDEDEEMRMLREKAIQMKMLKHQPPNANNVRKGMGSISAGANNGKMNN 184

Query: 841  XXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKRASD 665
                              KD+P    GLDQKTMAALKLNNAHL G G +LNLGEAKRA+D
Sbjct: 185  ACNANSGKKGGPNHNMAMKDNP---NGLDQKTMAALKLNNAHLNGEGLNLNLGEAKRAND 241

Query: 664  IGAMMNLAGFNGNGANVGSATVLGG-NSNGLGGFPVQSNNMIPGSSAGIPNGGLATGQYP 488
            IGAMMNLAGF+GNGANVGSATVLGG NSNG GGFPVQSNNMIPGSSA  P+GGLA+GQYP
Sbjct: 242  IGAMMNLAGFHGNGANVGSATVLGGNNSNGFGGFPVQSNNMIPGSSAAFPSGGLASGQYP 301

Query: 487  SSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXXXX 308
            SSLLMNMNGFN  NH SPSPL     MQAR AMQQQPQMMYHRS                
Sbjct: 302  SSLLMNMNGFN--NHTSPSPLMMNMNMQARQAMQQQPQMMYHRSPFVPPNTGYYYNHSGY 359

Query: 307  XXXXXXXXXXXXPV---DDHSSAAHMFSDDNTSSGCSIM 200
                             DDH +AAHMFSDDNTSS CSIM
Sbjct: 360  SPAHYSYSYGLPSYPGGDDH-TAAHMFSDDNTSSSCSIM 397


>XP_003555274.1 PREDICTED: putative uncharacterized protein DDB_G0286901 [Glycine
            max] KRG90987.1 hypothetical protein GLYMA_20G126300
            [Glycine max]
          Length = 407

 Score =  404 bits (1038), Expect = e-135
 Identities = 253/408 (62%), Positives = 266/408 (65%), Gaps = 16/408 (3%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSXX 1016
            LVR+GK+AELWS                           LVKGLEAFKNQQKFPAFSS  
Sbjct: 67   LVRAGKHAELWS--QKINQNQKQKNNNAKDDKNKGQKQALVKGLEAFKNQQKFPAFSSEE 124

Query: 1015 XXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISA-VSXXXXXXX 842
                            EMRF+REKANQLQ+L+ Q A+ANNV+KGMGAI+A  +       
Sbjct: 125  DEYYYDDEDDEEDEDEEMRFLREKANQLQMLKQQTANANNVRKGMGAIAAGANNGKTNNG 184

Query: 841  XXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKRASD 665
                              KDSP    GLDQKTMAALK NN HLGG G +LNLGEAKRA+D
Sbjct: 185  DNANSGKKGGPNHQNMGMKDSP--NGGLDQKTMAALKFNNGHLGGDGLNLNLGEAKRAND 242

Query: 664  IGAMMNLAGFNGNGA--NVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPNGGLAT 500
            IGAMMNLAGFNGN    NVGSATVLGG  NSNGLGGFPVQS NNMIPGS+A   NGGL+ 
Sbjct: 243  IGAMMNLAGFNGNNCANNVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSAAAFSNGGLSG 302

Query: 499  GQYPSSLLMNMNGFNNINHPSPSPL-XXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXX 323
            GQYPSSLLMNMNGFN  NHPSPSPL       QAR AMQQQPQMMYHRS           
Sbjct: 303  GQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQQARQAMQQQPQMMYHRSPFVPPNTGYYY 360

Query: 322  XXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200
                                      DDH SAAHMFSDDNTSS CSIM
Sbjct: 361  NHSSYSPAHYSYSYGLPSYPAAAGGGDDH-SAAHMFSDDNTSSSCSIM 407


>XP_007143034.1 hypothetical protein PHAVU_007G038000g [Phaseolus vulgaris]
            ESW15028.1 hypothetical protein PHAVU_007G038000g
            [Phaseolus vulgaris]
          Length = 400

 Score =  402 bits (1032), Expect = e-134
 Identities = 243/401 (60%), Positives = 261/401 (65%), Gaps = 9/401 (2%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSXX 1016
            LVR+GKYAELWS                           LVKGL+AFKNQQKFPAFSS  
Sbjct: 67   LVRAGKYAELWSQKINQNQKQKNNNAKDDKNKGQKQA--LVKGLDAFKNQQKFPAFSSEE 124

Query: 1015 XXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXXXXXXXX 839
                            EMRF+REKANQLQ+L+Q AA+ANNV+K M  + A +        
Sbjct: 125  DEYYSEYDDEDEDEDEEMRFLREKANQLQMLKQQAANANNVRKNMAPMGAGAINGKMNNG 184

Query: 838  XXXXXXXXXXXXXXXXXK-DSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKRASD 665
                               +SP     LDQKTMAALKLN  HLGG G +LNLGEAKRA+D
Sbjct: 185  GGNAGNGKKGGPNPNMGVKESPNVG--LDQKTMAALKLNGGHLGGEGLNLNLGEAKRAND 242

Query: 664  IGAMMNLAGFNGNGANVGSATVLGGNS-NGLGGFPVQSNNMIPGSSAGIPNGGLATGQYP 488
            IGAMMN+AGFNGNG NV SATVLG N+ N +GGFPVQSNNMIPGSSA   NGG+ATGQYP
Sbjct: 243  IGAMMNMAGFNGNGGNVSSATVLGANNPNAMGGFPVQSNNMIPGSSAAFSNGGMATGQYP 302

Query: 487  SSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXXXX 308
            SSLLMNM+GFN  NHPSPSPL     MQAR AMQQQPQMMYHRS                
Sbjct: 303  SSLLMNMSGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMYHRSPVIPTNTGYYYNHSNS 360

Query: 307  XXXXXXXXXXXXPV-----DDHSSAAHMFSDDNTSSGCSIM 200
                        P      DDH SAAHMFSDDNT+S CSIM
Sbjct: 361  YSPAQYSYSYGLPSYPGSGDDH-SAAHMFSDDNTNSSCSIM 400


>KHN06753.1 hypothetical protein glysoja_021299 [Glycine soja]
          Length = 407

 Score =  402 bits (1032), Expect = e-134
 Identities = 254/408 (62%), Positives = 266/408 (65%), Gaps = 16/408 (3%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VDSATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDSATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS-X 1019
            LVR+GK+AELWS                           LV+GLEAFKNQQKFPAFSS  
Sbjct: 67   LVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFPAFSSEE 124

Query: 1018 XXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS-XXXXXX 845
                             EMRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S       
Sbjct: 125  DEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSNNGKMNN 184

Query: 844  XXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRASD 665
                               KDSP     LDQKTM+ALKLNN HL GGE LNLGEAKRA+D
Sbjct: 185  GCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGEAKRAND 241

Query: 664  IGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN-GGLAT 500
            IGAMMNLAGFNG NGANVGSATVLGG  NSNGLGGFPVQS NNMIPGSSA   N GGL+ 
Sbjct: 242  IGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSNGGGLSG 301

Query: 499  GQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHA-MQQQPQMMYHRSXXXXXXXXXXX 323
            GQYPSSLLMNMNGFN  NHPSPSPL      QAR A MQQQPQMMYHRS           
Sbjct: 302  GQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPPNTGYYY 359

Query: 322  XXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200
                                      DDH SAAHMFSDDNTSS CSIM
Sbjct: 360  NHSSSYSPAHYSYSSYGLPGYLAAGGDDHHSAAHMFSDDNTSSSCSIM 407


>XP_003536625.1 PREDICTED: putative uncharacterized protein DDB_G0286901 [Glycine
            max] KRH35767.1 hypothetical protein GLYMA_10G264000
            [Glycine max]
          Length = 407

 Score =  402 bits (1032), Expect = e-134
 Identities = 254/408 (62%), Positives = 266/408 (65%), Gaps = 16/408 (3%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VDSATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDSATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS-X 1019
            LVR+GK+AELWS                           LV+GLEAFKNQQKFPAFSS  
Sbjct: 67   LVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFPAFSSEE 124

Query: 1018 XXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS-XXXXXX 845
                             EMRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S       
Sbjct: 125  DEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSNNGKMNN 184

Query: 844  XXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRASD 665
                               KDSP     LDQKTM+ALKLNN HL GGE LNLGEAKRA+D
Sbjct: 185  GCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGEAKRAND 241

Query: 664  IGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN-GGLAT 500
            IGAMMNLAGFNG NGANVGSATVLGG  NSNGLGGFPVQS NNMIPGSSA   N GGL+ 
Sbjct: 242  IGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSNGGGLSG 301

Query: 499  GQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHA-MQQQPQMMYHRSXXXXXXXXXXX 323
            GQYPSSLLMNMNGFN  NHPSPSPL      QAR A MQQQPQMMYHRS           
Sbjct: 302  GQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPPNTGYYY 359

Query: 322  XXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200
                                      DDH SAAHMFSDDNTSS CSIM
Sbjct: 360  NHSSSYSPAHYSYSSYGLPGYPAAGGDDHHSAAHMFSDDNTSSSCSIM 407


>XP_015942069.1 PREDICTED: bromodomain and WD repeat-containing DDB_G0285837 [Arachis
            duranensis]
          Length = 417

 Score =  398 bits (1023), Expect = e-133
 Identities = 248/414 (59%), Positives = 259/414 (62%), Gaps = 22/414 (5%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDA+QQKVTVSGSVD+ATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDADQQKVTVSGSVDAATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEAFKNQQ-KFP-A 1031
            LVR+GKYAE WS                              LVKGLEAFKNQQ KFP A
Sbjct: 67   LVRAGKYAEPWSQQKTIQNPKQKNNNNIVKDDKNKGGQKQQGLVKGLEAFKNQQQKFPSA 126

Query: 1030 FSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-----AADANNVKKGMGAISAV 866
            FSS                  EMRFIREKANQL LLRQ     AA+ANN+KKG+GAIS  
Sbjct: 127  FSSEEDDDYYDYDDEDEDDDEEMRFIREKANQLHLLRQQAAAAAAEANNLKKGVGAISGG 186

Query: 865  SXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLG 686
            S                                  LDQKTMAALKLNN H+GGGE LNLG
Sbjct: 187  SNNVKMNNNAGNNNNVGKKGGPGNNMGLKDGHGGVLDQKTMAALKLNNGHMGGGEGLNLG 246

Query: 685  EAKRASDIGAMMNLAGFNGN-----GANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI 521
            EAKRASDIGAMMNLAGFNGN       NVGSATVLG NSNGLGGFPV SNNM PGS+A +
Sbjct: 247  EAKRASDIGAMMNLAGFNGNNNNNVANNVGSATVLGANSNGLGGFPVLSNNMAPGSTAAV 306

Query: 520  -PNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRS---- 356
             PNG  +TGQYPSSLLMNMNGFN  NHPSPSPL     MQAR AMQQQPQMMYHRS    
Sbjct: 307  LPNGAFSTGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMYHRSPFVP 364

Query: 355  --XXXXXXXXXXXXXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200
                                          P  D +SAAHMFSDDNTSS CSIM
Sbjct: 365  PNTGYYYNHSNNYSPANYSYALPNYYPHQCPATDDNSAAHMFSDDNTSS-CSIM 417


>XP_016175066.1 PREDICTED: hybrid signal transduction histidine kinase A [Arachis
            ipaensis]
          Length = 418

 Score =  397 bits (1019), Expect = e-132
 Identities = 247/415 (59%), Positives = 259/415 (62%), Gaps = 23/415 (5%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDA+QQKVTVSGSVD+ATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDADQQKVTVSGSVDAATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEAFKNQQ-KFP-A 1031
            LVR+GKYAE WS                              LVKGLEAFKNQQ KFP A
Sbjct: 67   LVRAGKYAEPWSQQKTNQNPKQKNNNNIVKDDKNKGGQKQQGLVKGLEAFKNQQQKFPSA 126

Query: 1030 FSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ------AADANNVKKGMGAISA 869
            FSS                  EMRFIREKANQL LLRQ      AA+ANN+KKG+GAIS 
Sbjct: 127  FSSEEDDDYYDYDDEDEDDDEEMRFIREKANQLHLLRQQAAAAAAAEANNLKKGVGAISG 186

Query: 868  VSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNL 689
             S                                  LDQKTMAALKLNN H+GGGE LNL
Sbjct: 187  GSNNVKMNNNAGNNNNVGKKGGPGNNMGLKDGHGGVLDQKTMAALKLNNGHMGGGEGLNL 246

Query: 688  GEAKRASDIGAMMNLAGFNGN-----GANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAG 524
            GEAKRASDIGAMMNLAGFNGN       NVG+ATVLG NSNGLGGFPV SNNM PGS+A 
Sbjct: 247  GEAKRASDIGAMMNLAGFNGNNNNNVANNVGNATVLGANSNGLGGFPVLSNNMAPGSTAA 306

Query: 523  I-PNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRS--- 356
            + PNG  +TGQYPSSLLMNMNGFN  NHPSPSPL     MQAR AMQQQPQMMYHRS   
Sbjct: 307  VLPNGAFSTGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMYHRSPFV 364

Query: 355  ---XXXXXXXXXXXXXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200
                                           P  D +SAAHMFSDDNTSS CSIM
Sbjct: 365  PPNTGYYYNHSNNYSPANYSYALPNYYPHQCPATDDNSAAHMFSDDNTSS-CSIM 418


>XP_014510893.1 PREDICTED: serine, glycine and glutamine-rich protein [Vigna radiata
            var. radiata]
          Length = 402

 Score =  389 bits (1000), Expect = e-129
 Identities = 238/403 (59%), Positives = 254/403 (63%), Gaps = 11/403 (2%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSXX 1016
            LVR+GKYAELWS                           L KGL+AFKNQQKFPAFSS  
Sbjct: 67   LVRAGKYAELWSQKTNQNQKQKNNNAKDDKNKGQKQG--LAKGLDAFKNQQKFPAFSSEE 124

Query: 1015 XXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAADANNVKKGMGAISAVSXXXXXXXXX 836
                            EMRF+REKA+ LQ+L+Q A   NV+K MG + A +         
Sbjct: 125  DEYYSEYEDDDEEEDEEMRFLREKAHHLQMLKQQAANANVRKSMGGMGAGAINGKMNNGG 184

Query: 835  XXXXXXXXXXXXXXXXK---DSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKRAS 668
                                +SP     LDQKTMAALKLN  H+GG G  LNLGEAKRA+
Sbjct: 185  GNGGGGGGKKGGPNPNMGMKESPNGG--LDQKTMAALKLNGGHVGGEGLGLNLGEAKRAN 242

Query: 667  DIGAMMNLAGFNGNGANVGSATVLGGN-SNGLGGFPVQSNNMIPGSSAGIPNGGLATGQY 491
            DIGAMMN+AGFNGNG NV SATVLG N S+G+GGFPVQSNNMIPGSSAG  NGG+  GQY
Sbjct: 243  DIGAMMNMAGFNGNGGNVTSATVLGANNSSGMGGFPVQSNNMIPGSSAGFSNGGIGAGQY 302

Query: 490  PSSLLMNMNGFNNINHPSPSPLXXXXXM--QARHAMQQQPQMMYHRSXXXXXXXXXXXXX 317
            PSSLLMNMNGFNN  HPSPSPL     M  QAR AMQQQPQMMYHRS             
Sbjct: 303  PSSLLMNMNGFNN--HPSPSPLMMNMNMNMQARQAMQQQPQMMYHRSPLIPPNTGYYYNH 360

Query: 316  XXXXXXXXXXXXXXXPV----DDHSSAAHMFSDDNTSSGCSIM 200
                           P     DDH SA HMFSDDNTSS CSIM
Sbjct: 361  SNSYSPAQYSYSYGLPSYPGGDDH-SATHMFSDDNTSSSCSIM 402


>XP_017409476.1 PREDICTED: serine, glycine and glutamine-rich protein [Vigna
            angularis] KOM28905.1 hypothetical protein
            LR48_Vigan609s003700 [Vigna angularis] BAT93876.1
            hypothetical protein VIGAN_08042400 [Vigna angularis var.
            angularis]
          Length = 404

 Score =  387 bits (995), Expect = e-129
 Identities = 239/405 (59%), Positives = 254/405 (62%), Gaps = 13/405 (3%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSXX 1016
            LVR+GKYAELWS                           L KGL+AFKNQQKFPAFSS  
Sbjct: 67   LVRAGKYAELWSQKSNQNQKQKNNNAKDDKNKGQKQG--LPKGLDAFKNQQKFPAFSSEE 124

Query: 1015 XXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAADANNVKKGMG-----AISAVSXXXX 851
                            EMRF+REKA+ LQ+L+Q     NV+K MG     AI+       
Sbjct: 125  DEYYSEYEDDDEDEDEEMRFLREKAHHLQMLKQQTANANVRKSMGGMGAGAINGKMNNGG 184

Query: 850  XXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKR 674
                                 K+SP     LDQKTMAALKLN  H+GG G  LNLGEAKR
Sbjct: 185  GNGGGGGGGGKKGGPNPNMGMKESPNVG--LDQKTMAALKLNGGHVGGEGLGLNLGEAKR 242

Query: 673  ASDIGAMMNLAGFNGNGANVGSATVLGGN-SNGLGGFPVQSNNMIPGSSAGIPNGGLATG 497
            A+DIGAMMN+AGFNGNG NV SATVLG N S+G+GGFPVQSNNMIPGSSAG  NGG+  G
Sbjct: 243  ANDIGAMMNMAGFNGNGGNVTSATVLGANNSSGMGGFPVQSNNMIPGSSAGFSNGGIGAG 302

Query: 496  QYPSSLLMNMNGFNNINHPSPSPLXXXXXM--QARHAMQQQPQMMYHRSXXXXXXXXXXX 323
            QYPSSLLMNMNGFNN  HPSPSPL     M  QAR AMQQQPQMMYHRS           
Sbjct: 303  QYPSSLLMNMNGFNN--HPSPSPLMMNMNMNMQARQAMQQQPQMMYHRSPLIPPNTGYYY 360

Query: 322  XXXXXXXXXXXXXXXXXPV----DDHSSAAHMFSDDNTSSGCSIM 200
                             P     DDH SA HMFSDDNTSS CSIM
Sbjct: 361  NHSNSYSPAQYAYSYGLPSYPGGDDH-SATHMFSDDNTSSSCSIM 404


>KHN42381.1 hypothetical protein glysoja_020093 [Glycine soja]
          Length = 363

 Score =  385 bits (990), Expect = e-128
 Identities = 233/349 (66%), Positives = 246/349 (70%), Gaps = 9/349 (2%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSXX 1016
            LVR+GK+AELWS                           LVKGLEAFKNQQKFPAFSS  
Sbjct: 67   LVRAGKHAELWS--QKINQNQKQKNNNAKDDKNKGQKQALVKGLEAFKNQQKFPAFSSEE 124

Query: 1015 XXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISA-VSXXXXXXX 842
                            EMRF+REKANQLQ+L+ Q A+ANNV+KGMGAI+A  +       
Sbjct: 125  DEYYYDDEDDEEDEDEEMRFLREKANQLQMLKQQTANANNVRKGMGAIAAGANNGKTNNG 184

Query: 841  XXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKRASD 665
                              KDSP    GLDQKTMAALK NN HLGG G +LNLGEAKRA+D
Sbjct: 185  DNANSGKKGGPNHQNMGMKDSP--NGGLDQKTMAALKFNNGHLGGDGLNLNLGEAKRAND 242

Query: 664  IGAMMNLAGFNGNGA--NVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPNGGLAT 500
            IGAMMNLAGFNGN    NVGSATVLGG  NSNGLGGFPVQS NNMIPGS+A   NGGL+ 
Sbjct: 243  IGAMMNLAGFNGNNCANNVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSAAAFSNGGLSG 302

Query: 499  GQYPSSLLMNMNGFNNINHPSPSPL-XXXXXMQARHAMQQQPQMMYHRS 356
            GQYPSSLLMNMNGFN  NHPSPSPL       QAR AMQQQPQMMYHRS
Sbjct: 303  GQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQQARQAMQQQPQMMYHRS 349


>GAU24292.1 hypothetical protein TSUD_48800 [Trifolium subterraneum]
          Length = 399

 Score =  364 bits (935), Expect = e-120
 Identities = 225/405 (55%), Positives = 245/405 (60%), Gaps = 13/405 (3%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVD+ATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDAATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLV-KGLEAFKNQQKFPAFSSX 1019
            LVRSGKYAELWS                           +V KGLEAFKNQQKFPAFSS 
Sbjct: 67   LVRSGKYAELWSQKTNNNQNQKQKNNNIVKDDKNKGQKQVVVKGLEAFKNQQKFPAFSSE 126

Query: 1018 XXXXXXXXXXXXXXXXXE---MRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXXXX 851
                             E    R+IRE ANQ+Q++RQ   DANN KK +GA         
Sbjct: 127  EDGGYYGGYGDDDDEEEEDQETRYIREAANQIQMMRQQVVDANNAKKAIGA--------K 178

Query: 850  XXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRA 671
                                         G+DQKT+AA+KLNN HL G ES+NLGE+KR 
Sbjct: 179  MNNAGNVNGNSGKKGNSNQNMVGMKESANGVDQKTIAAMKLNNGHLVGNESMNLGESKRV 238

Query: 670  SDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSN--NMIPGSSAG-IPNGGLAT 500
            SDIGAMMNLAGFNGN   VG+AT+LGGNSNGLGGFPVQSN  NMI GSSA  IPNGG  T
Sbjct: 239  SDIGAMMNLAGFNGNNNVVGNATILGGNSNGLGGFPVQSNNTNMIQGSSAATIPNGGFVT 298

Query: 499  GQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAM-QQQPQMMYHRSXXXXXXXXXXX 323
            GQ P S++MNMNGFNN     PS L      QARH M QQQPQMMYHRS           
Sbjct: 299  GQIPPSMMMNMNGFNN----HPSSLMNMNMQQARHVMQQQQPQMMYHRSPYVPPNTGYYY 354

Query: 322  XXXXXXXXXXXXXXXXXPVDDH----SSAAHMFSDDNTSSGCSIM 200
                              +  +    +SAAHMFSDDNT+S CSIM
Sbjct: 355  NNYNNYIPPNATNYSSYAMPSYPTEDNSAAHMFSDDNTTSSCSIM 399


>XP_019441465.1 PREDICTED: heavy metal-associated isoprenylated plant protein 37-like
            [Lupinus angustifolius] OIW12890.1 hypothetical protein
            TanjilG_24823 [Lupinus angustifolius]
          Length = 402

 Score =  359 bits (922), Expect = e-118
 Identities = 228/409 (55%), Positives = 245/409 (59%), Gaps = 17/409 (4%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VD+ATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDAATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEAFKNQQKFPAFS 1025
            L R+GKYA+LWS                              LVKGLE FKNQQKFPAFS
Sbjct: 67   LARAGKYAQLWSQKSSNQNQKQNNNNNNCVKDDNKNKGQKQGLVKGLEDFKNQQKFPAFS 126

Query: 1024 SXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXXXXX 848
            S                  EMRF+RE+ NQLQ+LRQ A DANN  K   A++        
Sbjct: 127  SEEDDDFYDYDDDEDDDDEEMRFMRERVNQLQMLRQQAVDANNAAKNGVAVNNNGKINNN 186

Query: 847  XXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGG--ESLNLGEAKR 674
                                         LDQKT+AALK+NN HLGGG  E LN+G++KR
Sbjct: 187  AGKKGGPNQNMVIKDNTGG----------LDQKTIAALKMNNGHLGGGGGEGLNIGDSKR 236

Query: 673  ASDIGAMMNLAGFNGNGA-NVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGL-AT 500
            A+DIG MMNLAGFNGNGA N GSATVLG NSNGLGGF  QS NMIPGSSA IPNG   AT
Sbjct: 237  ANDIGVMMNLAGFNGNGANNAGSATVLGPNSNGLGGFSAQS-NMIPGSSAVIPNGAFAAT 295

Query: 499  G-QYPSSLLMNMNGFNNINHPSPSPL-XXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXX 326
            G QYPSSLLMNMNGFN  NHPSPSPL      MQARHAMQQQPQMMYHRS          
Sbjct: 296  GQQYPSSLLMNMNGFN--NHPSPSPLMMNNMNMQARHAMQQQPQMMYHRSPYIPPNTGYY 353

Query: 325  XXXXXXXXXXXXXXXXXXPVD-------DHSSAAHMFSDDNTSSGCSIM 200
                                          +SA H+FSDD T S CS+M
Sbjct: 354  YNHNLNNNHTPANYNYATMPSYPVGGGGSDNSATHIFSDDYTGSSCSVM 402


>KRH35766.1 hypothetical protein GLYMA_10G264000 [Glycine max]
          Length = 392

 Score =  357 bits (917), Expect = e-117
 Identities = 239/408 (58%), Positives = 251/408 (61%), Gaps = 16/408 (3%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQ               KVKKLLQRIEGVYQVQIDAEQQKVTVSG VDSATLIKK
Sbjct: 7    FKLLKIQ---------------KVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDSATLIKK 51

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS-X 1019
            LVR+GK+AELWS                           LV+GLEAFKNQQKFPAFSS  
Sbjct: 52   LVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFPAFSSEE 109

Query: 1018 XXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS-XXXXXX 845
                             EMRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S       
Sbjct: 110  DEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSNNGKMNN 169

Query: 844  XXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRASD 665
                               KDSP     LDQKTM+ALKLNN HL GGE LNLGEAKRA+D
Sbjct: 170  GCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGEAKRAND 226

Query: 664  IGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN-GGLAT 500
            IGAMMNLAGFNG NGANVGSATVLGG  NSNGLGGFPVQS NNMIPGSSA   N GGL+ 
Sbjct: 227  IGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSNGGGLSG 286

Query: 499  GQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHA-MQQQPQMMYHRSXXXXXXXXXXX 323
            GQYPSSLLMNMNGFN  NHPSPSPL      QAR A MQQQPQMMYHRS           
Sbjct: 287  GQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPPNTGYYY 344

Query: 322  XXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200
                                      DDH SAAHMFSDDNTSS CSIM
Sbjct: 345  NHSSSYSPAHYSYSSYGLPGYPAAGGDDHHSAAHMFSDDNTSSSCSIM 392


>AFK47709.1 unknown [Lotus japonicus]
          Length = 400

 Score =  355 bits (910), Expect = e-116
 Identities = 230/413 (55%), Positives = 243/413 (58%), Gaps = 21/413 (5%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSA LIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSAALIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEA-FKN----QQK 1040
            L RSGK+AELWS                              LVKGLEA FKN    QQK
Sbjct: 67   LNRSGKHAELWSQKANQNQKQKNNNNINNVKDDKNNKGQKQGLVKGLEAAFKNHQQQQQK 126

Query: 1039 FPAFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAADA-----NNVKKGMGAI 875
            FPAFSS                   +RFIREKANQLQLLRQ   A     NNVKK + A 
Sbjct: 127  FPAFSSEEDDEYYDYDDEDDDDEE-LRFIREKANQLQLLRQQQQAVVDANNNVKKAISAA 185

Query: 874  SAVSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESL 695
            S                              S       DQKTMAALKLNNAHLGGGESL
Sbjct: 186  SNNGHNKMNNAAGKKGGQNQNMGGMKESNVGS-------DQKTMAALKLNNAHLGGGESL 238

Query: 694  NLGEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSS-AGIP 518
            NLGEAKRA+DIGAMMNLAGF  NG N G+ATVLGGNSNG+GGFPVQSNNM  G+S A +P
Sbjct: 239  NLGEAKRANDIGAMMNLAGF--NGGNAGNATVLGGNSNGMGGFPVQSNNMFQGNSPAAVP 296

Query: 517  NGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXX 338
            NGG     Y  S+LMNMNGFNN      SP+     MQ RHAMQQQPQMM+HRS      
Sbjct: 297  NGG-----YAPSMLMNMNGFNN----HQSPMMNMNMMQTRHAMQQQPQMMFHRSPVIPPN 347

Query: 337  XXXXXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200
                                  P         DH SAAHMFSDDNT+S CS+M
Sbjct: 348  TGYYFNHNNYNPAANYSYYASLPSYPGGDYDHDHHSAAHMFSDDNTTSSCSVM 400


>XP_002284132.1 PREDICTED: heavy metal-associated isoprenylated plant protein 37
            [Vitis vinifera]
          Length = 390

 Score =  330 bits (847), Expect = e-106
 Identities = 209/395 (52%), Positives = 236/395 (59%), Gaps = 3/395 (0%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVY V IDAEQQ+VTVSGSVDS TLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYTVNIDAEQQRVTVSGSVDSGTLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSXX 1016
            LV++GK+AELWS                          GL+KGLEAFK QQKFP FSS  
Sbjct: 67   LVKAGKHAELWS-QKSNQNQKQKTNCIKDDKNNKGQKQGLIKGLEAFKTQQKFPVFSS-E 124

Query: 1015 XXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVSXXXXXXXX 839
                            E+RF++EKANQL LLR QA DA+N KKG GAI+A +        
Sbjct: 125  EDEDDFDDDEEDYEEEELRFLQEKANQLSLLRQQALDASNAKKGFGAIAASNNGKINNNV 184

Query: 838  XXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRASDIG 659
                             K SP    G+DQKT+AALK+NN HL GG ++N GE KR +DI 
Sbjct: 185  GNGNVQKKGNPNQNMGMKGSP---GGIDQKTIAALKMNNPHLVGGGNINSGEVKRGNDIN 241

Query: 658  AMMNLAGFNGNGANV-GSATVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLATG-QYPS 485
            +MM L GF+GNG NV  +A  LGGNSN LGGF +Q NN   GSS G PNGG ATG  +PS
Sbjct: 242  SMMGLGGFHGNGGNVAATAAALGGNSNALGGFQIQPNNGFQGSSTGFPNGGFATGHHHPS 301

Query: 484  SLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXXXXX 305
             +LMN+NG N  NHPS   +      Q RHA  QQPQMMYHRS                 
Sbjct: 302  PMLMNLNG-NQYNHPS-QMMMNMNMQQNRHAPMQQPQMMYHRSPFIPPSTGYYYNYSPAL 359

Query: 304  XXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200
                          DH SA+HMFSD+NTSS CSIM
Sbjct: 360  SPYTHCDTNYS--GDH-SASHMFSDENTSS-CSIM 390


>XP_018843651.1 PREDICTED: neurogenic protein mastermind-like [Juglans regia]
          Length = 378

 Score =  327 bits (837), Expect = e-105
 Identities = 212/395 (53%), Positives = 234/395 (59%), Gaps = 3/395 (0%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVY V IDAEQQKVTVSGSVD++TLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYLVNIDAEQQKVTVSGSVDASTLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSXX 1016
            LVR+GK+AE WS                           L KGLEAFKNQQKFPAFSS  
Sbjct: 67   LVRAGKHAEPWS-QKNTQNQKQMNNCVKDAKNNKSQKPLLFKGLEAFKNQQKFPAFSS-E 124

Query: 1015 XXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXXXXXXXX 839
                            E+RFIREKANQL LLRQ A DANN KKG+ AI A S        
Sbjct: 125  EEDDYFDDVEEDEEEDELRFIREKANQLNLLRQRAIDANNAKKGVAAIGAAS-NNGKMNN 183

Query: 838  XXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRASDIG 659
                                     G+D KT+AALK+N+AHLGGG ++N GE +R SD+ 
Sbjct: 184  VGNGNIGNGKKANTEQNMGMRASPAGIDPKTLAALKINSAHLGGG-NVNAGEGRRVSDLN 242

Query: 658  AMMNLAGFNGNGANVGS--ATVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLATGQYPS 485
             MM LAGF+GNG NV S  A  LGGNSNGLGGF         GSSAG P GG ATGQYPS
Sbjct: 243  GMMGLAGFHGNGLNVASAGAAALGGNSNGLGGF--------QGSSAGFPTGGYATGQYPS 294

Query: 484  SLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXXXXX 305
            S+LMNMNG    NHPSP  +     MQAR+AMQQQPQMMYHRS                 
Sbjct: 295  SMLMNMNGH---NHPSPMMM----NMQARNAMQQQPQMMYHRSPHVPPTTGYYYNYSPSP 347

Query: 304  XXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200
                            +SAA+MFSD+NT+S CSIM
Sbjct: 348  NPYSYTDPNH---TGRNSAAYMFSDENTNS-CSIM 378


>XP_007045083.1 PREDICTED: myb-like protein I [Theobroma cacao] XP_007045084.1
            PREDICTED: myb-like protein I [Theobroma cacao]
            XP_007045086.1 PREDICTED: myb-like protein I [Theobroma
            cacao] EOY00915.1 Heavy metal transport/detoxification
            superfamily protein isoform 1 [Theobroma cacao]
            EOY00916.1 Heavy metal transport/detoxification
            superfamily protein isoform 1 [Theobroma cacao]
            EOY00917.1 Heavy metal transport/detoxification
            superfamily protein isoform 1 [Theobroma cacao]
            EOY00918.1 Heavy metal transport/detoxification
            superfamily protein isoform 1 [Theobroma cacao]
          Length = 392

 Score =  323 bits (828), Expect = e-104
 Identities = 205/398 (51%), Positives = 231/398 (58%), Gaps = 6/398 (1%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQV IDAEQQKVTVSGSVDSATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVSIDAEQQKVTVSGSVDSATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSXX 1016
            LVR+GK+AE+WS                          GL+KGLEAFK QQKFP+F S  
Sbjct: 67   LVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKFPSFVS-E 124

Query: 1015 XXXXXXXXXXXXXXXXEMRFIRE----KANQLQLLR-QAADANNVKKGMGAISAVSXXXX 851
                            E++F++     +  QL LLR QA DANN K G+G I+A S    
Sbjct: 125  EDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNITATS-NNN 183

Query: 850  XXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRA 671
                                          LDQKT+AALK+NNA L GG ++N  E KR 
Sbjct: 184  NKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNINAAEGKRG 242

Query: 670  SDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PNGGLATGQ 494
             DI  +M L+GF+GNGANV  A  LGGN N +GGF VQSNN + GSSA I  NGG  TGQ
Sbjct: 243  HDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQNGGYVTGQ 302

Query: 493  YPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXX 314
             PSS+LMNMNG+N      PS +     +Q RHAMQQQPQMMYHRS              
Sbjct: 303  NPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPSTGYYYNYG 357

Query: 313  XXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200
                             DHS+A HMFSDDNTSS CSIM
Sbjct: 358  PPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 392


>GAV65585.1 HMA domain-containing protein [Cephalotus follicularis]
          Length = 383

 Score =  322 bits (826), Expect = e-103
 Identities = 202/395 (51%), Positives = 232/395 (58%), Gaps = 3/395 (0%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIKK 1196
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGV+QV IDAEQQ+VT+SGSVDSATLIKK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVFQVNIDAEQQRVTISGSVDSATLIKK 66

Query: 1195 LVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSXX 1016
            LVR+GK+AELWS                           L+KGLE+ KNQQKFPAFSS  
Sbjct: 67   LVRAGKHAELWSQKSNQNQKQKNNCIKEDKNNESQKQG-LIKGLESLKNQQKFPAFSSEE 125

Query: 1015 XXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAA--DANNVKKGMGAISAVSXXXXXXX 842
                            ++RF+ +  +QL LLRQ A  +ANN KKG GAI+A +       
Sbjct: 126  DDDYLDDDEDDDEEVEQLRFLEKANHQLGLLRQQAAIEANNAKKG-GAIAAAAANNGKMN 184

Query: 841  XXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRASDI 662
                                      GLDQKTMAALK+NNAHLGGG ++N GE KR +D+
Sbjct: 185  TSVGNLNTGKKGNPNQNTGIK-VNPGGLDQKTMAALKMNNAHLGGG-NINTGEVKRGNDL 242

Query: 661  GAMMNLAGFNGNGANVGSA-TVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLATGQYPS 485
              MM L GF+GNGAN+G+A T LGGN+NGLGG  VQ N     S AG PNGG ATGQYPS
Sbjct: 243  STMMGLTGFHGNGANIGNAATALGGNANGLGGIQVQPNGYQGSSGAGFPNGGYATGQYPS 302

Query: 484  SLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXXXXX 305
            ++LMNMNG+N+       P      MQ RH    QPQMMYHRS                 
Sbjct: 303  AMLMNMNGYNH-------PASMMMNMQNRH---PQPQMMYHRSPYIPASTGYYHNYSPSP 352

Query: 304  XXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200
                           HS+A HMFSD+NTSS CSIM
Sbjct: 353  YSYTEQPNHRI---GHSAATHMFSDENTSS-CSIM 383


>EOY00920.1 Heavy metal transport/detoxification superfamily protein isoform 6
            [Theobroma cacao]
          Length = 393

 Score =  318 bits (816), Expect = e-102
 Identities = 205/399 (51%), Positives = 231/399 (57%), Gaps = 7/399 (1%)
 Frame = -3

Query: 1375 FKLLKIQ-TCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLIK 1199
            FKLLKIQ TCVLKVNIHCDGCKQKVKKLLQRIEGVYQV IDAEQQKVTVSGSVDSATLIK
Sbjct: 7    FKLLKIQQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVSIDAEQQKVTVSGSVDSATLIK 66

Query: 1198 KLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSX 1019
            KLVR+GK+AE+WS                          GL+KGLEAFK QQKFP+F S 
Sbjct: 67   KLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKFPSFVS- 124

Query: 1018 XXXXXXXXXXXXXXXXXEMRFIRE----KANQLQLLR-QAADANNVKKGMGAISAVSXXX 854
                             E++F++     +  QL LLR QA DANN K G+G I+A S   
Sbjct: 125  EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNITATS-NN 183

Query: 853  XXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKR 674
                                           LDQKT+AALK+NNA L GG ++N  E KR
Sbjct: 184  NNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNINAAEGKR 242

Query: 673  ASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PNGGLATG 497
              DI  +M L+GF+GNGANV  A  LGGN N +GGF VQSNN + GSSA I  NGG  TG
Sbjct: 243  GHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQNGGYVTG 302

Query: 496  QYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXX 317
            Q PSS+LMNMNG+N      PS +     +Q RHAMQQQPQMMYHRS             
Sbjct: 303  QNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPSTGYYYNY 357

Query: 316  XXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200
                              DHS+A HMFSDDNTSS CSIM
Sbjct: 358  GPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 393


>EOY00919.1 Heavy metal transport/detoxification superfamily protein isoform 5
            [Theobroma cacao]
          Length = 393

 Score =  318 bits (816), Expect = e-102
 Identities = 205/399 (51%), Positives = 231/399 (57%), Gaps = 7/399 (1%)
 Frame = -3

Query: 1375 FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEG-VYQVQIDAEQQKVTVSGSVDSATLIK 1199
            FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEG VYQV IDAEQQKVTVSGSVDSATLIK
Sbjct: 7    FKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGGVYQVSIDAEQQKVTVSGSVDSATLIK 66

Query: 1198 KLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSSX 1019
            KLVR+GK+AE+WS                          GL+KGLEAFK QQKFP+F S 
Sbjct: 67   KLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKFPSFVS- 124

Query: 1018 XXXXXXXXXXXXXXXXXEMRFIRE----KANQLQLLR-QAADANNVKKGMGAISAVSXXX 854
                             E++F++     +  QL LLR QA DANN K G+G I+A S   
Sbjct: 125  EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNITATS-NN 183

Query: 853  XXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKR 674
                                           LDQKT+AALK+NNA L GG ++N  E KR
Sbjct: 184  NNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNINAAEGKR 242

Query: 673  ASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PNGGLATG 497
              DI  +M L+GF+GNGANV  A  LGGN N +GGF VQSNN + GSSA I  NGG  TG
Sbjct: 243  GHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQNGGYVTG 302

Query: 496  QYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXX 317
            Q PSS+LMNMNG+N      PS +     +Q RHAMQQQPQMMYHRS             
Sbjct: 303  QNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPSTGYYYNY 357

Query: 316  XXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200
                              DHS+A HMFSDDNTSS CSIM
Sbjct: 358  GPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 393


Top