BLASTX nr result
ID: Glycyrrhiza36_contig00027917
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza36_contig00027917 (1757 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KYP75352.1 hypothetical protein KK1_008076 [Cajanus cajan] 417 e-139 XP_003555274.1 PREDICTED: putative uncharacterized protein DDB_G... 416 e-138 XP_007143034.1 hypothetical protein PHAVU_007G038000g [Phaseolus... 414 e-137 KHN06753.1 hypothetical protein glysoja_021299 [Glycine soja] 414 e-137 XP_003536625.1 PREDICTED: putative uncharacterized protein DDB_G... 414 e-137 XP_015942069.1 PREDICTED: bromodomain and WD repeat-containing D... 410 e-135 XP_016175066.1 PREDICTED: hybrid signal transduction histidine k... 409 e-135 XP_014510893.1 PREDICTED: serine, glycine and glutamine-rich pro... 401 e-132 XP_017409476.1 PREDICTED: serine, glycine and glutamine-rich pro... 399 e-131 KHN42381.1 hypothetical protein glysoja_020093 [Glycine soja] 397 e-131 GAU24292.1 hypothetical protein TSUD_48800 [Trifolium subterraneum] 376 e-122 XP_019441465.1 PREDICTED: heavy metal-associated isoprenylated p... 371 e-120 KRH35766.1 hypothetical protein GLYMA_10G264000 [Glycine max] 369 e-120 AFK47709.1 unknown [Lotus japonicus] 367 e-119 XP_002284132.1 PREDICTED: heavy metal-associated isoprenylated p... 341 e-109 XP_018843651.1 PREDICTED: neurogenic protein mastermind-like [Ju... 338 e-108 XP_007045083.1 PREDICTED: myb-like protein I [Theobroma cacao] X... 335 e-106 GAV65585.1 HMA domain-containing protein [Cephalotus follicularis] 334 e-106 EOY00920.1 Heavy metal transport/detoxification superfamily prot... 330 e-105 EOY00919.1 Heavy metal transport/detoxification superfamily prot... 330 e-105 >KYP75352.1 hypothetical protein KK1_008076 [Cajanus cajan] Length = 397 Score = 417 bits (1073), Expect = e-139 Identities = 254/405 (62%), Positives = 272/405 (67%), Gaps = 7/405 (1%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTV GSVDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVLGSVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQ-QKF 1037 ATLIKKLVR+GKYAELWS GL KG+EAFKNQ QKF Sbjct: 61 ATLIKKLVRAGKYAELWS--QKTNQNQKQKNNNAKDEKNKGQKQGLPKGIEAFKNQHQKF 118 Query: 1036 PAFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVSX 860 PAFSS EMR +REKA Q+++L+ Q +ANNV+KGMG+ISA + Sbjct: 119 PAFSSEEDEYYFETDEEDEDEDEEMRMLREKAIQMKMLKHQPPNANNVRKGMGSISAGAN 178 Query: 859 XXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGE 683 KD+P GLDQKTMAALKLNNAHL G G +LNLGE Sbjct: 179 NGKMNNACNANSGKKGGPNHNMAMKDNP---NGLDQKTMAALKLNNAHLNGEGLNLNLGE 235 Query: 682 AKRASDIGAMMNLAGFNGNGANVGSATVLGG-NSNGLGGFPVQSNNMIPGSSAGIPNGGL 506 AKRA+DIGAMMNLAGF+GNGANVGSATVLGG NSNG GGFPVQSNNMIPGSSA P+GGL Sbjct: 236 AKRANDIGAMMNLAGFHGNGANVGSATVLGGNNSNGFGGFPVQSNNMIPGSSAAFPSGGL 295 Query: 505 ATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXX 326 A+GQYPSSLLMNMNGFN NH SPSPL MQAR AMQQQPQMMYHRS Sbjct: 296 ASGQYPSSLLMNMNGFN--NHTSPSPLMMNMNMQARQAMQQQPQMMYHRSPFVPPNTGYY 353 Query: 325 XXXXXXXXXXXXXXXXXXPV---DDHSSAAHMFSDDNTSSGCSIM 200 DDH +AAHMFSDDNTSS CSIM Sbjct: 354 YNHSGYSPAHYSYSYGLPSYPGGDDH-TAAHMFSDDNTSSSCSIM 397 >XP_003555274.1 PREDICTED: putative uncharacterized protein DDB_G0286901 [Glycine max] KRG90987.1 hypothetical protein GLYMA_20G126300 [Glycine max] Length = 407 Score = 416 bits (1069), Expect = e-138 Identities = 259/414 (62%), Positives = 272/414 (65%), Gaps = 16/414 (3%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 ATLIKKLVR+GK+AELWS LVKGLEAFKNQQKFP Sbjct: 61 ATLIKKLVRAGKHAELWS--QKINQNQKQKNNNAKDDKNKGQKQALVKGLEAFKNQQKFP 118 Query: 1033 AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISA-VSX 860 AFSS EMRF+REKANQLQ+L+ Q A+ANNV+KGMGAI+A + Sbjct: 119 AFSSEEDEYYYDDEDDEEDEDEEMRFLREKANQLQMLKQQTANANNVRKGMGAIAAGANN 178 Query: 859 XXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGE 683 KDSP GLDQKTMAALK NN HLGG G +LNLGE Sbjct: 179 GKTNNGDNANSGKKGGPNHQNMGMKDSP--NGGLDQKTMAALKFNNGHLGGDGLNLNLGE 236 Query: 682 AKRASDIGAMMNLAGFNGNGA--NVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIP 518 AKRA+DIGAMMNLAGFNGN NVGSATVLGG NSNGLGGFPVQS NNMIPGS+A Sbjct: 237 AKRANDIGAMMNLAGFNGNNCANNVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSAAAFS 296 Query: 517 NGGLATGQYPSSLLMNMNGFNNINHPSPSPL-XXXXXMQARHAMQQQPQMMYHRSXXXXX 341 NGGL+ GQYPSSLLMNMNGFN NHPSPSPL QAR AMQQQPQMMYHRS Sbjct: 297 NGGLSGGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQQARQAMQQQPQMMYHRSPFVPP 354 Query: 340 XXXXXXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200 DDH SAAHMFSDDNTSS CSIM Sbjct: 355 NTGYYYNHSSYSPAHYSYSYGLPSYPAAAGGGDDH-SAAHMFSDDNTSSSCSIM 407 >XP_007143034.1 hypothetical protein PHAVU_007G038000g [Phaseolus vulgaris] ESW15028.1 hypothetical protein PHAVU_007G038000g [Phaseolus vulgaris] Length = 400 Score = 414 bits (1063), Expect = e-137 Identities = 249/407 (61%), Positives = 267/407 (65%), Gaps = 9/407 (2%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 ATLIKKLVR+GKYAELWS LVKGL+AFKNQQKFP Sbjct: 61 ATLIKKLVRAGKYAELWSQKINQNQKQKNNNAKDDKNKGQKQA--LVKGLDAFKNQQKFP 118 Query: 1033 AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXX 857 AFSS EMRF+REKANQLQ+L+Q AA+ANNV+K M + A + Sbjct: 119 AFSSEEDEYYSEYDDEDEDEDEEMRFLREKANQLQMLKQQAANANNVRKNMAPMGAGAIN 178 Query: 856 XXXXXXXXXXXXXXXXXXXXXXXK-DSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGE 683 +SP LDQKTMAALKLN HLGG G +LNLGE Sbjct: 179 GKMNNGGGNAGNGKKGGPNPNMGVKESPNVG--LDQKTMAALKLNGGHLGGEGLNLNLGE 236 Query: 682 AKRASDIGAMMNLAGFNGNGANVGSATVLGGNS-NGLGGFPVQSNNMIPGSSAGIPNGGL 506 AKRA+DIGAMMN+AGFNGNG NV SATVLG N+ N +GGFPVQSNNMIPGSSA NGG+ Sbjct: 237 AKRANDIGAMMNMAGFNGNGGNVSSATVLGANNPNAMGGFPVQSNNMIPGSSAAFSNGGM 296 Query: 505 ATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXX 326 ATGQYPSSLLMNM+GFN NHPSPSPL MQAR AMQQQPQMMYHRS Sbjct: 297 ATGQYPSSLLMNMSGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMYHRSPVIPTNTGYY 354 Query: 325 XXXXXXXXXXXXXXXXXXPV-----DDHSSAAHMFSDDNTSSGCSIM 200 P DDH SAAHMFSDDNT+S CSIM Sbjct: 355 YNHSNSYSPAQYSYSYGLPSYPGSGDDH-SAAHMFSDDNTNSSCSIM 400 >KHN06753.1 hypothetical protein glysoja_021299 [Glycine soja] Length = 407 Score = 414 bits (1063), Expect = e-137 Identities = 260/414 (62%), Positives = 272/414 (65%), Gaps = 16/414 (3%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 ATLIKKLVR+GK+AELWS LV+GLEAFKNQQKFP Sbjct: 61 ATLIKKLVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFP 118 Query: 1033 AFSS-XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS- 863 AFSS EMRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S Sbjct: 119 AFSSEEDEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSN 178 Query: 862 XXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGE 683 KDSP LDQKTM+ALKLNN HL GGE LNLGE Sbjct: 179 NGKMNNGCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGE 235 Query: 682 AKRASDIGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN 515 AKRA+DIGAMMNLAGFNG NGANVGSATVLGG NSNGLGGFPVQS NNMIPGSSA N Sbjct: 236 AKRANDIGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSN 295 Query: 514 -GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHA-MQQQPQMMYHRSXXXXX 341 GGL+ GQYPSSLLMNMNGFN NHPSPSPL QAR A MQQQPQMMYHRS Sbjct: 296 GGGLSGGQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPP 353 Query: 340 XXXXXXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200 DDH SAAHMFSDDNTSS CSIM Sbjct: 354 NTGYYYNHSSSYSPAHYSYSSYGLPGYLAAGGDDHHSAAHMFSDDNTSSSCSIM 407 >XP_003536625.1 PREDICTED: putative uncharacterized protein DDB_G0286901 [Glycine max] KRH35767.1 hypothetical protein GLYMA_10G264000 [Glycine max] Length = 407 Score = 414 bits (1063), Expect = e-137 Identities = 260/414 (62%), Positives = 272/414 (65%), Gaps = 16/414 (3%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 ATLIKKLVR+GK+AELWS LV+GLEAFKNQQKFP Sbjct: 61 ATLIKKLVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFP 118 Query: 1033 AFSS-XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS- 863 AFSS EMRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S Sbjct: 119 AFSSEEDEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSN 178 Query: 862 XXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGE 683 KDSP LDQKTM+ALKLNN HL GGE LNLGE Sbjct: 179 NGKMNNGCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGE 235 Query: 682 AKRASDIGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN 515 AKRA+DIGAMMNLAGFNG NGANVGSATVLGG NSNGLGGFPVQS NNMIPGSSA N Sbjct: 236 AKRANDIGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSN 295 Query: 514 -GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHA-MQQQPQMMYHRSXXXXX 341 GGL+ GQYPSSLLMNMNGFN NHPSPSPL QAR A MQQQPQMMYHRS Sbjct: 296 GGGLSGGQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPP 353 Query: 340 XXXXXXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200 DDH SAAHMFSDDNTSS CSIM Sbjct: 354 NTGYYYNHSSSYSPAHYSYSSYGLPGYPAAGGDDHHSAAHMFSDDNTSSSCSIM 407 >XP_015942069.1 PREDICTED: bromodomain and WD repeat-containing DDB_G0285837 [Arachis duranensis] Length = 417 Score = 410 bits (1054), Expect = e-135 Identities = 254/420 (60%), Positives = 265/420 (63%), Gaps = 22/420 (5%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDA+QQKVTVSGSVD+ Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDADQQKVTVSGSVDA 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEAFKNQQ 1043 ATLIKKLVR+GKYAE WS LVKGLEAFKNQQ Sbjct: 61 ATLIKKLVRAGKYAEPWSQQKTIQNPKQKNNNNIVKDDKNKGGQKQQGLVKGLEAFKNQQ 120 Query: 1042 -KFP-AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-----AADANNVKKGM 884 KFP AFSS EMRFIREKANQL LLRQ AA+ANN+KKG+ Sbjct: 121 QKFPSAFSSEEDDDYYDYDDEDEDDDEEMRFIREKANQLHLLRQQAAAAAAEANNLKKGV 180 Query: 883 GAISAVSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGG 704 GAIS S LDQKTMAALKLNN H+GGG Sbjct: 181 GAISGGSNNVKMNNNAGNNNNVGKKGGPGNNMGLKDGHGGVLDQKTMAALKLNNGHMGGG 240 Query: 703 ESLNLGEAKRASDIGAMMNLAGFNGN-----GANVGSATVLGGNSNGLGGFPVQSNNMIP 539 E LNLGEAKRASDIGAMMNLAGFNGN NVGSATVLG NSNGLGGFPV SNNM P Sbjct: 241 EGLNLGEAKRASDIGAMMNLAGFNGNNNNNVANNVGSATVLGANSNGLGGFPVLSNNMAP 300 Query: 538 GSSAGI-PNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYH 362 GS+A + PNG +TGQYPSSLLMNMNGFN NHPSPSPL MQAR AMQQQPQMMYH Sbjct: 301 GSTAAVLPNGAFSTGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMYH 358 Query: 361 RS------XXXXXXXXXXXXXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 RS P D +SAAHMFSDDNTSS CSIM Sbjct: 359 RSPFVPPNTGYYYNHSNNYSPANYSYALPNYYPHQCPATDDNSAAHMFSDDNTSS-CSIM 417 >XP_016175066.1 PREDICTED: hybrid signal transduction histidine kinase A [Arachis ipaensis] Length = 418 Score = 409 bits (1050), Expect = e-135 Identities = 253/421 (60%), Positives = 265/421 (62%), Gaps = 23/421 (5%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDA+QQKVTVSGSVD+ Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDADQQKVTVSGSVDA 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEAFKNQQ 1043 ATLIKKLVR+GKYAE WS LVKGLEAFKNQQ Sbjct: 61 ATLIKKLVRAGKYAEPWSQQKTNQNPKQKNNNNIVKDDKNKGGQKQQGLVKGLEAFKNQQ 120 Query: 1042 -KFP-AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ------AADANNVKKG 887 KFP AFSS EMRFIREKANQL LLRQ AA+ANN+KKG Sbjct: 121 QKFPSAFSSEEDDDYYDYDDEDEDDDEEMRFIREKANQLHLLRQQAAAAAAAEANNLKKG 180 Query: 886 MGAISAVSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG 707 +GAIS S LDQKTMAALKLNN H+GG Sbjct: 181 VGAISGGSNNVKMNNNAGNNNNVGKKGGPGNNMGLKDGHGGVLDQKTMAALKLNNGHMGG 240 Query: 706 GESLNLGEAKRASDIGAMMNLAGFNGN-----GANVGSATVLGGNSNGLGGFPVQSNNMI 542 GE LNLGEAKRASDIGAMMNLAGFNGN NVG+ATVLG NSNGLGGFPV SNNM Sbjct: 241 GEGLNLGEAKRASDIGAMMNLAGFNGNNNNNVANNVGNATVLGANSNGLGGFPVLSNNMA 300 Query: 541 PGSSAGI-PNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMY 365 PGS+A + PNG +TGQYPSSLLMNMNGFN NHPSPSPL MQAR AMQQQPQMMY Sbjct: 301 PGSTAAVLPNGAFSTGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMY 358 Query: 364 HRS------XXXXXXXXXXXXXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSI 203 HRS P D +SAAHMFSDDNTSS CSI Sbjct: 359 HRSPFVPPNTGYYYNHSNNYSPANYSYALPNYYPHQCPATDDNSAAHMFSDDNTSS-CSI 417 Query: 202 M 200 M Sbjct: 418 M 418 >XP_014510893.1 PREDICTED: serine, glycine and glutamine-rich protein [Vigna radiata var. radiata] Length = 402 Score = 401 bits (1031), Expect = e-132 Identities = 244/409 (59%), Positives = 260/409 (63%), Gaps = 11/409 (2%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 ATLIKKLVR+GKYAELWS L KGL+AFKNQQKFP Sbjct: 61 ATLIKKLVRAGKYAELWSQKTNQNQKQKNNNAKDDKNKGQKQG--LAKGLDAFKNQQKFP 118 Query: 1033 AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAADANNVKKGMGAISAVSXXX 854 AFSS EMRF+REKA+ LQ+L+Q A NV+K MG + A + Sbjct: 119 AFSSEEDEYYSEYEDDDEEEDEEMRFLREKAHHLQMLKQQAANANVRKSMGGMGAGAING 178 Query: 853 XXXXXXXXXXXXXXXXXXXXXXK---DSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLG 686 +SP LDQKTMAALKLN H+GG G LNLG Sbjct: 179 KMNNGGGNGGGGGGKKGGPNPNMGMKESPNGG--LDQKTMAALKLNGGHVGGEGLGLNLG 236 Query: 685 EAKRASDIGAMMNLAGFNGNGANVGSATVLGGN-SNGLGGFPVQSNNMIPGSSAGIPNGG 509 EAKRA+DIGAMMN+AGFNGNG NV SATVLG N S+G+GGFPVQSNNMIPGSSAG NGG Sbjct: 237 EAKRANDIGAMMNMAGFNGNGGNVTSATVLGANNSSGMGGFPVQSNNMIPGSSAGFSNGG 296 Query: 508 LATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXM--QARHAMQQQPQMMYHRSXXXXXXX 335 + GQYPSSLLMNMNGFNN HPSPSPL M QAR AMQQQPQMMYHRS Sbjct: 297 IGAGQYPSSLLMNMNGFNN--HPSPSPLMMNMNMNMQARQAMQQQPQMMYHRSPLIPPNT 354 Query: 334 XXXXXXXXXXXXXXXXXXXXXPV----DDHSSAAHMFSDDNTSSGCSIM 200 P DDH SA HMFSDDNTSS CSIM Sbjct: 355 GYYYNHSNSYSPAQYSYSYGLPSYPGGDDH-SATHMFSDDNTSSSCSIM 402 >XP_017409476.1 PREDICTED: serine, glycine and glutamine-rich protein [Vigna angularis] KOM28905.1 hypothetical protein LR48_Vigan609s003700 [Vigna angularis] BAT93876.1 hypothetical protein VIGAN_08042400 [Vigna angularis var. angularis] Length = 404 Score = 399 bits (1026), Expect = e-131 Identities = 245/411 (59%), Positives = 260/411 (63%), Gaps = 13/411 (3%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 ATLIKKLVR+GKYAELWS L KGL+AFKNQQKFP Sbjct: 61 ATLIKKLVRAGKYAELWSQKSNQNQKQKNNNAKDDKNKGQKQG--LPKGLDAFKNQQKFP 118 Query: 1033 AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAADANNVKKGMG-----AISA 869 AFSS EMRF+REKA+ LQ+L+Q NV+K MG AI+ Sbjct: 119 AFSSEEDEYYSEYEDDDEDEDEEMRFLREKAHHLQMLKQQTANANVRKSMGGMGAGAING 178 Query: 868 VSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLN 692 K+SP LDQKTMAALKLN H+GG G LN Sbjct: 179 KMNNGGGNGGGGGGGGKKGGPNPNMGMKESPNVG--LDQKTMAALKLNGGHVGGEGLGLN 236 Query: 691 LGEAKRASDIGAMMNLAGFNGNGANVGSATVLGGN-SNGLGGFPVQSNNMIPGSSAGIPN 515 LGEAKRA+DIGAMMN+AGFNGNG NV SATVLG N S+G+GGFPVQSNNMIPGSSAG N Sbjct: 237 LGEAKRANDIGAMMNMAGFNGNGGNVTSATVLGANNSSGMGGFPVQSNNMIPGSSAGFSN 296 Query: 514 GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXM--QARHAMQQQPQMMYHRSXXXXX 341 GG+ GQYPSSLLMNMNGFNN HPSPSPL M QAR AMQQQPQMMYHRS Sbjct: 297 GGIGAGQYPSSLLMNMNGFNN--HPSPSPLMMNMNMNMQARQAMQQQPQMMYHRSPLIPP 354 Query: 340 XXXXXXXXXXXXXXXXXXXXXXXPV----DDHSSAAHMFSDDNTSSGCSIM 200 P DDH SA HMFSDDNTSS CSIM Sbjct: 355 NTGYYYNHSNSYSPAQYAYSYGLPSYPGGDDH-SATHMFSDDNTSSSCSIM 404 >KHN42381.1 hypothetical protein glysoja_020093 [Glycine soja] Length = 363 Score = 397 bits (1021), Expect = e-131 Identities = 239/355 (67%), Positives = 252/355 (70%), Gaps = 9/355 (2%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 ATLIKKLVR+GK+AELWS LVKGLEAFKNQQKFP Sbjct: 61 ATLIKKLVRAGKHAELWS--QKINQNQKQKNNNAKDDKNKGQKQALVKGLEAFKNQQKFP 118 Query: 1033 AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISA-VSX 860 AFSS EMRF+REKANQLQ+L+ Q A+ANNV+KGMGAI+A + Sbjct: 119 AFSSEEDEYYYDDEDDEEDEDEEMRFLREKANQLQMLKQQTANANNVRKGMGAIAAGANN 178 Query: 859 XXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGE 683 KDSP GLDQKTMAALK NN HLGG G +LNLGE Sbjct: 179 GKTNNGDNANSGKKGGPNHQNMGMKDSP--NGGLDQKTMAALKFNNGHLGGDGLNLNLGE 236 Query: 682 AKRASDIGAMMNLAGFNGNGA--NVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIP 518 AKRA+DIGAMMNLAGFNGN NVGSATVLGG NSNGLGGFPVQS NNMIPGS+A Sbjct: 237 AKRANDIGAMMNLAGFNGNNCANNVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSAAAFS 296 Query: 517 NGGLATGQYPSSLLMNMNGFNNINHPSPSPL-XXXXXMQARHAMQQQPQMMYHRS 356 NGGL+ GQYPSSLLMNMNGFN NHPSPSPL QAR AMQQQPQMMYHRS Sbjct: 297 NGGLSGGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQQARQAMQQQPQMMYHRS 349 >GAU24292.1 hypothetical protein TSUD_48800 [Trifolium subterraneum] Length = 399 Score = 376 bits (966), Expect = e-122 Identities = 231/411 (56%), Positives = 251/411 (61%), Gaps = 13/411 (3%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVD+ Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDA 60 Query: 1213 ATLIKKLVRSGKYAELWS-XXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKF 1037 ATLIKKLVRSGKYAELWS +VKGLEAFKNQQKF Sbjct: 61 ATLIKKLVRSGKYAELWSQKTNNNQNQKQKNNNIVKDDKNKGQKQVVVKGLEAFKNQQKF 120 Query: 1036 PAFSS---XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISA 869 PAFSS E R+IRE ANQ+Q++R Q DANN KK +GA Sbjct: 121 PAFSSEEDGGYYGGYGDDDDEEEEDQETRYIREAANQIQMMRQQVVDANNAKKAIGA--- 177 Query: 868 VSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNL 689 G+DQKT+AA+KLNN HL G ES+NL Sbjct: 178 -----KMNNAGNVNGNSGKKGNSNQNMVGMKESANGVDQKTIAAMKLNNGHLVGNESMNL 232 Query: 688 GEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSN--NMIPGSSAG-IP 518 GE+KR SDIGAMMNLAGFNGN VG+AT+LGGNSNGLGGFPVQSN NMI GSSA IP Sbjct: 233 GESKRVSDIGAMMNLAGFNGNNNVVGNATILGGNSNGLGGFPVQSNNTNMIQGSSAATIP 292 Query: 517 NGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAM-QQQPQMMYHRSXXXXX 341 NGG TGQ P S++MNMNGFNN PS L QARH M QQQPQMMYHRS Sbjct: 293 NGGFVTGQIPPSMMMNMNGFNN----HPSSLMNMNMQQARHVMQQQQPQMMYHRSPYVPP 348 Query: 340 XXXXXXXXXXXXXXXXXXXXXXXPVDDH----SSAAHMFSDDNTSSGCSIM 200 + + +SAAHMFSDDNT+S CSIM Sbjct: 349 NTGYYYNNYNNYIPPNATNYSSYAMPSYPTEDNSAAHMFSDDNTTSSCSIM 399 >XP_019441465.1 PREDICTED: heavy metal-associated isoprenylated plant protein 37-like [Lupinus angustifolius] OIW12890.1 hypothetical protein TanjilG_24823 [Lupinus angustifolius] Length = 402 Score = 371 bits (953), Expect = e-120 Identities = 234/415 (56%), Positives = 251/415 (60%), Gaps = 17/415 (4%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VD+ Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDA 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEAFKNQQ 1043 ATLIKKL R+GKYA+LWS LVKGLE FKNQQ Sbjct: 61 ATLIKKLARAGKYAQLWSQKSSNQNQKQNNNNNNCVKDDNKNKGQKQGLVKGLEDFKNQQ 120 Query: 1042 KFPAFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-AADANNVKKGMGAISAV 866 KFPAFSS EMRF+RE+ NQLQ+LRQ A DANN K A++ Sbjct: 121 KFPAFSSEEDDDFYDYDDDEDDDDEEMRFMRERVNQLQMLRQQAVDANNAAKNGVAVNNN 180 Query: 865 SXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGG--ESLN 692 LDQKT+AALK+NN HLGGG E LN Sbjct: 181 GKINNNAGKKGGPNQNMVIKDNTGG----------LDQKTIAALKMNNGHLGGGGGEGLN 230 Query: 691 LGEAKRASDIGAMMNLAGFNGNGA-NVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGIPN 515 +G++KRA+DIG MMNLAGFNGNGA N GSATVLG NSNGLGGF QS NMIPGSSA IPN Sbjct: 231 IGDSKRANDIGVMMNLAGFNGNGANNAGSATVLGPNSNGLGGFSAQS-NMIPGSSAVIPN 289 Query: 514 GGL-ATG-QYPSSLLMNMNGFNNINHPSPSPL-XXXXXMQARHAMQQQPQMMYHRSXXXX 344 G ATG QYPSSLLMNMNGFN NHPSPSPL MQARHAMQQQPQMMYHRS Sbjct: 290 GAFAATGQQYPSSLLMNMNGFN--NHPSPSPLMMNNMNMQARHAMQQQPQMMYHRSPYIP 347 Query: 343 XXXXXXXXXXXXXXXXXXXXXXXXPVD-------DHSSAAHMFSDDNTSSGCSIM 200 +SA H+FSDD T S CS+M Sbjct: 348 PNTGYYYNHNLNNNHTPANYNYATMPSYPVGGGGSDNSATHIFSDDYTGSSCSVM 402 >KRH35766.1 hypothetical protein GLYMA_10G264000 [Glycine max] Length = 392 Score = 369 bits (948), Expect = e-120 Identities = 245/414 (59%), Positives = 257/414 (62%), Gaps = 16/414 (3%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQ KVKKLLQRIEGVYQVQIDAEQQKVTVSG VDS Sbjct: 1 MTKEEDFKLLKIQ---------------KVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDS 45 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 ATLIKKLVR+GK+AELWS LV+GLEAFKNQQKFP Sbjct: 46 ATLIKKLVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFP 103 Query: 1033 AFSS-XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS- 863 AFSS EMRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S Sbjct: 104 AFSSEEDEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSN 163 Query: 862 XXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGE 683 KDSP LDQKTM+ALKLNN HL GGE LNLGE Sbjct: 164 NGKMNNGCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGE 220 Query: 682 AKRASDIGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN 515 AKRA+DIGAMMNLAGFNG NGANVGSATVLGG NSNGLGGFPVQS NNMIPGSSA N Sbjct: 221 AKRANDIGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSN 280 Query: 514 -GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHA-MQQQPQMMYHRSXXXXX 341 GGL+ GQYPSSLLMNMNGFN NHPSPSPL QAR A MQQQPQMMYHRS Sbjct: 281 GGGLSGGQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPP 338 Query: 340 XXXXXXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200 DDH SAAHMFSDDNTSS CSIM Sbjct: 339 NTGYYYNHSSSYSPAHYSYSSYGLPGYPAAGGDDHHSAAHMFSDDNTSSSCSIM 392 >AFK47709.1 unknown [Lotus japonicus] Length = 400 Score = 367 bits (941), Expect = e-119 Identities = 236/419 (56%), Positives = 249/419 (59%), Gaps = 21/419 (5%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEA-FKN- 1049 A LIKKL RSGK+AELWS LVKGLEA FKN Sbjct: 61 AALIKKLNRSGKHAELWSQKANQNQKQKNNNNINNVKDDKNNKGQKQGLVKGLEAAFKNH 120 Query: 1048 ---QQKFPAFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAADA-----NNVK 893 QQKFPAFSS +RFIREKANQLQLLRQ A NNVK Sbjct: 121 QQQQQKFPAFSSEEDDEYYDYDDEDDDDEE-LRFIREKANQLQLLRQQQQAVVDANNNVK 179 Query: 892 KGMGAISAVSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHL 713 K + A S S DQKTMAALKLNNAHL Sbjct: 180 KAISAASNNGHNKMNNAAGKKGGQNQNMGGMKESNVGS-------DQKTMAALKLNNAHL 232 Query: 712 GGGESLNLGEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGS 533 GGGESLNLGEAKRA+DIGAMMNLAGF NG N G+ATVLGGNSNG+GGFPVQSNNM G+ Sbjct: 233 GGGESLNLGEAKRANDIGAMMNLAGF--NGGNAGNATVLGGNSNGMGGFPVQSNNMFQGN 290 Query: 532 S-AGIPNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRS 356 S A +PNGG Y S+LMNMNGFNN SP+ MQ RHAMQQQPQMM+HRS Sbjct: 291 SPAAVPNGG-----YAPSMLMNMNGFNN----HQSPMMNMNMMQTRHAMQQQPQMMFHRS 341 Query: 355 XXXXXXXXXXXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200 P DH SAAHMFSDDNT+S CS+M Sbjct: 342 PVIPPNTGYYFNHNNYNPAANYSYYASLPSYPGGDYDHDHHSAAHMFSDDNTTSSCSVM 400 >XP_002284132.1 PREDICTED: heavy metal-associated isoprenylated plant protein 37 [Vitis vinifera] Length = 390 Score = 341 bits (875), Expect = e-109 Identities = 214/401 (53%), Positives = 242/401 (60%), Gaps = 3/401 (0%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTK+EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVY V IDAEQQ+VTVSGSVDS Sbjct: 1 MTKDEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYTVNIDAEQQRVTVSGSVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 TLIKKLV++GK+AELWS GL+KGLEAFK QQKFP Sbjct: 61 GTLIKKLVKAGKHAELWS-QKSNQNQKQKTNCIKDDKNNKGQKQGLIKGLEAFKTQQKFP 119 Query: 1033 AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVSXX 857 FSS E+RF++EKANQL LLR QA DA+N KKG GAI+A + Sbjct: 120 VFSS-EEDEDDFDDDEEDYEEEELRFLQEKANQLSLLRQQALDASNAKKGFGAIAASNNG 178 Query: 856 XXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAK 677 K SP G+DQKT+AALK+NN HL GG ++N GE K Sbjct: 179 KINNNVGNGNVQKKGNPNQNMGMKGSP---GGIDQKTIAALKMNNPHLVGGGNINSGEVK 235 Query: 676 RASDIGAMMNLAGFNGNGANV-GSATVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLAT 500 R +DI +MM L GF+GNG NV +A LGGNSN LGGF +Q NN GSS G PNGG AT Sbjct: 236 RGNDINSMMGLGGFHGNGGNVAATAAALGGNSNALGGFQIQPNNGFQGSSTGFPNGGFAT 295 Query: 499 G-QYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXX 323 G +PS +LMN+NG N NHPS + Q RHA QQPQMMYHRS Sbjct: 296 GHHHPSPMLMNLNG-NQYNHPS-QMMMNMNMQQNRHAPMQQPQMMYHRSPFIPPSTGYYY 353 Query: 322 XXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 DH SA+HMFSD+NTSS CSIM Sbjct: 354 NYSPALSPYTHCDTNYS--GDH-SASHMFSDENTSS-CSIM 390 >XP_018843651.1 PREDICTED: neurogenic protein mastermind-like [Juglans regia] Length = 378 Score = 338 bits (868), Expect = e-108 Identities = 218/401 (54%), Positives = 240/401 (59%), Gaps = 3/401 (0%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVY V IDAEQQKVTVSGSVD+ Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYLVNIDAEQQKVTVSGSVDA 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 +TLIKKLVR+GK+AE WS L KGLEAFKNQQKFP Sbjct: 61 STLIKKLVRAGKHAEPWS-QKNTQNQKQMNNCVKDAKNNKSQKPLLFKGLEAFKNQQKFP 119 Query: 1033 AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXX 857 AFSS E+RFIREKANQL LLRQ A DANN KKG+ AI A S Sbjct: 120 AFSS-EEEDDYFDDVEEDEEEDELRFIREKANQLNLLRQRAIDANNAKKGVAAIGAAS-N 177 Query: 856 XXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAK 677 G+D KT+AALK+N+AHLGGG ++N GE + Sbjct: 178 NGKMNNVGNGNIGNGKKANTEQNMGMRASPAGIDPKTLAALKINSAHLGGG-NVNAGEGR 236 Query: 676 RASDIGAMMNLAGFNGNGANVGS--ATVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLA 503 R SD+ MM LAGF+GNG NV S A LGGNSNGLGGF GSSAG P GG A Sbjct: 237 RVSDLNGMMGLAGFHGNGLNVASAGAAALGGNSNGLGGF--------QGSSAGFPTGGYA 288 Query: 502 TGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXX 323 TGQYPSS+LMNMNG NHPSP + MQAR+AMQQQPQMMYHRS Sbjct: 289 TGQYPSSMLMNMNGH---NHPSPMMM----NMQARNAMQQQPQMMYHRSPHVPPTTGYYY 341 Query: 322 XXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 +SAA+MFSD+NT+S CSIM Sbjct: 342 NYSPSPNPYSYTDPNH---TGRNSAAYMFSDENTNS-CSIM 378 >XP_007045083.1 PREDICTED: myb-like protein I [Theobroma cacao] XP_007045084.1 PREDICTED: myb-like protein I [Theobroma cacao] XP_007045086.1 PREDICTED: myb-like protein I [Theobroma cacao] EOY00915.1 Heavy metal transport/detoxification superfamily protein isoform 1 [Theobroma cacao] EOY00916.1 Heavy metal transport/detoxification superfamily protein isoform 1 [Theobroma cacao] EOY00917.1 Heavy metal transport/detoxification superfamily protein isoform 1 [Theobroma cacao] EOY00918.1 Heavy metal transport/detoxification superfamily protein isoform 1 [Theobroma cacao] Length = 392 Score = 335 bits (859), Expect = e-106 Identities = 211/404 (52%), Positives = 237/404 (58%), Gaps = 6/404 (1%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQV IDAEQQKVTVSGSVDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVSIDAEQQKVTVSGSVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 ATLIKKLVR+GK+AE+WS GL+KGLEAFK QQKFP Sbjct: 61 ATLIKKLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKFP 119 Query: 1033 AFSSXXXXXXXXXXXXXXXXXXEMRFIRE----KANQLQLLR-QAADANNVKKGMGAISA 869 +F S E++F++ + QL LLR QA DANN K G+G I+A Sbjct: 120 SFVS-EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNITA 178 Query: 868 VSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNL 689 S LDQKT+AALK+NNA L GG ++N Sbjct: 179 TS-NNNNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNINA 236 Query: 688 GEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PNG 512 E KR DI +M L+GF+GNGANV A LGGN N +GGF VQSNN + GSSA I NG Sbjct: 237 AEGKRGHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQNG 296 Query: 511 GLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXX 332 G TGQ PSS+LMNMNG+N PS + +Q RHAMQQQPQMMYHRS Sbjct: 297 GYVTGQNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPSTG 351 Query: 331 XXXXXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 DHS+A HMFSDDNTSS CSIM Sbjct: 352 YYYNYGPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 392 >GAV65585.1 HMA domain-containing protein [Cephalotus follicularis] Length = 383 Score = 334 bits (857), Expect = e-106 Identities = 208/401 (51%), Positives = 238/401 (59%), Gaps = 3/401 (0%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDS 1214 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGV+QV IDAEQQ+VT+SGSVDS Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVFQVNIDAEQQRVTISGSVDS 60 Query: 1213 ATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFP 1034 ATLIKKLVR+GK+AELWS L+KGLE+ KNQQKFP Sbjct: 61 ATLIKKLVRAGKHAELWSQKSNQNQKQKNNCIKEDKNNESQKQG-LIKGLESLKNQQKFP 119 Query: 1033 AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAA--DANNVKKGMGAISAVSX 860 AFSS ++RF+ + +QL LLRQ A +ANN KKG GAI+A + Sbjct: 120 AFSSEEDDDYLDDDEDDDEEVEQLRFLEKANHQLGLLRQQAAIEANNAKKG-GAIAAAAA 178 Query: 859 XXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEA 680 GLDQKTMAALK+NNAHLGGG ++N GE Sbjct: 179 NNGKMNTSVGNLNTGKKGNPNQNTGIK-VNPGGLDQKTMAALKMNNAHLGGG-NINTGEV 236 Query: 679 KRASDIGAMMNLAGFNGNGANVGSA-TVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLA 503 KR +D+ MM L GF+GNGAN+G+A T LGGN+NGLGG VQ N S AG PNGG A Sbjct: 237 KRGNDLSTMMGLTGFHGNGANIGNAATALGGNANGLGGIQVQPNGYQGSSGAGFPNGGYA 296 Query: 502 TGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXX 323 TGQYPS++LMNMNG+N+ P MQ RH QPQMMYHRS Sbjct: 297 TGQYPSAMLMNMNGYNH-------PASMMMNMQNRH---PQPQMMYHRSPYIPASTGYYH 346 Query: 322 XXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 HS+A HMFSD+NTSS CSIM Sbjct: 347 NYSPSPYSYTEQPNHRI---GHSAATHMFSDENTSS-CSIM 383 >EOY00920.1 Heavy metal transport/detoxification superfamily protein isoform 6 [Theobroma cacao] Length = 393 Score = 330 bits (847), Expect = e-105 Identities = 211/405 (52%), Positives = 237/405 (58%), Gaps = 7/405 (1%) Frame = -2 Query: 1393 MTKEEDFKLLKIQ-TCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVD 1217 MTKEEDFKLLKIQ TCVLKVNIHCDGCKQKVKKLLQRIEGVYQV IDAEQQKVTVSGSVD Sbjct: 1 MTKEEDFKLLKIQQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVSIDAEQQKVTVSGSVD 60 Query: 1216 SATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKF 1037 SATLIKKLVR+GK+AE+WS GL+KGLEAFK QQKF Sbjct: 61 SATLIKKLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKF 119 Query: 1036 PAFSSXXXXXXXXXXXXXXXXXXEMRFIRE----KANQLQLLR-QAADANNVKKGMGAIS 872 P+F S E++F++ + QL LLR QA DANN K G+G I+ Sbjct: 120 PSFVS-EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNIT 178 Query: 871 AVSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLN 692 A S LDQKT+AALK+NNA L GG ++N Sbjct: 179 ATS-NNNNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNIN 236 Query: 691 LGEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PN 515 E KR DI +M L+GF+GNGANV A LGGN N +GGF VQSNN + GSSA I N Sbjct: 237 AAEGKRGHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQN 296 Query: 514 GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXX 335 GG TGQ PSS+LMNMNG+N PS + +Q RHAMQQQPQMMYHRS Sbjct: 297 GGYVTGQNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPST 351 Query: 334 XXXXXXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 DHS+A HMFSDDNTSS CSIM Sbjct: 352 GYYYNYGPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 393 >EOY00919.1 Heavy metal transport/detoxification superfamily protein isoform 5 [Theobroma cacao] Length = 393 Score = 330 bits (847), Expect = e-105 Identities = 211/405 (52%), Positives = 237/405 (58%), Gaps = 7/405 (1%) Frame = -2 Query: 1393 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIE-GVYQVQIDAEQQKVTVSGSVD 1217 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIE GVYQV IDAEQQKVTVSGSVD Sbjct: 1 MTKEEDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGGVYQVSIDAEQQKVTVSGSVD 60 Query: 1216 SATLIKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKF 1037 SATLIKKLVR+GK+AE+WS GL+KGLEAFK QQKF Sbjct: 61 SATLIKKLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKF 119 Query: 1036 PAFSSXXXXXXXXXXXXXXXXXXEMRFIRE----KANQLQLLR-QAADANNVKKGMGAIS 872 P+F S E++F++ + QL LLR QA DANN K G+G I+ Sbjct: 120 PSFVS-EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNIT 178 Query: 871 AVSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLN 692 A S LDQKT+AALK+NNA L GG ++N Sbjct: 179 ATS-NNNNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNIN 236 Query: 691 LGEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PN 515 E KR DI +M L+GF+GNGANV A LGGN N +GGF VQSNN + GSSA I N Sbjct: 237 AAEGKRGHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQN 296 Query: 514 GGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXX 335 GG TGQ PSS+LMNMNG+N PS + +Q RHAMQQQPQMMYHRS Sbjct: 297 GGYVTGQNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPST 351 Query: 334 XXXXXXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 DHS+A HMFSDDNTSS CSIM Sbjct: 352 GYYYNYGPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 393