BLASTX nr result
ID: Glycyrrhiza35_contig00024123
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza35_contig00024123 (1383 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KYP75352.1 hypothetical protein KK1_008076 [Cajanus cajan] 410 e-137 XP_003555274.1 PREDICTED: putative uncharacterized protein DDB_G... 408 e-137 XP_007143034.1 hypothetical protein PHAVU_007G038000g [Phaseolus... 406 e-136 KHN06753.1 hypothetical protein glysoja_021299 [Glycine soja] 406 e-136 XP_003536625.1 PREDICTED: putative uncharacterized protein DDB_G... 406 e-136 XP_015942069.1 PREDICTED: bromodomain and WD repeat-containing D... 402 e-134 XP_016175066.1 PREDICTED: hybrid signal transduction histidine k... 401 e-134 XP_014510893.1 PREDICTED: serine, glycine and glutamine-rich pro... 394 e-131 XP_017409476.1 PREDICTED: serine, glycine and glutamine-rich pro... 392 e-130 KHN42381.1 hypothetical protein glysoja_020093 [Glycine soja] 390 e-130 GAU24292.1 hypothetical protein TSUD_48800 [Trifolium subterraneum] 369 e-121 XP_019441465.1 PREDICTED: heavy metal-associated isoprenylated p... 363 e-119 KRH35766.1 hypothetical protein GLYMA_10G264000 [Glycine max] 362 e-119 AFK47709.1 unknown [Lotus japonicus] 359 e-117 XP_002284132.1 PREDICTED: heavy metal-associated isoprenylated p... 335 e-108 XP_018843651.1 PREDICTED: neurogenic protein mastermind-like [Ju... 331 e-107 XP_007045083.1 PREDICTED: myb-like protein I [Theobroma cacao] X... 327 e-105 GAV65585.1 HMA domain-containing protein [Cephalotus follicularis] 327 e-105 EOY00920.1 Heavy metal transport/detoxification superfamily prot... 323 e-103 EOY00919.1 Heavy metal transport/detoxification superfamily prot... 323 e-103 >KYP75352.1 hypothetical protein KK1_008076 [Cajanus cajan] Length = 397 Score = 410 bits (1053), Expect = e-137 Identities = 250/401 (62%), Positives = 268/401 (66%), Gaps = 7/401 (1%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTV GSVDSATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVLGSVDSATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQ-QKFPAFS 1025 KKLVR+GKYAELWS GL KG+EAFKNQ QKFPAFS Sbjct: 65 KKLVRAGKYAELWS--QKTNQNQKQKNNNAKDEKNKGQKQGLPKGIEAFKNQHQKFPAFS 122 Query: 1024 SXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVSXXXXX 848 S EMR +REKA Q+++L+ Q +ANNV+KGMG+ISA + Sbjct: 123 SEEDEYYFETDEEDEDEDEEMRMLREKAIQMKMLKHQPPNANNVRKGMGSISAGANNGKM 182 Query: 847 XXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKRA 671 KD+P GLDQKTMAALKLNNAHL G G +LNLGEAKRA Sbjct: 183 NNACNANSGKKGGPNHNMAMKDNP---NGLDQKTMAALKLNNAHLNGEGLNLNLGEAKRA 239 Query: 670 SDIGAMMNLAGFNGNGANVGSATVLGG-NSNGLGGFPVQSNNMIPGSSAGIPNGGLATGQ 494 +DIGAMMNLAGF+GNGANVGSATVLGG NSNG GGFPVQSNNMIPGSSA P+GGLA+GQ Sbjct: 240 NDIGAMMNLAGFHGNGANVGSATVLGGNNSNGFGGFPVQSNNMIPGSSAAFPSGGLASGQ 299 Query: 493 YPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXX 314 YPSSLLMNMNGFN NH SPSPL MQAR AMQQQPQMMYHRS Sbjct: 300 YPSSLLMNMNGFN--NHTSPSPLMMNMNMQARQAMQQQPQMMYHRSPFVPPNTGYYYNHS 357 Query: 313 XXXXXXXXXXXXXXPV---DDHSSAAHMFSDDNTSSGCSIM 200 DDH +AAHMFSDDNTSS CSIM Sbjct: 358 GYSPAHYSYSYGLPSYPGGDDH-TAAHMFSDDNTSSSCSIM 397 >XP_003555274.1 PREDICTED: putative uncharacterized protein DDB_G0286901 [Glycine max] KRG90987.1 hypothetical protein GLYMA_20G126300 [Glycine max] Length = 407 Score = 408 bits (1049), Expect = e-137 Identities = 255/410 (62%), Positives = 268/410 (65%), Gaps = 16/410 (3%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GK+AELWS LVKGLEAFKNQQKFPAFSS Sbjct: 65 KKLVRAGKHAELWS--QKINQNQKQKNNNAKDDKNKGQKQALVKGLEAFKNQQKFPAFSS 122 Query: 1021 XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISA-VSXXXXX 848 EMRF+REKANQLQ+L+ Q A+ANNV+KGMGAI+A + Sbjct: 123 EEDEYYYDDEDDEEDEDEEMRFLREKANQLQMLKQQTANANNVRKGMGAIAAGANNGKTN 182 Query: 847 XXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKRA 671 KDSP GLDQKTMAALK NN HLGG G +LNLGEAKRA Sbjct: 183 NGDNANSGKKGGPNHQNMGMKDSP--NGGLDQKTMAALKFNNGHLGGDGLNLNLGEAKRA 240 Query: 670 SDIGAMMNLAGFNGNGA--NVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPNGGL 506 +DIGAMMNLAGFNGN NVGSATVLGG NSNGLGGFPVQS NNMIPGS+A NGGL Sbjct: 241 NDIGAMMNLAGFNGNNCANNVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSAAAFSNGGL 300 Query: 505 ATGQYPSSLLMNMNGFNNINHPSPSPL-XXXXXMQARHAMQQQPQMMYHRSXXXXXXXXX 329 + GQYPSSLLMNMNGFN NHPSPSPL QAR AMQQQPQMMYHRS Sbjct: 301 SGGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQQARQAMQQQPQMMYHRSPFVPPNTGY 358 Query: 328 XXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200 DDH SAAHMFSDDNTSS CSIM Sbjct: 359 YYNHSSYSPAHYSYSYGLPSYPAAAGGGDDH-SAAHMFSDDNTSSSCSIM 407 >XP_007143034.1 hypothetical protein PHAVU_007G038000g [Phaseolus vulgaris] ESW15028.1 hypothetical protein PHAVU_007G038000g [Phaseolus vulgaris] Length = 400 Score = 406 bits (1043), Expect = e-136 Identities = 245/403 (60%), Positives = 263/403 (65%), Gaps = 9/403 (2%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GKYAELWS LVKGL+AFKNQQKFPAFSS Sbjct: 65 KKLVRAGKYAELWSQKINQNQKQKNNNAKDDKNKGQKQA--LVKGLDAFKNQQKFPAFSS 122 Query: 1021 XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXXXXXX 845 EMRF+REKANQLQ+L+Q AA+ANNV+K M + A + Sbjct: 123 EEDEYYSEYDDEDEDEDEEMRFLREKANQLQMLKQQAANANNVRKNMAPMGAGAINGKMN 182 Query: 844 XXXXXXXXXXXXXXXXXXXK-DSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKRA 671 +SP LDQKTMAALKLN HLGG G +LNLGEAKRA Sbjct: 183 NGGGNAGNGKKGGPNPNMGVKESPNVG--LDQKTMAALKLNGGHLGGEGLNLNLGEAKRA 240 Query: 670 SDIGAMMNLAGFNGNGANVGSATVLGGNS-NGLGGFPVQSNNMIPGSSAGIPNGGLATGQ 494 +DIGAMMN+AGFNGNG NV SATVLG N+ N +GGFPVQSNNMIPGSSA NGG+ATGQ Sbjct: 241 NDIGAMMNMAGFNGNGGNVSSATVLGANNPNAMGGFPVQSNNMIPGSSAAFSNGGMATGQ 300 Query: 493 YPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXX 314 YPSSLLMNM+GFN NHPSPSPL MQAR AMQQQPQMMYHRS Sbjct: 301 YPSSLLMNMSGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMYHRSPVIPTNTGYYYNHS 358 Query: 313 XXXXXXXXXXXXXXPV-----DDHSSAAHMFSDDNTSSGCSIM 200 P DDH SAAHMFSDDNT+S CSIM Sbjct: 359 NSYSPAQYSYSYGLPSYPGSGDDH-SAAHMFSDDNTNSSCSIM 400 >KHN06753.1 hypothetical protein glysoja_021299 [Glycine soja] Length = 407 Score = 406 bits (1043), Expect = e-136 Identities = 256/410 (62%), Positives = 268/410 (65%), Gaps = 16/410 (3%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VDSATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDSATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GK+AELWS LV+GLEAFKNQQKFPAFSS Sbjct: 65 KKLVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFPAFSS 122 Query: 1021 -XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS-XXXX 851 EMRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S Sbjct: 123 EEDEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSNNGKM 182 Query: 850 XXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRA 671 KDSP LDQKTM+ALKLNN HL GGE LNLGEAKRA Sbjct: 183 NNGCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGEAKRA 239 Query: 670 SDIGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN-GGL 506 +DIGAMMNLAGFNG NGANVGSATVLGG NSNGLGGFPVQS NNMIPGSSA N GGL Sbjct: 240 NDIGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSNGGGL 299 Query: 505 ATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHA-MQQQPQMMYHRSXXXXXXXXX 329 + GQYPSSLLMNMNGFN NHPSPSPL QAR A MQQQPQMMYHRS Sbjct: 300 SGGQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPPNTGY 357 Query: 328 XXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200 DDH SAAHMFSDDNTSS CSIM Sbjct: 358 YYNHSSSYSPAHYSYSSYGLPGYLAAGGDDHHSAAHMFSDDNTSSSCSIM 407 >XP_003536625.1 PREDICTED: putative uncharacterized protein DDB_G0286901 [Glycine max] KRH35767.1 hypothetical protein GLYMA_10G264000 [Glycine max] Length = 407 Score = 406 bits (1043), Expect = e-136 Identities = 256/410 (62%), Positives = 268/410 (65%), Gaps = 16/410 (3%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VDSATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDSATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GK+AELWS LV+GLEAFKNQQKFPAFSS Sbjct: 65 KKLVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFPAFSS 122 Query: 1021 -XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS-XXXX 851 EMRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S Sbjct: 123 EEDEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSNNGKM 182 Query: 850 XXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRA 671 KDSP LDQKTM+ALKLNN HL GGE LNLGEAKRA Sbjct: 183 NNGCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGEAKRA 239 Query: 670 SDIGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN-GGL 506 +DIGAMMNLAGFNG NGANVGSATVLGG NSNGLGGFPVQS NNMIPGSSA N GGL Sbjct: 240 NDIGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSNGGGL 299 Query: 505 ATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHA-MQQQPQMMYHRSXXXXXXXXX 329 + GQYPSSLLMNMNGFN NHPSPSPL QAR A MQQQPQMMYHRS Sbjct: 300 SGGQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPPNTGY 357 Query: 328 XXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200 DDH SAAHMFSDDNTSS CSIM Sbjct: 358 YYNHSSSYSPAHYSYSSYGLPGYPAAGGDDHHSAAHMFSDDNTSSSCSIM 407 >XP_015942069.1 PREDICTED: bromodomain and WD repeat-containing DDB_G0285837 [Arachis duranensis] Length = 417 Score = 402 bits (1034), Expect = e-134 Identities = 250/416 (60%), Positives = 261/416 (62%), Gaps = 22/416 (5%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDA+QQKVTVSGSVD+ATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDADQQKVTVSGSVDAATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEAFKNQQ-KFP 1034 KKLVR+GKYAE WS LVKGLEAFKNQQ KFP Sbjct: 65 KKLVRAGKYAEPWSQQKTIQNPKQKNNNNIVKDDKNKGGQKQQGLVKGLEAFKNQQQKFP 124 Query: 1033 -AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-----AADANNVKKGMGAIS 872 AFSS EMRFIREKANQL LLRQ AA+ANN+KKG+GAIS Sbjct: 125 SAFSSEEDDDYYDYDDEDEDDDEEMRFIREKANQLHLLRQQAAAAAAEANNLKKGVGAIS 184 Query: 871 AVSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLN 692 S LDQKTMAALKLNN H+GGGE LN Sbjct: 185 GGSNNVKMNNNAGNNNNVGKKGGPGNNMGLKDGHGGVLDQKTMAALKLNNGHMGGGEGLN 244 Query: 691 LGEAKRASDIGAMMNLAGFNGN-----GANVGSATVLGGNSNGLGGFPVQSNNMIPGSSA 527 LGEAKRASDIGAMMNLAGFNGN NVGSATVLG NSNGLGGFPV SNNM PGS+A Sbjct: 245 LGEAKRASDIGAMMNLAGFNGNNNNNVANNVGSATVLGANSNGLGGFPVLSNNMAPGSTA 304 Query: 526 GI-PNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRS-- 356 + PNG +TGQYPSSLLMNMNGFN NHPSPSPL MQAR AMQQQPQMMYHRS Sbjct: 305 AVLPNGAFSTGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMYHRSPF 362 Query: 355 ----XXXXXXXXXXXXXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 P D +SAAHMFSDDNTSS CSIM Sbjct: 363 VPPNTGYYYNHSNNYSPANYSYALPNYYPHQCPATDDNSAAHMFSDDNTSS-CSIM 417 >XP_016175066.1 PREDICTED: hybrid signal transduction histidine kinase A [Arachis ipaensis] Length = 418 Score = 401 bits (1030), Expect = e-134 Identities = 249/417 (59%), Positives = 261/417 (62%), Gaps = 23/417 (5%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDA+QQKVTVSGSVD+ATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDADQQKVTVSGSVDAATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEAFKNQQ-KFP 1034 KKLVR+GKYAE WS LVKGLEAFKNQQ KFP Sbjct: 65 KKLVRAGKYAEPWSQQKTNQNPKQKNNNNIVKDDKNKGGQKQQGLVKGLEAFKNQQQKFP 124 Query: 1033 -AFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ------AADANNVKKGMGAI 875 AFSS EMRFIREKANQL LLRQ AA+ANN+KKG+GAI Sbjct: 125 SAFSSEEDDDYYDYDDEDEDDDEEMRFIREKANQLHLLRQQAAAAAAAEANNLKKGVGAI 184 Query: 874 SAVSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESL 695 S S LDQKTMAALKLNN H+GGGE L Sbjct: 185 SGGSNNVKMNNNAGNNNNVGKKGGPGNNMGLKDGHGGVLDQKTMAALKLNNGHMGGGEGL 244 Query: 694 NLGEAKRASDIGAMMNLAGFNGN-----GANVGSATVLGGNSNGLGGFPVQSNNMIPGSS 530 NLGEAKRASDIGAMMNLAGFNGN NVG+ATVLG NSNGLGGFPV SNNM PGS+ Sbjct: 245 NLGEAKRASDIGAMMNLAGFNGNNNNNVANNVGNATVLGANSNGLGGFPVLSNNMAPGST 304 Query: 529 AGI-PNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRS- 356 A + PNG +TGQYPSSLLMNMNGFN NHPSPSPL MQAR AMQQQPQMMYHRS Sbjct: 305 AAVLPNGAFSTGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQARQAMQQQPQMMYHRSP 362 Query: 355 -----XXXXXXXXXXXXXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 P D +SAAHMFSDDNTSS CSIM Sbjct: 363 FVPPNTGYYYNHSNNYSPANYSYALPNYYPHQCPATDDNSAAHMFSDDNTSS-CSIM 418 >XP_014510893.1 PREDICTED: serine, glycine and glutamine-rich protein [Vigna radiata var. radiata] Length = 402 Score = 394 bits (1011), Expect = e-131 Identities = 240/405 (59%), Positives = 256/405 (63%), Gaps = 11/405 (2%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GKYAELWS L KGL+AFKNQQKFPAFSS Sbjct: 65 KKLVRAGKYAELWSQKTNQNQKQKNNNAKDDKNKGQKQG--LAKGLDAFKNQQKFPAFSS 122 Query: 1021 XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAADANNVKKGMGAISAVSXXXXXXX 842 EMRF+REKA+ LQ+L+Q A NV+K MG + A + Sbjct: 123 EEDEYYSEYEDDDEEEDEEMRFLREKAHHLQMLKQQAANANVRKSMGGMGAGAINGKMNN 182 Query: 841 XXXXXXXXXXXXXXXXXXK---DSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKR 674 +SP LDQKTMAALKLN H+GG G LNLGEAKR Sbjct: 183 GGGNGGGGGGKKGGPNPNMGMKESPNGG--LDQKTMAALKLNGGHVGGEGLGLNLGEAKR 240 Query: 673 ASDIGAMMNLAGFNGNGANVGSATVLGGN-SNGLGGFPVQSNNMIPGSSAGIPNGGLATG 497 A+DIGAMMN+AGFNGNG NV SATVLG N S+G+GGFPVQSNNMIPGSSAG NGG+ G Sbjct: 241 ANDIGAMMNMAGFNGNGGNVTSATVLGANNSSGMGGFPVQSNNMIPGSSAGFSNGGIGAG 300 Query: 496 QYPSSLLMNMNGFNNINHPSPSPLXXXXXM--QARHAMQQQPQMMYHRSXXXXXXXXXXX 323 QYPSSLLMNMNGFNN HPSPSPL M QAR AMQQQPQMMYHRS Sbjct: 301 QYPSSLLMNMNGFNN--HPSPSPLMMNMNMNMQARQAMQQQPQMMYHRSPLIPPNTGYYY 358 Query: 322 XXXXXXXXXXXXXXXXXPV----DDHSSAAHMFSDDNTSSGCSIM 200 P DDH SA HMFSDDNTSS CSIM Sbjct: 359 NHSNSYSPAQYSYSYGLPSYPGGDDH-SATHMFSDDNTSSSCSIM 402 >XP_017409476.1 PREDICTED: serine, glycine and glutamine-rich protein [Vigna angularis] KOM28905.1 hypothetical protein LR48_Vigan609s003700 [Vigna angularis] BAT93876.1 hypothetical protein VIGAN_08042400 [Vigna angularis var. angularis] Length = 404 Score = 392 bits (1006), Expect = e-130 Identities = 241/407 (59%), Positives = 256/407 (62%), Gaps = 13/407 (3%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GKYAELWS L KGL+AFKNQQKFPAFSS Sbjct: 65 KKLVRAGKYAELWSQKSNQNQKQKNNNAKDDKNKGQKQG--LPKGLDAFKNQQKFPAFSS 122 Query: 1021 XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAADANNVKKGMG-----AISAVSXX 857 EMRF+REKA+ LQ+L+Q NV+K MG AI+ Sbjct: 123 EEDEYYSEYEDDDEDEDEEMRFLREKAHHLQMLKQQTANANVRKSMGGMGAGAINGKMNN 182 Query: 856 XXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEA 680 K+SP LDQKTMAALKLN H+GG G LNLGEA Sbjct: 183 GGGNGGGGGGGGKKGGPNPNMGMKESPNVG--LDQKTMAALKLNGGHVGGEGLGLNLGEA 240 Query: 679 KRASDIGAMMNLAGFNGNGANVGSATVLGGN-SNGLGGFPVQSNNMIPGSSAGIPNGGLA 503 KRA+DIGAMMN+AGFNGNG NV SATVLG N S+G+GGFPVQSNNMIPGSSAG NGG+ Sbjct: 241 KRANDIGAMMNMAGFNGNGGNVTSATVLGANNSSGMGGFPVQSNNMIPGSSAGFSNGGIG 300 Query: 502 TGQYPSSLLMNMNGFNNINHPSPSPLXXXXXM--QARHAMQQQPQMMYHRSXXXXXXXXX 329 GQYPSSLLMNMNGFNN HPSPSPL M QAR AMQQQPQMMYHRS Sbjct: 301 AGQYPSSLLMNMNGFNN--HPSPSPLMMNMNMNMQARQAMQQQPQMMYHRSPLIPPNTGY 358 Query: 328 XXXXXXXXXXXXXXXXXXXPV----DDHSSAAHMFSDDNTSSGCSIM 200 P DDH SA HMFSDDNTSS CSIM Sbjct: 359 YYNHSNSYSPAQYAYSYGLPSYPGGDDH-SATHMFSDDNTSSSCSIM 404 >KHN42381.1 hypothetical protein glysoja_020093 [Glycine soja] Length = 363 Score = 390 bits (1001), Expect = e-130 Identities = 235/351 (66%), Positives = 248/351 (70%), Gaps = 9/351 (2%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GK+AELWS LVKGLEAFKNQQKFPAFSS Sbjct: 65 KKLVRAGKHAELWS--QKINQNQKQKNNNAKDDKNKGQKQALVKGLEAFKNQQKFPAFSS 122 Query: 1021 XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISA-VSXXXXX 848 EMRF+REKANQLQ+L+ Q A+ANNV+KGMGAI+A + Sbjct: 123 EEDEYYYDDEDDEEDEDEEMRFLREKANQLQMLKQQTANANNVRKGMGAIAAGANNGKTN 182 Query: 847 XXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGG-GESLNLGEAKRA 671 KDSP GLDQKTMAALK NN HLGG G +LNLGEAKRA Sbjct: 183 NGDNANSGKKGGPNHQNMGMKDSP--NGGLDQKTMAALKFNNGHLGGDGLNLNLGEAKRA 240 Query: 670 SDIGAMMNLAGFNGNGA--NVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPNGGL 506 +DIGAMMNLAGFNGN NVGSATVLGG NSNGLGGFPVQS NNMIPGS+A NGGL Sbjct: 241 NDIGAMMNLAGFNGNNCANNVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSAAAFSNGGL 300 Query: 505 ATGQYPSSLLMNMNGFNNINHPSPSPL-XXXXXMQARHAMQQQPQMMYHRS 356 + GQYPSSLLMNMNGFN NHPSPSPL QAR AMQQQPQMMYHRS Sbjct: 301 SGGQYPSSLLMNMNGFN--NHPSPSPLMMNMNMQQARQAMQQQPQMMYHRS 349 >GAU24292.1 hypothetical protein TSUD_48800 [Trifolium subterraneum] Length = 399 Score = 369 bits (946), Expect = e-121 Identities = 227/407 (55%), Positives = 247/407 (60%), Gaps = 13/407 (3%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVD+ATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDAATLI 64 Query: 1201 KKLVRSGKYAELWS-XXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFS 1025 KKLVRSGKYAELWS +VKGLEAFKNQQKFPAFS Sbjct: 65 KKLVRSGKYAELWSQKTNNNQNQKQKNNNIVKDDKNKGQKQVVVKGLEAFKNQQKFPAFS 124 Query: 1024 S---XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVSXX 857 S E R+IRE ANQ+Q++R Q DANN KK +GA Sbjct: 125 SEEDGGYYGGYGDDDDEEEEDQETRYIREAANQIQMMRQQVVDANNAKKAIGA------- 177 Query: 856 XXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAK 677 G+DQKT+AA+KLNN HL G ES+NLGE+K Sbjct: 178 -KMNNAGNVNGNSGKKGNSNQNMVGMKESANGVDQKTIAAMKLNNGHLVGNESMNLGESK 236 Query: 676 RASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSN--NMIPGSSAG-IPNGGL 506 R SDIGAMMNLAGFNGN VG+AT+LGGNSNGLGGFPVQSN NMI GSSA IPNGG Sbjct: 237 RVSDIGAMMNLAGFNGNNNVVGNATILGGNSNGLGGFPVQSNNTNMIQGSSAATIPNGGF 296 Query: 505 ATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAM-QQQPQMMYHRSXXXXXXXXX 329 TGQ P S++MNMNGFNN PS L QARH M QQQPQMMYHRS Sbjct: 297 VTGQIPPSMMMNMNGFNN----HPSSLMNMNMQQARHVMQQQQPQMMYHRSPYVPPNTGY 352 Query: 328 XXXXXXXXXXXXXXXXXXXPVDDH----SSAAHMFSDDNTSSGCSIM 200 + + +SAAHMFSDDNT+S CSIM Sbjct: 353 YYNNYNNYIPPNATNYSSYAMPSYPTEDNSAAHMFSDDNTTSSCSIM 399 >XP_019441465.1 PREDICTED: heavy metal-associated isoprenylated plant protein 37-like [Lupinus angustifolius] OIW12890.1 hypothetical protein TanjilG_24823 [Lupinus angustifolius] Length = 402 Score = 363 bits (933), Expect = e-119 Identities = 230/411 (55%), Positives = 247/411 (60%), Gaps = 17/411 (4%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSG VD+ATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDAATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEAFKNQQKFPA 1031 KKL R+GKYA+LWS LVKGLE FKNQQKFPA Sbjct: 65 KKLARAGKYAQLWSQKSSNQNQKQNNNNNNCVKDDNKNKGQKQGLVKGLEDFKNQQKFPA 124 Query: 1030 FSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXXX 854 FSS EMRF+RE+ NQLQ+LRQ A DANN K A++ Sbjct: 125 FSSEEDDDFYDYDDDEDDDDEEMRFMRERVNQLQMLRQQAVDANNAAKNGVAVNNNGKIN 184 Query: 853 XXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGG--ESLNLGEA 680 LDQKT+AALK+NN HLGGG E LN+G++ Sbjct: 185 NNAGKKGGPNQNMVIKDNTGG----------LDQKTIAALKMNNGHLGGGGGEGLNIGDS 234 Query: 679 KRASDIGAMMNLAGFNGNGA-NVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGL- 506 KRA+DIG MMNLAGFNGNGA N GSATVLG NSNGLGGF QS NMIPGSSA IPNG Sbjct: 235 KRANDIGVMMNLAGFNGNGANNAGSATVLGPNSNGLGGFSAQS-NMIPGSSAVIPNGAFA 293 Query: 505 ATG-QYPSSLLMNMNGFNNINHPSPSPL-XXXXXMQARHAMQQQPQMMYHRSXXXXXXXX 332 ATG QYPSSLLMNMNGFN NHPSPSPL MQARHAMQQQPQMMYHRS Sbjct: 294 ATGQQYPSSLLMNMNGFN--NHPSPSPLMMNNMNMQARHAMQQQPQMMYHRSPYIPPNTG 351 Query: 331 XXXXXXXXXXXXXXXXXXXXPVD-------DHSSAAHMFSDDNTSSGCSIM 200 +SA H+FSDD T S CS+M Sbjct: 352 YYYNHNLNNNHTPANYNYATMPSYPVGGGGSDNSATHIFSDDYTGSSCSVM 402 >KRH35766.1 hypothetical protein GLYMA_10G264000 [Glycine max] Length = 392 Score = 362 bits (928), Expect = e-119 Identities = 241/410 (58%), Positives = 253/410 (61%), Gaps = 16/410 (3%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQ KVKKLLQRIEGVYQVQIDAEQQKVTVSG VDSATLI Sbjct: 5 EDFKLLKIQ---------------KVKKLLQRIEGVYQVQIDAEQQKVTVSGCVDSATLI 49 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GK+AELWS LV+GLEAFKNQQKFPAFSS Sbjct: 50 KKLVRAGKHAELWS--QKTNQNQKQKNNNAKDDKNKGQKQALVRGLEAFKNQQKFPAFSS 107 Query: 1021 -XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVS-XXXX 851 EMRF+REKANQLQ+L+ QAA+ANN +KGMGAI+A S Sbjct: 108 EEDEYYSEYDDDDDEDEDEEMRFLREKANQLQMLKQQAANANNARKGMGAIAAGSNNGKM 167 Query: 850 XXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRA 671 KDSP LDQKTM+ALKLNN HL GGE LNLGEAKRA Sbjct: 168 NNGCNANSGKKGGPNHQNMGMKDSP--NGRLDQKTMSALKLNNGHL-GGEGLNLGEAKRA 224 Query: 670 SDIGAMMNLAGFNG-NGANVGSATVLGG--NSNGLGGFPVQS-NNMIPGSSAGIPN-GGL 506 +DIGAMMNLAGFNG NGANVGSATVLGG NSNGLGGFPVQS NNMIPGSSA N GGL Sbjct: 225 NDIGAMMNLAGFNGNNGANVGSATVLGGNNNSNGLGGFPVQSNNNMIPGSSASFSNGGGL 284 Query: 505 ATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHA-MQQQPQMMYHRSXXXXXXXXX 329 + GQYPSSLLMNMNGFN NHPSPSPL QAR A MQQQPQMMYHRS Sbjct: 285 SGGQYPSSLLMNMNGFN--NHPSPSPLMMNMQQQARQAMMQQQPQMMYHRSPFVPPNTGY 342 Query: 328 XXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200 DDH SAAHMFSDDNTSS CSIM Sbjct: 343 YYNHSSSYSPAHYSYSSYGLPGYPAAGGDDHHSAAHMFSDDNTSSSCSIM 392 >AFK47709.1 unknown [Lotus japonicus] Length = 400 Score = 359 bits (921), Expect = e-117 Identities = 232/415 (55%), Positives = 245/415 (59%), Gaps = 21/415 (5%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSA LI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSAALI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXG---LVKGLEA-FKN----Q 1046 KKL RSGK+AELWS LVKGLEA FKN Q Sbjct: 65 KKLNRSGKHAELWSQKANQNQKQKNNNNINNVKDDKNNKGQKQGLVKGLEAAFKNHQQQQ 124 Query: 1045 QKFPAFSSXXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAADA-----NNVKKGMG 881 QKFPAFSS +RFIREKANQLQLLRQ A NNVKK + Sbjct: 125 QKFPAFSSEEDDEYYDYDDEDDDDEE-LRFIREKANQLQLLRQQQQAVVDANNNVKKAIS 183 Query: 880 AISAVSXXXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGE 701 A S S DQKTMAALKLNNAHLGGGE Sbjct: 184 AASNNGHNKMNNAAGKKGGQNQNMGGMKESNVGS-------DQKTMAALKLNNAHLGGGE 236 Query: 700 SLNLGEAKRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSS-AG 524 SLNLGEAKRA+DIGAMMNLAGF NG N G+ATVLGGNSNG+GGFPVQSNNM G+S A Sbjct: 237 SLNLGEAKRANDIGAMMNLAGF--NGGNAGNATVLGGNSNGMGGFPVQSNNMFQGNSPAA 294 Query: 523 IPNGGLATGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXX 344 +PNGG Y S+LMNMNGFNN SP+ MQ RHAMQQQPQMM+HRS Sbjct: 295 VPNGG-----YAPSMLMNMNGFNN----HQSPMMNMNMMQTRHAMQQQPQMMFHRSPVIP 345 Query: 343 XXXXXXXXXXXXXXXXXXXXXXXXPV-------DDHSSAAHMFSDDNTSSGCSIM 200 P DH SAAHMFSDDNT+S CS+M Sbjct: 346 PNTGYYFNHNNYNPAANYSYYASLPSYPGGDYDHDHHSAAHMFSDDNTTSSCSVM 400 >XP_002284132.1 PREDICTED: heavy metal-associated isoprenylated plant protein 37 [Vitis vinifera] Length = 390 Score = 335 bits (858), Expect = e-108 Identities = 211/397 (53%), Positives = 238/397 (59%), Gaps = 3/397 (0%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVY V IDAEQQ+VTVSGSVDS TLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYTVNIDAEQQRVTVSGSVDSGTLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLV++GK+AELWS GL+KGLEAFK QQKFP FSS Sbjct: 65 KKLVKAGKHAELWS-QKSNQNQKQKTNCIKDDKNNKGQKQGLIKGLEAFKTQQKFPVFSS 123 Query: 1021 XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLR-QAADANNVKKGMGAISAVSXXXXXX 845 E+RF++EKANQL LLR QA DA+N KKG GAI+A + Sbjct: 124 -EEDEDDFDDDEEDYEEEELRFLQEKANQLSLLRQQALDASNAKKGFGAIAASNNGKINN 182 Query: 844 XXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRASD 665 K SP G+DQKT+AALK+NN HL GG ++N GE KR +D Sbjct: 183 NVGNGNVQKKGNPNQNMGMKGSP---GGIDQKTIAALKMNNPHLVGGGNINSGEVKRGND 239 Query: 664 IGAMMNLAGFNGNGANV-GSATVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLATG-QY 491 I +MM L GF+GNG NV +A LGGNSN LGGF +Q NN GSS G PNGG ATG + Sbjct: 240 INSMMGLGGFHGNGGNVAATAAALGGNSNALGGFQIQPNNGFQGSSTGFPNGGFATGHHH 299 Query: 490 PSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXXX 311 PS +LMN+NG N NHPS + Q RHA QQPQMMYHRS Sbjct: 300 PSPMLMNLNG-NQYNHPS-QMMMNMNMQQNRHAPMQQPQMMYHRSPFIPPSTGYYYNYSP 357 Query: 310 XXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 DH SA+HMFSD+NTSS CSIM Sbjct: 358 ALSPYTHCDTNYS--GDH-SASHMFSDENTSS-CSIM 390 >XP_018843651.1 PREDICTED: neurogenic protein mastermind-like [Juglans regia] Length = 378 Score = 331 bits (848), Expect = e-107 Identities = 214/397 (53%), Positives = 236/397 (59%), Gaps = 3/397 (0%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCK KVKKLLQRIEGVY V IDAEQQKVTVSGSVD++TLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKHKVKKLLQRIEGVYLVNIDAEQQKVTVSGSVDASTLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GK+AE WS L KGLEAFKNQQKFPAFSS Sbjct: 65 KKLVRAGKHAEPWS-QKNTQNQKQMNNCVKDAKNNKSQKPLLFKGLEAFKNQQKFPAFSS 123 Query: 1021 XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQ-AADANNVKKGMGAISAVSXXXXXX 845 E+RFIREKANQL LLRQ A DANN KKG+ AI A S Sbjct: 124 -EEEDDYFDDVEEDEEEDELRFIREKANQLNLLRQRAIDANNAKKGVAAIGAAS-NNGKM 181 Query: 844 XXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRASD 665 G+D KT+AALK+N+AHLGGG ++N GE +R SD Sbjct: 182 NNVGNGNIGNGKKANTEQNMGMRASPAGIDPKTLAALKINSAHLGGG-NVNAGEGRRVSD 240 Query: 664 IGAMMNLAGFNGNGANVGS--ATVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLATGQY 491 + MM LAGF+GNG NV S A LGGNSNGLGGF GSSAG P GG ATGQY Sbjct: 241 LNGMMGLAGFHGNGLNVASAGAAALGGNSNGLGGF--------QGSSAGFPTGGYATGQY 292 Query: 490 PSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXXX 311 PSS+LMNMNG NHPSP + MQAR+AMQQQPQMMYHRS Sbjct: 293 PSSMLMNMNGH---NHPSPMMM----NMQARNAMQQQPQMMYHRSPHVPPTTGYYYNYSP 345 Query: 310 XXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 +SAA+MFSD+NT+S CSIM Sbjct: 346 SPNPYSYTDPNH---TGRNSAAYMFSDENTNS-CSIM 378 >XP_007045083.1 PREDICTED: myb-like protein I [Theobroma cacao] XP_007045084.1 PREDICTED: myb-like protein I [Theobroma cacao] XP_007045086.1 PREDICTED: myb-like protein I [Theobroma cacao] EOY00915.1 Heavy metal transport/detoxification superfamily protein isoform 1 [Theobroma cacao] EOY00916.1 Heavy metal transport/detoxification superfamily protein isoform 1 [Theobroma cacao] EOY00917.1 Heavy metal transport/detoxification superfamily protein isoform 1 [Theobroma cacao] EOY00918.1 Heavy metal transport/detoxification superfamily protein isoform 1 [Theobroma cacao] Length = 392 Score = 327 bits (839), Expect = e-105 Identities = 207/400 (51%), Positives = 233/400 (58%), Gaps = 6/400 (1%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQV IDAEQQKVTVSGSVDSATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVSIDAEQQKVTVSGSVDSATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GK+AE+WS GL+KGLEAFK QQKFP+F S Sbjct: 65 KKLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKFPSFVS 123 Query: 1021 XXXXXXXXXXXXXXXXXXEMRFIRE----KANQLQLLR-QAADANNVKKGMGAISAVSXX 857 E++F++ + QL LLR QA DANN K G+G I+A S Sbjct: 124 -EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNITATS-N 181 Query: 856 XXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAK 677 LDQKT+AALK+NNA L GG ++N E K Sbjct: 182 NNNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNINAAEGK 240 Query: 676 RASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PNGGLAT 500 R DI +M L+GF+GNGANV A LGGN N +GGF VQSNN + GSSA I NGG T Sbjct: 241 RGHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQNGGYVT 300 Query: 499 GQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXX 320 GQ PSS+LMNMNG+N PS + +Q RHAMQQQPQMMYHRS Sbjct: 301 GQNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPSTGYYYN 355 Query: 319 XXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 DHS+A HMFSDDNTSS CSIM Sbjct: 356 YGPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 392 >GAV65585.1 HMA domain-containing protein [Cephalotus follicularis] Length = 383 Score = 327 bits (837), Expect = e-105 Identities = 204/397 (51%), Positives = 234/397 (58%), Gaps = 3/397 (0%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATLI 1202 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGV+QV IDAEQQ+VT+SGSVDSATLI Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGVFQVNIDAEQQRVTISGSVDSATLI 64 Query: 1201 KKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFSS 1022 KKLVR+GK+AELWS L+KGLE+ KNQQKFPAFSS Sbjct: 65 KKLVRAGKHAELWSQKSNQNQKQKNNCIKEDKNNESQKQG-LIKGLESLKNQQKFPAFSS 123 Query: 1021 XXXXXXXXXXXXXXXXXXEMRFIREKANQLQLLRQAA--DANNVKKGMGAISAVSXXXXX 848 ++RF+ + +QL LLRQ A +ANN KKG GAI+A + Sbjct: 124 EEDDDYLDDDEDDDEEVEQLRFLEKANHQLGLLRQQAAIEANNAKKG-GAIAAAAANNGK 182 Query: 847 XXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEAKRAS 668 GLDQKTMAALK+NNAHLGGG ++N GE KR + Sbjct: 183 MNTSVGNLNTGKKGNPNQNTGIK-VNPGGLDQKTMAALKMNNAHLGGG-NINTGEVKRGN 240 Query: 667 DIGAMMNLAGFNGNGANVGSA-TVLGGNSNGLGGFPVQSNNMIPGSSAGIPNGGLATGQY 491 D+ MM L GF+GNGAN+G+A T LGGN+NGLGG VQ N S AG PNGG ATGQY Sbjct: 241 DLSTMMGLTGFHGNGANIGNAATALGGNANGLGGIQVQPNGYQGSSGAGFPNGGYATGQY 300 Query: 490 PSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXXXXXX 311 PS++LMNMNG+N+ P MQ RH QPQMMYHRS Sbjct: 301 PSAMLMNMNGYNH-------PASMMMNMQNRH---PQPQMMYHRSPYIPASTGYYHNYSP 350 Query: 310 XXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 HS+A HMFSD+NTSS CSIM Sbjct: 351 SPYSYTEQPNHRI---GHSAATHMFSDENTSS-CSIM 383 >EOY00920.1 Heavy metal transport/detoxification superfamily protein isoform 6 [Theobroma cacao] Length = 393 Score = 323 bits (827), Expect = e-103 Identities = 207/401 (51%), Positives = 233/401 (58%), Gaps = 7/401 (1%) Frame = -3 Query: 1381 EDFKLLKIQ-TCVLKVNIHCDGCKQKVKKLLQRIEGVYQVQIDAEQQKVTVSGSVDSATL 1205 EDFKLLKIQ TCVLKVNIHCDGCKQKVKKLLQRIEGVYQV IDAEQQKVTVSGSVDSATL Sbjct: 5 EDFKLLKIQQTCVLKVNIHCDGCKQKVKKLLQRIEGVYQVSIDAEQQKVTVSGSVDSATL 64 Query: 1204 IKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFS 1025 IKKLVR+GK+AE+WS GL+KGLEAFK QQKFP+F Sbjct: 65 IKKLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKFPSFV 123 Query: 1024 SXXXXXXXXXXXXXXXXXXEMRFIRE----KANQLQLLR-QAADANNVKKGMGAISAVSX 860 S E++F++ + QL LLR QA DANN K G+G I+A S Sbjct: 124 S-EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNITATS- 181 Query: 859 XXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEA 680 LDQKT+AALK+NNA L GG ++N E Sbjct: 182 NNNNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNINAAEG 240 Query: 679 KRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PNGGLA 503 KR DI +M L+GF+GNGANV A LGGN N +GGF VQSNN + GSSA I NGG Sbjct: 241 KRGHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQNGGYV 300 Query: 502 TGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXX 323 TGQ PSS+LMNMNG+N PS + +Q RHAMQQQPQMMYHRS Sbjct: 301 TGQNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPSTGYYY 355 Query: 322 XXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 DHS+A HMFSDDNTSS CSIM Sbjct: 356 NYGPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 393 >EOY00919.1 Heavy metal transport/detoxification superfamily protein isoform 5 [Theobroma cacao] Length = 393 Score = 323 bits (827), Expect = e-103 Identities = 207/401 (51%), Positives = 233/401 (58%), Gaps = 7/401 (1%) Frame = -3 Query: 1381 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIE-GVYQVQIDAEQQKVTVSGSVDSATL 1205 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIE GVYQV IDAEQQKVTVSGSVDSATL Sbjct: 5 EDFKLLKIQTCVLKVNIHCDGCKQKVKKLLQRIEGGVYQVSIDAEQQKVTVSGSVDSATL 64 Query: 1204 IKKLVRSGKYAELWSXXXXXXXXXXXXXXXXXXXXXXXXXXGLVKGLEAFKNQQKFPAFS 1025 IKKLVR+GK+AE+WS GL+KGLEAFK QQKFP+F Sbjct: 65 IKKLVRAGKHAEVWS-QKSNQNQKPKNNCIKDDKNNKGPKQGLIKGLEAFKTQQKFPSFV 123 Query: 1024 SXXXXXXXXXXXXXXXXXXEMRFIRE----KANQLQLLR-QAADANNVKKGMGAISAVSX 860 S E++F++ + QL LLR QA DANN K G+G I+A S Sbjct: 124 S-EEDDDYMDDYDEENEEDELQFLKPSQLGQLGQLGLLRQQALDANNAKNGIGNITATS- 181 Query: 859 XXXXXXXXXXXXXXXXXXXXXXXXKDSPXXXXGLDQKTMAALKLNNAHLGGGESLNLGEA 680 LDQKT+AALK+NNA L GG ++N E Sbjct: 182 NNNNKMNYNLINVNDGKKGNQNQNMGMKVNPGVLDQKTLAALKMNNAQL-GGLNINAAEG 240 Query: 679 KRASDIGAMMNLAGFNGNGANVGSATVLGGNSNGLGGFPVQSNNMIPGSSAGI-PNGGLA 503 KR DI +M L+GF+GNGANV A LGGN N +GGF VQSNN + GSSA I NGG Sbjct: 241 KRGHDINPIMGLSGFHGNGANVADAAALGGNPNAVGGFQVQSNNGLQGSSAAIFQNGGYV 300 Query: 502 TGQYPSSLLMNMNGFNNINHPSPSPLXXXXXMQARHAMQQQPQMMYHRSXXXXXXXXXXX 323 TGQ PSS+LMNMNG+N PS + +Q RHAMQQQPQMMYHRS Sbjct: 301 TGQNPSSVLMNMNGYN-----YPSSMMNMMNLQNRHAMQQQPQMMYHRSPVIPPSTGYYY 355 Query: 322 XXXXXXXXXXXXXXXXXPVDDHSSAAHMFSDDNTSSGCSIM 200 DHS+A HMFSDDNTSS CSIM Sbjct: 356 NYGPPPYSYPEAPSYNA---DHSAATHMFSDDNTSSSCSIM 393