BLASTX nr result
ID: Mentha22_contig00033886
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00033886 (430 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXC34665.1| hypothetical protein L484_020433 [Morus notabilis] 86 4e-15 emb|CBI32285.3| unnamed protein product [Vitis vinifera] 85 9e-15 ref|XP_007020458.1| Sequence-specific DNA binding,sequence-speci... 85 1e-14 ref|XP_007020457.1| NDX1 homeobox protein, putative isoform 2 [T... 85 1e-14 ref|XP_007020456.1| Sequence-specific DNA binding,sequence-speci... 85 1e-14 ref|XP_002520708.1| conserved hypothetical protein [Ricinus comm... 84 2e-14 ref|XP_002320379.1| hypothetical protein POPTR_0014s13140g [Popu... 83 3e-14 gb|EYU46643.1| hypothetical protein MIMGU_mgv1a001710mg [Mimulus... 79 8e-13 emb|CAN67843.1| hypothetical protein VITISV_016666 [Vitis vinifera] 79 8e-13 gb|EMT17752.1| hypothetical protein F775_26868 [Aegilops tauschii] 75 7e-12 dbj|BAK07519.1| predicted protein [Hordeum vulgare subsp. vulgare] 74 2e-11 gb|EMS55397.1| hypothetical protein TRIUR3_27485 [Triticum urartu] 74 3e-11 ref|XP_006850717.1| hypothetical protein AMTR_s00025p00031700 [A... 72 1e-10 ref|XP_006479839.1| PREDICTED: uncharacterized protein LOC102620... 71 2e-10 ref|XP_006479838.1| PREDICTED: uncharacterized protein LOC102620... 71 2e-10 ref|XP_006479836.1| PREDICTED: uncharacterized protein LOC102620... 71 2e-10 ref|XP_006444197.1| hypothetical protein CICLE_v10018730mg [Citr... 71 2e-10 ref|XP_006366379.1| PREDICTED: uncharacterized protein LOC102594... 69 9e-10 ref|XP_006654130.1| PREDICTED: uncharacterized protein LOC102716... 68 1e-09 ref|XP_004247476.1| PREDICTED: uncharacterized protein LOC101264... 68 1e-09 >gb|EXC34665.1| hypothetical protein L484_020433 [Morus notabilis] Length = 965 Score = 86.3 bits (212), Expect = 4e-15 Identities = 45/94 (47%), Positives = 61/94 (64%) Frame = +3 Query: 102 PTRTLSFEPGQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIE 281 P+ + EPGQ V++VD G E+AKG V QV G W NLD+ RTCVVDVK+ LK+ Sbjct: 869 PSEFVQCEPGQQVVIVDAAGEEIAKGKVFQVHGKWYGKNLDELRTCVVDVKD--LKVKRG 926 Query: 282 ARVQHPLELDNCYSFYEAKERFGSIRVMWDSNKL 383 R+ HP + SF EA+ + G +RV+WDS+K+ Sbjct: 927 TRLPHP-SVATGGSFEEAETKIGVMRVLWDSSKI 959 >emb|CBI32285.3| unnamed protein product [Vitis vinifera] Length = 878 Score = 85.1 bits (209), Expect = 9e-15 Identities = 45/88 (51%), Positives = 60/88 (68%) Frame = +3 Query: 126 PGQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLE 305 PGQYV+L+DG G+++ KG V QV+G W NL++S+TCVVDV E LK +R+ HP E Sbjct: 790 PGQYVVLLDGQGDDIGKGKVHQVQGKWYGKNLEESQTCVVDVME--LKAERWSRLPHPSE 847 Query: 306 LDNCYSFYEAKERFGSIRVMWDSNKLSL 389 SF EA+ + G +RV WDSNKL + Sbjct: 848 TTGT-SFDEAETKLGVMRVSWDSNKLCI 874 >ref|XP_007020458.1| Sequence-specific DNA binding,sequence-specific DNA binding transcription factors, putative isoform 3 [Theobroma cacao] gi|508720086|gb|EOY11983.1| Sequence-specific DNA binding,sequence-specific DNA binding transcription factors, putative isoform 3 [Theobroma cacao] Length = 874 Score = 84.7 bits (208), Expect = 1e-14 Identities = 44/89 (49%), Positives = 60/89 (67%) Frame = +3 Query: 123 EPGQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPL 302 +PGQ+V+LVDG G E+ KG V QV+G WC +L++S TCVVD + LK ++ +P Sbjct: 785 KPGQFVVLVDGRGEEIGKGKVHQVQGKWCGKSLEESGTCVVDAVD--LKADKWVKLPYPS 842 Query: 303 ELDNCYSFYEAKERFGSIRVMWDSNKLSL 389 E SF EA+ +FG +RVMWDSNK+ L Sbjct: 843 EATGT-SFEEAETKFGVMRVMWDSNKIFL 870 >ref|XP_007020457.1| NDX1 homeobox protein, putative isoform 2 [Theobroma cacao] gi|508720085|gb|EOY11982.1| NDX1 homeobox protein, putative isoform 2 [Theobroma cacao] Length = 926 Score = 84.7 bits (208), Expect = 1e-14 Identities = 44/89 (49%), Positives = 60/89 (67%) Frame = +3 Query: 123 EPGQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPL 302 +PGQ+V+LVDG G E+ KG V QV+G WC +L++S TCVVD + LK ++ +P Sbjct: 837 KPGQFVVLVDGRGEEIGKGKVHQVQGKWCGKSLEESGTCVVDAVD--LKADKWVKLPYPS 894 Query: 303 ELDNCYSFYEAKERFGSIRVMWDSNKLSL 389 E SF EA+ +FG +RVMWDSNK+ L Sbjct: 895 EATGT-SFEEAETKFGVMRVMWDSNKIFL 922 >ref|XP_007020456.1| Sequence-specific DNA binding,sequence-specific DNA binding transcription factors, putative isoform 1 [Theobroma cacao] gi|508720084|gb|EOY11981.1| Sequence-specific DNA binding,sequence-specific DNA binding transcription factors, putative isoform 1 [Theobroma cacao] Length = 1035 Score = 84.7 bits (208), Expect = 1e-14 Identities = 44/89 (49%), Positives = 60/89 (67%) Frame = +3 Query: 123 EPGQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPL 302 +PGQ+V+LVDG G E+ KG V QV+G WC +L++S TCVVD + LK ++ +P Sbjct: 946 KPGQFVVLVDGRGEEIGKGKVHQVQGKWCGKSLEESGTCVVDAVD--LKADKWVKLPYPS 1003 Query: 303 ELDNCYSFYEAKERFGSIRVMWDSNKLSL 389 E SF EA+ +FG +RVMWDSNK+ L Sbjct: 1004 EATGT-SFEEAETKFGVMRVMWDSNKIFL 1031 >ref|XP_002520708.1| conserved hypothetical protein [Ricinus communis] gi|223540093|gb|EEF41670.1| conserved hypothetical protein [Ricinus communis] Length = 957 Score = 84.3 bits (207), Expect = 2e-14 Identities = 51/127 (40%), Positives = 72/127 (56%) Frame = +3 Query: 3 SLAARGSAENEATNIEVTASVDEEDMGTSRRNNPTRTLSFEPGQYVLLVDGIGNEVAKGS 182 S A GSAEN ++ +D ++ + +PGQYV+LVD G+E+ KG Sbjct: 839 STARIGSAENAEISLAQFFGIDAAEL-----------VQCKPGQYVVLVDKQGDEIGKGK 887 Query: 183 VVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLELDNCYSFYEAKERFGSIRV 362 V QV+G W +L++S TCVVDV E LK R+ +P E SF EA+ + G +RV Sbjct: 888 VYQVQGKWYGKSLEESETCVVDVTE--LKAERWVRLPYPSEATGT-SFSEAETKLGVMRV 944 Query: 363 MWDSNKL 383 +WDSNK+ Sbjct: 945 LWDSNKI 951 >ref|XP_002320379.1| hypothetical protein POPTR_0014s13140g [Populus trichocarpa] gi|222861152|gb|EEE98694.1| hypothetical protein POPTR_0014s13140g [Populus trichocarpa] Length = 1326 Score = 83.2 bits (204), Expect = 3e-14 Identities = 47/101 (46%), Positives = 63/101 (62%) Frame = +3 Query: 123 EPGQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPL 302 +PGQ+V+LVDG G E+ KG V QV+G W L++S CVVDV E LK R+ +P Sbjct: 1079 KPGQFVVLVDGQGEEIGKGKVYQVQGKWYGRILEESEMCVVDVTE--LKTEKWVRLPYPS 1136 Query: 303 ELDNCYSFYEAKERFGSIRVMWDSNKLSLYFEPGEHIILTG 425 E SFYEA+++ G +RV+WDSNK+ + H LTG Sbjct: 1137 ETTG-MSFYEAEQKIGVMRVLWDSNKIYISI----HKSLTG 1172 >gb|EYU46643.1| hypothetical protein MIMGU_mgv1a001710mg [Mimulus guttatus] Length = 770 Score = 78.6 bits (192), Expect = 8e-13 Identities = 44/101 (43%), Positives = 58/101 (57%) Frame = +3 Query: 81 GTSRRNNPTRTLSFEPGQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEF 260 G+ +P T FE GQYV+LV + K V Q+ GNWC +LD S CVVD+ E Sbjct: 669 GSGNLESPLNT-DFEAGQYVILVGEKAETIGKAKVFQIGGNWCSSDLDVSGLCVVDIME- 726 Query: 261 SLKIPIEARVQHPLELDNCYSFYEAKERFGSIRVMWDSNKL 383 L I A++ HP++ YSF +AK R G + V+WD NKL Sbjct: 727 -LLIDRYAQLPHPVDATG-YSFDQAKRRLGRMLVLWDLNKL 765 >emb|CAN67843.1| hypothetical protein VITISV_016666 [Vitis vinifera] Length = 1134 Score = 78.6 bits (192), Expect = 8e-13 Identities = 52/133 (39%), Positives = 72/133 (54%), Gaps = 8/133 (6%) Frame = +3 Query: 12 ARGSAENEATNIEVT-ASVDEEDMGTSR--RNNPTRTLSFEPGQYVLLVDGIGNEVAKGS 182 ARG A V+ A D + T+ NP + EPGQYV+L+DG G+++ KG Sbjct: 926 ARGGTHQSAIGGSVSRAGADNAEAATAEFVDINPAEFVRREPGQYVVLLDGQGDDIGKGK 985 Query: 183 VVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLELDNCYSFYEAKERFGSI-- 356 V QV+G W NL++S+TCVVDV E LK +R+ HP E SF EA+ + G I Sbjct: 986 VHQVQGKWYGKNLEESQTCVVDVME--LKAERWSRLPHPSETTGT-SFDEAETKLGEILP 1042 Query: 357 ---RVMWDSNKLS 386 + W+S+ S Sbjct: 1043 STCLISWESDNXS 1055 >gb|EMT17752.1| hypothetical protein F775_26868 [Aegilops tauschii] Length = 779 Score = 75.5 bits (184), Expect = 7e-12 Identities = 52/131 (39%), Positives = 75/131 (57%), Gaps = 4/131 (3%) Frame = +3 Query: 6 LAARGSAENEATNIEVTASVDE---EDMGTSRRNNPTRTLSFEPGQYVLLVDGIGNEVAK 176 L A G + ++ ++ VT E +DM TSR TR+LSFEPG+ VLL+D GNE+ + Sbjct: 650 LNALGLSNSKGSSRLVTPDSSEPSTQDMMTSRPF--TRSLSFEPGRPVLLIDNEGNEIGR 707 Query: 177 GSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLELDNCYSFYEAKERFGSI 356 G + QV G +L +S TC++DV E LK+ + HP E +F EA+ R G + Sbjct: 708 GEIFQVDGRAQGKSLAESHTCIIDVTE--LKVEKWRELPHPSEASG-RTFQEAESRHGGV 764 Query: 357 -RVMWDSNKLS 386 RV WD +L+ Sbjct: 765 MRVAWDVVRLA 775 >dbj|BAK07519.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 915 Score = 73.9 bits (180), Expect = 2e-11 Identities = 52/131 (39%), Positives = 76/131 (58%), Gaps = 4/131 (3%) Frame = +3 Query: 6 LAARGSAENEATNIEVTASVDE---EDMGTSRRNNPTRTLSFEPGQYVLLVDGIGNEVAK 176 L A G + ++ ++ +T E +DM TSR TR+LSFEPG+ VLL+D GNEV + Sbjct: 786 LNALGLSNSKGSSRLLTPDSSEPSTQDMTTSRPF--TRSLSFEPGRPVLLIDNEGNEVGR 843 Query: 177 GSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLELDNCYSFYEAKERFGSI 356 G + QV+G +L +S C+VDV E LK+ + + HP E +F EA+ R G + Sbjct: 844 GEIFQVEGRAQGKSLAESYICIVDVTE--LKVEKWSELPHPSEASG-RAFLEAESRHGGV 900 Query: 357 -RVMWDSNKLS 386 RV WD +L+ Sbjct: 901 MRVAWDVVRLA 911 >gb|EMS55397.1| hypothetical protein TRIUR3_27485 [Triticum urartu] Length = 956 Score = 73.6 bits (179), Expect = 3e-11 Identities = 51/131 (38%), Positives = 74/131 (56%), Gaps = 4/131 (3%) Frame = +3 Query: 6 LAARGSAENEATNIEVTASVDE---EDMGTSRRNNPTRTLSFEPGQYVLLVDGIGNEVAK 176 L A G + ++ ++ VT E +DM TSR TR+LSFEPG+ VLL+D GNE+ + Sbjct: 827 LNALGLSNSKGSSRLVTPDSSEPSTQDMMTSRPF--TRSLSFEPGRPVLLIDNEGNEIGR 884 Query: 177 GSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLELDNCYSFYEAKERFGSI 356 G + QV G +L +S C++DV E LK+ + HP E +F EA+ R G + Sbjct: 885 GEIFQVDGRAQGKSLAESHVCIIDVTE--LKVEKWRELPHPSEASG-RTFQEAESRHGGV 941 Query: 357 -RVMWDSNKLS 386 RV WD +L+ Sbjct: 942 MRVAWDVVRLA 952 >ref|XP_006850717.1| hypothetical protein AMTR_s00025p00031700 [Amborella trichopoda] gi|548854388|gb|ERN12298.1| hypothetical protein AMTR_s00025p00031700 [Amborella trichopoda] Length = 1048 Score = 71.6 bits (174), Expect = 1e-10 Identities = 39/92 (42%), Positives = 55/92 (59%) Frame = +3 Query: 108 RTLSFEPGQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEAR 287 R L FE GQ V L D G EV +G + Q++G W NL +S C+V+V E LK+ + R Sbjct: 956 RYLRFEAGQCVSLTDDDGKEVCRGRICQMEGRWYGKNLVESGLCIVEVNE--LKVDRQTR 1013 Query: 288 VQHPLELDNCYSFYEAKERFGSIRVMWDSNKL 383 +QHP E +F EA+ + G ++V WD NK+ Sbjct: 1014 LQHPSEAGGS-TFDEAELKTGKMKVAWDVNKI 1044 >ref|XP_006479839.1| PREDICTED: uncharacterized protein LOC102620367 isoform X4 [Citrus sinensis] Length = 932 Score = 70.9 bits (172), Expect = 2e-10 Identities = 40/85 (47%), Positives = 52/85 (61%) Frame = +3 Query: 129 GQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLEL 308 GQ V+L+DG G E+ G V QV G W NL++S TC VDV E LK A + HP E Sbjct: 843 GQLVVLLDGQGEEIGSGRVHQVYGKWTGRNLEESGTCAVDVVE--LKAERWAPLPHPSEA 900 Query: 309 DNCYSFYEAKERFGSIRVMWDSNKL 383 SF EA+ + G +RV+WD+NK+ Sbjct: 901 AGS-SFGEAEAKLGVMRVLWDTNKM 924 >ref|XP_006479838.1| PREDICTED: uncharacterized protein LOC102620367 isoform X3 [Citrus sinensis] Length = 954 Score = 70.9 bits (172), Expect = 2e-10 Identities = 40/85 (47%), Positives = 52/85 (61%) Frame = +3 Query: 129 GQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLEL 308 GQ V+L+DG G E+ G V QV G W NL++S TC VDV E LK A + HP E Sbjct: 865 GQLVVLLDGQGEEIGSGRVHQVYGKWTGRNLEESGTCAVDVVE--LKAERWAPLPHPSEA 922 Query: 309 DNCYSFYEAKERFGSIRVMWDSNKL 383 SF EA+ + G +RV+WD+NK+ Sbjct: 923 AGS-SFGEAEAKLGVMRVLWDTNKM 946 >ref|XP_006479836.1| PREDICTED: uncharacterized protein LOC102620367 isoform X1 [Citrus sinensis] gi|568852343|ref|XP_006479837.1| PREDICTED: uncharacterized protein LOC102620367 isoform X2 [Citrus sinensis] Length = 957 Score = 70.9 bits (172), Expect = 2e-10 Identities = 40/85 (47%), Positives = 52/85 (61%) Frame = +3 Query: 129 GQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLEL 308 GQ V+L+DG G E+ G V QV G W NL++S TC VDV E LK A + HP E Sbjct: 868 GQLVVLLDGQGEEIGSGRVHQVYGKWTGRNLEESGTCAVDVVE--LKAERWAPLPHPSEA 925 Query: 309 DNCYSFYEAKERFGSIRVMWDSNKL 383 SF EA+ + G +RV+WD+NK+ Sbjct: 926 AGS-SFGEAEAKLGVMRVLWDTNKM 949 >ref|XP_006444197.1| hypothetical protein CICLE_v10018730mg [Citrus clementina] gi|567903420|ref|XP_006444198.1| hypothetical protein CICLE_v10018730mg [Citrus clementina] gi|567903422|ref|XP_006444199.1| hypothetical protein CICLE_v10018730mg [Citrus clementina] gi|557546459|gb|ESR57437.1| hypothetical protein CICLE_v10018730mg [Citrus clementina] gi|557546460|gb|ESR57438.1| hypothetical protein CICLE_v10018730mg [Citrus clementina] gi|557546461|gb|ESR57439.1| hypothetical protein CICLE_v10018730mg [Citrus clementina] Length = 957 Score = 70.9 bits (172), Expect = 2e-10 Identities = 40/85 (47%), Positives = 52/85 (61%) Frame = +3 Query: 129 GQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLEL 308 GQ V+L+DG G E+ G V QV G W NL++S TC VDV E LK A + HP E Sbjct: 868 GQLVVLLDGQGEEIGSGRVHQVYGKWTGRNLEESGTCAVDVVE--LKAERWAPLPHPSEA 925 Query: 309 DNCYSFYEAKERFGSIRVMWDSNKL 383 SF EA+ + G +RV+WD+NK+ Sbjct: 926 AGS-SFGEAEAKLGVMRVLWDTNKM 949 >ref|XP_006366379.1| PREDICTED: uncharacterized protein LOC102594863 [Solanum tuberosum] Length = 934 Score = 68.6 bits (166), Expect = 9e-10 Identities = 35/85 (41%), Positives = 54/85 (63%) Frame = +3 Query: 129 GQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLEL 308 G YV+L++ E+ +G V QV G W + +L++ TCVVDV LK+ A++ +P EL Sbjct: 847 GDYVVLINEKAEEIGRGKVCQVSGKWYQRDLEELGTCVVDV--IDLKVERSAKLPYPSEL 904 Query: 309 DNCYSFYEAKERFGSIRVMWDSNKL 383 SF +A+ +FG +RV+W S+KL Sbjct: 905 TGT-SFDQAERKFGFMRVLWQSSKL 928 >ref|XP_006654130.1| PREDICTED: uncharacterized protein LOC102716870 [Oryza brachyantha] Length = 933 Score = 68.2 bits (165), Expect = 1e-09 Identities = 43/95 (45%), Positives = 54/95 (56%), Gaps = 1/95 (1%) Frame = +3 Query: 105 TRTLSFEPGQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEA 284 TR+ SFEPG+ V L+D G EV +G + QV+G L SR CVVDV E LKI Sbjct: 838 TRSFSFEPGRLVSLIDNDGKEVGRGKIFQVEGRIQGKGLQDSRVCVVDVIE--LKIEKWR 895 Query: 285 RVQHPLELDNCYSFYEAKERFGSI-RVMWDSNKLS 386 + HP E +F EA+ R G + RV WD +LS Sbjct: 896 ELPHPTEASG-RTFQEAESRNGGVMRVAWDVIRLS 929 >ref|XP_004247476.1| PREDICTED: uncharacterized protein LOC101264065 [Solanum lycopersicum] Length = 934 Score = 68.2 bits (165), Expect = 1e-09 Identities = 34/85 (40%), Positives = 54/85 (63%) Frame = +3 Query: 129 GQYVLLVDGIGNEVAKGSVVQVKGNWCRYNLDQSRTCVVDVKEFSLKIPIEARVQHPLEL 308 G YV+L++ E+ +G V QV G W + +L++ TCVVD+ LK+ A++ +P EL Sbjct: 847 GDYVVLINEKAEEIGRGKVCQVSGKWYQRDLEELGTCVVDI--IDLKVERSAKLPYPSEL 904 Query: 309 DNCYSFYEAKERFGSIRVMWDSNKL 383 SF +A+ +FG +RV+W S+KL Sbjct: 905 TGT-SFDQAERKFGFMRVLWQSSKL 928