BLASTX nr result
ID: Mentha26_contig00016637
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00016637 (472 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU18323.1| hypothetical protein MIMGU_mgv1a0010871mg, partia... 108 8e-22 dbj|BAL63045.1| peptidyl serine alpha-galactosyltransferase [Nic... 81 2e-13 ref|XP_006344223.1| PREDICTED: uncharacterized protein LOC102606... 80 4e-13 ref|XP_004238851.1| PREDICTED: uncharacterized protein LOC101257... 77 3e-12 ref|XP_002271170.1| PREDICTED: uncharacterized protein LOC100242... 76 5e-12 ref|XP_002298591.2| hypothetical protein POPTR_0001s36250g [Popu... 69 9e-10 ref|XP_004304697.1| PREDICTED: uncharacterized protein LOC101294... 67 3e-09 ref|XP_007031710.1| F28J7.5 protein isoform 1 [Theobroma cacao] ... 64 2e-08 ref|XP_004173585.1| PREDICTED: uncharacterized LOC101221472, par... 62 8e-08 ref|XP_004145689.1| PREDICTED: uncharacterized protein LOC101221... 62 8e-08 ref|XP_007217047.1| hypothetical protein PRUPE_ppa001424mg [Prun... 62 1e-07 ref|XP_002526934.1| conserved hypothetical protein [Ricinus comm... 61 2e-07 gb|EXC31392.1| hypothetical protein L484_017674 [Morus notabilis] 60 2e-07 ref|NP_566148.2| uncharacterized protein [Arabidopsis thaliana] ... 58 2e-06 gb|AAF01555.1|AC009325_25 unknown protein [Arabidopsis thaliana]... 58 2e-06 >gb|EYU18323.1| hypothetical protein MIMGU_mgv1a0010871mg, partial [Mimulus guttatus] Length = 883 Score = 108 bits (270), Expect = 8e-22 Identities = 66/156 (42%), Positives = 92/156 (58%), Gaps = 3/156 (1%) Frame = -1 Query: 466 FQRDLLSIECARSLHEAIQN-YHQKKCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXXXX 290 FQRDLLSIEC ++L+EA+Q+ Y ++KC + N+LS P R+ + P +L P Sbjct: 732 FQRDLLSIECGKALNEALQSHYERRKCPDPNTLSNPVRE-QAKPAPNPPSLSPPIRKKTP 790 Query: 289 XXXXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTETENEAKEFKRDSEAKSENKEQSPP 110 PDR + E+T+ RK+GK++ +D D T D K+E++E +PP Sbjct: 791 EITSLPPPDRGPLNEITASRKIGKIDDVD----DALTHE-------DVAVKNESREITPP 839 Query: 109 AETN--QTFTSMRGWILGLWAFSIVGFLVVMYVMIS 8 ETN QTF MR WI+GLW FSI+ F VVM +MIS Sbjct: 840 IETNENQTFGFMRFWIIGLWGFSILSFFVVMAMMIS 875 >dbj|BAL63045.1| peptidyl serine alpha-galactosyltransferase [Nicotiana tabacum] Length = 898 Score = 80.9 bits (198), Expect = 2e-13 Identities = 57/182 (31%), Positives = 85/182 (46%), Gaps = 28/182 (15%) Frame = -1 Query: 463 QRDLLSIECARSLHEAIQNYHQ-KKCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXXXXX 287 QRDLLSIECA +L+EA++ +H+ +KC + NS+S ++DT + T + +A D Sbjct: 680 QRDLLSIECATTLNEALRIHHERRKCPDPNSISTTNQDTAN-ETRTNAETRANDDESRTN 738 Query: 286 XXXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTET------------------------ 179 + D T D + D ET Sbjct: 739 AETRTNDDESRTNAETRTNDDETRTNDDETRIDAETRTDAETRTSAEARMAVETTTSRKF 798 Query: 178 ---ENEAKEFKRDSEAKSENKEQSPPAETNQTFTSMRGWILGLWAFSIVGFLVVMYVMIS 8 +N+A+ +RD K+ +++ S P +N TF+SMR WI+ LWA SI FL VM VM+ Sbjct: 799 GKVDNDAQGLRRDDVPKNNSQQSSQPDMSNGTFSSMRFWIMALWAVSIFAFLGVMSVMLK 858 Query: 7 SR 2 R Sbjct: 859 GR 860 >ref|XP_006344223.1| PREDICTED: uncharacterized protein LOC102606280 [Solanum tuberosum] Length = 905 Score = 79.7 bits (195), Expect = 4e-13 Identities = 56/169 (33%), Positives = 82/169 (48%), Gaps = 15/169 (8%) Frame = -1 Query: 463 QRDLLSIECARSLHEAIQNYHQK-KCLEFNSLSPPSRD---------TRDPPLLTTSALK 314 QRDLLSIECA +L+EA+ +H++ KC + N++S P R+ TR T A Sbjct: 700 QRDLLSIECATTLNEALMLHHERRKCPDPNTISTPKRERENQDRVDETRTNAETRTRAET 759 Query: 313 APDPXXXXXXXXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTETE-----NEAKEFKRD 149 D S + T E + + + ++ T + +E + + D Sbjct: 760 RTDAETRTSAETRTSAETRTSAETRTDAETRTNAEARMAVETTTSTKFGNVDEVQALRND 819 Query: 148 SEAKSENKEQSPPAETNQTFTSMRGWILGLWAFSIVGFLVVMYVMISSR 2 K+ ++E S +N TFTSMR WI+ LWA SI GFL VM VM+ R Sbjct: 820 EIPKNSSQESSQVETSNGTFTSMRFWIMVLWAVSIFGFLGVMSVMLRGR 868 >ref|XP_004238851.1| PREDICTED: uncharacterized protein LOC101257369 [Solanum lycopersicum] Length = 912 Score = 76.6 bits (187), Expect = 3e-12 Identities = 56/176 (31%), Positives = 87/176 (49%), Gaps = 22/176 (12%) Frame = -1 Query: 463 QRDLLSIECARSLHEAIQNYHQK-KCLEFNSLSPPSRD---------TR-DPPLLTTSAL 317 QRDLLS+ECA +L+EA++ +H++ KC + N++S P D TR + SA Sbjct: 700 QRDLLSVECATTLNEALRLHHERRKCPDPNTISTPKHDRVNQDRVDETRTNAETRRASAE 759 Query: 316 KAPDPXXXXXXXXXRSPDRETVGEVTSGRKVGKLEKIDVSSH-----DTETE------NE 170 + + D +T E + + ++I ++ +T T +E Sbjct: 760 TRTNAETRTSAESRTNADTKTDAETRTNSETRADDEIRTNAEARMAVETTTSTKFGGVDE 819 Query: 169 AKEFKRDSEAKSENKEQSPPAETNQTFTSMRGWILGLWAFSIVGFLVVMYVMISSR 2 + F+ D K+ ++E S +N TFTSMR WI+ LW SI GFL VM VM+ R Sbjct: 820 VQAFRHDEMPKNSSQESSQVETSNGTFTSMRFWIMVLWGVSIFGFLGVMSVMLKGR 875 >ref|XP_002271170.1| PREDICTED: uncharacterized protein LOC100242361 [Vitis vinifera] gi|296081317|emb|CBI17699.3| unnamed protein product [Vitis vinifera] Length = 817 Score = 75.9 bits (185), Expect = 5e-12 Identities = 52/158 (32%), Positives = 74/158 (46%), Gaps = 1/158 (0%) Frame = -1 Query: 472 EVFQRDLLSIECARSLHEAIQNYHQKK-CLEFNSLSPPSRDTRDPPLLTTSALKAPDPXX 296 ++ QRDLLSIECA+ L+EA+ YH+++ C + NSLS + DT Sbjct: 675 DILQRDLLSIECAKKLNEALYLYHKRRNCPDPNSLSKSAWDTAT---------------- 718 Query: 295 XXXXXXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTETENEAKEFKRDSEAKSENKEQS 116 E T RK G+ E V+ D N +K+ S Sbjct: 719 ----------------EATMSRKFGRFEGSYVARSDHGPMNISKQ-------------SS 749 Query: 115 PPAETNQTFTSMRGWILGLWAFSIVGFLVVMYVMISSR 2 P T++ F+S R W++GLWAFS++GFL VM V+ R Sbjct: 750 LPVVTDRAFSSFRFWLVGLWAFSVLGFLAVMLVVFLGR 787 >ref|XP_002298591.2| hypothetical protein POPTR_0001s36250g [Populus trichocarpa] gi|550349003|gb|EEE83396.2| hypothetical protein POPTR_0001s36250g [Populus trichocarpa] Length = 804 Score = 68.6 bits (166), Expect = 9e-10 Identities = 48/158 (30%), Positives = 77/158 (48%), Gaps = 1/158 (0%) Frame = -1 Query: 472 EVFQRDLLSIECARSLHEAIQNYHQKKCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXXX 293 ++ QRDLLSIEC ++L++A++ +H+K R+ DP L+TS Sbjct: 677 DILQRDLLSIECGKTLNDALELHHKK------------RNCPDPHSLSTSK--------- 715 Query: 292 XXXXXXRSPDRETVGEVTSGRKVGKLEKID-VSSHDTETENEAKEFKRDSEAKSENKEQS 116 R+T E +S RK G+ + + V S+ T+N ++E S Sbjct: 716 ----------RDTGKEDSSSRKFGRFDGSNAVRSNPVPTKN--------------SEETS 751 Query: 115 PPAETNQTFTSMRGWILGLWAFSIVGFLVVMYVMISSR 2 PP + F S+R W++ LW S +GFL VM+++ S R Sbjct: 752 PPVPKDGLFGSLRFWVVALWMISGLGFLAVMFMVFSGR 789 >ref|XP_004304697.1| PREDICTED: uncharacterized protein LOC101294199 [Fragaria vesca subsp. vesca] Length = 819 Score = 67.0 bits (162), Expect = 3e-09 Identities = 48/159 (30%), Positives = 74/159 (46%), Gaps = 3/159 (1%) Frame = -1 Query: 469 VFQRDLLSIECARSLHEAIQNYHQK-KCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXXX 293 + QRDLLSIEC ++L+EA++ +H++ KC + NSLS + D ++ Sbjct: 673 ILQRDLLSIECIKTLNEALRLHHERRKCPDPNSLSNSNSDAQE----------------- 715 Query: 292 XXXXXXRSPDRETVGEVTSGRKVGKLEKIDV--SSHDTETENEAKEFKRDSEAKSENKEQ 119 E+ RK GK+ V S+HD K+++ E Sbjct: 716 ---------------ELVVSRKFGKMNVSSVVESNHDQ---------------KNQSGEH 745 Query: 118 SPPAETNQTFTSMRGWILGLWAFSIVGFLVVMYVMISSR 2 S P ET+ F+S+R W++ WAF + FL V V+ S R Sbjct: 746 SEPTETDGMFSSVRFWVIAFWAFCGLVFLTVASVLFSGR 784 >ref|XP_007031710.1| F28J7.5 protein isoform 1 [Theobroma cacao] gi|508710739|gb|EOY02636.1| F28J7.5 protein isoform 1 [Theobroma cacao] Length = 820 Score = 64.3 bits (155), Expect = 2e-08 Identities = 44/152 (28%), Positives = 73/152 (48%) Frame = -1 Query: 463 QRDLLSIECARSLHEAIQNYHQKKCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXXXXXX 284 QRDLLSIECA++L+EA+ +H++ R+ DP L+T Sbjct: 676 QRDLLSIECAKTLNEALLLHHKR------------RNCPDPTALST-------------- 709 Query: 283 XXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTETENEAKEFKRDSEAKSENKEQSPPAE 104 P+ +T ++T+ RK G D + K + ++ ++E S P Sbjct: 710 -----PELDTTKDITNSRKFGTFAGND-------------DIKSNPVPRNHSQESSLPRV 751 Query: 103 TNQTFTSMRGWILGLWAFSIVGFLVVMYVMIS 8 + F+++R WI+ LW FS +GF++VM V+ S Sbjct: 752 RDGLFSTLRFWIILLWVFSGLGFMLVMLVVFS 783 >ref|XP_004173585.1| PREDICTED: uncharacterized LOC101221472, partial [Cucumis sativus] Length = 384 Score = 62.0 bits (149), Expect = 8e-08 Identities = 48/155 (30%), Positives = 69/155 (44%) Frame = -1 Query: 466 FQRDLLSIECARSLHEAIQNYHQKKCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXXXXX 287 F RDLLSIEC R+L+EA+ +H+K R+ DP LL Sbjct: 235 FARDLLSIECIRTLNEALYLHHKK------------RNCSDPNLLA-------------- 268 Query: 286 XXXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTETENEAKEFKRDSEAKSENKEQSPPA 107 +P+ + EV RK+GKL+ E+ K D + ++E S A Sbjct: 269 -----NPNLDDESEVGVSRKIGKLD-------------ESYTGKEDHLSTDSSQESSQAA 310 Query: 106 ETNQTFTSMRGWILGLWAFSIVGFLVVMYVMISSR 2 + + F S+R WI+ LW S + FLVV+ S R Sbjct: 311 KEDGIFGSLRLWIIALWVISGLVFLVVIISKFSGR 345 >ref|XP_004145689.1| PREDICTED: uncharacterized protein LOC101221472 [Cucumis sativus] Length = 800 Score = 62.0 bits (149), Expect = 8e-08 Identities = 48/155 (30%), Positives = 69/155 (44%) Frame = -1 Query: 466 FQRDLLSIECARSLHEAIQNYHQKKCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXXXXX 287 F RDLLSIEC R+L+EA+ +H+K R+ DP LL Sbjct: 651 FARDLLSIECIRTLNEALYLHHKK------------RNCSDPNLLA-------------- 684 Query: 286 XXXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTETENEAKEFKRDSEAKSENKEQSPPA 107 +P+ + EV RK+GKL+ E+ K D + ++E S A Sbjct: 685 -----NPNLDDESEVGVSRKIGKLD-------------ESYTGKEDHLSTDSSQESSQAA 726 Query: 106 ETNQTFTSMRGWILGLWAFSIVGFLVVMYVMISSR 2 + + F S+R WI+ LW S + FLVV+ S R Sbjct: 727 KEDGIFGSLRLWIIALWVISGLVFLVVIISKFSGR 761 >ref|XP_007217047.1| hypothetical protein PRUPE_ppa001424mg [Prunus persica] gi|462413197|gb|EMJ18246.1| hypothetical protein PRUPE_ppa001424mg [Prunus persica] Length = 831 Score = 61.6 bits (148), Expect = 1e-07 Identities = 45/155 (29%), Positives = 69/155 (44%), Gaps = 1/155 (0%) Frame = -1 Query: 463 QRDLLSIECARSLHEAIQNYHQKK-CLEFNSLSPPSRDTRDPPLLTTSALKAPDPXXXXX 287 Q DLLSIEC ++L+EA++ +H+++ C + NSLS + D + Sbjct: 684 QTDLLSIECIKTLNEALRLHHERRNCPDPNSLSNSNSDAAE------------------- 724 Query: 286 XXXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTETENEAKEFKRDSEAKSENKEQSPPA 107 E+ RK GKL+ V + N ++E S P Sbjct: 725 -------------EIVVSRKFGKLDASRVVGSNRAEMNHSQEI-------------SEPT 758 Query: 106 ETNQTFTSMRGWILGLWAFSIVGFLVVMYVMISSR 2 T+ F+S+R W++ LWAF +GFL V V+ S R Sbjct: 759 LTDGLFSSVRFWVVALWAFCGLGFLTVASVLFSGR 793 >ref|XP_002526934.1| conserved hypothetical protein [Ricinus communis] gi|223533686|gb|EEF35421.1| conserved hypothetical protein [Ricinus communis] Length = 817 Score = 60.8 bits (146), Expect = 2e-07 Identities = 47/154 (30%), Positives = 71/154 (46%), Gaps = 1/154 (0%) Frame = -1 Query: 472 EVFQRDLLSIECARSLHEAIQNYHQK-KCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXX 296 ++ QRD LSIECAR L+EA+ +H+K KC + +SLS + D Sbjct: 668 DILQRDRLSIECARKLNEALFLHHKKRKCPDASSLSNSNSD------------------- 708 Query: 295 XXXXXXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTETENEAKEFKRDSEAKSENKEQS 116 T E S RK GK+++ +V+ R + ++E S Sbjct: 709 -------------TAKEAISSRKFGKIDEGNVA--------------RSNIPIRHSQETS 741 Query: 115 PPAETNQTFTSMRGWILGLWAFSIVGFLVVMYVM 14 PA + F S+R W++ LWA S VGF+ VM ++ Sbjct: 742 LPAMKDGLFGSLRIWVIVLWAVSGVGFIAVMLMV 775 >gb|EXC31392.1| hypothetical protein L484_017674 [Morus notabilis] Length = 811 Score = 60.5 bits (145), Expect = 2e-07 Identities = 47/157 (29%), Positives = 72/157 (45%), Gaps = 2/157 (1%) Frame = -1 Query: 472 EVFQRDLLSIECARSLHEAIQNYHQ-KKCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXX 296 ++ QRDLLSIEC R+++EA++ +H+ +KC + N SPP+ D TT Sbjct: 679 DIMQRDLLSIECIRTINEALRLHHERRKCQDPN--SPPATLNSDNTTTTT---------- 726 Query: 295 XXXXXXXRSPDRETVGEVTSGRKVGKLE-KIDVSSHDTETENEAKEFKRDSEAKSENKEQ 119 EV RK GK++ V S+ ET + ++E Sbjct: 727 ----------------EVAYSRKFGKVDTSYTVKSNKAET--------------NTSREL 756 Query: 118 SPPAETNQTFTSMRGWILGLWAFSIVGFLVVMYVMIS 8 S P T+ F + W++ LWA S +GFL V+ + S Sbjct: 757 SEPTRTDGGFRPLAFWLVVLWAVSGLGFLAVLLCLFS 793 >ref|NP_566148.2| uncharacterized protein [Arabidopsis thaliana] gi|18175797|gb|AAL59929.1| unknown protein [Arabidopsis thaliana] gi|20465701|gb|AAM20319.1| unknown protein [Arabidopsis thaliana] gi|332640186|gb|AEE73707.1| uncharacterized protein AT3G01720 [Arabidopsis thaliana] gi|377652301|dbj|BAL63044.1| peptidyl serine alpha-galactosyltransferase [Arabidopsis thaliana] Length = 802 Score = 57.8 bits (138), Expect = 2e-06 Identities = 40/154 (25%), Positives = 67/154 (43%) Frame = -1 Query: 463 QRDLLSIECARSLHEAIQNYHQKKCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXXXXXX 284 QRDLLSIEC + L+EA+ +H+++ P+P Sbjct: 676 QRDLLSIECGQKLNEALFLHHKRR-------------------------NCPEPGS---- 706 Query: 283 XXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTETENEAKEFKRDSEAKSENKEQSPPAE 104 E+ +++ RKVG +E + ++ E KE S +E Sbjct: 707 --------ESTEKISVSRKVGNIET------------------KQTQGSDETKESSGSSE 740 Query: 103 TNQTFTSMRGWILGLWAFSIVGFLVVMYVMISSR 2 + F++++ W++ LW S VGFLVVM ++ S+R Sbjct: 741 SEGRFSTLKLWVIALWLISGVGFLVVMLLVFSTR 774 >gb|AAF01555.1|AC009325_25 unknown protein [Arabidopsis thaliana] gi|6091716|gb|AAF03428.1|AC010797_4 unknown protein [Arabidopsis thaliana] Length = 814 Score = 57.8 bits (138), Expect = 2e-06 Identities = 40/154 (25%), Positives = 67/154 (43%) Frame = -1 Query: 463 QRDLLSIECARSLHEAIQNYHQKKCLEFNSLSPPSRDTRDPPLLTTSALKAPDPXXXXXX 284 QRDLLSIEC + L+EA+ +H+++ P+P Sbjct: 688 QRDLLSIECGQKLNEALFLHHKRR-------------------------NCPEPGS---- 718 Query: 283 XXXRSPDRETVGEVTSGRKVGKLEKIDVSSHDTETENEAKEFKRDSEAKSENKEQSPPAE 104 E+ +++ RKVG +E + ++ E KE S +E Sbjct: 719 --------ESTEKISVSRKVGNIET------------------KQTQGSDETKESSGSSE 752 Query: 103 TNQTFTSMRGWILGLWAFSIVGFLVVMYVMISSR 2 + F++++ W++ LW S VGFLVVM ++ S+R Sbjct: 753 SEGRFSTLKLWVIALWLISGVGFLVVMLLVFSTR 786