BLASTX nr result
ID: Mentha26_contig00006801
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00006801 (1073 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU40115.1| hypothetical protein MIMGU_mgv1a009496mg [Mimulus... 146 2e-32 ref|XP_006470446.1| PREDICTED: surfeit locus protein 1-like [Cit... 131 4e-28 ref|XP_007209306.1| hypothetical protein PRUPE_ppa007867mg [Prun... 129 2e-27 ref|XP_006446373.1| hypothetical protein CICLE_v10015784mg [Citr... 127 6e-27 ref|XP_006406674.1| hypothetical protein EUTSA_v10021015mg [Eutr... 125 2e-26 ref|XP_002317864.2| hypothetical protein POPTR_0012s04250g [Popu... 125 2e-26 ref|XP_006841105.1| hypothetical protein AMTR_s00086p00076960 [A... 125 4e-26 ref|XP_003619849.1| Surfeit locus protein [Medicago truncatula] ... 124 9e-26 ref|XP_002883096.1| hypothetical protein ARALYDRAFT_479277 [Arab... 123 1e-25 ref|XP_007036878.1| Surfeit locus 1 cytochrome c oxidase biogene... 123 2e-25 ref|NP_566592.1| Surfeit locus 1 cytochrome c oxidase biogenesis... 122 2e-25 ref|XP_004137509.1| PREDICTED: surfeit locus protein 1-like [Cuc... 122 3e-25 ref|XP_003534137.1| PREDICTED: surfeit locus protein 1-like [Gly... 120 7e-25 ref|XP_002282742.1| PREDICTED: surfeit locus protein 1 isoform 1... 120 1e-24 ref|XP_004299519.1| PREDICTED: surfeit locus protein 1-like [Fra... 120 1e-24 ref|XP_004170396.1| PREDICTED: surfeit locus protein 1-like, par... 119 3e-24 ref|XP_007152675.1| hypothetical protein PHAVU_004G149700g [Phas... 118 4e-24 ref|XP_006298040.1| hypothetical protein CARUB_v10014085mg [Caps... 118 4e-24 ref|XP_004235793.1| PREDICTED: surfeit locus protein 1-like [Sol... 118 4e-24 ref|XP_006341513.1| PREDICTED: surfeit locus protein 1-like [Sol... 118 5e-24 >gb|EYU40115.1| hypothetical protein MIMGU_mgv1a009496mg [Mimulus guttatus] Length = 340 Score = 146 bits (368), Expect = 2e-32 Identities = 99/280 (35%), Positives = 135/280 (48%), Gaps = 39/280 (13%) Frame = +3 Query: 36 AAPSKWDPKRSGLSNAARRAIATSPQPAQEENCRSTFIEKWWFLIPVSLFGSAAWNFSRA 215 A P W P S +S +A AI+ PQP QE RST+ + F+ FG W R Sbjct: 21 AIPPNWAPHSSPISTSAA-AISAEPQPEQEIKRRSTWSKLLLFIPGAMTFGLGTWQIFRR 79 Query: 216 RGEEKVRDYRKSRLELGALNGSTDICS---AENLEFRRVECEGVYDENNSILVHKYLKRK 386 + + K +YR+SRLEL L G+ + S ++LEFRRV+ +GV+D+ SI V + Sbjct: 80 QEKIKTLEYRQSRLELEPLKGNDIVPSNGSLDSLEFRRVQFKGVFDDKKSIYVGPRSRSI 139 Query: 387 SGERINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKA----------------- 515 SG NGY S+QSPILVNRGWVP SW +KA Sbjct: 140 SGVTENGYYLITPLVPFHGDPESVQSPILVNRGWVPRSWRDKALLNVPEDEPPAKSSSSA 199 Query: 516 -------------------SDLRSNSLPISTARGQKVKFVGVISKGERPNKVWRSNNAAK 638 + N LP +T V+ +GVI E P+ +N+ Sbjct: 200 SIQESAKVSWWRFWSNDKQEVVEENLLPSAT----PVEVIGVIRGSENPSIFVPANDPNA 255 Query: 639 SEWFTVDVPSISRACGLPETTLHVVEIDSSEERNMRKPYP 758 ++WF VDVP+++R CGLPE TL+V +I +E N PYP Sbjct: 256 AQWFYVDVPALARVCGLPENTLYVEDI--NEHVNPSSPYP 293 >ref|XP_006470446.1| PREDICTED: surfeit locus protein 1-like [Citrus sinensis] Length = 350 Score = 131 bits (330), Expect = 4e-28 Identities = 86/269 (31%), Positives = 139/269 (51%), Gaps = 41/269 (15%) Frame = +3 Query: 75 SNAARRAIATSPQPAQ----EENCR-----STFIEKWWFLIPVSL-FGSAAWNFSRARGE 224 S++A A++++PQ + +EN R S+ KW +P ++ FG W R + + Sbjct: 37 SSSAAAALSSAPQLSSSSQDQENVRKGSAPSSTWSKWLLFVPGAISFGLGTWQILRRQDK 96 Query: 225 EKVRDYRKSRLELGALNGSTDICSAENL---EFRRVECEGVYDENNSILVHKYLKRKSGE 395 K+ +YR++RL++ L + E+L EFRRV C+GV+DE SI V + SG Sbjct: 97 IKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRVICQGVFDEQRSIYVGPRSRSISGV 156 Query: 396 RINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKASDL-RSNSLPISTARG---- 560 NGY +S++SP+LVNRGWVP SW +K+S++ R + P++ A Sbjct: 157 TENGYYVITPLMPIPNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQS 216 Query: 561 -----------------------QKVKFVGVISKGERPNKVWRSNNAAKSEWFTVDVPSI 671 V+ VGV+ E+P+ +N+ + +WF VDVP+I Sbjct: 217 QQSSWWWFWLKKPNIVEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAI 276 Query: 672 SRACGLPETTLHVVEIDSSEERNMRKPYP 758 +RACG+PE ++++ +I +E N PYP Sbjct: 277 ARACGIPENSVYIEDI--NENVNPSNPYP 303 >ref|XP_007209306.1| hypothetical protein PRUPE_ppa007867mg [Prunus persica] gi|462405041|gb|EMJ10505.1| hypothetical protein PRUPE_ppa007867mg [Prunus persica] Length = 353 Score = 129 bits (325), Expect = 2e-27 Identities = 86/253 (33%), Positives = 122/253 (48%), Gaps = 35/253 (13%) Frame = +3 Query: 105 SPQPAQEENCRSTFIEKWWFLIPVSL-FGSAAWNFSRARGEEKVRDYRKSRLELGALNGS 281 S Q + E R + KW +P ++ FG W R + + K+ DYR+ RLE+ +N + Sbjct: 59 SSQATERERSRWS---KWLLFLPGAVSFGLGTWQIFRRQEKIKMLDYRQKRLEMEPVNFN 115 Query: 282 TDICSAE---NLEFRRVECEGVYDENNSILVHKYLKRKSGERINGYNXXXXXXXXXXXXR 452 S+E +LEFRRV C+G +DE SI V + SG NGY Sbjct: 116 NVSLSSEELDHLEFRRVICKGYFDEERSIYVGPRSRSISGVTENGYYVITPLVPVSDKPE 175 Query: 453 SIQSPILVNRGWVPLSWGNKASDLR------SNSLPISTARGQK---------------- 566 +Q PILVNRGWVP SW K+S++ SN P S ++ Sbjct: 176 RVQPPILVNRGWVPRSWKEKSSEVHEDGEQPSNVAPSSVQENERRSWWRFWMKKSKVVEV 235 Query: 567 ---------VKFVGVISKGERPNKVWRSNNAAKSEWFTVDVPSISRACGLPETTLHVVEI 719 V+ VGV+ E+P+ N+ S+WF VDVP+I+R CGLPE T+++ +I Sbjct: 236 DQQTPAFAPVEIVGVVRGSEKPSIFVPPNDPKSSQWFYVDVPAIARTCGLPEDTVYIEDI 295 Query: 720 DSSEERNMRKPYP 758 +E N PYP Sbjct: 296 --NENVNPSNPYP 306 >ref|XP_006446373.1| hypothetical protein CICLE_v10015784mg [Citrus clementina] gi|557548984|gb|ESR59613.1| hypothetical protein CICLE_v10015784mg [Citrus clementina] Length = 350 Score = 127 bits (320), Expect = 6e-27 Identities = 86/269 (31%), Positives = 137/269 (50%), Gaps = 41/269 (15%) Frame = +3 Query: 75 SNAARRAIATSPQPAQ----EENCR-----STFIEKWWFLIPVSL-FGSAAWNFSRARGE 224 S++A A++++PQ + +EN R S+ KW +P ++ FG W R + + Sbjct: 37 SSSAAAALSSAPQLSSSSQDQENVRKGSAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDK 96 Query: 225 EKVRDYRKSRLELGALNGSTDICSAENL---EFRRVECEGVYDENNSILVHKYLKRKSGE 395 K+ +YR++RL++ L + E+L EFRRV C+GV+DE SI V + SG Sbjct: 97 IKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRVICQGVFDEQRSIYVGPRSRSISGV 156 Query: 396 RINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKASDL-RSNSLPISTARG---- 560 NGY +S++SP+LVNRGWVP SW +K+S++ R + P++ A Sbjct: 157 TENGYYVITPLMPIPNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQS 216 Query: 561 -----------------------QKVKFVGVISKGERPNKVWRSNNAAKSEWFTVDVPSI 671 V+ VGV+ E+P+ +N+ + +WF VDVP+I Sbjct: 217 QQSSWWWFWLKKPNIVEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAI 276 Query: 672 SRACGLPETTLHVVEIDSSEERNMRKPYP 758 + ACGL E T+++ D++E N PYP Sbjct: 277 ACACGLSENTVYIE--DTNENVNPSNPYP 303 >ref|XP_006406674.1| hypothetical protein EUTSA_v10021015mg [Eutrema salsugineum] gi|557107820|gb|ESQ48127.1| hypothetical protein EUTSA_v10021015mg [Eutrema salsugineum] Length = 356 Score = 125 bits (315), Expect = 2e-26 Identities = 91/279 (32%), Positives = 137/279 (49%), Gaps = 34/279 (12%) Frame = +3 Query: 24 SRHLAAPSKWDPKRSGLSNAARRAIATSPQPAQE-ENCRSTFIEKWWFLIPVSL-FGSAA 197 SRH +A + DP S S+AA + +SP P +E + ST K+ +P ++ FG + Sbjct: 37 SRHFSAVA--DPSFS--SSAALGSQTSSPAPLKENKRGSSTKWSKFLLFLPGAITFGLGS 92 Query: 198 WNFSRARGEEKVRDYRKSRLELGALNGSTDICSAENL---EFRRVECEGVYDENNSILVH 368 W R + K +Y++ RL L + +T + +NL EFRRV C+GV+DE SI + Sbjct: 93 WQIVRREEKIKTLEYQQQRLNLEPMKLNTQLPPDKNLDTLEFRRVTCKGVFDEQKSIYLG 152 Query: 369 KYLKRKSGERINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKASDLRSNSLPIS 548 + SG NG+ S+QSPILVNRGWVP SW K+ + + S Sbjct: 153 PRSRSISGVTENGFYVITPLMPIPNDLDSMQSPILVNRGWVPRSWREKSPETTDANFVTS 212 Query: 549 TARGQK-----------------------------VKFVGVISKGERPNKVWRSNNAAKS 641 + K V+ VGVI GE P+ +N+ + Sbjct: 213 DSTEAKPLSHEQNSWWKFWSKKPVIIKEHGSAIKPVEVVGVIRGGENPSIFVPANDPSTG 272 Query: 642 EWFTVDVPSISRACGLPETTLHVVEIDSSEERNMRKPYP 758 +WF VDVP+++RA GLPE T++V ++ +R+ +PYP Sbjct: 273 QWFYVDVPAMARAIGLPENTIYVEDVHEDIDRS--RPYP 309 >ref|XP_002317864.2| hypothetical protein POPTR_0012s04250g [Populus trichocarpa] gi|550326363|gb|EEE96084.2| hypothetical protein POPTR_0012s04250g [Populus trichocarpa] Length = 344 Score = 125 bits (315), Expect = 2e-26 Identities = 87/274 (31%), Positives = 134/274 (48%), Gaps = 35/274 (12%) Frame = +3 Query: 42 PSKWDPKRSGLSNAARRAIAT--SPQPAQEENCRSTFIEKWWFLIPVSL-FGSAAWNFSR 212 P W P S S A A S QP ++E+ + + KW +P ++ FG W R Sbjct: 28 PKYWIPSSSSSSPFCSSASAATISAQPPEKES--GSRLSKWLLFLPGAITFGLGTWQVLR 85 Query: 213 ARGEEKVRDYRKSRLELGALNGSTDICSAE---NLEFRRVECEGVYDENNSILVHKYLKR 383 + + K+ +YR+ RL + + + S+E +LEFRRV C+GV+ + SI V + Sbjct: 86 RQDKIKMLEYREGRLAMEPMKFNDISPSSEQLDDLEFRRVACKGVFYDKMSIYVGPRSRN 145 Query: 384 KSGERINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKA-----SDLRSNSLPIS 548 SG NGY +QSPILVNRGWVP SW + + D + + + ++ Sbjct: 146 ISGITENGYYIITPLMPVSKNPECVQSPILVNRGWVPRSWKDNSLEVSQDDEQPSDIAMA 205 Query: 549 TARGQK------------------------VKFVGVISKGERPNKVWRSNNAAKSEWFTV 656 +A+G + V+ VGV+ E+P+ +N+ + +WF V Sbjct: 206 SAQGSEKSSWWRFWSRKPKTIEEKIPSIAPVEVVGVVRGSEKPSIFVPANDPSSFQWFYV 265 Query: 657 DVPSISRACGLPETTLHVVEIDSSEERNMRKPYP 758 DVP+I+R CGLPE T++V +I +E N PYP Sbjct: 266 DVPAIARVCGLPENTIYVEDI--NENFNSGCPYP 297 >ref|XP_006841105.1| hypothetical protein AMTR_s00086p00076960 [Amborella trichopoda] gi|548842999|gb|ERN02780.1| hypothetical protein AMTR_s00086p00076960 [Amborella trichopoda] Length = 343 Score = 125 bits (313), Expect = 4e-26 Identities = 84/273 (30%), Positives = 126/273 (46%), Gaps = 42/273 (15%) Frame = +3 Query: 66 SGLSNAARRAIATSPQPAQE----ENCRSTFIEKWWFLIPVSLFGSAAWNFSRARGEEKV 233 S + ++ ++++S QE E+ R + + FL FG W R + + ++ Sbjct: 26 SAFISTSQLSLSSSSTQTQEGINGESERKRWSSLFLFLPGAITFGLGTWQLFRRQEKIEM 85 Query: 234 RDYRKSRLELGAL---------NGSTDICSAENLEFRRVECEGVYDENNSILVHKYLKRK 386 +YR+ RL L L NGS ++LEFRRV CEGV+DE+ S+ + + Sbjct: 86 LEYRRGRLALEPLTWTSISSQFNGSRSDGEMDSLEFRRVLCEGVFDESKSVYIGPRSRSI 145 Query: 387 SGERINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKASDL-------------- 524 SG NGY S+Q PILVNRGWVP SW NK + Sbjct: 146 SGVTENGYYVVTPLMPVKNKSDSVQLPILVNRGWVPRSWRNKFVEAAEEAKQPSHTTLSG 205 Query: 525 ---------------RSNSLPISTARGQKVKFVGVISKGERPNKVWRSNNAAKSEWFTVD 659 +S + + + VK +GV+ E+P+ N+ +WF VD Sbjct: 206 IEESKGSFWSKFWPKKSEVVEVQEPKVDAVKVIGVVRGSEKPSIFVPENDPGSGQWFYVD 265 Query: 660 VPSISRACGLPETTLHVVEIDSSEERNMRKPYP 758 VP+I+RACG+PE T++V +I +E N PYP Sbjct: 266 VPAIARACGIPENTVYVEDI--NENVNPSYPYP 296 >ref|XP_003619849.1| Surfeit locus protein [Medicago truncatula] gi|355494864|gb|AES76067.1| Surfeit locus protein [Medicago truncatula] Length = 333 Score = 124 bits (310), Expect = 9e-26 Identities = 82/234 (35%), Positives = 117/234 (50%), Gaps = 32/234 (13%) Frame = +3 Query: 153 KWWFLIPVSL-FGSAAWNFSRARGEEKVRDYRKSRLELGALNGSTDICSAE---NLEFRR 320 KWW +P ++ FG +W R + K+ +YR RL++ L S S+E +LEFR+ Sbjct: 55 KWWLYLPGAIAFGLGSWQIVRREDKIKMLEYRGKRLQMEPLKFSGAYPSSEELDSLEFRK 114 Query: 321 VECEGVYDENNSILVHKYLKRKSGERINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLS 500 V C+GV+D+ SI V + SG NGY S+ SPILVNRGWVP S Sbjct: 115 VVCKGVFDDKKSIYVGPRSRSISGVTENGYYVITPLMPVHDHPDSVSSPILVNRGWVPRS 174 Query: 501 WGNKASDLR-----SNSLPI-STARGQKV----------------------KFVGVISKG 596 W +K + ++ LP S A G + + VGV+ Sbjct: 175 WKDKFLEASHDEQFADPLPSPSQADGTRSWWRFWSKEPVSSEDQVPSITPNEVVGVVRGS 234 Query: 597 ERPNKVWRSNNAAKSEWFTVDVPSISRACGLPETTLHVVEIDSSEERNMRKPYP 758 E P+ +N+ S+WF +DVPSI+R+CGLPE T++V +I +E N PYP Sbjct: 235 ENPSIFVPANDPGSSQWFYIDVPSIARSCGLPENTVYVDDI--NENVNPSNPYP 286 >ref|XP_002883096.1| hypothetical protein ARALYDRAFT_479277 [Arabidopsis lyrata subsp. lyrata] gi|297328936|gb|EFH59355.1| hypothetical protein ARALYDRAFT_479277 [Arabidopsis lyrata subsp. lyrata] Length = 354 Score = 123 bits (309), Expect = 1e-25 Identities = 89/277 (32%), Positives = 133/277 (48%), Gaps = 32/277 (11%) Frame = +3 Query: 24 SRHLAAPSKWDPKRSGLSNAARRAIATSPQPAQEENCRSTFIEKWWFLIPVSLFGSAAWN 203 SRH +A + S ++AA + ++S P QE S + + FL FG +W Sbjct: 37 SRHFSAVAD----SSSSTSAALGSQSSSSAPPQENKRGSKWSQLLLFLPGAITFGLGSWQ 92 Query: 204 FSRARGEEKVRDYRKSRLELGALNGSTDICSAENL---EFRRVECEGVYDENNSILVHKY 374 R + K +Y++ RL + + + D +NL EFRRV C+GV+DE SI + Sbjct: 93 IVRREEKFKTLEYQQRRLNMEPMKLNIDHPPDKNLDALEFRRVSCKGVFDEQRSIYLGPR 152 Query: 375 LKRKSGERINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKA-----SDLRSNS- 536 + SG NG+ S+QSPILVNRGWVP SW K+ +D +N Sbjct: 153 SRSISGVTENGFYLITPLMPIPGDLDSMQSPILVNRGWVPRSWREKSPESTEADFAANQS 212 Query: 537 -------------------LPISTARG----QKVKFVGVISKGERPNKVWRSNNAAKSEW 647 P+ T + V+ VGVI GE P+ SN+ + +W Sbjct: 213 TKAESPSNEPKSWWKFWSKTPVITKEHVSVVKPVEVVGVIRGGENPSIFVPSNDPSSGQW 272 Query: 648 FTVDVPSISRACGLPETTLHVVEIDSSEERNMRKPYP 758 F VDVP+++RA GLPE T++V ++ +R+ +PYP Sbjct: 273 FYVDVPAMARAVGLPENTIYVEDVHEHVDRS--RPYP 307 >ref|XP_007036878.1| Surfeit locus 1 cytochrome c oxidase biogenesis protein isoform 1 [Theobroma cacao] gi|590665998|ref|XP_007036879.1| Surfeit locus 1 cytochrome c oxidase biogenesis protein isoform 1 [Theobroma cacao] gi|590666002|ref|XP_007036880.1| Surfeit locus 1 cytochrome c oxidase biogenesis protein isoform 1 [Theobroma cacao] gi|590666009|ref|XP_007036882.1| Surfeit locus 1 cytochrome c oxidase biogenesis protein isoform 1 [Theobroma cacao] gi|508774123|gb|EOY21379.1| Surfeit locus 1 cytochrome c oxidase biogenesis protein isoform 1 [Theobroma cacao] gi|508774124|gb|EOY21380.1| Surfeit locus 1 cytochrome c oxidase biogenesis protein isoform 1 [Theobroma cacao] gi|508774125|gb|EOY21381.1| Surfeit locus 1 cytochrome c oxidase biogenesis protein isoform 1 [Theobroma cacao] gi|508774127|gb|EOY21383.1| Surfeit locus 1 cytochrome c oxidase biogenesis protein isoform 1 [Theobroma cacao] Length = 337 Score = 123 bits (308), Expect = 2e-25 Identities = 90/278 (32%), Positives = 135/278 (48%), Gaps = 33/278 (11%) Frame = +3 Query: 24 SRHLAAPSKWDPKRSGLSNAARRAIATSPQPAQEENCRSTFIEKWWFLIPVSL-FGSAAW 200 S L P W P S S AA A+++S QE+ ST+ +W+ +P ++ FG W Sbjct: 21 SNQLLPPKYWVPPAS-FSTAA--AVSSSQSHDQEKG--STW-SRWFLFLPGAITFGLGTW 74 Query: 201 NFSRARGEEKVRDYRKSRLELGALNGSTDICSAENLE---FRRVECEGVYDENNSILVHK 371 R + + K+ +YR+ RL++ L + S+ENLE FRRV C GV+D+ SI V Sbjct: 75 QIFRRQDKIKMLEYRQKRLQMEPLKLNNMPPSSENLESLEFRRVVCRGVFDDGRSIYVGP 134 Query: 372 YLKRKSGERINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKASDL---RSNSLP 542 + SG NGY S+Q+P+LVNRGWVP SW +K+ ++ R S Sbjct: 135 RSRSISGVTENGYYVITPLVPIANNAESVQAPVLVNRGWVPRSWRDKSFEVPQEREKSSS 194 Query: 543 ISTARGQK--------------------------VKFVGVISKGERPNKVWRSNNAAKSE 644 I Q+ ++ +GV+ E+P+ +N+ + Sbjct: 195 IEAVPAQQSEQSWWWQFWSKKPKVVEDQAPAITSIEVIGVVRGSEKPSIFVPANDPNSRQ 254 Query: 645 WFTVDVPSISRACGLPETTLHVVEIDSSEERNMRKPYP 758 WF VDVP+I+ A GLPE +L + +I +E N PYP Sbjct: 255 WFYVDVPAIAVASGLPEDSLLIEDI--NENVNPSNPYP 290 >ref|NP_566592.1| Surfeit locus 1 cytochrome c oxidase biogenesis protein [Arabidopsis thaliana] gi|75203836|sp|Q9SE51.1|SURF1_ARATH RecName: Full=Surfeit locus protein 1; Short=Surfeit 1; AltName: Full=Cytochrome c oxidase assembly protein SURF1; AltName: Full=Protein EMBRYO DEFECTIVE 3121; AltName: Full=Surfeit locus 1 cytochrome c oxidase biogenesis protein gi|6630873|gb|AAF19609.1|AF182953_1 Surfeit 1 [Arabidopsis thaliana] gi|89000977|gb|ABD59078.1| At3g17910 [Arabidopsis thaliana] gi|332642502|gb|AEE76023.1| Surfeit locus 1 cytochrome c oxidase biogenesis protein [Arabidopsis thaliana] Length = 354 Score = 122 bits (307), Expect = 2e-25 Identities = 88/277 (31%), Positives = 131/277 (47%), Gaps = 32/277 (11%) Frame = +3 Query: 24 SRHLAAPSKWDPKRSGLSNAARRAIATSPQPAQEENCRSTFIEKWWFLIPVSLFGSAAWN 203 SRH +A + S S+AA + ++S P QE S + + FL FG +W Sbjct: 37 SRHFSAVAD----SSSSSSAALGSQSSSSAPPQENKRGSKWSQLLLFLPGAITFGLGSWQ 92 Query: 204 FSRARGEEKVRDYRKSRLELGALNGSTDICSAENL---EFRRVECEGVYDENNSILVHKY 374 R + K +Y++ RL + + + D +NL EFRRV C+GV+DE SI + Sbjct: 93 IVRREEKFKTLEYQQQRLNMEPIKLNIDHPLDKNLNALEFRRVSCKGVFDEQRSIYLGPR 152 Query: 375 LKRKSGERINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKASDLRS-------- 530 + SG NG+ S+QSPILVNRGWVP SW K+ + Sbjct: 153 SRSISGITENGFFVITPLMPIPGDLDSMQSPILVNRGWVPRSWREKSQESAEAEFIANQS 212 Query: 531 -----------------NSLPISTARG----QKVKFVGVISKGERPNKVWRSNNAAKSEW 647 + P+ T + V+ VGVI GE P+ SN+ + +W Sbjct: 213 TKAKSPSNEPKSWWKFWSKTPVITKEHISAVKPVEVVGVIRGGENPSIFVPSNDPSTGQW 272 Query: 648 FTVDVPSISRACGLPETTLHVVEIDSSEERNMRKPYP 758 F VDVP+++RA GLPE T++V ++ +R+ +PYP Sbjct: 273 FYVDVPAMARAVGLPENTIYVEDVHEHVDRS--RPYP 307 >ref|XP_004137509.1| PREDICTED: surfeit locus protein 1-like [Cucumis sativus] Length = 345 Score = 122 bits (305), Expect = 3e-25 Identities = 85/269 (31%), Positives = 129/269 (47%), Gaps = 34/269 (12%) Frame = +3 Query: 54 DPKRSGLSNAARRAIATSPQPAQEENCRSTFIEKWWFLIPVSL-FGSAAWNFSRARGEEK 230 DP S LS QP Q++ R + + KW +P +L FG W R + + + Sbjct: 45 DPNSSSLS-----------QPQQKQ--RESRLSKWLLFLPGALTFGLGTWQIFRRQEKIE 91 Query: 231 VRDYRKSRLELGALNGSTDIC---SAENLEFRRVECEGVYDENNSILVHKYLKRKSGERI 401 + DYR+ RL + +N + + ++LEFRRV C+GV+DE SI V + SG Sbjct: 92 MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 151 Query: 402 NGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKA-------SDLRSNSLPISTARG 560 NG+ S+QSP+LVNRGW P +W KA S+ S+ +P G Sbjct: 152 NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 211 Query: 561 QK-----------------------VKFVGVISKGERPNKVWRSNNAAKSEWFTVDVPSI 671 ++ V+ +GV+ E+P+ +N+ +WF VDVP+I Sbjct: 212 ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI 271 Query: 672 SRACGLPETTLHVVEIDSSEERNMRKPYP 758 +R+ GLPE T++V +I +E N PYP Sbjct: 272 ARSSGLPEDTIYVEDI--NENVNPSDPYP 298 >ref|XP_003534137.1| PREDICTED: surfeit locus protein 1-like [Glycine max] Length = 333 Score = 120 bits (302), Expect = 7e-25 Identities = 85/264 (32%), Positives = 128/264 (48%), Gaps = 33/264 (12%) Frame = +3 Query: 66 SGLSNAARRAIATSPQ--PAQEENCRSTFIEKWWFLIPVSL-FGSAAWNFSRARGEEKVR 236 S + AA +++ S P+ E+ R +W +P ++ FG W R + K+ Sbjct: 27 SSAAGAAVSSVSDSDPTLPSSSESQRKA--SRWLLFLPGAITFGLGTWQIIRREEKIKML 84 Query: 237 DYRKSRLELGALNGSTDICSAE---NLEFRRVECEGVYDENNSILVHKYLKRKSGERING 407 +YR++RL++ L S+ S E +LEFR+V C+G +D+ SI V + SG NG Sbjct: 85 EYRENRLQMEPLKFSSAYSSNEELDSLEFRKVVCKGYFDDKKSIYVGPRSRSISGITENG 144 Query: 408 YNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNK---ASD------------------- 521 Y S+ PILVNRGWVP SW +K AS+ Sbjct: 145 YYIITPLMPVPNCPDSVSFPILVNRGWVPRSWKDKFLEASEDEDLEDALPSPSHDDGTKS 204 Query: 522 -----LRSNSLPISTARGQKVKFVGVISKGERPNKVWRSNNAAKSEWFTVDVPSISRACG 686 R + A ++ VGV+ + E+P+ +N+ S+WF VDVP I+RACG Sbjct: 205 WWRFWSRKPVIEDQVASVTPIEVVGVVRESEKPSIFVPANDPKASQWFYVDVPGIARACG 264 Query: 687 LPETTLHVVEIDSSEERNMRKPYP 758 LPE T++V +I +E+ N PYP Sbjct: 265 LPENTIYVEDI--NEDVNPSNPYP 286 >ref|XP_002282742.1| PREDICTED: surfeit locus protein 1 isoform 1 [Vitis vinifera] gi|359491038|ref|XP_003634208.1| PREDICTED: surfeit locus protein 1 isoform 2 [Vitis vinifera] gi|297734345|emb|CBI15592.3| unnamed protein product [Vitis vinifera] Length = 349 Score = 120 bits (301), Expect = 1e-24 Identities = 89/264 (33%), Positives = 127/264 (48%), Gaps = 33/264 (12%) Frame = +3 Query: 66 SGLSNAARRAIATSPQPAQEENCRSTFIEKWWFLIPVSL-FGSAAWNFSRARGEEKVRDY 242 + +S+A+ + T PQ + E R KW +P ++ FG +W R + + + DY Sbjct: 43 ASVSSASSVSSLTEPQSSGGEQRRGW--TKWLLFVPGAVTFGLGSWQILRRQDKINMLDY 100 Query: 243 RKSRLELGALNGSTDICSAE---NLEFRRVECEGVYDENNSILVHKYLKRKSGERINGYN 413 R+ RL+L + GS E +LEFRRV+ +G +DE SI V + SG NGY Sbjct: 101 RRKRLDLEPIPGSNLYSLNEKLDSLEFRRVKAKGFFDEKKSIYVGPRSRSISGVTENGYY 160 Query: 414 XXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNK----------ASDLRSNSLPIS----- 548 S+QSPILVNRGWVP SW +K + ++ S S+ S Sbjct: 161 LITPLMPIPDDPDSVQSPILVNRGWVPRSWRDKFLQDLPVDEQSKNIASPSIQESERSSW 220 Query: 549 ---------TARGQ-----KVKFVGVISKGERPNKVWRSNNAAKSEWFTVDVPSISRACG 686 T Q V+ VGV+ E+P+ N+ +WF VDVP+ISRA G Sbjct: 221 WRFWSKKPKTVEDQVPAVTPVEVVGVVRGSEKPSIFVPENDLCSRQWFYVDVPAISRASG 280 Query: 687 LPETTLHVVEIDSSEERNMRKPYP 758 L E T++V +I +E N PYP Sbjct: 281 LAENTIYVDDI--NENVNPSNPYP 302 >ref|XP_004299519.1| PREDICTED: surfeit locus protein 1-like [Fragaria vesca subsp. vesca] Length = 351 Score = 120 bits (300), Expect = 1e-24 Identities = 86/269 (31%), Positives = 125/269 (46%), Gaps = 38/269 (14%) Frame = +3 Query: 66 SGLSNAARRAIATSPQ-----PAQEENCRSTFIEKWWFLIPVSL-FGSAAWNFSRARGEE 227 S LS+++ A ++ P+ +Q + KW +P ++ FG W R + + Sbjct: 38 SSLSSSSTAAASSEPEFQSAISSQAPERERSRWSKWLLFLPGAITFGLGTWQIVRRQDKI 97 Query: 228 KVRDYRKSRLELGAL---NGSTDICSAENLEFRRVECEGVYDENNSILVHKYLKRKSGER 398 ++ +YR+ RLE+ L + S ENLEFRRV C+G +DE SI V + SG Sbjct: 98 QMLEYRRKRLEMEPLQFNHVSPSTKELENLEFRRVLCKGHFDEKGSIYVGPRSRSISGVT 157 Query: 399 INGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNKASDL------RSNSLPISTARG 560 NGY S Q PILVNRGWVP SW K+S+L SN+ P Sbjct: 158 ENGYYVITPLLPVSDEAESSQPPILVNRGWVPRSWKEKSSELPQDDEQPSNTTPSIGKEE 217 Query: 561 QKVKF-----------------------VGVISKGERPNKVWRSNNAAKSEWFTVDVPSI 671 ++ + VGVI E+P+ N+ +WF VDVP+I Sbjct: 218 ERASWWRFWTKKPKVVEDQPPTQAPDEVVGVIRGSEKPSIFVPPNDPNSGQWFYVDVPAI 277 Query: 672 SRACGLPETTLHVVEIDSSEERNMRKPYP 758 +R GLPE T+++ +I E+ N PYP Sbjct: 278 ARTFGLPEDTIYIEDI--HEDVNPSNPYP 304 >ref|XP_004170396.1| PREDICTED: surfeit locus protein 1-like, partial [Cucumis sativus] Length = 289 Score = 119 bits (297), Expect = 3e-24 Identities = 77/242 (31%), Positives = 119/242 (49%), Gaps = 34/242 (14%) Frame = +3 Query: 135 RSTFIEKWWFLIPVSL-FGSAAWNFSRARGEEKVRDYRKSRLELGALNGSTDIC---SAE 302 R + + KW +P +L FG W R + + ++ DYR+ RL + +N + + + Sbjct: 3 RESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRRKRLLMEPVNINNLLSLEDKLD 62 Query: 303 NLEFRRVECEGVYDENNSILVHKYLKRKSGERINGYNXXXXXXXXXXXXRSIQSPILVNR 482 +LEFRRV C+GV+DE SI V + SG NG+ S+QSP+LVNR Sbjct: 63 DLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNR 122 Query: 483 GWVPLSWGNKA-------SDLRSNSLPISTARGQK-----------------------VK 572 GW P +W KA S+ S+ +P G++ V+ Sbjct: 123 GWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGGERSSWWKFWSKKTESLENEITPITPVE 182 Query: 573 FVGVISKGERPNKVWRSNNAAKSEWFTVDVPSISRACGLPETTLHVVEIDSSEERNMRKP 752 +GV+ E+P+ +N+ +WF VDVP+I+R+ GLPE T++V +I +E N P Sbjct: 183 VIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDI--NENVNPSDP 240 Query: 753 YP 758 YP Sbjct: 241 YP 242 >ref|XP_007152675.1| hypothetical protein PHAVU_004G149700g [Phaseolus vulgaris] gi|561025984|gb|ESW24669.1| hypothetical protein PHAVU_004G149700g [Phaseolus vulgaris] Length = 333 Score = 118 bits (296), Expect = 4e-24 Identities = 83/267 (31%), Positives = 129/267 (48%), Gaps = 35/267 (13%) Frame = +3 Query: 63 RSGLSNAARRAIATSPQ--PAQEENCRSTFIEKWWFLIPVSL-FGSAAWNFSRARGEEKV 233 RS S AA +++ S P+ E+ R + +W +P ++ FG W R + K+ Sbjct: 24 RSFSSAAAVSSVSDSDPSLPSSSESQRKS--SRWLLFLPGAITFGLGTWQIIRREEKIKM 81 Query: 234 RDYRKSRLELGALNGSTDICSAE---NLEFRRVECEGVYDENNSILVHKYLKRKSGERIN 404 +YR+ RL++ L S+ E +LEFR+V C+G +++ SI V + SG N Sbjct: 82 LEYREKRLQMEPLKFSSTYSFNEELDSLEFRKVACKGYFEDRKSIYVGPRSRSISGVTEN 141 Query: 405 GYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNK-----ASDLRSNSLPISTARGQ-- 563 GY S+ PILVNRGWVP SW +K ++ ++SLP + + Sbjct: 142 GYYIITPLMPVPDCPDSVSFPILVNRGWVPRSWKDKFLEASQDEILADSLPSPSHTDETT 201 Query: 564 ----------------------KVKFVGVISKGERPNKVWRSNNAAKSEWFTVDVPSISR 677 ++ VGV+ E+P+ +N+ S+WF VDVP+I+R Sbjct: 202 SWWRLWSKKPPVIIEDQVLSVTPIEVVGVVRGSEKPSIFVPANDPKSSQWFYVDVPAIAR 261 Query: 678 ACGLPETTLHVVEIDSSEERNMRKPYP 758 CGLPE T++V +I +E N PYP Sbjct: 262 TCGLPENTIYVEDI--NENVNPSNPYP 286 >ref|XP_006298040.1| hypothetical protein CARUB_v10014085mg [Capsella rubella] gi|482566749|gb|EOA30938.1| hypothetical protein CARUB_v10014085mg [Capsella rubella] Length = 348 Score = 118 bits (296), Expect = 4e-24 Identities = 86/277 (31%), Positives = 132/277 (47%), Gaps = 32/277 (11%) Frame = +3 Query: 24 SRHLAAPSKWDPKRSGLSNAARRAIATSPQPAQEENCRSTFIEKWWFLIPVSLFGSAAWN 203 +RH +A + S ++AA + ++S P QE+ S + FL FG +W Sbjct: 31 ARHFSAVAD----SSSSNSAALGSQSSSSAPPQEKKRGSKLSQLLLFLPGAITFGLGSWQ 86 Query: 204 FSRARGEEKVRDYRKSRLELGALNGSTDICSAENL---EFRRVECEGVYDENNSILVHKY 374 R + K +Y++ RL + + +T+ +NL EFRRV C+GV+DE SI + Sbjct: 87 IVRRDEKFKTLEYQQKRLNMEPMKLNTEHPPEKNLDALEFRRVSCKGVFDEQRSIYLGPR 146 Query: 375 LKRKSGERINGYNXXXXXXXXXXXXRSIQSPILVNRGWVPLSWGNK------ASDLRSNS 536 + SG NG+ S+QSPILVNRGWVP SW K A + + S Sbjct: 147 SRSISGITENGFYVITPLMPIPGDLDSMQSPILVNRGWVPRSWREKLPESTDADFITNQS 206 Query: 537 LPISTARGQK-----------------------VKFVGVISKGERPNKVWRSNNAAKSEW 647 + ++ V+ VGVI GE P+ SN+ + +W Sbjct: 207 TKAKSISDEQNSWWKYWSKSPMITEPQVPVVKPVEVVGVIRGGENPSIFVPSNDPSTGQW 266 Query: 648 FTVDVPSISRACGLPETTLHVVEIDSSEERNMRKPYP 758 F VDVP++++A GLPE T++V ++ EE + +PYP Sbjct: 267 FYVDVPAMAQAVGLPENTIYVEDV--HEEIDRSRPYP 301 >ref|XP_004235793.1| PREDICTED: surfeit locus protein 1-like [Solanum lycopersicum] Length = 334 Score = 118 bits (296), Expect = 4e-24 Identities = 81/257 (31%), Positives = 132/257 (51%), Gaps = 31/257 (12%) Frame = +3 Query: 81 AARRAIATSPQPAQEENCRSTFIEKWWFLIPVSLFGSAAWNFSRARGEEKVRDYRKSRLE 260 A+ AI+ + E+ S + + F+ V FG +W R + + ++ +YR++RL+ Sbjct: 33 ASAPAISVTETQPPEKGGPSKWSKLLLFIPGVITFGLGSWQIIRRQDKIEMLEYRQNRLQ 92 Query: 261 LGALNGSTDICSAENL---EFRRVECEGVYDENNSILVHKYLKRKSGERINGYNXXXXXX 431 + LN + S+ENL EF RV C GV+DE SI + + SG NGY Sbjct: 93 MDPLNCNEVSPSSENLDSLEFCRVLCRGVFDEKKSIFIGPRSRSISGVTENGYYVITPLM 152 Query: 432 XXXXXXRSIQSPILVNRGWVPLSWGNKASDLRS-NSLPISTA----------------RG 560 +S+Q+PILVNRGWVP +W +K+ ++ + + +STA + Sbjct: 153 PLANDPKSVQTPILVNRGWVPRNWRDKSLEVAAADDQSLSTAPPPQESGKSSWWMFSSKK 212 Query: 561 QKVK-----------FVGVISKGERPNKVWRSNNAAKSEWFTVDVPSISRACGLPETTLH 707 +KV+ +GVI E+P+ +N+ + +WF VDVP+I+RA GLPE TL+ Sbjct: 213 KKVEEDQVPTLKSTEVIGVIRGSEKPSIFVPANDPSSFQWFYVDVPAIARASGLPENTLY 272 Query: 708 VVEIDSSEERNMRKPYP 758 + I+ + + + PYP Sbjct: 273 IEAINDNVDPS--NPYP 287 >ref|XP_006341513.1| PREDICTED: surfeit locus protein 1-like [Solanum tuberosum] Length = 334 Score = 118 bits (295), Expect = 5e-24 Identities = 86/262 (32%), Positives = 125/262 (47%), Gaps = 33/262 (12%) Frame = +3 Query: 72 LSNAARRAIATSPQPAQ--EENCRSTFIEKWWFLIPVSLFGSAAWNFSRARGEEKVRDYR 245 LS+AA A A S Q E S + F+ V FG +W R + + ++ +YR Sbjct: 28 LSSAAASAPAISVTETQPPERGGPSKWSNLLLFVPGVITFGLGSWQIIRRQDKIEMLEYR 87 Query: 246 KSRLELGALNGSTDICSAEN---LEFRRVECEGVYDENNSILVHKYLKRKSGERINGYNX 416 ++RL + LN + S EN LEF RV C GV+DE SI + + SG NGY Sbjct: 88 QNRLRMDPLNCNEVSPSGENVDSLEFCRVLCRGVFDEKKSIFIGPRSRSISGVTENGYYV 147 Query: 417 XXXXXXXXXXXRSIQSPILVNRGWVPLSWGNK------ASDLRSNSLPISTARGQK---- 566 +S+Q+PILVNRGWVP +W +K A + S+ P S G+ Sbjct: 148 ITPLMPLANDPKSVQAPILVNRGWVPRNWRDKSLEMAAADEQPSSIAPPSQESGKSSWWM 207 Query: 567 ------------------VKFVGVISKGERPNKVWRSNNAAKSEWFTVDVPSISRACGLP 692 + +GVI E+P+ +N+ +WF VDV +I+RACGLP Sbjct: 208 FSSKKNKVEEDQVPTVKPTEVIGVIRGSEKPSIFVPANDPNSFQWFYVDVSAIARACGLP 267 Query: 693 ETTLHVVEIDSSEERNMRKPYP 758 E TL++ I+ + + + PYP Sbjct: 268 ENTLYIEAINDNVDPS--NPYP 287