BLASTX nr result
ID: Mentha23_contig00034476
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00034476 (452 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 139 3e-31 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 138 7e-31 ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr... 135 6e-30 ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-... 134 1e-29 emb|CBI22685.3| unnamed protein product [Vitis vinifera] 134 2e-29 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 134 2e-29 ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot... 131 1e-28 gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] 128 7e-28 ref|XP_006413289.1| hypothetical protein EUTSA_v10025027mg [Eutr... 125 6e-27 ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot... 125 6e-27 ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225... 125 8e-27 ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210... 125 8e-27 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 117 1e-24 ref|XP_002865912.1| hydroxyproline-rich glycoprotein family prot... 117 2e-24 ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Caps... 115 5e-24 ref|XP_006827570.1| hypothetical protein AMTR_s00009p00224560 [A... 115 8e-24 ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr... 114 2e-23 ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein... 113 2e-23 ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun... 113 3e-23 ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu... 110 3e-22 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 139 bits (351), Expect = 3e-31 Identities = 65/100 (65%), Positives = 80/100 (80%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 MS V N+VDTVNAAA+AIV+AE+RVQPST+QKRRWGSCWS+YWCFGS KHSKRIGHAV+V Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP+ G + PV+E+ + +T+V+PFI FL SD Sbjct: 61 PEPAAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSD 100 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum lycopersicum] Length = 470 Score = 138 bits (348), Expect = 7e-31 Identities = 65/100 (65%), Positives = 79/100 (79%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 MS V N+VDTVNAAA+AIV+AE+RVQPST+QKRRWGSCWS+YWCFGS KHSKRIGHAV+V Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP G + PV+E+ + +T+V+PFI FL SD Sbjct: 61 PEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSD 100 >ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] gi|557523850|gb|ESR35217.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] Length = 500 Score = 135 bits (340), Expect = 6e-30 Identities = 64/100 (64%), Positives = 78/100 (78%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 MS VH+SV+TVNAAATAIVSAE+R++P+ IQKRRWGSCWS+YWCFGS K SKRI HAV+V Sbjct: 1 MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLV 60 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP TG +AP +E+ + + +VLPFI FLQSD Sbjct: 61 PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSD 100 >ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis] Length = 500 Score = 134 bits (337), Expect = 1e-29 Identities = 63/100 (63%), Positives = 78/100 (78%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 MS VH+SV+TVNAAATAIVSAE+R++P+ IQKRRWGSCWS+YWCFGS K SKRI HAV++ Sbjct: 1 MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLL 60 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP TG +AP +E+ + + +VLPFI FLQSD Sbjct: 61 PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSD 100 >emb|CBI22685.3| unnamed protein product [Vitis vinifera] Length = 295 Score = 134 bits (336), Expect = 2e-29 Identities = 64/100 (64%), Positives = 78/100 (78%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 M V+NSV+T+NAAATAIVSAE+RVQP+T+QKRRWGSC S+YWCFGS +HSKRIGHAV+V Sbjct: 1 MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP G AP SE+ + +++VLPFI FLQSD Sbjct: 61 PEPMVPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQSD 100 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 134 bits (336), Expect = 2e-29 Identities = 64/100 (64%), Positives = 78/100 (78%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 M V+NSV+T+NAAATAIVSAE+RVQP+T+QKRRWGSC S+YWCFGS +HSKRIGHAV+V Sbjct: 1 MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP G AP SE+ + +++VLPFI FLQSD Sbjct: 61 PEPMVPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQSD 100 >ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 131 bits (329), Expect = 1e-28 Identities = 61/100 (61%), Positives = 77/100 (77%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 M V++SV+TVNAAATAIVSA++RVQP+T+QK+RWGSCW +YWCFGS K+SKRIGHAV+V Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP G S +E+ S P+ ++LPFI FLQSD Sbjct: 61 PEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSD 100 >gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] Length = 521 Score = 128 bits (322), Expect = 7e-28 Identities = 61/100 (61%), Positives = 74/100 (74%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 M V+NSV+T+NAAATAIVSAE R QP+ + KRRWGSCWS+YWCFGS K+SKRIGHAV+V Sbjct: 1 MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFGSHKNSKRIGHAVLV 60 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP G +AP E+ + + +VLPFI FLQSD Sbjct: 61 PEPVLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSD 100 >ref|XP_006413289.1| hypothetical protein EUTSA_v10025027mg [Eutrema salsugineum] gi|557114459|gb|ESQ54742.1| hypothetical protein EUTSA_v10025027mg [Eutrema salsugineum] Length = 489 Score = 125 bits (314), Expect = 6e-27 Identities = 62/101 (61%), Positives = 77/101 (76%), Gaps = 2/101 (1%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 M V+NSVDTVNAAA+AIVSAE+RVQPS++QK+RWGSCWS+YWCFGS K++KRIGHAV+V Sbjct: 1 MRNVNNSVDTVNAAASAIVSAESRVQPSSVQKKRWGSCWSLYWCFGSQKNNKRIGHAVLV 60 Query: 332 SEPSPTG--VSAPVSESWSRPSTLVLPFIXXXXXXXXFLQS 448 EP +G APV S + +++ LPFI FLQS Sbjct: 61 PEPVSSGSVPVAPVQNSSTNSTSIFLPFIAPPSSPASFLQS 101 >ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 125 bits (314), Expect = 6e-27 Identities = 61/104 (58%), Positives = 77/104 (74%), Gaps = 4/104 (3%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQ----KRRWGSCWSIYWCFGSCKHSKRIGH 319 M V++SV+TVNAAATAIVSA++RVQP+T+Q K+RWGSCW +YWCFGS K+SKRIGH Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60 Query: 320 AVIVSEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 AV+V EP G S +E+ S P+ ++LPFI FLQSD Sbjct: 61 AVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSD 104 >ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225370 [Cucumis sativus] Length = 497 Score = 125 bits (313), Expect = 8e-27 Identities = 63/102 (61%), Positives = 75/102 (73%), Gaps = 2/102 (1%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCF--GSCKHSKRIGHAV 325 M+ ++NSVDTVNAAATAIVSAE RVQP+T KRRWGSCWS+YWCF GS K +KRIGHAV Sbjct: 1 MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60 Query: 326 IVSEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 +V EP+ G AP E + +T+VLPFI FLQS+ Sbjct: 61 LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSE 102 >ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210841 [Cucumis sativus] Length = 497 Score = 125 bits (313), Expect = 8e-27 Identities = 63/102 (61%), Positives = 75/102 (73%), Gaps = 2/102 (1%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCF--GSCKHSKRIGHAV 325 M+ ++NSVDTVNAAATAIVSAE RVQP+T KRRWGSCWS+YWCF GS K +KRIGHAV Sbjct: 1 MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60 Query: 326 IVSEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 +V EP+ G AP E + +T+VLPFI FLQS+ Sbjct: 61 LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSE 102 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 117 bits (294), Expect = 1e-24 Identities = 56/96 (58%), Positives = 71/96 (73%) Frame = +2 Query: 164 HNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIVSEPS 343 ++SVDT+NAAATAIVSAE+RVQP+T+QKRRWG CWS+YWCFGS K +KRIGHAV+ EP Sbjct: 19 NSSVDTINAAATAIVSAESRVQPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPE 77 Query: 344 PTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 G +E+ S+ + + +PFI FLQSD Sbjct: 78 VQGAVVTSAENQSQSTAITVPFIAPPSSPASFLQSD 113 >ref|XP_002865912.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] gi|297311747|gb|EFH42171.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] Length = 437 Score = 117 bits (292), Expect = 2e-24 Identities = 57/100 (57%), Positives = 75/100 (75%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 M V+NSV+TVNAAATAIV+AE+RVQPS++QK RWG CWS+Y CFG+ K++KRIG+AV+V Sbjct: 1 MRNVNNSVETVNAAATAIVTAESRVQPSSVQKGRWGKCWSLYSCFGTQKNNKRIGNAVLV 60 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP +GV ++ + +T+VLPFI FLQSD Sbjct: 61 PEPVASGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSD 100 >ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Capsella rubella] gi|482549191|gb|EOA13385.1| hypothetical protein CARUB_v10026425mg [Capsella rubella] Length = 437 Score = 115 bits (289), Expect = 5e-24 Identities = 56/100 (56%), Positives = 74/100 (74%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 M V+NSV+TVNAAATAI++AE+RVQPS++QKRRW CWS+Y CFGS K++KRIG+AV+V Sbjct: 1 MRNVNNSVETVNAAATAIITAESRVQPSSVQKRRWAKCWSLYSCFGSQKNNKRIGNAVLV 60 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP +GV ++ + +T+VLPFI FL SD Sbjct: 61 PEPVASGVPVVTVQNSATSTTVVLPFIAPPSSPASFLPSD 100 >ref|XP_006827570.1| hypothetical protein AMTR_s00009p00224560 [Amborella trichopoda] gi|548832190|gb|ERM94986.1| hypothetical protein AMTR_s00009p00224560 [Amborella trichopoda] Length = 501 Score = 115 bits (287), Expect = 8e-24 Identities = 54/99 (54%), Positives = 68/99 (68%) Frame = +2 Query: 155 SGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIVS 334 + V+NSV+TVNAAA+AIV+A+ RVQ +T+QKRRWG CWS+YWCFGS +H KRI AV+V Sbjct: 3 NNVNNSVETVNAAASAIVTADHRVQQATVQKRRWGGCWSVYWCFGSPRHGKRISRAVLVP 62 Query: 335 EPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP P+G AP S P LPF+ FL S+ Sbjct: 63 EPIPSGDGAPPPNDPSHPPPPPLPFVAPPSSPASFLHSE 101 >ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|222858882|gb|EEE96429.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 507 Score = 114 bits (284), Expect = 2e-23 Identities = 60/106 (56%), Positives = 77/106 (72%), Gaps = 6/106 (5%) Frame = +2 Query: 152 MSGVHNS-VDTVNAAATAIVSAETRVQPST--IQKRRWGSCWSIYWCF---GSCKHSKRI 313 M V+NS ++TVNAAATAIVSAE+RVQPS+ +QKRRWG CWS+YWCF GS K+SKRI Sbjct: 1 MRSVNNSSIETVNAAATAIVSAESRVQPSSSSVQKRRWGGCWSLYWCFGSHGSHKNSKRI 60 Query: 314 GHAVIVSEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 GHAV+V EP G + +E+ ++ + ++LPFI FLQSD Sbjct: 61 GHAVLVPEPEVPGAVSSSTENQTQSTPILLPFIAPPSSPASFLQSD 106 >ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|10177409|dbj|BAB10540.1| unnamed protein product [Arabidopsis thaliana] gi|40823427|gb|AAR92282.1| At5g52430 [Arabidopsis thaliana] gi|56381929|gb|AAV85683.1| At5g52430 [Arabidopsis thaliana] gi|110738650|dbj|BAF01250.1| hypothetical protein [Arabidopsis thaliana] gi|332008830|gb|AED96213.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 438 Score = 113 bits (283), Expect = 2e-23 Identities = 56/97 (57%), Positives = 73/97 (75%) Frame = +2 Query: 161 VHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIVSEP 340 V+NSV+TVNAAATAIV+AE+RVQPS+ QK RWG CWS+Y CFG+ K++KRIG+AV+V EP Sbjct: 5 VNNSVETVNAAATAIVTAESRVQPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEP 64 Query: 341 SPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 +GV ++ + +T+VLPFI FLQSD Sbjct: 65 VTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSD 101 >ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] gi|462415503|gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 113 bits (282), Expect = 3e-23 Identities = 55/100 (55%), Positives = 70/100 (70%) Frame = +2 Query: 152 MSGVHNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIV 331 M V++SVDT+NAAATAIVSAE R QP+T+ KRRWGSCWS+YWCFG K +KRIGHAV+V Sbjct: 1 MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHK-NKRIGHAVLV 59 Query: 332 SEPSPTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 EP G + ++ + + +V+PFI FL SD Sbjct: 60 PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSD 99 >ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346901|gb|EEE82832.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 453 Score = 110 bits (274), Expect = 3e-22 Identities = 52/96 (54%), Positives = 66/96 (68%) Frame = +2 Query: 164 HNSVDTVNAAATAIVSAETRVQPSTIQKRRWGSCWSIYWCFGSCKHSKRIGHAVIVSEPS 343 +N+++T+NAAATAI SAE RV +T+QKRRWGSCWSIY CFG KH K+IGHAV+ EPS Sbjct: 12 NNTLETINAAATAIASAENRVPQATVQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPS 71 Query: 344 PTGVSAPVSESWSRPSTLVLPFIXXXXXXXXFLQSD 451 G AP SE+ ++ + LPF F QS+ Sbjct: 72 APGNGAPASENPTQAPAVTLPFAAPPSSPASFFQSE 107