BLASTX nr result

ID: Mentha22_contig00008372 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00008372
         (309 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ACN41026.1| unknown [Picea sitchensis]                             106   4e-21
gb|ADE76557.1| unknown [Picea sitchensis]                             102   6e-20
ref|XP_006288474.1| hypothetical protein CARUB_v10001733mg [Caps...   100   3e-19
ref|XP_002320878.1| NC domain-containing family protein [Populus...   100   3e-19
ref|XP_006396322.1| hypothetical protein EUTSA_v10028905mg [Eutr...    99   5e-19
ref|XP_001761156.1| predicted protein [Physcomitrella patens] gi...    99   5e-19
ref|XP_006841547.1| hypothetical protein AMTR_s00003p00168330 [A...    99   8e-19
ref|XP_002875007.1| hypothetical protein ARALYDRAFT_490474 [Arab...    99   8e-19
ref|XP_004230000.1| PREDICTED: uncharacterized protein LOC101257...    98   1e-18
ref|NP_680550.1| NC domain-containing protein-like protein [Arab...    98   1e-18
ref|XP_006491314.1| PREDICTED: uncharacterized protein LOC102607...    97   2e-18
ref|XP_006444800.1| hypothetical protein CICLE_v10021778mg [Citr...    97   2e-18
ref|XP_004252614.1| PREDICTED: uncharacterized protein LOC101248...    97   2e-18
ref|XP_006360725.1| PREDICTED: uncharacterized protein LOC102583...    97   2e-18
ref|XP_006339735.1| PREDICTED: uncharacterized protein LOC102589...    97   2e-18
ref|XP_002892081.1| hypothetical protein ARALYDRAFT_470154 [Arab...    97   2e-18
ref|XP_002523304.1| conserved hypothetical protein [Ricinus comm...    97   2e-18
ref|XP_002302649.1| NC domain-containing family protein [Populus...    96   4e-18
ref|XP_002266683.1| PREDICTED: uncharacterized protein LOC100253...    96   5e-18
ref|XP_006400172.1| hypothetical protein EUTSA_v10014595mg [Eutr...    96   7e-18

>gb|ACN41026.1| unknown [Picea sitchensis]
          Length = 279

 Score =  106 bits (264), Expect = 4e-21
 Identities = 53/100 (53%), Positives = 64/100 (64%)
 Frame = -3

Query: 304 ASKAKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGG 125
           +S     C  C    F    SGV +SCL+CFL GG LYR+ YGVS      ++ A+ RGG
Sbjct: 62  SSSIPTQCLRCPDCGFQRENSGVMLSCLDCFLAGGPLYRFEYGVSP----AIFLAKARGG 117

Query: 124 TCNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           TC  AES P E+V+ RA +LLQ NGFG YH+F NNCEDFA
Sbjct: 118 TCTLAESDPPELVIHRAMYLLQ-NGFGNYHIFQNNCEDFA 156


>gb|ADE76557.1| unknown [Picea sitchensis]
          Length = 277

 Score =  102 bits (254), Expect = 6e-20
 Identities = 54/93 (58%), Positives = 61/93 (65%)
 Frame = -3

Query: 283 CNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGTCNTAES 104
           C  CG   F    SGVT+SCL+CFL GG LYR+ YGVS      V+ A+ RGGTC  AES
Sbjct: 72  CLDCG---FERENSGVTLSCLDCFLAGGNLYRFEYGVS----AAVFLAKARGGTCTLAES 124

Query: 103 GPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
            P E V  RA +LLQ NGFG Y +F NNCEDFA
Sbjct: 125 DPLETVNHRAMYLLQ-NGFGNYDIFENNCEDFA 156


>ref|XP_006288474.1| hypothetical protein CARUB_v10001733mg [Capsella rubella]
           gi|482557180|gb|EOA21372.1| hypothetical protein
           CARUB_v10001733mg [Capsella rubella]
          Length = 265

 Score =  100 bits (248), Expect = 3e-19
 Identities = 50/97 (51%), Positives = 63/97 (64%)
 Frame = -3

Query: 295 AKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGTCN 116
           ++A C       + +  SGV +SCL+CFL  G LYR+ YGVS      ++ +RLRGGTC 
Sbjct: 74  SEAPCPTFPDCGYKQPKSGVVLSCLDCFLKKGSLYRFEYGVS----SSIFLSRLRGGTCT 129

Query: 115 TAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           TA S P + V+ RA HLLQ NGFG Y +F NNCEDFA
Sbjct: 130 TAPSDPLQTVIHRAMHLLQ-NGFGNYDIFQNNCEDFA 165


>ref|XP_002320878.1| NC domain-containing family protein [Populus trichocarpa]
           gi|222861651|gb|EEE99193.1| NC domain-containing family
           protein [Populus trichocarpa]
          Length = 261

 Score =  100 bits (248), Expect = 3e-19
 Identities = 52/99 (52%), Positives = 62/99 (62%)
 Frame = -3

Query: 301 SKAKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGT 122
           S   +SC       F +  SGV +SCL+CFL  G LY + YGV       V+ A++RGGT
Sbjct: 64  SSIPSSCETFPDCGFRQPDSGVVLSCLDCFLKKGSLYSFEYGVPP----TVFIAKVRGGT 119

Query: 121 CNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           C TA S P E V+ RA +LLQ NGFG Y VFHNNCEDFA
Sbjct: 120 CTTAASDPPETVIHRAMYLLQ-NGFGNYDVFHNNCEDFA 157


>ref|XP_006396322.1| hypothetical protein EUTSA_v10028905mg [Eutrema salsugineum]
           gi|557097339|gb|ESQ37775.1| hypothetical protein
           EUTSA_v10028905mg [Eutrema salsugineum]
          Length = 253

 Score = 99.4 bits (246), Expect = 5e-19
 Identities = 51/97 (52%), Positives = 62/97 (63%)
 Frame = -3

Query: 295 AKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGTCN 116
           A+A C       F +  SGV +SCL+CFL  G LYR+ YGVS      ++ +R RGGTC 
Sbjct: 72  AEAPCPTYPDCGFKQPKSGVVLSCLDCFLKNGSLYRFVYGVS----SSLFLSRFRGGTCT 127

Query: 115 TAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           TA S P + V+ RA HLLQ NGFG Y +F NNCEDFA
Sbjct: 128 TAPSDPLQTVVHRAMHLLQ-NGFGNYDIFQNNCEDFA 163


>ref|XP_001761156.1| predicted protein [Physcomitrella patens]
           gi|162687496|gb|EDQ73878.1| predicted protein
           [Physcomitrella patens]
          Length = 272

 Score = 99.4 bits (246), Expect = 5e-19
 Identities = 54/102 (52%), Positives = 66/102 (64%), Gaps = 1/102 (0%)
 Frame = -3

Query: 307 KASKAKASCNVCGLMDFAERTS-GVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLR 131
           +   A A C+ CG+    E TS GV +SCL+CFL G  LYR+ Y V      + + A+ R
Sbjct: 65  RPESATAKCDKCGM----EGTSNGVVLSCLDCFLVGCPLYRFEYNVDP----VTFFAKAR 116

Query: 130 GGTCNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           GGTC  A+S  AEVVL RA +LL  NGFG YH+FHNNCEDFA
Sbjct: 117 GGTCTLAKSDTAEVVLHRANYLLN-NGFGLYHIFHNNCEDFA 157


>ref|XP_006841547.1| hypothetical protein AMTR_s00003p00168330 [Amborella trichopoda]
           gi|548843568|gb|ERN03222.1| hypothetical protein
           AMTR_s00003p00168330 [Amborella trichopoda]
          Length = 267

 Score = 98.6 bits (244), Expect = 8e-19
 Identities = 52/98 (53%), Positives = 60/98 (61%)
 Frame = -3

Query: 298 KAKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGTC 119
           ++   C  CG      R  GV  SCL+CFL GG LYR+ YGV+  +    + A+ RGGTC
Sbjct: 68  RSDTPCEKCG---DNTRLDGVISSCLDCFLSGGDLYRFEYGVTSIF----FLAKARGGTC 120

Query: 118 NTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
             A S P E VL RA HLLQ NGFG YH F NNCEDFA
Sbjct: 121 TLACSDPPEKVLHRATHLLQ-NGFGFYHAFRNNCEDFA 157


>ref|XP_002875007.1| hypothetical protein ARALYDRAFT_490474 [Arabidopsis lyrata subsp.
           lyrata] gi|297320844|gb|EFH51266.1| hypothetical protein
           ARALYDRAFT_490474 [Arabidopsis lyrata subsp. lyrata]
          Length = 263

 Score = 98.6 bits (244), Expect = 8e-19
 Identities = 50/97 (51%), Positives = 61/97 (62%)
 Frame = -3

Query: 295 AKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGTCN 116
           ++A C       + +  SGV +SCL+CFL  G LYR+ YGVS      ++  R RGGTC 
Sbjct: 72  SEAPCPTFPDCGYKQPKSGVVLSCLDCFLKKGSLYRFEYGVS----SSIFLTRFRGGTCT 127

Query: 115 TAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           TA S P + V+ RA HLLQ NGFG Y VF NNCEDFA
Sbjct: 128 TAPSDPLQTVIHRAMHLLQ-NGFGNYDVFQNNCEDFA 163


>ref|XP_004230000.1| PREDICTED: uncharacterized protein LOC101257602 [Solanum
           lycopersicum]
          Length = 254

 Score = 97.8 bits (242), Expect = 1e-18
 Identities = 52/99 (52%), Positives = 62/99 (62%)
 Frame = -3

Query: 301 SKAKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGT 122
           S   +SC +     F    SGV +SCL CFL  G LY + YGVS      V+ +++RGGT
Sbjct: 57  SGISSSCPIFPDCGFRLPNSGVVLSCLNCFLRNGSLYSFEYGVSPS----VFLSKVRGGT 112

Query: 121 CNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           C TA S P E+V+ RA HLLQ NGFG Y VF NNCEDFA
Sbjct: 113 CTTAVSDPPEMVIHRAMHLLQ-NGFGNYDVFQNNCEDFA 150


>ref|NP_680550.1| NC domain-containing protein-like protein [Arabidopsis thaliana]
           gi|332656554|gb|AEE81954.1| NC domain-containing
           protein-like protein [Arabidopsis thaliana]
          Length = 263

 Score = 97.8 bits (242), Expect = 1e-18
 Identities = 50/97 (51%), Positives = 61/97 (62%)
 Frame = -3

Query: 295 AKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGTCN 116
           ++A C       +    SGV +SCL+CFL  G LYR+ YGVS      ++  R RGGTC 
Sbjct: 72  SEAPCPTYPDCGYKRPKSGVVLSCLDCFLKKGSLYRFDYGVS----SSIFLTRFRGGTCT 127

Query: 115 TAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           TA S P + V+ RA HLLQ NGFG Y+VF NNCEDFA
Sbjct: 128 TAPSDPLQTVIHRAMHLLQ-NGFGNYNVFQNNCEDFA 163


>ref|XP_006491314.1| PREDICTED: uncharacterized protein LOC102607118 [Citrus sinensis]
          Length = 259

 Score = 97.4 bits (241), Expect = 2e-18
 Identities = 50/95 (52%), Positives = 63/95 (66%)
 Frame = -3

Query: 289 ASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGTCNTA 110
           +SC +     F +  SGV +SCL+CFL  G LY + YGV+      V+ A++RGGTC TA
Sbjct: 66  SSCLIFPDCGFRQPNSGVILSCLDCFLGNGSLYCFEYGVAP----SVFLAKVRGGTCTTA 121

Query: 109 ESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
            S P E V+ RA +LLQ NGFG Y+VF NNCEDFA
Sbjct: 122 TSDPPETVIHRAMYLLQ-NGFGNYNVFQNNCEDFA 155


>ref|XP_006444800.1| hypothetical protein CICLE_v10021778mg [Citrus clementina]
           gi|567904626|ref|XP_006444801.1| hypothetical protein
           CICLE_v10021778mg [Citrus clementina]
           gi|557547062|gb|ESR58040.1| hypothetical protein
           CICLE_v10021778mg [Citrus clementina]
           gi|557547063|gb|ESR58041.1| hypothetical protein
           CICLE_v10021778mg [Citrus clementina]
          Length = 259

 Score = 97.4 bits (241), Expect = 2e-18
 Identities = 50/95 (52%), Positives = 63/95 (66%)
 Frame = -3

Query: 289 ASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGTCNTA 110
           +SC +     F +  SGV +SCL+CFL  G LY + YGV+      V+ A++RGGTC TA
Sbjct: 66  SSCLIFPDCGFRQPNSGVILSCLDCFLGNGSLYCFEYGVAP----SVFLAKVRGGTCTTA 121

Query: 109 ESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
            S P E V+ RA +LLQ NGFG Y+VF NNCEDFA
Sbjct: 122 TSDPPETVIHRAMYLLQ-NGFGNYNVFQNNCEDFA 155


>ref|XP_004252614.1| PREDICTED: uncharacterized protein LOC101248754 [Solanum
           lycopersicum]
          Length = 385

 Score = 97.4 bits (241), Expect = 2e-18
 Identities = 53/103 (51%), Positives = 66/103 (64%), Gaps = 3/103 (2%)
 Frame = -3

Query: 304 ASKAKASCNVCGLMD--FAERTSGVTMSCLECFL-DGGILYRYRYGVSKFYYGLVYSARL 134
           AS    S     + D  F ++ SGV +SCL+CFL + G+LYR+ YG S      V+  +L
Sbjct: 66  ASSTNVSSTCTNIPDCGFQQKASGVVLSCLDCFLGEEGLLYRFDYGTSPS----VFLTKL 121

Query: 133 RGGTCNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           RGGTC TA+S P E V+ RA +LLQ NGFG Y VF NNCEDFA
Sbjct: 122 RGGTCTTAQSDPPEAVIHRAMYLLQ-NGFGNYDVFKNNCEDFA 163


>ref|XP_006360725.1| PREDICTED: uncharacterized protein LOC102583473 [Solanum tuberosum]
          Length = 280

 Score = 97.1 bits (240), Expect = 2e-18
 Identities = 53/103 (51%), Positives = 66/103 (64%), Gaps = 3/103 (2%)
 Frame = -3

Query: 304 ASKAKASCNVCGLMD--FAERTSGVTMSCLECFL-DGGILYRYRYGVSKFYYGLVYSARL 134
           AS    S     + D  F ++ SGV +SCL+CFL + G+LYR+ YG S      V+  +L
Sbjct: 66  ASSTNVSSTCTNIPDCGFQQKESGVVLSCLDCFLGEEGLLYRFDYGASPS----VFLTKL 121

Query: 133 RGGTCNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           RGGTC TA+S P E V+ RA +LLQ NGFG Y VF NNCEDFA
Sbjct: 122 RGGTCTTAQSDPPEAVIHRAMYLLQ-NGFGSYDVFKNNCEDFA 163


>ref|XP_006339735.1| PREDICTED: uncharacterized protein LOC102589709 [Solanum tuberosum]
          Length = 254

 Score = 97.1 bits (240), Expect = 2e-18
 Identities = 52/99 (52%), Positives = 62/99 (62%)
 Frame = -3

Query: 301 SKAKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGT 122
           S   +SC +     F    SGV +SCL CFL  G LY + YGVS      V+ +++RGGT
Sbjct: 57  SGLSSSCPIFPDCGFRLPNSGVVLSCLNCFLRTGSLYSFEYGVSPS----VFLSKVRGGT 112

Query: 121 CNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           C TA S P E+V+ RA HLLQ NGFG Y VF NNCEDFA
Sbjct: 113 CTTAVSDPPEMVIHRAMHLLQ-NGFGNYDVFQNNCEDFA 150


>ref|XP_002892081.1| hypothetical protein ARALYDRAFT_470154 [Arabidopsis lyrata subsp.
           lyrata] gi|297337923|gb|EFH68340.1| hypothetical protein
           ARALYDRAFT_470154 [Arabidopsis lyrata subsp. lyrata]
          Length = 255

 Score = 97.1 bits (240), Expect = 2e-18
 Identities = 49/104 (47%), Positives = 68/104 (65%), Gaps = 3/104 (2%)
 Frame = -3

Query: 307 KASKAKASCNVCGLMD---FAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSAR 137
           ++S + +S ++C +     F +  SGV +SCL+CFL  G LY + YGVS      V+  +
Sbjct: 51  ESSSSSSSDDICSIFPDCGFRQPDSGVVLSCLDCFLKNGSLYCFEYGVSP----SVFLTK 106

Query: 136 LRGGTCNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           +RGGTC TA+S P + V+ RA +LLQ NGFG Y +F NNCEDFA
Sbjct: 107 VRGGTCTTAQSDPTDSVIHRAMYLLQ-NGFGNYDIFKNNCEDFA 149


>ref|XP_002523304.1| conserved hypothetical protein [Ricinus communis]
           gi|223537392|gb|EEF39020.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 258

 Score = 97.1 bits (240), Expect = 2e-18
 Identities = 50/99 (50%), Positives = 61/99 (61%)
 Frame = -3

Query: 301 SKAKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGT 122
           S   +SC       F +  SGV +SCL+CFL  G LY + YGV       V+ A++RGGT
Sbjct: 61  SSIASSCETFPDCGFRQPNSGVVLSCLDCFLRNGSLYSFEYGVPP----SVFLAKVRGGT 116

Query: 121 CNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           C TA S P E V+ RA +LLQ NGFG Y +F NNCEDFA
Sbjct: 117 CTTAASDPPEAVIHRAMYLLQ-NGFGNYDIFQNNCEDFA 154


>ref|XP_002302649.1| NC domain-containing family protein [Populus trichocarpa]
           gi|222844375|gb|EEE81922.1| NC domain-containing family
           protein [Populus trichocarpa]
          Length = 261

 Score = 96.3 bits (238), Expect = 4e-18
 Identities = 51/99 (51%), Positives = 61/99 (61%)
 Frame = -3

Query: 301 SKAKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGT 122
           S   +SC       F +  SGV +SCL+CFL  G LY + YGV       V+ A++RGGT
Sbjct: 64  SSIPSSCETFPDCGFRQLDSGVVLSCLDCFLKKGSLYCFEYGVPP----TVFLAKVRGGT 119

Query: 121 CNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           C TA S P E V+ RA +LLQ NGFG Y VF NNCEDFA
Sbjct: 120 CTTAASDPPETVIHRAMYLLQ-NGFGNYDVFQNNCEDFA 157


>ref|XP_002266683.1| PREDICTED: uncharacterized protein LOC100253490 [Vitis vinifera]
           gi|147767788|emb|CAN66977.1| hypothetical protein
           VITISV_022080 [Vitis vinifera]
          Length = 262

 Score = 95.9 bits (237), Expect = 5e-18
 Identities = 49/99 (49%), Positives = 63/99 (63%)
 Frame = -3

Query: 301 SKAKASCNVCGLMDFAERTSGVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGT 122
           S   ++C+      F +  SGV +SCL+CFL  G LY + YGV+      V+ A++RGGT
Sbjct: 64  SSIPSTCSTFPDCGFRQPNSGVVLSCLDCFLGKGSLYSFEYGVTP----SVFLAKVRGGT 119

Query: 121 CNTAESGPAEVVLERAEHLLQTNGFGEYHVFHNNCEDFA 5
           C TA S P + V+ RA +LLQ NGFG Y VF NNCEDFA
Sbjct: 120 CTTATSDPPDAVIHRAMYLLQ-NGFGNYDVFQNNCEDFA 157


>ref|XP_006400172.1| hypothetical protein EUTSA_v10014595mg [Eutrema salsugineum]
           gi|557101262|gb|ESQ41625.1| hypothetical protein
           EUTSA_v10014595mg [Eutrema salsugineum]
          Length = 226

 Score = 95.5 bits (236), Expect = 7e-18
 Identities = 44/79 (55%), Positives = 56/79 (70%)
 Frame = -3

Query: 241 GVTMSCLECFLDGGILYRYRYGVSKFYYGLVYSARLRGGTCNTAESGPAEVVLERAEHLL 62
           GV  SCL+CF+ GG L+ + YGVS      ++ ++LRGGTC TA S P + V+ RA+ LL
Sbjct: 85  GVISSCLDCFIAGGDLFLFDYGVSP----AIFMSKLRGGTCTTATSDPPDQVISRAKSLL 140

Query: 61  QTNGFGEYHVFHNNCEDFA 5
             NGFG+YHVF NNCEDFA
Sbjct: 141 SRNGFGDYHVFENNCEDFA 159


Top