BLASTX nr result

ID: Mentha28_contig00006083 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00006083
         (739 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus...   179   7e-43
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   138   2e-30
ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579...   134   3e-29
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   132   1e-28
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   127   3e-27
ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295...   127   4e-27
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...   124   4e-26
ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293...   124   4e-26
gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]     120   4e-25
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   119   8e-25
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   119   1e-24
ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295...   119   1e-24
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   119   1e-24
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   119   1e-24
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   118   2e-24
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   116   7e-24
ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303...   116   7e-24
ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295...   115   1e-23
gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus...   115   2e-23
ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295...   115   2e-23

>gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus guttatus]
          Length = 208

 Score =  179 bits (455), Expect = 7e-43
 Identities = 102/201 (50%), Positives = 138/201 (68%), Gaps = 10/201 (4%)
 Frame = -2

Query: 576 YQPEV-QGYPLAPASVVPRSDEEFG--NNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVIL 406
           Y  EV Q YPLAP S VPRSDEE+   NN  + E+MKK KR+KC  Y          +IL
Sbjct: 5   YNQEVHQAYPLAP-STVPRSDEEYSGTNNYRAQEEMKKNKRMKCFAYIACFAVFQIIIIL 63

Query: 405 IFSLIIMRVRTPKVRMDNVTVTSG-ANGDVRFGARVLVKNTNFGRYKFESTLATIRTADN 229
           I +L +MRV++PK+R+ ++TVT    +G+VR  ARVLVKNTNFGRYKF+S LATIR+  +
Sbjct: 64  ILALTVMRVKSPKLRLGDITVTKDHVSGNVRLTARVLVKNTNFGRYKFDSGLATIRSGAS 123

Query: 228 NVVQFPIQEARARARSTKKIAVVASLGASAT------GTLELTVEAKLRGKVEFMRVIKR 67
           NV QF I E+RARARSTKK+ V   L +S +      G   L VE++LRGKVE ++V+K+
Sbjct: 124 NVGQFVIPESRARARSTKKMYVTVDLNSSNSSNNSMGGVWTLNVESQLRGKVELLKVVKK 183

Query: 66  KKTADMNCTLTLVLATNSVQN 4
            K+A MNC + + L ++++Q+
Sbjct: 184 TKSAYMNCVVVINLRSSTIQD 204


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  138 bits (347), Expect = 2e-30
 Identities = 87/212 (41%), Positives = 126/212 (59%), Gaps = 21/212 (9%)
 Frame = -2

Query: 573 QPEVQGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSL 394
           + + Q +PLAPA+  PRSDEE  + +   +++K+KKRIK   Y          VILIF+L
Sbjct: 3   EKDQQVHPLAPANGHPRSDEESASLQ--SKELKRKKRIKYAVYIAAFAVFQTVVILIFAL 60

Query: 393 IIMRVRTPKVRMDNVTVTS--------GANGDVRFGARVLVKNTNFGRYKFESTLATIRT 238
            +MRV+ PKVR+  VTV +         A+ ++RF  +V VKNTNFG YKF++   +   
Sbjct: 61  TVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLY 120

Query: 237 ADNNVVQFPIQEARARARSTKKIAVVASLGASA-------------TGTLELTVEAKLRG 97
               V +  I +ARARARSTKK+ V   + +SA             +  L L  +AKL+G
Sbjct: 121 DGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKG 180

Query: 96  KVEFMRVIKRKKTADMNCTLTLVLATNSVQNL 1
           KVE M+V+K+KK+ +MNCTL   ++T S+Q+L
Sbjct: 181 KVELMKVMKKKKSPEMNCTLIFNVSTRSLQDL 212


>ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579067 [Solanum tuberosum]
          Length = 197

 Score =  134 bits (338), Expect = 3e-29
 Identities = 69/193 (35%), Positives = 117/193 (60%), Gaps = 6/193 (3%)
 Frame = -2

Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382
           Q YPLAP++++PRSD EF  N       ++KK+++              +IL+F    +R
Sbjct: 5   QKYPLAPSNIMPRSDAEFATNNFQSNNQRRKKKLRST---FLLTIFLTGIILLFCFTFLR 61

Query: 381 VRTPKVRMDNVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV-QFPIQ 205
           +++PK+R++N+ +T+  +G + F A+V ++N NF RY ++STL TI TA+   + +F I 
Sbjct: 62  IKSPKIRIENIRITNDGDGRINFSAQVFLRNRNFWRYGYDSTLGTINTAEGTTIGRFVIP 121

Query: 204 EARARARSTKKIAV-----VASLGASATGTLELTVEAKLRGKVEFMRVIKRKKTADMNCT 40
           +   R RSTK I V     + S   + +G L +  EAK+RGKV+  RV + KK  D++CT
Sbjct: 122 DGEVRRRSTKTIYVMENFILPSRLNNTSGILPVISEAKIRGKVKVFRVFRWKKNVDLSCT 181

Query: 39  LTLVLATNSVQNL 1
           +++ L  +++Q+L
Sbjct: 182 MSINLTISAIQDL 194


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  132 bits (333), Expect = 1e-28
 Identities = 79/204 (38%), Positives = 113/204 (55%), Gaps = 19/204 (9%)
 Frame = -2

Query: 555 YPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVR 376
           YPL PA+      +E     HS E +KKKKR+KCL Y          +IL+F+L +MR+R
Sbjct: 9   YPLVPAANGHERSDEESVAAHSKE-LKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRIR 67

Query: 375 TPKVRMD-------NVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQ 217
            PK R+        NV   +  + D++   +  VKNTNFG +K+E  L T       V +
Sbjct: 68  NPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVGR 127

Query: 216 FPIQEARARARSTKKIAVVASLGAS------------ATGTLELTVEAKLRGKVEFMRVI 73
             IQ+ARARARSTKK+ VV  L ++            + G L LT  +KL GK+  M+VI
Sbjct: 128 ATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKVI 187

Query: 72  KRKKTADMNCTLTLVLATNSVQNL 1
           K+KK+  MNCT+ + + T +V+N+
Sbjct: 188 KKKKSTQMNCTMDVAIDTRTVRNI 211


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  127 bits (320), Expect = 3e-27
 Identities = 79/205 (38%), Positives = 118/205 (57%), Gaps = 18/205 (8%)
 Frame = -2

Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382
           Q YPLAPA+   RSD   G +  S++++K++KR K   Y          V+ +F L +M+
Sbjct: 7   QAYPLAPANGYTRSD---GESLVSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMK 63

Query: 381 VRTPKVRMDNVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 220
           V+TPKVR+  + V S        + D  F  ++ VKNTN+G YKF+++ AT       V 
Sbjct: 64  VKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVG 123

Query: 219 QFPIQEARARARSTKKIAVVASLGASA------------TGTLELTVEAKLRGKVEFMRV 76
           Q  I +++AR RSTKKI+V   L  +A            +G L LT +AKL GKVE M +
Sbjct: 124 QVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLI 183

Query: 75  IKRKKTADMNCTLTLVLATNSVQNL 1
           +K+KK+A M+CT+   L+T +V++L
Sbjct: 184 MKKKKSATMDCTIAFDLSTKTVKSL 208


>ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca
           subsp. vesca]
          Length = 200

 Score =  127 bits (319), Expect = 4e-27
 Identities = 74/195 (37%), Positives = 111/195 (56%), Gaps = 8/195 (4%)
 Frame = -2

Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382
           Q YPLAP++   RSD E      S++++K+KKRIKC  Y          V  +F L +++
Sbjct: 7   QVYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAVGAVFGLTVLK 62

Query: 381 VRTPKVRMDNVTVTSGANGDVR-----FGARVLVKNTNFGRYKFESTLATIRTADNNVVQ 217
           V+TPKVR+D  +  SG           F  ++ VKNTN+G YKF+  + T +     V  
Sbjct: 63  VKTPKVRLDTTSTLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVVTFKYQGTPVGT 122

Query: 216 FPIQEARARARSTKKIAVVASLGASA---TGTLELTVEAKLRGKVEFMRVIKRKKTADMN 46
           F + + +A  R TKKI    SL  +A   +G L LT EAKL GKV  M ++K+KK+A MN
Sbjct: 123 FTVPKGKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMFIMKKKKSASMN 182

Query: 45  CTLTLVLATNSVQNL 1
           CT+ + ++  +V+++
Sbjct: 183 CTIQIDVSGQTVKSV 197


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score =  124 bits (310), Expect = 4e-26
 Identities = 78/211 (36%), Positives = 117/211 (55%), Gaps = 26/211 (12%)
 Frame = -2

Query: 555 YPLAP-ASVVPRSDEEFGNNRH-SDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382
           YPL P A    RSD+E   +   S E+++ KKR++CL Y          VI +F+L +M+
Sbjct: 13  YPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVVVITVFALTVMK 72

Query: 381 VRTPKVRMDNVTVTSGANG-----------DVRFGARVLVKNTNFGRYKFESTLATIRTA 235
           +++PK R+   ++T    G           DV FG    VKNTNFG +++E  +      
Sbjct: 73  IKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFG----VKNTNFGHFEYEDGIVVFTYR 128

Query: 234 DNNVVQFPIQEARARARSTKKIAV----VASLGASA---------TGTLELTVEAKLRGK 94
           D  + Q  ++E R RARST+K+ V    + S G  A         TG + +T+ +KL GK
Sbjct: 129 DVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITISSKLDGK 188

Query: 93  VEFMRVIKRKKTADMNCTLTLVLATNSVQNL 1
           +  M++IK+KK+A MNCT+ +VLAT SVQN+
Sbjct: 189 IHLMKIIKKKKSAQMNCTMEVVLATKSVQNV 219


>ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  124 bits (310), Expect = 4e-26
 Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 19/206 (9%)
 Frame = -2

Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382
           Q YPLAP++   RSD E      S++++K+KKRIKC  Y          V+ +F L IM+
Sbjct: 7   QSYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMK 62

Query: 381 VRTPKVRMDNVTVTSGANGDVR------FGARVLVKNTNFGRYKFESTLATIRTADNNVV 220
           V+TPKVR+   T+T   + D        F  ++ VKNTN+G YKF+  + T       V 
Sbjct: 63  VKTPKVRLGTSTLTDFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVG 122

Query: 219 QFPIQEARARARSTKKIAVVASLGASAT-------------GTLELTVEAKLRGKVEFMR 79
              + + +A  R TKKI V   L  +A              G L LT EAKL GKVE M 
Sbjct: 123 TVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELML 182

Query: 78  VIKRKKTADMNCTLTLVLATNSVQNL 1
           ++K+KK+A MNCT+ + ++  +V++L
Sbjct: 183 IMKKKKSASMNCTIQIDVSGKTVKSL 208


>gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]
          Length = 212

 Score =  120 bits (302), Expect = 4e-25
 Identities = 76/204 (37%), Positives = 114/204 (55%), Gaps = 20/204 (9%)
 Frame = -2

Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382
           Q YPLAPA+  PRSDEE  N     +++K++KRIK   Y          V L+F L++MR
Sbjct: 7   QVYPLAPANGHPRSDEESSNL--DAKELKRRKRIKLAIYAFIFTASQIIVTLVFVLVVMR 64

Query: 381 VRTPKVRMDN------VTVTSGANG--DVRFGARVLVKNTNFGRYKFESTLATIRTADNN 226
           V++PK+R+ +      +   SG+    D+ F  ++ VKNTN+G YKF++T A        
Sbjct: 65  VKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTTAAFAYEGET 124

Query: 225 VVQFPIQEARARARSTKKIAVVASLGAS------------ATGTLELTVEAKLRGKVEFM 82
           V Q  I + +A  RSTKK+ V  SL +S            + G L L   AK+ GKV+ M
Sbjct: 125 VGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTAKMTGKVKLM 184

Query: 81  RVIKRKKTADMNCTLTLVLATNSV 10
            ++K+KK+A+MNCT+ + +   +V
Sbjct: 185 LIMKKKKSANMNCTINIHVKEKTV 208


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  119 bits (299), Expect = 8e-25
 Identities = 66/179 (36%), Positives = 102/179 (56%), Gaps = 20/179 (11%)
 Frame = -2

Query: 477 KKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTV--------TSGANGD 322
           ++K+ IKCL Y          +IL+F +++MR+R PKVR+  VTV        +S  +  
Sbjct: 10  RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69

Query: 321 VRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVVASLGAS 142
           +   A+V VKNTNFG +KF+++  TI      V +  I +ARARARST K+ V  S+ + 
Sbjct: 70  MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSD 129

Query: 141 ------------ATGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNL 1
                        +GT+ L+  AKL GK+   +V K+KK+A+MNCT+ +  ++  +QNL
Sbjct: 130 KMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNL 188


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  119 bits (298), Expect = 1e-24
 Identities = 73/205 (35%), Positives = 115/205 (56%), Gaps = 18/205 (8%)
 Frame = -2

Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382
           Q YPLAPA+   RSD   G +  S +++K++KRI+  TY          V+ +F L +M+
Sbjct: 7   QAYPLAPANGYTRSD---GESLVSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMK 63

Query: 381 VRTPKVRMDNV------TVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 220
           V+TPKVR+  +      +V +  + D  F  ++ VKNTN+G YKF+++  T       V 
Sbjct: 64  VKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVG 123

Query: 219 QFPIQEARARARSTKKIAVVASLGASA------------TGTLELTVEAKLRGKVEFMRV 76
           Q  + + +A  RSTKK+ V  SL A+             +G L L  +AKL GKVE M +
Sbjct: 124 QVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLI 183

Query: 75  IKRKKTADMNCTLTLVLATNSVQNL 1
           +K+KK++ M+C +   L+T +V++L
Sbjct: 184 MKKKKSSTMDCMIGFDLSTKTVKSL 208


>ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  119 bits (298), Expect = 1e-24
 Identities = 72/205 (35%), Positives = 114/205 (55%), Gaps = 18/205 (8%)
 Frame = -2

Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382
           Q YP AP++   RSD   G +  S++++K+KKRIK  TY          V+ +F L +M+
Sbjct: 7   QAYPTAPSNGYARSD---GESLVSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMK 63

Query: 381 VRTPKVRMDNVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 220
           V+TPK R  ++ V +        + D  F  ++ +KNTN+G YKF++  AT       + 
Sbjct: 64  VKTPKARWGSIDVETLNYVPATPSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIG 123

Query: 219 QFPIQEARARARSTKKIAVVASLGASA------------TGTLELTVEAKLRGKVEFMRV 76
           +  I +++A  RSTKKI V  SL  +A            +G L LT + +L+GKVE M +
Sbjct: 124 KVDIPKSKAGMRSTKKIDVEVSLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLI 183

Query: 75  IKRKKTADMNCTLTLVLATNSVQNL 1
           +K+ K A M+CT+   L++ +VQ+L
Sbjct: 184 MKKNKNASMDCTIAFDLSSKTVQSL 208


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  119 bits (297), Expect = 1e-24
 Identities = 71/177 (40%), Positives = 102/177 (57%), Gaps = 19/177 (10%)
 Frame = -2

Query: 474 KKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTVTSGANG-------DVR 316
           K+   KCL Y          +ILIF+L +MR++ PKVR   VTV + + G       D+R
Sbjct: 6   KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65

Query: 315 FGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVVASLGAS-- 142
             A+V VKNTNFG +K+E++   I      V +  I +ARARAR TKK  V   + +S  
Sbjct: 66  LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125

Query: 141 ----------ATGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNL 1
                     A+G L L+ EAKL GKV  M+VIK+KK+++M+CT+ + + T +VQ+L
Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDL 182


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  119 bits (297), Expect = 1e-24
 Identities = 70/184 (38%), Positives = 108/184 (58%), Gaps = 20/184 (10%)
 Frame = -2

Query: 492 SDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTV-----TSGAN 328
           S  ++K+KKR+K   Y          VIL+FSL +MR++ PK R+ ++TV     TS  N
Sbjct: 15  SAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPN 74

Query: 327 G---DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVVA 157
               +++F A V VKNTNFG +KF++T  +       V +  + + RA+ARSTKK+ V  
Sbjct: 75  PPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTV 134

Query: 156 SLGAS------------ATGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNS 13
            L ++            ++G L LT   KL GKV  M++IK+KK+A MNCT+T+ LA+ +
Sbjct: 135 DLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRA 194

Query: 12  VQNL 1
           +Q++
Sbjct: 195 IQDI 198


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  118 bits (296), Expect = 2e-24
 Identities = 70/182 (38%), Positives = 109/182 (59%), Gaps = 20/182 (10%)
 Frame = -2

Query: 486 EQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTV------TSGANG 325
           +++K+KKR+KCL Y          +IL+F+L +MR++ PK R+ +V V       S  + 
Sbjct: 14  KELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSF 73

Query: 324 DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQE--ARARARSTKKIAVVASL 151
           +++F A+V VKNTNFG YKFE++  T     + V +  + +  ARARARSTKK+ V   L
Sbjct: 74  NMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTMDL 133

Query: 150 GASA------------TGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQ 7
            ++             +G L LT ++ L GKV  M+VIK+KK+ +MNCT+T+ LA   V+
Sbjct: 134 NSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKLVR 193

Query: 6   NL 1
           ++
Sbjct: 194 DI 195


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  116 bits (291), Expect = 7e-24
 Identities = 68/181 (37%), Positives = 108/181 (59%), Gaps = 19/181 (10%)
 Frame = -2

Query: 486 EQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTVTS---GANGDVR 316
           E+ K+ + +KC  Y          +IL+F+L +MR++TP  R+ +VTV S    A+G   
Sbjct: 5   EKYKRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPH 64

Query: 315 FGARVL----VKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVVASLG 148
           F  R++    VKN NFG ++F++T A +      V    I ++RARAR TK++ V   + 
Sbjct: 65  FNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVS 124

Query: 147 ASA------------TGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQN 4
           +SA            +GTL LT  A+LRGKV  M+++K++KTA+MNCT+T+ L +++VQ+
Sbjct: 125 SSAVSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQD 184

Query: 3   L 1
           L
Sbjct: 185 L 185


>ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  116 bits (291), Expect = 7e-24
 Identities = 73/205 (35%), Positives = 117/205 (57%), Gaps = 18/205 (8%)
 Frame = -2

Query: 561 QGYPLAP---ASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLI 391
           + YP AP      + RSD E  +  HSD +++KKKRIKCL Y          VI +F+L 
Sbjct: 7   EAYPFAPYANGQAMARSDAE-SSRAHSDHELRKKKRIKCLIYIAVFAVFQIIVITVFALT 65

Query: 390 IMRVRTPKVRMDNVTV-----TSGANG--DVRFGARVLVKNTNFGRYKFESTLATIRTAD 232
           +M++++PK R+ ++TV     ++ AN    + F A V VKN NFGRYK++ T  +     
Sbjct: 66  VMKIKSPKFRIKSITVQDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQTSISFIYEG 125

Query: 231 NNVVQFPIQEARARARSTKK------IAVVASLGAS--ATGTLELTVEAKLRGKVEFMRV 76
             V    + +A AR ++T+K      +  V S  AS  + G++ L+  +K+ GKV  M +
Sbjct: 126 TQVGDAVVPKATARTKATRKEIVSGAVKTVNSNLASDISAGSVTLSTYSKINGKVYLMNM 185

Query: 75  IKRKKTADMNCTLTLVLATNSVQNL 1
           IK+KK+A+M CT+ + L++  VQ++
Sbjct: 186 IKKKKSAEMKCTMVVHLSSKQVQDI 210


>ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca
           subsp. vesca]
          Length = 212

 Score =  115 bits (289), Expect = 1e-23
 Identities = 69/182 (37%), Positives = 103/182 (56%), Gaps = 18/182 (9%)
 Frame = -2

Query: 492 SDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTVTS------GA 331
           S+E++K++KRIK  TY          V+ +F L +M+V+TPKVR+    V +        
Sbjct: 28  SEEELKRQKRIKLFTYIGIFIGFQIIVMTVFGLTVMKVKTPKVRLGATNVQNLNFVPTSP 87

Query: 330 NGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVVASL 151
           + D  F  ++ +KNTN+G YKF++  AT       V Q    +++A  RSTKKI    SL
Sbjct: 88  SFDTTFATQIRIKNTNWGPYKFDAGTATFMYQGVAVGQVSFPKSKAGMRSTKKINAEVSL 147

Query: 150 GAS------------ATGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQ 7
            ++            ++G L LT EAKL GKVE M ++K+KK+A MNCT+ L L+T ++Q
Sbjct: 148 NSNEIPSTSNLGSELSSGVLTLTSEAKLTGKVELMLIMKKKKSATMNCTMKLDLSTKTIQ 207

Query: 6   NL 1
            L
Sbjct: 208 AL 209


>gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus]
          Length = 213

 Score =  115 bits (288), Expect = 2e-23
 Identities = 73/208 (35%), Positives = 107/208 (51%), Gaps = 19/208 (9%)
 Frame = -2

Query: 567 EVQGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLII 388
           E +  PL  A+   RSD E G   H   + +KKKR KC  Y          VI IFS+ +
Sbjct: 3   EKEHQPLPYANGHGRSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTV 62

Query: 387 MRVRTPKVRMDNVTVT---SGANGDVRF----GARVLVKNTNFGRYKFESTLATIRTADN 229
           M++RTPK R+ +  +T   +G  G   F     A   VKN NFGRYK+ +T         
Sbjct: 63  MKIRTPKFRIRSAHLTTFHAGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGT 122

Query: 228 NVVQFPIQEARARARSTKKIAVVASLGAS------------ATGTLELTVEAKLRGKVEF 85
            V Q  ++++RA  RSTKK  VV  L  +              G +++T +A++ G+VE 
Sbjct: 123 PVGQVFVRDSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNAGVVQITSQARMAGRVEL 182

Query: 84  MRVIKRKKTADMNCTLTLVLATNSVQNL 1
           + V+K+ K+ DMNC + +V AT  ++NL
Sbjct: 183 IFVMKKNKSTDMNCNMEIVTATQQIRNL 210


>ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  115 bits (287), Expect = 2e-23
 Identities = 72/203 (35%), Positives = 109/203 (53%), Gaps = 16/203 (7%)
 Frame = -2

Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382
           Q YPLA  +   RSD E      S++++K+KKRIKC  Y          +  +F L +++
Sbjct: 10  QTYPLASENGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAIGAVFGLTVLK 65

Query: 381 VRTPKVRMDNVTVTSGANGDVRFGA----RVLVKNTNFGRYKFESTLATIRTADNNVVQF 214
           V+TPKVR+   T++   +    F +    ++ VKNTN+G YKF+  + T       V   
Sbjct: 66  VKTPKVRLGTSTLSDVTSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVTFMYQGAPVGTV 125

Query: 213 PIQEARARARSTKKIAVVASLGASAT------------GTLELTVEAKLRGKVEFMRVIK 70
            + + +A  R TKKI V  SL  +A             G L LT EAKL GKVE M ++K
Sbjct: 126 VVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKLTGKVELMLIMK 185

Query: 69  RKKTADMNCTLTLVLATNSVQNL 1
           +KK+A MNCT+ + ++  +V++L
Sbjct: 186 KKKSASMNCTIQIDVSGKTVKSL 208


Top