BLASTX nr result

ID: Mentha23_contig00009995 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00009995
         (648 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus...   206   7e-51
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   150   3e-34
ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579...   145   1e-32
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   145   1e-32
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...   142   9e-32
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   141   2e-31
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   138   2e-30
ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295...   138   2e-30
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   137   3e-30
ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303...   137   4e-30
emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]   135   1e-29
ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293...   135   1e-29
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   132   7e-29
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   132   9e-29
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   130   3e-28
gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]     130   4e-28
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   129   8e-28
gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus...   127   3e-27
ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295...   127   4e-27
ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295...   126   7e-27

>gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus guttatus]
          Length = 208

 Score =  206 bits (523), Expect = 7e-51
 Identities = 111/207 (53%), Positives = 150/207 (72%), Gaps = 10/207 (4%)
 Frame = -1

Query: 648 DKYQPEV-QGYPLAPATVVPRSDEEYG--NNRRSDEQMRKKKRMKCLAYVAVFAVLQVAV 478
           +KY  EV Q YPLAP+TV PRSDEEY   NN R+ E+M+K KRMKC AY+A FAV Q+ +
Sbjct: 3   EKYNQEVHQAYPLAPSTV-PRSDEEYSGTNNYRAQEEMKKNKRMKCFAYIACFAVFQIII 61

Query: 477 ILVFALVIMRVRTPKVRMDDVTVTSG-ANGDVRFGARVLVKNTNFGRYKFESTLGSITAA 301
           IL+ AL +MRV++PK+R+ D+TVT    +G+VR  ARVLVKNTNFGRYKF+S L +I + 
Sbjct: 62  ILILALTVMRVKSPKLRLGDITVTKDHVSGNVRLTARVLVKNTNFGRYKFDSGLATIRSG 121

Query: 300 DNNVVQFPIQEARARARSTKKIAFVESLSASGS------GTLELTVEAKLRGKVEFFRVI 139
            +NV QF I E+RARARSTKK+     L++S S      G   L VE++LRGKVE  +V+
Sbjct: 122 ASNVGQFVIPESRARARSTKKMYVTVDLNSSNSSNNSMGGVWTLNVESQLRGKVELLKVV 181

Query: 138 KRRKTADMSCTLTVVLATNSVQNLRCK 58
           K+ K+A M+C + + L ++++Q+ RCK
Sbjct: 182 KKTKSAYMNCVVVINLRSSTIQDSRCK 208


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  150 bits (380), Expect = 3e-34
 Identities = 89/215 (41%), Positives = 132/215 (61%), Gaps = 21/215 (9%)
 Frame = -1

Query: 639 QPEVQGYPLAPATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILVFAL 460
           + + Q +PLAPA   PRSDEE  + +   +++++KKR+K   Y+A FAV Q  VIL+FAL
Sbjct: 3   EKDQQVHPLAPANGHPRSDEESASLQ--SKELKRKKRIKYAVYIAAFAVFQTVVILIFAL 60

Query: 459 VIMRVRTPKVRMDDVTV--------TSGANGDVRFGARVLVKNTNFGRYKFESTLGSITA 304
            +MRV+ PKVR+  VTV         + A+ ++RF  +V VKNTNFG YKF++   S   
Sbjct: 61  TVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLY 120

Query: 303 ADNNVVQFPIQEARARARSTKKIAFVESLSAS-------------GSGTLELTVEAKLRG 163
               V +  I +ARARARSTKK+     +++S              S  L L  +AKL+G
Sbjct: 121 DGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKG 180

Query: 162 KVEFFRVIKRRKTADMSCTLTVVLATNSVQNLRCK 58
           KVE  +V+K++K+ +M+CTL   ++T S+Q+L+CK
Sbjct: 181 KVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215


>ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579067 [Solanum tuberosum]
          Length = 197

 Score =  145 bits (366), Expect = 1e-32
 Identities = 71/196 (36%), Positives = 122/196 (62%), Gaps = 6/196 (3%)
 Frame = -1

Query: 627 QGYPLAPATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILVFALVIMR 448
           Q YPLAP+ ++PRSD E+  N       R+KK+++    + +F      +IL+F    +R
Sbjct: 5   QKYPLAPSNIMPRSDAEFATNNFQSNNQRRKKKLRSTFLLTIFLT---GIILLFCFTFLR 61

Query: 447 VRTPKVRMDDVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLGSITAADNNVV-QFPIQ 271
           +++PK+R++++ +T+  +G + F A+V ++N NF RY ++STLG+I  A+   + +F I 
Sbjct: 62  IKSPKIRIENIRITNDGDGRINFSAQVFLRNRNFWRYGYDSTLGTINTAEGTTIGRFVIP 121

Query: 270 EARARARSTKKIAFVE-----SLSASGSGTLELTVEAKLRGKVEFFRVIKRRKTADMSCT 106
           +   R RSTK I  +E     S   + SG L +  EAK+RGKV+ FRV + +K  D+SCT
Sbjct: 122 DGEVRRRSTKTIYVMENFILPSRLNNTSGILPVISEAKIRGKVKVFRVFRWKKNVDLSCT 181

Query: 105 LTVVLATNSVQNLRCK 58
           +++ L  +++Q+L C+
Sbjct: 182 MSINLTISAIQDLDCQ 197


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  145 bits (365), Expect = 1e-32
 Identities = 85/208 (40%), Positives = 123/208 (59%), Gaps = 20/208 (9%)
 Frame = -1

Query: 621 YPLAPATVV-PRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILVFALVIMRV 445
           YPL PA     RSDEE  +     ++++KKKRMKCL Y+ +FAV Q  +IL+FAL +MR+
Sbjct: 9   YPLVPAANGHERSDEE--SVAAHSKELKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRI 66

Query: 444 RTPKVRMDDVTVTSGANG-------DVRFGARVLVKNTNFGRYKFESTLGSITAADNNVV 286
           R PK R+   + T+   G       D++   +  VKNTNFG +K+E  L +       V 
Sbjct: 67  RNPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVG 126

Query: 285 QFPIQEARARARSTKKIAFVESLSASG------------SGTLELTVEAKLRGKVEFFRV 142
           +  IQ+ARARARSTKK+  V  LS++G            +G L LT  +KL GK+   +V
Sbjct: 127 RATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKV 186

Query: 141 IKRRKTADMSCTLTVVLATNSVQNLRCK 58
           IK++K+  M+CT+ V + T +V+N+ CK
Sbjct: 187 IKKKKSTQMNCTMDVAIDTRTVRNIICK 214


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score =  142 bits (358), Expect = 9e-32
 Identities = 86/214 (40%), Positives = 126/214 (58%), Gaps = 26/214 (12%)
 Frame = -1

Query: 621 YPLAP-ATVVPRSDEEYGNNRR-SDEQMRKKKRMKCLAYVAVFAVLQVAVILVFALVIMR 448
           YPL P A    RSD+E   +   S E++R KKRM+CL YV++FAV QV VI VFAL +M+
Sbjct: 13  YPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVVVITVFALTVMK 72

Query: 447 VRTPKVRMDDVTVTSGANG-----------DVRFGARVLVKNTNFGRYKFESTLGSITAA 301
           +++PK R+   ++T    G           DV FG    VKNTNFG +++E  +   T  
Sbjct: 73  IKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFG----VKNTNFGHFEYEDGIVVFTYR 128

Query: 300 DNNVVQFPIQEARARARSTKKIAFVE-SLSASG------------SGTLELTVEAKLRGK 160
           D  + Q  ++E R RARST+K+      L++ G            +G + +T+ +KL GK
Sbjct: 129 DVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITISSKLDGK 188

Query: 159 VEFFRVIKRRKTADMSCTLTVVLATNSVQNLRCK 58
           +   ++IK++K+A M+CT+ VVLAT SVQN+ CK
Sbjct: 189 IHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVCK 222


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  141 bits (355), Expect = 2e-31
 Identities = 80/180 (44%), Positives = 111/180 (61%), Gaps = 19/180 (10%)
 Frame = -1

Query: 540 KKKRMKCLAYVAVFAVLQVAVILVFALVIMRVRTPKVRMDDVTVTSGANG-------DVR 382
           K+   KCLAYVAVF V Q A+IL+FAL +MR++ PKVR   VTV + + G       D+R
Sbjct: 6   KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65

Query: 381 FGARVLVKNTNFGRYKFESTLGSITAADNNVVQFPIQEARARARSTKKIAFVESLSAS-- 208
             A+V VKNTNFG +K+E++   I      V +  I +ARARAR TKK      +S+S  
Sbjct: 66  LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125

Query: 207 ----------GSGTLELTVEAKLRGKVEFFRVIKRRKTADMSCTLTVVLATNSVQNLRCK 58
                      SG L L+ EAKL GKV   +VIK++K+++MSCT+ + + T +VQ+L+CK
Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKCK 185


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  138 bits (347), Expect = 2e-30
 Identities = 79/208 (37%), Positives = 123/208 (59%), Gaps = 18/208 (8%)
 Frame = -1

Query: 627 QGYPLAPATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILVFALVIMR 448
           Q YPLAPA    RSD   G +  S+++++++KR K   Y+ +F V+Q+ V+ VF L +M+
Sbjct: 7   QAYPLAPANGYTRSD---GESLVSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMK 63

Query: 447 VRTPKVRMDDVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLGSITAADNNVV 286
           V+TPKVR+  + V S        + D  F  ++ VKNTN+G YKF+++  +       V 
Sbjct: 64  VKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVG 123

Query: 285 QFPIQEARARARSTKKIAFVESLSAS------------GSGTLELTVEAKLRGKVEFFRV 142
           Q  I +++AR RSTKKI+    L+ +             SG L LT +AKL GKVE   +
Sbjct: 124 QVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLI 183

Query: 141 IKRRKTADMSCTLTVVLATNSVQNLRCK 58
           +K++K+A M CT+   L+T +V++L+CK
Sbjct: 184 MKKKKSATMDCTIAFDLSTKTVKSLQCK 211


>ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca
           subsp. vesca]
          Length = 200

 Score =  138 bits (347), Expect = 2e-30
 Identities = 79/205 (38%), Positives = 122/205 (59%), Gaps = 8/205 (3%)
 Frame = -1

Query: 648 DKYQPEVQGYPLAPATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILV 469
           DK+Q   Q YPLAP+    RSD E      S++++++KKR+KC AY+ +F V Q+AV  V
Sbjct: 3   DKHQ---QVYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAVGAV 55

Query: 468 FALVIMRVRTPKVRMDDVTVTSGANGDV-----RFGARVLVKNTNFGRYKFESTLGSITA 304
           F L +++V+TPKVR+D  +  SG           F  ++ VKNTN+G YKF+  + +   
Sbjct: 56  FGLTVLKVKTPKVRLDTTSTLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVVTFKY 115

Query: 303 ADNNVVQFPIQEARARARSTKKIAFVESLSA---SGSGTLELTVEAKLRGKVEFFRVIKR 133
               V  F + + +A  R TKKI    SL+    + SG L LT EAKL GKV    ++K+
Sbjct: 116 QGTPVGTFTVPKGKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMFIMKK 175

Query: 132 RKTADMSCTLTVVLATNSVQNLRCK 58
           +K+A M+CT+ + ++  +V+++ CK
Sbjct: 176 KKSASMNCTIQIDVSGQTVKSVVCK 200


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  137 bits (345), Expect = 3e-30
 Identities = 78/184 (42%), Positives = 119/184 (64%), Gaps = 20/184 (10%)
 Frame = -1

Query: 552 EQMRKKKRMKCLAYVAVFAVLQVAVILVFALVIMRVRTPKVR-----MDDVTV-TSGANG 391
           +++++KKRMKCLAYVA F + Q A+ILVFAL +MR++ PK R     +DD+T   S  + 
Sbjct: 14  KELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSF 73

Query: 390 DVRFGARVLVKNTNFGRYKFESTLGSITAADNNVVQFPIQE--ARARARSTKKIAFVESL 217
           +++F A+V VKNTNFG YKFE++  +     + V +  + +  ARARARSTKK+     L
Sbjct: 74  NMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTMDL 133

Query: 216 SASG------------SGTLELTVEAKLRGKVEFFRVIKRRKTADMSCTLTVVLATNSVQ 73
           +++G            SG L LT ++ L GKV   +VIK++K+ +M+CT+TV LA   V+
Sbjct: 134 NSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKLVR 193

Query: 72  NLRC 61
           +++C
Sbjct: 194 DIKC 197


>ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  137 bits (344), Expect = 4e-30
 Identities = 81/207 (39%), Positives = 124/207 (59%), Gaps = 18/207 (8%)
 Frame = -1

Query: 627 QGYPLAP---ATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILVFALV 457
           + YP AP      + RSD E  +   SD ++RKKKR+KCL Y+AVFAV Q+ VI VFAL 
Sbjct: 7   EAYPFAPYANGQAMARSDAE-SSRAHSDHELRKKKRIKCLIYIAVFAVFQIIVITVFALT 65

Query: 456 IMRVRTPKVR-----MDDVTVTSGANG--DVRFGARVLVKNTNFGRYKFESTLGSITAAD 298
           +M++++PK R     + D+T ++ AN    + F A V VKN NFGRYK++ T  S     
Sbjct: 66  VMKIKSPKFRIKSITVQDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQTSISFIYEG 125

Query: 297 NNVVQFPIQEARARARSTKK------IAFVESLSAS--GSGTLELTVEAKLRGKVEFFRV 142
             V    + +A AR ++T+K      +  V S  AS   +G++ L+  +K+ GKV    +
Sbjct: 126 TQVGDAVVPKATARTKATRKEIVSGAVKTVNSNLASDISAGSVTLSTYSKINGKVYLMNM 185

Query: 141 IKRRKTADMSCTLTVVLATNSVQNLRC 61
           IK++K+A+M CT+ V L++  VQ+++C
Sbjct: 186 IKKKKSAEMKCTMVVHLSSKQVQDIKC 212


>emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]
          Length = 186

 Score =  135 bits (340), Expect = 1e-29
 Identities = 73/181 (40%), Positives = 115/181 (63%), Gaps = 18/181 (9%)
 Frame = -1

Query: 546 MRKKKRMKCLAYVAVFAVLQVAVILVFALVIMRVRTPKVRMDDVTV------TSGANGDV 385
           +R+KK +KCLAYVA F V Q  +IL+F L+++++R PKVR+  ++V      T+  + D+
Sbjct: 8   VRRKKSLKCLAYVAAFVVFQTGIILLFVLLVLKIRDPKVRIASISVENQHFSTNSFSMDL 67

Query: 384 RFGARVLVKNTNFGRYKFESTLGSITAADNNVVQFPIQEARARARSTKKIAFVESLSAS- 208
           +  ARV VKNTNFG +KF+++  +I+     V +  I +ARAR+RSTK+      +S+S 
Sbjct: 68  K--ARVTVKNTNFGHFKFDNSTATISYFGTAVGEATILKARARSRSTKRFNITVPISSSK 125

Query: 207 -----------GSGTLELTVEAKLRGKVEFFRVIKRRKTADMSCTLTVVLATNSVQNLRC 61
                       SG L L+  AKL GK+  F++ K++K+A+MSCT+ +   T+S++NL C
Sbjct: 126 VNNHRQLRRDLNSGVLNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENLSC 185

Query: 60  K 58
           K
Sbjct: 186 K 186


>ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  135 bits (339), Expect = 1e-29
 Identities = 79/209 (37%), Positives = 122/209 (58%), Gaps = 19/209 (9%)
 Frame = -1

Query: 627 QGYPLAPATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILVFALVIMR 448
           Q YPLAP+    RSD E      S++++++KKR+KC AY+ +F V Q+AV+ VF L IM+
Sbjct: 7   QSYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMK 62

Query: 447 VRTPKVRMDDVTVT------SGANGDVRFGARVLVKNTNFGRYKFESTLGSITAADNNVV 286
           V+TPKVR+   T+T      +  + D  F  ++ VKNTN+G YKF+  + +       V 
Sbjct: 63  VKTPKVRLGTSTLTDFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVG 122

Query: 285 QFPIQEARARARSTKKI--------AFVESLSAS-----GSGTLELTVEAKLRGKVEFFR 145
              + + +A  R TKKI        A + S S++       G L LT EAKL GKVE   
Sbjct: 123 TVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELML 182

Query: 144 VIKRRKTADMSCTLTVVLATNSVQNLRCK 58
           ++K++K+A M+CT+ + ++  +V++L CK
Sbjct: 183 IMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  132 bits (333), Expect = 7e-29
 Identities = 70/182 (38%), Positives = 111/182 (60%), Gaps = 20/182 (10%)
 Frame = -1

Query: 543 RKKKRMKCLAYVAVFAVLQVAVILVFALVIMRVRTPKVRMDDVTV--------TSGANGD 388
           R+K+ +KCLAY+    + Q  +IL+F +++MR+R PKVR+  VTV        +S  +  
Sbjct: 10  RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69

Query: 387 VRFGARVLVKNTNFGRYKFESTLGSITAADNNVVQFPIQEARARARSTKKIAFVESLSAS 208
           +   A+V VKNTNFG +KF+++  +I+     V +  I +ARARARST K+    S+S+ 
Sbjct: 70  MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSD 129

Query: 207 ------------GSGTLELTVEAKLRGKVEFFRVIKRRKTADMSCTLTVVLATNSVQNLR 64
                       GSGT+ L+  AKL GK+  F+V K++K+A+M+CT+ V  ++  +QNL 
Sbjct: 130 KMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLM 189

Query: 63  CK 58
           C+
Sbjct: 190 CQ 191


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  132 bits (332), Expect = 9e-29
 Identities = 74/187 (39%), Positives = 113/187 (60%), Gaps = 20/187 (10%)
 Frame = -1

Query: 558 SDEQMRKKKRMKCLAYVAVFAVLQVAVILVFALVIMRVRTPKVR-----MDDVTVTSGAN 394
           S  ++++KKRMK  AY A F V Q  VILVF+L +MR++ PK R     ++D+  TS  N
Sbjct: 15  SAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPN 74

Query: 393 G---DVRFGARVLVKNTNFGRYKFESTLGSITAADNNVVQFPIQEARARARSTKKIAFVE 223
               +++F A V VKNTNFG +KF++T  S       V +  + + RA+ARSTKK+    
Sbjct: 75  PPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTV 134

Query: 222 SLSAS------------GSGTLELTVEAKLRGKVEFFRVIKRRKTADMSCTLTVVLATNS 79
            L+++             SG L LT   KL GKV   ++IK++K+A M+CT+TV LA+ +
Sbjct: 135 DLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRA 194

Query: 78  VQNLRCK 58
           +Q+++C+
Sbjct: 195 IQDIKCQ 201


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  130 bits (328), Expect = 3e-28
 Identities = 74/208 (35%), Positives = 122/208 (58%), Gaps = 18/208 (8%)
 Frame = -1

Query: 627 QGYPLAPATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILVFALVIMR 448
           Q YPLAPA    RSD   G +  S ++++++KR++   Y+ +F V Q+ V+ VF L +M+
Sbjct: 7   QAYPLAPANGYTRSD---GESLVSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMK 63

Query: 447 VRTPKVRMDDV------TVTSGANGDVRFGARVLVKNTNFGRYKFESTLGSITAADNNVV 286
           V+TPKVR+ ++      +V +  + D  F  ++ VKNTN+G YKF+++  +       V 
Sbjct: 64  VKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVG 123

Query: 285 QFPIQEARARARSTKKIAFVESLSASG------------SGTLELTVEAKLRGKVEFFRV 142
           Q  + + +A  RSTKK+    SL+A+G            SG L L  +AKL GKVE   +
Sbjct: 124 QVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLI 183

Query: 141 IKRRKTADMSCTLTVVLATNSVQNLRCK 58
           +K++K++ M C +   L+T +V++L+CK
Sbjct: 184 MKKKKSSTMDCMIGFDLSTKTVKSLQCK 211


>gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]
          Length = 212

 Score =  130 bits (327), Expect = 4e-28
 Identities = 80/216 (37%), Positives = 124/216 (57%), Gaps = 20/216 (9%)
 Frame = -1

Query: 648 DKYQPEVQGYPLAPATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILV 469
           ++YQ   Q YPLAPA   PRSDEE  N     ++++++KR+K   Y  +F   Q+ V LV
Sbjct: 3   ERYQ---QVYPLAPANGHPRSDEESSN--LDAKELKRRKRIKLAIYAFIFTASQIIVTLV 57

Query: 468 FALVIMRVRTPKVRMDD------VTVTSGA--NGDVRFGARVLVKNTNFGRYKFESTLGS 313
           F LV+MRV++PK+R+ D      +   SG+  + D+ F  ++ VKNTN+G YKF++T  +
Sbjct: 58  FVLVVMRVKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTTAA 117

Query: 312 ITAADNNVVQFPIQEARARARSTKKIAFVESLSAS------------GSGTLELTVEAKL 169
                  V Q  I + +A  RSTKK+    SLS+S              G L L   AK+
Sbjct: 118 FAYEGETVGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTAKM 177

Query: 168 RGKVEFFRVIKRRKTADMSCTLTVVLATNSVQNLRC 61
            GKV+   ++K++K+A+M+CT+ + +   +V NL+C
Sbjct: 178 TGKVKLMLIMKKKKSANMNCTINIHVKEKTV-NLKC 212


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  129 bits (324), Expect = 8e-28
 Identities = 74/184 (40%), Positives = 112/184 (60%), Gaps = 19/184 (10%)
 Frame = -1

Query: 552 EQMRKKKRMKCLAYVAVFAVLQVAVILVFALVIMRVRTPKVRMDDVTVTS---GANGDVR 382
           E+ ++ + MKC AY+    V Q  +ILVFAL +MR++TP  R+  VTV S    A+G   
Sbjct: 5   EKYKRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPH 64

Query: 381 FGARVL----VKNTNFGRYKFESTLGSITAADNNVVQFPIQEARARARSTKKIAFVESLS 214
           F  R++    VKN NFG ++F++T  ++T     V    I ++RARAR TK++     +S
Sbjct: 65  FNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVS 124

Query: 213 ASG------------SGTLELTVEAKLRGKVEFFRVIKRRKTADMSCTLTVVLATNSVQN 70
           +S             SGTL LT  A+LRGKV   +++K+RKTA+M+CT+TV L +++VQ+
Sbjct: 125 SSAVSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQD 184

Query: 69  LRCK 58
           L C+
Sbjct: 185 LDCE 188


>gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus]
          Length = 213

 Score =  127 bits (319), Expect = 3e-27
 Identities = 74/211 (35%), Positives = 113/211 (53%), Gaps = 19/211 (9%)
 Frame = -1

Query: 633 EVQGYPLAPATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILVFALVI 454
           E +  PL  A    RSD E G       + RKKKR KC  Y+A+F + Q+ VI +F++ +
Sbjct: 3   EKEHQPLPYANGHGRSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTV 62

Query: 453 MRVRTPKVRMDDVTVT---SGANGDVRF----GARVLVKNTNFGRYKFESTLGSITAADN 295
           M++RTPK R+    +T   +G  G   F     A   VKN NFGRYK+ +T         
Sbjct: 63  MKIRTPKFRIRSAHLTTFHAGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGT 122

Query: 294 NVVQFPIQEARARARSTKKIAFVESLSAS------------GSGTLELTVEAKLRGKVEF 151
            V Q  ++++RA  RSTKK   V  L+ +             +G +++T +A++ G+VE 
Sbjct: 123 PVGQVFVRDSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNAGVVQITSQARMAGRVEL 182

Query: 150 FRVIKRRKTADMSCTLTVVLATNSVQNLRCK 58
             V+K+ K+ DM+C + +V AT  ++NL CK
Sbjct: 183 IFVMKKNKSTDMNCNMEIVTATQQIRNLVCK 213


>ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  127 bits (318), Expect = 4e-27
 Identities = 71/208 (34%), Positives = 116/208 (55%), Gaps = 18/208 (8%)
 Frame = -1

Query: 627 QGYPLAPATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILVFALVIMR 448
           Q YP AP+    RSD   G +  S++++++KKR+K   Y+ +F V Q+ V+ VF L +M+
Sbjct: 7   QAYPTAPSNGYARSD---GESLVSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMK 63

Query: 447 VRTPKVRMDDVT------VTSGANGDVRFGARVLVKNTNFGRYKFESTLGSITAADNNVV 286
           V+TPK R   +       V +  + D  F  ++ +KNTN+G YKF++   +       + 
Sbjct: 64  VKTPKARWGSIDVETLNYVPATPSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIG 123

Query: 285 QFPIQEARARARSTKKIAFVESLSAS------------GSGTLELTVEAKLRGKVEFFRV 142
           +  I +++A  RSTKKI    SL+ +             SG L LT + +L+GKVE   +
Sbjct: 124 KVDIPKSKAGMRSTKKIDVEVSLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLI 183

Query: 141 IKRRKTADMSCTLTVVLATNSVQNLRCK 58
           +K+ K A M CT+   L++ +VQ+L+CK
Sbjct: 184 MKKNKNASMDCTIAFDLSSKTVQSLQCK 211


>ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  126 bits (316), Expect = 7e-27
 Identities = 77/214 (35%), Positives = 120/214 (56%), Gaps = 17/214 (7%)
 Frame = -1

Query: 648 DKYQPEVQGYPLAPATVVPRSDEEYGNNRRSDEQMRKKKRMKCLAYVAVFAVLQVAVILV 469
           +K Q   Q YPLA      RSD E      S++++++KKR+KC AY+ +F V Q+A+  V
Sbjct: 3   EKSQKTHQTYPLASENGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAIGAV 58

Query: 468 FALVIMRVRTPKVR-----MDDVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLGSITA 304
           F L +++V+TPKVR     + DVT +S  +    F  ++ VKNTN+G YKF+  + +   
Sbjct: 59  FGLTVLKVKTPKVRLGTSTLSDVT-SSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVTFMY 117

Query: 303 ADNNVVQFPIQEARARARSTKKIAFVESLSAS------------GSGTLELTVEAKLRGK 160
               V    + + +A  R TKKI    SL+ +              G L LT EAKL GK
Sbjct: 118 QGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKLTGK 177

Query: 159 VEFFRVIKRRKTADMSCTLTVVLATNSVQNLRCK 58
           VE   ++K++K+A M+CT+ + ++  +V++L CK
Sbjct: 178 VELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211


Top