BLASTX nr result

ID: Mentha29_contig00006279 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00006279
         (817 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus...   181   3e-43
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   134   3e-29
ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579...   125   2e-26
ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295...   124   4e-26
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   120   5e-25
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   118   2e-24
gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]     118   3e-24
ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293...   118   3e-24
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...   115   1e-23
ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303...   114   4e-23
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   114   6e-23
ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295...   112   1e-22
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   112   2e-22
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   110   6e-22
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   109   1e-21
ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295...   109   1e-21
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   109   1e-21
ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295...   108   3e-21
gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus...   107   5e-21
ref|XP_004298841.1| PREDICTED: uncharacterized protein LOC101294...   107   7e-21

>gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus guttatus]
          Length = 208

 Score =  181 bits (459), Expect = 3e-43
 Identities = 104/209 (49%), Positives = 136/209 (65%), Gaps = 10/209 (4%)
 Frame = +2

Query: 95  MADKYQPEV-QGYPLAPASVVPRSDEEYG--NNRHSDEQMKKKKRIKCLTYXXXXXXXXX 265
           MA+KY  EV Q YPLAP S VPRSDEEY   NN  + E+MKK KR+KC  Y         
Sbjct: 1   MAEKYNQEVHQAYPLAP-STVPRSDEEYSGTNNYRAQEEMKKNKRMKCFAYIACFAVFQI 59

Query: 266 XXILIFSLIIMRVRTPKVRMDDVTVTSG-ANGDVRFGARVLVKNTNFGRYKFESTLAAIR 442
             ILI +L +MRV++PK+R+ D+TVT    +G+VR  ARVLVKNTNFGRYKF+S LA IR
Sbjct: 60  IIILILALTVMRVKSPKLRLGDITVTKDHVSGNVRLTARVLVKNTNFGRYKFDSGLATIR 119

Query: 443 TADNNVVPFPIQEARARARSTKK------IXXXXXXXXXXXXTLELTVEAKLRGKVEFMR 604
           +  +NV  F I E+RARARSTKK      +               L VE++LRGKVE ++
Sbjct: 120 SGASNVGQFVIPESRARARSTKKMYVTVDLNSSNSSNNSMGGVWTLNVESQLRGKVELLK 179

Query: 605 VIKRKKTADMNCTLTLVLATNSVQNLRCK 691
           V+K+ K+A MNC + + L ++++Q+ RCK
Sbjct: 180 VVKKTKSAYMNCVVVINLRSSTIQDSRCK 208


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  134 bits (338), Expect = 3e-29
 Identities = 89/220 (40%), Positives = 125/220 (56%), Gaps = 21/220 (9%)
 Frame = +2

Query: 95  MADKYQPEVQGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXI 274
           MA+K Q   Q +PLAPA+  PRSDEE  + +   +++K+KKRIK   Y           I
Sbjct: 1   MAEKDQ---QVHPLAPANGHPRSDEESASLQ--SKELKRKKRIKYAVYIAAFAVFQTVVI 55

Query: 275 LIFSLIIMRVRTPKVRMDDVTVTS--------GANGDVRFGARVLVKNTNFGRYKFESTL 430
           LIF+L +MRV+ PKVR+  VTV +         A+ ++RF  +V VKNTNFG YKF++  
Sbjct: 56  LIFALTVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNAT 115

Query: 431 AAIRTADNNVVPFPIQEARARARSTKKIXXXXXXXXXXXXT-------------LELTVE 571
            +       V    I +ARARARSTKK+            +             L L  +
Sbjct: 116 MSFLYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQ 175

Query: 572 AKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRCK 691
           AKL+GKVE M+V+K+KK+ +MNCTL   ++T S+Q+L+CK
Sbjct: 176 AKLKGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215


>ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579067 [Solanum tuberosum]
          Length = 197

 Score =  125 bits (313), Expect = 2e-26
 Identities = 64/196 (32%), Positives = 110/196 (56%), Gaps = 6/196 (3%)
 Frame = +2

Query: 122 QGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXILIFSLIIMR 301
           Q YPLAP++++PRSD E+  N       ++KK+++               IL+F    +R
Sbjct: 5   QKYPLAPSNIMPRSDAEFATNNFQSNNQRRKKKLRST---FLLTIFLTGIILLFCFTFLR 61

Query: 302 VRTPKVRMDDVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLAAIRTADNNVV-PFPIQ 478
           +++PK+R++++ +T+  +G + F A+V ++N NF RY ++STL  I TA+   +  F I 
Sbjct: 62  IKSPKIRIENIRITNDGDGRINFSAQVFLRNRNFWRYGYDSTLGTINTAEGTTIGRFVIP 121

Query: 479 EARARARSTKKIXXXXXXXXXXXXT-----LELTVEAKLRGKVEFMRVIKRKKTADMNCT 643
           +   R RSTK I                  L +  EAK+RGKV+  RV + KK  D++CT
Sbjct: 122 DGEVRRRSTKTIYVMENFILPSRLNNTSGILPVISEAKIRGKVKVFRVFRWKKNVDLSCT 181

Query: 644 LTLVLATNSVQNLRCK 691
           +++ L  +++Q+L C+
Sbjct: 182 MSINLTISAIQDLDCQ 197


>ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca
           subsp. vesca]
          Length = 200

 Score =  124 bits (311), Expect = 4e-26
 Identities = 75/207 (36%), Positives = 112/207 (54%), Gaps = 8/207 (3%)
 Frame = +2

Query: 95  MADKYQPEVQGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXI 274
           MADK+Q   Q YPLAP++   RSD E      S++++K+KKRIKC  Y            
Sbjct: 1   MADKHQ---QVYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAVG 53

Query: 275 LIFSLIIMRVRTPKVRMDDVTVTSGANGDVR-----FGARVLVKNTNFGRYKFESTLAAI 439
            +F L +++V+TPKVR+D  +  SG           F  ++ VKNTN+G YKF+  +   
Sbjct: 54  AVFGLTVLKVKTPKVRLDTTSTLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVVTF 113

Query: 440 RTADNNVVPFPIQEARARARSTKKIXXXXXXXXXXXXT---LELTVEAKLRGKVEFMRVI 610
           +     V  F + + +A  R TKKI            +   L LT EAKL GKV  M ++
Sbjct: 114 KYQGTPVGTFTVPKGKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMFIM 173

Query: 611 KRKKTADMNCTLTLVLATNSVQNLRCK 691
           K+KK+A MNCT+ + ++  +V+++ CK
Sbjct: 174 KKKKSASMNCTIQIDVSGQTVKSVVCK 200


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  120 bits (302), Expect = 5e-25
 Identities = 76/207 (36%), Positives = 105/207 (50%), Gaps = 19/207 (9%)
 Frame = +2

Query: 128 YPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXILIFSLIIMRVR 307
           YPL PA+      +E     HS E +KKKKR+KCL Y           IL+F+L +MR+R
Sbjct: 9   YPLVPAANGHERSDEESVAAHSKE-LKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRIR 67

Query: 308 TPKVRMDDVTVTSGANG-------DVRFGARVLVKNTNFGRYKFESTLAAIRTADNNVVP 466
            PK R+   + T+   G       D++   +  VKNTNFG +K+E  L         V  
Sbjct: 68  NPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVGR 127

Query: 467 FPIQEARARARSTKKIXXXXXXXXXXXXT------------LELTVEAKLRGKVEFMRVI 610
             IQ+ARARARSTKK+                         L LT  +KL GK+  M+VI
Sbjct: 128 ATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKVI 187

Query: 611 KRKKTADMNCTLTLVLATNSVQNLRCK 691
           K+KK+  MNCT+ + + T +V+N+ CK
Sbjct: 188 KKKKSTQMNCTMDVAIDTRTVRNIICK 214


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  118 bits (296), Expect = 2e-24
 Identities = 74/208 (35%), Positives = 112/208 (53%), Gaps = 18/208 (8%)
 Frame = +2

Query: 122 QGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXILIFSLIIMR 301
           Q YPLAPA+   RSD   G +  S++++K++KR K   Y           + +F L +M+
Sbjct: 7   QAYPLAPANGYTRSD---GESLVSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMK 63

Query: 302 VRTPKVRMDDVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLAAIRTADNNVV 463
           V+TPKVR+  + V S        + D  F  ++ VKNTN+G YKF+++ A        V 
Sbjct: 64  VKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVG 123

Query: 464 PFPIQEARARARSTKKIXXXXXXXXXXXXT------------LELTVEAKLRGKVEFMRV 607
              I +++AR RSTKKI            +            L LT +AKL GKVE M +
Sbjct: 124 QVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLI 183

Query: 608 IKRKKTADMNCTLTLVLATNSVQNLRCK 691
           +K+KK+A M+CT+   L+T +V++L+CK
Sbjct: 184 MKKKKSATMDCTIAFDLSTKTVKSLQCK 211


>gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]
          Length = 212

 Score =  118 bits (295), Expect = 3e-24
 Identities = 78/218 (35%), Positives = 116/218 (53%), Gaps = 20/218 (9%)
 Frame = +2

Query: 95  MADKYQPEVQGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXI 274
           MA++YQ   Q YPLAPA+  PRSDEE  N     +++K++KRIK   Y            
Sbjct: 1   MAERYQ---QVYPLAPANGHPRSDEESSNL--DAKELKRRKRIKLAIYAFIFTASQIIVT 55

Query: 275 LIFSLIIMRVRTPKVRMDD------VTVTSGANG--DVRFGARVLVKNTNFGRYKFESTL 430
           L+F L++MRV++PK+R+ D      +   SG+    D+ F  ++ VKNTN+G YKF++T 
Sbjct: 56  LVFVLVVMRVKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTT 115

Query: 431 AAIRTADNNVVPFPIQEARARARSTKKIXXXXXXXXXXXXT------------LELTVEA 574
           AA       V    I + +A  RSTKK+                         L L   A
Sbjct: 116 AAFAYEGETVGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTA 175

Query: 575 KLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRC 688
           K+ GKV+ M ++K+KK+A+MNCT+ + +   +V NL+C
Sbjct: 176 KMTGKVKLMLIMKKKKSANMNCTINIHVKEKTV-NLKC 212


>ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  118 bits (295), Expect = 3e-24
 Identities = 73/209 (34%), Positives = 107/209 (51%), Gaps = 19/209 (9%)
 Frame = +2

Query: 122 QGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXILIFSLIIMR 301
           Q YPLAP++   RSD E      S++++K+KKRIKC  Y           + +F L IM+
Sbjct: 7   QSYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMK 62

Query: 302 VRTPKVRMDDVTVTSGANGDVR------FGARVLVKNTNFGRYKFESTLAAIRTADNNVV 463
           V+TPKVR+   T+T   + D        F  ++ VKNTN+G YKF+  +         V 
Sbjct: 63  VKTPKVRLGTSTLTDFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVG 122

Query: 464 PFPIQEARARARSTKKIXXXXXXXXXXXXT-------------LELTVEAKLRGKVEFMR 604
              + + +A  R TKKI            +             L LT EAKL GKVE M 
Sbjct: 123 TVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELML 182

Query: 605 VIKRKKTADMNCTLTLVLATNSVQNLRCK 691
           ++K+KK+A MNCT+ + ++  +V++L CK
Sbjct: 183 IMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score =  115 bits (289), Expect = 1e-23
 Identities = 72/214 (33%), Positives = 110/214 (51%), Gaps = 26/214 (12%)
 Frame = +2

Query: 128 YPLAP-ASVVPRSDEEYGNNRH-SDEQMKKKKRIKCLTYXXXXXXXXXXXILIFSLIIMR 301
           YPL P A    RSD+E   +   S E+++ KKR++CL Y           I +F+L +M+
Sbjct: 13  YPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVVVITVFALTVMK 72

Query: 302 VRTPKVRMDDVTVTSGANG-----------DVRFGARVLVKNTNFGRYKFESTLAAIRTA 448
           +++PK R+   ++T    G           DV FG    VKNTNFG +++E  +      
Sbjct: 73  IKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFG----VKNTNFGHFEYEDGIVVFTYR 128

Query: 449 DNNVVPFPIQEARARARSTKKIXXXXXXXXXXXXT-------------LELTVEAKLRGK 589
           D  +    ++E R RARST+K+                          + +T+ +KL GK
Sbjct: 129 DVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITISSKLDGK 188

Query: 590 VEFMRVIKRKKTADMNCTLTLVLATNSVQNLRCK 691
           +  M++IK+KK+A MNCT+ +VLAT SVQN+ CK
Sbjct: 189 IHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVCK 222


>ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  114 bits (285), Expect = 4e-23
 Identities = 69/213 (32%), Positives = 112/213 (52%), Gaps = 15/213 (7%)
 Frame = +2

Query: 95  MADKYQPEVQGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXI 274
           MA++ Q      P A    + RSD E  +  HSD +++KKKRIKCL Y           I
Sbjct: 1   MAERNQEAYPFAPYANGQAMARSDAE-SSRAHSDHELRKKKRIKCLIYIAVFAVFQIIVI 59

Query: 275 LIFSLIIMRVRTPKVR-----MDDVTVTSGANG--DVRFGARVLVKNTNFGRYKFESTLA 433
            +F+L +M++++PK R     + D+T ++ AN    + F A V VKN NFGRYK++ T  
Sbjct: 60  TVFALTVMKIKSPKFRIKSITVQDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQTSI 119

Query: 434 AIRTADNNVVPFPIQEARARARSTK--------KIXXXXXXXXXXXXTLELTVEAKLRGK 589
           +       V    + +A AR ++T+        K             ++ L+  +K+ GK
Sbjct: 120 SFIYEGTQVGDAVVPKATARTKATRKEIVSGAVKTVNSNLASDISAGSVTLSTYSKINGK 179

Query: 590 VEFMRVIKRKKTADMNCTLTLVLATNSVQNLRC 688
           V  M +IK+KK+A+M CT+ + L++  VQ+++C
Sbjct: 180 VYLMNMIKKKKSAEMKCTMVVHLSSKQVQDIKC 212


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  114 bits (284), Expect = 6e-23
 Identities = 70/180 (38%), Positives = 97/180 (53%), Gaps = 19/180 (10%)
 Frame = +2

Query: 209 KKKRIKCLTYXXXXXXXXXXXILIFSLIIMRVRTPKVRMDDVTVTSGANG-------DVR 367
           K+   KCL Y           ILIF+L +MR++ PKVR   VTV + + G       D+R
Sbjct: 6   KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65

Query: 368 FGARVLVKNTNFGRYKFESTLAAIRTADNNVVPFPIQEARARARSTKKIXXXXXXXXXXX 547
             A+V VKNTNFG +K+E++   I      V    I +ARARAR TKK            
Sbjct: 66  LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125

Query: 548 XT------------LELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRCK 691
            T            L L+ EAKL GKV  M+VIK+KK+++M+CT+ + + T +VQ+L+CK
Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKCK 185


>ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  112 bits (281), Expect = 1e-22
 Identities = 72/215 (33%), Positives = 109/215 (50%), Gaps = 16/215 (7%)
 Frame = +2

Query: 95  MADKYQPEVQGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXI 274
           MA+K Q   Q YPLA  +   RSD E      S++++K+KKRIKC  Y            
Sbjct: 1   MAEKSQKTHQTYPLASENGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAIG 56

Query: 275 LIFSLIIMRVRTPKVRMDDVTVTSGANGDVRFGA----RVLVKNTNFGRYKFESTLAAIR 442
            +F L +++V+TPKVR+   T++   +    F +    ++ VKNTN+G YKF+  +    
Sbjct: 57  AVFGLTVLKVKTPKVRLGTSTLSDVTSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVTFM 116

Query: 443 TADNNVVPFPIQEARARARSTKKIXXXXXXXXXXXXT------------LELTVEAKLRG 586
                V    + + +A  R TKKI            +            L LT EAKL G
Sbjct: 117 YQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKLTG 176

Query: 587 KVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRCK 691
           KVE M ++K+KK+A MNCT+ + ++  +V++L CK
Sbjct: 177 KVELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  112 bits (279), Expect = 2e-22
 Identities = 67/187 (35%), Positives = 101/187 (54%), Gaps = 20/187 (10%)
 Frame = +2

Query: 191 SDEQMKKKKRIKCLTYXXXXXXXXXXXILIFSLIIMRVRTPKVRMDDVTV-----TSGAN 355
           S  ++K+KKR+K   Y           IL+FSL +MR++ PK R+  +TV     TS  N
Sbjct: 15  SAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPN 74

Query: 356 G---DVRFGARVLVKNTNFGRYKFESTLAAIRTADNNVVPFPIQEARARARSTKKIXXXX 526
               +++F A V VKNTNFG +KF++T  +       V    + + RA+ARSTKK+    
Sbjct: 75  PPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTV 134

Query: 527 XXXXXXXXT------------LELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNS 670
                                L LT   KL GKV  M++IK+KK+A MNCT+T+ LA+ +
Sbjct: 135 DLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRA 194

Query: 671 VQNLRCK 691
           +Q+++C+
Sbjct: 195 IQDIKCQ 201


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  110 bits (275), Expect = 6e-22
 Identities = 68/184 (36%), Positives = 103/184 (55%), Gaps = 20/184 (10%)
 Frame = +2

Query: 197 EQMKKKKRIKCLTYXXXXXXXXXXXILIFSLIIMRVRTPKVRM-----DDVTVT-SGANG 358
           +++K+KKR+KCL Y           IL+F+L +MR++ PK R+     DD+T   S  + 
Sbjct: 14  KELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSF 73

Query: 359 DVRFGARVLVKNTNFGRYKFESTLAAIRTADNNVVPFPIQE--ARARARSTKKIXXXXXX 532
           +++F A+V VKNTNFG YKFE++        + V    + +  ARARARSTKK+      
Sbjct: 74  NMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTMDL 133

Query: 533 XXXXXXT------------LELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQ 676
                              L LT ++ L GKV  M+VIK+KK+ +MNCT+T+ LA   V+
Sbjct: 134 NSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKLVR 193

Query: 677 NLRC 688
           +++C
Sbjct: 194 DIKC 197


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  109 bits (273), Expect = 1e-21
 Identities = 67/208 (32%), Positives = 110/208 (52%), Gaps = 18/208 (8%)
 Frame = +2

Query: 122 QGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXILIFSLIIMR 301
           Q YPLAPA+   RSD   G +  S +++K++KRI+  TY           + +F L +M+
Sbjct: 7   QAYPLAPANGYTRSD---GESLVSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMK 63

Query: 302 VRTPKVRMDDV------TVTSGANGDVRFGARVLVKNTNFGRYKFESTLAAIRTADNNVV 463
           V+TPKVR+ ++      +V +  + D  F  ++ VKNTN+G YKF+++          V 
Sbjct: 64  VKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVG 123

Query: 464 PFPIQEARARARSTKKIXXXXXXXXXXXXT------------LELTVEAKLRGKVEFMRV 607
              + + +A  RSTKK+            +            L L  +AKL GKVE M +
Sbjct: 124 QVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLI 183

Query: 608 IKRKKTADMNCTLTLVLATNSVQNLRCK 691
           +K+KK++ M+C +   L+T +V++L+CK
Sbjct: 184 MKKKKSSTMDCMIGFDLSTKTVKSLQCK 211


>ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  109 bits (273), Expect = 1e-21
 Identities = 67/208 (32%), Positives = 106/208 (50%), Gaps = 18/208 (8%)
 Frame = +2

Query: 122 QGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXILIFSLIIMR 301
           Q YP AP++   RSD   G +  S++++K+KKRIK  TY           + +F L +M+
Sbjct: 7   QAYPTAPSNGYARSD---GESLVSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMK 63

Query: 302 VRTPKVRMDDVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLAAIRTADNNVV 463
           V+TPK R   + V +        + D  F  ++ +KNTN+G YKF++  A        + 
Sbjct: 64  VKTPKARWGSIDVETLNYVPATPSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIG 123

Query: 464 PFPIQEARARARSTKKIXXXXXXXXXXXXT------------LELTVEAKLRGKVEFMRV 607
              I +++A  RSTKKI                         L LT + +L+GKVE M +
Sbjct: 124 KVDIPKSKAGMRSTKKIDVEVSLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLI 183

Query: 608 IKRKKTADMNCTLTLVLATNSVQNLRCK 691
           +K+ K A M+CT+   L++ +VQ+L+CK
Sbjct: 184 MKKNKNASMDCTIAFDLSSKTVQSLQCK 211


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  109 bits (272), Expect = 1e-21
 Identities = 63/182 (34%), Positives = 95/182 (52%), Gaps = 20/182 (10%)
 Frame = +2

Query: 206 KKKKRIKCLTYXXXXXXXXXXXILIFSLIIMRVRTPKVRMDDVTV--------TSGANGD 361
           ++K+ IKCL Y           IL+F +++MR+R PKVR+  VTV        +S  +  
Sbjct: 10  RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69

Query: 362 VRFGARVLVKNTNFGRYKFESTLAAIRTADNNVVPFPIQEARARARSTKKIXXXXXXXXX 541
           +   A+V VKNTNFG +KF+++   I      V    I +ARARARST K+         
Sbjct: 70  MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSD 129

Query: 542 XXX------------TLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLR 685
                          T+ L+  AKL GK+   +V K+KK+A+MNCT+ +  ++  +QNL 
Sbjct: 130 KMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLM 189

Query: 686 CK 691
           C+
Sbjct: 190 CQ 191


>ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca
           subsp. vesca]
          Length = 212

 Score =  108 bits (269), Expect = 3e-21
 Identities = 71/217 (32%), Positives = 107/217 (49%), Gaps = 18/217 (8%)
 Frame = +2

Query: 95  MADKYQPEVQGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXI 274
           MA+K++     + LA         +E      S+E++K++KRIK  TY           +
Sbjct: 1   MAEKFK-----HALASVKGYATKKDEQLPTFQSEEELKRQKRIKLFTYIGIFIGFQIIVM 55

Query: 275 LIFSLIIMRVRTPKVRMDDVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLAA 436
            +F L +M+V+TPKVR+    V +        + D  F  ++ +KNTN+G YKF++  A 
Sbjct: 56  TVFGLTVMKVKTPKVRLGATNVQNLNFVPTSPSFDTTFATQIRIKNTNWGPYKFDAGTAT 115

Query: 437 IRTADNNVVPFPIQEARARARSTKKIXXXXXXXXXXXXT------------LELTVEAKL 580
                  V      +++A  RSTKKI            +            L LT EAKL
Sbjct: 116 FMYQGVAVGQVSFPKSKAGMRSTKKINAEVSLNSNEIPSTSNLGSELSSGVLTLTSEAKL 175

Query: 581 RGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRCK 691
            GKVE M ++K+KK+A MNCT+ L L+T ++Q L CK
Sbjct: 176 TGKVELMLIMKKKKSATMNCTMKLDLSTKTIQALECK 212


>gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus]
          Length = 213

 Score =  107 bits (267), Expect = 5e-21
 Identities = 69/211 (32%), Positives = 101/211 (47%), Gaps = 19/211 (9%)
 Frame = +2

Query: 116 EVQGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXILIFSLII 295
           E +  PL  A+   RSD E G   H   + +KKKR KC  Y           I IFS+ +
Sbjct: 3   EKEHQPLPYANGHGRSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTV 62

Query: 296 MRVRTPKVRMDDVTVT---SGANGDVRF----GARVLVKNTNFGRYKFESTLAAIRTADN 454
           M++RTPK R+    +T   +G  G   F     A   VKN NFGRYK+ +T         
Sbjct: 63  MKIRTPKFRIRSAHLTTFHAGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGT 122

Query: 455 NVVPFPIQEARARARSTKKI------------XXXXXXXXXXXXTLELTVEAKLRGKVEF 598
            V    ++++RA  RSTKK                          +++T +A++ G+VE 
Sbjct: 123 PVGQVFVRDSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNAGVVQITSQARMAGRVEL 182

Query: 599 MRVIKRKKTADMNCTLTLVLATNSVQNLRCK 691
           + V+K+ K+ DMNC + +V AT  ++NL CK
Sbjct: 183 IFVMKKNKSTDMNCNMEIVTATQQIRNLVCK 213


>ref|XP_004298841.1| PREDICTED: uncharacterized protein LOC101294558 [Fragaria vesca
           subsp. vesca]
          Length = 203

 Score =  107 bits (266), Expect = 7e-21
 Identities = 69/208 (33%), Positives = 111/208 (53%), Gaps = 9/208 (4%)
 Frame = +2

Query: 95  MADKYQPEVQGYPLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXXI 274
           MA+K Q   Q Y  + A+   RS ++  +   SDE++K++KRIK  TY           +
Sbjct: 1   MAEKNQ---QAY--SSANGYTRSTDQESSPFQSDEELKRQKRIKLFTYIGIFIVFQIVVM 55

Query: 275 LIFSLIIMRVRTPKVRMDDVT------VTSGANGDVRFGARVLVKNTNFGRYKFESTLAA 436
            +F L +M+V+TPK R  ++T      V +  + D  F  ++ +KNTN+G YKF++  A 
Sbjct: 56  TVFGLTVMKVKTPKARWGEITVKTLNSVPAAPSFDTTFETQIRIKNTNWGPYKFDAGTAT 115

Query: 437 IRTADNNVVPFPIQEARARARSTKKIXXXXXXXXXXXXT---LELTVEAKLRGKVEFMRV 607
                  +    I +++A  R TKKI            +   L LT EAKL GKV  M +
Sbjct: 116 FLYQGVTIGKVDIPKSKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMGM 175

Query: 608 IKRKKTADMNCTLTLVLATNSVQNLRCK 691
           +K+KK+A MNCT+ + ++  +V+++ CK
Sbjct: 176 MKKKKSASMNCTIQIDVSGPTVKSVVCK 203


Top