BLASTX nr result

ID: Mentha25_contig00006339 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00006339
         (822 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus...   185   2e-44
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   135   1e-29
ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579...   130   6e-28
ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295...   126   8e-27
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   125   1e-26
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   122   1e-25
ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293...   120   6e-25
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...   118   3e-24
gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]     117   7e-24
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   115   3e-23
ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295...   115   3e-23
ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303...   114   4e-23
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   113   7e-23
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   113   7e-23
ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295...   113   1e-22
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   112   1e-22
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   112   1e-22
ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295...   112   2e-22
ref|XP_004298841.1| PREDICTED: uncharacterized protein LOC101294...   109   1e-21
emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]   109   1e-21

>gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus guttatus]
          Length = 208

 Score =  185 bits (469), Expect = 2e-44
 Identities = 106/209 (50%), Positives = 141/209 (67%), Gaps = 10/209 (4%)
 Frame = -2

Query: 725 MADKYQPEV-QGYPLAPASVVPRSDEEYG--NNHHSDEQMKKKKRIKCLTYXXXXXXXXX 555
           MA+KY  EV Q YPLAP S VPRSDEEY   NN+ + E+MKK KR+KC  Y         
Sbjct: 1   MAEKYNQEVHQAYPLAP-STVPRSDEEYSGTNNYRAQEEMKKNKRMKCFAYIACFAVFQI 59

Query: 554 XVILIFSLIIMRVRTPKVRMDNVTVTSG-ANGDVRFGARVLVKNTNFGRYKFESTLATIR 378
            +ILI +L +MRV++PK+R+ ++TVT    +G+VR  ARVLVKNTNFGRYKF+S LATIR
Sbjct: 60  IIILILALTVMRVKSPKLRLGDITVTKDHVSGNVRLTARVLVKNTNFGRYKFDSGLATIR 119

Query: 377 TADNNVVQFPIQEARARARSTKK------IXXXXXXXXXXXGTLELTVEAKLRGKVEFMR 216
           +  +NV QF I E+RARARSTKK      +           G   L VE++LRGKVE ++
Sbjct: 120 SGASNVGQFVIPESRARARSTKKMYVTVDLNSSNSSNNSMGGVWTLNVESQLRGKVELLK 179

Query: 215 VIKRKKTADMNCTLTLVLATNSVQNLRCK 129
           V+K+ K+A MNC + + L ++++Q+ RCK
Sbjct: 180 VVKKTKSAYMNCVVVINLRSSTIQDSRCK 208


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  135 bits (341), Expect = 1e-29
 Identities = 90/220 (40%), Positives = 126/220 (57%), Gaps = 21/220 (9%)
 Frame = -2

Query: 725 MADKYQPEVQGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVI 546
           MA+K Q   Q +PLAPA+  PRSDEE  +     +++K+KKRIK   Y          VI
Sbjct: 1   MAEKDQ---QVHPLAPANGHPRSDEESASLQ--SKELKRKKRIKYAVYIAAFAVFQTVVI 55

Query: 545 LIFSLIIMRVRTPKVRMDNVTVTS--------GANGDVRFGARVLVKNTNFGRYKFESTL 390
           LIF+L +MRV+ PKVR+  VTV +         A+ ++RF  +V VKNTNFG YKF++  
Sbjct: 56  LIFALTVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNAT 115

Query: 389 ATIRTADNNVVQFPIQEARARARSTKKIXXXXXXXXXXXGT-------------LELTVE 249
            +       V +  I +ARARARSTKK+            +             L L  +
Sbjct: 116 MSFLYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQ 175

Query: 248 AKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRCK 129
           AKL+GKVE M+V+K+KK+ +MNCTL   ++T S+Q+L+CK
Sbjct: 176 AKLKGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215


>ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579067 [Solanum tuberosum]
          Length = 197

 Score =  130 bits (327), Expect = 6e-28
 Identities = 66/196 (33%), Positives = 114/196 (58%), Gaps = 6/196 (3%)
 Frame = -2

Query: 698 QGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 519
           Q YPLAP++++PRSD E+  N+      ++KK+++              +IL+F    +R
Sbjct: 5   QKYPLAPSNIMPRSDAEFATNNFQSNNQRRKKKLRST---FLLTIFLTGIILLFCFTFLR 61

Query: 518 VRTPKVRMDNVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV-QFPIQ 342
           +++PK+R++N+ +T+  +G + F A+V ++N NF RY ++STL TI TA+   + +F I 
Sbjct: 62  IKSPKIRIENIRITNDGDGRINFSAQVFLRNRNFWRYGYDSTLGTINTAEGTTIGRFVIP 121

Query: 341 EARARARSTKKIXXXXXXXXXXXGT-----LELTVEAKLRGKVEFMRVIKRKKTADMNCT 177
           +   R RSTK I                  L +  EAK+RGKV+  RV + KK  D++CT
Sbjct: 122 DGEVRRRSTKTIYVMENFILPSRLNNTSGILPVISEAKIRGKVKVFRVFRWKKNVDLSCT 181

Query: 176 LTLVLATNSVQNLRCK 129
           +++ L  +++Q+L C+
Sbjct: 182 MSINLTISAIQDLDCQ 197


>ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca
           subsp. vesca]
          Length = 200

 Score =  126 bits (317), Expect = 8e-27
 Identities = 77/207 (37%), Positives = 114/207 (55%), Gaps = 8/207 (3%)
 Frame = -2

Query: 725 MADKYQPEVQGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVI 546
           MADK+Q   Q YPLAP++   RSD E      S++++K+KKRIKC  Y          V 
Sbjct: 1   MADKHQ---QVYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAVG 53

Query: 545 LIFSLIIMRVRTPKVRMDNVTVTSGANGDVR-----FGARVLVKNTNFGRYKFESTLATI 381
            +F L +++V+TPKVR+D  +  SG           F  ++ VKNTN+G YKF+  + T 
Sbjct: 54  AVFGLTVLKVKTPKVRLDTTSTLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVVTF 113

Query: 380 RTADNNVVQFPIQEARARARSTKKIXXXXXXXXXXXGT---LELTVEAKLRGKVEFMRVI 210
           +     V  F + + +A  R TKKI            +   L LT EAKL GKV  M ++
Sbjct: 114 KYQGTPVGTFTVPKGKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMFIM 173

Query: 209 KRKKTADMNCTLTLVLATNSVQNLRCK 129
           K+KK+A MNCT+ + ++  +V+++ CK
Sbjct: 174 KKKKSASMNCTIQIDVSGQTVKSVVCK 200


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  125 bits (315), Expect = 1e-26
 Identities = 80/208 (38%), Positives = 112/208 (53%), Gaps = 20/208 (9%)
 Frame = -2

Query: 692 YPLAPASVV-PRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRV 516
           YPL PA+    RSDEE    H   +++KKKKR+KCL Y          +IL+F+L +MR+
Sbjct: 9   YPLVPAANGHERSDEESVAAH--SKELKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRI 66

Query: 515 RTPKVRMD-------NVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 357
           R PK R+        NV   +  + D++   +  VKNTNFG +K+E  L T       V 
Sbjct: 67  RNPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVG 126

Query: 356 QFPIQEARARARSTKKI------------XXXXXXXXXXXGTLELTVEAKLRGKVEFMRV 213
           +  IQ+ARARARSTKK+                       G L LT  +KL GK+  M+V
Sbjct: 127 RATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKV 186

Query: 212 IKRKKTADMNCTLTLVLATNSVQNLRCK 129
           IK+KK+  MNCT+ + + T +V+N+ CK
Sbjct: 187 IKKKKSTQMNCTMDVAIDTRTVRNIICK 214


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  122 bits (307), Expect = 1e-25
 Identities = 77/208 (37%), Positives = 115/208 (55%), Gaps = 18/208 (8%)
 Frame = -2

Query: 698 QGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 519
           Q YPLAPA+   RSD   G +  S++++K++KR K   Y          V+ +F L +M+
Sbjct: 7   QAYPLAPANGYTRSD---GESLVSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMK 63

Query: 518 VRTPKVRMDNVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 357
           V+TPKVR+  + V S        + D  F  ++ VKNTN+G YKF+++ AT       V 
Sbjct: 64  VKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVG 123

Query: 356 QFPIQEARARARSTKKIXXXXXXXXXXXGT------------LELTVEAKLRGKVEFMRV 213
           Q  I +++AR RSTKKI            +            L LT +AKL GKVE M +
Sbjct: 124 QVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLI 183

Query: 212 IKRKKTADMNCTLTLVLATNSVQNLRCK 129
           +K+KK+A M+CT+   L+T +V++L+CK
Sbjct: 184 MKKKKSATMDCTIAFDLSTKTVKSLQCK 211


>ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  120 bits (301), Expect = 6e-25
 Identities = 75/209 (35%), Positives = 109/209 (52%), Gaps = 19/209 (9%)
 Frame = -2

Query: 698 QGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 519
           Q YPLAP++   RSD E      S++++K+KKRIKC  Y          V+ +F L IM+
Sbjct: 7   QSYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMK 62

Query: 518 VRTPKVRMDNVTVTSGANGDVR------FGARVLVKNTNFGRYKFESTLATIRTADNNVV 357
           V+TPKVR+   T+T   + D        F  ++ VKNTN+G YKF+  + T       V 
Sbjct: 63  VKTPKVRLGTSTLTDFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVG 122

Query: 356 QFPIQEARARARSTKKIXXXXXXXXXXXGT-------------LELTVEAKLRGKVEFMR 216
              + + +A  R TKKI            +             L LT EAKL GKVE M 
Sbjct: 123 TVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELML 182

Query: 215 VIKRKKTADMNCTLTLVLATNSVQNLRCK 129
           ++K+KK+A MNCT+ + ++  +V++L CK
Sbjct: 183 IMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score =  118 bits (295), Expect = 3e-24
 Identities = 74/214 (34%), Positives = 112/214 (52%), Gaps = 26/214 (12%)
 Frame = -2

Query: 692 YPLAP-ASVVPRSDEEYGNNHH-SDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 519
           YPL P A    RSD+E   +   S E+++ KKR++CL Y          VI +F+L +M+
Sbjct: 13  YPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVVVITVFALTVMK 72

Query: 518 VRTPKVRMDNVTVTSGANG-----------DVRFGARVLVKNTNFGRYKFESTLATIRTA 372
           +++PK R+   ++T    G           DV FG    VKNTNFG +++E  +      
Sbjct: 73  IKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFG----VKNTNFGHFEYEDGIVVFTYR 128

Query: 371 DNNVVQFPIQEARARARSTKKIXXXXXXXXXXXGT-------------LELTVEAKLRGK 231
           D  + Q  ++E R RARST+K+                          + +T+ +KL GK
Sbjct: 129 DVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITISSKLDGK 188

Query: 230 VEFMRVIKRKKTADMNCTLTLVLATNSVQNLRCK 129
           +  M++IK+KK+A MNCT+ +VLAT SVQN+ CK
Sbjct: 189 IHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVCK 222


>gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]
          Length = 212

 Score =  117 bits (292), Expect = 7e-24
 Identities = 78/218 (35%), Positives = 117/218 (53%), Gaps = 20/218 (9%)
 Frame = -2

Query: 725 MADKYQPEVQGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVI 546
           MA++YQ   Q YPLAPA+  PRSDEE  N     +++K++KRIK   Y          V 
Sbjct: 1   MAERYQ---QVYPLAPANGHPRSDEESSNL--DAKELKRRKRIKLAIYAFIFTASQIIVT 55

Query: 545 LIFSLIIMRVRTPKVRMDN------VTVTSGANG--DVRFGARVLVKNTNFGRYKFESTL 390
           L+F L++MRV++PK+R+ +      +   SG+    D+ F  ++ VKNTN+G YKF++T 
Sbjct: 56  LVFVLVVMRVKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTT 115

Query: 389 ATIRTADNNVVQFPIQEARARARSTKKIXXXXXXXXXXXGT------------LELTVEA 246
           A        V Q  I + +A  RSTKK+                         L L   A
Sbjct: 116 AAFAYEGETVGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTA 175

Query: 245 KLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRC 132
           K+ GKV+ M ++K+KK+A+MNCT+ + +   +V NL+C
Sbjct: 176 KMTGKVKLMLIMKKKKSANMNCTINIHVKEKTV-NLKC 212


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  115 bits (287), Expect = 3e-23
 Identities = 70/180 (38%), Positives = 99/180 (55%), Gaps = 19/180 (10%)
 Frame = -2

Query: 611 KKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTVTSGANG-------DVR 453
           K+   KCL Y          +ILIF+L +MR++ PKVR   VTV + + G       D+R
Sbjct: 6   KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65

Query: 452 FGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIXXXXXXXXXXX 273
             A+V VKNTNFG +K+E++   I      V +  I +ARARAR TKK            
Sbjct: 66  LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125

Query: 272 GT------------LELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRCK 129
            T            L L+ EAKL GKV  M+VIK+KK+++M+CT+ + + T +VQ+L+CK
Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKCK 185


>ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  115 bits (287), Expect = 3e-23
 Identities = 73/215 (33%), Positives = 111/215 (51%), Gaps = 16/215 (7%)
 Frame = -2

Query: 725 MADKYQPEVQGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVI 546
           MA+K Q   Q YPLA  +   RSD E      S++++K+KKRIKC  Y          + 
Sbjct: 1   MAEKSQKTHQTYPLASENGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAIG 56

Query: 545 LIFSLIIMRVRTPKVRMDNVTVTSGANGDVRFGA----RVLVKNTNFGRYKFESTLATIR 378
            +F L +++V+TPKVR+   T++   +    F +    ++ VKNTN+G YKF+  + T  
Sbjct: 57  AVFGLTVLKVKTPKVRLGTSTLSDVTSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVTFM 116

Query: 377 TADNNVVQFPIQEARARARSTKKIXXXXXXXXXXXGT------------LELTVEAKLRG 234
                V    + + +A  R TKKI            +            L LT EAKL G
Sbjct: 117 YQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKLTG 176

Query: 233 KVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRCK 129
           KVE M ++K+KK+A MNCT+ + ++  +V++L CK
Sbjct: 177 KVELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  114 bits (285), Expect = 4e-23
 Identities = 71/213 (33%), Positives = 115/213 (53%), Gaps = 15/213 (7%)
 Frame = -2

Query: 725 MADKYQPEVQGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVI 546
           MA++ Q      P A    + RSD E  +  HSD +++KKKRIKCL Y          VI
Sbjct: 1   MAERNQEAYPFAPYANGQAMARSDAE-SSRAHSDHELRKKKRIKCLIYIAVFAVFQIIVI 59

Query: 545 LIFSLIIMRVRTPKVRMDNVTV-----TSGANG--DVRFGARVLVKNTNFGRYKFESTLA 387
            +F+L +M++++PK R+ ++TV     ++ AN    + F A V VKN NFGRYK++ T  
Sbjct: 60  TVFALTVMKIKSPKFRIKSITVQDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQTSI 119

Query: 386 TIRTADNNVVQFPIQEARARARSTK--------KIXXXXXXXXXXXGTLELTVEAKLRGK 231
           +       V    + +A AR ++T+        K            G++ L+  +K+ GK
Sbjct: 120 SFIYEGTQVGDAVVPKATARTKATRKEIVSGAVKTVNSNLASDISAGSVTLSTYSKINGK 179

Query: 230 VEFMRVIKRKKTADMNCTLTLVLATNSVQNLRC 132
           V  M +IK+KK+A+M CT+ + L++  VQ+++C
Sbjct: 180 VYLMNMIKKKKSAEMKCTMVVHLSSKQVQDIKC 212


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  113 bits (283), Expect = 7e-23
 Identities = 68/187 (36%), Positives = 104/187 (55%), Gaps = 20/187 (10%)
 Frame = -2

Query: 629 SDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTV-----TSGAN 465
           S  ++K+KKR+K   Y          VIL+FSL +MR++ PK R+ ++TV     TS  N
Sbjct: 15  SAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPN 74

Query: 464 G---DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIXXXX 294
               +++F A V VKNTNFG +KF++T  +       V +  + + RA+ARSTKK+    
Sbjct: 75  PPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTV 134

Query: 293 XXXXXXXGT------------LELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNS 150
                                L LT   KL GKV  M++IK+KK+A MNCT+T+ LA+ +
Sbjct: 135 DLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRA 194

Query: 149 VQNLRCK 129
           +Q+++C+
Sbjct: 195 IQDIKCQ 201


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  113 bits (283), Expect = 7e-23
 Identities = 68/184 (36%), Positives = 105/184 (57%), Gaps = 20/184 (10%)
 Frame = -2

Query: 623 EQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTV------TSGANG 462
           +++K+KKR+KCL Y          +IL+F+L +MR++ PK R+ +V V       S  + 
Sbjct: 14  KELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSF 73

Query: 461 DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQE--ARARARSTKKIXXXXXX 288
           +++F A+V VKNTNFG YKFE++  T     + V +  + +  ARARARSTKK+      
Sbjct: 74  NMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTMDL 133

Query: 287 XXXXXGT------------LELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQ 144
                              L LT ++ L GKV  M+VIK+KK+ +MNCT+T+ LA   V+
Sbjct: 134 NSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKLVR 193

Query: 143 NLRC 132
           +++C
Sbjct: 194 DIKC 197


>ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca
           subsp. vesca]
          Length = 212

 Score =  113 bits (282), Expect = 1e-22
 Identities = 74/217 (34%), Positives = 110/217 (50%), Gaps = 18/217 (8%)
 Frame = -2

Query: 725 MADKYQPEVQGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVI 546
           MA+K++     + LA         +E      S+E++K++KRIK  TY          V+
Sbjct: 1   MAEKFK-----HALASVKGYATKKDEQLPTFQSEEELKRQKRIKLFTYIGIFIGFQIIVM 55

Query: 545 LIFSLIIMRVRTPKVRMDNVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLAT 384
            +F L +M+V+TPKVR+    V +        + D  F  ++ +KNTN+G YKF++  AT
Sbjct: 56  TVFGLTVMKVKTPKVRLGATNVQNLNFVPTSPSFDTTFATQIRIKNTNWGPYKFDAGTAT 115

Query: 383 IRTADNNVVQFPIQEARARARSTKKIXXXXXXXXXXXGT------------LELTVEAKL 240
                  V Q    +++A  RSTKKI            +            L LT EAKL
Sbjct: 116 FMYQGVAVGQVSFPKSKAGMRSTKKINAEVSLNSNEIPSTSNLGSELSSGVLTLTSEAKL 175

Query: 239 RGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRCK 129
            GKVE M ++K+KK+A MNCT+ L L+T ++Q L CK
Sbjct: 176 TGKVELMLIMKKKKSATMNCTMKLDLSTKTIQALECK 212


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  112 bits (281), Expect = 1e-22
 Identities = 64/182 (35%), Positives = 98/182 (53%), Gaps = 20/182 (10%)
 Frame = -2

Query: 614 KKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTV--------TSGANGD 459
           ++K+ IKCL Y          +IL+F +++MR+R PKVR+  VTV        +S  +  
Sbjct: 10  RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69

Query: 458 VRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIXXXXXXXXX 279
           +   A+V VKNTNFG +KF+++  TI      V +  I +ARARARST K+         
Sbjct: 70  MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSD 129

Query: 278 XXG------------TLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLR 135
                          T+ L+  AKL GK+   +V K+KK+A+MNCT+ +  ++  +QNL 
Sbjct: 130 KMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLM 189

Query: 134 CK 129
           C+
Sbjct: 190 CQ 191


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  112 bits (281), Expect = 1e-22
 Identities = 70/208 (33%), Positives = 112/208 (53%), Gaps = 18/208 (8%)
 Frame = -2

Query: 698 QGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 519
           Q YPLAPA+   RSD   G +  S +++K++KRI+  TY          V+ +F L +M+
Sbjct: 7   QAYPLAPANGYTRSD---GESLVSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMK 63

Query: 518 VRTPKVRMDNV------TVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 357
           V+TPKVR+  +      +V +  + D  F  ++ VKNTN+G YKF+++  T       V 
Sbjct: 64  VKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVG 123

Query: 356 QFPIQEARARARSTKKIXXXXXXXXXXXGT------------LELTVEAKLRGKVEFMRV 213
           Q  + + +A  RSTKK+            +            L L  +AKL GKVE M +
Sbjct: 124 QVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLI 183

Query: 212 IKRKKTADMNCTLTLVLATNSVQNLRCK 129
           +K+KK++ M+C +   L+T +V++L+CK
Sbjct: 184 MKKKKSSTMDCMIGFDLSTKTVKSLQCK 211


>ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  112 bits (280), Expect = 2e-22
 Identities = 69/208 (33%), Positives = 110/208 (52%), Gaps = 18/208 (8%)
 Frame = -2

Query: 698 QGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 519
           Q YP AP++   RSD   G +  S++++K+KKRIK  TY          V+ +F L +M+
Sbjct: 7   QAYPTAPSNGYARSD---GESLVSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMK 63

Query: 518 VRTPKVRMDNVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 357
           V+TPK R  ++ V +        + D  F  ++ +KNTN+G YKF++  AT       + 
Sbjct: 64  VKTPKARWGSIDVETLNYVPATPSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIG 123

Query: 356 QFPIQEARARARSTKKIXXXXXXXXXXXGT------------LELTVEAKLRGKVEFMRV 213
           +  I +++A  RSTKKI                         L LT + +L+GKVE M +
Sbjct: 124 KVDIPKSKAGMRSTKKIDVEVSLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLI 183

Query: 212 IKRKKTADMNCTLTLVLATNSVQNLRCK 129
           +K+ K A M+CT+   L++ +VQ+L+CK
Sbjct: 184 MKKNKNASMDCTIAFDLSSKTVQSLQCK 211


>ref|XP_004298841.1| PREDICTED: uncharacterized protein LOC101294558 [Fragaria vesca
           subsp. vesca]
          Length = 203

 Score =  109 bits (273), Expect = 1e-21
 Identities = 71/208 (34%), Positives = 113/208 (54%), Gaps = 9/208 (4%)
 Frame = -2

Query: 725 MADKYQPEVQGYPLAPASVVPRSDEEYGNNHHSDEQMKKKKRIKCLTYXXXXXXXXXXVI 546
           MA+K Q   Q Y  + A+   RS ++  +   SDE++K++KRIK  TY          V+
Sbjct: 1   MAEKNQ---QAY--SSANGYTRSTDQESSPFQSDEELKRQKRIKLFTYIGIFIVFQIVVM 55

Query: 545 LIFSLIIMRVRTPKVRMDNVT------VTSGANGDVRFGARVLVKNTNFGRYKFESTLAT 384
            +F L +M+V+TPK R   +T      V +  + D  F  ++ +KNTN+G YKF++  AT
Sbjct: 56  TVFGLTVMKVKTPKARWGEITVKTLNSVPAAPSFDTTFETQIRIKNTNWGPYKFDAGTAT 115

Query: 383 IRTADNNVVQFPIQEARARARSTKKIXXXXXXXXXXXGT---LELTVEAKLRGKVEFMRV 213
                  + +  I +++A  R TKKI            +   L LT EAKL GKV  M +
Sbjct: 116 FLYQGVTIGKVDIPKSKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMGM 175

Query: 212 IKRKKTADMNCTLTLVLATNSVQNLRCK 129
           +K+KK+A MNCT+ + ++  +V+++ CK
Sbjct: 176 MKKKKSASMNCTIQIDVSGPTVKSVVCK 203


>emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]
          Length = 186

 Score =  109 bits (272), Expect = 1e-21
 Identities = 64/181 (35%), Positives = 103/181 (56%), Gaps = 18/181 (9%)
 Frame = -2

Query: 617 MKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTV------TSGANGDV 456
           +++KK +KCL Y          +IL+F L+++++R PKVR+ +++V      T+  + D+
Sbjct: 8   VRRKKSLKCLAYVAAFVVFQTGIILLFVLLVLKIRDPKVRIASISVENQHFSTNSFSMDL 67

Query: 455 RFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIXXXXXXXXXX 276
           +  ARV VKNTNFG +KF+++ ATI      V +  I +ARAR+RSTK+           
Sbjct: 68  K--ARVTVKNTNFGHFKFDNSTATISYFGTAVGEATILKARARSRSTKRFNITVPISSSK 125

Query: 275 XGT------------LELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNLRC 132
                          L L+  AKL GK+   ++ K+KK+A+M+CT+ L   T+S++NL C
Sbjct: 126 VNNHRQLRRDLNSGVLNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENLSC 185

Query: 131 K 129
           K
Sbjct: 186 K 186


Top