BLASTX nr result

ID: Mentha26_contig00000825 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00000825
         (496 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus...   165   7e-39
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   121   9e-26
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   117   2e-24
ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579...   115   8e-24
ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295...   114   2e-23
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   112   7e-23
gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]     107   2e-21
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   106   3e-21
ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293...   106   3e-21
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   106   4e-21
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   105   8e-21
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   103   2e-20
ref|XP_004298841.1| PREDICTED: uncharacterized protein LOC101294...   103   2e-20
ref|XP_004309174.1| PREDICTED: uncharacterized protein LOC101303...   103   3e-20
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   102   7e-20
ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295...   102   7e-20
ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303...   101   9e-20
ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295...   101   9e-20
ref|XP_007203004.1| hypothetical protein PRUPE_ppa017380mg, part...   101   9e-20
ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r...   100   2e-19

>gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus guttatus]
          Length = 208

 Score =  165 bits (417), Expect = 7e-39
 Identities = 94/173 (54%), Positives = 120/173 (69%), Gaps = 9/173 (5%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYG--NNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRV 321
           PLAP S VPRSDEEY   NN  + E+MKK KR+KC  Y          +ILI +L VMRV
Sbjct: 14  PLAP-STVPRSDEEYSGTNNYRAQEEMKKNKRMKCFAYIACFAVFQIIIILILALTVMRV 72

Query: 320 RTPKVRMDNVTVTSG-ANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFSIQE 144
           ++PK+R+ ++TVT    +G+VR  ARVLVKNTNFGRYKF+S LATIR+  +NVGQF I E
Sbjct: 73  KSPKLRLGDITVTKDHVSGNVRLTARVLVKNTNFGRYKFDSGLATIRSGASNVGQFVIPE 132

Query: 143 ARARARSTKKIAVVASLGASAT------GTLELTVEAKLRGKVEFMRVIKRKK 3
           +RARARSTKK+ V   L +S +      G   L VE++LRGKVE ++V+K+ K
Sbjct: 133 SRARARSTKKMYVTVDLNSSNSSNNSMGGVWTLNVESQLRGKVELLKVVKKTK 185


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  121 bits (304), Expect = 9e-26
 Identities = 79/185 (42%), Positives = 108/185 (58%), Gaps = 21/185 (11%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRT 315
           PLAPA+  PRSDEE  + +   +++K+KKRIK   Y          VILIF+L VMRV+ 
Sbjct: 10  PLAPANGHPRSDEESASLQ--SKELKRKKRIKYAVYIAAFAVFQTVVILIFALTVMRVKN 67

Query: 314 PKVRMDNVTVTS--------GANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQ 159
           PKVR+  VTV +         A+ ++RF  +V VKNTNFG YKF++   +       VG+
Sbjct: 68  PKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGE 127

Query: 158 FSIQEARARARSTKKIAVVASLGASA-------------TGTLELTVEAKLRGKVEFMRV 18
             I +ARARARSTKK+ V   + +SA             +  L L  +AKL+GKVE M+V
Sbjct: 128 AIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKV 187

Query: 17  IKRKK 3
           +K+KK
Sbjct: 188 MKKKK 192


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  117 bits (292), Expect = 2e-24
 Identities = 73/183 (39%), Positives = 100/183 (54%), Gaps = 19/183 (10%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRT 315
           PL PA+      +E     HS E +KKKKR+KCL Y          +IL+F+L VMR+R 
Sbjct: 10  PLVPAANGHERSDEESVAAHSKE-LKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRIRN 68

Query: 314 PKVRMD-------NVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQF 156
           PK R+        NV   +  + D++   +  VKNTNFG +K+E  L T       VG+ 
Sbjct: 69  PKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVGRA 128

Query: 155 SIQEARARARSTKKIAVVASLGAS------------ATGTLELTVEAKLRGKVEFMRVIK 12
           +IQ+ARARARSTKK+ VV  L ++            + G L LT  +KL GK+  M+VIK
Sbjct: 129 TIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKVIK 188

Query: 11  RKK 3
           +KK
Sbjct: 189 KKK 191


>ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579067 [Solanum tuberosum]
          Length = 197

 Score =  115 bits (287), Expect = 8e-24
 Identities = 61/170 (35%), Positives = 101/170 (59%), Gaps = 6/170 (3%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRT 315
           PLAP++++PRSD E+  N       ++KK+++              +IL+F    +R+++
Sbjct: 8   PLAPSNIMPRSDAEFATNNFQSNNQRRKKKLRST---FLLTIFLTGIILLFCFTFLRIKS 64

Query: 314 PKVRMDNVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTAD-NNVGQFSIQEAR 138
           PK+R++N+ +T+  +G + F A+V ++N NF RY ++STL TI TA+   +G+F I +  
Sbjct: 65  PKIRIENIRITNDGDGRINFSAQVFLRNRNFWRYGYDSTLGTINTAEGTTIGRFVIPDGE 124

Query: 137 ARARSTKKIAV-----VASLGASATGTLELTVEAKLRGKVEFMRVIKRKK 3
            R RSTK I V     + S   + +G L +  EAK+RGKV+  RV + KK
Sbjct: 125 VRRRSTKTIYVMENFILPSRLNNTSGILPVISEAKIRGKVKVFRVFRWKK 174


>ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca
           subsp. vesca]
          Length = 200

 Score =  114 bits (284), Expect = 2e-23
 Identities = 68/172 (39%), Positives = 96/172 (55%), Gaps = 8/172 (4%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRT 315
           PLAP++   RSD E      S++++K+KKRIKC  Y          V  +F L V++V+T
Sbjct: 10  PLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAVGAVFGLTVLKVKT 65

Query: 314 PKVRMDNVTVTSGANGDVR-----FGARVLVKNTNFGRYKFESTLATIRTADNNVGQFSI 150
           PKVR+D  +  SG           F  ++ VKNTN+G YKF+  + T +     VG F++
Sbjct: 66  PKVRLDTTSTLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVVTFKYQGTPVGTFTV 125

Query: 149 QEARARARSTKKIAVVASLGASA---TGTLELTVEAKLRGKVEFMRVIKRKK 3
            + +A  R TKKI    SL  +A   +G L LT EAKL GKV  M ++K+KK
Sbjct: 126 PKGKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMFIMKKKK 177


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  112 bits (279), Expect = 7e-23
 Identities = 72/182 (39%), Positives = 103/182 (56%), Gaps = 18/182 (9%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRT 315
           PLAPA+   RSD   G +  S++++K++KR K   Y          V+ +F L VM+V+T
Sbjct: 10  PLAPANGYTRSD---GESLVSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMKVKT 66

Query: 314 PKVRMDNVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFS 153
           PKVR+  + V S        + D  F  ++ VKNTN+G YKF+++ AT       VGQ S
Sbjct: 67  PKVRLGGINVQSLNSVPATPSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVGQVS 126

Query: 152 IQEARARARSTKKIAVVASLGASA------------TGTLELTVEAKLRGKVEFMRVIKR 9
           I +++AR RSTKKI+V   L  +A            +G L LT +AKL GKVE M ++K+
Sbjct: 127 IPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLIMKK 186

Query: 8   KK 3
           KK
Sbjct: 187 KK 188


>gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]
          Length = 212

 Score =  107 bits (267), Expect = 2e-21
 Identities = 70/184 (38%), Positives = 101/184 (54%), Gaps = 20/184 (10%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRT 315
           PLAPA+  PRSDEE  N     +++K++KRIK   Y          V L+F L+VMRV++
Sbjct: 10  PLAPANGHPRSDEESSNL--DAKELKRRKRIKLAIYAFIFTASQIIVTLVFVLVVMRVKS 67

Query: 314 PKVRMDN------VTVTSGANG--DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQ 159
           PK+R+ +      +   SG+    D+ F  ++ VKNTN+G YKF++T A        VGQ
Sbjct: 68  PKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTTAAFAYEGETVGQ 127

Query: 158 FSIQEARARARSTKKIAVVASLGAS------------ATGTLELTVEAKLRGKVEFMRVI 15
             I + +A  RSTKK+ V  SL +S            + G L L   AK+ GKV+ M ++
Sbjct: 128 VVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTAKMTGKVKLMLIM 187

Query: 14  KRKK 3
           K+KK
Sbjct: 188 KKKK 191


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  106 bits (265), Expect = 3e-21
 Identities = 67/182 (36%), Positives = 101/182 (55%), Gaps = 18/182 (9%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRT 315
           PLAPA+   RSD   G +  S +++K++KRI+  TY          V+ +F L VM+V+T
Sbjct: 10  PLAPANGYTRSD---GESLVSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMKVKT 66

Query: 314 PKVRMDNV------TVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFS 153
           PKVR+  +      +V +  + D  F  ++ VKNTN+G YKF+++  T       VGQ +
Sbjct: 67  PKVRLGEINVQDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVGQVT 126

Query: 152 IQEARARARSTKKIAVVASLGASA------------TGTLELTVEAKLRGKVEFMRVIKR 9
           + + +A  RSTKK+ V  SL A+             +G L L  +AKL GKVE M ++K+
Sbjct: 127 VPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLIMKK 186

Query: 8   KK 3
           KK
Sbjct: 187 KK 188


>ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  106 bits (265), Expect = 3e-21
 Identities = 68/183 (37%), Positives = 96/183 (52%), Gaps = 19/183 (10%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRT 315
           PLAP++   RSD E      S++++K+KKRIKC  Y          V+ +F L +M+V+T
Sbjct: 10  PLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMKVKT 65

Query: 314 PKVRMDNVTVT------SGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFS 153
           PKVR+   T+T      +  + D  F  ++ VKNTN+G YKF+  + T       VG   
Sbjct: 66  PKVRLGTSTLTDFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVGTVV 125

Query: 152 IQEARARARSTKKIAVVASLGASAT-------------GTLELTVEAKLRGKVEFMRVIK 12
           + + +A  R TKKI V   L  +A              G L LT EAKL GKVE M ++K
Sbjct: 126 VPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELMLIMK 185

Query: 11  RKK 3
           +KK
Sbjct: 186 KKK 188


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  106 bits (264), Expect = 4e-21
 Identities = 66/157 (42%), Positives = 88/157 (56%), Gaps = 19/157 (12%)
 Frame = -3

Query: 416 KKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRTPKVRMDNVTVTSGANG-------DVR 258
           K+   KCL Y          +ILIF+L VMR++ PKVR   VTV + + G       D+R
Sbjct: 6   KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65

Query: 257 FGARVLVKNTNFGRYKFESTLATIRTADNNVGQFSIQEARARARSTKKIAVVASLGAS-- 84
             A+V VKNTNFG +K+E++   I      VG+ +I +ARARAR TKK  V   + +S  
Sbjct: 66  LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125

Query: 83  ----------ATGTLELTVEAKLRGKVEFMRVIKRKK 3
                     A+G L L+ EAKL GKV  M+VIK+KK
Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKK 162


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  105 bits (261), Expect = 8e-21
 Identities = 64/162 (39%), Positives = 95/162 (58%), Gaps = 20/162 (12%)
 Frame = -3

Query: 428 EQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRTPKVRMDNVTV------TSGANG 267
           +++K+KKR+KCL Y          +IL+F+L VMR++ PK R+ +V V       S  + 
Sbjct: 14  KELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSF 73

Query: 266 DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFSIQE--ARARARSTKKIAVVASL 93
           +++F A+V VKNTNFG YKFE++  T     + VG+  + +  ARARARSTKK+ V   L
Sbjct: 74  NMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTMDL 133

Query: 92  GASA------------TGTLELTVEAKLRGKVEFMRVIKRKK 3
            ++             +G L LT ++ L GKV  M+VIK+KK
Sbjct: 134 NSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKK 175


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  103 bits (258), Expect = 2e-20
 Identities = 60/159 (37%), Positives = 89/159 (55%), Gaps = 20/159 (12%)
 Frame = -3

Query: 419 KKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRTPKVRMDNVTV--------TSGANGD 264
           ++K+ IKCL Y          +IL+F ++VMR+R PKVR+  VTV        +S  +  
Sbjct: 10  RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69

Query: 263 VRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFSIQEARARARSTKKIAVVASLGAS 84
           +   A+V VKNTNFG +KF+++  TI      VG+ +I +ARARARST K+ V  S+ + 
Sbjct: 70  MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSD 129

Query: 83  ------------ATGTLELTVEAKLRGKVEFMRVIKRKK 3
                        +GT+ L+  AKL GK+   +V K+KK
Sbjct: 130 KMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKK 168


>ref|XP_004298841.1| PREDICTED: uncharacterized protein LOC101294558 [Fragaria vesca
           subsp. vesca]
          Length = 203

 Score =  103 bits (257), Expect = 2e-20
 Identities = 62/164 (37%), Positives = 92/164 (56%), Gaps = 9/164 (5%)
 Frame = -3

Query: 467 RSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRTPKVRMDNVT 288
           RS ++  +   SDE++K++KRIK  TY          V+ +F L VM+V+TPK R   +T
Sbjct: 17  RSTDQESSPFQSDEELKRQKRIKLFTYIGIFIVFQIVVMTVFGLTVMKVKTPKARWGEIT 76

Query: 287 ------VTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFSIQEARARAR 126
                 V +  + D  F  ++ +KNTN+G YKF++  AT       +G+  I +++A  R
Sbjct: 77  VKTLNSVPAAPSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIGKVDIPKSKAGMR 136

Query: 125 STKKIAVVASLGASA---TGTLELTVEAKLRGKVEFMRVIKRKK 3
            TKKI    SL  +A   +G L LT EAKL GKV  M ++K+KK
Sbjct: 137 GTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMGMMKKKK 180


>ref|XP_004309174.1| PREDICTED: uncharacterized protein LOC101303468 [Fragaria vesca
           subsp. vesca]
          Length = 178

 Score =  103 bits (256), Expect = 3e-20
 Identities = 60/153 (39%), Positives = 91/153 (59%), Gaps = 11/153 (7%)
 Frame = -3

Query: 428 EQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRTPKVRMDNVTVTS--GANGDVR- 258
           ++  ++KR +CL            +I++F +IVMRV+TPKVR+++V VT+   +N  ++ 
Sbjct: 3   DEESRRKRTRCLACIAFGVIAQTIIIVLFVVIVMRVKTPKVRLESVGVTTLTASNSSLKA 62

Query: 257 -FGARVLVKNTNFGRYKFESTLATIRTADNNVGQFSIQEARARARSTKKIAVVASLGAS- 84
              A V VKN NFG YKFES  AT      N+G+ +I + +A+A+ TKKI V  SL +  
Sbjct: 63  SIDALVTVKNKNFGHYKFESAKATFSYKGTNIGEGTISKDKAKAKKTKKINVTVSLNSDK 122

Query: 83  ------ATGTLELTVEAKLRGKVEFMRVIKRKK 3
                 ++G + LT  AKL GKV  + +IK+KK
Sbjct: 123 ITASDISSGNVTLTAYAKLDGKVHLLNIIKKKK 155


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  102 bits (253), Expect = 7e-20
 Identities = 63/164 (38%), Positives = 92/164 (56%), Gaps = 20/164 (12%)
 Frame = -3

Query: 434 SDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRTPKVRMDNVTV-----TSGAN 270
           S  ++K+KKR+K   Y          VIL+FSL VMR++ PK R+ ++TV     TS  N
Sbjct: 15  SAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPN 74

Query: 269 G---DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFSIQEARARARSTKKIAVVA 99
               +++F A V VKNTNFG +KF++T  +       VG+  + + RA+ARSTKK+ V  
Sbjct: 75  PPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTV 134

Query: 98  SLGAS------------ATGTLELTVEAKLRGKVEFMRVIKRKK 3
            L ++            ++G L LT   KL GKV  M++IK+KK
Sbjct: 135 DLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKK 178


>ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  102 bits (253), Expect = 7e-20
 Identities = 64/182 (35%), Positives = 99/182 (54%), Gaps = 18/182 (9%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRT 315
           P AP++   RSD   G +  S++++K+KKRIK  TY          V+ +F L VM+V+T
Sbjct: 10  PTAPSNGYARSD---GESLVSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMKVKT 66

Query: 314 PKVRMDNVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFS 153
           PK R  ++ V +        + D  F  ++ +KNTN+G YKF++  AT       +G+  
Sbjct: 67  PKARWGSIDVETLNYVPATPSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIGKVD 126

Query: 152 IQEARARARSTKKIAVVASLGASA------------TGTLELTVEAKLRGKVEFMRVIKR 9
           I +++A  RSTKKI V  SL  +A            +G L LT + +L+GKVE M ++K+
Sbjct: 127 IPKSKAGMRSTKKIDVEVSLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLIMKK 186

Query: 8   KK 3
            K
Sbjct: 187 NK 188


>ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  101 bits (252), Expect = 9e-20
 Identities = 66/179 (36%), Positives = 100/179 (55%), Gaps = 15/179 (8%)
 Frame = -3

Query: 494 PLAPASVVPRSDEEYGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRT 315
           P A    + RSD E  +  HSD +++KKKRIKCL Y          VI +F+L VM++++
Sbjct: 13  PYANGQAMARSDAE-SSRAHSDHELRKKKRIKCLIYIAVFAVFQIIVITVFALTVMKIKS 71

Query: 314 PKVRMDNVTV-----TSGANG--DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQF 156
           PK R+ ++TV     ++ AN    + F A V VKN NFGRYK++ T  +       VG  
Sbjct: 72  PKFRIKSITVQDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQTSISFIYEGTQVGDA 131

Query: 155 SIQEARARARSTKK------IAVVASLGAS--ATGTLELTVEAKLRGKVEFMRVIKRKK 3
            + +A AR ++T+K      +  V S  AS  + G++ L+  +K+ GKV  M +IK+KK
Sbjct: 132 VVPKATARTKATRKEIVSGAVKTVNSNLASDISAGSVTLSTYSKINGKVYLMNMIKKKK 190


>ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca
           subsp. vesca]
          Length = 212

 Score =  101 bits (252), Expect = 9e-20
 Identities = 62/162 (38%), Positives = 90/162 (55%), Gaps = 18/162 (11%)
 Frame = -3

Query: 434 SDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRTPKVRMDNVTVTS------GA 273
           S+E++K++KRIK  TY          V+ +F L VM+V+TPKVR+    V +        
Sbjct: 28  SEEELKRQKRIKLFTYIGIFIGFQIIVMTVFGLTVMKVKTPKVRLGATNVQNLNFVPTSP 87

Query: 272 NGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFSIQEARARARSTKKIAVVASL 93
           + D  F  ++ +KNTN+G YKF++  AT       VGQ S  +++A  RSTKKI    SL
Sbjct: 88  SFDTTFATQIRIKNTNWGPYKFDAGTATFMYQGVAVGQVSFPKSKAGMRSTKKINAEVSL 147

Query: 92  GAS------------ATGTLELTVEAKLRGKVEFMRVIKRKK 3
            ++            ++G L LT EAKL GKVE M ++K+KK
Sbjct: 148 NSNEIPSTSNLGSELSSGVLTLTSEAKLTGKVELMLIMKKKK 189


>ref|XP_007203004.1| hypothetical protein PRUPE_ppa017380mg, partial [Prunus persica]
           gi|462398535|gb|EMJ04203.1| hypothetical protein
           PRUPE_ppa017380mg, partial [Prunus persica]
          Length = 192

 Score =  101 bits (252), Expect = 9e-20
 Identities = 64/161 (39%), Positives = 90/161 (55%), Gaps = 20/161 (12%)
 Frame = -3

Query: 428 EQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIVMRVRTPKVRMDNVTV------TSGANG 267
           +++K+KKRIK   Y          VI  F+L VMRV++PK+R+  ++V      +S  + 
Sbjct: 32  QELKRKKRIKLAIYISAFVVVQIIVITTFALTVMRVQSPKLRLGAISVQTLNASSSTPSF 91

Query: 266 DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVGQFSIQEARARARSTKKIAVVASLGA 87
           D+ F  +V +KNTNFGRYKF++T       D  VGQ  I +++A  RSTKKI V  SL +
Sbjct: 92  DMTFTTQVRIKNTNFGRYKFDATNVRFMYEDRAVGQVRIPKSKAGMRSTKKIDVTVSLNS 151

Query: 86  S--------------ATGTLELTVEAKLRGKVEFMRVIKRK 6
                           TG L L+ EA+L GKVE M V+K+K
Sbjct: 152 KELPSRSRYNLGNELKTGVLSLSSEARLAGKVELMFVMKKK 192


>ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 226

 Score =  100 bits (249), Expect = 2e-19
 Identities = 61/152 (40%), Positives = 86/152 (56%), Gaps = 19/152 (12%)
 Frame = -3

Query: 401 KCLTYXXXXXXXXXXVILIFSLIVMRVRTPKVRMDNVTV-------TSGANGDVRFGARV 243
           KCL Y          +IL+F+L VMR+R+PKVR   VTV       +S  + D++  A+V
Sbjct: 11  KCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFSTVNSSSPSFDMKLMAQV 70

Query: 242 LVKNTNFGRYKFESTLATIRTADNNVGQFSIQEARARARSTKKIAVVASLGASA------ 81
            VKNTNFG +K+E++  TI      VG+ +I + RARAR TKK  +   + +S       
Sbjct: 71  AVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDISSSRLSSNSN 130

Query: 80  ------TGTLELTVEAKLRGKVEFMRVIKRKK 3
                  G L L+ +AKL+GKV  M+VIK+KK
Sbjct: 131 LGNDINAGVLPLSSQAKLKGKVHLMKVIKKKK 162


Top