BLASTX nr result

ID: Mentha26_contig00035379 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00035379
         (700 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU25166.1| hypothetical protein MIMGU_mgv1a014071mg [Mimulus...   139   9e-31
gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus...   134   2e-29
gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus...   117   3e-24
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   114   4e-23
ref|XP_002513626.1| conserved hypothetical protein [Ricinus comm...   112   2e-22
ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578...   107   5e-21
gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus...   106   8e-21
ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303...   105   1e-20
ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293...   105   2e-20
ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295...   104   3e-20
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   103   7e-20
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...   103   7e-20
ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295...    97   5e-18
ref|XP_007210080.1| hypothetical protein PRUPE_ppa019661mg [Prun...    95   3e-17
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...    94   3e-17
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...    93   7e-17
ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r...    92   1e-16
ref|XP_007210107.1| hypothetical protein PRUPE_ppa021960mg [Prun...    92   2e-16
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...    91   3e-16
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...    90   8e-16

>gb|EYU25166.1| hypothetical protein MIMGU_mgv1a014071mg [Mimulus guttatus]
          Length = 202

 Score =  139 bits (350), Expect = 9e-31
 Identities = 85/203 (41%), Positives = 119/203 (58%), Gaps = 3/203 (1%)
 Frame = -3

Query: 698 GHGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFR 519
           GHG  RSD EA  GGA  + R++N R KCFLY              F+ TVMKIRTPKFR
Sbjct: 6   GHG--RSDAEA--GGAATEPRKKN-RTKCFLYIALFVIFQIGVITIFSLTVMKIRTPKFR 60

Query: 518 VLSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXX 339
           + SA   ++  A  PA+ S S T++AE  +KN+NFGR+K+  T V F+Y G         
Sbjct: 61  IRSAHL-TNFNAGTPASPSFSATVNAEFTVKNANFGRYKYRNTTVDFFYRGTPVGQVLVR 119

Query: 338 EGHANWRSTXXXXXXXXXXVASSA---RLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKR 168
           +  A WRST          + ++    +L  DL+AGVV++ S+A MRG V +  ++KK +
Sbjct: 120 DSRAGWRSTKKFNVAVNLSLTNAQANPQLASDLNAGVVQISSQARMRGRVELIFVMKKNK 179

Query: 167 SSFMNCTMGIMIGARKLRNIVCE 99
           S+ MNCTM I+   ++LRNI+C+
Sbjct: 180 STDMNCTMEIVTATQQLRNILCK 202


>gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus]
          Length = 213

 Score =  134 bits (338), Expect = 2e-29
 Identities = 78/203 (38%), Positives = 114/203 (56%), Gaps = 3/203 (1%)
 Frame = -3

Query: 698 GHGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFR 519
           GHG  RSD EA       + +R+ KR KCF+Y              F+ TVMKIRTPKFR
Sbjct: 14  GHG--RSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTVMKIRTPKFR 71

Query: 518 VLSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXX 339
           + SA   ++  A  P + S S T++AE  +KN+NFGR+K+  T V F+Y G         
Sbjct: 72  IRSAHL-TTFHAGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGTPVGQVFVR 130

Query: 338 EGHANWRSTXXXXXXXXXXVASSA---RLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKR 168
           +  A WRST          +A++    +L  DL+AGVV++ S+A M G V +  ++KK +
Sbjct: 131 DSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNAGVVQITSQARMAGRVELIFVMKKNK 190

Query: 167 SSFMNCTMGIMIGARKLRNIVCE 99
           S+ MNC M I+   +++RN+VC+
Sbjct: 191 STDMNCNMEIVTATQQIRNLVCK 213


>gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus]
          Length = 214

 Score =  117 bits (294), Expect = 3e-24
 Identities = 74/202 (36%), Positives = 110/202 (54%), Gaps = 3/202 (1%)
 Frame = -3

Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           HG  RSD EA  GGA      + KR +C +Y              F+ TVMKIR P+FR+
Sbjct: 18  HG--RSDTEA--GGAAASELHKRKRTQCLIYIGLLAIIQIAVVIVFSLTVMKIRNPRFRI 73

Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336
            SA   ++  A  PA+ + +  ++AE  +KN+NFGR+K+  T V F Y G         E
Sbjct: 74  RSAHL-TNFNAGTPASPAFTGKLNAEFSVKNANFGRYKYMDTTVDFVYRGTRVGEVFVRE 132

Query: 335 GHANWRSTXXXXXXXXXXVASSA---RLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165
             A WR+T          +A++    +L  DL+AGVV + S A M G+V +  ++KK RS
Sbjct: 133 SRAGWRTTKKFNVAVDLSLANARANPQLASDLNAGVVPISSEARMSGSVELLFVLKKNRS 192

Query: 164 SFMNCTMGIMIGARKLRNIVCE 99
           + +NCTM I+   +++RNI+C+
Sbjct: 193 TGLNCTMEIVTATQQIRNILCK 214


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  114 bits (284), Expect = 4e-23
 Identities = 72/202 (35%), Positives = 105/202 (51%), Gaps = 3/202 (1%)
 Frame = -3

Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           +G ERSD E+    + E +++  KR KC LY              F  TVM+IR PKFRV
Sbjct: 16  NGHERSDEESVAAHSKELKKK--KRMKCLLYIVLFAVFQTGIILLFALTVMRIRNPKFRV 73

Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336
            S  S ++      A+ S  L M+ +  +KN+NFG FK+E  LV F Y G         +
Sbjct: 74  RSG-SFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVGRATIQK 132

Query: 335 GHANWRSTXXXXXXXXXXV---ASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165
             A  RST               ++  L RD+SAGV+ L S +++ G + +  ++KKK+S
Sbjct: 133 ARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKVIKKKKS 192

Query: 164 SFMNCTMGIMIGARKLRNIVCE 99
           + MNCTM + I  R +RNI+C+
Sbjct: 193 TQMNCTMDVAIDTRTVRNIICK 214


>ref|XP_002513626.1| conserved hypothetical protein [Ricinus communis]
           gi|223547534|gb|EEF49029.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 217

 Score =  112 bits (279), Expect = 2e-22
 Identities = 70/206 (33%), Positives = 104/206 (50%), Gaps = 8/206 (3%)
 Frame = -3

Query: 692 GQERSDGEAADGGATEQRR-RRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           GQ RSD E+   G  + +  R+ KR KC  +              F FTV++ + PKFRV
Sbjct: 15  GQTRSDEESGTAGTAQTKELRKKKRMKCIAFVVAFTIFQTGIILLFVFTVLRFKDPKFRV 74

Query: 515 LSATSDSS----GEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXX 348
            SA+ D +     +A AP   S +LTM+ + G+KN+NFG FK+E + V F Y G      
Sbjct: 75  RSASFDDTFHVGTDAAAP---SFNLTMNTQFGVKNTNFGHFKYETSTVTFEYRGTVVGLV 131

Query: 347 XXXEGHANWRSTXXXXXXXXXXV---ASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVK 177
              +  A  RST                   L  D+S+G + L S + + G + +  ++K
Sbjct: 132 NVDKARARARSTRKFDAIVVLRTDRLPDGFELSSDISSGKIPLSSSSRLDGEIHLMKVIK 191

Query: 176 KKRSSFMNCTMGIMIGARKLRNIVCE 99
           KK+S+ MNCTM + I  R L++IVC+
Sbjct: 192 KKKSAEMNCTMNVDIQTRTLQDIVCK 217


>ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578608 [Solanum tuberosum]
          Length = 204

 Score =  107 bits (266), Expect = 5e-21
 Identities = 59/179 (32%), Positives = 99/179 (55%)
 Frame = -3

Query: 635 RRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSATSDSSGEAVAPANGSLS 456
           RR KRNK  +Y              F+  +MKIRTPKF V SAT D     +   N S +
Sbjct: 31  RRKKRNKILVYVALFIVFQIAVLLFFSLYIMKIRTPKFSVRSATFD----LMVTENASFN 86

Query: 455 LTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHANWRSTXXXXXXXXXXVA 276
           +TM+AEL +KN+NFG + ++ + +YFYY+          +G A ++S+           +
Sbjct: 87  ITMNAELSVKNANFGPYNYKNSTIYFYYNDVSIGEAFVYQGKAGFKSS-KKFNVIVNLSS 145

Query: 275 SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSFMNCTMGIMIGARKLRNIVCE 99
             ++L  DL++G + L S++++ G V++   +KKK+S+ MNC + I +  + +R+I C+
Sbjct: 146 KESKLRNDLNSGTLILTSKSKLEGKVKLIFFMKKKKSTEMNCAIIIGLAGKVVRDIQCD 204


>gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus guttatus]
          Length = 214

 Score =  106 bits (264), Expect = 8e-21
 Identities = 67/200 (33%), Positives = 101/200 (50%), Gaps = 6/200 (3%)
 Frame = -3

Query: 683 RSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSAT 504
           R D E A     ++  +R KR KCF Y              F  T+MK+RTPKF V SAT
Sbjct: 19  RVDEEVAS--VAQKNEKRKKRVKCFTYVAVFIVIQSVIFMIFGLTIMKVRTPKFHVRSAT 76

Query: 503 SDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHAN 324
             +   +    N S ++ M A+L ++N NFG++K++ + V F++ G            AN
Sbjct: 77  FGAFEVSTLDTNPSFNINMIADLSVRNRNFGQYKYQNSTVEFFFRGTKVGEARIVRSRAN 136

Query: 323 WRSTXXXXXXXXXXVASSARLDRDLSA------GVVRLMSRAEMRGNVRMAMMVKKKRSS 162
            RST            SSA +  ++ A       ++ L SR+ +RG V +  ++KK +S+
Sbjct: 137 ARST---RRFLATVDLSSAGVPTEVLANEFRTHALIPLTSRSTLRGKVEIMKLMKKNKST 193

Query: 161 FMNCTMGIMIGARKLRNIVC 102
            MNCTM IMI +++L NI C
Sbjct: 194 NMNCTMEIMISSKQLGNISC 213


>ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  105 bits (263), Expect = 1e-20
 Identities = 64/199 (32%), Positives = 107/199 (53%)
 Frame = -3

Query: 698 GHGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFR 519
           G    RSD E++   +  + R++ KR KC +Y              F  TVMKI++PKFR
Sbjct: 17  GQAMARSDAESSRAHSDHELRKK-KRIKCLIYIAVFAVFQIIVITVFALTVMKIKSPKFR 75

Query: 518 VLSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXX 339
           + S T      + + AN SLS++  AE+ +KN NFGR+K+++T + F Y+G         
Sbjct: 76  IKSITVQDLTTSNS-ANPSLSMSFVAEVSVKNPNFGRYKYDQTSISFIYEGTQVGDAVVP 134

Query: 338 EGHANWRSTXXXXXXXXXXVASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSF 159
           +  A  ++T            +S  L  D+SAG V L + +++ G V +  M+KKK+S+ 
Sbjct: 135 KATARTKATRKEIVSGAVKTVNS-NLASDISAGSVTLSTYSKINGKVYLMNMIKKKKSAE 193

Query: 158 MNCTMGIMIGARKLRNIVC 102
           M CTM + + ++++++I C
Sbjct: 194 MKCTMVVHLSSKQVQDIKC 212


>ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  105 bits (261), Expect = 2e-20
 Identities = 65/204 (31%), Positives = 107/204 (52%), Gaps = 5/204 (2%)
 Frame = -3

Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           +G  RSDGE+     +E   +R KR KCF Y              F  T+MK++TPK R+
Sbjct: 15  NGYTRSDGESL----SEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMKVKTPKVRL 70

Query: 515 LSAT-SDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXX 339
            ++T +D +    AP   S   T + ++ +KN+N+G +KF++ +V F Y G         
Sbjct: 71  GTSTLTDFTSSDTAP---SFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVGTVVVP 127

Query: 338 EGHANWRSTXXXXXXXXXXVA----SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKK 171
           +G A  R T           A    SS+ L  +LS GV+ L S A++ G V + +++KKK
Sbjct: 128 KGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELMLIMKKK 187

Query: 170 RSSFMNCTMGIMIGARKLRNIVCE 99
           +S+ MNCT+ I +  + ++++ C+
Sbjct: 188 KSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  104 bits (259), Expect = 3e-20
 Identities = 64/202 (31%), Positives = 105/202 (51%), Gaps = 3/202 (1%)
 Frame = -3

Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           +G  RSDGE+     +E   +R KR KCF Y              F  TV+K++TPK R+
Sbjct: 18  NGYTRSDGESL----SEDELKRKKRIKCFAYIGIFIVFQMAIGAVFGLTVLKVKTPKVRL 73

Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336
            ++T       V  +  S S T + ++ +KN+N+G +KF++ +V F Y G         +
Sbjct: 74  GTSTLSD----VTSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVTFMYQGAPVGTVVVPK 129

Query: 335 GHANWRSTXXXXXXXXXXVA---SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165
           G A  R T           A   SS+ L  +LS GV+ L S A++ G V + +++KKK+S
Sbjct: 130 GKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKLTGKVELMLIMKKKKS 189

Query: 164 SFMNCTMGIMIGARKLRNIVCE 99
           + MNCT+ I +  + ++++ C+
Sbjct: 190 ASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  103 bits (256), Expect = 7e-20
 Identities = 58/182 (31%), Positives = 94/182 (51%), Gaps = 3/182 (1%)
 Frame = -3

Query: 635 RRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSATSDSSGEAVAPANGSLS 456
           +R KR K F Y              F+ TVM+I+ PKFRV S T +       P   S +
Sbjct: 20  KRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNPPSFN 79

Query: 455 LTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHANWRST---XXXXXXXXX 285
           +  +AE+ +KN+NFG FKF+ T + F Y G         +G A  RST            
Sbjct: 80  MKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVDLNSN 139

Query: 284 XVASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSFMNCTMGIMIGARKLRNIV 105
            + +++ L  D+S+G + L +  ++ G V +  ++KKK+S+ MNCTM + + +R +++I 
Sbjct: 140 NIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAIQDIK 199

Query: 104 CE 99
           C+
Sbjct: 200 CQ 201


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score =  103 bits (256), Expect = 7e-20
 Identities = 63/200 (31%), Positives = 103/200 (51%), Gaps = 5/200 (2%)
 Frame = -3

Query: 683 RSDGEAADGGA-TEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSA 507
           RSD EAA     + +  R  KR +C LY              F  TVMKI++PKFRV +A
Sbjct: 24  RSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVVVITVFALTVMKIKSPKFRVRTA 83

Query: 506 TSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHA 327
            S +  E  + +N S +L M    G+KN+NFG F++E  +V F Y           E   
Sbjct: 84  -SITGFEVGSASNPSFNLEMDVHFGVKNTNFGHFEYEDGIVVFTYRDVRIGQTNVEEERV 142

Query: 326 NWRSTXXXXXXXXXXVA----SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSF 159
             RST           +    +++RL  D+S G++ +   +++ G + +  ++KKK+S+ 
Sbjct: 143 RARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITISSKLDGKIHLMKIIKKKKSAQ 202

Query: 158 MNCTMGIMIGARKLRNIVCE 99
           MNCTM +++  + ++N+VC+
Sbjct: 203 MNCTMEVVLATKSVQNVVCK 222


>ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca
           subsp. vesca]
          Length = 200

 Score = 97.1 bits (240), Expect = 5e-18
 Identities = 64/199 (32%), Positives = 100/199 (50%)
 Frame = -3

Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           +G  RSDGE+     +E   +R KR KCF Y              F  TV+K++TPK R 
Sbjct: 15  NGYTRSDGESL----SEDELKRKKRIKCFAYIGIFIVFQMAVGAVFGLTVLKVKTPKVR- 69

Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336
           L  TS  SG  V  +  S S T + ++ +KN+N+G +KF+  +V F Y G         +
Sbjct: 70  LDTTSTLSG--VTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVVTFKYQGTPVGTFTVPK 127

Query: 335 GHANWRSTXXXXXXXXXXVASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSFM 156
           G A  R T           A+        S+G + L S A++ G V +  ++KKK+S+ M
Sbjct: 128 GKAGMRGTKKIDASVSLNTAALN------SSGELTLTSEAKLTGKVTLMFIMKKKKSASM 181

Query: 155 NCTMGIMIGARKLRNIVCE 99
           NCT+ I +  + ++++VC+
Sbjct: 182 NCTIQIDVSGQTVKSVVCK 200


>ref|XP_007210080.1| hypothetical protein PRUPE_ppa019661mg [Prunus persica]
           gi|462405815|gb|EMJ11279.1| hypothetical protein
           PRUPE_ppa019661mg [Prunus persica]
          Length = 211

 Score = 94.7 bits (234), Expect = 3e-17
 Identities = 61/202 (30%), Positives = 103/202 (50%), Gaps = 3/202 (1%)
 Frame = -3

Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           +G  R+D E+A    +E+  +R KR K  +Y               + TVMK++TPKFR+
Sbjct: 12  NGYSRNDAESATL-QSEEELKRQKRIKMAIYISIFVVFQIIVITTMSLTVMKVKTPKFRL 70

Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNS-NFGRFKFERTLVYFYYDGXXXXXXXXX 339
            S  +  S ++V PA  S  +  + ++ IKNS N+G +KF    V F Y G         
Sbjct: 71  GSDINVQSFKSV-PATPSFDMKFTTQIRIKNSANWGSYKFNTANVTFQYQGATVGVIDIA 129

Query: 338 EGHANWRSTXXXXXXXXXXVA--SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165
           +G   W ST           +  + + L+ +LS+GV+ L S   + G V +  ++KKK++
Sbjct: 130 KGKVGWLSTIKRNAEVSLSSSGLTGSNLESELSSGVLTLNSVGRLNGKVAIMFIMKKKKA 189

Query: 164 SFMNCTMGIMIGARKLRNIVCE 99
           + MNCT+   + A+ L+++ C+
Sbjct: 190 TNMNCTIAFDVAAKTLKSLECK 211


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score = 94.4 bits (233), Expect = 3e-17
 Identities = 59/202 (29%), Positives = 100/202 (49%), Gaps = 3/202 (1%)
 Frame = -3

Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           +G  RSDGE+     ++   +R KR + F Y              F  TVMK++TPK R+
Sbjct: 15  NGYTRSDGESL---VSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMKVKTPKVRL 71

Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336
                        PA  S   T + ++ +KN+N+G +KF+ + V F Y G         +
Sbjct: 72  GEINVQDFNSV--PATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVGQVTVPK 129

Query: 335 GHANWRSTXXXXXXXXXXV---ASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165
           G A  RST               SS+ L  +L++GV+ L S+A++ G V + +++KKK+S
Sbjct: 130 GKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLIMKKKKS 189

Query: 164 SFMNCTMGIMIGARKLRNIVCE 99
           S M+C +G  +  + ++++ C+
Sbjct: 190 STMDCMIGFDLSTKTVKSLQCK 211


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score = 93.2 bits (230), Expect = 7e-17
 Identities = 56/202 (27%), Positives = 100/202 (49%), Gaps = 3/202 (1%)
 Frame = -3

Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           +G  RSDGE+     +E   +R KR K F+Y              F  TVMK++TPK R+
Sbjct: 15  NGYTRSDGESL---VSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMKVKTPKVRL 71

Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336
                 S      PA  S   + + ++ +KN+N+G +KF+ +   F Y G         +
Sbjct: 72  GGINVQSLNSV--PATPSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVGQVSIPK 129

Query: 335 GHANWRSTXXXXXXXXXXV---ASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165
             A  RST               SS+ +  +L++G++ L S+A++ G V + +++KKK+S
Sbjct: 130 SKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLIMKKKKS 189

Query: 164 SFMNCTMGIMIGARKLRNIVCE 99
           + M+CT+   +  + ++++ C+
Sbjct: 190 ATMDCTIAFDLSTKTVKSLQCK 211


>ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 226

 Score = 92.4 bits (228), Expect = 1e-16
 Identities = 56/175 (32%), Positives = 87/175 (49%), Gaps = 3/175 (1%)
 Frame = -3

Query: 638 RRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSATSDSSGEAVAPANGSL 459
           RR     KC  Y              F  TVM+IR+PK R  + T +S    V  ++ S 
Sbjct: 4   RREGSNAKCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFS-TVNSSSPSF 62

Query: 458 SLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHANWRSTXXXXXXXXXXV 279
            + + A++ +KN+NFG FK+E + V   Y G         +G A  R T           
Sbjct: 63  DMKLMAQVAVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDISS 122

Query: 278 A---SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSFMNCTMGIMIGAR 123
           +   S++ L  D++AGV+ L S+A+++G V +  ++KKK+S  M+CTMGI +  R
Sbjct: 123 SRLSSNSNLGNDINAGVLPLSSQAKLKGKVHLMKVIKKKKSGEMSCTMGINLATR 177


>ref|XP_007210107.1| hypothetical protein PRUPE_ppa021960mg [Prunus persica]
           gi|462405842|gb|EMJ11306.1| hypothetical protein
           PRUPE_ppa021960mg [Prunus persica]
          Length = 212

 Score = 91.7 bits (226), Expect = 2e-16
 Identities = 61/202 (30%), Positives = 99/202 (49%), Gaps = 3/202 (1%)
 Frame = -3

Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           HG +RSD E+ +        +R K+ K  +Y               + TVMK++TP+FR 
Sbjct: 16  HGYQRSDAESLENA---DELKRKKKIKMAIYIGIFVVFQIIVITTMSLTVMKVKTPRFR- 71

Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNS-NFGRFKFERTLVYFYYDGXXXXXXXXX 339
           L   +  S E+V PA  S     + ++ IKNS N+G +KF    V F Y G         
Sbjct: 72  LGNINVQSFESV-PATPSFDTKFTTQIKIKNSANWGSYKFNAANVTFQYQGETVAVINIA 130

Query: 338 EGHANWRSTXXXXXXXXXXVA--SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165
           +G A W ST           +  + + L  +LS+GV+ L S   + G V +  ++KKK++
Sbjct: 131 KGKAGWLSTIKRNAEVSLNSSGITGSNLGSELSSGVLTLNSVGRLNGKVAIMFIMKKKKA 190

Query: 164 SFMNCTMGIMIGARKLRNIVCE 99
           + MNCT+   + A+ L+++ C+
Sbjct: 191 TNMNCTIAFDVAAKTLKSLQCK 212


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score = 91.3 bits (225), Expect = 3e-16
 Identities = 58/203 (28%), Positives = 100/203 (49%), Gaps = 4/203 (1%)
 Frame = -3

Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516
           +G  RSD E+A   + E +R+  KR K  +Y              F  TVM+++ PK R+
Sbjct: 15  NGHPRSDEESASLQSKELKRK--KRIKYAVYIAAFAVFQTVVILIFALTVMRVKNPKVRI 72

Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336
              T ++   +   A  S +L    ++ +KN+NFG +KF+   + F YDG         +
Sbjct: 73  GKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGEAIIPK 132

Query: 335 GHANWRST----XXXXXXXXXXVASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKR 168
             A  RST               +++  L  +LS+ V+ L S+A+++G V +  ++KKK+
Sbjct: 133 ARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKVMKKKK 192

Query: 167 SSFMNCTMGIMIGARKLRNIVCE 99
           S  MNCT+   +  R L+++ C+
Sbjct: 193 SPEMNCTLIFNVSTRSLQDLKCK 215


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score = 89.7 bits (221), Expect = 8e-16
 Identities = 53/187 (28%), Positives = 92/187 (49%), Gaps = 3/187 (1%)
 Frame = -3

Query: 650 TEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSATSDSSGEAVAPA 471
           T  RR+RN   KC  Y              F   VM+IR PK R+   T ++     + +
Sbjct: 7   TTSRRKRNI--KCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSS 64

Query: 470 NGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHANWRSTXXXXXXX 291
           + S S+ ++A++ +KN+NFG FKF+ + +   Y G         +  A  RST       
Sbjct: 65  SPSFSMNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTV 124

Query: 290 XXXVASSAR---LDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSFMNCTMGIMIGARK 120
                  +R   L  D+ +G + L S A++ G + +  + KKK+S+ MNCTM +   +++
Sbjct: 125 SVSSDKMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQ 184

Query: 119 LRNIVCE 99
           ++N++C+
Sbjct: 185 IQNLMCQ 191


Top