BLASTX nr result
ID: Mentha26_contig00035379
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00035379 (700 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU25166.1| hypothetical protein MIMGU_mgv1a014071mg [Mimulus... 139 9e-31 gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus... 134 2e-29 gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus... 117 3e-24 ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom... 114 4e-23 ref|XP_002513626.1| conserved hypothetical protein [Ricinus comm... 112 2e-22 ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578... 107 5e-21 gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus... 106 8e-21 ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303... 105 1e-20 ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293... 105 2e-20 ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295... 104 3e-20 ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r... 103 7e-20 ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302... 103 7e-20 ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295... 97 5e-18 ref|XP_007210080.1| hypothetical protein PRUPE_ppa019661mg [Prun... 95 3e-17 ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296... 94 3e-17 ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296... 93 7e-17 ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r... 92 1e-16 ref|XP_007210107.1| hypothetical protein PRUPE_ppa021960mg [Prun... 92 2e-16 ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r... 91 3e-16 ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r... 90 8e-16 >gb|EYU25166.1| hypothetical protein MIMGU_mgv1a014071mg [Mimulus guttatus] Length = 202 Score = 139 bits (350), Expect = 9e-31 Identities = 85/203 (41%), Positives = 119/203 (58%), Gaps = 3/203 (1%) Frame = -3 Query: 698 GHGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFR 519 GHG RSD EA GGA + R++N R KCFLY F+ TVMKIRTPKFR Sbjct: 6 GHG--RSDAEA--GGAATEPRKKN-RTKCFLYIALFVIFQIGVITIFSLTVMKIRTPKFR 60 Query: 518 VLSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXX 339 + SA ++ A PA+ S S T++AE +KN+NFGR+K+ T V F+Y G Sbjct: 61 IRSAHL-TNFNAGTPASPSFSATVNAEFTVKNANFGRYKYRNTTVDFFYRGTPVGQVLVR 119 Query: 338 EGHANWRSTXXXXXXXXXXVASSA---RLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKR 168 + A WRST + ++ +L DL+AGVV++ S+A MRG V + ++KK + Sbjct: 120 DSRAGWRSTKKFNVAVNLSLTNAQANPQLASDLNAGVVQISSQARMRGRVELIFVMKKNK 179 Query: 167 SSFMNCTMGIMIGARKLRNIVCE 99 S+ MNCTM I+ ++LRNI+C+ Sbjct: 180 STDMNCTMEIVTATQQLRNILCK 202 >gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus] Length = 213 Score = 134 bits (338), Expect = 2e-29 Identities = 78/203 (38%), Positives = 114/203 (56%), Gaps = 3/203 (1%) Frame = -3 Query: 698 GHGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFR 519 GHG RSD EA + +R+ KR KCF+Y F+ TVMKIRTPKFR Sbjct: 14 GHG--RSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTVMKIRTPKFR 71 Query: 518 VLSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXX 339 + SA ++ A P + S S T++AE +KN+NFGR+K+ T V F+Y G Sbjct: 72 IRSAHL-TTFHAGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGTPVGQVFVR 130 Query: 338 EGHANWRSTXXXXXXXXXXVASSA---RLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKR 168 + A WRST +A++ +L DL+AGVV++ S+A M G V + ++KK + Sbjct: 131 DSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNAGVVQITSQARMAGRVELIFVMKKNK 190 Query: 167 SSFMNCTMGIMIGARKLRNIVCE 99 S+ MNC M I+ +++RN+VC+ Sbjct: 191 STDMNCNMEIVTATQQIRNLVCK 213 >gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus] Length = 214 Score = 117 bits (294), Expect = 3e-24 Identities = 74/202 (36%), Positives = 110/202 (54%), Gaps = 3/202 (1%) Frame = -3 Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 HG RSD EA GGA + KR +C +Y F+ TVMKIR P+FR+ Sbjct: 18 HG--RSDTEA--GGAAASELHKRKRTQCLIYIGLLAIIQIAVVIVFSLTVMKIRNPRFRI 73 Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336 SA ++ A PA+ + + ++AE +KN+NFGR+K+ T V F Y G E Sbjct: 74 RSAHL-TNFNAGTPASPAFTGKLNAEFSVKNANFGRYKYMDTTVDFVYRGTRVGEVFVRE 132 Query: 335 GHANWRSTXXXXXXXXXXVASSA---RLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165 A WR+T +A++ +L DL+AGVV + S A M G+V + ++KK RS Sbjct: 133 SRAGWRTTKKFNVAVDLSLANARANPQLASDLNAGVVPISSEARMSGSVELLFVLKKNRS 192 Query: 164 SFMNCTMGIMIGARKLRNIVCE 99 + +NCTM I+ +++RNI+C+ Sbjct: 193 TGLNCTMEIVTATQQIRNILCK 214 >ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao] gi|508776114|gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] Length = 214 Score = 114 bits (284), Expect = 4e-23 Identities = 72/202 (35%), Positives = 105/202 (51%), Gaps = 3/202 (1%) Frame = -3 Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 +G ERSD E+ + E +++ KR KC LY F TVM+IR PKFRV Sbjct: 16 NGHERSDEESVAAHSKELKKK--KRMKCLLYIVLFAVFQTGIILLFALTVMRIRNPKFRV 73 Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336 S S ++ A+ S L M+ + +KN+NFG FK+E LV F Y G + Sbjct: 74 RSG-SFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVGRATIQK 132 Query: 335 GHANWRSTXXXXXXXXXXV---ASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165 A RST ++ L RD+SAGV+ L S +++ G + + ++KKK+S Sbjct: 133 ARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKVIKKKKS 192 Query: 164 SFMNCTMGIMIGARKLRNIVCE 99 + MNCTM + I R +RNI+C+ Sbjct: 193 TQMNCTMDVAIDTRTVRNIICK 214 >ref|XP_002513626.1| conserved hypothetical protein [Ricinus communis] gi|223547534|gb|EEF49029.1| conserved hypothetical protein [Ricinus communis] Length = 217 Score = 112 bits (279), Expect = 2e-22 Identities = 70/206 (33%), Positives = 104/206 (50%), Gaps = 8/206 (3%) Frame = -3 Query: 692 GQERSDGEAADGGATEQRR-RRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 GQ RSD E+ G + + R+ KR KC + F FTV++ + PKFRV Sbjct: 15 GQTRSDEESGTAGTAQTKELRKKKRMKCIAFVVAFTIFQTGIILLFVFTVLRFKDPKFRV 74 Query: 515 LSATSDSS----GEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXX 348 SA+ D + +A AP S +LTM+ + G+KN+NFG FK+E + V F Y G Sbjct: 75 RSASFDDTFHVGTDAAAP---SFNLTMNTQFGVKNTNFGHFKYETSTVTFEYRGTVVGLV 131 Query: 347 XXXEGHANWRSTXXXXXXXXXXV---ASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVK 177 + A RST L D+S+G + L S + + G + + ++K Sbjct: 132 NVDKARARARSTRKFDAIVVLRTDRLPDGFELSSDISSGKIPLSSSSRLDGEIHLMKVIK 191 Query: 176 KKRSSFMNCTMGIMIGARKLRNIVCE 99 KK+S+ MNCTM + I R L++IVC+ Sbjct: 192 KKKSAEMNCTMNVDIQTRTLQDIVCK 217 >ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578608 [Solanum tuberosum] Length = 204 Score = 107 bits (266), Expect = 5e-21 Identities = 59/179 (32%), Positives = 99/179 (55%) Frame = -3 Query: 635 RRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSATSDSSGEAVAPANGSLS 456 RR KRNK +Y F+ +MKIRTPKF V SAT D + N S + Sbjct: 31 RRKKRNKILVYVALFIVFQIAVLLFFSLYIMKIRTPKFSVRSATFD----LMVTENASFN 86 Query: 455 LTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHANWRSTXXXXXXXXXXVA 276 +TM+AEL +KN+NFG + ++ + +YFYY+ +G A ++S+ + Sbjct: 87 ITMNAELSVKNANFGPYNYKNSTIYFYYNDVSIGEAFVYQGKAGFKSS-KKFNVIVNLSS 145 Query: 275 SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSFMNCTMGIMIGARKLRNIVCE 99 ++L DL++G + L S++++ G V++ +KKK+S+ MNC + I + + +R+I C+ Sbjct: 146 KESKLRNDLNSGTLILTSKSKLEGKVKLIFFMKKKKSTEMNCAIIIGLAGKVVRDIQCD 204 >gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus guttatus] Length = 214 Score = 106 bits (264), Expect = 8e-21 Identities = 67/200 (33%), Positives = 101/200 (50%), Gaps = 6/200 (3%) Frame = -3 Query: 683 RSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSAT 504 R D E A ++ +R KR KCF Y F T+MK+RTPKF V SAT Sbjct: 19 RVDEEVAS--VAQKNEKRKKRVKCFTYVAVFIVIQSVIFMIFGLTIMKVRTPKFHVRSAT 76 Query: 503 SDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHAN 324 + + N S ++ M A+L ++N NFG++K++ + V F++ G AN Sbjct: 77 FGAFEVSTLDTNPSFNINMIADLSVRNRNFGQYKYQNSTVEFFFRGTKVGEARIVRSRAN 136 Query: 323 WRSTXXXXXXXXXXVASSARLDRDLSA------GVVRLMSRAEMRGNVRMAMMVKKKRSS 162 RST SSA + ++ A ++ L SR+ +RG V + ++KK +S+ Sbjct: 137 ARST---RRFLATVDLSSAGVPTEVLANEFRTHALIPLTSRSTLRGKVEIMKLMKKNKST 193 Query: 161 FMNCTMGIMIGARKLRNIVC 102 MNCTM IMI +++L NI C Sbjct: 194 NMNCTMEIMISSKQLGNISC 213 >ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca subsp. vesca] Length = 213 Score = 105 bits (263), Expect = 1e-20 Identities = 64/199 (32%), Positives = 107/199 (53%) Frame = -3 Query: 698 GHGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFR 519 G RSD E++ + + R++ KR KC +Y F TVMKI++PKFR Sbjct: 17 GQAMARSDAESSRAHSDHELRKK-KRIKCLIYIAVFAVFQIIVITVFALTVMKIKSPKFR 75 Query: 518 VLSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXX 339 + S T + + AN SLS++ AE+ +KN NFGR+K+++T + F Y+G Sbjct: 76 IKSITVQDLTTSNS-ANPSLSMSFVAEVSVKNPNFGRYKYDQTSISFIYEGTQVGDAVVP 134 Query: 338 EGHANWRSTXXXXXXXXXXVASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSF 159 + A ++T +S L D+SAG V L + +++ G V + M+KKK+S+ Sbjct: 135 KATARTKATRKEIVSGAVKTVNS-NLASDISAGSVTLSTYSKINGKVYLMNMIKKKKSAE 193 Query: 158 MNCTMGIMIGARKLRNIVC 102 M CTM + + ++++++I C Sbjct: 194 MKCTMVVHLSSKQVQDIKC 212 >ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca subsp. vesca] Length = 211 Score = 105 bits (261), Expect = 2e-20 Identities = 65/204 (31%), Positives = 107/204 (52%), Gaps = 5/204 (2%) Frame = -3 Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 +G RSDGE+ +E +R KR KCF Y F T+MK++TPK R+ Sbjct: 15 NGYTRSDGESL----SEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMKVKTPKVRL 70 Query: 515 LSAT-SDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXX 339 ++T +D + AP S T + ++ +KN+N+G +KF++ +V F Y G Sbjct: 71 GTSTLTDFTSSDTAP---SFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVGTVVVP 127 Query: 338 EGHANWRSTXXXXXXXXXXVA----SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKK 171 +G A R T A SS+ L +LS GV+ L S A++ G V + +++KKK Sbjct: 128 KGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELMLIMKKK 187 Query: 170 RSSFMNCTMGIMIGARKLRNIVCE 99 +S+ MNCT+ I + + ++++ C+ Sbjct: 188 KSASMNCTIQIDVSGKTVKSLECK 211 >ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca subsp. vesca] Length = 211 Score = 104 bits (259), Expect = 3e-20 Identities = 64/202 (31%), Positives = 105/202 (51%), Gaps = 3/202 (1%) Frame = -3 Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 +G RSDGE+ +E +R KR KCF Y F TV+K++TPK R+ Sbjct: 18 NGYTRSDGESL----SEDELKRKKRIKCFAYIGIFIVFQMAIGAVFGLTVLKVKTPKVRL 73 Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336 ++T V + S S T + ++ +KN+N+G +KF++ +V F Y G + Sbjct: 74 GTSTLSD----VTSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVTFMYQGAPVGTVVVPK 129 Query: 335 GHANWRSTXXXXXXXXXXVA---SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165 G A R T A SS+ L +LS GV+ L S A++ G V + +++KKK+S Sbjct: 130 GKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKLTGKVELMLIMKKKKS 189 Query: 164 SFMNCTMGIMIGARKLRNIVCE 99 + MNCT+ I + + ++++ C+ Sbjct: 190 ASMNCTIQIDVSGKTVKSLECK 211 >ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776113|gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 103 bits (256), Expect = 7e-20 Identities = 58/182 (31%), Positives = 94/182 (51%), Gaps = 3/182 (1%) Frame = -3 Query: 635 RRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSATSDSSGEAVAPANGSLS 456 +R KR K F Y F+ TVM+I+ PKFRV S T + P S + Sbjct: 20 KRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNPPSFN 79 Query: 455 LTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHANWRST---XXXXXXXXX 285 + +AE+ +KN+NFG FKF+ T + F Y G +G A RST Sbjct: 80 MKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVDLNSN 139 Query: 284 XVASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSFMNCTMGIMIGARKLRNIV 105 + +++ L D+S+G + L + ++ G V + ++KKK+S+ MNCTM + + +R +++I Sbjct: 140 NIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAIQDIK 199 Query: 104 CE 99 C+ Sbjct: 200 CQ 201 >ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca subsp. vesca] Length = 222 Score = 103 bits (256), Expect = 7e-20 Identities = 63/200 (31%), Positives = 103/200 (51%), Gaps = 5/200 (2%) Frame = -3 Query: 683 RSDGEAADGGA-TEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSA 507 RSD EAA + + R KR +C LY F TVMKI++PKFRV +A Sbjct: 24 RSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVVVITVFALTVMKIKSPKFRVRTA 83 Query: 506 TSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHA 327 S + E + +N S +L M G+KN+NFG F++E +V F Y E Sbjct: 84 -SITGFEVGSASNPSFNLEMDVHFGVKNTNFGHFEYEDGIVVFTYRDVRIGQTNVEEERV 142 Query: 326 NWRSTXXXXXXXXXXVA----SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSF 159 RST + +++RL D+S G++ + +++ G + + ++KKK+S+ Sbjct: 143 RARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITISSKLDGKIHLMKIIKKKKSAQ 202 Query: 158 MNCTMGIMIGARKLRNIVCE 99 MNCTM +++ + ++N+VC+ Sbjct: 203 MNCTMEVVLATKSVQNVVCK 222 >ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca subsp. vesca] Length = 200 Score = 97.1 bits (240), Expect = 5e-18 Identities = 64/199 (32%), Positives = 100/199 (50%) Frame = -3 Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 +G RSDGE+ +E +R KR KCF Y F TV+K++TPK R Sbjct: 15 NGYTRSDGESL----SEDELKRKKRIKCFAYIGIFIVFQMAVGAVFGLTVLKVKTPKVR- 69 Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336 L TS SG V + S S T + ++ +KN+N+G +KF+ +V F Y G + Sbjct: 70 LDTTSTLSG--VTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVVTFKYQGTPVGTFTVPK 127 Query: 335 GHANWRSTXXXXXXXXXXVASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSFM 156 G A R T A+ S+G + L S A++ G V + ++KKK+S+ M Sbjct: 128 GKAGMRGTKKIDASVSLNTAALN------SSGELTLTSEAKLTGKVTLMFIMKKKKSASM 181 Query: 155 NCTMGIMIGARKLRNIVCE 99 NCT+ I + + ++++VC+ Sbjct: 182 NCTIQIDVSGQTVKSVVCK 200 >ref|XP_007210080.1| hypothetical protein PRUPE_ppa019661mg [Prunus persica] gi|462405815|gb|EMJ11279.1| hypothetical protein PRUPE_ppa019661mg [Prunus persica] Length = 211 Score = 94.7 bits (234), Expect = 3e-17 Identities = 61/202 (30%), Positives = 103/202 (50%), Gaps = 3/202 (1%) Frame = -3 Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 +G R+D E+A +E+ +R KR K +Y + TVMK++TPKFR+ Sbjct: 12 NGYSRNDAESATL-QSEEELKRQKRIKMAIYISIFVVFQIIVITTMSLTVMKVKTPKFRL 70 Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNS-NFGRFKFERTLVYFYYDGXXXXXXXXX 339 S + S ++V PA S + + ++ IKNS N+G +KF V F Y G Sbjct: 71 GSDINVQSFKSV-PATPSFDMKFTTQIRIKNSANWGSYKFNTANVTFQYQGATVGVIDIA 129 Query: 338 EGHANWRSTXXXXXXXXXXVA--SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165 +G W ST + + + L+ +LS+GV+ L S + G V + ++KKK++ Sbjct: 130 KGKVGWLSTIKRNAEVSLSSSGLTGSNLESELSSGVLTLNSVGRLNGKVAIMFIMKKKKA 189 Query: 164 SFMNCTMGIMIGARKLRNIVCE 99 + MNCT+ + A+ L+++ C+ Sbjct: 190 TNMNCTIAFDVAAKTLKSLECK 211 >ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca subsp. vesca] Length = 211 Score = 94.4 bits (233), Expect = 3e-17 Identities = 59/202 (29%), Positives = 100/202 (49%), Gaps = 3/202 (1%) Frame = -3 Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 +G RSDGE+ ++ +R KR + F Y F TVMK++TPK R+ Sbjct: 15 NGYTRSDGESL---VSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMKVKTPKVRL 71 Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336 PA S T + ++ +KN+N+G +KF+ + V F Y G + Sbjct: 72 GEINVQDFNSV--PATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVGQVTVPK 129 Query: 335 GHANWRSTXXXXXXXXXXV---ASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165 G A RST SS+ L +L++GV+ L S+A++ G V + +++KKK+S Sbjct: 130 GKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLIMKKKKS 189 Query: 164 SFMNCTMGIMIGARKLRNIVCE 99 S M+C +G + + ++++ C+ Sbjct: 190 STMDCMIGFDLSTKTVKSLQCK 211 >ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca subsp. vesca] Length = 211 Score = 93.2 bits (230), Expect = 7e-17 Identities = 56/202 (27%), Positives = 100/202 (49%), Gaps = 3/202 (1%) Frame = -3 Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 +G RSDGE+ +E +R KR K F+Y F TVMK++TPK R+ Sbjct: 15 NGYTRSDGESL---VSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMKVKTPKVRL 71 Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336 S PA S + + ++ +KN+N+G +KF+ + F Y G + Sbjct: 72 GGINVQSLNSV--PATPSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVGQVSIPK 129 Query: 335 GHANWRSTXXXXXXXXXXV---ASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165 A RST SS+ + +L++G++ L S+A++ G V + +++KKK+S Sbjct: 130 SKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLIMKKKKS 189 Query: 164 SFMNCTMGIMIGARKLRNIVCE 99 + M+CT+ + + ++++ C+ Sbjct: 190 ATMDCTIAFDLSTKTVKSLQCK 211 >ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 226 Score = 92.4 bits (228), Expect = 1e-16 Identities = 56/175 (32%), Positives = 87/175 (49%), Gaps = 3/175 (1%) Frame = -3 Query: 638 RRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSATSDSSGEAVAPANGSL 459 RR KC Y F TVM+IR+PK R + T +S V ++ S Sbjct: 4 RREGSNAKCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFS-TVNSSSPSF 62 Query: 458 SLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHANWRSTXXXXXXXXXXV 279 + + A++ +KN+NFG FK+E + V Y G +G A R T Sbjct: 63 DMKLMAQVAVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDISS 122 Query: 278 A---SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSFMNCTMGIMIGAR 123 + S++ L D++AGV+ L S+A+++G V + ++KKK+S M+CTMGI + R Sbjct: 123 SRLSSNSNLGNDINAGVLPLSSQAKLKGKVHLMKVIKKKKSGEMSCTMGINLATR 177 >ref|XP_007210107.1| hypothetical protein PRUPE_ppa021960mg [Prunus persica] gi|462405842|gb|EMJ11306.1| hypothetical protein PRUPE_ppa021960mg [Prunus persica] Length = 212 Score = 91.7 bits (226), Expect = 2e-16 Identities = 61/202 (30%), Positives = 99/202 (49%), Gaps = 3/202 (1%) Frame = -3 Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 HG +RSD E+ + +R K+ K +Y + TVMK++TP+FR Sbjct: 16 HGYQRSDAESLENA---DELKRKKKIKMAIYIGIFVVFQIIVITTMSLTVMKVKTPRFR- 71 Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNS-NFGRFKFERTLVYFYYDGXXXXXXXXX 339 L + S E+V PA S + ++ IKNS N+G +KF V F Y G Sbjct: 72 LGNINVQSFESV-PATPSFDTKFTTQIKIKNSANWGSYKFNAANVTFQYQGETVAVINIA 130 Query: 338 EGHANWRSTXXXXXXXXXXVA--SSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRS 165 +G A W ST + + + L +LS+GV+ L S + G V + ++KKK++ Sbjct: 131 KGKAGWLSTIKRNAEVSLNSSGITGSNLGSELSSGVLTLNSVGRLNGKVAIMFIMKKKKA 190 Query: 164 SFMNCTMGIMIGARKLRNIVCE 99 + MNCT+ + A+ L+++ C+ Sbjct: 191 TNMNCTIAFDVAAKTLKSLQCK 212 >ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777615|gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 91.3 bits (225), Expect = 3e-16 Identities = 58/203 (28%), Positives = 100/203 (49%), Gaps = 4/203 (1%) Frame = -3 Query: 695 HGQERSDGEAADGGATEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRV 516 +G RSD E+A + E +R+ KR K +Y F TVM+++ PK R+ Sbjct: 15 NGHPRSDEESASLQSKELKRK--KRIKYAVYIAAFAVFQTVVILIFALTVMRVKNPKVRI 72 Query: 515 LSATSDSSGEAVAPANGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXE 336 T ++ + A S +L ++ +KN+NFG +KF+ + F YDG + Sbjct: 73 GKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGEAIIPK 132 Query: 335 GHANWRST----XXXXXXXXXXVASSARLDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKR 168 A RST +++ L +LS+ V+ L S+A+++G V + ++KKK+ Sbjct: 133 ARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKVMKKKK 192 Query: 167 SSFMNCTMGIMIGARKLRNIVCE 99 S MNCT+ + R L+++ C+ Sbjct: 193 SPEMNCTLIFNVSTRSLQDLKCK 215 >ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776108|gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 191 Score = 89.7 bits (221), Expect = 8e-16 Identities = 53/187 (28%), Positives = 92/187 (49%), Gaps = 3/187 (1%) Frame = -3 Query: 650 TEQRRRRNKRNKCFLYXXXXXXXXXXXXXXFTFTVMKIRTPKFRVLSATSDSSGEAVAPA 471 T RR+RN KC Y F VM+IR PK R+ T ++ + + Sbjct: 7 TTSRRKRNI--KCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSS 64 Query: 470 NGSLSLTMSAELGIKNSNFGRFKFERTLVYFYYDGXXXXXXXXXEGHANWRSTXXXXXXX 291 + S S+ ++A++ +KN+NFG FKF+ + + Y G + A RST Sbjct: 65 SPSFSMNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTV 124 Query: 290 XXXVASSAR---LDRDLSAGVVRLMSRAEMRGNVRMAMMVKKKRSSFMNCTMGIMIGARK 120 +R L D+ +G + L S A++ G + + + KKK+S+ MNCTM + +++ Sbjct: 125 SVSSDKMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQ 184 Query: 119 LRNIVCE 99 ++N++C+ Sbjct: 185 IQNLMCQ 191