BLASTX nr result
ID: Mentha23_contig00023222
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00023222 (528 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus... 167 1e-39 ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r... 108 6e-22 ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295... 106 3e-21 ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom... 105 5e-21 ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579... 105 7e-21 gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] 103 2e-20 ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296... 101 9e-20 ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296... 99 6e-19 ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295... 99 8e-19 ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293... 98 1e-18 ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295... 95 1e-17 ref|XP_004298841.1| PREDICTED: uncharacterized protein LOC101294... 92 7e-17 ref|XP_007203004.1| hypothetical protein PRUPE_ppa017380mg, part... 92 1e-16 ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r... 91 2e-16 ref|XP_004288775.1| PREDICTED: uncharacterized protein LOC101312... 90 4e-16 ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r... 89 6e-16 ref|XP_004309174.1| PREDICTED: uncharacterized protein LOC101303... 89 6e-16 ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-r... 89 8e-16 ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r... 89 8e-16 ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303... 88 1e-15 >gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus guttatus] Length = 208 Score = 167 bits (423), Expect = 1e-39 Identities = 97/184 (52%), Positives = 125/184 (67%), Gaps = 10/184 (5%) Frame = +3 Query: 6 MADKYQPEV-QGYPLAPASVGPRSDEEYG--NNHHSGEQMKKKKRIKCLTYXXXXXXXXX 176 MA+KY EV Q YPLAP++V PRSDEEY NN+ + E+MKK KR+KC Y Sbjct: 1 MAEKYNQEVHQAYPLAPSTV-PRSDEEYSGTNNYRAQEEMKKNKRMKCFAYIACFAVFQI 59 Query: 177 XXXXXXSLIIMRVRTPKVRMDDVTVTSV-ANGDVRFGARVLVKNTNFGRYKFESTLATIR 353 +L +MRV++PK+R+ D+TVT +G+VR ARVLVKNTNFGRYKF+S LATIR Sbjct: 60 IIILILALTVMRVKSPKLRLGDITVTKDHVSGNVRLTARVLVKNTNFGRYKFDSGLATIR 119 Query: 354 TADNNVVQFPIQEARARARSTKKIAVMASLGASAS------GTLELTVEAKLRGKVEFMR 515 + +NV QF I E+RARARSTKK+ V L +S S G L VE++LRGKVE ++ Sbjct: 120 SGASNVGQFVIPESRARARSTKKMYVTVDLNSSNSSNNSMGGVWTLNVESQLRGKVELLK 179 Query: 516 VIKR 527 V+K+ Sbjct: 180 VVKK 183 >ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777615|gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 108 bits (271), Expect = 6e-22 Identities = 76/195 (38%), Positives = 107/195 (54%), Gaps = 21/195 (10%) Frame = +3 Query: 6 MADKYQPEVQGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXX 185 MA+K Q Q +PLAPA+ PRSDEE + +++K+KKRIK Y Sbjct: 1 MAEKDQ---QVHPLAPANGHPRSDEESASLQ--SKELKRKKRIKYAVYIAAFAVFQTVVI 55 Query: 186 XXXSLIIMRVRTPKVRMDDVTVTSV--------ANGDVRFGARVLVKNTNFGRYKFESTL 341 +L +MRV+ PKVR+ VTV ++ A+ ++RF +V VKNTNFG YKF++ Sbjct: 56 LIFALTVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNAT 115 Query: 342 ATIRTADNNVVQFPIQEARARARSTKKIAVMASLGASA-------------SGTLELTVE 482 + V + I +ARARARSTKK+ V + +SA S L L + Sbjct: 116 MSFLYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQ 175 Query: 483 AKLRGKVEFMRVIKR 527 AKL+GKVE M+V+K+ Sbjct: 176 AKLKGKVELMKVMKK 190 >ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca subsp. vesca] Length = 200 Score = 106 bits (265), Expect = 3e-21 Identities = 69/182 (37%), Positives = 97/182 (53%), Gaps = 8/182 (4%) Frame = +3 Query: 6 MADKYQPEVQGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXX 185 MADK+Q Q YPLAP++ RSD E S +++K+KKRIKC Y Sbjct: 1 MADKHQ---QVYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAVG 53 Query: 186 XXXSLIIMRVRTPKVRMDDVTV-----TSVANGDVRFGARVLVKNTNFGRYKFESTLATI 350 L +++V+TPKVR+D + +S + F ++ VKNTN+G YKF+ + T Sbjct: 54 AVFGLTVLKVKTPKVRLDTTSTLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVVTF 113 Query: 351 RTADNNVVQFPIQEARARARSTKKIAVMASLGASA---SGTLELTVEAKLRGKVEFMRVI 521 + V F + + +A R TKKI SL +A SG L LT EAKL GKV M ++ Sbjct: 114 KYQGTPVGTFTVPKGKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMFIM 173 Query: 522 KR 527 K+ Sbjct: 174 KK 175 >ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao] gi|508776114|gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] Length = 214 Score = 105 bits (263), Expect = 5e-21 Identities = 69/183 (37%), Positives = 97/183 (53%), Gaps = 20/183 (10%) Frame = +3 Query: 39 YPLAPASVG-PRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLIIMRV 215 YPL PA+ G RSDEE H +++KKKKR+KCL Y +L +MR+ Sbjct: 9 YPLVPAANGHERSDEESVAAH--SKELKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRI 66 Query: 216 RTPKVRMDDVTVTSVANG-------DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 374 R PK R+ + T+ G D++ + VKNTNFG +K+E L T V Sbjct: 67 RNPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVG 126 Query: 375 QFPIQEARARARSTKKIAVMASLGAS------------ASGTLELTVEAKLRGKVEFMRV 518 + IQ+ARARARSTKK+ V+ L ++ ++G L LT +KL GK+ M+V Sbjct: 127 RATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKV 186 Query: 519 IKR 527 IK+ Sbjct: 187 IKK 189 >ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579067 [Solanum tuberosum] Length = 197 Score = 105 bits (262), Expect = 7e-21 Identities = 58/170 (34%), Positives = 95/170 (55%), Gaps = 6/170 (3%) Frame = +3 Query: 33 QGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLIIMR 212 Q YPLAP+++ PRSD E+ N+ ++KK+++ +R Sbjct: 5 QKYPLAPSNIMPRSDAEFATNNFQSNNQRRKKKLRST---FLLTIFLTGIILLFCFTFLR 61 Query: 213 VRTPKVRMDDVTVTSVANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV-QFPIQ 389 +++PK+R++++ +T+ +G + F A+V ++N NF RY ++STL TI TA+ + +F I Sbjct: 62 IKSPKIRIENIRITNDGDGRINFSAQVFLRNRNFWRYGYDSTLGTINTAEGTTIGRFVIP 121 Query: 390 EARARARSTKKIAVM-----ASLGASASGTLELTVEAKLRGKVEFMRVIK 524 + R RSTK I VM S + SG L + EAK+RGKV+ RV + Sbjct: 122 DGEVRRRSTKTIYVMENFILPSRLNNTSGILPVISEAKIRGKVKVFRVFR 171 >gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] Length = 212 Score = 103 bits (257), Expect = 2e-20 Identities = 69/194 (35%), Positives = 100/194 (51%), Gaps = 20/194 (10%) Frame = +3 Query: 6 MADKYQPEVQGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXX 185 MA++YQ Q YPLAPA+ PRSDEE N +++K++KRIK Y Sbjct: 1 MAERYQ---QVYPLAPANGHPRSDEESSNL--DAKELKRRKRIKLAIYAFIFTASQIIVT 55 Query: 186 XXXSLIIMRVRTPKVRMDD--------VTVTSVANGDVRFGARVLVKNTNFGRYKFESTL 341 L++MRV++PK+R+ D S + D+ F ++ VKNTN+G YKF++T Sbjct: 56 LVFVLVVMRVKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTT 115 Query: 342 ATIRTADNNVVQFPIQEARARARSTKKIAVMASLGAS------------ASGTLELTVEA 485 A V Q I + +A RSTKK+ V SL +S + G L L A Sbjct: 116 AAFAYEGETVGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTA 175 Query: 486 KLRGKVEFMRVIKR 527 K+ GKV+ M ++K+ Sbjct: 176 KMTGKVKLMLIMKK 189 >ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca subsp. vesca] Length = 211 Score = 101 bits (252), Expect = 9e-20 Identities = 68/183 (37%), Positives = 97/183 (53%), Gaps = 18/183 (9%) Frame = +3 Query: 33 QGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLIIMR 212 Q YPLAPA+ RSD G + S +++K++KR K Y L +M+ Sbjct: 7 QAYPLAPANGYTRSD---GESLVSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMK 63 Query: 213 VRTPKVRMDDVTVTSV------ANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 374 V+TPKVR+ + V S+ + D F ++ VKNTN+G YKF+++ AT V Sbjct: 64 VKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVG 123 Query: 375 QFPIQEARARARSTKKIAVMASLGASA------------SGTLELTVEAKLRGKVEFMRV 518 Q I +++AR RSTKKI+V L +A SG L LT +AKL GKVE M + Sbjct: 124 QVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLI 183 Query: 519 IKR 527 +K+ Sbjct: 184 MKK 186 >ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca subsp. vesca] Length = 211 Score = 99.0 bits (245), Expect = 6e-19 Identities = 64/183 (34%), Positives = 96/183 (52%), Gaps = 18/183 (9%) Frame = +3 Query: 33 QGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLIIMR 212 Q YPLAPA+ RSD G + S +++K++KRI+ TY L +M+ Sbjct: 7 QAYPLAPANGYTRSD---GESLVSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMK 63 Query: 213 VRTPKVRMDDV------TVTSVANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 374 V+TPKVR+ ++ +V + + D F ++ VKNTN+G YKF+++ T V Sbjct: 64 VKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVG 123 Query: 375 QFPIQEARARARSTKKIAVMASLGASA------------SGTLELTVEAKLRGKVEFMRV 518 Q + + +A RSTKK+ V SL A+ SG L L +AKL GKVE M + Sbjct: 124 QVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLI 183 Query: 519 IKR 527 +K+ Sbjct: 184 MKK 186 >ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca subsp. vesca] Length = 211 Score = 98.6 bits (244), Expect = 8e-19 Identities = 67/190 (35%), Positives = 94/190 (49%), Gaps = 16/190 (8%) Frame = +3 Query: 6 MADKYQPEVQGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXX 185 MA+K Q Q YPLA + RSD E S +++K+KKRIKC Y Sbjct: 1 MAEKSQKTHQTYPLASENGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAIG 56 Query: 186 XXXSLIIMRVRTPKVRMDDVTVTSVANGDVRFGA----RVLVKNTNFGRYKFESTLATIR 353 L +++V+TPKVR+ T++ V + F + ++ VKNTN+G YKF+ + T Sbjct: 57 AVFGLTVLKVKTPKVRLGTSTLSDVTSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVTFM 116 Query: 354 TADNNVVQFPIQEARARARSTKKIAVMASLGASA------------SGTLELTVEAKLRG 497 V + + +A R TKKI V SL +A G L LT EAKL G Sbjct: 117 YQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKLTG 176 Query: 498 KVEFMRVIKR 527 KVE M ++K+ Sbjct: 177 KVELMLIMKK 186 >ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca subsp. vesca] Length = 211 Score = 98.2 bits (243), Expect = 1e-18 Identities = 66/184 (35%), Positives = 89/184 (48%), Gaps = 19/184 (10%) Frame = +3 Query: 33 QGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLIIMR 212 Q YPLAP++ RSD E S +++K+KKRIKC Y L IM+ Sbjct: 7 QSYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMK 62 Query: 213 VRTPKVRMDDVTVTSVANGDVR------FGARVLVKNTNFGRYKFESTLATIRTADNNVV 374 V+TPKVR+ T+T + D F ++ VKNTN+G YKF+ + T V Sbjct: 63 VKTPKVRLGTSTLTDFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVG 122 Query: 375 QFPIQEARARARSTKKIAVMASLGASA-------------SGTLELTVEAKLRGKVEFMR 515 + + +A R TKKI V L +A G L LT EAKL GKVE M Sbjct: 123 TVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELML 182 Query: 516 VIKR 527 ++K+ Sbjct: 183 IMKK 186 >ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca subsp. vesca] Length = 211 Score = 94.7 bits (234), Expect = 1e-17 Identities = 62/183 (33%), Positives = 94/183 (51%), Gaps = 18/183 (9%) Frame = +3 Query: 33 QGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLIIMR 212 Q YP AP++ RSD G + S +++K+KKRIK TY L +M+ Sbjct: 7 QAYPTAPSNGYARSD---GESLVSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMK 63 Query: 213 VRTPKVRMDDVTVTSV------ANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 374 V+TPK R + V ++ + D F ++ +KNTN+G YKF++ AT + Sbjct: 64 VKTPKARWGSIDVETLNYVPATPSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIG 123 Query: 375 QFPIQEARARARSTKKIAVMASLGASA------------SGTLELTVEAKLRGKVEFMRV 518 + I +++A RSTKKI V SL +A SG L LT + +L+GKVE M + Sbjct: 124 KVDIPKSKAGMRSTKKIDVEVSLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLI 183 Query: 519 IKR 527 +K+ Sbjct: 184 MKK 186 >ref|XP_004298841.1| PREDICTED: uncharacterized protein LOC101294558 [Fragaria vesca subsp. vesca] Length = 203 Score = 92.0 bits (227), Expect = 7e-17 Identities = 63/183 (34%), Positives = 96/183 (52%), Gaps = 9/183 (4%) Frame = +3 Query: 6 MADKYQPEVQGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXX 185 MA+K Q Q Y + A+ RS ++ + S E++K++KRIK TY Sbjct: 1 MAEKNQ---QAY--SSANGYTRSTDQESSPFQSDEELKRQKRIKLFTYIGIFIVFQIVVM 55 Query: 186 XXXSLIIMRVRTPKVRMDDVTVTSV------ANGDVRFGARVLVKNTNFGRYKFESTLAT 347 L +M+V+TPK R ++TV ++ + D F ++ +KNTN+G YKF++ AT Sbjct: 56 TVFGLTVMKVKTPKARWGEITVKTLNSVPAAPSFDTTFETQIRIKNTNWGPYKFDAGTAT 115 Query: 348 IRTADNNVVQFPIQEARARARSTKKIAVMASLGASA---SGTLELTVEAKLRGKVEFMRV 518 + + I +++A R TKKI SL +A SG L LT EAKL GKV M + Sbjct: 116 FLYQGVTIGKVDIPKSKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMGM 175 Query: 519 IKR 527 +K+ Sbjct: 176 MKK 178 >ref|XP_007203004.1| hypothetical protein PRUPE_ppa017380mg, partial [Prunus persica] gi|462398535|gb|EMJ04203.1| hypothetical protein PRUPE_ppa017380mg, partial [Prunus persica] Length = 192 Score = 91.7 bits (226), Expect = 1e-16 Identities = 66/187 (35%), Positives = 96/187 (51%), Gaps = 22/187 (11%) Frame = +3 Query: 33 QGYPLAPASVG--PRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLII 206 Q P A A+ G P DEE + +++K+KKRIK Y +L + Sbjct: 7 QASPSAAATNGQHPTVDEE--SAILPSQELKRKKRIKLAIYISAFVVVQIIVITTFALTV 64 Query: 207 MRVRTPKVRMDDVTV------TSVANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNN 368 MRV++PK+R+ ++V +S + D+ F +V +KNTNFGRYKF++T D Sbjct: 65 MRVQSPKLRLGAISVQTLNASSSTPSFDMTFTTQVRIKNTNFGRYKFDATNVRFMYEDRA 124 Query: 369 VVQFPIQEARARARSTKKIAVMASLGAS--------------ASGTLELTVEAKLRGKVE 506 V Q I +++A RSTKKI V SL + +G L L+ EA+L GKVE Sbjct: 125 VGQVRIPKSKAGMRSTKKIDVTVSLNSKELPSRSRYNLGNELKTGVLSLSSEARLAGKVE 184 Query: 507 FMRVIKR 527 M V+K+ Sbjct: 185 LMFVMKK 191 >ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 90.9 bits (224), Expect = 2e-16 Identities = 59/155 (38%), Positives = 79/155 (50%), Gaps = 19/155 (12%) Frame = +3 Query: 120 KKKRIKCLTYXXXXXXXXXXXXXXXSLIIMRVRTPKVRMDDVTVTSVANG-------DVR 278 K+ KCL Y +L +MR++ PKVR VTV + + G D+R Sbjct: 6 KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65 Query: 279 FGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVMASLGAS-- 452 A+V VKNTNFG +K+E++ I V + I +ARARAR TKK V + +S Sbjct: 66 LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125 Query: 453 ----------ASGTLELTVEAKLRGKVEFMRVIKR 527 ASG L L+ EAKL GKV M+VIK+ Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKK 160 >ref|XP_004288775.1| PREDICTED: uncharacterized protein LOC101312197 [Fragaria vesca subsp. vesca] Length = 210 Score = 89.7 bits (221), Expect = 4e-16 Identities = 59/179 (32%), Positives = 89/179 (49%), Gaps = 17/179 (9%) Frame = +3 Query: 42 PLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLIIMRVRT 221 PL+PA+ RSD E E +K+KKRIK Y +L +MRV+T Sbjct: 6 PLSPANANLRSDHEAAALQSQAE-LKRKKRIKVGIYVTVFVVVQIIVGTIIALTVMRVKT 64 Query: 222 PKVRMDDVTVTSVA------NGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFP 383 P++R+ ++ V ++ + +V F ++ VKNTN+G YKF+++ T + Q Sbjct: 65 PRLRLGEIKVQNITAVPATPSFEVNFTTQIRVKNTNWGPYKFDASNVTFEYEGETLAQLG 124 Query: 384 IQEARARARSTKKIAVMASLGASA-----------SGTLELTVEAKLRGKVEFMRVIKR 527 I + +A STKK V SL + +G L LT A+L GKVE M V+K+ Sbjct: 125 IPKGKAGMLSTKKYDVSVSLNSKGLKNSNLGSDLNTGVLSLTSTARLTGKVELMMVMKK 183 >ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721844|gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 89.0 bits (219), Expect = 6e-16 Identities = 59/160 (36%), Positives = 88/160 (55%), Gaps = 20/160 (12%) Frame = +3 Query: 108 EQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLIIMRVRTPKVRM-----DDVTVT-SVANG 269 +++K+KKR+KCL Y +L +MR++ PK R+ DD+T S + Sbjct: 14 KELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSF 73 Query: 270 DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQE--ARARARSTKKIAVMASL 443 +++F A+V VKNTNFG YKFE++ T + V + + + ARARARSTKK+ V L Sbjct: 74 NMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTMDL 133 Query: 444 GASA------------SGTLELTVEAKLRGKVEFMRVIKR 527 ++ SG L LT ++ L GKV M+VIK+ Sbjct: 134 NSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKK 173 >ref|XP_004309174.1| PREDICTED: uncharacterized protein LOC101303468 [Fragaria vesca subsp. vesca] Length = 178 Score = 89.0 bits (219), Expect = 6e-16 Identities = 55/151 (36%), Positives = 82/151 (54%), Gaps = 11/151 (7%) Frame = +3 Query: 108 EQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLIIMRVRTPKVRMDDVTVTSV--ANGDVR- 278 ++ ++KR +CL +I+MRV+TPKVR++ V VT++ +N ++ Sbjct: 3 DEESRRKRTRCLACIAFGVIAQTIIIVLFVVIVMRVKTPKVRLESVGVTTLTASNSSLKA 62 Query: 279 -FGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVMASLGAS- 452 A V VKN NFG YKFES AT N+ + I + +A+A+ TKKI V SL + Sbjct: 63 SIDALVTVKNKNFGHYKFESAKATFSYKGTNIGEGTISKDKAKAKKTKKINVTVSLNSDK 122 Query: 453 ------ASGTLELTVEAKLRGKVEFMRVIKR 527 +SG + LT AKL GKV + +IK+ Sbjct: 123 ITASDISSGNVTLTAYAKLDGKVHLLNIIKK 153 >ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777616|gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 213 Score = 88.6 bits (218), Expect = 8e-16 Identities = 61/188 (32%), Positives = 89/188 (47%), Gaps = 19/188 (10%) Frame = +3 Query: 21 QPEVQGYPLAPASVGPRSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXXXXXXXXSL 200 Q + Q PLAP PRSD E+G + Q +K+K KCL Y + Sbjct: 2 QEDPQAKPLAPVEYYPRSDMEFGGIKPTASQ-RKEKSSKCLVYVLVGMVIQGAVLLIFAS 60 Query: 201 IIMRVRTPKVRMDDVTVTSVANGD-------VRFGARVLVKNTNFGRYKFESTLATIRTA 359 I++R RTP V + VTV ++ G+ + V V+N+NFG +KFE+T T+ Sbjct: 61 IVLRARTPDVEIVSVTVRNLKYGNSSAPSFNLTLVTEVTVENSNFGDFKFENTTGTVWCG 120 Query: 360 DNNVVQFPIQEARARARSTKKIAVMASLGA------------SASGTLELTVEAKLRGKV 503 V + I RA+AR+T+++ V + + +SG LEL KL GKV Sbjct: 121 SVVVGKMKIPTGRAQARATERLNVSVDVSSLPLPDTKNVSCNISSGLLELNSHVKLSGKV 180 Query: 504 EFMRVIKR 527 M +KR Sbjct: 181 SIMNFMKR 188 >ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776113|gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 88.6 bits (218), Expect = 8e-16 Identities = 54/162 (33%), Positives = 83/162 (51%), Gaps = 20/162 (12%) Frame = +3 Query: 102 SGEQMKKKKRIKCLTYXXXXXXXXXXXXXXXSLIIMRVRTPKVRMDDVTVTSVA------ 263 S ++K+KKR+K Y SL +MR++ PK R+ +TV +A Sbjct: 15 SAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPN 74 Query: 264 --NGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVMA 437 + +++F A V VKNTNFG +KF++T + V + + + RA+ARSTKK+ V Sbjct: 75 PPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTV 134 Query: 438 SLGAS------------ASGTLELTVEAKLRGKVEFMRVIKR 527 L ++ +SG L LT KL GKV M++IK+ Sbjct: 135 DLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKK 176 >ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca subsp. vesca] Length = 213 Score = 88.2 bits (217), Expect = 1e-15 Identities = 59/192 (30%), Positives = 97/192 (50%), Gaps = 18/192 (9%) Frame = +3 Query: 6 MADKYQPEVQGYPLAPASVGP---RSDEEYGNNHHSGEQMKKKKRIKCLTYXXXXXXXXX 176 MA++ Q + YP AP + G RSD E + HS +++KKKRIKCL Y Sbjct: 1 MAERNQ---EAYPFAPYANGQAMARSDAE-SSRAHSDHELRKKKRIKCLIYIAVFAVFQI 56 Query: 177 XXXXXXSLIIMRVRTPKVRMDDVTVTSVANGD-------VRFGARVLVKNTNFGRYKFES 335 +L +M++++PK R+ +TV + + + F A V VKN NFGRYK++ Sbjct: 57 IVITVFALTVMKIKSPKFRIKSITVQDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQ 116 Query: 336 TLATIRTADNNVVQFPIQEARARARSTKKIAVMASL--------GASASGTLELTVEAKL 491 T + V + +A AR ++T+K V ++ ++G++ L+ +K+ Sbjct: 117 TSISFIYEGTQVGDAVVPKATARTKATRKEIVSGAVKTVNSNLASDISAGSVTLSTYSKI 176 Query: 492 RGKVEFMRVIKR 527 GKV M +IK+ Sbjct: 177 NGKVYLMNMIKK 188