BLASTX nr result
ID: Mentha28_contig00006083
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00006083 (739 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus... 179 7e-43 ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r... 138 2e-30 ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579... 134 3e-29 ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom... 132 1e-28 ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296... 127 3e-27 ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295... 127 4e-27 ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302... 124 4e-26 ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293... 124 4e-26 gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] 120 4e-25 ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r... 119 8e-25 ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296... 119 1e-24 ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295... 119 1e-24 ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r... 119 1e-24 ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r... 119 1e-24 ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r... 118 2e-24 ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r... 116 7e-24 ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303... 116 7e-24 ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295... 115 1e-23 gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus... 115 2e-23 ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295... 115 2e-23 >gb|EYU40054.1| hypothetical protein MIMGU_mgv1a018513mg [Mimulus guttatus] Length = 208 Score = 179 bits (455), Expect = 7e-43 Identities = 102/201 (50%), Positives = 138/201 (68%), Gaps = 10/201 (4%) Frame = -2 Query: 576 YQPEV-QGYPLAPASVVPRSDEEFG--NNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVIL 406 Y EV Q YPLAP S VPRSDEE+ NN + E+MKK KR+KC Y +IL Sbjct: 5 YNQEVHQAYPLAP-STVPRSDEEYSGTNNYRAQEEMKKNKRMKCFAYIACFAVFQIIIIL 63 Query: 405 IFSLIIMRVRTPKVRMDNVTVTSG-ANGDVRFGARVLVKNTNFGRYKFESTLATIRTADN 229 I +L +MRV++PK+R+ ++TVT +G+VR ARVLVKNTNFGRYKF+S LATIR+ + Sbjct: 64 ILALTVMRVKSPKLRLGDITVTKDHVSGNVRLTARVLVKNTNFGRYKFDSGLATIRSGAS 123 Query: 228 NVVQFPIQEARARARSTKKIAVVASLGASAT------GTLELTVEAKLRGKVEFMRVIKR 67 NV QF I E+RARARSTKK+ V L +S + G L VE++LRGKVE ++V+K+ Sbjct: 124 NVGQFVIPESRARARSTKKMYVTVDLNSSNSSNNSMGGVWTLNVESQLRGKVELLKVVKK 183 Query: 66 KKTADMNCTLTLVLATNSVQN 4 K+A MNC + + L ++++Q+ Sbjct: 184 TKSAYMNCVVVINLRSSTIQD 204 >ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777615|gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 138 bits (347), Expect = 2e-30 Identities = 87/212 (41%), Positives = 126/212 (59%), Gaps = 21/212 (9%) Frame = -2 Query: 573 QPEVQGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSL 394 + + Q +PLAPA+ PRSDEE + + +++K+KKRIK Y VILIF+L Sbjct: 3 EKDQQVHPLAPANGHPRSDEESASLQ--SKELKRKKRIKYAVYIAAFAVFQTVVILIFAL 60 Query: 393 IIMRVRTPKVRMDNVTVTS--------GANGDVRFGARVLVKNTNFGRYKFESTLATIRT 238 +MRV+ PKVR+ VTV + A+ ++RF +V VKNTNFG YKF++ + Sbjct: 61 TVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLY 120 Query: 237 ADNNVVQFPIQEARARARSTKKIAVVASLGASA-------------TGTLELTVEAKLRG 97 V + I +ARARARSTKK+ V + +SA + L L +AKL+G Sbjct: 121 DGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKG 180 Query: 96 KVEFMRVIKRKKTADMNCTLTLVLATNSVQNL 1 KVE M+V+K+KK+ +MNCTL ++T S+Q+L Sbjct: 181 KVELMKVMKKKKSPEMNCTLIFNVSTRSLQDL 212 >ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579067 [Solanum tuberosum] Length = 197 Score = 134 bits (338), Expect = 3e-29 Identities = 69/193 (35%), Positives = 117/193 (60%), Gaps = 6/193 (3%) Frame = -2 Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382 Q YPLAP++++PRSD EF N ++KK+++ +IL+F +R Sbjct: 5 QKYPLAPSNIMPRSDAEFATNNFQSNNQRRKKKLRST---FLLTIFLTGIILLFCFTFLR 61 Query: 381 VRTPKVRMDNVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV-QFPIQ 205 +++PK+R++N+ +T+ +G + F A+V ++N NF RY ++STL TI TA+ + +F I Sbjct: 62 IKSPKIRIENIRITNDGDGRINFSAQVFLRNRNFWRYGYDSTLGTINTAEGTTIGRFVIP 121 Query: 204 EARARARSTKKIAV-----VASLGASATGTLELTVEAKLRGKVEFMRVIKRKKTADMNCT 40 + R RSTK I V + S + +G L + EAK+RGKV+ RV + KK D++CT Sbjct: 122 DGEVRRRSTKTIYVMENFILPSRLNNTSGILPVISEAKIRGKVKVFRVFRWKKNVDLSCT 181 Query: 39 LTLVLATNSVQNL 1 +++ L +++Q+L Sbjct: 182 MSINLTISAIQDL 194 >ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao] gi|508776114|gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] Length = 214 Score = 132 bits (333), Expect = 1e-28 Identities = 79/204 (38%), Positives = 113/204 (55%), Gaps = 19/204 (9%) Frame = -2 Query: 555 YPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVR 376 YPL PA+ +E HS E +KKKKR+KCL Y +IL+F+L +MR+R Sbjct: 9 YPLVPAANGHERSDEESVAAHSKE-LKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRIR 67 Query: 375 TPKVRMD-------NVTVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQ 217 PK R+ NV + + D++ + VKNTNFG +K+E L T V + Sbjct: 68 NPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVGR 127 Query: 216 FPIQEARARARSTKKIAVVASLGAS------------ATGTLELTVEAKLRGKVEFMRVI 73 IQ+ARARARSTKK+ VV L ++ + G L LT +KL GK+ M+VI Sbjct: 128 ATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKVI 187 Query: 72 KRKKTADMNCTLTLVLATNSVQNL 1 K+KK+ MNCT+ + + T +V+N+ Sbjct: 188 KKKKSTQMNCTMDVAIDTRTVRNI 211 >ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca subsp. vesca] Length = 211 Score = 127 bits (320), Expect = 3e-27 Identities = 79/205 (38%), Positives = 118/205 (57%), Gaps = 18/205 (8%) Frame = -2 Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382 Q YPLAPA+ RSD G + S++++K++KR K Y V+ +F L +M+ Sbjct: 7 QAYPLAPANGYTRSD---GESLVSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMK 63 Query: 381 VRTPKVRMDNVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 220 V+TPKVR+ + V S + D F ++ VKNTN+G YKF+++ AT V Sbjct: 64 VKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVG 123 Query: 219 QFPIQEARARARSTKKIAVVASLGASA------------TGTLELTVEAKLRGKVEFMRV 76 Q I +++AR RSTKKI+V L +A +G L LT +AKL GKVE M + Sbjct: 124 QVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLI 183 Query: 75 IKRKKTADMNCTLTLVLATNSVQNL 1 +K+KK+A M+CT+ L+T +V++L Sbjct: 184 MKKKKSATMDCTIAFDLSTKTVKSL 208 >ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca subsp. vesca] Length = 200 Score = 127 bits (319), Expect = 4e-27 Identities = 74/195 (37%), Positives = 111/195 (56%), Gaps = 8/195 (4%) Frame = -2 Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382 Q YPLAP++ RSD E S++++K+KKRIKC Y V +F L +++ Sbjct: 7 QVYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAVGAVFGLTVLK 62 Query: 381 VRTPKVRMDNVTVTSGANGDVR-----FGARVLVKNTNFGRYKFESTLATIRTADNNVVQ 217 V+TPKVR+D + SG F ++ VKNTN+G YKF+ + T + V Sbjct: 63 VKTPKVRLDTTSTLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVVTFKYQGTPVGT 122 Query: 216 FPIQEARARARSTKKIAVVASLGASA---TGTLELTVEAKLRGKVEFMRVIKRKKTADMN 46 F + + +A R TKKI SL +A +G L LT EAKL GKV M ++K+KK+A MN Sbjct: 123 FTVPKGKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMFIMKKKKSASMN 182 Query: 45 CTLTLVLATNSVQNL 1 CT+ + ++ +V+++ Sbjct: 183 CTIQIDVSGQTVKSV 197 >ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca subsp. vesca] Length = 222 Score = 124 bits (310), Expect = 4e-26 Identities = 78/211 (36%), Positives = 117/211 (55%), Gaps = 26/211 (12%) Frame = -2 Query: 555 YPLAP-ASVVPRSDEEFGNNRH-SDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382 YPL P A RSD+E + S E+++ KKR++CL Y VI +F+L +M+ Sbjct: 13 YPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVVVITVFALTVMK 72 Query: 381 VRTPKVRMDNVTVTSGANG-----------DVRFGARVLVKNTNFGRYKFESTLATIRTA 235 +++PK R+ ++T G DV FG VKNTNFG +++E + Sbjct: 73 IKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFG----VKNTNFGHFEYEDGIVVFTYR 128 Query: 234 DNNVVQFPIQEARARARSTKKIAV----VASLGASA---------TGTLELTVEAKLRGK 94 D + Q ++E R RARST+K+ V + S G A TG + +T+ +KL GK Sbjct: 129 DVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITISSKLDGK 188 Query: 93 VEFMRVIKRKKTADMNCTLTLVLATNSVQNL 1 + M++IK+KK+A MNCT+ +VLAT SVQN+ Sbjct: 189 IHLMKIIKKKKSAQMNCTMEVVLATKSVQNV 219 >ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca subsp. vesca] Length = 211 Score = 124 bits (310), Expect = 4e-26 Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 19/206 (9%) Frame = -2 Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382 Q YPLAP++ RSD E S++++K+KKRIKC Y V+ +F L IM+ Sbjct: 7 QSYPLAPSNGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTIMK 62 Query: 381 VRTPKVRMDNVTVTSGANGDVR------FGARVLVKNTNFGRYKFESTLATIRTADNNVV 220 V+TPKVR+ T+T + D F ++ VKNTN+G YKF+ + T V Sbjct: 63 VKTPKVRLGTSTLTDFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMPVG 122 Query: 219 QFPIQEARARARSTKKIAVVASLGASAT-------------GTLELTVEAKLRGKVEFMR 79 + + +A R TKKI V L +A G L LT EAKL GKVE M Sbjct: 123 TVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVELML 182 Query: 78 VIKRKKTADMNCTLTLVLATNSVQNL 1 ++K+KK+A MNCT+ + ++ +V++L Sbjct: 183 IMKKKKSASMNCTIQIDVSGKTVKSL 208 >gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] Length = 212 Score = 120 bits (302), Expect = 4e-25 Identities = 76/204 (37%), Positives = 114/204 (55%), Gaps = 20/204 (9%) Frame = -2 Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382 Q YPLAPA+ PRSDEE N +++K++KRIK Y V L+F L++MR Sbjct: 7 QVYPLAPANGHPRSDEESSNL--DAKELKRRKRIKLAIYAFIFTASQIIVTLVFVLVVMR 64 Query: 381 VRTPKVRMDN------VTVTSGANG--DVRFGARVLVKNTNFGRYKFESTLATIRTADNN 226 V++PK+R+ + + SG+ D+ F ++ VKNTN+G YKF++T A Sbjct: 65 VKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTTAAFAYEGET 124 Query: 225 VVQFPIQEARARARSTKKIAVVASLGAS------------ATGTLELTVEAKLRGKVEFM 82 V Q I + +A RSTKK+ V SL +S + G L L AK+ GKV+ M Sbjct: 125 VGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTAKMTGKVKLM 184 Query: 81 RVIKRKKTADMNCTLTLVLATNSV 10 ++K+KK+A+MNCT+ + + +V Sbjct: 185 LIMKKKKSANMNCTINIHVKEKTV 208 >ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776108|gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 191 Score = 119 bits (299), Expect = 8e-25 Identities = 66/179 (36%), Positives = 102/179 (56%), Gaps = 20/179 (11%) Frame = -2 Query: 477 KKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTV--------TSGANGD 322 ++K+ IKCL Y +IL+F +++MR+R PKVR+ VTV +S + Sbjct: 10 RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69 Query: 321 VRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVVASLGAS 142 + A+V VKNTNFG +KF+++ TI V + I +ARARARST K+ V S+ + Sbjct: 70 MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSD 129 Query: 141 ------------ATGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNL 1 +GT+ L+ AKL GK+ +V K+KK+A+MNCT+ + ++ +QNL Sbjct: 130 KMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNL 188 >ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca subsp. vesca] Length = 211 Score = 119 bits (298), Expect = 1e-24 Identities = 73/205 (35%), Positives = 115/205 (56%), Gaps = 18/205 (8%) Frame = -2 Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382 Q YPLAPA+ RSD G + S +++K++KRI+ TY V+ +F L +M+ Sbjct: 7 QAYPLAPANGYTRSD---GESLVSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMK 63 Query: 381 VRTPKVRMDNV------TVTSGANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 220 V+TPKVR+ + +V + + D F ++ VKNTN+G YKF+++ T V Sbjct: 64 VKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVG 123 Query: 219 QFPIQEARARARSTKKIAVVASLGASA------------TGTLELTVEAKLRGKVEFMRV 76 Q + + +A RSTKK+ V SL A+ +G L L +AKL GKVE M + Sbjct: 124 QVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLI 183 Query: 75 IKRKKTADMNCTLTLVLATNSVQNL 1 +K+KK++ M+C + L+T +V++L Sbjct: 184 MKKKKSSTMDCMIGFDLSTKTVKSL 208 >ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca subsp. vesca] Length = 211 Score = 119 bits (298), Expect = 1e-24 Identities = 72/205 (35%), Positives = 114/205 (55%), Gaps = 18/205 (8%) Frame = -2 Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382 Q YP AP++ RSD G + S++++K+KKRIK TY V+ +F L +M+ Sbjct: 7 QAYPTAPSNGYARSD---GESLVSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMK 63 Query: 381 VRTPKVRMDNVTVTS------GANGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVV 220 V+TPK R ++ V + + D F ++ +KNTN+G YKF++ AT + Sbjct: 64 VKTPKARWGSIDVETLNYVPATPSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIG 123 Query: 219 QFPIQEARARARSTKKIAVVASLGASA------------TGTLELTVEAKLRGKVEFMRV 76 + I +++A RSTKKI V SL +A +G L LT + +L+GKVE M + Sbjct: 124 KVDIPKSKAGMRSTKKIDVEVSLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLI 183 Query: 75 IKRKKTADMNCTLTLVLATNSVQNL 1 +K+ K A M+CT+ L++ +VQ+L Sbjct: 184 MKKNKNASMDCTIAFDLSSKTVQSL 208 >ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 119 bits (297), Expect = 1e-24 Identities = 71/177 (40%), Positives = 102/177 (57%), Gaps = 19/177 (10%) Frame = -2 Query: 474 KKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTVTSGANG-------DVR 316 K+ KCL Y +ILIF+L +MR++ PKVR VTV + + G D+R Sbjct: 6 KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65 Query: 315 FGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVVASLGAS-- 142 A+V VKNTNFG +K+E++ I V + I +ARARAR TKK V + +S Sbjct: 66 LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125 Query: 141 ----------ATGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQNL 1 A+G L L+ EAKL GKV M+VIK+KK+++M+CT+ + + T +VQ+L Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDL 182 >ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776113|gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 119 bits (297), Expect = 1e-24 Identities = 70/184 (38%), Positives = 108/184 (58%), Gaps = 20/184 (10%) Frame = -2 Query: 492 SDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTV-----TSGAN 328 S ++K+KKR+K Y VIL+FSL +MR++ PK R+ ++TV TS N Sbjct: 15 SAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPN 74 Query: 327 G---DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVVA 157 +++F A V VKNTNFG +KF++T + V + + + RA+ARSTKK+ V Sbjct: 75 PPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTV 134 Query: 156 SLGAS------------ATGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNS 13 L ++ ++G L LT KL GKV M++IK+KK+A MNCT+T+ LA+ + Sbjct: 135 DLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRA 194 Query: 12 VQNL 1 +Q++ Sbjct: 195 IQDI 198 >ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721844|gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 118 bits (296), Expect = 2e-24 Identities = 70/182 (38%), Positives = 109/182 (59%), Gaps = 20/182 (10%) Frame = -2 Query: 486 EQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTV------TSGANG 325 +++K+KKR+KCL Y +IL+F+L +MR++ PK R+ +V V S + Sbjct: 14 KELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSF 73 Query: 324 DVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQE--ARARARSTKKIAVVASL 151 +++F A+V VKNTNFG YKFE++ T + V + + + ARARARSTKK+ V L Sbjct: 74 NMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTMDL 133 Query: 150 GASA------------TGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQ 7 ++ +G L LT ++ L GKV M+VIK+KK+ +MNCT+T+ LA V+ Sbjct: 134 NSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKLVR 193 Query: 6 NL 1 ++ Sbjct: 194 DI 195 >ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777614|gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 188 Score = 116 bits (291), Expect = 7e-24 Identities = 68/181 (37%), Positives = 108/181 (59%), Gaps = 19/181 (10%) Frame = -2 Query: 486 EQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTVTS---GANGDVR 316 E+ K+ + +KC Y +IL+F+L +MR++TP R+ +VTV S A+G Sbjct: 5 EKYKRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPH 64 Query: 315 FGARVL----VKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVVASLG 148 F R++ VKN NFG ++F++T A + V I ++RARAR TK++ V + Sbjct: 65 FNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVS 124 Query: 147 ASA------------TGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQN 4 +SA +GTL LT A+LRGKV M+++K++KTA+MNCT+T+ L +++VQ+ Sbjct: 125 SSAVSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQD 184 Query: 3 L 1 L Sbjct: 185 L 185 >ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca subsp. vesca] Length = 213 Score = 116 bits (291), Expect = 7e-24 Identities = 73/205 (35%), Positives = 117/205 (57%), Gaps = 18/205 (8%) Frame = -2 Query: 561 QGYPLAP---ASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLI 391 + YP AP + RSD E + HSD +++KKKRIKCL Y VI +F+L Sbjct: 7 EAYPFAPYANGQAMARSDAE-SSRAHSDHELRKKKRIKCLIYIAVFAVFQIIVITVFALT 65 Query: 390 IMRVRTPKVRMDNVTV-----TSGANG--DVRFGARVLVKNTNFGRYKFESTLATIRTAD 232 +M++++PK R+ ++TV ++ AN + F A V VKN NFGRYK++ T + Sbjct: 66 VMKIKSPKFRIKSITVQDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQTSISFIYEG 125 Query: 231 NNVVQFPIQEARARARSTKK------IAVVASLGAS--ATGTLELTVEAKLRGKVEFMRV 76 V + +A AR ++T+K + V S AS + G++ L+ +K+ GKV M + Sbjct: 126 TQVGDAVVPKATARTKATRKEIVSGAVKTVNSNLASDISAGSVTLSTYSKINGKVYLMNM 185 Query: 75 IKRKKTADMNCTLTLVLATNSVQNL 1 IK+KK+A+M CT+ + L++ VQ++ Sbjct: 186 IKKKKSAEMKCTMVVHLSSKQVQDI 210 >ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca subsp. vesca] Length = 212 Score = 115 bits (289), Expect = 1e-23 Identities = 69/182 (37%), Positives = 103/182 (56%), Gaps = 18/182 (9%) Frame = -2 Query: 492 SDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMRVRTPKVRMDNVTVTS------GA 331 S+E++K++KRIK TY V+ +F L +M+V+TPKVR+ V + Sbjct: 28 SEEELKRQKRIKLFTYIGIFIGFQIIVMTVFGLTVMKVKTPKVRLGATNVQNLNFVPTSP 87 Query: 330 NGDVRFGARVLVKNTNFGRYKFESTLATIRTADNNVVQFPIQEARARARSTKKIAVVASL 151 + D F ++ +KNTN+G YKF++ AT V Q +++A RSTKKI SL Sbjct: 88 SFDTTFATQIRIKNTNWGPYKFDAGTATFMYQGVAVGQVSFPKSKAGMRSTKKINAEVSL 147 Query: 150 GAS------------ATGTLELTVEAKLRGKVEFMRVIKRKKTADMNCTLTLVLATNSVQ 7 ++ ++G L LT EAKL GKVE M ++K+KK+A MNCT+ L L+T ++Q Sbjct: 148 NSNEIPSTSNLGSELSSGVLTLTSEAKLTGKVELMLIMKKKKSATMNCTMKLDLSTKTIQ 207 Query: 6 NL 1 L Sbjct: 208 AL 209 >gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus] Length = 213 Score = 115 bits (288), Expect = 2e-23 Identities = 73/208 (35%), Positives = 107/208 (51%), Gaps = 19/208 (9%) Frame = -2 Query: 567 EVQGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLII 388 E + PL A+ RSD E G H + +KKKR KC Y VI IFS+ + Sbjct: 3 EKEHQPLPYANGHGRSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTV 62 Query: 387 MRVRTPKVRMDNVTVT---SGANGDVRF----GARVLVKNTNFGRYKFESTLATIRTADN 229 M++RTPK R+ + +T +G G F A VKN NFGRYK+ +T Sbjct: 63 MKIRTPKFRIRSAHLTTFHAGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGT 122 Query: 228 NVVQFPIQEARARARSTKKIAVVASLGAS------------ATGTLELTVEAKLRGKVEF 85 V Q ++++RA RSTKK VV L + G +++T +A++ G+VE Sbjct: 123 PVGQVFVRDSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNAGVVQITSQARMAGRVEL 182 Query: 84 MRVIKRKKTADMNCTLTLVLATNSVQNL 1 + V+K+ K+ DMNC + +V AT ++NL Sbjct: 183 IFVMKKNKSTDMNCNMEIVTATQQIRNL 210 >ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca subsp. vesca] Length = 211 Score = 115 bits (287), Expect = 2e-23 Identities = 72/203 (35%), Positives = 109/203 (53%), Gaps = 16/203 (7%) Frame = -2 Query: 561 QGYPLAPASVVPRSDEEFGNNRHSDEQMKKKKRIKCLTYXXXXXXXXXXVILIFSLIIMR 382 Q YPLA + RSD E S++++K+KKRIKC Y + +F L +++ Sbjct: 10 QTYPLASENGYTRSDGE----SLSEDELKRKKRIKCFAYIGIFIVFQMAIGAVFGLTVLK 65 Query: 381 VRTPKVRMDNVTVTSGANGDVRFGA----RVLVKNTNFGRYKFESTLATIRTADNNVVQF 214 V+TPKVR+ T++ + F + ++ VKNTN+G YKF+ + T V Sbjct: 66 VKTPKVRLGTSTLSDVTSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVTFMYQGAPVGTV 125 Query: 213 PIQEARARARSTKKIAVVASLGASAT------------GTLELTVEAKLRGKVEFMRVIK 70 + + +A R TKKI V SL +A G L LT EAKL GKVE M ++K Sbjct: 126 VVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKLTGKVELMLIMK 185 Query: 69 RKKTADMNCTLTLVLATNSVQNL 1 +KK+A MNCT+ + ++ +V++L Sbjct: 186 KKKSASMNCTIQIDVSGKTVKSL 208