BLASTX nr result
ID: Rehmannia23_contig00018326
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00018326 (813 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] 210 6e-52 ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302... 192 1e-46 ref|XP_002513626.1| conserved hypothetical protein [Ricinus comm... 183 7e-44 ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293... 171 3e-40 ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578... 170 6e-40 ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303... 169 8e-40 gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich g... 168 2e-39 gb|EXC05941.1| hypothetical protein L484_014209 [Morus notabilis] 166 7e-39 ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295... 165 2e-38 ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296... 164 3e-38 gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich g... 161 3e-37 ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296... 159 9e-37 gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich g... 157 3e-36 gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich g... 156 7e-36 emb|CBI22611.3| unnamed protein product [Vitis vinifera] 156 9e-36 ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295... 155 2e-35 ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295... 155 2e-35 gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich g... 154 3e-35 gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich g... 150 7e-34 gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] 148 2e-33 >gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] Length = 214 Score = 210 bits (534), Expect = 6e-52 Identities = 103/216 (47%), Positives = 152/216 (70%), Gaps = 4/216 (1%) Frame = +1 Query: 136 EKEQQATHPMAPT-NGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVILI 312 E + Q+ +P+ P NG+ RSD E S AAH ++ +KKKR KCLLYIVLFA+FQ +IL+ Sbjct: 2 EAKSQSPYPLVPAANGHERSD-EESVAAH--SKELKKKKRMKCLLYIVLFAVFQTGIILL 58 Query: 313 FVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNTTVEF 492 F LT+MR+R PKFRVRS + T FNVGT +PS MN + V+N NFG +KY V F Sbjct: 59 FALTVMRIRNPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTF 118 Query: 493 LYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN---LGVNSELGSDLRAGIVPITSQARMR 663 YRGTPVG+ ++ +RA RST+K +V V+L+ L +ELG D+ AG++ +TS +++ Sbjct: 119 AYRGTPVGRATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLD 178 Query: 664 GRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 G++ L+ V+++ +ST MNC+M++ + T+ +RNI+C+ Sbjct: 179 GKIHLMKVIKKKKSTQMNCTMDVAIDTRTVRNIICK 214 >ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca subsp. vesca] Length = 222 Score = 192 bits (488), Expect = 1e-46 Identities = 98/222 (44%), Positives = 144/222 (64%), Gaps = 8/222 (3%) Frame = +1 Query: 130 MAE-KEQQATHP---MAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQA 297 MAE KE AT P M Y RSD E++++A A R KKR +CLLY+ +FA+FQ Sbjct: 1 MAENKEAAATSPYPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQV 60 Query: 298 AVILIFVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRN 477 VI +F LT+M++++PKFRVR+A++T F VG+ NPS M+ V+N NFG ++Y + Sbjct: 61 VVITVFALTVMKIKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFGVKNTNFGHFEYED 120 Query: 478 TTVEFLYRGTPVGQVIVRGSRANWRSTRKFEV-RVDL---NLGVNSELGSDLRAGIVPIT 645 V F YR +GQ V R RSTRK +V VDL L NS LGSD+ GI+PIT Sbjct: 121 GIVVFTYRDVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPIT 180 Query: 646 SQARMRGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 +++ G++ L+ ++++ +S MNC+ME+V+AT+ ++N+VC+ Sbjct: 181 ISSKLDGKIHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVCK 222 >ref|XP_002513626.1| conserved hypothetical protein [Ricinus communis] gi|223547534|gb|EEF49029.1| conserved hypothetical protein [Ricinus communis] Length = 217 Score = 183 bits (464), Expect = 7e-44 Identities = 100/219 (45%), Positives = 143/219 (65%), Gaps = 5/219 (2%) Frame = +1 Query: 130 MAEKEQQATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVIL 309 MAEKEQ T +A +G TRSD ES +A + RKKKR KC+ ++V F IFQ +IL Sbjct: 1 MAEKEQAPTPLVA--DGQTRSDEESGTAGTAQTKELRKKKRMKCIAFVVAFTIFQTGIIL 58 Query: 310 IFVLTIMRVRTPKFRVRSAALTN-FNVGT-PENPSLTATMNAELAVRNANFGRYKYRNTT 483 +FV T++R + PKFRVRSA+ + F+VGT PS TMN + V+N NFG +KY +T Sbjct: 59 LFVFTVLRFKDPKFRVRSASFDDTFHVGTDAAAPSFNLTMNTQFGVKNTNFGHFKYETST 118 Query: 484 VEFLYRGTPVGQVIVRGSRANWRSTRKFEVRVDL---NLGVNSELGSDLRAGIVPITSQA 654 V F YRGT VG V V +RA RSTRKF+ V L L EL SD+ +G +P++S + Sbjct: 119 VTFEYRGTVVGLVNVDKARARARSTRKFDAIVVLRTDRLPDGFELSSDISSGKIPLSSSS 178 Query: 655 RMRGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 R+ G + L+ V+++ +S +MNC+M + + T+ +++IVC+ Sbjct: 179 RLDGEIHLMKVIKKKKSAEMNCTMNVDIQTRTLQDIVCK 217 >ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca subsp. vesca] Length = 211 Score = 171 bits (433), Expect = 3e-40 Identities = 87/218 (39%), Positives = 141/218 (64%), Gaps = 4/218 (1%) Frame = +1 Query: 130 MAEKEQQATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVIL 309 MAE+ Q+ +P+AP+NGYTRSD ES S ++KKR KC YI +F +FQ AV+ Sbjct: 1 MAERTHQS-YPLAPSNGYTRSDGESLSEDE-----LKRKKRIKCFAYIGIFIVFQIAVMT 54 Query: 310 IFVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNTTVE 489 +F LTIM+V+TPK R+ ++ LT+F + PS T N ++ V+N N+G YK+ V Sbjct: 55 VFGLTIMKVKTPKVRLGTSTLTDFT-SSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVT 113 Query: 490 FLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLNLGV----NSELGSDLRAGIVPITSQAR 657 F+Y+G PVG V+V +A R T+K V V LN +S L ++L G++ +TS+A+ Sbjct: 114 FMYQGMPVGTVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAK 173 Query: 658 MRGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 + G+V+L+ +M++ +S MNC+++I V+ + ++++ C+ Sbjct: 174 LTGKVELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211 >ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578608 [Solanum tuberosum] Length = 204 Score = 170 bits (430), Expect = 6e-40 Identities = 91/212 (42%), Positives = 141/212 (66%) Frame = +1 Query: 133 AEKEQQATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVILI 312 AE+EQQ TNG+ + E+ ++ + R+KKRNK L+Y+ LF +FQ AV+L Sbjct: 3 AEEEQQLQ-----TNGHAKPAEETPNSTQ--SNELRRKKRNKILVYVALFIVFQIAVLLF 55 Query: 313 FVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNTTVEF 492 F L IM++RTPKF VRSA F++ EN S TMNAEL+V+NANFG Y Y+N+T+ F Sbjct: 56 FSLYIMKIRTPKFSVRSAT---FDLMVTENASFNITMNAELSVKNANFGPYNYKNSTIYF 112 Query: 493 LYRGTPVGQVIVRGSRANWRSTRKFEVRVDLNLGVNSELGSDLRAGIVPITSQARMRGRV 672 Y +G+ V +A ++S++KF V V+L+ S+L +DL +G + +TS++++ G+V Sbjct: 113 YYNDVSIGEAFVYQGKAGFKSSKKFNVIVNLS-SKESKLRNDLNSGTLILTSKSKLEGKV 171 Query: 673 DLIFVMRRNRSTDMNCSMEIVVATQQIRNIVC 768 LIF M++ +ST+MNC++ I +A + +R+I C Sbjct: 172 KLIFFMKKKKSTEMNCAIIIGLAGKVVRDIQC 203 >ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca subsp. vesca] Length = 213 Score = 169 bits (429), Expect = 8e-40 Identities = 90/216 (41%), Positives = 143/216 (66%), Gaps = 3/216 (1%) Frame = +1 Query: 130 MAEKEQQATHPMAP-TNGYT--RSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAA 300 MAE+ Q+A +P AP NG RSDAESS A RKKKR KCL+YI +FA+FQ Sbjct: 1 MAERNQEA-YPFAPYANGQAMARSDAESSRAHSD--HELRKKKRIKCLIYIAVFAVFQII 57 Query: 301 VILIFVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNT 480 VI +F LT+M++++PKFR++S + + NPSL+ + AE++V+N NFGRYKY T Sbjct: 58 VITVFALTVMKIKSPKFRIKSITVQDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQT 117 Query: 481 TVEFLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLNLGVNSELGSDLRAGIVPITSQARM 660 ++ F+Y GT VG +V + A ++TRK E+ VNS L SD+ AG V +++ +++ Sbjct: 118 SISFIYEGTQVGDAVVPKATARTKATRK-EIVSGAVKTVNSNLASDISAGSVTLSTYSKI 176 Query: 661 RGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVC 768 G+V L+ ++++ +S +M C+M + ++++Q+++I C Sbjct: 177 NGKVYLMNMIKKKKSAEMKCTMVVHLSSKQVQDIKC 212 >gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 168 bits (425), Expect = 2e-39 Identities = 87/219 (39%), Positives = 138/219 (63%), Gaps = 5/219 (2%) Frame = +1 Query: 130 MAEKEQQATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVIL 309 MAEK+QQ HP+AP NG+ RSD ES+S ++ ++KKR K +YI FA+FQ VIL Sbjct: 1 MAEKDQQV-HPLAPANGHPRSDEESASLQ---SKELKRKKRIKYAVYIAAFAVFQTVVIL 56 Query: 310 IFVLTIMRVRTPKFRVRSAALTNFNVGTPENP-SLTATMNAELAVRNANFGRYKYRNTTV 486 IF LT+MRV+ PK R+ + E S ++ V+N NFG YK+ N T+ Sbjct: 57 IFALTVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATM 116 Query: 487 EFLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN----LGVNSELGSDLRAGIVPITSQA 654 FLY G VG+ I+ +RA RST+K +V V++N + LGS+L + ++ + SQA Sbjct: 117 SFLYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQA 176 Query: 655 RMRGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 +++G+V+L+ VM++ +S +MNC++ V+T+ ++++ C+ Sbjct: 177 KLKGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215 >gb|EXC05941.1| hypothetical protein L484_014209 [Morus notabilis] Length = 220 Score = 166 bits (421), Expect = 7e-39 Identities = 80/215 (37%), Positives = 135/215 (62%), Gaps = 7/215 (3%) Frame = +1 Query: 148 QATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVILIFVLTI 327 Q T P+AP + Y RSD E S + H RKK+RN+ LL++ FA+ +I+++ + + Sbjct: 9 QPTQPLAPPHAYIRSDMEMESLS--AQEHIRKKRRNR-LLFVTSFAVTLVILIIVYAIVV 65 Query: 328 MRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNTTVEFLYRGT 507 R +TPKFR+RSA+ T+F VG +PS + MN++ +RN NFGRYKY + TV F YRG Sbjct: 66 TRYKTPKFRLRSASFTSFQVGNSTDPSFSFVMNSQFTIRNRNFGRYKYEDATVVFEYRGL 125 Query: 508 PVGQVIVRGSRANWRSTRKFEVRVDL-------NLGVNSELGSDLRAGIVPITSQARMRG 666 VGQ + +R R+T+K V L + G +LG D+ G++ + S + ++G Sbjct: 126 AVGQAYIDDARVRPRTTKKVNATVVLDSSSLVGDSGAFDQLGKDIGEGVLVLNSSSELKG 185 Query: 667 RVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 +V ++ V+R+ + + +NC+M +V+A++ ++N++C+ Sbjct: 186 KVRVLKVIRKTKYSRLNCTMNVVIASRSVQNLICQ 220 >ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca subsp. vesca] Length = 211 Score = 165 bits (418), Expect = 2e-38 Identities = 85/219 (38%), Positives = 140/219 (63%), Gaps = 5/219 (2%) Frame = +1 Query: 130 MAEKEQQA--THPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAV 303 MAEK Q+ T+P+A NGYTRSD ES S ++KKR KC YI +F +FQ A+ Sbjct: 1 MAEKSQKTHQTYPLASENGYTRSDGESLSEDE-----LKRKKRIKCFAYIGIFIVFQMAI 55 Query: 304 ILIFVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNTT 483 +F LT+++V+TPK R+ ++ L++ T S ++T N ++ V+N N+G YK+ Sbjct: 56 GAVFGLTVLKVKTPKVRLGTSTLSDV---TSSTTSFSSTFNTQIRVKNTNWGPYKFDQGV 112 Query: 484 VEFLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN---LGVNSELGSDLRAGIVPITSQA 654 V F+Y+G PVG V+V +A R T+K V V LN L +S L S+L G++ +TS+A Sbjct: 113 VTFMYQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEA 172 Query: 655 RMRGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 ++ G+V+L+ +M++ +S MNC+++I V+ + ++++ C+ Sbjct: 173 KLTGKVELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211 >ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca subsp. vesca] Length = 211 Score = 164 bits (416), Expect = 3e-38 Identities = 85/217 (39%), Positives = 138/217 (63%), Gaps = 3/217 (1%) Frame = +1 Query: 130 MAEKEQQATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVIL 309 MAEK QA +P+AP NGYTRSD ES + +++KR + YI +F +FQ V+ Sbjct: 1 MAEKTHQA-YPLAPANGYTRSDGESLVSKD----ELKRRKRIRLFTYIGIFIVFQIIVMT 55 Query: 310 IFVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNTTVE 489 +F LT+M+V+TPK R+ + +FN P PS T ++ V+N N+G YK+ +TV Sbjct: 56 VFGLTVMKVKTPKVRLGEINVQDFN-SVPATPSFDTTFTTQIRVKNTNWGPYKFDASTVT 114 Query: 490 FLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN---LGVNSELGSDLRAGIVPITSQARM 660 F+Y+G VGQV V +A RST+K V V LN L +S LGS+L +G++ + SQA++ Sbjct: 115 FMYQGVAVGQVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKL 174 Query: 661 RGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 G+V+L+ +M++ +S+ M+C + ++T+ ++++ C+ Sbjct: 175 SGKVELMLIMKKKKSSTMDCMIGFDLSTKTVKSLQCK 211 >gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 161 bits (407), Expect = 3e-37 Identities = 68/180 (37%), Positives = 123/180 (68%), Gaps = 3/180 (1%) Frame = +1 Query: 241 KKKRNKCLLYIVLFAIFQAAVILIFVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTAT 420 K+ KCL Y+ +F +FQ A+ILIF LT+MR++ PK R + + NF+ G +P Sbjct: 6 KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65 Query: 421 MNAELAVRNANFGRYKYRNTTVEFLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN---L 591 + A++ V+N NFG +KY N+++ LY G PVG+ + +RA R T+KF+V +D++ L Sbjct: 66 LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125 Query: 592 GVNSELGSDLRAGIVPITSQARMRGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 NS LG+D+ +G++P++S+A++ G+V L+ V+++ +S++M+C+M I + T+ ++++ C+ Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKCK 185 >ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca subsp. vesca] Length = 211 Score = 159 bits (403), Expect = 9e-37 Identities = 82/217 (37%), Positives = 138/217 (63%), Gaps = 3/217 (1%) Frame = +1 Query: 130 MAEKEQQATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVIL 309 MAEK QA +P+AP NGYTRSD ES + +++KR K +YI +F + Q V+ Sbjct: 1 MAEKTNQA-YPLAPANGYTRSDGESLVSED----ELKRQKRRKLFMYIGIFIVVQIIVMT 55 Query: 310 IFVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNTTVE 489 +F LT+M+V+TPK R+ + + N P PS + ++ V+N N+G YK+ +T Sbjct: 56 VFGLTVMKVKTPKVRLGGINVQSLN-SVPATPSFDTSFTTQIRVKNTNWGPYKFDASTAT 114 Query: 490 FLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN---LGVNSELGSDLRAGIVPITSQARM 660 F+Y+G VGQV + S+A RST+K V V LN L +S +G++L +GI+ +TSQA++ Sbjct: 115 FMYQGVAVGQVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKL 174 Query: 661 RGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 G+V+L+ +M++ +S M+C++ ++T+ ++++ C+ Sbjct: 175 TGKVELMLIMKKKKSATMDCTIAFDLSTKTVKSLQCK 211 >gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 157 bits (398), Expect = 3e-36 Identities = 77/186 (41%), Positives = 124/186 (66%), Gaps = 5/186 (2%) Frame = +1 Query: 226 ARHQRKKKRNKCLLYIVLFAIFQAAVILIFVLTIMRVRTPKFRVRSAALTNFNVGTPENP 405 ++ ++KKR KCL Y+ F IFQ A+IL+F LT+MR++ PKFR+RS + + +P Sbjct: 13 SKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNN-SSP 71 Query: 406 SLTATMNAELAVRNANFGRYKYRNTTVEFLYRGTPVGQVIVR--GSRANWRSTRKFEVRV 579 S A++ V+N NFG YK+ N+TV F Y+G+ VG+ +V +RA RST+K V + Sbjct: 72 SFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTM 131 Query: 580 DLN---LGVNSELGSDLRAGIVPITSQARMRGRVDLIFVMRRNRSTDMNCSMEIVVATQQ 750 DLN + +S+LGSDL +G + +TSQ+ + G+V L+ V+++ +S +MNC+M + +A + Sbjct: 132 DLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKL 191 Query: 751 IRNIVC 768 +R+I C Sbjct: 192 VRDIKC 197 >gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 226 Score = 156 bits (395), Expect = 7e-36 Identities = 72/174 (41%), Positives = 118/174 (67%), Gaps = 3/174 (1%) Frame = +1 Query: 235 QRKKKRNKCLLYIVLFAIFQAAVILIFVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLT 414 +R+ KCL Y+ F +FQ A+IL+F LT+MR+R+PK R + + +F+ +PS Sbjct: 4 RREGSNAKCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFSTVNSSSPSFD 63 Query: 415 ATMNAELAVRNANFGRYKYRNTTVEFLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN-- 588 + A++AV+N NFG +KY N+TV LY G PVG+ + RA R T+KF + VD++ Sbjct: 64 MKLMAQVAVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDISSS 123 Query: 589 -LGVNSELGSDLRAGIVPITSQARMRGRVDLIFVMRRNRSTDMNCSMEIVVATQ 747 L NS LG+D+ AG++P++SQA+++G+V L+ V+++ +S +M+C+M I +AT+ Sbjct: 124 RLSSNSNLGNDINAGVLPLSSQAKLKGKVHLMKVIKKKKSGEMSCTMGINLATR 177 >emb|CBI22611.3| unnamed protein product [Vitis vinifera] Length = 297 Score = 156 bits (394), Expect = 9e-36 Identities = 84/229 (36%), Positives = 137/229 (59%), Gaps = 4/229 (1%) Frame = +1 Query: 94 FPNFSSSFREN*MAEKEQQATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYI 273 F NFS + + MA+K+QQ HP+ PT G ++D ES R+ K + + Y+ Sbjct: 78 FLNFSRAKAKK-MAQKKQQV-HPIEPTGGPAKTDVESEEL--------RRMKCTRYIAYL 127 Query: 274 VLFAIFQAAVILIFVLTIMRVRTPKFRVRSAALTNFN-VGTPENPSLTATMNAELAVRNA 450 FA+F+ VI++ V+T+MR+R+PKFR R+ ++ N N +PS NA++AV+N Sbjct: 128 SAFALFETIVIMVCVVTLMRIRSPKFRFRAVSIENLNYTSDTTSPSFNIRFNAKVAVKNT 187 Query: 451 NFGRYKYRNTTVEFLYRGTPVGQVIVRGSRANWRSTRKFEVRVDL---NLGVNSELGSDL 621 NFG +K++N+T+ YRG VG + +RA RST+K V VD+ N+ NS L SD+ Sbjct: 188 NFGHFKFKNSTITLAYRGDHVGDAKISKARARARSTKKMNVTVDVTSNNVSSNSNLASDI 247 Query: 622 RAGIVPITSQARMRGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVC 768 +G + +T Q ++ G+V L+ V ++ +S MNC+++I + + I+ C Sbjct: 248 NSGFLTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKINLENKVIQEWKC 296 >ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca subsp. vesca] Length = 200 Score = 155 bits (392), Expect = 2e-35 Identities = 84/215 (39%), Positives = 134/215 (62%), Gaps = 1/215 (0%) Frame = +1 Query: 130 MAEKEQQATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVIL 309 MA+K QQ +P+AP+NGYTRSD ES S ++KKR KC YI +F +FQ AV Sbjct: 1 MADKHQQV-YPLAPSNGYTRSDGESLSEDE-----LKRKKRIKCFAYIGIFIVFQMAVGA 54 Query: 310 IFVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNTTVE 489 +F LT+++V+TPK R+ + + + T S ++T N ++ V+N N+G YK+ V Sbjct: 55 VFGLTVLKVKTPKVRLDTTS--TLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVVT 112 Query: 490 FLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN-LGVNSELGSDLRAGIVPITSQARMRG 666 F Y+GTPVG V +A R T+K + V LN +NS +G + +TS+A++ G Sbjct: 113 FKYQGTPVGTFTVPKGKAGMRGTKKIDASVSLNTAALNS-------SGELTLTSEAKLTG 165 Query: 667 RVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 +V L+F+M++ +S MNC+++I V+ Q ++++VC+ Sbjct: 166 KVTLMFIMKKKKSASMNCTIQIDVSGQTVKSVVCK 200 >ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca subsp. vesca] Length = 211 Score = 155 bits (391), Expect = 2e-35 Identities = 80/217 (36%), Positives = 136/217 (62%), Gaps = 3/217 (1%) Frame = +1 Query: 130 MAEKEQQATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVIL 309 MAE QA +P AP+NGY RSD ES + ++KKR K YI +F +FQ V+ Sbjct: 1 MAENTHQA-YPTAPSNGYARSDGESLVSED----ELKRKKRIKLFTYIGIFIVFQIIVMT 55 Query: 310 IFVLTIMRVRTPKFRVRSAALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNTTVE 489 +F LT+M+V+TPK R S + N P PS T ++ ++N N+G YK+ T Sbjct: 56 VFGLTVMKVKTPKARWGSIDVETLNY-VPATPSFDTTFETQIRIKNTNWGPYKFDAGTAT 114 Query: 490 FLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN---LGVNSELGSDLRAGIVPITSQARM 660 FLY+G +G+V + S+A RST+K +V V LN L +S LG++L +G++ +TSQ ++ Sbjct: 115 FLYQGVTIGKVDIPKSKAGMRSTKKIDVEVSLNTNALPNSSALGTELSSGVLTLTSQVQL 174 Query: 661 RGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVCR 771 +G+V+L+ +M++N++ M+C++ ++++ ++++ C+ Sbjct: 175 KGKVELMLIMKKNKNASMDCTIAFDLSSKTVQSLQCK 211 >gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 154 bits (390), Expect = 3e-35 Identities = 83/207 (40%), Positives = 125/207 (60%), Gaps = 4/207 (1%) Frame = +1 Query: 163 MAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVILIFVLTIMRVRT 342 MA N Y + + + SAA ++KKR K Y F +FQ VIL+F LT+MR++ Sbjct: 1 MAEQN-YQQKNIDMESAAE-----LKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKN 54 Query: 343 PKFRVRSAALTNFN-VGTPENPSLTATMNAELAVRNANFGRYKYRNTTVEFLYRGTPVGQ 519 PKFRVRS + + TP PS NAE+AV+N NFG +K+ NTT+ F Y G VG+ Sbjct: 55 PKFRVRSITVEDIAYTSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGE 114 Query: 520 VIVRGSRANWRSTRKFEVRVDL---NLGVNSELGSDLRAGIVPITSQARMRGRVDLIFVM 690 V RA RST+K V VDL N+ NS L SD+ +G + +T+ ++ G+V L+ ++ Sbjct: 115 AFVAKGRAKARSTKKMNVTVDLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLI 174 Query: 691 RRNRSTDMNCSMEIVVATQQIRNIVCR 771 ++ +S MNC+M + +A++ I++I C+ Sbjct: 175 KKKKSAQMNCTMTVNLASRAIQDIKCQ 201 >gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 191 Score = 150 bits (378), Expect = 7e-34 Identities = 70/182 (38%), Positives = 120/182 (65%), Gaps = 4/182 (2%) Frame = +1 Query: 238 RKKKRNKCLLYIVLFAIFQAAVILIFVLTIMRVRTPKFRVRSAALTNFNVGTPEN-PSLT 414 R+K+ KCL YIV I Q +IL+FV+ +MR+R PK R+ + N N+ + + PS + Sbjct: 10 RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69 Query: 415 ATMNAELAVRNANFGRYKYRNTTVEFLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN-- 588 +NA++ V+N NFG +K++N+T+ YRGTPVG+ + +RA RST K V V ++ Sbjct: 70 MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSD 129 Query: 589 -LGVNSELGSDLRAGIVPITSQARMRGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIV 765 + NS L SD+ +G + ++S A++ G++ L V ++ +S +MNC+ME+ +++QI+N++ Sbjct: 130 KMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLM 189 Query: 766 CR 771 C+ Sbjct: 190 CQ 191 >gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] Length = 212 Score = 148 bits (374), Expect = 2e-33 Identities = 83/217 (38%), Positives = 127/217 (58%), Gaps = 4/217 (1%) Frame = +1 Query: 130 MAEKEQQATHPMAPTNGYTRSDAESSSAAHGGARHQRKKKRNKCLLYIVLFAIFQAAVIL 309 MAE+ QQ +P+AP NG+ RSD ESS+ A+ +++KR K +Y +F Q V L Sbjct: 1 MAERYQQV-YPLAPANGHPRSDEESSNL---DAKELKRRKRIKLAIYAFIFTASQIIVTL 56 Query: 310 IFVLTIMRVRTPKFRVRSA-ALTNFNVGTPENPSLTATMNAELAVRNANFGRYKYRNTTV 486 +FVL +MRV++PK R+ + PS + +L V+N N+G YK+ NTT Sbjct: 57 VFVLVVMRVKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTTA 116 Query: 487 EFLYRGTPVGQVIVRGSRANWRSTRKFEVRVDLN---LGVNSELGSDLRAGIVPITSQAR 657 F Y G VGQV++ +A RST+K V V L+ L N+ LGS+L GI+ + A+ Sbjct: 117 AFAYEGETVGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTAK 176 Query: 658 MRGRVDLIFVMRRNRSTDMNCSMEIVVATQQIRNIVC 768 M G+V L+ +M++ +S +MNC++ I V + + N+ C Sbjct: 177 MTGKVKLMLIMKKKKSANMNCTINIHVKEKTV-NLKC 212