BLASTX nr result
ID: Catharanthus22_contig00018813
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00018813 (703 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich g... 162 1e-37 ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578... 159 6e-37 gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich g... 153 4e-35 gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] 147 3e-33 gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich g... 146 7e-33 emb|CBI22611.3| unnamed protein product [Vitis vinifera] 138 1e-30 gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich g... 134 2e-29 gb|EMJ20358.1| hypothetical protein PRUPE_ppa016330mg [Prunus pe... 132 8e-29 ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302... 132 1e-28 ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293... 132 1e-28 gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich g... 130 5e-28 gb|EMJ11911.1| hypothetical protein PRUPE_ppa022983mg [Prunus pe... 130 5e-28 gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich g... 129 1e-27 gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis] 128 2e-27 ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295... 128 2e-27 ref|XP_002513626.1| conserved hypothetical protein [Ricinus comm... 127 3e-27 ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303... 127 5e-27 gb|EOY13742.1| Late embryogenesis abundant hydroxyproline-rich g... 125 2e-26 ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296... 124 3e-26 emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera] 124 3e-26 >gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 162 bits (409), Expect = 1e-37 Identities = 87/185 (47%), Positives = 122/185 (65%), Gaps = 3/185 (1%) Frame = -1 Query: 547 RESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATLEI 368 +++ MES+++ L+ K MK + Y AF+VFQT +IL+F+LT+M+I+ PKFR+RS T+E Sbjct: 8 QKNIDMESAAE-LKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVED 66 Query: 367 LALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIVGESKAGIRA 188 +A TP PSFN + NAE+ VKN NFG +KF N+ I F Y G VGEA V + +A R+ Sbjct: 67 IAYTSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARS 126 Query: 187 TKYISVVVDLLSSR---NSQLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMDCTL 17 TK ++V VDL S+ NS LA D++SG L L +KLSGKV K M+CT+ Sbjct: 127 TKKMNVTVDLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTM 186 Query: 16 TIGVA 2 T+ +A Sbjct: 187 TVNLA 191 >ref|XP_006362675.1| PREDICTED: uncharacterized protein LOC102578608 [Solanum tuberosum] Length = 204 Score = 159 bits (403), Expect = 6e-37 Identities = 89/192 (46%), Positives = 119/192 (61%) Frame = -1 Query: 577 QSYRHGGGDDRESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPK 398 Q +G E T + S +LR K K +Y F+VFQ A++L F+L IMKIRTPK Sbjct: 8 QLQTNGHAKPAEETPNSTQSNELRRKKRNKILVYVALFIVFQIAVLLFFSLYIMKIRTPK 67 Query: 397 FRIRSATLEILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAI 218 F +RSAT +++ EN SFN MNAE+ VKN NFG Y ++NS IYFYY +GEA Sbjct: 68 FSVRSATFDLMVT----ENASFNITMNAELSVKNANFGPYNYKNSTIYFYYNDVSIGEAF 123 Query: 217 VGESKAGIRATKYISVVVDLLSSRNSQLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKF 38 V + KAG +++K +V+V+ LSS+ S+L DLNSG L L +SKL GKV+ K Sbjct: 124 VYQGKAGFKSSKKFNVIVN-LSSKESKLRNDLNSGTLILTSKSKLEGKVKLIFFMKKKKS 182 Query: 37 TCMDCTLTIGVA 2 T M+C + IG+A Sbjct: 183 TEMNCAIIIGLA 194 >gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 153 bits (387), Expect = 4e-35 Identities = 85/178 (47%), Positives = 118/178 (66%), Gaps = 5/178 (2%) Frame = -1 Query: 520 SQDLRNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATLEILALRRTPEN 341 S++L+ K MKC Y AF++FQTAIIL+FALT+M+I+ PKFRIRS ++ L + + Sbjct: 13 SKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNS--S 70 Query: 340 PSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIV--GESKAGIRATKYISVV 167 PSFN + A++ VKN NFG YKF+NS + F YKGS VGEA+V G ++A R+TK ++V Sbjct: 71 PSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVT 130 Query: 166 VDLLS---SRNSQLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMDCTLTIGVA 2 +DL S + +S L DLNSG L L +S L+GKV K M+CT+T+ +A Sbjct: 131 MDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLA 188 >gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] Length = 214 Score = 147 bits (371), Expect = 3e-33 Identities = 80/204 (39%), Positives = 122/204 (59%), Gaps = 3/204 (1%) Frame = -1 Query: 607 SKDEQVYPLPQSYRHGGGDDRESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFA 428 +K + YPL + D ES + S++L+ K MKC LY + F VFQT IILLFA Sbjct: 3 AKSQSPYPLVPAANGHERSDEESVA--AHSKELKKKKRMKCLLYIVLFAVFQTGIILLFA 60 Query: 427 LTIMKIRTPKFRIRSATLEILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFY 248 LT+M+IR PKFR+RS + + T +PSF+ +MN + VKN NFG +K++ + F Sbjct: 61 LTVMRIRNPKFRVRSGSFTTFNV-GTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFA 119 Query: 247 YKGSLVGEAIVGESKAGIRATKYISVVVDLLSS---RNSQLAGDLNSGILKLKIESKLSG 77 Y+G+ VG A + +++A R+TK + VVV+L S+ ++L D+++G+L L SKL G Sbjct: 120 YRGTPVGRATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDG 179 Query: 76 KVEXXXXXXXXKFTCMDCTLTIGV 5 K+ K T M+CT+ + + Sbjct: 180 KIHLMKVIKKKKSTQMNCTMDVAI 203 >gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 146 bits (368), Expect = 7e-33 Identities = 82/205 (40%), Positives = 124/205 (60%), Gaps = 4/205 (1%) Frame = -1 Query: 604 KDEQVYPLPQSYRHGGGDDRESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFAL 425 KD+QV+PL + H D+ ES +++S ++L+ K +K +Y AF VFQT +IL+FAL Sbjct: 4 KDQQVHPLAPANGHPRSDE-ESASLQS--KELKRKKRIKYAVYIAAFAVFQTVVILIFAL 60 Query: 424 TIMKIRTPKFRIRSATLEILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYY 245 T+M+++ PK RI T+E + T SFN R ++ VKN NFG YKF N+ + F Y Sbjct: 61 TVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLY 120 Query: 244 KGSLVGEAIVGESKAGIRATKYISVVVDL----LSSRNSQLAGDLNSGILKLKIESKLSG 77 G +VGEAI+ +++A R+TK + V V++ L+S + L +L+S +L L ++KL G Sbjct: 121 DGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKG 180 Query: 76 KVEXXXXXXXXKFTCMDCTLTIGVA 2 KVE K M+CTL V+ Sbjct: 181 KVELMKVMKKKKSPEMNCTLIFNVS 205 >emb|CBI22611.3| unnamed protein product [Vitis vinifera] Length = 297 Score = 138 bits (348), Expect = 1e-30 Identities = 79/203 (38%), Positives = 120/203 (59%), Gaps = 3/203 (1%) Frame = -1 Query: 604 KDEQVYPLPQSYRHGGGDDRESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFAL 425 K +QV+P+ + GG + S++LR K + Y AF +F+T +I++ + Sbjct: 92 KKQQVHPIEPT----GGPAKTDV----ESEELRRMKCTRYIAYLSAFALFETIVIMVCVV 143 Query: 424 TIMKIRTPKFRIRSATLEILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYY 245 T+M+IR+PKFR R+ ++E L +PSFN R NA++ VKN NFG +KF+NS I Y Sbjct: 144 TLMRIRSPKFRFRAVSIENLNYTSDTTSPSFNIRFNAKVAVKNTNFGHFKFKNSTITLAY 203 Query: 244 KGSLVGEAIVGESKAGIRATKYISVVVDLLS---SRNSQLAGDLNSGILKLKIESKLSGK 74 +G VG+A + +++A R+TK ++V VD+ S S NS LA D+NSG L L + KL+GK Sbjct: 204 RGDHVGDAKISKARARARSTKKMNVTVDVTSNNVSSNSNLASDINSGFLTLTGQGKLNGK 263 Query: 73 VEXXXXXXXXKFTCMDCTLTIGV 5 V K M+CT+ I + Sbjct: 264 VHLMKVFKKKKSPQMNCTIKINL 286 >gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 226 Score = 134 bits (338), Expect = 2e-29 Identities = 74/172 (43%), Positives = 106/172 (61%), Gaps = 3/172 (1%) Frame = -1 Query: 508 RNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATLEILALRRTPENPSFN 329 R + KC Y AF+VFQTAIILLFALT+M+IR+PK R + T+E + + +PSF+ Sbjct: 5 REGSNAKCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFSTVNS-SSPSFD 63 Query: 328 FRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIVGESKAGIRATKYISVVVDLLSS 149 ++ A++ VKN NFG +K++NS + Y G VGEA + + +A R TK ++ VD+ SS Sbjct: 64 MKLMAQVAVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDISSS 123 Query: 148 R---NSQLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMDCTLTIGVA 2 R NS L D+N+G+L L ++KL GKV K M CT+ I +A Sbjct: 124 RLSSNSNLGNDINAGVLPLSSQAKLKGKVHLMKVIKKKKSGEMSCTMGINLA 175 >gb|EMJ20358.1| hypothetical protein PRUPE_ppa016330mg [Prunus persica] Length = 189 Score = 132 bits (333), Expect = 8e-29 Identities = 72/177 (40%), Positives = 109/177 (61%), Gaps = 3/177 (1%) Frame = -1 Query: 532 MESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATLEILALRR 353 M ++D R ++ +C+LY A ++ QT II+LF + +M+I+TPK R+ S ++ L Sbjct: 1 MTRGNEDSRRKRN-RCFLYIAAGIILQTIIIVLFVVFVMRIKTPKVRLDSVAVDSLTANS 59 Query: 352 TPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIVGESKAGIRATKYIS 173 +P +PSF ++NA + VKN NFG YKF+ SK+ F YKG+ VGE + ++KA + TK I+ Sbjct: 60 SPSSPSFKVQINALVTVKNKNFGHYKFEGSKVTFSYKGTAVGEGTIAKAKAKAKRTKKIN 119 Query: 172 VVVDLLS---SRNSQLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMDCTLTI 11 V V L S S +SQL+ DL+SG L L +KL GKV K ++CT+ + Sbjct: 120 VTVSLNSNKVSSHSQLSSDLSSGNLTLTAYAKLDGKVHLFKVIKKKKSANLNCTVHV 176 >ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca subsp. vesca] Length = 222 Score = 132 bits (332), Expect = 1e-28 Identities = 72/201 (35%), Positives = 118/201 (58%), Gaps = 5/201 (2%) Frame = -1 Query: 589 YPL-PQSYRHGGGDDRESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFALTIMK 413 YPL P + + D + + S+++LR+ K M+C LY F VFQ +I +FALT+MK Sbjct: 13 YPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVVVITVFALTVMK 72 Query: 412 IRTPKFRIRSATLEILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSL 233 I++PKFR+R+A++ + + NPSFN M+ GVKN NFG +++++ + F Y+ Sbjct: 73 IKSPKFRVRTASITGFEV-GSASNPSFNLEMDVHFGVKNTNFGHFEYEDGIVVFTYRDVR 131 Query: 232 VGEAIVGESKAGIRATKYISVVVDLLSSR----NSQLAGDLNSGILKLKIESKLSGKVEX 65 +G+ V E + R+T+ + V L+SR NS+L D+++GI+ + I SKL GK+ Sbjct: 132 IGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITISSKLDGKIHL 191 Query: 64 XXXXXXXKFTCMDCTLTIGVA 2 K M+CT+ + +A Sbjct: 192 MKIIKKKKSAQMNCTMEVVLA 212 >ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca subsp. vesca] Length = 211 Score = 132 bits (331), Expect = 1e-28 Identities = 83/206 (40%), Positives = 117/206 (56%), Gaps = 5/206 (2%) Frame = -1 Query: 604 KDEQVYPLPQSYRHGGGDDRESTTMESSSQD-LRNNKSMKCYLYFIAFLVFQTAIILLFA 428 + Q YPL S + D ES S+D L+ K +KC+ Y F+VFQ A++ +F Sbjct: 4 RTHQSYPLAPSNGYTRSDG------ESLSEDELKRKKRIKCFAYIGIFIVFQIAVMTVFG 57 Query: 427 LTIMKIRTPKFRIRSATLEILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFY 248 LTIMK++TPK R+ ++TL T PSF+ N +I VKN N+G YKF + F Sbjct: 58 LTIMKVKTPKVRLGTSTLTDFTSSDTA--PSFDTTFNTQIRVKNTNWGPYKFDQGVVTFM 115 Query: 247 YKGSLVGEAIVGESKAGIRATKYISVVVDL----LSSRNSQLAGDLNSGILKLKIESKLS 80 Y+G VG +V + KAG+R TK I+V V L L S +S L+ +L+ G+L L E+KL+ Sbjct: 116 YQGMPVGTVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLT 175 Query: 79 GKVEXXXXXXXXKFTCMDCTLTIGVA 2 GKVE K M+CT+ I V+ Sbjct: 176 GKVELMLIMKKKKSASMNCTIQIDVS 201 >gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 191 Score = 130 bits (326), Expect = 5e-28 Identities = 67/169 (39%), Positives = 104/169 (61%), Gaps = 3/169 (1%) Frame = -1 Query: 508 RNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATLEILALRRTPENPSFN 329 R +++KC Y +A ++ QT IILLF + +M+IR PK R+ T+E L L + +PSF+ Sbjct: 10 RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69 Query: 328 FRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIVGESKAGIRATKYISVVVDLLS- 152 +NA++ VKN NFG +KFQNS + Y+G+ VGEA + +++A R+T ++V V + S Sbjct: 70 MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSD 129 Query: 151 --SRNSQLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMDCTLTI 11 SRNS L+ D+ SG + L +KL GK+ K M+CT+ + Sbjct: 130 KMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEV 178 >gb|EMJ11911.1| hypothetical protein PRUPE_ppa022983mg [Prunus persica] Length = 209 Score = 130 bits (326), Expect = 5e-28 Identities = 73/186 (39%), Positives = 107/186 (57%), Gaps = 3/186 (1%) Frame = -1 Query: 550 DRESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATLE 371 D EST ++S ++L+ K +K Y Y + F+V Q ++ +F LT+MK++TPKFR+ ++ Sbjct: 18 DEESTALQS--EELKRQKRIKMYKYIVIFIVVQLIVLPVFGLTVMKVKTPKFRL--GNIK 73 Query: 370 ILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIVGESKAGIR 191 + L P PSF +I VKN N+G YKF + F YKG VG+ +V +SKA +R Sbjct: 74 VQNLSSVPSTPSFEASFATQIRVKNTNWGPYKFDAGTVTFMYKGVTVGQVVVPKSKAKMR 133 Query: 190 ATKYISVVVDLLS---SRNSQLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMDCT 20 +TK I V V L S +S L +L SG+L L + KL+GKV K MDCT Sbjct: 134 STKKIDVTVSLNSYGLPSSSNLGTELKSGVLTLSSKGKLTGKVVLMLMMKKRKSATMDCT 193 Query: 19 LTIGVA 2 +T ++ Sbjct: 194 MTFDLS 199 >gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 129 bits (323), Expect = 1e-27 Identities = 71/165 (43%), Positives = 101/165 (61%), Gaps = 3/165 (1%) Frame = -1 Query: 490 KCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATLEILALRRTPENPSFNFRMNAE 311 KC Y F+VFQTAIIL+FALT+M+I+ PK R + T+E + + +P F+ R+ A+ Sbjct: 11 KCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNS-SSPFFDMRLMAQ 69 Query: 310 IGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIVGESKAGIRATKYISVVVDLLSSR---NS 140 + VKN NFG +K++NS I Y G VGEA + +++A R TK V +D+ SS+ NS Sbjct: 70 VTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKLSTNS 129 Query: 139 QLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMDCTLTIGV 5 L D+ SG+L L E+KLSGKV K + M CT+ I + Sbjct: 130 NLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINI 174 >gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis] Length = 213 Score = 128 bits (321), Expect = 2e-27 Identities = 73/184 (39%), Positives = 112/184 (60%), Gaps = 1/184 (0%) Frame = -1 Query: 553 DDRESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATL 374 + S ES++++L++ K M+ A +V T +IL+F T+M+I+ P+ RIRS + Sbjct: 17 ESASSADHESNAKELQHKKRMRRLGGVTAIVVLLTVVILVFPQTVMRIKGPELRIRSVAI 76 Query: 373 EILALRRTPEN-PSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIVGESKAG 197 E L + + N PS + + ++EIGVKN NFG++KF S I F YKG+ VG+A V + KA Sbjct: 77 EDLTISNSDTNSPSLSMKFDSEIGVKNTNFGEFKFDESSITFVYKGTEVGDASVEKGKAK 136 Query: 196 IRATKYISVVVDLLSSRNSQLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMDCTL 17 R+TK ++V ++ + NS LA D+ SG L L +SKL+GKV K M+CT+ Sbjct: 137 ARSTKKMNVTAEV--NANSNLANDVRSGFLTLTSQSKLNGKVHLMKVIKKKKTAEMNCTI 194 Query: 16 TIGV 5 TI + Sbjct: 195 TINL 198 >ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca subsp. vesca] Length = 211 Score = 128 bits (321), Expect = 2e-27 Identities = 79/205 (38%), Positives = 116/205 (56%), Gaps = 4/205 (1%) Frame = -1 Query: 604 KDEQVYPLPQSYRHGGGDDRESTTMESSSQD-LRNNKSMKCYLYFIAFLVFQTAIILLFA 428 K Q YPL + D ES S+D L+ K +KC+ Y F+VFQ AI +F Sbjct: 7 KTHQTYPLASENGYTRSDG------ESLSEDELKRKKRIKCFAYIGIFIVFQMAIGAVFG 60 Query: 427 LTIMKIRTPKFRIRSATLEILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFY 248 LT++K++TPK R+ ++TL + T SF+ N +I VKN N+G YKF + F Sbjct: 61 LTVLKVKTPKVRLGTSTLSDV----TSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVTFM 116 Query: 247 YKGSLVGEAIVGESKAGIRATKYISVVVDLLSS---RNSQLAGDLNSGILKLKIESKLSG 77 Y+G+ VG +V + KAG+R TK I+V V L ++ +S L+ +L+ G+L L E+KL+G Sbjct: 117 YQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKLTG 176 Query: 76 KVEXXXXXXXXKFTCMDCTLTIGVA 2 KVE K M+CT+ I V+ Sbjct: 177 KVELMLIMKKKKSASMNCTIQIDVS 201 >ref|XP_002513626.1| conserved hypothetical protein [Ricinus communis] gi|223547534|gb|EEF49029.1| conserved hypothetical protein [Ricinus communis] Length = 217 Score = 127 bits (320), Expect = 3e-27 Identities = 65/187 (34%), Positives = 108/187 (57%), Gaps = 4/187 (2%) Frame = -1 Query: 553 DDRESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATL 374 D+ T + +++LR K MKC + +AF +FQT IILLF T+++ + PKFR+RSA+ Sbjct: 20 DEESGTAGTAQTKELRKKKRMKCIAFVVAFTIFQTGIILLFVFTVLRFKDPKFRVRSASF 79 Query: 373 -EILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIVGESKAG 197 + + PSFN MN + GVKN NFG +K++ S + F Y+G++VG V +++A Sbjct: 80 DDTFHVGTDAAAPSFNLTMNTQFGVKNTNFGHFKYETSTVTFEYRGTVVGLVNVDKARAR 139 Query: 196 IRATKYISVVVDLLSSR---NSQLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMD 26 R+T+ +V L + R +L+ D++SG + L S+L G++ K M+ Sbjct: 140 ARSTRKFDAIVVLRTDRLPDGFELSSDISSGKIPLSSSSRLDGEIHLMKVIKKKKSAEMN 199 Query: 25 CTLTIGV 5 CT+ + + Sbjct: 200 CTMNVDI 206 >ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca subsp. vesca] Length = 213 Score = 127 bits (318), Expect = 5e-27 Identities = 75/201 (37%), Positives = 116/201 (57%), Gaps = 3/201 (1%) Frame = -1 Query: 604 KDEQVYPLPQSYRHGGG---DDRESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILL 434 ++++ YP Y +G D ES+ S +LR K +KC +Y F VFQ +I + Sbjct: 4 RNQEAYPFAP-YANGQAMARSDAESSRAHSD-HELRKKKRIKCLIYIAVFAVFQIIVITV 61 Query: 433 FALTIMKIRTPKFRIRSATLEILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIY 254 FALT+MKI++PKFRI+S T++ L + NPS + AE+ VKN NFG YK+ + I Sbjct: 62 FALTVMKIKSPKFRIKSITVQDLTTSNS-ANPSLSMSFVAEVSVKNPNFGRYKYDQTSIS 120 Query: 253 FYYKGSLVGEAIVGESKAGIRATKYISVVVDLLSSRNSQLAGDLNSGILKLKIESKLSGK 74 F Y+G+ VG+A+V ++ A +AT+ +V + + NS LA D+++G + L SK++GK Sbjct: 121 FIYEGTQVGDAVVPKATARTKATRK-EIVSGAVKTVNSNLASDISAGSVTLSTYSKINGK 179 Query: 73 VEXXXXXXXXKFTCMDCTLTI 11 V K M CT+ + Sbjct: 180 VYLMNMIKKKKSAEMKCTMVV 200 >gb|EOY13742.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 192 Score = 125 bits (313), Expect = 2e-26 Identities = 69/182 (37%), Positives = 107/182 (58%), Gaps = 5/182 (2%) Frame = -1 Query: 532 MESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATLEILALRR 353 M+S Q R +++KC+ +A ++ +T IILLF L +M+IR PK R+ T+E L Sbjct: 1 MKSGDQTSRGKRNIKCWAIVVAGVIAKTIIILLFVLIVMRIRNPKVRLGGVTVENLRASS 60 Query: 352 TPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIVGESKAGIRATKYIS 173 + +PSF+ ++NA++ VKN NFG +KF+NS + Y GS VG+A + E A R+TK + Sbjct: 61 SSSSPSFSTKLNAQVSVKNTNFGHFKFKNSTLTISYNGSPVGKATIVEGLARARSTKKFN 120 Query: 172 VVVDLLS----SRNS-QLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMDCTLTIG 8 V + + S SRNS QL+ D+ SG + L +KL GK+ K M+CT+ + Sbjct: 121 VTILVSSNNKISRNSDQLSSDIESGTINLSSHAKLEGKIHLFKIFKKKKSAEMNCTMDVN 180 Query: 7 VA 2 + Sbjct: 181 TS 182 >ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca subsp. vesca] Length = 211 Score = 124 bits (311), Expect = 3e-26 Identities = 72/199 (36%), Positives = 108/199 (54%), Gaps = 3/199 (1%) Frame = -1 Query: 604 KDEQVYPLPQSYRHGGGDDRESTTMESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFAL 425 K Q YPL + G R S +L+ K ++ + Y F+VFQ ++ +F L Sbjct: 4 KTHQAYPLAPA----NGYTRSDGESLVSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGL 59 Query: 424 TIMKIRTPKFRIRSATLEILALRRTPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYY 245 T+MK++TPK R+ + + P PSF+ +I VKN N+G YKF S + F Y Sbjct: 60 TVMKVKTPKVRL--GEINVQDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMY 117 Query: 244 KGSLVGEAIVGESKAGIRATKYISVVVDLLSS---RNSQLAGDLNSGILKLKIESKLSGK 74 +G VG+ V + KAG+R+TK ++V V L ++ +S L +LNSG+L L ++KLSGK Sbjct: 118 QGVAVGQVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGK 177 Query: 73 VEXXXXXXXXKFTCMDCTL 17 VE K + MDC + Sbjct: 178 VELMLIMKKKKSSTMDCMI 196 >emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera] Length = 186 Score = 124 bits (311), Expect = 3e-26 Identities = 73/177 (41%), Positives = 102/177 (57%), Gaps = 3/177 (1%) Frame = -1 Query: 532 MESSSQDLRNNKSMKCYLYFIAFLVFQTAIILLFALTIMKIRTPKFRIRSATLEILALRR 353 M S + +R KS+KC Y AF+VFQT IILLF L ++KIR PK RI S ++E + Sbjct: 1 MTSDTGSVRRKKSLKCLAYVAAFVVFQTGIILLFVLLVLKIRDPKVRIASISVE----NQ 56 Query: 352 TPENPSFNFRMNAEIGVKNNNFGDYKFQNSKIYFYYKGSLVGEAIVGESKAGIRATKYIS 173 SF+ + A + VKN NFG +KF NS Y G+ VGEA + +++A R+TK + Sbjct: 57 HFSTNSFSMDLKARVTVKNTNFGHFKFDNSTATISYFGTAVGEATILKARARSRSTKRFN 116 Query: 172 VVVDLLSSR---NSQLAGDLNSGILKLKIESKLSGKVEXXXXXXXXKFTCMDCTLTI 11 + V + SS+ + QL DLNSG+L L +KLSGK+ K M CT+ + Sbjct: 117 ITVPISSSKVNNHRQLRRDLNSGVLNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMEL 173