BLASTX nr result
ID: Paeonia22_contig00016007
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia22_contig00016007 (640 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r... 153 4e-35 ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r... 145 1e-32 ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r... 139 7e-31 gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus... 139 1e-30 gb|EYU40055.1| hypothetical protein MIMGU_mgv1a018374mg [Mimulus... 137 3e-30 emb|CBI22611.3| unnamed protein product [Vitis vinifera] 137 3e-30 ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r... 124 2e-26 ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294... 123 4e-26 emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera] 122 1e-25 ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r... 119 6e-25 ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r... 119 1e-24 ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r... 118 1e-24 gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis] 117 2e-24 ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296... 114 2e-23 ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296... 112 1e-22 ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-r... 112 1e-22 ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295... 111 2e-22 gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus... 111 2e-22 ref|XP_007051691.1| Late embryogenesis abundant (LEA) hydroxypro... 110 4e-22 ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306... 110 5e-22 >ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 153 bits (387), Expect = 4e-35 Identities = 87/192 (45%), Positives = 110/192 (57%) Frame = +1 Query: 46 MKGAHNHKSSRTKCXXXXXXXXXXXXXXXXXXXLTIMKIKSPKLRFGSVAVQNFNNGGGT 225 MKG K S KC LT+M+IK+PK+RFG+V V+NF+ G + Sbjct: 1 MKG--EGKRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSS 58 Query: 226 TNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDARKTKR 405 + VKNTNFGHFKYEN++ I YGG+ VGE I K R AR+TK+ Sbjct: 59 S-----PFFDMRLMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKK 113 Query: 406 FNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXXSAEMSCNMVVN 585 F+VTID++S KLS N NL DI SG L L+SEAKLSG S+EMSC M +N Sbjct: 114 FDVTIDISSSKLSTNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGIN 173 Query: 586 LANGSVQELKCK 621 + +VQ+LKCK Sbjct: 174 IGTRTVQDLKCK 185 >ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776113|gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 145 bits (365), Expect = 1e-32 Identities = 74/159 (46%), Positives = 99/159 (62%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 LT+M+IK+PK R S+ V++ T+ VKNTNFGHFK++NTT Sbjct: 47 LTVMRIKNPKFRVRSITVEDI----AYTSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTT 102 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504 + YGGV VGE F+ KGR AR TK+ NVT+DL S + AN NLA+DI+SGFL LT+ Sbjct: 103 ISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVDLNSNNIPANSNLASDISSGFLTLTTHT 162 Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 KLSG SA+M+C M VNLA+ ++Q++KC+ Sbjct: 163 KLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAIQDIKCQ 201 >ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 226 Score = 139 bits (350), Expect = 7e-31 Identities = 72/149 (48%), Positives = 96/149 (64%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 LT+M+I+SPK+RFG+V V++F+ ++ VKNTNFGHFKYEN+T Sbjct: 32 LTVMRIRSPKVRFGAVTVESFSTVNSSS-----PSFDMKLMAQVAVKNTNFGHFKYENST 86 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504 TI YGG+ VGE I KGR AR+TK+FN+ +D++S +LS+N NL DIN+G L L+S+A Sbjct: 87 VTILYGGMPVGEAAIFKGRARARQTKKFNINVDISSSRLSSNSNLGNDINAGVLPLSSQA 146 Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLA 591 KL G S EMSC M +NLA Sbjct: 147 KLKGKVHLMKVIKKKKSGEMSCTMGINLA 175 >gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus guttatus] Length = 192 Score = 139 bits (349), Expect = 1e-30 Identities = 74/186 (39%), Positives = 100/186 (53%), Gaps = 1/186 (0%) Frame = +1 Query: 67 KSSRTKCXXXXXXXXXXXXXXXXXXXLTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXX 246 K S KC LT+MKIKSPK+R ++AV++F++ N Sbjct: 7 KKSSKKCLAYVAVFVVFQAAVIMVLALTVMKIKSPKIRLNAIAVESFSSSNNGNNAGPTP 66 Query: 247 XXXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDL 426 +KNTNFG FKY+N T I Y GV +GE IP+GR ARKT +FNV+ DL Sbjct: 67 SINMKLLTQLTIKNTNFGQFKYDNATLAILYNGVPLGEAVIPRGRVKARKTLKFNVSFDL 126 Query: 427 TSEKLSA-NPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSV 603 S++L+ N NL DINSG L L+S+A+++G S M+C+ +VNLA V Sbjct: 127 NSDRLNGNNTNLGNDINSGVLRLSSQARVNGKVHLMKIIKKNKSGNMNCDWIVNLATRMV 186 Query: 604 QELKCK 621 + L CK Sbjct: 187 ENLNCK 192 >gb|EYU40055.1| hypothetical protein MIMGU_mgv1a018374mg [Mimulus guttatus] Length = 183 Score = 137 bits (345), Expect = 3e-30 Identities = 71/160 (44%), Positives = 97/160 (60%), Gaps = 1/160 (0%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 LT++KIKSPK+RF ++AV++F + G N +KNTNFG FKY+N T Sbjct: 25 LTVLKIKSPKIRFNAIAVESFTSNNGN-NAGPTPSINMRLLTQLTIKNTNFGQFKYDNAT 83 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSA-NPNLATDINSGFLALTSE 501 I Y GV +GE IP+GR ARKT +FNV+ DL S++L+ N NL DINSG L L+S+ Sbjct: 84 LAILYNGVPLGEAVIPRGRVKARKTLKFNVSFDLNSDRLNGNNTNLGNDINSGVLRLSSQ 143 Query: 502 AKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 A+++G S M+C+ +VNLA V+ L CK Sbjct: 144 ARVNGKVHLMKIIKKNKSGNMNCDWIVNLATRMVENLNCK 183 >emb|CBI22611.3| unnamed protein product [Vitis vinifera] Length = 297 Score = 137 bits (345), Expect = 3e-30 Identities = 71/158 (44%), Positives = 98/158 (62%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 +T+M+I+SPK RF +V+++N N TT+ VKNTNFGHFK++N+T Sbjct: 143 VTLMRIRSPKFRFRAVSIENLNYTSDTTSPSFNIRFNAKVA----VKNTNFGHFKFKNST 198 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504 T+AY G VG+ I K R AR TK+ NVT+D+TS +S+N NLA+DINSGFL LT + Sbjct: 199 ITLAYRGDHVGDAKISKARARARSTKKMNVTVDVTSNNVSSNSNLASDINSGFLTLTGQG 258 Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKC 618 KL+G S +M+C + +NL N +QE KC Sbjct: 259 KLNGKVHLMKVFKKKKSPQMNCTIKINLENKVIQEWKC 296 >ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721844|gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 124 bits (312), Expect = 2e-26 Identities = 74/188 (39%), Positives = 99/188 (52%), Gaps = 4/188 (2%) Frame = +1 Query: 67 KSSRTKCXXXXXXXXXXXXXXXXXXXLTIMKIKSPKLRFGSVAVQN--FNNGGGTTNXXX 240 + R KC LT+M+IK+PK R SV V + FNN + N Sbjct: 18 RKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSFNMKF 77 Query: 241 XXXXXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDAR--KTKRFNV 414 VKNTNFGH+K+EN+T T AY G VGE + KGR AR TK+ NV Sbjct: 78 IAQVT--------VKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNV 129 Query: 415 TIDLTSEKLSANPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXXSAEMSCNMVVNLAN 594 T+DL S ++ + +L +D+NSGFL LTS++ L+G S EM+C M VNLA Sbjct: 130 TMDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQ 189 Query: 595 GSVQELKC 618 V+++KC Sbjct: 190 KLVRDIKC 197 >ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294764 [Fragaria vesca subsp. vesca] Length = 182 Score = 123 bits (309), Expect = 4e-26 Identities = 66/159 (41%), Positives = 87/159 (54%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 LT+MKIK PK+RF + V NFN+ T VKNTNFGHFKY N+T Sbjct: 29 LTVMKIKGPKVRFQTATVSNFNSDSSTA-----ASFSGDLVTKFAVKNTNFGHFKYPNST 83 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504 +I Y G +G +P + AR T+R ++TI + S KLS NL T I +G + LTSE+ Sbjct: 84 VSILYEGQVIGTAAVPSQKAKARSTRRTDITISIDSSKLSGTTNLTTAIGAGVVPLTSES 143 Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 L G S +MSC M++NL +V +LKCK Sbjct: 144 TLKGKVEVMKIIKKNKSGKMSCTMLLNLKTRTVDDLKCK 182 >emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera] Length = 186 Score = 122 bits (305), Expect = 1e-25 Identities = 66/159 (41%), Positives = 94/159 (59%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 L ++KI+ PK+R S++V+N + + + VKNTNFGHFK++N+T Sbjct: 36 LLVLKIRDPKVRIASISVENQHFSTNSFSMDLKARVT--------VKNTNFGHFKFDNST 87 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504 ATI+Y G AVGE I K R +R TKRFN+T+ ++S K++ + L D+NSG L L+S A Sbjct: 88 ATISYFGTAVGEATILKARARSRSTKRFNITVPISSSKVNNHRQLRRDLNSGVLNLSSTA 147 Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 KLSG SAEMSC M ++ S++ L CK Sbjct: 148 KLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENLSCK 186 >ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776108|gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 191 Score = 119 bits (299), Expect = 6e-25 Identities = 62/159 (38%), Positives = 92/159 (57%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 + +M+I++PK+R G V V+N N +++ VKNTNFGHFK++N+T Sbjct: 37 MLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFSMNLNAQVT----VKNTNFGHFKFQNST 92 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504 TI+Y G VGE I K R AR T + NVT+ ++S+K+S N L++D+ SG + L+S A Sbjct: 93 LTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSDKMSRNSALSSDVGSGTINLSSHA 152 Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 KL G SAEM+C M V ++ +Q L C+ Sbjct: 153 KLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLMCQ 191 >ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777614|gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 188 Score = 119 bits (297), Expect = 1e-24 Identities = 67/160 (41%), Positives = 91/160 (56%), Gaps = 1/160 (0%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFN-NGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENT 321 LT+M+IK+P R SV VQ+ N N G + VKN NFGHF+++NT Sbjct: 35 LTVMRIKTPSARLRSVTVQSLNYNASGVPHFNMRLIMEIA------VKNKNFGHFRFDNT 88 Query: 322 TATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSE 501 TA + +G V VG+ I K R ARKTKR NVT+D++S +S L T ++SG L LT Sbjct: 89 TANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSDEDELRTKLSSGTLTLTGV 148 Query: 502 AKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 A+L G +AEM+C M VNL + +VQ+L C+ Sbjct: 149 ARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDCE 188 >ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777615|gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 118 bits (296), Expect = 1e-24 Identities = 62/160 (38%), Positives = 90/160 (56%), Gaps = 1/160 (0%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 LT+M++K+PK+R G V V+ T+N VKNTNFGH+K++N T Sbjct: 60 LTVMRVKNPKVRIGKVTVETME----TSNTEAAASFNLRFITQVTVKNTNFGHYKFDNAT 115 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKL-SANPNLATDINSGFLALTSE 501 + Y GV VGE IPK R AR TK+ +VT+++ S L S L ++++S L L S+ Sbjct: 116 MSFLYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQ 175 Query: 502 AKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 AKL G S EM+C ++ N++ S+Q+LKCK Sbjct: 176 AKLKGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215 >gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis] Length = 213 Score = 117 bits (294), Expect = 2e-24 Identities = 63/158 (39%), Positives = 89/158 (56%) Frame = +1 Query: 148 TIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTTA 327 T+M+IK P+LR SVA+++ TN VKNTNFG FK++ ++ Sbjct: 60 TVMRIKGPELRIRSVAIEDLTISNSDTNSPSLSMKFDSEIG---VKNTNFGEFKFDESSI 116 Query: 328 TIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAK 507 T Y G VG+ + KG+ AR TK+ NVT ++ +AN NLA D+ SGFL LTS++K Sbjct: 117 TFVYKGTEVGDASVEKGKAKARSTKKMNVTAEV-----NANSNLANDVRSGFLTLTSQSK 171 Query: 508 LSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 L+G +AEM+C + +NL N VQ+ KCK Sbjct: 172 LNGKVHLMKVIKKKKTAEMNCTITINLENKVVQDFKCK 209 >ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca subsp. vesca] Length = 211 Score = 114 bits (286), Expect = 2e-23 Identities = 60/159 (37%), Positives = 92/159 (57%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 LT+MK+K+PK+R G + VQ+FN+ T + VKNTN+G +K++ +T Sbjct: 59 LTVMKVKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIR------VKNTNWGPYKFDAST 112 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504 T Y GVAVG+ +PKG+ R TK+ NV + L + L ++ NL +++NSG L L S+A Sbjct: 113 VTFMYQGVAVGQVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQA 172 Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 KLSG S+ M C + +L+ +V+ L+CK Sbjct: 173 KLSGKVELMLIMKKKKSSTMDCMIGFDLSTKTVKSLQCK 211 >ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca subsp. vesca] Length = 211 Score = 112 bits (280), Expect = 1e-22 Identities = 59/159 (37%), Positives = 92/159 (57%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 LT+MK+K+PK+R G + VQ+ N+ T + VKNTN+G +K++ +T Sbjct: 59 LTVMKVKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIR------VKNTNWGPYKFDAST 112 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504 AT Y GVAVG+ IPK + R TK+ +V++ L + L ++ + T++NSG L LTS+A Sbjct: 113 ATFMYQGVAVGQVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQA 172 Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 KL+G SA M C + +L+ +V+ L+CK Sbjct: 173 KLTGKVELMLIMKKKKSATMDCTIAFDLSTKTVKSLQCK 211 >ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721845|gb|EOY13742.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 192 Score = 112 bits (279), Expect = 1e-22 Identities = 66/161 (40%), Positives = 94/161 (58%), Gaps = 2/161 (1%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 L +M+I++PK+R G V V+N +++ VKNTNFGHFK++N+T Sbjct: 36 LIVMRIRNPKVRLGGVTVENLRASSSSSSPSFSTKLNAQVS----VKNTNFGHFKFKNST 91 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTS-EKLSANPN-LATDINSGFLALTS 498 TI+Y G VG+ I +G AR TK+FNVTI ++S K+S N + L++DI SG + L+S Sbjct: 92 LTISYNGSPVGKATIVEGLARARSTKKFNVTILVSSNNKISRNSDQLSSDIESGTINLSS 151 Query: 499 EAKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 AKL G SAEM+C M VN + +Q+L CK Sbjct: 152 HAKLEGKIHLFKIFKKKKSAEMNCTMDVNTSLKQIQKLTCK 192 >ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca subsp. vesca] Length = 212 Score = 111 bits (278), Expect = 2e-22 Identities = 59/159 (37%), Positives = 90/159 (56%) Frame = +1 Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324 LT+MK+K+PK+R G+ VQN N + + +KNTN+G +K++ T Sbjct: 60 LTVMKVKTPKVRLGATNVQNLNFVPTSPSFDTTFATQIR------IKNTNWGPYKFDAGT 113 Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504 AT Y GVAVG+ PK + R TK+ N + L S ++ + NL ++++SG L LTSEA Sbjct: 114 ATFMYQGVAVGQVSFPKSKAGMRSTKKINAEVSLNSNEIPSTSNLGSELSSGVLTLTSEA 173 Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 KL+G SA M+C M ++L+ ++Q L+CK Sbjct: 174 KLTGKVELMLIMKKKKSATMNCTMKLDLSTKTIQALECK 212 >gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus] Length = 214 Score = 111 bits (277), Expect = 2e-22 Identities = 61/186 (32%), Positives = 89/186 (47%) Frame = +1 Query: 64 HKSSRTKCXXXXXXXXXXXXXXXXXXXLTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXX 243 HK RT+C LT+MKI++P+ R S + NFN G + Sbjct: 34 HKRKRTQCLIYIGLLAIIQIAVVIVFSLTVMKIRNPRFRIRSAHLTNFNAGTPAS----- 88 Query: 244 XXXXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDARKTKRFNVTID 423 VKN NFG +KY +TT Y G VGE F+ + R R TK+FNV +D Sbjct: 89 PAFTGKLNAEFSVKNANFGRYKYMDTTVDFVYRGTRVGEVFVRESRAGWRTTKKFNVAVD 148 Query: 424 LTSEKLSANPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSV 603 L+ ANP LA+D+N+G + ++SEA++SG S ++C M + A + Sbjct: 149 LSLANARANPQLASDLNAGVVPISSEARMSGSVELLFVLKKNRSTGLNCTMEIVTATQQI 208 Query: 604 QELKCK 621 + + CK Sbjct: 209 RNILCK 214 >ref|XP_007051691.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 [Theobroma cacao] gi|590721708|ref|XP_007051692.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 [Theobroma cacao] gi|508703952|gb|EOX95848.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 [Theobroma cacao] gi|508703953|gb|EOX95849.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 [Theobroma cacao] Length = 220 Score = 110 bits (275), Expect = 4e-22 Identities = 56/158 (35%), Positives = 88/158 (55%) Frame = +1 Query: 148 TIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTTA 327 T+ ++K P ++ VAV + GTT VKN N FKY+NTT Sbjct: 61 TVFRVKDPVIKMNGVAVTHLELINGTT---PKPGSNISLIADVSVKNPNVASFKYKNTTT 117 Query: 328 TIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAK 507 T+ Y G VGE P GR AR+T R N+++D+ +++L A+PNL D+NSG L ++S ++ Sbjct: 118 TLYYYGTIVGEARGPAGRAKARRTMRMNISVDIITDRLLASPNLVADVNSGTLTMSSYSR 177 Query: 508 LSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 + G + +M+C+M VN+++ ++QE KCK Sbjct: 178 IGGRVNMLNIIKKHVTVKMNCSMTVNISSQAIQEQKCK 215 >ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306460 [Fragaria vesca subsp. vesca] Length = 219 Score = 110 bits (274), Expect = 5e-22 Identities = 57/158 (36%), Positives = 82/158 (51%) Frame = +1 Query: 148 TIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTTA 327 T+ ++K PK+ V V GTT VKN N FKY NTT Sbjct: 60 TVFRVKEPKIMMNKVTVTKLELVNGTT---PKPGTNISLTADVSVKNPNVASFKYSNTTT 116 Query: 328 TIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAK 507 T+ Y G VGE P GR AR+T R N+T+D+ ++ L+ NPNL TD+ SG L ++S ++ Sbjct: 117 TLYYHGTVVGEARGPPGRAKARRTMRMNITVDIITDILTTNPNLKTDVGSGLLTMSSYSR 176 Query: 508 LSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621 + G +M+C M VN+++ ++QE KCK Sbjct: 177 IPGRVNMLNIVKKHVVVKMNCTMTVNISSQAIQEQKCK 214