BLASTX nr result

ID: Paeonia24_contig00018389 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia24_contig00018389
         (724 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   152   8e-35
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   145   1e-32
ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r...   138   2e-30
gb|EYU40055.1| hypothetical protein MIMGU_mgv1a018374mg [Mimulus...   137   5e-30
emb|CBI22611.3| unnamed protein product [Vitis vinifera]              136   6e-30
gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus...   136   8e-30
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   124   4e-26
ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294...   122   9e-26
gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis]     121   3e-25
emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]   121   3e-25
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   119   1e-24
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   118   2e-24
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   115   2e-23
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   114   4e-23
ref|XP_007051691.1| Late embryogenesis abundant (LEA) hydroxypro...   113   6e-23
ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306...   113   7e-23
gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus...   112   1e-22
ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295...   112   2e-22
ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-r...   111   2e-22
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   111   2e-22

>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  152 bits (385), Expect = 8e-35
 Identities = 89/189 (47%), Positives = 112/189 (59%)
 Frame = -3

Query: 680 MKGAHNHKSSRTKCXXXXXXXXXXXXXXXXXXALTIMKIKSPKLRFGSVAVQNFNNGGGT 501
           MKG    K S  KC                  ALT+M+IK+PK+RFG+V V+NF+ G  +
Sbjct: 1   MKG--EGKRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSS 58

Query: 500 TNXXXXXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDARKTKRFNV 321
           +              VKNTNFGHFKYEN++  I YGG+ VGE  I K R  AR+TK+F+V
Sbjct: 59  S--PFFDMRLMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDV 116

Query: 320 TIDLTSEKLSANPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXKSAEMSCNMVVNLAN 141
           TID++S KLS N NL  DI SG L L+SEAKLSG           KS+EMSC M +N+  
Sbjct: 117 TIDISSSKLSTNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGT 176

Query: 140 GSVQELKCK 114
            +VQ+LKCK
Sbjct: 177 RTVQDLKCK 185


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  145 bits (366), Expect = 1e-32
 Identities = 76/156 (48%), Positives = 100/156 (64%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           LT+M+IK+PK R  S+ V++      T N             VKNTNFGHFK++NTT + 
Sbjct: 47  LTVMRIKNPKFRVRSITVEDIAYTS-TPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISF 105

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLS 222
            YGGV VGE F+ KGR  AR TK+ NVT+DL S  + AN NLA+DI+SGFL LT+  KLS
Sbjct: 106 DYGGVQVGEAFVAKGRAKARSTKKMNVTVDLNSNNIPANSNLASDISSGFLTLTTHTKLS 165

Query: 221 GXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
           G           KSA+M+C M VNLA+ ++Q++KC+
Sbjct: 166 GKVHLMKLIKKKKSAQMNCTMTVNLASRAIQDIKCQ 201


>ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 226

 Score =  138 bits (348), Expect = 2e-30
 Identities = 73/146 (50%), Positives = 97/146 (66%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           LT+M+I+SPK+RFG+V V++F+    ++              VKNTNFGHFKYEN+T TI
Sbjct: 32  LTVMRIRSPKVRFGAVTVESFSTVNSSS--PSFDMKLMAQVAVKNTNFGHFKYENSTVTI 89

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLS 222
            YGG+ VGE  I KGR  AR+TK+FN+ +D++S +LS+N NL  DIN+G L L+S+AKL 
Sbjct: 90  LYGGMPVGEAAIFKGRARARQTKKFNINVDISSSRLSSNSNLGNDINAGVLPLSSQAKLK 149

Query: 221 GXXXXXXXXXXXKSAEMSCNMVVNLA 144
           G           KS EMSC M +NLA
Sbjct: 150 GKVHLMKVIKKKKSGEMSCTMGINLA 175


>gb|EYU40055.1| hypothetical protein MIMGU_mgv1a018374mg [Mimulus guttatus]
          Length = 183

 Score =  137 bits (344), Expect = 5e-30
 Identities = 73/159 (45%), Positives = 98/159 (61%), Gaps = 3/159 (1%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNF--NNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTA 408
           LT++KIKSPK+RF ++AV++F  NNG                  +KNTNFG FKY+N T 
Sbjct: 25  LTVLKIKSPKIRFNAIAVESFTSNNGNNAGPTPSINMRLLTQLTIKNTNFGQFKYDNATL 84

Query: 407 TIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSA-NPNLATDINSGFLALTSEA 231
            I Y GV +GE  IP+GR  ARKT +FNV+ DL S++L+  N NL  DINSG L L+S+A
Sbjct: 85  AILYNGVPLGEAVIPRGRVKARKTLKFNVSFDLNSDRLNGNNTNLGNDINSGVLRLSSQA 144

Query: 230 KLSGXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
           +++G           KS  M+C+ +VNLA   V+ L CK
Sbjct: 145 RVNGKVHLMKIIKKNKSGNMNCDWIVNLATRMVENLNCK 183


>emb|CBI22611.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  136 bits (343), Expect = 6e-30
 Identities = 71/155 (45%), Positives = 98/155 (63%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           +T+M+I+SPK RF +V+++N N    TT+              KNTNFGHFK++N+T T+
Sbjct: 143 VTLMRIRSPKFRFRAVSIENLNYTSDTTSPSFNIRFNAKVAV-KNTNFGHFKFKNSTITL 201

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLS 222
           AY G  VG+  I K R  AR TK+ NVT+D+TS  +S+N NLA+DINSGFL LT + KL+
Sbjct: 202 AYRGDHVGDAKISKARARARSTKKMNVTVDVTSNNVSSNSNLASDINSGFLTLTGQGKLN 261

Query: 221 GXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKC 117
           G           KS +M+C + +NL N  +QE KC
Sbjct: 262 GKVHLMKVFKKKKSPQMNCTIKINLENKVIQEWKC 296


>gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus guttatus]
          Length = 192

 Score =  136 bits (342), Expect = 8e-30
 Identities = 78/186 (41%), Positives = 102/186 (54%), Gaps = 4/186 (2%)
 Frame = -3

Query: 659 KSSRTKCXXXXXXXXXXXXXXXXXXALTIMKIKSPKLRFGSVAVQNF---NNGGGTTNXX 489
           K S  KC                  ALT+MKIKSPK+R  ++AV++F   NNG       
Sbjct: 7   KKSSKKCLAYVAVFVVFQAAVIMVLALTVMKIKSPKIRLNAIAVESFSSSNNGNNAGPTP 66

Query: 488 XXXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDL 309
                      +KNTNFG FKY+N T  I Y GV +GE  IP+GR  ARKT +FNV+ DL
Sbjct: 67  SINMKLLTQLTIKNTNFGQFKYDNATLAILYNGVPLGEAVIPRGRVKARKTLKFNVSFDL 126

Query: 308 TSEKLSA-NPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXKSAEMSCNMVVNLANGSV 132
            S++L+  N NL  DINSG L L+S+A+++G           KS  M+C+ +VNLA   V
Sbjct: 127 NSDRLNGNNTNLGNDINSGVLRLSSQARVNGKVHLMKIIKKNKSGNMNCDWIVNLATRMV 186

Query: 131 QELKCK 114
           + L CK
Sbjct: 187 ENLNCK 192


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  124 bits (310), Expect = 4e-26
 Identities = 75/185 (40%), Positives = 100/185 (54%), Gaps = 4/185 (2%)
 Frame = -3

Query: 659 KSSRTKCXXXXXXXXXXXXXXXXXXALTIMKIKSPKLRFGSVAVQN--FNNGGGTTNXXX 486
           +  R KC                  ALT+M+IK+PK R  SV V +  FNN   + N   
Sbjct: 18  RKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSFNMKF 77

Query: 485 XXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDAR--KTKRFNVTID 312
                      KNTNFGH+K+EN+T T AY G  VGE  + KGR  AR   TK+ NVT+D
Sbjct: 78  IAQVTV-----KNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTMD 132

Query: 311 LTSEKLSANPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXKSAEMSCNMVVNLANGSV 132
           L S  ++ + +L +D+NSGFL LTS++ L+G           KS EM+C M VNLA   V
Sbjct: 133 LNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKLV 192

Query: 131 QELKC 117
           +++KC
Sbjct: 193 RDIKC 197


>ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294764 [Fragaria vesca
           subsp. vesca]
          Length = 182

 Score =  122 bits (307), Expect = 9e-26
 Identities = 67/156 (42%), Positives = 88/156 (56%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           LT+MKIK PK+RF +  V NFN+   T               VKNTNFGHFKY N+T +I
Sbjct: 29  LTVMKIKGPKVRFQTATVSNFNSDSSTA--ASFSGDLVTKFAVKNTNFGHFKYPNSTVSI 86

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLS 222
            Y G  +G   +P  +  AR T+R ++TI + S KLS   NL T I +G + LTSE+ L 
Sbjct: 87  LYEGQVIGTAAVPSQKAKARSTRRTDITISIDSSKLSGTTNLTTAIGAGVVPLTSESTLK 146

Query: 221 GXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
           G           KS +MSC M++NL   +V +LKCK
Sbjct: 147 GKVEVMKIIKKNKSGKMSCTMLLNLKTRTVDDLKCK 182


>gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis]
          Length = 213

 Score =  121 bits (303), Expect = 3e-25
 Identities = 64/155 (41%), Positives = 90/155 (58%)
 Frame = -3

Query: 578 TIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATIA 399
           T+M+IK P+LR  SVA+++       TN             VKNTNFG FK++ ++ T  
Sbjct: 60  TVMRIKGPELRIRSVAIEDLTISNSDTNSPSLSMKFDSEIGVKNTNFGEFKFDESSITFV 119

Query: 398 YGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLSG 219
           Y G  VG+  + KG+  AR TK+ NVT +     ++AN NLA D+ SGFL LTS++KL+G
Sbjct: 120 YKGTEVGDASVEKGKAKARSTKKMNVTAE-----VNANSNLANDVRSGFLTLTSQSKLNG 174

Query: 218 XXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
                      K+AEM+C + +NL N  VQ+ KCK
Sbjct: 175 KVHLMKVIKKKKTAEMNCTITINLENKVVQDFKCK 209


>emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]
          Length = 186

 Score =  121 bits (303), Expect = 3e-25
 Identities = 66/156 (42%), Positives = 94/156 (60%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           L ++KI+ PK+R  S++V+N +    + +              KNTNFGHFK++N+TATI
Sbjct: 36  LLVLKIRDPKVRIASISVENQHFSTNSFSMDLKARVTV-----KNTNFGHFKFDNSTATI 90

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLS 222
           +Y G AVGE  I K R  +R TKRFN+T+ ++S K++ +  L  D+NSG L L+S AKLS
Sbjct: 91  SYFGTAVGEATILKARARSRSTKRFNITVPISSSKVNNHRQLRRDLNSGVLNLSSTAKLS 150

Query: 221 GXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
           G           KSAEMSC M ++    S++ L CK
Sbjct: 151 GKIHLFKIFKKKKSAEMSCTMELHTNTSSIENLSCK 186


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  119 bits (297), Expect = 1e-24
 Identities = 62/156 (39%), Positives = 92/156 (58%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           + +M+I++PK+R G V V+N N    +++              KNTNFGHFK++N+T TI
Sbjct: 37  MLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFSMNLNAQVTV-KNTNFGHFKFQNSTLTI 95

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLS 222
           +Y G  VGE  I K R  AR T + NVT+ ++S+K+S N  L++D+ SG + L+S AKL 
Sbjct: 96  SYRGTPVGEATIVKARARARSTTKLNVTVSVSSDKMSRNSALSSDVGSGTINLSSHAKLD 155

Query: 221 GXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
           G           KSAEM+C M V  ++  +Q L C+
Sbjct: 156 GKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLMCQ 191


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  118 bits (295), Expect = 2e-24
 Identities = 67/157 (42%), Positives = 91/157 (57%), Gaps = 1/157 (0%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFN-NGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTAT 405
           LT+M+IK+P  R  SV VQ+ N N  G  +              KN NFGHF+++NTTA 
Sbjct: 35  LTVMRIKTPSARLRSVTVQSLNYNASGVPHFNMRLIMEIAV---KNKNFGHFRFDNTTAN 91

Query: 404 IAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKL 225
           + +G V VG+  I K R  ARKTKR NVT+D++S  +S    L T ++SG L LT  A+L
Sbjct: 92  VTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSDEDELRTKLSSGTLTLTGVARL 151

Query: 224 SGXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
            G           K+AEM+C M VNL + +VQ+L C+
Sbjct: 152 RGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDCE 188


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  115 bits (287), Expect = 2e-23
 Identities = 62/157 (39%), Positives = 89/157 (56%), Gaps = 1/157 (0%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           LT+M++K+PK+R G V V+       T               VKNTNFGH+K++N T + 
Sbjct: 60  LTVMRVKNPKVRIGKVTVETMETSN-TEAAASFNLRFITQVTVKNTNFGHYKFDNATMSF 118

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKL-SANPNLATDINSGFLALTSEAKL 225
            Y GV VGE  IPK R  AR TK+ +VT+++ S  L S    L ++++S  L L S+AKL
Sbjct: 119 LYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKL 178

Query: 224 SGXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
            G           KS EM+C ++ N++  S+Q+LKCK
Sbjct: 179 KGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  114 bits (284), Expect = 4e-23
 Identities = 60/156 (38%), Positives = 92/156 (58%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           LT+MK+K+PK+R G + VQ+FN+   T +              KNTN+G +K++ +T T 
Sbjct: 59  LTVMKVKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIRV---KNTNWGPYKFDASTVTF 115

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLS 222
            Y GVAVG+  +PKG+   R TK+ NV + L +  L ++ NL +++NSG L L S+AKLS
Sbjct: 116 MYQGVAVGQVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLS 175

Query: 221 GXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
           G           KS+ M C +  +L+  +V+ L+CK
Sbjct: 176 GKVELMLIMKKKKSSTMDCMIGFDLSTKTVKSLQCK 211


>ref|XP_007051691.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein
           family, putative isoform 1 [Theobroma cacao]
           gi|590721708|ref|XP_007051692.1| Late embryogenesis
           abundant (LEA) hydroxyproline-rich glycoprotein family,
           putative isoform 1 [Theobroma cacao]
           gi|508703952|gb|EOX95848.1| Late embryogenesis abundant
           (LEA) hydroxyproline-rich glycoprotein family, putative
           isoform 1 [Theobroma cacao] gi|508703953|gb|EOX95849.1|
           Late embryogenesis abundant (LEA) hydroxyproline-rich
           glycoprotein family, putative isoform 1 [Theobroma
           cacao]
          Length = 220

 Score =  113 bits (283), Expect = 6e-23
 Identities = 56/155 (36%), Positives = 88/155 (56%)
 Frame = -3

Query: 578 TIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATIA 399
           T+ ++K P ++   VAV +     GTT              VKN N   FKY+NTT T+ 
Sbjct: 61  TVFRVKDPVIKMNGVAVTHLELINGTTPKPGSNISLIADVSVKNPNVASFKYKNTTTTLY 120

Query: 398 YGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLSG 219
           Y G  VGE   P GR  AR+T R N+++D+ +++L A+PNL  D+NSG L ++S +++ G
Sbjct: 121 YYGTIVGEARGPAGRAKARRTMRMNISVDIITDRLLASPNLVADVNSGTLTMSSYSRIGG 180

Query: 218 XXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
                       + +M+C+M VN+++ ++QE KCK
Sbjct: 181 RVNMLNIIKKHVTVKMNCSMTVNISSQAIQEQKCK 215


>ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306460 [Fragaria vesca
           subsp. vesca]
          Length = 219

 Score =  113 bits (282), Expect = 7e-23
 Identities = 57/155 (36%), Positives = 82/155 (52%)
 Frame = -3

Query: 578 TIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATIA 399
           T+ ++K PK+    V V       GTT              VKN N   FKY NTT T+ 
Sbjct: 60  TVFRVKEPKIMMNKVTVTKLELVNGTTPKPGTNISLTADVSVKNPNVASFKYSNTTTTLY 119

Query: 398 YGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLSG 219
           Y G  VGE   P GR  AR+T R N+T+D+ ++ L+ NPNL TD+ SG L ++S +++ G
Sbjct: 120 YHGTVVGEARGPPGRAKARRTMRMNITVDIITDILTTNPNLKTDVGSGLLTMSSYSRIPG 179

Query: 218 XXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
                         +M+C M VN+++ ++QE KCK
Sbjct: 180 RVNMLNIVKKHVVVKMNCTMTVNISSQAIQEQKCK 214


>gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus]
          Length = 214

 Score =  112 bits (281), Expect = 1e-22
 Identities = 62/183 (33%), Positives = 91/183 (49%)
 Frame = -3

Query: 662 HKSSRTKCXXXXXXXXXXXXXXXXXXALTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXX 483
           HK  RT+C                  +LT+MKI++P+ R  S  + NFN   GT      
Sbjct: 34  HKRKRTQCLIYIGLLAIIQIAVVIVFSLTVMKIRNPRFRIRSAHLTNFN--AGTPASPAF 91

Query: 482 XXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTS 303
                    VKN NFG +KY +TT    Y G  VGE F+ + R   R TK+FNV +DL+ 
Sbjct: 92  TGKLNAEFSVKNANFGRYKYMDTTVDFVYRGTRVGEVFVRESRAGWRTTKKFNVAVDLSL 151

Query: 302 EKLSANPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXKSAEMSCNMVVNLANGSVQEL 123
               ANP LA+D+N+G + ++SEA++SG           +S  ++C M +  A   ++ +
Sbjct: 152 ANARANPQLASDLNAGVVPISSEARMSGSVELLFVLKKNRSTGLNCTMEIVTATQQIRNI 211

Query: 122 KCK 114
            CK
Sbjct: 212 LCK 214


>ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca
           subsp. vesca]
          Length = 212

 Score =  112 bits (279), Expect = 2e-22
 Identities = 60/156 (38%), Positives = 89/156 (57%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           LT+MK+K+PK+R G+  VQN N                    +KNTN+G +K++  TAT 
Sbjct: 60  LTVMKVKTPKVRLGATNVQNLNF---VPTSPSFDTTFATQIRIKNTNWGPYKFDAGTATF 116

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLS 222
            Y GVAVG+   PK +   R TK+ N  + L S ++ +  NL ++++SG L LTSEAKL+
Sbjct: 117 MYQGVAVGQVSFPKSKAGMRSTKKINAEVSLNSNEIPSTSNLGSELSSGVLTLTSEAKLT 176

Query: 221 GXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
           G           KSA M+C M ++L+  ++Q L+CK
Sbjct: 177 GKVELMLIMKKKKSATMNCTMKLDLSTKTIQALECK 212


>ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721845|gb|EOY13742.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 192

 Score =  111 bits (278), Expect = 2e-22
 Identities = 67/158 (42%), Positives = 95/158 (60%), Gaps = 2/158 (1%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           L +M+I++PK+R G V V+N      +++             VKNTNFGHFK++N+T TI
Sbjct: 36  LIVMRIRNPKVRLGGVTVENLR-ASSSSSSPSFSTKLNAQVSVKNTNFGHFKFKNSTLTI 94

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTS-EKLSANPN-LATDINSGFLALTSEAK 228
           +Y G  VG+  I +G   AR TK+FNVTI ++S  K+S N + L++DI SG + L+S AK
Sbjct: 95  SYNGSPVGKATIVEGLARARSTKKFNVTILVSSNNKISRNSDQLSSDIESGTINLSSHAK 154

Query: 227 LSGXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
           L G           KSAEM+C M VN +   +Q+L CK
Sbjct: 155 LEGKIHLFKIFKKKKSAEMNCTMDVNTSLKQIQKLTCK 192


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  111 bits (278), Expect = 2e-22
 Identities = 59/156 (37%), Positives = 92/156 (58%)
 Frame = -3

Query: 581 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXVKNTNFGHFKYENTTATI 402
           LT+MK+K+PK+R G + VQ+ N+   T +              KNTN+G +K++ +TAT 
Sbjct: 59  LTVMKVKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIRV---KNTNWGPYKFDASTATF 115

Query: 401 AYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLS 222
            Y GVAVG+  IPK +   R TK+ +V++ L +  L ++  + T++NSG L LTS+AKL+
Sbjct: 116 MYQGVAVGQVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLT 175

Query: 221 GXXXXXXXXXXXKSAEMSCNMVVNLANGSVQELKCK 114
           G           KSA M C +  +L+  +V+ L+CK
Sbjct: 176 GKVELMLIMKKKKSATMDCTIAFDLSTKTVKSLQCK 211


Top