BLASTX nr result

ID: Paeonia22_contig00016007 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00016007
         (640 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   153   4e-35
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   145   1e-32
ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r...   139   7e-31
gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus...   139   1e-30
gb|EYU40055.1| hypothetical protein MIMGU_mgv1a018374mg [Mimulus...   137   3e-30
emb|CBI22611.3| unnamed protein product [Vitis vinifera]              137   3e-30
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   124   2e-26
ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294...   123   4e-26
emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]   122   1e-25
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   119   6e-25
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   119   1e-24
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   118   1e-24
gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis]     117   2e-24
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   114   2e-23
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   112   1e-22
ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-r...   112   1e-22
ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295...   111   2e-22
gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus...   111   2e-22
ref|XP_007051691.1| Late embryogenesis abundant (LEA) hydroxypro...   110   4e-22
ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306...   110   5e-22

>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  153 bits (387), Expect = 4e-35
 Identities = 87/192 (45%), Positives = 110/192 (57%)
 Frame = +1

Query: 46  MKGAHNHKSSRTKCXXXXXXXXXXXXXXXXXXXLTIMKIKSPKLRFGSVAVQNFNNGGGT 225
           MKG    K S  KC                   LT+M+IK+PK+RFG+V V+NF+ G  +
Sbjct: 1   MKG--EGKRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSS 58

Query: 226 TNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDARKTKR 405
           +                 VKNTNFGHFKYEN++  I YGG+ VGE  I K R  AR+TK+
Sbjct: 59  S-----PFFDMRLMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKK 113

Query: 406 FNVTIDLTSEKLSANPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXXSAEMSCNMVVN 585
           F+VTID++S KLS N NL  DI SG L L+SEAKLSG            S+EMSC M +N
Sbjct: 114 FDVTIDISSSKLSTNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGIN 173

Query: 586 LANGSVQELKCK 621
           +   +VQ+LKCK
Sbjct: 174 IGTRTVQDLKCK 185


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  145 bits (365), Expect = 1e-32
 Identities = 74/159 (46%), Positives = 99/159 (62%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           LT+M+IK+PK R  S+ V++       T+                VKNTNFGHFK++NTT
Sbjct: 47  LTVMRIKNPKFRVRSITVEDI----AYTSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTT 102

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504
            +  YGGV VGE F+ KGR  AR TK+ NVT+DL S  + AN NLA+DI+SGFL LT+  
Sbjct: 103 ISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVDLNSNNIPANSNLASDISSGFLTLTTHT 162

Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           KLSG            SA+M+C M VNLA+ ++Q++KC+
Sbjct: 163 KLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAIQDIKCQ 201


>ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 226

 Score =  139 bits (350), Expect = 7e-31
 Identities = 72/149 (48%), Positives = 96/149 (64%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           LT+M+I+SPK+RFG+V V++F+    ++                 VKNTNFGHFKYEN+T
Sbjct: 32  LTVMRIRSPKVRFGAVTVESFSTVNSSS-----PSFDMKLMAQVAVKNTNFGHFKYENST 86

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504
            TI YGG+ VGE  I KGR  AR+TK+FN+ +D++S +LS+N NL  DIN+G L L+S+A
Sbjct: 87  VTILYGGMPVGEAAIFKGRARARQTKKFNINVDISSSRLSSNSNLGNDINAGVLPLSSQA 146

Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLA 591
           KL G            S EMSC M +NLA
Sbjct: 147 KLKGKVHLMKVIKKKKSGEMSCTMGINLA 175


>gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus guttatus]
          Length = 192

 Score =  139 bits (349), Expect = 1e-30
 Identities = 74/186 (39%), Positives = 100/186 (53%), Gaps = 1/186 (0%)
 Frame = +1

Query: 67  KSSRTKCXXXXXXXXXXXXXXXXXXXLTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXX 246
           K S  KC                   LT+MKIKSPK+R  ++AV++F++     N     
Sbjct: 7   KKSSKKCLAYVAVFVVFQAAVIMVLALTVMKIKSPKIRLNAIAVESFSSSNNGNNAGPTP 66

Query: 247 XXXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDL 426
                      +KNTNFG FKY+N T  I Y GV +GE  IP+GR  ARKT +FNV+ DL
Sbjct: 67  SINMKLLTQLTIKNTNFGQFKYDNATLAILYNGVPLGEAVIPRGRVKARKTLKFNVSFDL 126

Query: 427 TSEKLSA-NPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSV 603
            S++L+  N NL  DINSG L L+S+A+++G            S  M+C+ +VNLA   V
Sbjct: 127 NSDRLNGNNTNLGNDINSGVLRLSSQARVNGKVHLMKIIKKNKSGNMNCDWIVNLATRMV 186

Query: 604 QELKCK 621
           + L CK
Sbjct: 187 ENLNCK 192


>gb|EYU40055.1| hypothetical protein MIMGU_mgv1a018374mg [Mimulus guttatus]
          Length = 183

 Score =  137 bits (345), Expect = 3e-30
 Identities = 71/160 (44%), Positives = 97/160 (60%), Gaps = 1/160 (0%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           LT++KIKSPK+RF ++AV++F +  G  N                +KNTNFG FKY+N T
Sbjct: 25  LTVLKIKSPKIRFNAIAVESFTSNNGN-NAGPTPSINMRLLTQLTIKNTNFGQFKYDNAT 83

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSA-NPNLATDINSGFLALTSE 501
             I Y GV +GE  IP+GR  ARKT +FNV+ DL S++L+  N NL  DINSG L L+S+
Sbjct: 84  LAILYNGVPLGEAVIPRGRVKARKTLKFNVSFDLNSDRLNGNNTNLGNDINSGVLRLSSQ 143

Query: 502 AKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           A+++G            S  M+C+ +VNLA   V+ L CK
Sbjct: 144 ARVNGKVHLMKIIKKNKSGNMNCDWIVNLATRMVENLNCK 183


>emb|CBI22611.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  137 bits (345), Expect = 3e-30
 Identities = 71/158 (44%), Positives = 98/158 (62%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           +T+M+I+SPK RF +V+++N N    TT+                VKNTNFGHFK++N+T
Sbjct: 143 VTLMRIRSPKFRFRAVSIENLNYTSDTTSPSFNIRFNAKVA----VKNTNFGHFKFKNST 198

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504
            T+AY G  VG+  I K R  AR TK+ NVT+D+TS  +S+N NLA+DINSGFL LT + 
Sbjct: 199 ITLAYRGDHVGDAKISKARARARSTKKMNVTVDVTSNNVSSNSNLASDINSGFLTLTGQG 258

Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKC 618
           KL+G            S +M+C + +NL N  +QE KC
Sbjct: 259 KLNGKVHLMKVFKKKKSPQMNCTIKINLENKVIQEWKC 296


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  124 bits (312), Expect = 2e-26
 Identities = 74/188 (39%), Positives = 99/188 (52%), Gaps = 4/188 (2%)
 Frame = +1

Query: 67  KSSRTKCXXXXXXXXXXXXXXXXXXXLTIMKIKSPKLRFGSVAVQN--FNNGGGTTNXXX 240
           +  R KC                   LT+M+IK+PK R  SV V +  FNN   + N   
Sbjct: 18  RKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSSPSFNMKF 77

Query: 241 XXXXXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDAR--KTKRFNV 414
                        VKNTNFGH+K+EN+T T AY G  VGE  + KGR  AR   TK+ NV
Sbjct: 78  IAQVT--------VKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNV 129

Query: 415 TIDLTSEKLSANPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXXSAEMSCNMVVNLAN 594
           T+DL S  ++ + +L +D+NSGFL LTS++ L+G            S EM+C M VNLA 
Sbjct: 130 TMDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQ 189

Query: 595 GSVQELKC 618
             V+++KC
Sbjct: 190 KLVRDIKC 197


>ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294764 [Fragaria vesca
           subsp. vesca]
          Length = 182

 Score =  123 bits (309), Expect = 4e-26
 Identities = 66/159 (41%), Positives = 87/159 (54%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           LT+MKIK PK+RF +  V NFN+   T                  VKNTNFGHFKY N+T
Sbjct: 29  LTVMKIKGPKVRFQTATVSNFNSDSSTA-----ASFSGDLVTKFAVKNTNFGHFKYPNST 83

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504
            +I Y G  +G   +P  +  AR T+R ++TI + S KLS   NL T I +G + LTSE+
Sbjct: 84  VSILYEGQVIGTAAVPSQKAKARSTRRTDITISIDSSKLSGTTNLTTAIGAGVVPLTSES 143

Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
            L G            S +MSC M++NL   +V +LKCK
Sbjct: 144 TLKGKVEVMKIIKKNKSGKMSCTMLLNLKTRTVDDLKCK 182


>emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]
          Length = 186

 Score =  122 bits (305), Expect = 1e-25
 Identities = 66/159 (41%), Positives = 94/159 (59%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           L ++KI+ PK+R  S++V+N +    + +                VKNTNFGHFK++N+T
Sbjct: 36  LLVLKIRDPKVRIASISVENQHFSTNSFSMDLKARVT--------VKNTNFGHFKFDNST 87

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504
           ATI+Y G AVGE  I K R  +R TKRFN+T+ ++S K++ +  L  D+NSG L L+S A
Sbjct: 88  ATISYFGTAVGEATILKARARSRSTKRFNITVPISSSKVNNHRQLRRDLNSGVLNLSSTA 147

Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           KLSG            SAEMSC M ++    S++ L CK
Sbjct: 148 KLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENLSCK 186


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  119 bits (299), Expect = 6e-25
 Identities = 62/159 (38%), Positives = 92/159 (57%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           + +M+I++PK+R G V V+N N    +++                VKNTNFGHFK++N+T
Sbjct: 37  MLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFSMNLNAQVT----VKNTNFGHFKFQNST 92

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504
            TI+Y G  VGE  I K R  AR T + NVT+ ++S+K+S N  L++D+ SG + L+S A
Sbjct: 93  LTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSDKMSRNSALSSDVGSGTINLSSHA 152

Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           KL G            SAEM+C M V  ++  +Q L C+
Sbjct: 153 KLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLMCQ 191


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  119 bits (297), Expect = 1e-24
 Identities = 67/160 (41%), Positives = 91/160 (56%), Gaps = 1/160 (0%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFN-NGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENT 321
           LT+M+IK+P  R  SV VQ+ N N  G  +                VKN NFGHF+++NT
Sbjct: 35  LTVMRIKTPSARLRSVTVQSLNYNASGVPHFNMRLIMEIA------VKNKNFGHFRFDNT 88

Query: 322 TATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSE 501
           TA + +G V VG+  I K R  ARKTKR NVT+D++S  +S    L T ++SG L LT  
Sbjct: 89  TANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSDEDELRTKLSSGTLTLTGV 148

Query: 502 AKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           A+L G            +AEM+C M VNL + +VQ+L C+
Sbjct: 149 ARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDCE 188


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  118 bits (296), Expect = 1e-24
 Identities = 62/160 (38%), Positives = 90/160 (56%), Gaps = 1/160 (0%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           LT+M++K+PK+R G V V+       T+N                VKNTNFGH+K++N T
Sbjct: 60  LTVMRVKNPKVRIGKVTVETME----TSNTEAAASFNLRFITQVTVKNTNFGHYKFDNAT 115

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKL-SANPNLATDINSGFLALTSE 501
            +  Y GV VGE  IPK R  AR TK+ +VT+++ S  L S    L ++++S  L L S+
Sbjct: 116 MSFLYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQ 175

Query: 502 AKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           AKL G            S EM+C ++ N++  S+Q+LKCK
Sbjct: 176 AKLKGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215


>gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis]
          Length = 213

 Score =  117 bits (294), Expect = 2e-24
 Identities = 63/158 (39%), Positives = 89/158 (56%)
 Frame = +1

Query: 148 TIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTTA 327
           T+M+IK P+LR  SVA+++       TN                VKNTNFG FK++ ++ 
Sbjct: 60  TVMRIKGPELRIRSVAIEDLTISNSDTNSPSLSMKFDSEIG---VKNTNFGEFKFDESSI 116

Query: 328 TIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAK 507
           T  Y G  VG+  + KG+  AR TK+ NVT ++     +AN NLA D+ SGFL LTS++K
Sbjct: 117 TFVYKGTEVGDASVEKGKAKARSTKKMNVTAEV-----NANSNLANDVRSGFLTLTSQSK 171

Query: 508 LSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           L+G            +AEM+C + +NL N  VQ+ KCK
Sbjct: 172 LNGKVHLMKVIKKKKTAEMNCTITINLENKVVQDFKCK 209


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  114 bits (286), Expect = 2e-23
 Identities = 60/159 (37%), Positives = 92/159 (57%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           LT+MK+K+PK+R G + VQ+FN+   T +                VKNTN+G +K++ +T
Sbjct: 59  LTVMKVKTPKVRLGEINVQDFNSVPATPSFDTTFTTQIR------VKNTNWGPYKFDAST 112

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504
            T  Y GVAVG+  +PKG+   R TK+ NV + L +  L ++ NL +++NSG L L S+A
Sbjct: 113 VTFMYQGVAVGQVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQA 172

Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           KLSG            S+ M C +  +L+  +V+ L+CK
Sbjct: 173 KLSGKVELMLIMKKKKSSTMDCMIGFDLSTKTVKSLQCK 211


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  112 bits (280), Expect = 1e-22
 Identities = 59/159 (37%), Positives = 92/159 (57%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           LT+MK+K+PK+R G + VQ+ N+   T +                VKNTN+G +K++ +T
Sbjct: 59  LTVMKVKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIR------VKNTNWGPYKFDAST 112

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504
           AT  Y GVAVG+  IPK +   R TK+ +V++ L +  L ++  + T++NSG L LTS+A
Sbjct: 113 ATFMYQGVAVGQVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQA 172

Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           KL+G            SA M C +  +L+  +V+ L+CK
Sbjct: 173 KLTGKVELMLIMKKKKSATMDCTIAFDLSTKTVKSLQCK 211


>ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721845|gb|EOY13742.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 192

 Score =  112 bits (279), Expect = 1e-22
 Identities = 66/161 (40%), Positives = 94/161 (58%), Gaps = 2/161 (1%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           L +M+I++PK+R G V V+N      +++                VKNTNFGHFK++N+T
Sbjct: 36  LIVMRIRNPKVRLGGVTVENLRASSSSSSPSFSTKLNAQVS----VKNTNFGHFKFKNST 91

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTS-EKLSANPN-LATDINSGFLALTS 498
            TI+Y G  VG+  I +G   AR TK+FNVTI ++S  K+S N + L++DI SG + L+S
Sbjct: 92  LTISYNGSPVGKATIVEGLARARSTKKFNVTILVSSNNKISRNSDQLSSDIESGTINLSS 151

Query: 499 EAKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
            AKL G            SAEM+C M VN +   +Q+L CK
Sbjct: 152 HAKLEGKIHLFKIFKKKKSAEMNCTMDVNTSLKQIQKLTCK 192


>ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca
           subsp. vesca]
          Length = 212

 Score =  111 bits (278), Expect = 2e-22
 Identities = 59/159 (37%), Positives = 90/159 (56%)
 Frame = +1

Query: 145 LTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTT 324
           LT+MK+K+PK+R G+  VQN N    + +                +KNTN+G +K++  T
Sbjct: 60  LTVMKVKTPKVRLGATNVQNLNFVPTSPSFDTTFATQIR------IKNTNWGPYKFDAGT 113

Query: 325 ATIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEA 504
           AT  Y GVAVG+   PK +   R TK+ N  + L S ++ +  NL ++++SG L LTSEA
Sbjct: 114 ATFMYQGVAVGQVSFPKSKAGMRSTKKINAEVSLNSNEIPSTSNLGSELSSGVLTLTSEA 173

Query: 505 KLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           KL+G            SA M+C M ++L+  ++Q L+CK
Sbjct: 174 KLTGKVELMLIMKKKKSATMNCTMKLDLSTKTIQALECK 212


>gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus]
          Length = 214

 Score =  111 bits (277), Expect = 2e-22
 Identities = 61/186 (32%), Positives = 89/186 (47%)
 Frame = +1

Query: 64  HKSSRTKCXXXXXXXXXXXXXXXXXXXLTIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXX 243
           HK  RT+C                   LT+MKI++P+ R  S  + NFN G   +     
Sbjct: 34  HKRKRTQCLIYIGLLAIIQIAVVIVFSLTVMKIRNPRFRIRSAHLTNFNAGTPAS----- 88

Query: 244 XXXXXXXXXXXXVKNTNFGHFKYENTTATIAYGGVAVGEFFIPKGRTDARKTKRFNVTID 423
                       VKN NFG +KY +TT    Y G  VGE F+ + R   R TK+FNV +D
Sbjct: 89  PAFTGKLNAEFSVKNANFGRYKYMDTTVDFVYRGTRVGEVFVRESRAGWRTTKKFNVAVD 148

Query: 424 LTSEKLSANPNLATDINSGFLALTSEAKLSGXXXXXXXXXXXXSAEMSCNMVVNLANGSV 603
           L+     ANP LA+D+N+G + ++SEA++SG            S  ++C M +  A   +
Sbjct: 149 LSLANARANPQLASDLNAGVVPISSEARMSGSVELLFVLKKNRSTGLNCTMEIVTATQQI 208

Query: 604 QELKCK 621
           + + CK
Sbjct: 209 RNILCK 214


>ref|XP_007051691.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein
           family, putative isoform 1 [Theobroma cacao]
           gi|590721708|ref|XP_007051692.1| Late embryogenesis
           abundant (LEA) hydroxyproline-rich glycoprotein family,
           putative isoform 1 [Theobroma cacao]
           gi|508703952|gb|EOX95848.1| Late embryogenesis abundant
           (LEA) hydroxyproline-rich glycoprotein family, putative
           isoform 1 [Theobroma cacao] gi|508703953|gb|EOX95849.1|
           Late embryogenesis abundant (LEA) hydroxyproline-rich
           glycoprotein family, putative isoform 1 [Theobroma
           cacao]
          Length = 220

 Score =  110 bits (275), Expect = 4e-22
 Identities = 56/158 (35%), Positives = 88/158 (55%)
 Frame = +1

Query: 148 TIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTTA 327
           T+ ++K P ++   VAV +     GTT                 VKN N   FKY+NTT 
Sbjct: 61  TVFRVKDPVIKMNGVAVTHLELINGTT---PKPGSNISLIADVSVKNPNVASFKYKNTTT 117

Query: 328 TIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAK 507
           T+ Y G  VGE   P GR  AR+T R N+++D+ +++L A+PNL  D+NSG L ++S ++
Sbjct: 118 TLYYYGTIVGEARGPAGRAKARRTMRMNISVDIITDRLLASPNLVADVNSGTLTMSSYSR 177

Query: 508 LSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           + G            + +M+C+M VN+++ ++QE KCK
Sbjct: 178 IGGRVNMLNIIKKHVTVKMNCSMTVNISSQAIQEQKCK 215


>ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306460 [Fragaria vesca
           subsp. vesca]
          Length = 219

 Score =  110 bits (274), Expect = 5e-22
 Identities = 57/158 (36%), Positives = 82/158 (51%)
 Frame = +1

Query: 148 TIMKIKSPKLRFGSVAVQNFNNGGGTTNXXXXXXXXXXXXXXXXVKNTNFGHFKYENTTA 327
           T+ ++K PK+    V V       GTT                 VKN N   FKY NTT 
Sbjct: 60  TVFRVKEPKIMMNKVTVTKLELVNGTT---PKPGTNISLTADVSVKNPNVASFKYSNTTT 116

Query: 328 TIAYGGVAVGEFFIPKGRTDARKTKRFNVTIDLTSEKLSANPNLATDINSGFLALTSEAK 507
           T+ Y G  VGE   P GR  AR+T R N+T+D+ ++ L+ NPNL TD+ SG L ++S ++
Sbjct: 117 TLYYHGTVVGEARGPPGRAKARRTMRMNITVDIITDILTTNPNLKTDVGSGLLTMSSYSR 176

Query: 508 LSGXXXXXXXXXXXXSAEMSCNMVVNLANGSVQELKCK 621
           + G              +M+C M VN+++ ++QE KCK
Sbjct: 177 IPGRVNMLNIVKKHVVVKMNCTMTVNISSQAIQEQKCK 214


Top