BLASTX nr result

ID: Akebia27_contig00028304 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00028304
         (828 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prun...   172   2e-40
emb|CBI22611.3| unnamed protein product [Vitis vinifera]              167   3e-39
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   166   1e-38
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   161   2e-37
ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295...   161   2e-37
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   159   2e-36
ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295...   156   1e-35
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   155   2e-35
gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus...   154   3e-35
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   153   9e-35
gb|EYU40055.1| hypothetical protein MIMGU_mgv1a018374mg [Mimulus...   151   3e-34
ref|XP_007209620.1| hypothetical protein PRUPE_ppa011689mg [Prun...   149   1e-33
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   148   2e-33
ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294...   148   3e-33
ref|XP_007219159.1| hypothetical protein PRUPE_ppa016330mg [Prun...   145   2e-32
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   144   3e-32
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   144   4e-32
ref|XP_004300832.1| PREDICTED: uncharacterized protein LOC101296...   143   9e-32
gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis]     142   2e-31
ref|XP_004298841.1| PREDICTED: uncharacterized protein LOC101294...   142   2e-31

>ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prunus persica]
           gi|462406447|gb|EMJ11911.1| hypothetical protein
           PRUPE_ppa022983mg [Prunus persica]
          Length = 209

 Score =  172 bits (435), Expect = 2e-40
 Identities = 86/192 (44%), Positives = 120/192 (62%)
 Frame = +1

Query: 139 DEERAMLGPEELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNF 318
           DEE   L  EEL       +  YIV+FIV+Q+IV  V G TV+KVK P  ++ ++ +QN 
Sbjct: 18  DEESTALQSEELKRQKRIKMYKYIVIFIVVQLIVLPVFGLTVMKVKTPKFRLGNIKVQNL 77

Query: 319 NYDPTTPSANMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRR 498
           +  P+TPS      T++RVKN NWG YKF   T+   Y+G TVG+  +P  KAK+RST++
Sbjct: 78  SSVPSTPSFEASFATQIRVKNTNWGPYKFDAGTVTFMYKGVTVGQVVVPKSKAKMRSTKK 137

Query: 499 VNVTFDVTSEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTIN 678
           ++VT  + S  L  +SNL ++L SG LT ++  KL GKV LMLMMKK KS  M+CT T +
Sbjct: 138 IDVTVSLNSYGLPSSSNLGTELKSGVLTLSSKGKLTGKVVLMLMMKKRKSATMDCTMTFD 197

Query: 679 TPNQSAHDIICK 714
              ++   + CK
Sbjct: 198 LSTKTLKTLQCK 209


>emb|CBI22611.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  167 bits (424), Expect = 3e-39
 Identities = 87/184 (47%), Positives = 119/184 (64%), Gaps = 2/184 (1%)
 Frame = +1

Query: 166 EELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNFNY--DPTTP 339
           EEL        I Y+  F + + IV  V   T+++++ P  +  +VSI+N NY  D T+P
Sbjct: 113 EELRRMKCTRYIAYLSAFALFETIVIMVCVVTLMRIRSPKFRFRAVSIENLNYTSDTTSP 172

Query: 340 SANMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVTFDV 519
           S N+R   +V VKN N+G +KF+NST+ L+YRG+ VG+A+I   +A+ RST+++NVT DV
Sbjct: 173 SFNIRFNAKVAVKNTNFGHFKFKNSTITLAYRGDHVGDAKISKARARARSTKKMNVTVDV 232

Query: 520 TSEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQSAH 699
           TS  +S NSNLASD+NSG LT     KLNGKV LM + KK KS QMNCT  IN  N+   
Sbjct: 233 TSNNVSSNSNLASDINSGFLTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKINLENKVIQ 292

Query: 700 DIIC 711
           +  C
Sbjct: 293 EWKC 296


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  166 bits (419), Expect = 1e-38
 Identities = 84/193 (43%), Positives = 121/193 (62%)
 Frame = +1

Query: 136 SDEERAMLGPEELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQN 315
           SD E +++  +EL       L TYI +FIV Q+IV  V G TV+KVK P +++  +++Q+
Sbjct: 20  SDGE-SLVSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMKVKTPKVRLGEINVQD 78

Query: 316 FNYDPTTPSANMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTR 495
           FN  P TPS +    T++RVKN NWG YKF  ST+   Y+G  VG+  +P  KA +RST+
Sbjct: 79  FNSVPATPSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVGQVTVPKGKAGMRSTK 138

Query: 496 RVNVTFDVTSEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTI 675
           ++NV   + +  L  +SNL S+LNSG LT N+  KL+GKV LML+MKK KS  M+C    
Sbjct: 139 KMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLIMKKKKSSTMDCMIGF 198

Query: 676 NTPNQSAHDIICK 714
           +   ++   + CK
Sbjct: 199 DLSTKTVKSLQCK 211


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  161 bits (408), Expect = 2e-37
 Identities = 81/184 (44%), Positives = 115/184 (62%), Gaps = 2/184 (1%)
 Frame = +1

Query: 169 ELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNFNYD--PTTPS 342
           EL       L  Y   F+V Q IV  V   TV+++K P  ++ S+++++  Y   P  PS
Sbjct: 18  ELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNPPS 77

Query: 343 ANMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVTFDVT 522
            NM+   EV VKN N+G +KF N+T+   Y G  VGEA +   +AK RST+++NVT D+ 
Sbjct: 78  FNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVDLN 137

Query: 523 SEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQSAHD 702
           S  +  NSNLASD++SG LT   +TKL+GKV LM ++KK KS QMNCT T+N  +++  D
Sbjct: 138 SNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAIQD 197

Query: 703 IICK 714
           I C+
Sbjct: 198 IKCQ 201


>ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca
           subsp. vesca]
          Length = 212

 Score =  161 bits (408), Expect = 2e-37
 Identities = 78/183 (42%), Positives = 111/183 (60%)
 Frame = +1

Query: 166 EELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNFNYDPTTPSA 345
           EEL       L TYI +FI  Q+IV  V G TV+KVK P +++ + ++QN N+ PT+PS 
Sbjct: 30  EELKRQKRIKLFTYIGIFIGFQIIVMTVFGLTVMKVKTPKVRLGATNVQNLNFVPTSPSF 89

Query: 346 NMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVTFDVTS 525
           +    T++R+KN NWG YKF   T    Y+G  VG+   P  KA +RST+++N    + S
Sbjct: 90  DTTFATQIRIKNTNWGPYKFDAGTATFMYQGVAVGQVSFPKSKAGMRSTKKINAEVSLNS 149

Query: 526 EQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQSAHDI 705
            ++   SNL S+L+SG LT  +  KL GKV LML+MKK KS  MNCT  ++   ++   +
Sbjct: 150 NEIPSTSNLGSELSSGVLTLTSEAKLTGKVELMLIMKKKKSATMNCTMKLDLSTKTIQAL 209

Query: 706 ICK 714
            CK
Sbjct: 210 ECK 212


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  159 bits (401), Expect = 2e-36
 Identities = 79/193 (40%), Positives = 119/193 (61%)
 Frame = +1

Query: 136 SDEERAMLGPEELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQN 315
           SD E +++  +EL       L  YI +FIV+Q+IV  V G TV+KVK P +++  +++Q+
Sbjct: 20  SDGE-SLVSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMKVKTPKVRLGGINVQS 78

Query: 316 FNYDPTTPSANMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTR 495
            N  P TPS +    T++RVKN NWG YKF  ST    Y+G  VG+  IP  KA++RST+
Sbjct: 79  LNSVPATPSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVGQVSIPKSKARMRSTK 138

Query: 496 RVNVTFDVTSEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTI 675
           +++V+  + +  L  +S + ++LNSG LT  +  KL GKV LML+MKK KS  M+CT   
Sbjct: 139 KISVSVILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLIMKKKKSATMDCTIAF 198

Query: 676 NTPNQSAHDIICK 714
           +   ++   + CK
Sbjct: 199 DLSTKTVKSLQCK 211


>ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  156 bits (394), Expect = 1e-35
 Identities = 78/193 (40%), Positives = 117/193 (60%)
 Frame = +1

Query: 136 SDEERAMLGPEELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQN 315
           SD E +++  +EL       L TYI +FIV Q+IV  V G TV+KVK P  +  S+ ++ 
Sbjct: 20  SDGE-SLVSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMKVKTPKARWGSIDVET 78

Query: 316 FNYDPTTPSANMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTR 495
            NY P TPS +    T++R+KN NWG YKF   T    Y+G T+G+  IP  KA +RST+
Sbjct: 79  LNYVPATPSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIGKVDIPKSKAGMRSTK 138

Query: 496 RVNVTFDVTSEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTI 675
           +++V   + +  L  +S L ++L+SG LT  +  +L GKV LML+MKKNK+  M+CT   
Sbjct: 139 KIDVEVSLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLIMKKNKNASMDCTIAF 198

Query: 676 NTPNQSAHDIICK 714
           +  +++   + CK
Sbjct: 199 DLSSKTVQSLQCK 211


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  155 bits (391), Expect = 2e-35
 Identities = 78/194 (40%), Positives = 118/194 (60%), Gaps = 2/194 (1%)
 Frame = +1

Query: 136 SDEERAMLGPEELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQN 315
           SD    M   +EL        + Y+  F++ Q  +  V   TV+++K P  +I SV + +
Sbjct: 4   SDVAFPMEQSKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDD 63

Query: 316 FNYDPTTPSANMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIP--AIKAKLRS 489
             ++ ++PS NM+ I +V VKN N+G YKF NST+  +Y+G  VGEA +     +A+ RS
Sbjct: 64  LTFNNSSPSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARS 123

Query: 490 TRRVNVTFDVTSEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTF 669
           T+++NVT D+ S  ++ +S+L SDLNSG LT  + + LNGKV LM ++KK KS +MNCT 
Sbjct: 124 TKKMNVTMDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTM 183

Query: 670 TINTPNQSAHDIIC 711
           T+N   +   DI C
Sbjct: 184 TVNLAQKLVRDIKC 197


>gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus guttatus]
          Length = 192

 Score =  154 bits (390), Expect = 3e-35
 Identities = 73/179 (40%), Positives = 120/179 (67%), Gaps = 7/179 (3%)
 Frame = +1

Query: 199 ITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNF------NYDPTTPSANMRLI 360
           + Y+ VF+V Q  V  VL  TV+K+K P I++ ++++++F      N    TPS NM+L+
Sbjct: 14  LAYVAVFVVFQAAVIMVLALTVMKIKSPKIRLNAIAVESFSSSNNGNNAGPTPSINMKLL 73

Query: 361 TEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVTFDVTSEQLSG 540
           T++ +KN N+G++K+ N+T+ + Y G  +GEA IP  + K R T + NV+FD+ S++L+G
Sbjct: 74  TQLTIKNTNFGQFKYDNATLAILYNGVPLGEAVIPRGRVKARKTLKFNVSFDLNSDRLNG 133

Query: 541 -NSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQSAHDIICK 714
            N+NL +D+NSG L  ++  ++NGKV LM ++KKNKSG MNC + +N   +   ++ CK
Sbjct: 134 NNTNLGNDINSGVLRLSSQARVNGKVHLMKIIKKNKSGNMNCDWIVNLATRMVENLNCK 192


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  153 bits (386), Expect = 9e-35
 Identities = 85/196 (43%), Positives = 115/196 (58%), Gaps = 3/196 (1%)
 Frame = +1

Query: 136 SDEERAMLGPEELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQN 315
           SDEE A L  +EL          YI  F V Q +V  +   TV++VK P ++I  V+++ 
Sbjct: 20  SDEESASLQSKELKRKKRIKYAVYIAAFAVFQTVVILIFALTVMRVKNPKVRIGKVTVET 79

Query: 316 FNYDPTTPSA--NMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRS 489
                T  +A  N+R IT+V VKN N+G YKF N+TM   Y G  VGEA IP  +A+ RS
Sbjct: 80  METSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGEAIIPKARARARS 139

Query: 490 TRRVNVTFDVTSEQL-SGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCT 666
           T++++VT +V S  L S  + L S+L+S  LT N+  KL GKV LM +MKK KS +MNCT
Sbjct: 140 TKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKVMKKKKSPEMNCT 199

Query: 667 FTINTPNQSAHDIICK 714
              N   +S  D+ CK
Sbjct: 200 LIFNVSTRSLQDLKCK 215


>gb|EYU40055.1| hypothetical protein MIMGU_mgv1a018374mg [Mimulus guttatus]
          Length = 183

 Score =  151 bits (382), Expect = 3e-34
 Identities = 73/175 (41%), Positives = 117/175 (66%), Gaps = 6/175 (3%)
 Frame = +1

Query: 208 IVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNF-----NYDPTTPSANMRLITEVR 372
           + VF++ Q  V  VL  TVLK+K P I+  ++++++F     N    TPS NMRL+T++ 
Sbjct: 9   VAVFVLFQAAVIMVLALTVLKIKSPKIRFNAIAVESFTSNNGNNAGPTPSINMRLLTQLT 68

Query: 373 VKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVTFDVTSEQLSG-NSN 549
           +KN N+G++K+ N+T+ + Y G  +GEA IP  + K R T + NV+FD+ S++L+G N+N
Sbjct: 69  IKNTNFGQFKYDNATLAILYNGVPLGEAVIPRGRVKARKTLKFNVSFDLNSDRLNGNNTN 128

Query: 550 LASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQSAHDIICK 714
           L +D+NSG L  ++  ++NGKV LM ++KKNKSG MNC + +N   +   ++ CK
Sbjct: 129 LGNDINSGVLRLSSQARVNGKVHLMKIIKKNKSGNMNCDWIVNLATRMVENLNCK 183


>ref|XP_007209620.1| hypothetical protein PRUPE_ppa011689mg [Prunus persica]
           gi|462405355|gb|EMJ10819.1| hypothetical protein
           PRUPE_ppa011689mg [Prunus persica]
          Length = 200

 Score =  149 bits (377), Expect = 1e-33
 Identities = 77/194 (39%), Positives = 113/194 (58%), Gaps = 1/194 (0%)
 Frame = +1

Query: 133 GSDEERAMLGPEELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITS-VSI 309
           G  E  A    EEL        + Y  +FIV Q+IV  +   TV+K K P +K+ S V I
Sbjct: 6   GDHEANAYHQAEELKRQKKIKRLKYFGIFIVFQVIVITIFSLTVMKAKTPKLKLASNVYI 65

Query: 310 QNFNYDPTTPSANMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRS 489
           Q   Y P TPS +M  IT+VRV+N NWG +KFR+ T++ +Y+G  VG+  IP  K  LRS
Sbjct: 66  QTLTYSPATPSFDMSFITQVRVRNPNWGPFKFRDGTVVFTYQGVVVGQVYIPNGKVGLRS 125

Query: 490 TRRVNVTFDVTSEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTF 669
           T+++ V  +V S  L G S L ++L++G L   +  +L GKV LML+MK  K+ +++C+ 
Sbjct: 126 TKKITVLVNVNSNALPGKSALGNELSNGLLLLTSTAELKGKVELMLIMKTKKTAELSCSM 185

Query: 670 TINTPNQSAHDIIC 711
             N   +S  ++ C
Sbjct: 186 VFNLAARSLQNLDC 199


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  148 bits (374), Expect = 2e-33
 Identities = 73/188 (38%), Positives = 115/188 (61%), Gaps = 1/188 (0%)
 Frame = +1

Query: 154 MLGPEELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNFNYDPT 333
           ML  E+           YI+  +V Q I+  V   TV+++K P+ ++ SV++Q+ NY+ +
Sbjct: 1   MLESEKYKRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNAS 60

Query: 334 -TPSANMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVT 510
             P  NMRLI E+ VKNKN+G ++F N+T  +++    VG+ +I   +A+ R T+R+NVT
Sbjct: 61  GVPHFNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVT 120

Query: 511 FDVTSEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQ 690
            DV+S  +S    L + L+SG LT     +L GKVTLM +MKK K+ +MNCT T+N  + 
Sbjct: 121 VDVSSSAVSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSH 180

Query: 691 SAHDIICK 714
           +  D+ C+
Sbjct: 181 AVQDLDCE 188


>ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294764 [Fragaria vesca
           subsp. vesca]
          Length = 182

 Score =  148 bits (373), Expect = 3e-33
 Identities = 67/173 (38%), Positives = 114/173 (65%), Gaps = 1/173 (0%)
 Frame = +1

Query: 199 ITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNFNYDPTTPSA-NMRLITEVRV 375
           + Y+ +FIV Q+IV  +   TV+K+K P ++  + ++ NFN D +T ++ +  L+T+  V
Sbjct: 10  LAYVAIFIVFQIIVITIFALTVMKIKGPKVRFQTATVSNFNSDSSTAASFSGDLVTKFAV 69

Query: 376 KNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVTFDVTSEQLSGNSNLA 555
           KN N+G +K+ NST+ + Y G+ +G A +P+ KAK RSTRR ++T  + S +LSG +NL 
Sbjct: 70  KNTNFGHFKYPNSTVSILYEGQVIGTAAVPSQKAKARSTRRTDITISIDSSKLSGTTNLT 129

Query: 556 SDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQSAHDIICK 714
           + + +G +   + + L GKV +M ++KKNKSG+M+CT  +N   ++  D+ CK
Sbjct: 130 TAIGAGVVPLTSESTLKGKVEVMKIIKKNKSGKMSCTMLLNLKTRTVDDLKCK 182


>ref|XP_007219159.1| hypothetical protein PRUPE_ppa016330mg [Prunus persica]
           gi|462415621|gb|EMJ20358.1| hypothetical protein
           PRUPE_ppa016330mg [Prunus persica]
          Length = 189

 Score =  145 bits (366), Expect = 2e-32
 Identities = 71/172 (41%), Positives = 109/172 (63%), Gaps = 2/172 (1%)
 Frame = +1

Query: 205 YIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNF--NYDPTTPSANMRLITEVRVK 378
           YI   I+LQ I+  +    V+++K P +++ SV++ +   N  P++PS  +++   V VK
Sbjct: 18  YIAAGIILQTIIIVLFVVFVMRIKTPKVRLDSVAVDSLTANSSPSSPSFKVQINALVTVK 77

Query: 379 NKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVTFDVTSEQLSGNSNLAS 558
           NKN+G YKF  S +  SY+G  VGE  I   KAK + T+++NVT  + S ++S +S L+S
Sbjct: 78  NKNFGHYKFEGSKVTFSYKGTAVGEGTIAKAKAKAKRTKKINVTVSLNSNKVSSHSQLSS 137

Query: 559 DLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQSAHDIICK 714
           DL+SGNLT  AY KL+GKV L  ++KK KS  +NCT  ++T  +  H + CK
Sbjct: 138 DLSSGNLTLTAYAKLDGKVHLFKVIKKKKSANLNCTVHVDTKAKVVHVLTCK 189


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  144 bits (364), Expect = 3e-32
 Identities = 66/174 (37%), Positives = 116/174 (66%), Gaps = 2/174 (1%)
 Frame = +1

Query: 199 ITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNFNYDPTT--PSANMRLITEVR 372
           + YIV  ++ Q I+  +    V++++ P +++  V+++N N + ++  PS +M L  +V 
Sbjct: 18  LAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFSMNLNAQVT 77

Query: 373 VKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVTFDVTSEQLSGNSNL 552
           VKN N+G +KF+NST+ +SYRG  VGEA I   +A+ RST ++NVT  V+S+++S NS L
Sbjct: 78  VKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSDKMSRNSAL 137

Query: 553 ASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQSAHDIICK 714
           +SD+ SG +  +++ KL+GK+ L  + KK KS +MNCT  + T ++   +++C+
Sbjct: 138 SSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLMCQ 191


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  144 bits (363), Expect = 4e-32
 Identities = 68/173 (39%), Positives = 115/173 (66%), Gaps = 1/173 (0%)
 Frame = +1

Query: 199 ITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNFNY-DPTTPSANMRLITEVRV 375
           + Y+ VF+V Q  +  +   TV+++K P ++  +V+++NF+  + ++P  +MRL+ +V V
Sbjct: 13  LAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMRLMAQVTV 72

Query: 376 KNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVTFDVTSEQLSGNSNLA 555
           KN N+G +K+ NS++ + Y G  VGEA I   +A+ R T++ +VT D++S +LS NSNL 
Sbjct: 73  KNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKLSTNSNLG 132

Query: 556 SDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQSAHDIICK 714
           +D+ SG L  ++  KL+GKV LM ++KK KS +M+CT  IN   ++  D+ CK
Sbjct: 133 NDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKCK 185


>ref|XP_004300832.1| PREDICTED: uncharacterized protein LOC101296778 [Fragaria vesca
           subsp. vesca]
          Length = 237

 Score =  143 bits (360), Expect = 9e-32
 Identities = 71/194 (36%), Positives = 112/194 (57%)
 Frame = +1

Query: 133 GSDEERAMLGPEELAXXXXXXLITYIVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQ 312
           G D        +E+       L   I + IV Q+I+  V   TV+KVK P +++ +++IQ
Sbjct: 44  GEDHSTTFQFNDEIKRQKRMKLYKCIGILIVFQIIILTVFALTVMKVKTPKVRLGAINIQ 103

Query: 313 NFNYDPTTPSANMRLITEVRVKNKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRST 492
           + N  P TPS +    T++RVKN NWG +KF  ST   SY+G  VG+  IP  KA++RST
Sbjct: 104 SLNSVPATPSFDASFTTQIRVKNPNWGPFKFDASTATFSYQGVPVGQVVIPKSKARMRST 163

Query: 493 RRVNVTFDVTSEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFT 672
           +++ VT  V S+ L  +S+L S+L +G LT ++  K++GKV +M + KK K+ QM C   
Sbjct: 164 KKIGVTVSVNSKALPSSSDLGSELKNGVLTLSSQAKVSGKVEIMSVTKKRKTAQMYCAIV 223

Query: 673 INTPNQSAHDIICK 714
            +   ++   + C+
Sbjct: 224 FDLSTKAIQTLQCE 237


>gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis]
          Length = 213

 Score =  142 bits (358), Expect = 2e-31
 Identities = 74/172 (43%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
 Frame = +1

Query: 208 IVVFIVLQMIVFAVLGTTVLKVKRPNIKITSVSIQNF---NYDPTTPSANMRLITEVRVK 378
           +   +VL  +V  V   TV+++K P ++I SV+I++    N D  +PS +M+  +E+ VK
Sbjct: 43  VTAIVVLLTVVILVFPQTVMRIKGPELRIRSVAIEDLTISNSDTNSPSLSMKFDSEIGVK 102

Query: 379 NKNWGEYKFRNSTMILSYRGETVGEAQIPAIKAKLRSTRRVNVTFDVTSEQLSGNSNLAS 558
           N N+GE+KF  S++   Y+G  VG+A +   KAK RST+++NVT +V +     NSNLA+
Sbjct: 103 NTNFGEFKFDESSITFVYKGTEVGDASVEKGKAKARSTKKMNVTAEVNA-----NSNLAN 157

Query: 559 DLNSGNLTFNAYTKLNGKVTLMLMMKKNKSGQMNCTFTINTPNQSAHDIICK 714
           D+ SG LT  + +KLNGKV LM ++KK K+ +MNCT TIN  N+   D  CK
Sbjct: 158 DVRSGFLTLTSQSKLNGKVHLMKVIKKKKTAEMNCTITINLENKVVQDFKCK 209


>ref|XP_004298841.1| PREDICTED: uncharacterized protein LOC101294558 [Fragaria vesca
           subsp. vesca]
          Length = 203

 Score =  142 bits (358), Expect = 2e-31
 Identities = 75/210 (35%), Positives = 113/210 (53%), Gaps = 1/210 (0%)
 Frame = +1

Query: 88  QKK*KTFSFIKEHNMGSDEERAML-GPEELAXXXXXXLITYIVVFIVLQMIVFAVLGTTV 264
           +K  + +S    +   +D+E +     EEL       L TYI +FIV Q++V  V G TV
Sbjct: 3   EKNQQAYSSANGYTRSTDQESSPFQSDEELKRQKRIKLFTYIGIFIVFQIVVMTVFGLTV 62

Query: 265 LKVKRPNIKITSVSIQNFNYDPTTPSANMRLITEVRVKNKNWGEYKFRNSTMILSYRGET 444
           +KVK P  +   ++++  N  P  PS +    T++R+KN NWG YKF   T    Y+G T
Sbjct: 63  MKVKTPKARWGEITVKTLNSVPAAPSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVT 122

Query: 445 VGEAQIPAIKAKLRSTRRVNVTFDVTSEQLSGNSNLASDLNSGNLTFNAYTKLNGKVTLM 624
           +G+  IP  KA +R T++++ +  + +  L+         +SG LT  +  KL GKVTLM
Sbjct: 123 IGKVDIPKSKAGMRGTKKIDASVSLNTAALN---------SSGELTLTSEAKLTGKVTLM 173

Query: 625 LMMKKNKSGQMNCTFTINTPNQSAHDIICK 714
            MMKK KS  MNCT  I+    +   ++CK
Sbjct: 174 GMMKKKKSASMNCTIQIDVSGPTVKSVVCK 203


Top