BLASTX nr result

ID: Akebia27_contig00024544 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00024544
         (800 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI22611.3| unnamed protein product [Vitis vinifera]              150   7e-34
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   149   2e-33
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   146   8e-33
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   145   2e-32
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   142   1e-31
ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prun...   139   1e-30
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   137   6e-30
ref|XP_007219159.1| hypothetical protein PRUPE_ppa016330mg [Prun...   135   1e-29
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   135   2e-29
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   132   1e-28
ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295...   132   1e-28
ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294...   132   2e-28
ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r...   129   1e-27
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   129   2e-27
ref|XP_004300832.1| PREDICTED: uncharacterized protein LOC101296...   128   3e-27
ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-r...   127   4e-27
gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]     127   5e-27
gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus...   125   1e-26
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   125   1e-26
gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis]     125   2e-26

>emb|CBI22611.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  150 bits (378), Expect = 7e-34
 Identities = 82/190 (43%), Positives = 120/190 (63%), Gaps = 1/190 (0%)
 Frame = -1

Query: 713 RVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNST 534
           + ++ S+E  R K  + I  +    +F+T+           ++ P  +  +V+IENLN T
Sbjct: 107 KTDVESEELRRMKCTRYIAYLSAFALFETIVIMVCVVTLMRIRSPKFRFRAVSIENLNYT 166

Query: 533 S-ATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHM 357
           S  T+ SFN+R   KV VKN N+G +KF N  +TL+Y G  VGDA+I + +A+ RST+ M
Sbjct: 167 SDTTSPSFNIRFNAKVAVKNTNFGHFKFKNSTITLAYRGDHVGDAKISKARARARSTKKM 226

Query: 356 DVTIDVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINT 177
           +VT+DVTS+ +S NSNLASD++ G LT     KLNG+V +M + KK KS QMNCTI IN 
Sbjct: 227 NVTVDVTSNNVSSNSNLASDINSGFLTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKINL 286

Query: 176 RSQTVQQIIC 147
            ++ +Q+  C
Sbjct: 287 ENKVIQEWKC 296


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  149 bits (375), Expect = 2e-33
 Identities = 77/183 (42%), Positives = 111/183 (60%), Gaps = 1/183 (0%)
 Frame = -1

Query: 692 ERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNT-S 516
           E  RKK  KL       +VFQT+           +K P  ++ S+ +E++  TS  N  S
Sbjct: 18  ELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNPPS 77

Query: 515 FNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVT 336
           FNM+   +V VKN N+G +KF N  ++  YGG  VG+A + + +AK RST+ M+VT+D+ 
Sbjct: 78  FNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVDLN 137

Query: 335 SDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQTVQQ 156
           S+ +  NSNLASD+  G LT   + KL+G+V +M LIKK KS QMNCT+ +N  S+ +Q 
Sbjct: 138 SNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAIQD 197

Query: 155 IIC 147
           I C
Sbjct: 198 IKC 200


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  146 bits (369), Expect = 8e-33
 Identities = 80/195 (41%), Positives = 116/195 (59%), Gaps = 2/195 (1%)
 Frame = -1

Query: 725 NDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIEN 546
           +DEE  +L S+E  RKK  K  + I    VFQT+           VK P ++I  V +E 
Sbjct: 20  SDEESASLQSKELKRKKRIKYAVYIAAFAVFQTVVILIFALTVMRVKNPKVRIGKVTVET 79

Query: 545 LN-STSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRS 369
           +  S +    SFN+R IT+VTVKN N+G YKF N  M+  Y G  VG+A IP+ +A+ RS
Sbjct: 80  METSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGEAIIPKARARARS 139

Query: 368 TRHMDVTIDVTSDWL-SRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCT 192
           T+ +DVT++V S  L S  + L S+L    LT N+  KL G+V +M ++KK KS +MNCT
Sbjct: 140 TKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKVMKKKKSPEMNCT 199

Query: 191 IAINTRSQTVQQIIC 147
           +  N  ++++Q + C
Sbjct: 200 LIFNVSTRSLQDLKC 214


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  145 bits (365), Expect = 2e-32
 Identities = 69/180 (38%), Positives = 118/180 (65%), Gaps = 1/180 (0%)
 Frame = -1

Query: 683 RKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLN-STSATNTSFNM 507
           RK+N K +  IV  ++ QT+           ++ P +++  V +ENLN ++S+++ SF+M
Sbjct: 11  RKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFSM 70

Query: 506 RLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSDW 327
            L  +VTVKN N+G +KF N  +T+SY G  VG+A I + +A+ RST  ++VT+ V+SD 
Sbjct: 71  NLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSDK 130

Query: 326 LSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQTVQQIIC 147
           +SRNS L+SD+  G +  +++ KL+G++ +  + KK KS +MNCT+ + T S+ +Q ++C
Sbjct: 131 MSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLMC 190


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  142 bits (358), Expect = 1e-31
 Identities = 69/179 (38%), Positives = 114/179 (63%)
 Frame = -1

Query: 683 RKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNTSFNMR 504
           ++ N K +  + V +VFQT            +K P ++  +V +EN ++ ++++  F+MR
Sbjct: 6   KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65

Query: 503 LITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSDWL 324
           L+ +VTVKN N+G +K+ N  + + YGG  VG+A I + +A+ R T+  DVTID++S  L
Sbjct: 66  LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125

Query: 323 SRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQTVQQIIC 147
           S NSNL +D+  G L  ++  KL+G+V +M +IKK KS +M+CT+ IN  ++TVQ + C
Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKC 184


>ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prunus persica]
           gi|462406447|gb|EMJ11911.1| hypothetical protein
           PRUPE_ppa022983mg [Prunus persica]
          Length = 209

 Score =  139 bits (350), Expect = 1e-30
 Identities = 73/192 (38%), Positives = 113/192 (58%)
 Frame = -1

Query: 722 DEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENL 543
           DEE   L S+E  R+K  K+   IV+ IV Q +           VK P  ++ ++ ++NL
Sbjct: 18  DEESTALQSEELKRQKRIKMYKYIVIFIVVQLIVLPVFGLTVMKVKTPKFRLGNIKVQNL 77

Query: 542 NSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTR 363
           +S  +T  SF     T++ VKN NWG YKF  G +T  Y G TVG   +P+ KAK+RST+
Sbjct: 78  SSVPST-PSFEASFATQIRVKNTNWGPYKFDAGTVTFMYKGVTVGQVVVPKSKAKMRSTK 136

Query: 362 HMDVTIDVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAI 183
            +DVT+ + S  L  +SNL ++L  G LT ++  KL G+V +M ++KK KS  M+CT+  
Sbjct: 137 KIDVTVSLNSYGLPSSSNLGTELKSGVLTLSSKGKLTGKVVLMLMMKKRKSATMDCTMTF 196

Query: 182 NTRSQTVQQIIC 147
           +  ++T++ + C
Sbjct: 197 DLSTKTLKTLQC 208


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  137 bits (344), Expect = 6e-30
 Identities = 68/186 (36%), Positives = 110/186 (59%)
 Frame = -1

Query: 704 LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSAT 525
           L S++  R +N K    I+  +VFQT+           +K P+ ++ SV +++LN  ++ 
Sbjct: 2   LESEKYKRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASG 61

Query: 524 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345
              FNMRLI ++ VKNKN+G ++F N    +++G   VGD +I + +A+ R T+ M+VT+
Sbjct: 62  VPHFNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTV 121

Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165
           DV+S  +S    L + L  G LT     +L G+VT+M L+KK K+ +MNCT+ +N  S  
Sbjct: 122 DVSSSAVSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHA 181

Query: 164 VQQIIC 147
           VQ + C
Sbjct: 182 VQDLDC 187


>ref|XP_007219159.1| hypothetical protein PRUPE_ppa016330mg [Prunus persica]
           gi|462415621|gb|EMJ20358.1| hypothetical protein
           PRUPE_ppa016330mg [Prunus persica]
          Length = 189

 Score =  135 bits (341), Expect = 1e-29
 Identities = 69/186 (37%), Positives = 117/186 (62%), Gaps = 1/186 (0%)
 Frame = -1

Query: 701 GSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATN 522
           G+++  RK+N +  + I   I+ QT+           +K P +++ SVA+++L + S+ +
Sbjct: 4   GNEDSRRKRN-RCFLYIAAGIILQTIIIVLFVVFVMRIKTPKVRLDSVAVDSLTANSSPS 62

Query: 521 T-SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345
           + SF +++   VTVKNKN+G YKF    +T SY G  VG+  I + KAK + T+ ++VT+
Sbjct: 63  SPSFKVQINALVTVKNKNFGHYKFEGSKVTFSYKGTAVGEGTIAKAKAKAKRTKKINVTV 122

Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165
            + S+ +S +S L+SDL  GNLT  AY KL+G+V +  +IKK KS  +NCT+ ++T+++ 
Sbjct: 123 SLNSNKVSSHSQLSSDLSSGNLTLTAYAKLDGKVHLFKVIKKKKSANLNCTVHVDTKAKV 182

Query: 164 VQQIIC 147
           V  + C
Sbjct: 183 VHVLTC 188


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  135 bits (339), Expect = 2e-29
 Identities = 72/186 (38%), Positives = 115/186 (61%), Gaps = 2/186 (1%)
 Frame = -1

Query: 698 SQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNT 519
           S+E  RKK  K +  +   ++FQT            +K P  +I SV +++L   +++  
Sbjct: 13  SKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSS-P 71

Query: 518 SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPE--RKAKLRSTRHMDVTI 345
           SFNM+ I +VTVKN N+G YKF N  +T +Y G+ VG+A + +   +A+ RST+ M+VT+
Sbjct: 72  SFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTM 131

Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165
           D+ S+ ++ +S+L SDL+ G LT  +   LNG+V +M +IKK KS +MNCT+ +N   + 
Sbjct: 132 DLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKL 191

Query: 164 VQQIIC 147
           V+ I C
Sbjct: 192 VRDIKC 197


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  132 bits (333), Expect = 1e-28
 Identities = 68/193 (35%), Positives = 115/193 (59%)
 Frame = -1

Query: 725 NDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIEN 546
           +DEE V   S+E  +KK  K ++ IV+  VFQT            ++ P  ++ S +   
Sbjct: 21  SDEESVAAHSKELKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRIRNPKFRVRSGSFTT 80

Query: 545 LNSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRST 366
            N  +  + SF++++ T+ TVKN N+G +K+  G++T +Y G  VG A I + +A+ RST
Sbjct: 81  FNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVGRATIQKARARARST 140

Query: 365 RHMDVTIDVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIA 186
           + +DV ++++S+ L   + L  D+  G LT  +  KL+G++ +M +IKK KS QMNCT+ 
Sbjct: 141 KKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKVIKKKKSTQMNCTMD 200

Query: 185 INTRSQTVQQIIC 147
           +   ++TV+ IIC
Sbjct: 201 VAIDTRTVRNIIC 213


>ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  132 bits (333), Expect = 1e-28
 Identities = 72/186 (38%), Positives = 105/186 (56%)
 Frame = -1

Query: 704 LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSAT 525
           +   E  RKK  KL   I + IVFQ +           VK P  +  S+ +E LN   AT
Sbjct: 26  VSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMKVKTPKARWGSIDVETLNYVPAT 85

Query: 524 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345
             SF+    T++ +KN NWG YKF  G  T  Y G T+G   IP+ KA +RST+ +DV +
Sbjct: 86  -PSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIGKVDIPKSKAGMRSTKKIDVEV 144

Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165
            + ++ L  +S L ++L  G LT  + V+L G+V +M ++KKNK+  M+CTIA +  S+T
Sbjct: 145 SLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLIMKKNKNASMDCTIAFDLSSKT 204

Query: 164 VQQIIC 147
           VQ + C
Sbjct: 205 VQSLQC 210


>ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294764 [Fragaria vesca
           subsp. vesca]
          Length = 182

 Score =  132 bits (331), Expect = 2e-28
 Identities = 65/176 (36%), Positives = 103/176 (58%)
 Frame = -1

Query: 674 NFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNTSFNMRLIT 495
           N K +  + + IVFQ +           +K P ++  +  + N NS S+T  SF+  L+T
Sbjct: 6   NKKCLAYVAIFIVFQIIVITIFALTVMKIKGPKVRFQTATVSNFNSDSSTAASFSGDLVT 65

Query: 494 KVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSDWLSRN 315
           K  VKN N+G +K+ N  +++ Y G  +G A +P +KAK RSTR  D+TI + S  LS  
Sbjct: 66  KFAVKNTNFGHFKYPNSTVSILYEGQVIGTAAVPSQKAKARSTRRTDITISIDSSKLSGT 125

Query: 314 SNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQTVQQIIC 147
           +NL + +  G +   +   L G+V VM +IKKNKSG+M+CT+ +N +++TV  + C
Sbjct: 126 TNLTTAIGAGVVPLTSESTLKGKVEVMKIIKKNKSGKMSCTMLLNLKTRTVDDLKC 181


>ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 226

 Score =  129 bits (324), Expect = 1e-27
 Identities = 60/170 (35%), Positives = 106/170 (62%)
 Frame = -1

Query: 689 RARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNTSFN 510
           R    N K +  +   +VFQT            ++ P ++  +V +E+ ++ ++++ SF+
Sbjct: 4   RREGSNAKCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFSTVNSSSPSFD 63

Query: 509 MRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSD 330
           M+L+ +V VKN N+G +K+ N  +T+ YGG  VG+A I + +A+ R T+  ++ +D++S 
Sbjct: 64  MKLMAQVAVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDISSS 123

Query: 329 WLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAIN 180
            LS NSNL +D++ G L  ++  KL G+V +M +IKK KSG+M+CT+ IN
Sbjct: 124 RLSSNSNLGNDINAGVLPLSSQAKLKGKVHLMKVIKKKKSGEMSCTMGIN 173


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  129 bits (323), Expect = 2e-27
 Identities = 66/186 (35%), Positives = 106/186 (56%)
 Frame = -1

Query: 704 LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSAT 525
           +   E  R+K  +L   I + IVFQ +           VK P +++  + +++ NS  AT
Sbjct: 26  VSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMKVKTPKVRLGEINVQDFNSVPAT 85

Query: 524 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345
             SF+    T++ VKN NWG YKF    +T  Y G  VG   +P+ KA +RST+ M+V +
Sbjct: 86  -PSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVGQVTVPKGKAGMRSTKKMNVEV 144

Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165
            + ++ L  +SNL S+L+ G LT N+  KL+G+V +M ++KK KS  M+C I  +  ++T
Sbjct: 145 SLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLIMKKKKSSTMDCMIGFDLSTKT 204

Query: 164 VQQIIC 147
           V+ + C
Sbjct: 205 VKSLQC 210


>ref|XP_004300832.1| PREDICTED: uncharacterized protein LOC101296778 [Fragaria vesca
           subsp. vesca]
          Length = 237

 Score =  128 bits (321), Expect = 3e-27
 Identities = 71/194 (36%), Positives = 108/194 (55%)
 Frame = -1

Query: 728 GNDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIE 549
           G D       + E  R+K  KL   I +LIVFQ +           VK P +++ ++ I+
Sbjct: 44  GEDHSTTFQFNDEIKRQKRMKLYKCIGILIVFQIIILTVFALTVMKVKTPKVRLGAINIQ 103

Query: 548 NLNSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRS 369
           +LNS  AT  SF+    T++ VKN NWG +KF     T SY G  VG   IP+ KA++RS
Sbjct: 104 SLNSVPAT-PSFDASFTTQIRVKNPNWGPFKFDASTATFSYQGVPVGQVVIPKSKARMRS 162

Query: 368 TRHMDVTIDVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTI 189
           T+ + VT+ V S  L  +S+L S+L  G LT ++  K++G+V +M + KK K+ QM C I
Sbjct: 163 TKKIGVTVSVNSKALPSSSDLGSELKNGVLTLSSQAKVSGKVEIMSVTKKRKTAQMYCAI 222

Query: 188 AINTRSQTVQQIIC 147
             +  ++ +Q + C
Sbjct: 223 VFDLSTKAIQTLQC 236


>ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721845|gb|EOY13742.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 192

 Score =  127 bits (320), Expect = 4e-27
 Identities = 67/188 (35%), Positives = 118/188 (62%), Gaps = 3/188 (1%)
 Frame = -1

Query: 701 GSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLN-STSAT 525
           G Q    K+N K   ++V  ++ +T+           ++ P +++  V +ENL  S+S++
Sbjct: 4   GDQTSRGKRNIKCWAIVVAGVIAKTIIILLFVLIVMRIRNPKVRLGGVTVENLRASSSSS 63

Query: 524 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345
           + SF+ +L  +V+VKN N+G +KF N  +T+SY G+ VG A I E  A+ RST+  +VTI
Sbjct: 64  SPSFSTKLNAQVSVKNTNFGHFKFKNSTLTISYNGSPVGKATIVEGLARARSTKKFNVTI 123

Query: 344 DVTS-DWLSRNSN-LASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRS 171
            V+S + +SRNS+ L+SD++ G +  +++ KL G++ +  + KK KS +MNCT+ +NT  
Sbjct: 124 LVSSNNKISRNSDQLSSDIESGTINLSSHAKLEGKIHLFKIFKKKKSAEMNCTMDVNTSL 183

Query: 170 QTVQQIIC 147
           + +Q++ C
Sbjct: 184 KQIQKLTC 191


>gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]
          Length = 212

 Score =  127 bits (319), Expect = 5e-27
 Identities = 68/189 (35%), Positives = 105/189 (55%), Gaps = 1/189 (0%)
 Frame = -1

Query: 725 NDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISS-VAIE 549
           +DEE  NL ++E  R+K  KL +   +    Q +           VK P +++S     +
Sbjct: 20  SDEESSNLDAKELKRRKRIKLAIYAFIFTASQIIVTLVFVLVVMRVKSPKLRLSDKFEFQ 79

Query: 548 NLNSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRS 369
            + + S +  SF++   T++ VKN NWG YKF N     +Y G TVG   IP+ KA +RS
Sbjct: 80  TIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTTAAFAYEGETVGQVVIPKGKAGMRS 139

Query: 368 TRHMDVTIDVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTI 189
           T+ + V++ ++S  L  N+NL S+L  G LT     K+ G+V +M ++KK KS  MNCTI
Sbjct: 140 TKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTAKMTGKVKLMLIMKKKKSANMNCTI 199

Query: 188 AINTRSQTV 162
            I+ + +TV
Sbjct: 200 NIHVKEKTV 208


>gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus guttatus]
          Length = 192

 Score =  125 bits (315), Expect = 1e-26
 Identities = 63/190 (33%), Positives = 114/190 (60%), Gaps = 6/190 (3%)
 Frame = -1

Query: 698 SQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNT 519
           S+   +K + K +  + V +VFQ             +K P I+++++A+E+ +S++  N 
Sbjct: 2   SKGDGKKSSKKCLAYVAVFVVFQAAVIMVLALTVMKIKSPKIRLNAIAVESFSSSNNGNN 61

Query: 518 -----SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMD 354
                S NM+L+T++T+KN N+G++K+ N  + + Y G  +G+A IP  + K R T   +
Sbjct: 62  AGPTPSINMKLLTQLTIKNTNFGQFKYDNATLAILYNGVPLGEAVIPRGRVKARKTLKFN 121

Query: 353 VTIDVTSDWLS-RNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINT 177
           V+ D+ SD L+  N+NL +D++ G L  ++  ++NG+V +M +IKKNKSG MNC   +N 
Sbjct: 122 VSFDLNSDRLNGNNTNLGNDINSGVLRLSSQARVNGKVHLMKIIKKNKSGNMNCDWIVNL 181

Query: 176 RSQTVQQIIC 147
            ++ V+ + C
Sbjct: 182 ATRMVENLNC 191


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  125 bits (315), Expect = 1e-26
 Identities = 66/186 (35%), Positives = 106/186 (56%)
 Frame = -1

Query: 704 LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSAT 525
           +   E  R+K  KL M I + IV Q +           VK P +++  + +++LNS  AT
Sbjct: 26  VSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMKVKTPKVRLGGINVQSLNSVPAT 85

Query: 524 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345
             SF+    T++ VKN NWG YKF     T  Y G  VG   IP+ KA++RST+ + V++
Sbjct: 86  -PSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVGQVSIPKSKARMRSTKKISVSV 144

Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165
            + ++ L  +S + ++L+ G LT  +  KL G+V +M ++KK KS  M+CTIA +  ++T
Sbjct: 145 ILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLIMKKKKSATMDCTIAFDLSTKT 204

Query: 164 VQQIIC 147
           V+ + C
Sbjct: 205 VKSLQC 210


>gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis]
          Length = 213

 Score =  125 bits (314), Expect = 2e-26
 Identities = 74/186 (39%), Positives = 111/186 (59%), Gaps = 2/186 (1%)
 Frame = -1

Query: 698 SQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLN-STSATN 522
           ++E   KK  + +  +  ++V  T+           +K P ++I SVAIE+L  S S TN
Sbjct: 28  AKELQHKKRMRRLGGVTAIVVLLTVVILVFPQTVMRIKGPELRIRSVAIEDLTISNSDTN 87

Query: 521 T-SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345
           + S +M+  +++ VKN N+GE+KF    +T  Y G  VGDA + + KAK RST+ M+VT 
Sbjct: 88  SPSLSMKFDSEIGVKNTNFGEFKFDESSITFVYKGTEVGDASVEKGKAKARSTKKMNVTA 147

Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165
           +V +     NSNLA+D+  G LT  +  KLNG+V +M +IKK K+ +MNCTI IN  ++ 
Sbjct: 148 EVNA-----NSNLANDVRSGFLTLTSQSKLNGKVHLMKVIKKKKTAEMNCTITINLENKV 202

Query: 164 VQQIIC 147
           VQ   C
Sbjct: 203 VQDFKC 208


Top