BLASTX nr result

ID: Akebia23_contig00046619 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00046619
         (770 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI22611.3| unnamed protein product [Vitis vinifera]              152   1e-34
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   151   2e-34
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   148   2e-33
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   146   9e-33
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   144   3e-32
ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prun...   140   5e-31
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   139   1e-30
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   136   9e-30
ref|XP_007219159.1| hypothetical protein PRUPE_ppa016330mg [Prun...   135   2e-29
ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294...   135   2e-29
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   134   3e-29
ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295...   134   3e-29
ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r...   133   6e-29
gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus...   130   7e-28
ref|XP_004300832.1| PREDICTED: uncharacterized protein LOC101296...   130   7e-28
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   130   7e-28
gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]     129   9e-28
gb|EYU40055.1| hypothetical protein MIMGU_mgv1a018374mg [Mimulus...   128   3e-27
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   127   3e-27
ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-r...   127   4e-27

>emb|CBI22611.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  152 bits (385), Expect = 1e-34
 Identities = 82/190 (43%), Positives = 120/190 (63%), Gaps = 1/190 (0%)
 Frame = +2

Query: 59  RVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNST 238
           + ++ S+E  R K  + I  +    +F+T+            + P  +  +V+IENLN T
Sbjct: 107 KTDVESEELRRMKCTRYIAYLSAFALFETIVIMVCVVTLMRIRSPKFRFRAVSIENLNYT 166

Query: 239 S-ATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHM 415
           S  T+ SFN+R   KV VKN N+G +KF N  +TL+Y G  VGDA+I + +A+ RST+ M
Sbjct: 167 SDTTSPSFNIRFNAKVAVKNTNFGHFKFKNSTITLAYRGDHVGDAKISKARARARSTKKM 226

Query: 416 DVTIDVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINI 595
           +VT+DVTS+ +S NSNLASD++ G LT     KLNG+V +M + KK KS QMNCTI IN+
Sbjct: 227 NVTVDVTSNNVSSNSNLASDINSGFLTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKINL 286

Query: 596 RSQTVQQIIC 625
            ++ +Q+  C
Sbjct: 287 ENKVIQEWKC 296


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  151 bits (382), Expect = 2e-34
 Identities = 77/183 (42%), Positives = 111/183 (60%), Gaps = 1/183 (0%)
 Frame = +2

Query: 80  ERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSATNT-S 256
           E  RKK  KL       +VFQT+            K P  ++ S+ +E++  TS  N  S
Sbjct: 18  ELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNPPS 77

Query: 257 FNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVT 436
           FNM+   +V VKN N+G +KF N  ++  YGG  VG+A + + +AK RST+ M+VT+D+ 
Sbjct: 78  FNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVDLN 137

Query: 437 SDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQTVQQ 616
           S+ +  NSNLASD+  G LT   + KL+G+V +M LIKK KS QMNCT+ +N+ S+ +Q 
Sbjct: 138 SNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAIQD 197

Query: 617 IIC 625
           I C
Sbjct: 198 IKC 200


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  148 bits (374), Expect = 2e-33
 Identities = 79/195 (40%), Positives = 116/195 (59%), Gaps = 2/195 (1%)
 Frame = +2

Query: 47  NDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIEN 226
           +DEE  +L S+E  RKK  K  + I    VFQT+            K P ++I  V +E 
Sbjct: 20  SDEESASLQSKELKRKKRIKYAVYIAAFAVFQTVVILIFALTVMRVKNPKVRIGKVTVET 79

Query: 227 LN-STSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRS 403
           +  S +    SFN+R IT+VTVKN N+G YKF N  M+  Y G  VG+A IP+ +A+ RS
Sbjct: 80  METSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGEAIIPKARARARS 139

Query: 404 TRHMDVTIDVTSDRL-SRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCT 580
           T+ +DVT++V S  L S  + L S+L    LT N+  KL G+V +M ++KK KS +MNCT
Sbjct: 140 TKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKVMKKKKSPEMNCT 199

Query: 581 IAINIRSQTVQQIIC 625
           +  N+ ++++Q + C
Sbjct: 200 LIFNVSTRSLQDLKC 214


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  146 bits (368), Expect = 9e-33
 Identities = 70/179 (39%), Positives = 115/179 (64%)
 Frame = +2

Query: 89  RKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSATNTSFNMR 268
           ++ N K +  + V +VFQT             K P ++  +V +EN ++ ++++  F+MR
Sbjct: 6   KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65

Query: 269 LITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSDRL 448
           L+ +VTVKN N+G +K+ N  + + YGG  VG+A I + +A+ R T+  DVTID++S +L
Sbjct: 66  LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125

Query: 449 SRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQTVQQIIC 625
           S NSNL +D+  G L  ++  KL+G+V +M +IKK KS +M+CT+ INI ++TVQ + C
Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKC 184


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  144 bits (364), Expect = 3e-32
 Identities = 68/180 (37%), Positives = 117/180 (65%), Gaps = 1/180 (0%)
 Frame = +2

Query: 89  RKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLN-STSATNTSFNM 265
           RK+N K +  IV  ++ QT+            + P +++  V +ENLN ++S+++ SF+M
Sbjct: 11  RKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFSM 70

Query: 266 RLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSDR 445
            L  +VTVKN N+G +KF N  +T+SY G  VG+A I + +A+ RST  ++VT+ V+SD+
Sbjct: 71  NLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSDK 130

Query: 446 LSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQTVQQIIC 625
           +SRNS L+SD+  G +  +++ KL+G++ +  + KK KS +MNCT+ +   S+ +Q ++C
Sbjct: 131 MSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLMC 190


>ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prunus persica]
           gi|462406447|gb|EMJ11911.1| hypothetical protein
           PRUPE_ppa022983mg [Prunus persica]
          Length = 209

 Score =  140 bits (353), Expect = 5e-31
 Identities = 72/192 (37%), Positives = 113/192 (58%)
 Frame = +2

Query: 50  DEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENL 229
           DEE   L S+E  R+K  K+   IV+ IV Q +            K P  ++ ++ ++NL
Sbjct: 18  DEESTALQSEELKRQKRIKMYKYIVIFIVVQLIVLPVFGLTVMKVKTPKFRLGNIKVQNL 77

Query: 230 NSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTR 409
           +S  +T  SF     T++ VKN NWG YKF  G +T  Y G TVG   +P+ KAK+RST+
Sbjct: 78  SSVPST-PSFEASFATQIRVKNTNWGPYKFDAGTVTFMYKGVTVGQVVVPKSKAKMRSTK 136

Query: 410 HMDVTIDVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAI 589
            +DVT+ + S  L  +SNL ++L  G LT ++  KL G+V +M ++KK KS  M+CT+  
Sbjct: 137 KIDVTVSLNSYGLPSSSNLGTELKSGVLTLSSKGKLTGKVVLMLMMKKRKSATMDCTMTF 196

Query: 590 NIRSQTVQQIIC 625
           ++ ++T++ + C
Sbjct: 197 DLSTKTLKTLQC 208


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  139 bits (349), Expect = 1e-30
 Identities = 68/186 (36%), Positives = 110/186 (59%)
 Frame = +2

Query: 68  LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSAT 247
           L S++  R +N K    I+  +VFQT+            K P+ ++ SV +++LN  ++ 
Sbjct: 2   LESEKYKRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASG 61

Query: 248 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 427
              FNMRLI ++ VKNKN+G ++F N    +++G   VGD +I + +A+ R T+ M+VT+
Sbjct: 62  VPHFNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTV 121

Query: 428 DVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQT 607
           DV+S  +S    L + L  G LT     +L G+VT+M L+KK K+ +MNCT+ +N+ S  
Sbjct: 122 DVSSSAVSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHA 181

Query: 608 VQQIIC 625
           VQ + C
Sbjct: 182 VQDLDC 187


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  136 bits (342), Expect = 9e-30
 Identities = 72/186 (38%), Positives = 115/186 (61%), Gaps = 2/186 (1%)
 Frame = +2

Query: 74  SQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSATNT 253
           S+E  RKK  K +  +   ++FQT             K P  +I SV +++L   +++  
Sbjct: 13  SKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSS-P 71

Query: 254 SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPE--RKAKLRSTRHMDVTI 427
           SFNM+ I +VTVKN N+G YKF N  +T +Y G+ VG+A + +   +A+ RST+ M+VT+
Sbjct: 72  SFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTM 131

Query: 428 DVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQT 607
           D+ S+ ++ +S+L SDL+ G LT  +   LNG+V +M +IKK KS +MNCT+ +N+  + 
Sbjct: 132 DLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKL 191

Query: 608 VQQIIC 625
           V+ I C
Sbjct: 192 VRDIKC 197


>ref|XP_007219159.1| hypothetical protein PRUPE_ppa016330mg [Prunus persica]
           gi|462415621|gb|EMJ20358.1| hypothetical protein
           PRUPE_ppa016330mg [Prunus persica]
          Length = 189

 Score =  135 bits (340), Expect = 2e-29
 Identities = 68/186 (36%), Positives = 116/186 (62%), Gaps = 1/186 (0%)
 Frame = +2

Query: 71  GSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSATN 250
           G+++  RK+N +  + I   I+ QT+            K P +++ SVA+++L + S+ +
Sbjct: 4   GNEDSRRKRN-RCFLYIAAGIILQTIIIVLFVVFVMRIKTPKVRLDSVAVDSLTANSSPS 62

Query: 251 T-SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 427
           + SF +++   VTVKNKN+G YKF    +T SY G  VG+  I + KAK + T+ ++VT+
Sbjct: 63  SPSFKVQINALVTVKNKNFGHYKFEGSKVTFSYKGTAVGEGTIAKAKAKAKRTKKINVTV 122

Query: 428 DVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQT 607
            + S+++S +S L+SDL  GNLT  AY KL+G+V +  +IKK KS  +NCT+ ++ +++ 
Sbjct: 123 SLNSNKVSSHSQLSSDLSSGNLTLTAYAKLDGKVHLFKVIKKKKSANLNCTVHVDTKAKV 182

Query: 608 VQQIIC 625
           V  + C
Sbjct: 183 VHVLTC 188


>ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294764 [Fragaria vesca
           subsp. vesca]
          Length = 182

 Score =  135 bits (339), Expect = 2e-29
 Identities = 65/176 (36%), Positives = 104/176 (59%)
 Frame = +2

Query: 98  NFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSATNTSFNMRLIT 277
           N K +  + + IVFQ +            K P ++  +  + N NS S+T  SF+  L+T
Sbjct: 6   NKKCLAYVAIFIVFQIIVITIFALTVMKIKGPKVRFQTATVSNFNSDSSTAASFSGDLVT 65

Query: 278 KVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSDRLSRN 457
           K  VKN N+G +K+ N  +++ Y G  +G A +P +KAK RSTR  D+TI + S +LS  
Sbjct: 66  KFAVKNTNFGHFKYPNSTVSILYEGQVIGTAAVPSQKAKARSTRRTDITISIDSSKLSGT 125

Query: 458 SNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQTVQQIIC 625
           +NL + +  G +   +   L G+V VM +IKKNKSG+M+CT+ +N++++TV  + C
Sbjct: 126 TNLTTAIGAGVVPLTSESTLKGKVEVMKIIKKNKSGKMSCTMLLNLKTRTVDDLKC 181


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  134 bits (338), Expect = 3e-29
 Identities = 69/193 (35%), Positives = 115/193 (59%)
 Frame = +2

Query: 47  NDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIEN 226
           +DEE V   S+E  +KK  K ++ IV+  VFQT             + P  ++ S +   
Sbjct: 21  SDEESVAAHSKELKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRIRNPKFRVRSGSFTT 80

Query: 227 LNSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRST 406
            N  +  + SF++++ T+ TVKN N+G +K+  G++T +Y G  VG A I + +A+ RST
Sbjct: 81  FNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVGRATIQKARARARST 140

Query: 407 RHMDVTIDVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIA 586
           + +DV ++++S+ L   + L  D+  G LT  +  KL+G++ +M +IKK KS QMNCT+ 
Sbjct: 141 KKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKVIKKKKSTQMNCTMD 200

Query: 587 INIRSQTVQQIIC 625
           + I ++TV+ IIC
Sbjct: 201 VAIDTRTVRNIIC 213


>ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  134 bits (338), Expect = 3e-29
 Identities = 71/186 (38%), Positives = 105/186 (56%)
 Frame = +2

Query: 68  LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSAT 247
           +   E  RKK  KL   I + IVFQ +            K P  +  S+ +E LN   AT
Sbjct: 26  VSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMKVKTPKARWGSIDVETLNYVPAT 85

Query: 248 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 427
             SF+    T++ +KN NWG YKF  G  T  Y G T+G   IP+ KA +RST+ +DV +
Sbjct: 86  -PSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIGKVDIPKSKAGMRSTKKIDVEV 144

Query: 428 DVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQT 607
            + ++ L  +S L ++L  G LT  + V+L G+V +M ++KKNK+  M+CTIA ++ S+T
Sbjct: 145 SLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLIMKKNKNASMDCTIAFDLSSKT 204

Query: 608 VQQIIC 625
           VQ + C
Sbjct: 205 VQSLQC 210


>ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 226

 Score =  133 bits (335), Expect = 6e-29
 Identities = 61/174 (35%), Positives = 109/174 (62%)
 Frame = +2

Query: 83  RARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSATNTSFN 262
           R    N K +  +   +VFQT             + P ++  +V +E+ ++ ++++ SF+
Sbjct: 4   RREGSNAKCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFSTVNSSSPSFD 63

Query: 263 MRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSD 442
           M+L+ +V VKN N+G +K+ N  +T+ YGG  VG+A I + +A+ R T+  ++ +D++S 
Sbjct: 64  MKLMAQVAVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDISSS 123

Query: 443 RLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQ 604
           RLS NSNL +D++ G L  ++  KL G+V +M +IKK KSG+M+CT+ IN+ ++
Sbjct: 124 RLSSNSNLGNDINAGVLPLSSQAKLKGKVHLMKVIKKKKSGEMSCTMGINLATR 177


>gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus guttatus]
          Length = 192

 Score =  130 bits (326), Expect = 7e-28
 Identities = 64/190 (33%), Positives = 115/190 (60%), Gaps = 6/190 (3%)
 Frame = +2

Query: 74  SQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSATNT 253
           S+   +K + K +  + V +VFQ              K P I+++++A+E+ +S++  N 
Sbjct: 2   SKGDGKKSSKKCLAYVAVFVVFQAAVIMVLALTVMKIKSPKIRLNAIAVESFSSSNNGNN 61

Query: 254 -----SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMD 418
                S NM+L+T++T+KN N+G++K+ N  + + Y G  +G+A IP  + K R T   +
Sbjct: 62  AGPTPSINMKLLTQLTIKNTNFGQFKYDNATLAILYNGVPLGEAVIPRGRVKARKTLKFN 121

Query: 419 VTIDVTSDRLS-RNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINI 595
           V+ D+ SDRL+  N+NL +D++ G L  ++  ++NG+V +M +IKKNKSG MNC   +N+
Sbjct: 122 VSFDLNSDRLNGNNTNLGNDINSGVLRLSSQARVNGKVHLMKIIKKNKSGNMNCDWIVNL 181

Query: 596 RSQTVQQIIC 625
            ++ V+ + C
Sbjct: 182 ATRMVENLNC 191


>ref|XP_004300832.1| PREDICTED: uncharacterized protein LOC101296778 [Fragaria vesca
           subsp. vesca]
          Length = 237

 Score =  130 bits (326), Expect = 7e-28
 Identities = 70/194 (36%), Positives = 108/194 (55%)
 Frame = +2

Query: 44  GNDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIE 223
           G D       + E  R+K  KL   I +LIVFQ +            K P +++ ++ I+
Sbjct: 44  GEDHSTTFQFNDEIKRQKRMKLYKCIGILIVFQIIILTVFALTVMKVKTPKVRLGAINIQ 103

Query: 224 NLNSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRS 403
           +LNS  AT  SF+    T++ VKN NWG +KF     T SY G  VG   IP+ KA++RS
Sbjct: 104 SLNSVPAT-PSFDASFTTQIRVKNPNWGPFKFDASTATFSYQGVPVGQVVIPKSKARMRS 162

Query: 404 TRHMDVTIDVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTI 583
           T+ + VT+ V S  L  +S+L S+L  G LT ++  K++G+V +M + KK K+ QM C I
Sbjct: 163 TKKIGVTVSVNSKALPSSSDLGSELKNGVLTLSSQAKVSGKVEIMSVTKKRKTAQMYCAI 222

Query: 584 AINIRSQTVQQIIC 625
             ++ ++ +Q + C
Sbjct: 223 VFDLSTKAIQTLQC 236


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  130 bits (326), Expect = 7e-28
 Identities = 65/186 (34%), Positives = 106/186 (56%)
 Frame = +2

Query: 68  LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSAT 247
           +   E  R+K  +L   I + IVFQ +            K P +++  + +++ NS  AT
Sbjct: 26  VSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMKVKTPKVRLGEINVQDFNSVPAT 85

Query: 248 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 427
             SF+    T++ VKN NWG YKF    +T  Y G  VG   +P+ KA +RST+ M+V +
Sbjct: 86  -PSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVGQVTVPKGKAGMRSTKKMNVEV 144

Query: 428 DVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQT 607
            + ++ L  +SNL S+L+ G LT N+  KL+G+V +M ++KK KS  M+C I  ++ ++T
Sbjct: 145 SLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLIMKKKKSSTMDCMIGFDLSTKT 204

Query: 608 VQQIIC 625
           V+ + C
Sbjct: 205 VKSLQC 210


>gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]
          Length = 212

 Score =  129 bits (325), Expect = 9e-28
 Identities = 67/189 (35%), Positives = 106/189 (56%), Gaps = 1/189 (0%)
 Frame = +2

Query: 47  NDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISS-VAIE 223
           +DEE  NL ++E  R+K  KL +   +    Q +            K P +++S     +
Sbjct: 20  SDEESSNLDAKELKRRKRIKLAIYAFIFTASQIIVTLVFVLVVMRVKSPKLRLSDKFEFQ 79

Query: 224 NLNSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRS 403
            + + S +  SF++   T++ VKN NWG YKF N     +Y G TVG   IP+ KA +RS
Sbjct: 80  TIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTTAAFAYEGETVGQVVIPKGKAGMRS 139

Query: 404 TRHMDVTIDVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTI 583
           T+ + V++ ++S +L  N+NL S+L  G LT     K+ G+V +M ++KK KS  MNCTI
Sbjct: 140 TKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTAKMTGKVKLMLIMKKKKSANMNCTI 199

Query: 584 AINIRSQTV 610
            I+++ +TV
Sbjct: 200 NIHVKEKTV 208


>gb|EYU40055.1| hypothetical protein MIMGU_mgv1a018374mg [Mimulus guttatus]
          Length = 183

 Score =  128 bits (321), Expect = 3e-27
 Identities = 62/179 (34%), Positives = 107/179 (59%), Gaps = 5/179 (2%)
 Frame = +2

Query: 104 KLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSATNT----SFNMRL 271
           K +  + V ++FQ              K P I+ +++A+E+  S +  N     S NMRL
Sbjct: 4   KCLACVAVFVLFQAAVIMVLALTVLKIKSPKIRFNAIAVESFTSNNGNNAGPTPSINMRL 63

Query: 272 ITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSDRLS 451
           +T++T+KN N+G++K+ N  + + Y G  +G+A IP  + K R T   +V+ D+ SDRL+
Sbjct: 64  LTQLTIKNTNFGQFKYDNATLAILYNGVPLGEAVIPRGRVKARKTLKFNVSFDLNSDRLN 123

Query: 452 -RNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQTVQQIIC 625
             N+NL +D++ G L  ++  ++NG+V +M +IKKNKSG MNC   +N+ ++ V+ + C
Sbjct: 124 GNNTNLGNDINSGVLRLSSQARVNGKVHLMKIIKKNKSGNMNCDWIVNLATRMVENLNC 182


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  127 bits (320), Expect = 3e-27
 Identities = 65/186 (34%), Positives = 106/186 (56%)
 Frame = +2

Query: 68  LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLNSTSAT 247
           +   E  R+K  KL M I + IV Q +            K P +++  + +++LNS  AT
Sbjct: 26  VSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMKVKTPKVRLGGINVQSLNSVPAT 85

Query: 248 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 427
             SF+    T++ VKN NWG YKF     T  Y G  VG   IP+ KA++RST+ + V++
Sbjct: 86  -PSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVGQVSIPKSKARMRSTKKISVSV 144

Query: 428 DVTSDRLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRSQT 607
            + ++ L  +S + ++L+ G LT  +  KL G+V +M ++KK KS  M+CTIA ++ ++T
Sbjct: 145 ILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLIMKKKKSATMDCTIAFDLSTKT 204

Query: 608 VQQIIC 625
           V+ + C
Sbjct: 205 VKSLQC 210


>ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721845|gb|EOY13742.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 192

 Score =  127 bits (319), Expect = 4e-27
 Identities = 66/188 (35%), Positives = 117/188 (62%), Gaps = 3/188 (1%)
 Frame = +2

Query: 71  GSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXXKRPNIKISSVAIENLN-STSAT 247
           G Q    K+N K   ++V  ++ +T+            + P +++  V +ENL  S+S++
Sbjct: 4   GDQTSRGKRNIKCWAIVVAGVIAKTIIILLFVLIVMRIRNPKVRLGGVTVENLRASSSSS 63

Query: 248 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 427
           + SF+ +L  +V+VKN N+G +KF N  +T+SY G+ VG A I E  A+ RST+  +VTI
Sbjct: 64  SPSFSTKLNAQVSVKNTNFGHFKFKNSTLTISYNGSPVGKATIVEGLARARSTKKFNVTI 123

Query: 428 DVTS-DRLSRNSN-LASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINIRS 601
            V+S +++SRNS+ L+SD++ G +  +++ KL G++ +  + KK KS +MNCT+ +N   
Sbjct: 124 LVSSNNKISRNSDQLSSDIESGTINLSSHAKLEGKIHLFKIFKKKKSAEMNCTMDVNTSL 183

Query: 602 QTVQQIIC 625
           + +Q++ C
Sbjct: 184 KQIQKLTC 191


Top