BLASTX nr result

ID: Paeonia25_contig00008665 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00008665
         (978 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   197   7e-48
gb|EYU25166.1| hypothetical protein MIMGU_mgv1a014071mg [Mimulus...   176   2e-41
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   176   2e-41
ref|XP_002513626.1| conserved hypothetical protein [Ricinus comm...   175   3e-41
gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus...   169   2e-39
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   167   5e-39
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   167   5e-39
gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]     167   8e-39
gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus...   166   1e-38
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...   164   5e-38
ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293...   162   2e-37
ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r...   161   4e-37
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   156   1e-35
ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295...   156   1e-35
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   155   2e-35
ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prun...   155   2e-35
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   152   2e-34
gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus...   151   3e-34
gb|EXC05941.1| hypothetical protein L484_014209 [Morus notabilis]     150   6e-34
ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303...   150   8e-34

>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  197 bits (500), Expect = 7e-48
 Identities = 107/217 (49%), Positives = 140/217 (64%), Gaps = 1/217 (0%)
 Frame = -2

Query: 845 MADKDQAARPFAQA-NGHVRSDQESGGGLTKEQKKKRNMKRLAYFAXXXXXXXXXXXXFS 669
           M  K Q+  P   A NGH RSD+ES    +KE KKK+ MK L Y              F+
Sbjct: 1   MEAKSQSPYPLVPAANGHERSDEESVAAHSKELKKKKRMKCLLYIVLFAVFQTGIILLFA 60

Query: 668 LTVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVNFA 489
           LTVM+++ PKFRV S +   T  VG   +PSF +++N QF VKNTNFG +K+E   V FA
Sbjct: 61  LTVMRIRNPKFRVRSGSFT-TFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFA 119

Query: 488 YRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHSTL 309
           YRG PVG+  I K++A+  STKK+  +V  +LSS  L +  E   DIS G+L L+S S L
Sbjct: 120 YRGTPVGRATIQKARARARSTKKVDVVV--ELSSNGLPNTNELGRDISAGVLTLTSSSKL 177

Query: 308 NGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
           +GK+HLM ++KKKKSTQMNCTMDV I+TR ++N+ CK
Sbjct: 178 DGKIHLMKVIKKKKSTQMNCTMDVAIDTRTVRNIICK 214


>gb|EYU25166.1| hypothetical protein MIMGU_mgv1a014071mg [Mimulus guttatus]
          Length = 202

 Score =  176 bits (445), Expect = 2e-41
 Identities = 94/205 (45%), Positives = 132/205 (64%)
 Frame = -2

Query: 812 AQANGHVRSDQESGGGLTKEQKKKRNMKRLAYFAXXXXXXXXXXXXFSLTVMKVKRPKFR 633
           A ANGH RSD E+GG  T+ +KK R  K   Y A            FSLTVMK++ PKFR
Sbjct: 2   APANGHGRSDAEAGGAATEPRKKNRT-KCFLYIALFVIFQIGVITIFSLTVMKIRTPKFR 60

Query: 632 VGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVNFAYRGIPVGQVLIV 453
           + S+ L      G   +PSF   +NA+F VKN NFG+YK+ NT+V+F YRG PVGQVL+ 
Sbjct: 61  IRSAHLTN-FNAGTPASPSFSATVNAEFTVKNANFGRYKYRNTTVDFFYRGTPVGQVLVR 119

Query: 452 KSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHSTLNGKVHLMLIMKK 273
            S+A + STKK    VA++LS T+  +  +  SD++ G++ +SS + + G+V L+ +MKK
Sbjct: 120 DSRAGWRSTKKFN--VAVNLSLTNAQANPQLASDLNAGVVQISSQARMRGRVELIFVMKK 177

Query: 272 KKSTQMNCTMDVVINTRAIQNLSCK 198
            KST MNCTM++V  T+ ++N+ CK
Sbjct: 178 NKSTDMNCTMEIVTATQQLRNILCK 202


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  176 bits (445), Expect = 2e-41
 Identities = 97/217 (44%), Positives = 136/217 (62%), Gaps = 1/217 (0%)
 Frame = -2

Query: 845 MADKDQAARPFAQANGHVRSDQESGGGLTKEQKKKRNMKRLAYFAXXXXXXXXXXXXFSL 666
           MA+KDQ   P A ANGH RSD+ES    +KE K+K+ +K   Y A            F+L
Sbjct: 1   MAEKDQQVHPLAPANGHPRSDEESASLQSKELKRKKRIKYAVYIAAFAVFQTVVILIFAL 60

Query: 665 TVMKVKRPKFRVGSSTLVQTLEVGNA-TNPSFKMELNAQFRVKNTNFGQYKFENTSVNFA 489
           TVM+VK PK R+G  T V+T+E  N     SF +    Q  VKNTNFG YKF+N +++F 
Sbjct: 61  TVMRVKNPKVRIGKVT-VETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFL 119

Query: 488 YRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHSTL 309
           Y G+ VG+ +I K++A+  STKK+   V ++ SS   S+ T   S++S  +L L+S + L
Sbjct: 120 YDGVMVGEAIIPKARARARSTKKLDVTVEVN-SSALTSTTTGLGSELSSSVLTLNSQAKL 178

Query: 308 NGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
            GKV LM +MKKKKS +MNCT+   ++TR++Q+L CK
Sbjct: 179 KGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215


>ref|XP_002513626.1| conserved hypothetical protein [Ricinus communis]
           gi|223547534|gb|EEF49029.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 217

 Score =  175 bits (443), Expect = 3e-41
 Identities = 101/220 (45%), Positives = 138/220 (62%), Gaps = 4/220 (1%)
 Frame = -2

Query: 845 MADKDQAARPFAQANGHVRSDQESGGG---LTKEQKKKRNMKRLAYFAXXXXXXXXXXXX 675
           MA+K+QA  P   A+G  RSD+ESG      TKE +KK+ MK +A+              
Sbjct: 1   MAEKEQAPTPLV-ADGQTRSDEESGTAGTAQTKELRKKKRMKCIAFVVAFTIFQTGIILL 59

Query: 674 FSLTVMKVKRPKFRVGSSTLVQTLEVG-NATNPSFKMELNAQFRVKNTNFGQYKFENTSV 498
           F  TV++ K PKFRV S++   T  VG +A  PSF + +N QF VKNTNFG +K+E ++V
Sbjct: 60  FVFTVLRFKDPKFRVRSASFDDTFHVGTDAAAPSFNLTMNTQFGVKNTNFGHFKYETSTV 119

Query: 497 NFAYRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSH 318
            F YRG  VG V + K++A+  ST+K  A+V   L +  L    E  SDIS G +PLSS 
Sbjct: 120 TFEYRGTVVGLVNVDKARARARSTRKFDAIVV--LRTDRLPDGFELSSDISSGKIPLSSS 177

Query: 317 STLNGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
           S L+G++HLM ++KKKKS +MNCTM+V I TR +Q++ CK
Sbjct: 178 SRLDGEIHLMKVIKKKKSAEMNCTMNVDIQTRTLQDIVCK 217


>gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus]
          Length = 213

 Score =  169 bits (428), Expect = 2e-39
 Identities = 92/218 (42%), Positives = 134/218 (61%), Gaps = 2/218 (0%)
 Frame = -2

Query: 845 MADKDQAARPFAQANGHVRSDQESGGGL--TKEQKKKRNMKRLAYFAXXXXXXXXXXXXF 672
           MA+K+    P+A  NGH RSD E+G      +EQ+KK+  K   Y A            F
Sbjct: 1   MAEKEHQPLPYA--NGHGRSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIF 58

Query: 671 SLTVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVNF 492
           S+TVMK++ PKFR+ S+ L  T   G   +PSF   +NA+F VKN NFG+YK+ NT+V F
Sbjct: 59  SVTVMKIRTPKFRIRSAHLT-TFHAGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGF 117

Query: 491 AYRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHST 312
            Y+G PVGQV +  S+A + STKK +  V +DL+  +     +  SD++ G++ ++S + 
Sbjct: 118 FYKGTPVGQVFVRDSRAGWRSTKKFR--VVVDLNLANAQGNPQLASDLNAGVVQITSQAR 175

Query: 311 LNGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
           + G+V L+ +MKK KST MNC M++V  T+ I+NL CK
Sbjct: 176 MAGRVELIFVMKKNKSTDMNCNMEIVTATQQIRNLVCK 213


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  167 bits (424), Expect = 5e-39
 Identities = 88/187 (47%), Positives = 125/187 (66%)
 Frame = -2

Query: 758 KEQKKKRNMKRLAYFAXXXXXXXXXXXXFSLTVMKVKRPKFRVGSSTLVQTLEVGNATNP 579
           K + K+ N K LAY A            F+LTVM++K PK R G+ T V+    GN+++P
Sbjct: 2   KGEGKRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVT-VENFSTGNSSSP 60

Query: 578 SFKMELNAQFRVKNTNFGQYKFENTSVNFAYRGIPVGQVLIVKSKAKFLSTKKIKAMVAM 399
            F M L AQ  VKNTNFG +K+EN+S+   Y G+PVG+  IVK++A+   TKK    V +
Sbjct: 61  FFDMRLMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFD--VTI 118

Query: 398 DLSSTDLSSATEWESDISKGILPLSSHSTLNGKVHLMLIMKKKKSTQMNCTMDVVINTRA 219
           D+SS+ LS+ +   +DI+ G+LPLSS + L+GKVHLM ++KKKKS++M+CTM + I TR 
Sbjct: 119 DISSSKLSTNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRT 178

Query: 218 IQNLSCK 198
           +Q+L CK
Sbjct: 179 VQDLKCK 185


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  167 bits (424), Expect = 5e-39
 Identities = 87/186 (46%), Positives = 121/186 (65%)
 Frame = -2

Query: 755 EQKKKRNMKRLAYFAXXXXXXXXXXXXFSLTVMKVKRPKFRVGSSTLVQTLEVGNATNPS 576
           E K+K+ MK  AY A            FSLTVM++K PKFRV S T+           PS
Sbjct: 18  ELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNPPS 77

Query: 575 FKMELNAQFRVKNTNFGQYKFENTSVNFAYRGIPVGQVLIVKSKAKFLSTKKIKAMVAMD 396
           F M+ NA+  VKNTNFG +KF+NT+++F Y G+ VG+  + K +AK  STKK+   V +D
Sbjct: 78  FNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMN--VTVD 135

Query: 395 LSSTDLSSATEWESDISKGILPLSSHSTLNGKVHLMLIMKKKKSTQMNCTMDVVINTRAI 216
           L+S ++ + +   SDIS G L L++H+ L+GKVHLM ++KKKKS QMNCTM V + +RAI
Sbjct: 136 LNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAI 195

Query: 215 QNLSCK 198
           Q++ C+
Sbjct: 196 QDIKCQ 201


>gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]
          Length = 212

 Score =  167 bits (422), Expect = 8e-39
 Identities = 95/215 (44%), Positives = 127/215 (59%)
 Frame = -2

Query: 845 MADKDQAARPFAQANGHVRSDQESGGGLTKEQKKKRNMKRLAYFAXXXXXXXXXXXXFSL 666
           MA++ Q   P A ANGH RSD+ES     KE K+++ +K   Y              F L
Sbjct: 1   MAERYQQVYPLAPANGHPRSDEESSNLDAKELKRRKRIKLAIYAFIFTASQIIVTLVFVL 60

Query: 665 TVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVNFAY 486
            VM+VK PK R+      QT+E  + + PSF +    Q RVKNTN+G YKF+NT+  FAY
Sbjct: 61  VVMRVKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTTAAFAY 120

Query: 485 RGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHSTLN 306
            G  VGQV+I K KA   STKK+   V++ LSS+ L + T   S++S GIL L   + + 
Sbjct: 121 EGETVGQVVIPKGKAGMRSTKKVP--VSVSLSSSQLKNNTNLGSELSGGILTLRCTAKMT 178

Query: 305 GKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSC 201
           GKV LMLIMKKKKS  MNCT+++ +  + + NL C
Sbjct: 179 GKVKLMLIMKKKKSANMNCTINIHVKEKTV-NLKC 212


>gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus]
          Length = 214

 Score =  166 bits (420), Expect = 1e-38
 Identities = 87/217 (40%), Positives = 136/217 (62%), Gaps = 1/217 (0%)
 Frame = -2

Query: 845 MADKDQA-ARPFAQANGHVRSDQESGGGLTKEQKKKRNMKRLAYFAXXXXXXXXXXXXFS 669
           M +K+Q  + P A AN H RSD E+GG    E  K++  + L Y              FS
Sbjct: 1   MGEKEQQLSYPMAPANDHGRSDTEAGGAAASELHKRKRTQCLIYIGLLAIIQIAVVIVFS 60

Query: 668 LTVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVNFA 489
           LTVMK++ P+FR+ S+ L      G   +P+F  +LNA+F VKN NFG+YK+ +T+V+F 
Sbjct: 61  LTVMKIRNPRFRIRSAHLTN-FNAGTPASPAFTGKLNAEFSVKNANFGRYKYMDTTVDFV 119

Query: 488 YRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHSTL 309
           YRG  VG+V + +S+A + +TKK    VA+DLS  +  +  +  SD++ G++P+SS + +
Sbjct: 120 YRGTRVGEVFVRESRAGWRTTKKFN--VAVDLSLANARANPQLASDLNAGVVPISSEARM 177

Query: 308 NGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
           +G V L+ ++KK +ST +NCTM++V  T+ I+N+ CK
Sbjct: 178 SGSVELLFVLKKNRSTGLNCTMEIVTATQQIRNILCK 214


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score =  164 bits (415), Expect = 5e-38
 Identities = 91/224 (40%), Positives = 141/224 (62%), Gaps = 8/224 (3%)
 Frame = -2

Query: 845 MADKDQAARP-----FAQANGHVRSDQESGGGL---TKEQKKKRNMKRLAYFAXXXXXXX 690
           MA+  +AA          A  ++RSDQE+        +E + K+ M+ L Y +       
Sbjct: 1   MAENKEAAATSPYPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQV 60

Query: 689 XXXXXFSLTVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFE 510
                F+LTVMK+K PKFRV ++++    EVG+A+NPSF +E++  F VKNTNFG +++E
Sbjct: 61  VVITVFALTVMKIKSPKFRVRTASITG-FEVGSASNPSFNLEMDVHFGVKNTNFGHFEYE 119

Query: 509 NTSVNFAYRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILP 330
           +  V F YR + +GQ  + + + +  ST+K+    ++DL+S  L + +   SDIS GI+P
Sbjct: 120 DGIVVFTYRDVRIGQTNVEEERVRARSTRKVDVS-SVDLTSRGLPANSRLGSDISTGIIP 178

Query: 329 LSSHSTLNGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
           ++  S L+GK+HLM I+KKKKS QMNCTM+VV+ T+++QN+ CK
Sbjct: 179 ITISSKLDGKIHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVCK 222


>ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  162 bits (410), Expect = 2e-37
 Identities = 92/216 (42%), Positives = 129/216 (59%)
 Frame = -2

Query: 845 MADKDQAARPFAQANGHVRSDQESGGGLTKEQKKKRNMKRLAYFAXXXXXXXXXXXXFSL 666
           MA++   + P A +NG+ RSD ES      E K+K+ +K  AY              F L
Sbjct: 1   MAERTHQSYPLAPSNGYTRSDGESLS--EDELKRKKRIKCFAYIGIFIVFQIAVMTVFGL 58

Query: 665 TVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVNFAY 486
           T+MKVK PK R+G+STL       + T PSF    N Q RVKNTN+G YKF+   V F Y
Sbjct: 59  TIMKVKTPKVRLGTSTLTDF--TSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMY 116

Query: 485 RGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHSTLN 306
           +G+PVG V++ K KA    TKKI   V ++ ++   SS+T   +++S G+L L+S + L 
Sbjct: 117 QGMPVGTVVVPKGKAGMRGTKKINVNVRLNTAALPSSSST-LSTELSGGVLTLTSEAKLT 175

Query: 305 GKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
           GKV LMLIMKKKKS  MNCT+ + ++ + +++L CK
Sbjct: 176 GKVELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508721844|gb|EOY13741.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 259

 Score =  161 bits (407), Expect = 4e-37
 Identities = 86/187 (45%), Positives = 122/187 (65%)
 Frame = -2

Query: 761 TKEQKKKRNMKRLAYFAXXXXXXXXXXXXFSLTVMKVKRPKFRVGSSTLVQTLEVGNATN 582
           +KE K+K+ MK LAY A            F+LTVM++K PKFR+  S LV  L   N++ 
Sbjct: 13  SKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRI-RSVLVDDLTFNNSS- 70

Query: 581 PSFKMELNAQFRVKNTNFGQYKFENTSVNFAYRGIPVGQVLIVKSKAKFLSTKKIKAMVA 402
           PSF M+  AQ  VKNTNFG YKFEN++V FAY+G  VG+ L+ K +A+  +    K  V 
Sbjct: 71  PSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVT 130

Query: 401 MDLSSTDLSSATEWESDISKGILPLSSHSTLNGKVHLMLIMKKKKSTQMNCTMDVVINTR 222
           MDL+S  +++ ++  SD++ G L L+S S LNGKVHLM ++KKKKS +MNCTM V +  +
Sbjct: 131 MDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQK 190

Query: 221 AIQNLSC 201
            ++++ C
Sbjct: 191 LVRDIKC 197


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  156 bits (395), Expect = 1e-35
 Identities = 98/218 (44%), Positives = 131/218 (60%), Gaps = 2/218 (0%)
 Frame = -2

Query: 845 MADKDQAARPFAQANGHVRSDQESGGGLTKEQKKKRNMKR--LAYFAXXXXXXXXXXXXF 672
           MA+K   A P A ANG+ RSD ES   L  E + KR  +R    Y              F
Sbjct: 1   MAEKTNQAYPLAPANGYTRSDGES---LVSEDELKRQKRRKLFMYIGIFIVVQIIVMTVF 57

Query: 671 SLTVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVNF 492
            LTVMKVK PK R+G    VQ+L    AT PSF      Q RVKNTN+G YKF+ ++  F
Sbjct: 58  GLTVMKVKTPKVRLGGIN-VQSLNSVPAT-PSFDTSFTTQIRVKNTNWGPYKFDASTATF 115

Query: 491 AYRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHST 312
            Y+G+ VGQV I KSKA+  STKKI   V++ L++  L S++   ++++ GIL L+S + 
Sbjct: 116 MYQGVAVGQVSIPKSKARMRSTKKIS--VSVILNTNALPSSSTIGTELNSGILTLTSQAK 173

Query: 311 LNGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
           L GKV LMLIMKKKKS  M+CT+   ++T+ +++L CK
Sbjct: 174 LTGKVELMLIMKKKKSATMDCTIAFDLSTKTVKSLQCK 211


>ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  156 bits (394), Expect = 1e-35
 Identities = 92/219 (42%), Positives = 127/219 (57%), Gaps = 3/219 (1%)
 Frame = -2

Query: 845 MADKDQAAR---PFAQANGHVRSDQESGGGLTKEQKKKRNMKRLAYFAXXXXXXXXXXXX 675
           MA+K Q      P A  NG+ RSD ES      E K+K+ +K  AY              
Sbjct: 1   MAEKSQKTHQTYPLASENGYTRSDGESLS--EDELKRKKRIKCFAYIGIFIVFQMAIGAV 58

Query: 674 FSLTVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVN 495
           F LTV+KVK PK R+G+STL        ++  SF    N Q RVKNTN+G YKF+   V 
Sbjct: 59  FGLTVLKVKTPKVRLGTSTLSDV----TSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVT 114

Query: 494 FAYRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHS 315
           F Y+G PVG V++ K KA    TKKI   V+++ ++  L S++   S++S G+L L+S +
Sbjct: 115 FMYQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAA--LPSSSTLSSELSGGVLTLTSEA 172

Query: 314 TLNGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
            L GKV LMLIMKKKKS  MNCT+ + ++ + +++L CK
Sbjct: 173 KLTGKVELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  155 bits (392), Expect = 2e-35
 Identities = 78/184 (42%), Positives = 118/184 (64%)
 Frame = -2

Query: 749 KKKRNMKRLAYFAXXXXXXXXXXXXFSLTVMKVKRPKFRVGSSTLVQTLEVGNATNPSFK 570
           ++KRN+K LAY              F + VM+++ PK R+G  T+       ++++PSF 
Sbjct: 10  RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69

Query: 569 MELNAQFRVKNTNFGQYKFENTSVNFAYRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLS 390
           M LNAQ  VKNTNFG +KF+N+++  +YRG PVG+  IVK++A+  ST K+   V + +S
Sbjct: 70  MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLN--VTVSVS 127

Query: 389 STDLSSATEWESDISKGILPLSSHSTLNGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQN 210
           S  +S  +   SD+  G + LSSH+ L+GK+HL  + KKKKS +MNCTM+V  +++ IQN
Sbjct: 128 SDKMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQN 187

Query: 209 LSCK 198
           L C+
Sbjct: 188 LMCQ 191


>ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prunus persica]
           gi|462406447|gb|EMJ11911.1| hypothetical protein
           PRUPE_ppa022983mg [Prunus persica]
          Length = 209

 Score =  155 bits (392), Expect = 2e-35
 Identities = 94/216 (43%), Positives = 129/216 (59%)
 Frame = -2

Query: 845 MADKDQAARPFAQANGHVRSDQESGGGLTKEQKKKRNMKRLAYFAXXXXXXXXXXXXFSL 666
           MADK     P    +GH R D+ES    ++E K+++ +K   Y              F L
Sbjct: 1   MADKYNHREP---VHGHPRRDEESTALQSEELKRQKRIKMYKYIVIFIVVQLIVLPVFGL 57

Query: 665 TVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVNFAY 486
           TVMKVK PKFR+G+   VQ L    +T PSF+     Q RVKNTN+G YKF+  +V F Y
Sbjct: 58  TVMKVKTPKFRLGNIK-VQNLSSVPST-PSFEASFATQIRVKNTNWGPYKFDAGTVTFMY 115

Query: 485 RGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHSTLN 306
           +G+ VGQV++ KSKAK  STKKI   V + L+S  L S++   +++  G+L LSS   L 
Sbjct: 116 KGVTVGQVVVPKSKAKMRSTKKID--VTVSLNSYGLPSSSNLGTELKSGVLTLSSKGKLT 173

Query: 305 GKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
           GKV LML+MKK+KS  M+CTM   ++T+ ++ L CK
Sbjct: 174 GKVVLMLMMKKRKSATMDCTMTFDLSTKTLKTLQCK 209


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  152 bits (385), Expect = 2e-34
 Identities = 94/217 (43%), Positives = 131/217 (60%), Gaps = 1/217 (0%)
 Frame = -2

Query: 845 MADKDQAARPFAQANGHVRSDQESGGGLTKEQKKKRNMKRL-AYFAXXXXXXXXXXXXFS 669
           MA+K   A P A ANG+ RSD ES   ++K++ K+R   RL  Y              F 
Sbjct: 1   MAEKTHQAYPLAPANGYTRSDGESL--VSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFG 58

Query: 668 LTVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVNFA 489
           LTVMKVK PK R+G    VQ      AT PSF      Q RVKNTN+G YKF+ ++V F 
Sbjct: 59  LTVMKVKTPKVRLGEIN-VQDFNSVPAT-PSFDTTFTTQIRVKNTNWGPYKFDASTVTFM 116

Query: 488 YRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSHSTL 309
           Y+G+ VGQV + K KA   STKK+   V + L++  L S++   S+++ G+L L+S + L
Sbjct: 117 YQGVAVGQVTVPKGKAGMRSTKKMN--VEVSLNANGLPSSSNLGSELNSGVLTLNSQAKL 174

Query: 308 NGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
           +GKV LMLIMKKKKS+ M+C +   ++T+ +++L CK
Sbjct: 175 SGKVELMLIMKKKKSSTMDCMIGFDLSTKTVKSLQCK 211


>gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus guttatus]
          Length = 214

 Score =  151 bits (382), Expect = 3e-34
 Identities = 87/220 (39%), Positives = 130/220 (59%), Gaps = 4/220 (1%)
 Frame = -2

Query: 845 MADKDQAARPFAQANGHVRSDQESGGGLTKEQKKKRNMKRLAYFAXXXXXXXXXXXXFSL 666
           MA+ +  A   A A+ + R D+E      K +K+K+ +K   Y A            F L
Sbjct: 1   MAENNHQAGEKAYASPYGRVDEEVASVAQKNEKRKKRVKCFTYVAVFIVIQSVIFMIFGL 60

Query: 665 TVMKVKRPKFRVGSSTL----VQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSV 498
           T+MKV+ PKF V S+T     V TL+    TNPSF + + A   V+N NFGQYK++N++V
Sbjct: 61  TIMKVRTPKFHVRSATFGAFEVSTLD----TNPSFNINMIADLSVRNRNFGQYKYQNSTV 116

Query: 497 NFAYRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSH 318
            F +RG  VG+  IV+S+A   ST++  A V  DLSS  + +        +  ++PL+S 
Sbjct: 117 EFFFRGTKVGEARIVRSRANARSTRRFLATV--DLSSAGVPTEVLANEFRTHALIPLTSR 174

Query: 317 STLNGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
           STL GKV +M +MKK KST MNCTM+++I+++ + N+SC+
Sbjct: 175 STLRGKVEIMKLMKKNKSTNMNCTMEIMISSKQLGNISCR 214


>gb|EXC05941.1| hypothetical protein L484_014209 [Morus notabilis]
          Length = 220

 Score =  150 bits (380), Expect = 6e-34
 Identities = 78/213 (36%), Positives = 130/213 (61%), Gaps = 2/213 (0%)
 Frame = -2

Query: 830 QAARPFAQANGHVRSDQESGGGLTKEQKKKRNMKRLAYFAXXXXXXXXXXXXFSLTVMKV 651
           Q  +P A  + ++RSD E      +E  +K+   RL +              +++ V + 
Sbjct: 9   QPTQPLAPPHAYIRSDMEMESLSAQEHIRKKRRNRLLFVTSFAVTLVILIIVYAIVVTRY 68

Query: 650 KRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSVNFAYRGIPV 471
           K PKFR+ S++   + +VGN+T+PSF   +N+QF ++N NFG+YK+E+ +V F YRG+ V
Sbjct: 69  KTPKFRLRSASFT-SFQVGNSTDPSFSFVMNSQFTIRNRNFGRYKYEDATVVFEYRGLAV 127

Query: 470 GQVLIVKSKAKFLSTKKIKAMVAMDLSST--DLSSATEWESDISKGILPLSSHSTLNGKV 297
           GQ  I  ++ +  +TKK+ A V +D SS   D  +  +   DI +G+L L+S S L GKV
Sbjct: 128 GQAYIDDARVRPRTTKKVNATVVLDSSSLVGDSGAFDQLGKDIGEGVLVLNSSSELKGKV 187

Query: 296 HLMLIMKKKKSTQMNCTMDVVINTRAIQNLSCK 198
            ++ +++K K +++NCTM+VVI +R++QNL C+
Sbjct: 188 RVLKVIRKTKYSRLNCTMNVVIASRSVQNLICQ 220


>ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  150 bits (379), Expect = 8e-34
 Identities = 91/219 (41%), Positives = 134/219 (61%), Gaps = 4/219 (1%)
 Frame = -2

Query: 845 MADKDQAARPFAQ-ANGHV--RSDQESGGGLTK-EQKKKRNMKRLAYFAXXXXXXXXXXX 678
           MA+++Q A PFA  ANG    RSD ES    +  E +KK+ +K L Y A           
Sbjct: 1   MAERNQEAYPFAPYANGQAMARSDAESSRAHSDHELRKKKRIKCLIYIAVFAVFQIIVIT 60

Query: 677 XFSLTVMKVKRPKFRVGSSTLVQTLEVGNATNPSFKMELNAQFRVKNTNFGQYKFENTSV 498
            F+LTVMK+K PKFR+ S T VQ L   N+ NPS  M   A+  VKN NFG+YK++ TS+
Sbjct: 61  VFALTVMKIKSPKFRIKSIT-VQDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQTSI 119

Query: 497 NFAYRGIPVGQVLIVKSKAKFLSTKKIKAMVAMDLSSTDLSSATEWESDISKGILPLSSH 318
           +F Y G  VG  ++ K+ A+  +T+K     A+   +++L+      SDIS G + LS++
Sbjct: 120 SFIYEGTQVGDAVVPKATARTKATRKEIVSGAVKTVNSNLA------SDISAGSVTLSTY 173

Query: 317 STLNGKVHLMLIMKKKKSTQMNCTMDVVINTRAIQNLSC 201
           S +NGKV+LM ++KKKKS +M CTM V ++++ +Q++ C
Sbjct: 174 SKINGKVYLMNMIKKKKSAEMKCTMVVHLSSKQVQDIKC 212


Top