BLASTX nr result

ID: Paeonia23_contig00010586 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00010586
         (871 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241...   187   6e-45
ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prun...   172   1e-40
ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-r...   167   6e-39
ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620...   166   1e-38
ref|XP_002509872.1| conserved hypothetical protein [Ricinus comm...   166   1e-38
ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citr...   164   3e-38
gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis]     156   1e-35
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   151   3e-34
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   150   5e-34
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   150   6e-34
ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297...   149   1e-33
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...   148   3e-33
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   145   2e-32
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   141   4e-31
ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578...   140   5e-31
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   140   8e-31
ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, part...   139   2e-30
emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]   138   2e-30
gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus...   138   3e-30
gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus...   136   9e-30

>ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241699 [Vitis vinifera]
          Length = 213

 Score =  187 bits (474), Expect = 6e-45
 Identities = 90/213 (42%), Positives = 140/213 (65%)
 Frame = +3

Query: 78  MGEENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFA 257
           M E+NQ  PLAPA ++G+SDEE    KP AS +   ++SKC VY+L G+V   +I L FA
Sbjct: 1   MPEDNQFQPLAPARLHGKSDEEFGVFKPRAS-KPPRRSSKCPVYVLAGLVTLAAIALVFA 59

Query: 258 IIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLY 437
           +  L+V+ PD ++ S+ ++NL +G         T+   + V+N NFG F  +N  A+VLY
Sbjct: 60  LAVLRVEAPDVELKSVAVKNLTHGTSPSPSFNVTLTAEVSVQNKNFGAFNFENGTATVLY 119

Query: 438 RDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGK 617
              ++G+   S+  V++++TKRM+ T D++S+ L  DKN SS+I SG + + +YA ++GK
Sbjct: 120 EGMVVGDEEFSKAHVESRKTKRMNVTLDVRSDRLWNDKNLSSDISSGSVNLTTYAQVTGK 179

Query: 618 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
           V +M ++++R T  MNC+M L L SSSI+DL+C
Sbjct: 180 VRVMKVVRRRTTARMNCSMTLNLTSSSIQDLVC 212


>ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica]
           gi|462406396|gb|EMJ11860.1| hypothetical protein
           PRUPE_ppa022176mg [Prunus persica]
          Length = 213

 Score =  172 bits (437), Expect = 1e-40
 Identities = 89/213 (41%), Positives = 139/213 (65%), Gaps = 2/213 (0%)
 Frame = +3

Query: 84  EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 263
           +E+Q++PLAP+ ++ RSDEE     P   + R E+++KCFVY+   IV+Q+  IL FA++
Sbjct: 4   QESQVWPLAPSRLHRRSDEE----NPTFRAIRRERSNKCFVYVFAAIVLQSIFILVFALV 59

Query: 264 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRD 443
            L+VK P   +SS+++++L +         AT+ T + +KN NFG ++ + + AS+ Y  
Sbjct: 60  VLRVKSPGFNLSSVSVKSLKHTTSPTSSLNATLVTELAIKNKNFGEYKFEGSSASLWYGG 119

Query: 444 TILGEANISETRVKAQQTKRMDCTFDLKSNGL--SGDKNFSSEIDSGILKIRSYASLSGK 617
             +GEA I + RVKA+ T+R+  + D++SN L       F  E++SG LKI SYA L+GK
Sbjct: 120 FKVGEAKIGKGRVKARGTRRVSLSIDVRSNRLPQEAKNGFEGEMNSGYLKISSYAKLTGK 179

Query: 618 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
           V+LM I+KKRKT   NCTM ++L S +++DL C
Sbjct: 180 VNLMKIMKKRKTIDTNCTMVVVLKSRTVKDLFC 212


>ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777616|gb|EOY24872.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 213

 Score =  167 bits (422), Expect = 6e-39
 Identities = 88/213 (41%), Positives = 129/213 (60%)
 Frame = +3

Query: 78  MGEENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFA 257
           M E+ Q  PLAP   Y RSD E    KP AS R+E K+SKC VY+L G+VIQ +++L FA
Sbjct: 1   MQEDPQAKPLAPVEYYPRSDMEFGGIKPTASQRKE-KSSKCLVYVLVGMVIQGAVLLIFA 59

Query: 258 IIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLY 437
            I L+ + PD ++ S+T+ NL YGN        T+ T + V+N+NFG F+ +NT  +V  
Sbjct: 60  SIVLRARTPDVEIVSVTVRNLKYGNSSAPSFNLTLVTEVTVENSNFGDFKFENTTGTVWC 119

Query: 438 RDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGK 617
              ++G+  I   R +A+ T+R++ + D+ S  L   KN S  I SG+L++ S+  LSGK
Sbjct: 120 GSVVVGKMKIPTGRAQARATERLNVSVDVSSLPLPDTKNVSCNISSGLLELNSHVKLSGK 179

Query: 618 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
           V +MN +K+R+   MNC M L L   + +D  C
Sbjct: 180 VSIMNFMKRRRHPEMNCFMTLNLTGQTKQDFPC 212


>ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620163 [Citrus sinensis]
          Length = 214

 Score =  166 bits (420), Expect = 1e-38
 Identities = 88/216 (40%), Positives = 135/216 (62%), Gaps = 3/216 (1%)
 Frame = +3

Query: 78  MGEENQLYPLAPA-NIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFF 254
           M EEN  +PLAP  N Y RSD+E A     A    + K+SKC VY+L  IV  ++ +L  
Sbjct: 1   MAEENPKFPLAPPRNEYPRSDQEYAP----AVIESQRKSSKCLVYVLVTIVTVSAALLIS 56

Query: 255 AIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVL 434
           A IFL+   P+ ++ S+T++NL++GN        T+ T + + N N+G FE +N   SV 
Sbjct: 57  ASIFLRPNTPEVQLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVF 116

Query: 435 YRDTILGEANISETRVKAQQTKRMDCT--FDLKSNGLSGDKNFSSEIDSGILKIRSYASL 608
           Y    +G+  I + RV+A++ KR++ T   D++SNG   ++N  S+I+SGI+K+ SYA L
Sbjct: 117 YGSVTVGDVKIRDGRVEAREVKRINVTVDVDVRSNGNLDNQNLRSDINSGIVKLNSYAKL 176

Query: 609 SGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
            G V L N++KK KT  ++C+MNL+L   ++EDL+C
Sbjct: 177 HGNVSLFNVLKKTKTPELDCSMNLVLARRAVEDLVC 212


>ref|XP_002509872.1| conserved hypothetical protein [Ricinus communis]
           gi|223549771|gb|EEF51259.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 221

 Score =  166 bits (420), Expect = 1e-38
 Identities = 90/222 (40%), Positives = 143/222 (64%), Gaps = 9/222 (4%)
 Frame = +3

Query: 78  MGEENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFA 257
           M E+NQ+ PLAPA    RSDEE A  KP  + R +E++SKC VY+L GIVI +++IL FA
Sbjct: 1   MVEDNQIVPLAPAETNPRSDEEFAAVKP--NLRLQERSSKCLVYVLAGIVILSAVILVFA 58

Query: 258 IIFLKVKLPDSKMSSITIENLNYG--------NXXXXXXXATMNTIIKVKNANFGRFELQ 413
           ++ L+   P++++S + +++LNY         N        T+ + +K++N+NFG F+  
Sbjct: 59  LVVLRPVNPNAELSFVRLKDLNYAAGSGGNGNNVSLPAFNMTLESELKIENSNFGEFKYD 118

Query: 414 NTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNG-LSGDKNFSSEIDSGILKI 590
           NT A V Y    +GEA + E RV A+ T RM+   +++S+  +    + +S+I+SGILK+
Sbjct: 119 NTSARVFYGGMAVGEAILREGRVSARDTLRMNVKVEVRSHKYIYNGTDLTSDINSGILKL 178

Query: 591 RSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
            S+A  SG+V+L+ I KKR++  M+C+ +L L S SI+DL+C
Sbjct: 179 NSHAKFSGRVNLLQIAKKRRSASMDCSFSLDLRSRSIQDLVC 220


>ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citrus clementina]
           gi|557541714|gb|ESR52692.1| hypothetical protein
           CICLE_v10023929mg [Citrus clementina]
          Length = 214

 Score =  164 bits (416), Expect = 3e-38
 Identities = 88/216 (40%), Positives = 135/216 (62%), Gaps = 3/216 (1%)
 Frame = +3

Query: 78  MGEENQLYPLAPA-NIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFF 254
           M EEN   PLAP  N Y RSD+E A     A    + K+SKC VY+L  IV  ++ +L  
Sbjct: 1   MAEENPKIPLAPPRNEYPRSDQEYAP----AVIESQRKSSKCLVYVLVTIVTVSAALLIS 56

Query: 255 AIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVL 434
           A IFL+   P+ ++ S+T++NL++GN        T+ T + + N N+G FE +N   SV 
Sbjct: 57  ASIFLRPNTPEVQLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVF 116

Query: 435 YRDTILGEANISETRVKAQQTKRMDCT--FDLKSNGLSGDKNFSSEIDSGILKIRSYASL 608
           Y    +G+  I + RV+A++ KR++ T   D++SNG   ++N SS+ +SGI+K+ SYA L
Sbjct: 117 YGSVTVGDVKIRDGRVEAREVKRINVTVDVDVRSNGNLDNQNLSSDRNSGIVKLNSYAKL 176

Query: 609 SGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
            G V+L N++KK KT  ++C+MNL+L   ++EDL+C
Sbjct: 177 HGNVNLFNVLKKTKTPELDCSMNLVLARRAVEDLVC 212


>gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis]
          Length = 213

 Score =  156 bits (394), Expect = 1e-35
 Identities = 81/213 (38%), Positives = 128/213 (60%), Gaps = 2/213 (0%)
 Frame = +3

Query: 84  EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 263
           +E+Q +PLAP  ++ RSDEE     P   + R+E+ +KCFVYI  GIVI  +I+L FA+I
Sbjct: 4   QESQSWPLAPMRVHQRSDEE----NPAFKALRKERTNKCFVYIFAGIVILGAILLIFALI 59

Query: 264 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFEL-QNTIASVLYR 440
            L+ K P+ K+ S+T+++L+Y         AT+   + +KN NFG +    N  A  LY 
Sbjct: 60  VLRSKSPEIKLKSVTVKSLDYSTSPWPSLNATLIATVAIKNPNFGPYRFGSNNSAVFLYG 119

Query: 441 DTILGEANISETRVKAQQTKRMDCTFDLKSNGL-SGDKNFSSEIDSGILKIRSYASLSGK 617
              LGE  I + +  A+ TKR++ T +++++ L  G  N   ++ SG++ + SY   +G+
Sbjct: 120 GGKLGEQRIRQGKATAKATKRVNVTVEIRTSRLPQGSNNLGGDLSSGMVNLSSYCKFTGR 179

Query: 618 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
           VHL+ I + RKT  MNC M L+L +  I++L C
Sbjct: 180 VHLIKIFENRKTAEMNCAMTLVLKTKMIKNLRC 212


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  151 bits (382), Expect = 3e-34
 Identities = 76/207 (36%), Positives = 121/207 (58%), Gaps = 1/207 (0%)
 Frame = +3

Query: 99  YPLAPA-NIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKV 275
           YPL PA N + RSDEE      ++   +++K  KC +YI+   V QT IIL FA+  +++
Sbjct: 9   YPLVPAANGHERSDEESVAA--HSKELKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRI 66

Query: 276 KLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRDTILG 455
           + P  ++ S +    N G          MNT   VKN NFG F+ +  + +  YR T +G
Sbjct: 67  RNPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVG 126

Query: 456 EANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNI 635
            A I + R +A+ TK++D   +L SNGL        +I +G+L + S + L GK+HLM +
Sbjct: 127 RATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKV 186

Query: 636 IKKRKTTVMNCTMNLILNSSSIEDLIC 716
           IKK+K+T MNCTM++ +++ ++ ++IC
Sbjct: 187 IKKKKSTQMNCTMDVAIDTRTVRNIIC 213


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  150 bits (380), Expect = 5e-34
 Identities = 70/179 (39%), Positives = 113/179 (63%)
 Frame = +3

Query: 180 EEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXAT 359
           +  N+KC  Y+   +V QT+IIL FA+  +++K P  +  ++T+EN + GN         
Sbjct: 6   KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65

Query: 360 MNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGL 539
           +   + VKN NFG F+ +N+   +LY    +GEA I + R +A+QTK+ D T D+ S+ L
Sbjct: 66  LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125

Query: 540 SGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
           S + N  ++I SG+L + S A LSGKVHLM +IKK+K++ M+CTM + + + +++DL C
Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKC 184


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  150 bits (379), Expect = 6e-34
 Identities = 71/177 (40%), Positives = 112/177 (63%)
 Frame = +3

Query: 186 KNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMN 365
           +N KC+ YI+ G+V QT IIL FA+  +++K P +++ S+T+++LNY           + 
Sbjct: 11  QNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPHFNMRLI 70

Query: 366 TIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSG 545
             I VKN NFG F   NT A+V +   ++G+  I ++R +A++TKRM+ T D+ S+ +S 
Sbjct: 71  MEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSD 130

Query: 546 DKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
           +    +++ SG L +   A L GKV LM ++KKRKT  MNCTM + LNS +++DL C
Sbjct: 131 EDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDC 187


>ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297644 [Fragaria vesca
           subsp. vesca]
          Length = 210

 Score =  149 bits (377), Expect = 1e-33
 Identities = 80/212 (37%), Positives = 129/212 (60%), Gaps = 1/212 (0%)
 Frame = +3

Query: 84  EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 263
           +E+Q++PLAP  ++ RS+E      P   + R E+++KCFVY+ +GIV     +L FA++
Sbjct: 4   QESQIWPLAPGKLHQRSEEN-----PTFKAIRRERSNKCFVYVFSGIVFFCVTVLVFALL 58

Query: 264 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRD 443
            L+VK P+ ++ S+T+++L Y +        +++  + VKN NFG +E   T  S LY  
Sbjct: 59  VLRVKSPEIRLRSVTVKSLKYTSSPPSFN-VSLSGQMSVKNPNFGDYEFVPTTVSFLYSR 117

Query: 444 TILGEANISETRVKAQQTKRMDCTFDLKSNGL-SGDKNFSSEIDSGILKIRSYASLSGKV 620
             +G   +++   K ++T+R+    DL+SN L  G     S+I+SG+LK+     +SGKV
Sbjct: 118 GAVGSTKVAKGLAKVKKTERLSFGVDLRSNKLPEGANTLKSDINSGMLKLTGTGKVSGKV 177

Query: 621 HLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
            L  II KRKT  M+CTM L+L S +I+DL+C
Sbjct: 178 TLWKIINKRKTGKMDCTMTLVLKSKTIKDLVC 209


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score =  148 bits (373), Expect = 3e-33
 Identities = 79/220 (35%), Positives = 122/220 (55%), Gaps = 3/220 (1%)
 Frame = +3

Query: 66  SETKMGEENQLYPLAP-ANIYGRSDEEVATQKP-YASSRREEKNSKCFVYILTGIVIQTS 239
           +E K       YPL P A  Y RSD+E A   P  A   R +K  +C +Y+    V Q  
Sbjct: 2   AENKEAAATSPYPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVV 61

Query: 240 IILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNT 419
           +I  FA+  +K+K P  ++ + +I     G+         M+    VKN NFG FE ++ 
Sbjct: 62  VITVFALTVMKIKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFGVKNTNFGHFEYEDG 121

Query: 420 IASVLYRDTILGEANISETRVKAQQTKRMD-CTFDLKSNGLSGDKNFSSEIDSGILKIRS 596
           I    YRD  +G+ N+ E RV+A+ T+++D  + DL S GL  +    S+I +GI+ I  
Sbjct: 122 IVVFTYRDVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITI 181

Query: 597 YASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
            + L GK+HLM IIKK+K+  MNCTM ++L + S+++++C
Sbjct: 182 SSKLDGKIHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVC 221


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  145 bits (366), Expect = 2e-32
 Identities = 65/184 (35%), Positives = 115/184 (62%), Gaps = 1/184 (0%)
 Frame = +3

Query: 168 SSRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXX 347
           ++ R ++N KC  YI+ G++ QT IIL F ++ ++++ P  ++  +T+ENLN  +     
Sbjct: 7   TTSRRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSP 66

Query: 348 XXA-TMNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDL 524
             +  +N  + VKN NFG F+ QN+  ++ YR T +GEA I + R +A+ T +++ T  +
Sbjct: 67  SFSMNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSV 126

Query: 525 KSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIE 704
            S+ +S +   SS++ SG + + S+A L GK+HL  + KK+K+  MNCTM +  +S  I+
Sbjct: 127 SSDKMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQ 186

Query: 705 DLIC 716
           +L+C
Sbjct: 187 NLMC 190


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  141 bits (355), Expect = 4e-31
 Identities = 72/185 (38%), Positives = 110/185 (59%), Gaps = 1/185 (0%)
 Frame = +3

Query: 165 ASSRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNY-GNXXX 341
           A+  + +K  K F Y    +V QT +IL F++  +++K P  ++ SIT+E++ Y      
Sbjct: 16  AAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNP 75

Query: 342 XXXXATMNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFD 521
                  N  + VKN NFG F+  NT  S  Y    +GEA +++ R KA+ TK+M+ T D
Sbjct: 76  PSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVD 135

Query: 522 LKSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSI 701
           L SN +  + N +S+I SG L + ++  LSGKVHLM +IKK+K+  MNCTM + L S +I
Sbjct: 136 LNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAI 195

Query: 702 EDLIC 716
           +D+ C
Sbjct: 196 QDIKC 200


>ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578735 [Solanum tuberosum]
          Length = 223

 Score =  140 bits (354), Expect = 5e-31
 Identities = 81/222 (36%), Positives = 129/222 (58%), Gaps = 9/222 (4%)
 Frame = +3

Query: 78  MGEENQLYPLAPANIYGRSDEEVATQKPYA-----SSRREEKNSKCFVYILTGIVIQTSI 242
           M +++ + PLAP   Y +SD+ +   K        ++ +  K+ KCFVY L+ IVI + I
Sbjct: 1   MAQDSHIIPLAPPRAYPKSDQGLNLSKSINYYNNHNNNKNRKSGKCFVYFLSTIVILSII 60

Query: 243 ILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXX-ATMNTIIKVKNANFGRFELQNT 419
           +L F+++F + K P  ++  I ++NL + N          M   I V N NFG+   Q++
Sbjct: 61  MLIFSMVFFRFKSPSFELDHINVQNLRFSNSTNSSSFNMNMGGEIIVDNDNFGQINYQDS 120

Query: 420 IASV-LYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDK--NFSSEIDSGILKI 590
             SV LY +  +G AN++  RV+A+++KR+  +  L++N        N SS+I+S +LK+
Sbjct: 121 SMSVFLYDNVTIGIANVNVGRVEARKSKRIGISLQLRTNYQLNYSYGNLSSDINSRMLKL 180

Query: 591 RSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
            S+    GKV  M II K KT++MNCTMNL L S +I+DL+C
Sbjct: 181 TSFGEFRGKVKAMKIISKHKTSIMNCTMNLNLTSQAIQDLLC 222


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  140 bits (352), Expect = 8e-31
 Identities = 80/216 (37%), Positives = 126/216 (58%), Gaps = 5/216 (2%)
 Frame = +3

Query: 84  EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 263
           ++ Q++PLAPAN + RSDEE A+ +  +   + +K  K  VYI    V QT +IL FA+ 
Sbjct: 4   KDQQVHPLAPANGHPRSDEESASLQ--SKELKRKKRIKYAVYIAAFAVFQTVVILIFALT 61

Query: 264 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMN----TIIKVKNANFGRFELQNTIASV 431
            ++VK P  ++  +T+E +   N       A+ N    T + VKN NFG ++  N   S 
Sbjct: 62  VMRVKNPKVRIGKVTVETMETSNTEAA---ASFNLRFITQVTVKNTNFGHYKFDNATMSF 118

Query: 432 LYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGL-SGDKNFSSEIDSGILKIRSYASL 608
           LY   ++GEA I + R +A+ TK++D T ++ S+ L S      SE+ S +L + S A L
Sbjct: 119 LYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKL 178

Query: 609 SGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
            GKV LM ++KK+K+  MNCT+   +++ S++DL C
Sbjct: 179 KGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKC 214


>ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, partial [Populus
           trichocarpa] gi|550346930|gb|ERP65340.1| hypothetical
           protein POPTR_0001s09980g, partial [Populus trichocarpa]
          Length = 173

 Score =  139 bits (349), Expect = 2e-30
 Identities = 81/173 (46%), Positives = 114/173 (65%), Gaps = 1/173 (0%)
 Frame = +3

Query: 201 FVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKV 380
           F   L  IVI ++IIL FAII +K + P  K+SS+ +E+L+YGN        T+   + V
Sbjct: 3   FFNSLALIVILSAIILVFAII-VKPRTPRVKLSSVAVEHLSYGNNPIPSFNMTLAAEVSV 61

Query: 381 KNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNG-LSGDKNF 557
           KN+NF RF+ +NT +S LY+  ++GEA +   RV A++T+RM+    + S G LS  KN 
Sbjct: 62  KNSNFVRFKFENTSSSALYKGMVVGEAKLRSGRVGARKTRRMNIVVKIGSPGSLSEAKNL 121

Query: 558 SSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
           SS+I+SG+LK+ SYA+L G V L  I+K R T VM+C MNL L+S SI+DL C
Sbjct: 122 SSDINSGMLKMNSYATLKGDVRLFGIVKNR-TAVMSCGMNLNLSSRSIQDLEC 173


>emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]
          Length = 186

 Score =  138 bits (348), Expect = 2e-30
 Identities = 68/182 (37%), Positives = 111/182 (60%)
 Frame = +3

Query: 171 SRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXX 350
           S R +K+ KC  Y+   +V QT IIL F ++ LK++ P  +++SI++EN ++        
Sbjct: 7   SVRRKKSLKCLAYVAAFVVFQTGIILLFVLLVLKIRDPKVRIASISVENQHFSTNSFSMD 66

Query: 351 XATMNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKS 530
              +   + VKN NFG F+  N+ A++ Y  T +GEA I + R +++ TKR + T  + S
Sbjct: 67  ---LKARVTVKNTNFGHFKFDNSTATISYFGTAVGEATILKARARSRSTKRFNITVPISS 123

Query: 531 NGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDL 710
           + ++  +    +++SG+L + S A LSGK+HL  I KK+K+  M+CTM L  N+SSIE+L
Sbjct: 124 SKVNNHRQLRRDLNSGVLNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENL 183

Query: 711 IC 716
            C
Sbjct: 184 SC 185


>gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus]
          Length = 213

 Score =  138 bits (347), Expect = 3e-30
 Identities = 65/210 (30%), Positives = 117/210 (55%)
 Frame = +3

Query: 87  ENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAIIF 266
           E +  PL  AN +GRSD E       A  +R++K +KCF+YI   ++ Q  +I  F++  
Sbjct: 3   EKEHQPLPYANGHGRSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTV 62

Query: 267 LKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRDT 446
           +K++ P  ++ S  +   + G         T+N    VKNANFGR++ +NT     Y+ T
Sbjct: 63  MKIRTPKFRIRSAHLTTFHAGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGT 122

Query: 447 ILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHL 626
            +G+  + ++R   + TK+     DL      G+   +S++++G+++I S A ++G+V L
Sbjct: 123 PVGQVFVRDSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNAGVVQITSQARMAGRVEL 182

Query: 627 MNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
           + ++KK K+T MNC M ++  +  I +L+C
Sbjct: 183 IFVMKKNKSTDMNCNMEIVTATQQIRNLVC 212


>gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus]
          Length = 214

 Score =  136 bits (343), Expect = 9e-30
 Identities = 69/215 (32%), Positives = 121/215 (56%), Gaps = 2/215 (0%)
 Frame = +3

Query: 78  MGEENQL--YPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILF 251
           MGE+ Q   YP+APAN +GRSD E       AS   + K ++C +YI    +IQ ++++ 
Sbjct: 1   MGEKEQQLSYPMAPANDHGRSDTEAGGAA--ASELHKRKRTQCLIYIGLLAIIQIAVVIV 58

Query: 252 FAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASV 431
           F++  +K++ P  ++ S  + N N G          +N    VKNANFGR++  +T    
Sbjct: 59  FSLTVMKIRNPRFRIRSAHLTNFNAGTPASPAFTGKLNAEFSVKNANFGRYKYMDTTVDF 118

Query: 432 LYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLS 611
           +YR T +GE  + E+R   + TK+ +   DL       +   +S++++G++ I S A +S
Sbjct: 119 VYRGTRVGEVFVRESRAGWRTTKKFNVAVDLSLANARANPQLASDLNAGVVPISSEARMS 178

Query: 612 GKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716
           G V L+ ++KK ++T +NCTM ++  +  I +++C
Sbjct: 179 GSVELLFVLKKNRSTGLNCTMEIVTATQQIRNILC 213


Top