BLASTX nr result

ID: Paeonia22_contig00006019 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00006019
         (853 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241...   187   6e-45
ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prun...   172   1e-40
ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-r...   167   6e-39
ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620...   166   1e-38
ref|XP_002509872.1| conserved hypothetical protein [Ricinus comm...   166   1e-38
ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citr...   164   3e-38
gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis]     156   1e-35
ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom...   151   3e-34
ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r...   150   5e-34
ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r...   150   6e-34
ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297...   149   1e-33
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...   148   3e-33
ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r...   145   2e-32
ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r...   141   4e-31
ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578...   140   5e-31
ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r...   140   8e-31
ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, part...   139   2e-30
emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]   138   2e-30
gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus...   138   3e-30
gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus...   136   9e-30

>ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241699 [Vitis vinifera]
          Length = 213

 Score =  187 bits (474), Expect = 6e-45
 Identities = 90/213 (42%), Positives = 140/213 (65%)
 Frame = +1

Query: 37  MGEENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFA 216
           M E+NQ  PLAPA ++G+SDEE    KP AS +   ++SKC VY+L G+V   +I L FA
Sbjct: 1   MPEDNQFQPLAPARLHGKSDEEFGVFKPRAS-KPPRRSSKCPVYVLAGLVTLAAIALVFA 59

Query: 217 IIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLY 396
           +  L+V+ PD ++ S+ ++NL +G         T+   + V+N NFG F  +N  A+VLY
Sbjct: 60  LAVLRVEAPDVELKSVAVKNLTHGTSPSPSFNVTLTAEVSVQNKNFGAFNFENGTATVLY 119

Query: 397 RDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGK 576
              ++G+   S+  V++++TKRM+ T D++S+ L  DKN SS+I SG + + +YA ++GK
Sbjct: 120 EGMVVGDEEFSKAHVESRKTKRMNVTLDVRSDRLWNDKNLSSDISSGSVNLTTYAQVTGK 179

Query: 577 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
           V +M ++++R T  MNC+M L L SSSI+DL+C
Sbjct: 180 VRVMKVVRRRTTARMNCSMTLNLTSSSIQDLVC 212


>ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica]
           gi|462406396|gb|EMJ11860.1| hypothetical protein
           PRUPE_ppa022176mg [Prunus persica]
          Length = 213

 Score =  172 bits (437), Expect = 1e-40
 Identities = 89/213 (41%), Positives = 139/213 (65%), Gaps = 2/213 (0%)
 Frame = +1

Query: 43  EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 222
           +E+Q++PLAP+ ++ RSDEE     P   + R E+++KCFVY+   IV+Q+  IL FA++
Sbjct: 4   QESQVWPLAPSRLHRRSDEE----NPTFRAIRRERSNKCFVYVFAAIVLQSIFILVFALV 59

Query: 223 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRD 402
            L+VK P   +SS+++++L +         AT+ T + +KN NFG ++ + + AS+ Y  
Sbjct: 60  VLRVKSPGFNLSSVSVKSLKHTTSPTSSLNATLVTELAIKNKNFGEYKFEGSSASLWYGG 119

Query: 403 TILGEANISETRVKAQQTKRMDCTFDLKSNGL--SGDKNFSSEIDSGILKIRSYASLSGK 576
             +GEA I + RVKA+ T+R+  + D++SN L       F  E++SG LKI SYA L+GK
Sbjct: 120 FKVGEAKIGKGRVKARGTRRVSLSIDVRSNRLPQEAKNGFEGEMNSGYLKISSYAKLTGK 179

Query: 577 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
           V+LM I+KKRKT   NCTM ++L S +++DL C
Sbjct: 180 VNLMKIMKKRKTIDTNCTMVVVLKSRTVKDLFC 212


>ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777616|gb|EOY24872.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 213

 Score =  167 bits (422), Expect = 6e-39
 Identities = 88/213 (41%), Positives = 129/213 (60%)
 Frame = +1

Query: 37  MGEENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFA 216
           M E+ Q  PLAP   Y RSD E    KP AS R+E K+SKC VY+L G+VIQ +++L FA
Sbjct: 1   MQEDPQAKPLAPVEYYPRSDMEFGGIKPTASQRKE-KSSKCLVYVLVGMVIQGAVLLIFA 59

Query: 217 IIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLY 396
            I L+ + PD ++ S+T+ NL YGN        T+ T + V+N+NFG F+ +NT  +V  
Sbjct: 60  SIVLRARTPDVEIVSVTVRNLKYGNSSAPSFNLTLVTEVTVENSNFGDFKFENTTGTVWC 119

Query: 397 RDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGK 576
              ++G+  I   R +A+ T+R++ + D+ S  L   KN S  I SG+L++ S+  LSGK
Sbjct: 120 GSVVVGKMKIPTGRAQARATERLNVSVDVSSLPLPDTKNVSCNISSGLLELNSHVKLSGK 179

Query: 577 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
           V +MN +K+R+   MNC M L L   + +D  C
Sbjct: 180 VSIMNFMKRRRHPEMNCFMTLNLTGQTKQDFPC 212


>ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620163 [Citrus sinensis]
          Length = 214

 Score =  166 bits (420), Expect = 1e-38
 Identities = 88/216 (40%), Positives = 135/216 (62%), Gaps = 3/216 (1%)
 Frame = +1

Query: 37  MGEENQLYPLAPA-NIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFF 213
           M EEN  +PLAP  N Y RSD+E A     A    + K+SKC VY+L  IV  ++ +L  
Sbjct: 1   MAEENPKFPLAPPRNEYPRSDQEYAP----AVIESQRKSSKCLVYVLVTIVTVSAALLIS 56

Query: 214 AIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVL 393
           A IFL+   P+ ++ S+T++NL++GN        T+ T + + N N+G FE +N   SV 
Sbjct: 57  ASIFLRPNTPEVQLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVF 116

Query: 394 YRDTILGEANISETRVKAQQTKRMDCT--FDLKSNGLSGDKNFSSEIDSGILKIRSYASL 567
           Y    +G+  I + RV+A++ KR++ T   D++SNG   ++N  S+I+SGI+K+ SYA L
Sbjct: 117 YGSVTVGDVKIRDGRVEAREVKRINVTVDVDVRSNGNLDNQNLRSDINSGIVKLNSYAKL 176

Query: 568 SGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
            G V L N++KK KT  ++C+MNL+L   ++EDL+C
Sbjct: 177 HGNVSLFNVLKKTKTPELDCSMNLVLARRAVEDLVC 212


>ref|XP_002509872.1| conserved hypothetical protein [Ricinus communis]
           gi|223549771|gb|EEF51259.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 221

 Score =  166 bits (420), Expect = 1e-38
 Identities = 90/222 (40%), Positives = 143/222 (64%), Gaps = 9/222 (4%)
 Frame = +1

Query: 37  MGEENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFA 216
           M E+NQ+ PLAPA    RSDEE A  KP  + R +E++SKC VY+L GIVI +++IL FA
Sbjct: 1   MVEDNQIVPLAPAETNPRSDEEFAAVKP--NLRLQERSSKCLVYVLAGIVILSAVILVFA 58

Query: 217 IIFLKVKLPDSKMSSITIENLNYG--------NXXXXXXXATMNTIIKVKNANFGRFELQ 372
           ++ L+   P++++S + +++LNY         N        T+ + +K++N+NFG F+  
Sbjct: 59  LVVLRPVNPNAELSFVRLKDLNYAAGSGGNGNNVSLPAFNMTLESELKIENSNFGEFKYD 118

Query: 373 NTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNG-LSGDKNFSSEIDSGILKI 549
           NT A V Y    +GEA + E RV A+ T RM+   +++S+  +    + +S+I+SGILK+
Sbjct: 119 NTSARVFYGGMAVGEAILREGRVSARDTLRMNVKVEVRSHKYIYNGTDLTSDINSGILKL 178

Query: 550 RSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
            S+A  SG+V+L+ I KKR++  M+C+ +L L S SI+DL+C
Sbjct: 179 NSHAKFSGRVNLLQIAKKRRSASMDCSFSLDLRSRSIQDLVC 220


>ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citrus clementina]
           gi|557541714|gb|ESR52692.1| hypothetical protein
           CICLE_v10023929mg [Citrus clementina]
          Length = 214

 Score =  164 bits (416), Expect = 3e-38
 Identities = 88/216 (40%), Positives = 135/216 (62%), Gaps = 3/216 (1%)
 Frame = +1

Query: 37  MGEENQLYPLAPA-NIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFF 213
           M EEN   PLAP  N Y RSD+E A     A    + K+SKC VY+L  IV  ++ +L  
Sbjct: 1   MAEENPKIPLAPPRNEYPRSDQEYAP----AVIESQRKSSKCLVYVLVTIVTVSAALLIS 56

Query: 214 AIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVL 393
           A IFL+   P+ ++ S+T++NL++GN        T+ T + + N N+G FE +N   SV 
Sbjct: 57  ASIFLRPNTPEVQLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVF 116

Query: 394 YRDTILGEANISETRVKAQQTKRMDCT--FDLKSNGLSGDKNFSSEIDSGILKIRSYASL 567
           Y    +G+  I + RV+A++ KR++ T   D++SNG   ++N SS+ +SGI+K+ SYA L
Sbjct: 117 YGSVTVGDVKIRDGRVEAREVKRINVTVDVDVRSNGNLDNQNLSSDRNSGIVKLNSYAKL 176

Query: 568 SGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
            G V+L N++KK KT  ++C+MNL+L   ++EDL+C
Sbjct: 177 HGNVNLFNVLKKTKTPELDCSMNLVLARRAVEDLVC 212


>gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis]
          Length = 213

 Score =  156 bits (394), Expect = 1e-35
 Identities = 81/213 (38%), Positives = 128/213 (60%), Gaps = 2/213 (0%)
 Frame = +1

Query: 43  EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 222
           +E+Q +PLAP  ++ RSDEE     P   + R+E+ +KCFVYI  GIVI  +I+L FA+I
Sbjct: 4   QESQSWPLAPMRVHQRSDEE----NPAFKALRKERTNKCFVYIFAGIVILGAILLIFALI 59

Query: 223 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFEL-QNTIASVLYR 399
            L+ K P+ K+ S+T+++L+Y         AT+   + +KN NFG +    N  A  LY 
Sbjct: 60  VLRSKSPEIKLKSVTVKSLDYSTSPWPSLNATLIATVAIKNPNFGPYRFGSNNSAVFLYG 119

Query: 400 DTILGEANISETRVKAQQTKRMDCTFDLKSNGL-SGDKNFSSEIDSGILKIRSYASLSGK 576
              LGE  I + +  A+ TKR++ T +++++ L  G  N   ++ SG++ + SY   +G+
Sbjct: 120 GGKLGEQRIRQGKATAKATKRVNVTVEIRTSRLPQGSNNLGGDLSSGMVNLSSYCKFTGR 179

Query: 577 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
           VHL+ I + RKT  MNC M L+L +  I++L C
Sbjct: 180 VHLIKIFENRKTAEMNCAMTLVLKTKMIKNLRC 212


>ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
           gi|508776114|gb|EOY23370.1| Uncharacterized protein
           TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  151 bits (382), Expect = 3e-34
 Identities = 76/207 (36%), Positives = 121/207 (58%), Gaps = 1/207 (0%)
 Frame = +1

Query: 58  YPLAPA-NIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKV 234
           YPL PA N + RSDEE      ++   +++K  KC +YI+   V QT IIL FA+  +++
Sbjct: 9   YPLVPAANGHERSDEESVAA--HSKELKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRI 66

Query: 235 KLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRDTILG 414
           + P  ++ S +    N G          MNT   VKN NFG F+ +  + +  YR T +G
Sbjct: 67  RNPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVG 126

Query: 415 EANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNI 594
            A I + R +A+ TK++D   +L SNGL        +I +G+L + S + L GK+HLM +
Sbjct: 127 RATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKV 186

Query: 595 IKKRKTTVMNCTMNLILNSSSIEDLIC 675
           IKK+K+T MNCTM++ +++ ++ ++IC
Sbjct: 187 IKKKKSTQMNCTMDVAIDTRTVRNIIC 213


>ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late
           embryogenesis abundant hydroxyproline-rich glycofamily
           protein [Theobroma cacao]
          Length = 185

 Score =  150 bits (380), Expect = 5e-34
 Identities = 70/179 (39%), Positives = 113/179 (63%)
 Frame = +1

Query: 139 EEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXAT 318
           +  N+KC  Y+   +V QT+IIL FA+  +++K P  +  ++T+EN + GN         
Sbjct: 6   KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65

Query: 319 MNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGL 498
           +   + VKN NFG F+ +N+   +LY    +GEA I + R +A+QTK+ D T D+ S+ L
Sbjct: 66  LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125

Query: 499 SGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
           S + N  ++I SG+L + S A LSGKVHLM +IKK+K++ M+CTM + + + +++DL C
Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKC 184


>ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777614|gb|EOY24870.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 188

 Score =  150 bits (379), Expect = 6e-34
 Identities = 71/177 (40%), Positives = 112/177 (63%)
 Frame = +1

Query: 145 KNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMN 324
           +N KC+ YI+ G+V QT IIL FA+  +++K P +++ S+T+++LNY           + 
Sbjct: 11  QNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPHFNMRLI 70

Query: 325 TIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSG 504
             I VKN NFG F   NT A+V +   ++G+  I ++R +A++TKRM+ T D+ S+ +S 
Sbjct: 71  MEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSD 130

Query: 505 DKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
           +    +++ SG L +   A L GKV LM ++KKRKT  MNCTM + LNS +++DL C
Sbjct: 131 EDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDC 187


>ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297644 [Fragaria vesca
           subsp. vesca]
          Length = 210

 Score =  149 bits (377), Expect = 1e-33
 Identities = 80/212 (37%), Positives = 129/212 (60%), Gaps = 1/212 (0%)
 Frame = +1

Query: 43  EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 222
           +E+Q++PLAP  ++ RS+E      P   + R E+++KCFVY+ +GIV     +L FA++
Sbjct: 4   QESQIWPLAPGKLHQRSEEN-----PTFKAIRRERSNKCFVYVFSGIVFFCVTVLVFALL 58

Query: 223 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRD 402
            L+VK P+ ++ S+T+++L Y +        +++  + VKN NFG +E   T  S LY  
Sbjct: 59  VLRVKSPEIRLRSVTVKSLKYTSSPPSFN-VSLSGQMSVKNPNFGDYEFVPTTVSFLYSR 117

Query: 403 TILGEANISETRVKAQQTKRMDCTFDLKSNGL-SGDKNFSSEIDSGILKIRSYASLSGKV 579
             +G   +++   K ++T+R+    DL+SN L  G     S+I+SG+LK+     +SGKV
Sbjct: 118 GAVGSTKVAKGLAKVKKTERLSFGVDLRSNKLPEGANTLKSDINSGMLKLTGTGKVSGKV 177

Query: 580 HLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
            L  II KRKT  M+CTM L+L S +I+DL+C
Sbjct: 178 TLWKIINKRKTGKMDCTMTLVLKSKTIKDLVC 209


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score =  148 bits (373), Expect = 3e-33
 Identities = 79/220 (35%), Positives = 122/220 (55%), Gaps = 3/220 (1%)
 Frame = +1

Query: 25  SETKMGEENQLYPLAP-ANIYGRSDEEVATQKP-YASSRREEKNSKCFVYILTGIVIQTS 198
           +E K       YPL P A  Y RSD+E A   P  A   R +K  +C +Y+    V Q  
Sbjct: 2   AENKEAAATSPYPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVV 61

Query: 199 IILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNT 378
           +I  FA+  +K+K P  ++ + +I     G+         M+    VKN NFG FE ++ 
Sbjct: 62  VITVFALTVMKIKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFGVKNTNFGHFEYEDG 121

Query: 379 IASVLYRDTILGEANISETRVKAQQTKRMD-CTFDLKSNGLSGDKNFSSEIDSGILKIRS 555
           I    YRD  +G+ N+ E RV+A+ T+++D  + DL S GL  +    S+I +GI+ I  
Sbjct: 122 IVVFTYRDVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITI 181

Query: 556 YASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
            + L GK+HLM IIKK+K+  MNCTM ++L + S+++++C
Sbjct: 182 SSKLDGKIHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVC 221


>ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776108|gb|EOY23364.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 191

 Score =  145 bits (366), Expect = 2e-32
 Identities = 65/184 (35%), Positives = 115/184 (62%), Gaps = 1/184 (0%)
 Frame = +1

Query: 127 SSRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXX 306
           ++ R ++N KC  YI+ G++ QT IIL F ++ ++++ P  ++  +T+ENLN  +     
Sbjct: 7   TTSRRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSP 66

Query: 307 XXA-TMNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDL 483
             +  +N  + VKN NFG F+ QN+  ++ YR T +GEA I + R +A+ T +++ T  +
Sbjct: 67  SFSMNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSV 126

Query: 484 KSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIE 663
            S+ +S +   SS++ SG + + S+A L GK+HL  + KK+K+  MNCTM +  +S  I+
Sbjct: 127 SSDKMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQ 186

Query: 664 DLIC 675
           +L+C
Sbjct: 187 NLMC 190


>ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508776113|gb|EOY23369.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 201

 Score =  141 bits (355), Expect = 4e-31
 Identities = 72/185 (38%), Positives = 110/185 (59%), Gaps = 1/185 (0%)
 Frame = +1

Query: 124 ASSRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNY-GNXXX 300
           A+  + +K  K F Y    +V QT +IL F++  +++K P  ++ SIT+E++ Y      
Sbjct: 16  AAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNP 75

Query: 301 XXXXATMNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFD 480
                  N  + VKN NFG F+  NT  S  Y    +GEA +++ R KA+ TK+M+ T D
Sbjct: 76  PSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVD 135

Query: 481 LKSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSI 660
           L SN +  + N +S+I SG L + ++  LSGKVHLM +IKK+K+  MNCTM + L S +I
Sbjct: 136 LNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAI 195

Query: 661 EDLIC 675
           +D+ C
Sbjct: 196 QDIKC 200


>ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578735 [Solanum tuberosum]
          Length = 223

 Score =  140 bits (354), Expect = 5e-31
 Identities = 81/222 (36%), Positives = 129/222 (58%), Gaps = 9/222 (4%)
 Frame = +1

Query: 37  MGEENQLYPLAPANIYGRSDEEVATQKPYA-----SSRREEKNSKCFVYILTGIVIQTSI 201
           M +++ + PLAP   Y +SD+ +   K        ++ +  K+ KCFVY L+ IVI + I
Sbjct: 1   MAQDSHIIPLAPPRAYPKSDQGLNLSKSINYYNNHNNNKNRKSGKCFVYFLSTIVILSII 60

Query: 202 ILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXX-ATMNTIIKVKNANFGRFELQNT 378
           +L F+++F + K P  ++  I ++NL + N          M   I V N NFG+   Q++
Sbjct: 61  MLIFSMVFFRFKSPSFELDHINVQNLRFSNSTNSSSFNMNMGGEIIVDNDNFGQINYQDS 120

Query: 379 IASV-LYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDK--NFSSEIDSGILKI 549
             SV LY +  +G AN++  RV+A+++KR+  +  L++N        N SS+I+S +LK+
Sbjct: 121 SMSVFLYDNVTIGIANVNVGRVEARKSKRIGISLQLRTNYQLNYSYGNLSSDINSRMLKL 180

Query: 550 RSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
            S+    GKV  M II K KT++MNCTMNL L S +I+DL+C
Sbjct: 181 TSFGEFRGKVKAMKIISKHKTSIMNCTMNLNLTSQAIQDLLC 222


>ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
           gi|508777615|gb|EOY24871.1| Late embryogenesis abundant
           hydroxyproline-rich glycofamily protein, putative
           [Theobroma cacao]
          Length = 215

 Score =  140 bits (352), Expect = 8e-31
 Identities = 80/216 (37%), Positives = 126/216 (58%), Gaps = 5/216 (2%)
 Frame = +1

Query: 43  EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 222
           ++ Q++PLAPAN + RSDEE A+ +  +   + +K  K  VYI    V QT +IL FA+ 
Sbjct: 4   KDQQVHPLAPANGHPRSDEESASLQ--SKELKRKKRIKYAVYIAAFAVFQTVVILIFALT 61

Query: 223 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMN----TIIKVKNANFGRFELQNTIASV 390
            ++VK P  ++  +T+E +   N       A+ N    T + VKN NFG ++  N   S 
Sbjct: 62  VMRVKNPKVRIGKVTVETMETSNTEAA---ASFNLRFITQVTVKNTNFGHYKFDNATMSF 118

Query: 391 LYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGL-SGDKNFSSEIDSGILKIRSYASL 567
           LY   ++GEA I + R +A+ TK++D T ++ S+ L S      SE+ S +L + S A L
Sbjct: 119 LYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKL 178

Query: 568 SGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
            GKV LM ++KK+K+  MNCT+   +++ S++DL C
Sbjct: 179 KGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKC 214


>ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, partial [Populus
           trichocarpa] gi|550346930|gb|ERP65340.1| hypothetical
           protein POPTR_0001s09980g, partial [Populus trichocarpa]
          Length = 173

 Score =  139 bits (349), Expect = 2e-30
 Identities = 81/173 (46%), Positives = 114/173 (65%), Gaps = 1/173 (0%)
 Frame = +1

Query: 160 FVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKV 339
           F   L  IVI ++IIL FAII +K + P  K+SS+ +E+L+YGN        T+   + V
Sbjct: 3   FFNSLALIVILSAIILVFAII-VKPRTPRVKLSSVAVEHLSYGNNPIPSFNMTLAAEVSV 61

Query: 340 KNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNG-LSGDKNF 516
           KN+NF RF+ +NT +S LY+  ++GEA +   RV A++T+RM+    + S G LS  KN 
Sbjct: 62  KNSNFVRFKFENTSSSALYKGMVVGEAKLRSGRVGARKTRRMNIVVKIGSPGSLSEAKNL 121

Query: 517 SSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
           SS+I+SG+LK+ SYA+L G V L  I+K R T VM+C MNL L+S SI+DL C
Sbjct: 122 SSDINSGMLKMNSYATLKGDVRLFGIVKNR-TAVMSCGMNLNLSSRSIQDLEC 173


>emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]
          Length = 186

 Score =  138 bits (348), Expect = 2e-30
 Identities = 68/182 (37%), Positives = 111/182 (60%)
 Frame = +1

Query: 130 SRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXX 309
           S R +K+ KC  Y+   +V QT IIL F ++ LK++ P  +++SI++EN ++        
Sbjct: 7   SVRRKKSLKCLAYVAAFVVFQTGIILLFVLLVLKIRDPKVRIASISVENQHFSTNSFSMD 66

Query: 310 XATMNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKS 489
              +   + VKN NFG F+  N+ A++ Y  T +GEA I + R +++ TKR + T  + S
Sbjct: 67  ---LKARVTVKNTNFGHFKFDNSTATISYFGTAVGEATILKARARSRSTKRFNITVPISS 123

Query: 490 NGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDL 669
           + ++  +    +++SG+L + S A LSGK+HL  I KK+K+  M+CTM L  N+SSIE+L
Sbjct: 124 SKVNNHRQLRRDLNSGVLNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENL 183

Query: 670 IC 675
            C
Sbjct: 184 SC 185


>gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus]
          Length = 213

 Score =  138 bits (347), Expect = 3e-30
 Identities = 65/210 (30%), Positives = 117/210 (55%)
 Frame = +1

Query: 46  ENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAIIF 225
           E +  PL  AN +GRSD E       A  +R++K +KCF+YI   ++ Q  +I  F++  
Sbjct: 3   EKEHQPLPYANGHGRSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTV 62

Query: 226 LKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRDT 405
           +K++ P  ++ S  +   + G         T+N    VKNANFGR++ +NT     Y+ T
Sbjct: 63  MKIRTPKFRIRSAHLTTFHAGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGT 122

Query: 406 ILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHL 585
            +G+  + ++R   + TK+     DL      G+   +S++++G+++I S A ++G+V L
Sbjct: 123 PVGQVFVRDSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNAGVVQITSQARMAGRVEL 182

Query: 586 MNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
           + ++KK K+T MNC M ++  +  I +L+C
Sbjct: 183 IFVMKKNKSTDMNCNMEIVTATQQIRNLVC 212


>gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus]
          Length = 214

 Score =  136 bits (343), Expect = 9e-30
 Identities = 69/215 (32%), Positives = 121/215 (56%), Gaps = 2/215 (0%)
 Frame = +1

Query: 37  MGEENQL--YPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILF 210
           MGE+ Q   YP+APAN +GRSD E       AS   + K ++C +YI    +IQ ++++ 
Sbjct: 1   MGEKEQQLSYPMAPANDHGRSDTEAGGAA--ASELHKRKRTQCLIYIGLLAIIQIAVVIV 58

Query: 211 FAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASV 390
           F++  +K++ P  ++ S  + N N G          +N    VKNANFGR++  +T    
Sbjct: 59  FSLTVMKIRNPRFRIRSAHLTNFNAGTPASPAFTGKLNAEFSVKNANFGRYKYMDTTVDF 118

Query: 391 LYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLS 570
           +YR T +GE  + E+R   + TK+ +   DL       +   +S++++G++ I S A +S
Sbjct: 119 VYRGTRVGEVFVRESRAGWRTTKKFNVAVDLSLANARANPQLASDLNAGVVPISSEARMS 178

Query: 571 GKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 675
           G V L+ ++KK ++T +NCTM ++  +  I +++C
Sbjct: 179 GSVELLFVLKKNRSTGLNCTMEIVTATQQIRNILC 213


Top