BLASTX nr result
ID: Paeonia23_contig00010586
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia23_contig00010586 (871 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241... 187 6e-45 ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prun... 172 1e-40 ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-r... 167 6e-39 ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620... 166 1e-38 ref|XP_002509872.1| conserved hypothetical protein [Ricinus comm... 166 1e-38 ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citr... 164 3e-38 gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis] 156 1e-35 ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom... 151 3e-34 ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r... 150 5e-34 ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r... 150 6e-34 ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297... 149 1e-33 ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302... 148 3e-33 ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r... 145 2e-32 ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r... 141 4e-31 ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578... 140 5e-31 ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r... 140 8e-31 ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, part... 139 2e-30 emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera] 138 2e-30 gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus... 138 3e-30 gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus... 136 9e-30 >ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241699 [Vitis vinifera] Length = 213 Score = 187 bits (474), Expect = 6e-45 Identities = 90/213 (42%), Positives = 140/213 (65%) Frame = +3 Query: 78 MGEENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFA 257 M E+NQ PLAPA ++G+SDEE KP AS + ++SKC VY+L G+V +I L FA Sbjct: 1 MPEDNQFQPLAPARLHGKSDEEFGVFKPRAS-KPPRRSSKCPVYVLAGLVTLAAIALVFA 59 Query: 258 IIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLY 437 + L+V+ PD ++ S+ ++NL +G T+ + V+N NFG F +N A+VLY Sbjct: 60 LAVLRVEAPDVELKSVAVKNLTHGTSPSPSFNVTLTAEVSVQNKNFGAFNFENGTATVLY 119 Query: 438 RDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGK 617 ++G+ S+ V++++TKRM+ T D++S+ L DKN SS+I SG + + +YA ++GK Sbjct: 120 EGMVVGDEEFSKAHVESRKTKRMNVTLDVRSDRLWNDKNLSSDISSGSVNLTTYAQVTGK 179 Query: 618 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 V +M ++++R T MNC+M L L SSSI+DL+C Sbjct: 180 VRVMKVVRRRTTARMNCSMTLNLTSSSIQDLVC 212 >ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica] gi|462406396|gb|EMJ11860.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica] Length = 213 Score = 172 bits (437), Expect = 1e-40 Identities = 89/213 (41%), Positives = 139/213 (65%), Gaps = 2/213 (0%) Frame = +3 Query: 84 EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 263 +E+Q++PLAP+ ++ RSDEE P + R E+++KCFVY+ IV+Q+ IL FA++ Sbjct: 4 QESQVWPLAPSRLHRRSDEE----NPTFRAIRRERSNKCFVYVFAAIVLQSIFILVFALV 59 Query: 264 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRD 443 L+VK P +SS+++++L + AT+ T + +KN NFG ++ + + AS+ Y Sbjct: 60 VLRVKSPGFNLSSVSVKSLKHTTSPTSSLNATLVTELAIKNKNFGEYKFEGSSASLWYGG 119 Query: 444 TILGEANISETRVKAQQTKRMDCTFDLKSNGL--SGDKNFSSEIDSGILKIRSYASLSGK 617 +GEA I + RVKA+ T+R+ + D++SN L F E++SG LKI SYA L+GK Sbjct: 120 FKVGEAKIGKGRVKARGTRRVSLSIDVRSNRLPQEAKNGFEGEMNSGYLKISSYAKLTGK 179 Query: 618 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 V+LM I+KKRKT NCTM ++L S +++DL C Sbjct: 180 VNLMKIMKKRKTIDTNCTMVVVLKSRTVKDLFC 212 >ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777616|gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 213 Score = 167 bits (422), Expect = 6e-39 Identities = 88/213 (41%), Positives = 129/213 (60%) Frame = +3 Query: 78 MGEENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFA 257 M E+ Q PLAP Y RSD E KP AS R+E K+SKC VY+L G+VIQ +++L FA Sbjct: 1 MQEDPQAKPLAPVEYYPRSDMEFGGIKPTASQRKE-KSSKCLVYVLVGMVIQGAVLLIFA 59 Query: 258 IIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLY 437 I L+ + PD ++ S+T+ NL YGN T+ T + V+N+NFG F+ +NT +V Sbjct: 60 SIVLRARTPDVEIVSVTVRNLKYGNSSAPSFNLTLVTEVTVENSNFGDFKFENTTGTVWC 119 Query: 438 RDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGK 617 ++G+ I R +A+ T+R++ + D+ S L KN S I SG+L++ S+ LSGK Sbjct: 120 GSVVVGKMKIPTGRAQARATERLNVSVDVSSLPLPDTKNVSCNISSGLLELNSHVKLSGK 179 Query: 618 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 V +MN +K+R+ MNC M L L + +D C Sbjct: 180 VSIMNFMKRRRHPEMNCFMTLNLTGQTKQDFPC 212 >ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620163 [Citrus sinensis] Length = 214 Score = 166 bits (420), Expect = 1e-38 Identities = 88/216 (40%), Positives = 135/216 (62%), Gaps = 3/216 (1%) Frame = +3 Query: 78 MGEENQLYPLAPA-NIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFF 254 M EEN +PLAP N Y RSD+E A A + K+SKC VY+L IV ++ +L Sbjct: 1 MAEENPKFPLAPPRNEYPRSDQEYAP----AVIESQRKSSKCLVYVLVTIVTVSAALLIS 56 Query: 255 AIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVL 434 A IFL+ P+ ++ S+T++NL++GN T+ T + + N N+G FE +N SV Sbjct: 57 ASIFLRPNTPEVQLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVF 116 Query: 435 YRDTILGEANISETRVKAQQTKRMDCT--FDLKSNGLSGDKNFSSEIDSGILKIRSYASL 608 Y +G+ I + RV+A++ KR++ T D++SNG ++N S+I+SGI+K+ SYA L Sbjct: 117 YGSVTVGDVKIRDGRVEAREVKRINVTVDVDVRSNGNLDNQNLRSDINSGIVKLNSYAKL 176 Query: 609 SGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 G V L N++KK KT ++C+MNL+L ++EDL+C Sbjct: 177 HGNVSLFNVLKKTKTPELDCSMNLVLARRAVEDLVC 212 >ref|XP_002509872.1| conserved hypothetical protein [Ricinus communis] gi|223549771|gb|EEF51259.1| conserved hypothetical protein [Ricinus communis] Length = 221 Score = 166 bits (420), Expect = 1e-38 Identities = 90/222 (40%), Positives = 143/222 (64%), Gaps = 9/222 (4%) Frame = +3 Query: 78 MGEENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFA 257 M E+NQ+ PLAPA RSDEE A KP + R +E++SKC VY+L GIVI +++IL FA Sbjct: 1 MVEDNQIVPLAPAETNPRSDEEFAAVKP--NLRLQERSSKCLVYVLAGIVILSAVILVFA 58 Query: 258 IIFLKVKLPDSKMSSITIENLNYG--------NXXXXXXXATMNTIIKVKNANFGRFELQ 413 ++ L+ P++++S + +++LNY N T+ + +K++N+NFG F+ Sbjct: 59 LVVLRPVNPNAELSFVRLKDLNYAAGSGGNGNNVSLPAFNMTLESELKIENSNFGEFKYD 118 Query: 414 NTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNG-LSGDKNFSSEIDSGILKI 590 NT A V Y +GEA + E RV A+ T RM+ +++S+ + + +S+I+SGILK+ Sbjct: 119 NTSARVFYGGMAVGEAILREGRVSARDTLRMNVKVEVRSHKYIYNGTDLTSDINSGILKL 178 Query: 591 RSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 S+A SG+V+L+ I KKR++ M+C+ +L L S SI+DL+C Sbjct: 179 NSHAKFSGRVNLLQIAKKRRSASMDCSFSLDLRSRSIQDLVC 220 >ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citrus clementina] gi|557541714|gb|ESR52692.1| hypothetical protein CICLE_v10023929mg [Citrus clementina] Length = 214 Score = 164 bits (416), Expect = 3e-38 Identities = 88/216 (40%), Positives = 135/216 (62%), Gaps = 3/216 (1%) Frame = +3 Query: 78 MGEENQLYPLAPA-NIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFF 254 M EEN PLAP N Y RSD+E A A + K+SKC VY+L IV ++ +L Sbjct: 1 MAEENPKIPLAPPRNEYPRSDQEYAP----AVIESQRKSSKCLVYVLVTIVTVSAALLIS 56 Query: 255 AIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVL 434 A IFL+ P+ ++ S+T++NL++GN T+ T + + N N+G FE +N SV Sbjct: 57 ASIFLRPNTPEVQLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVF 116 Query: 435 YRDTILGEANISETRVKAQQTKRMDCT--FDLKSNGLSGDKNFSSEIDSGILKIRSYASL 608 Y +G+ I + RV+A++ KR++ T D++SNG ++N SS+ +SGI+K+ SYA L Sbjct: 117 YGSVTVGDVKIRDGRVEAREVKRINVTVDVDVRSNGNLDNQNLSSDRNSGIVKLNSYAKL 176 Query: 609 SGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 G V+L N++KK KT ++C+MNL+L ++EDL+C Sbjct: 177 HGNVNLFNVLKKTKTPELDCSMNLVLARRAVEDLVC 212 >gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis] Length = 213 Score = 156 bits (394), Expect = 1e-35 Identities = 81/213 (38%), Positives = 128/213 (60%), Gaps = 2/213 (0%) Frame = +3 Query: 84 EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 263 +E+Q +PLAP ++ RSDEE P + R+E+ +KCFVYI GIVI +I+L FA+I Sbjct: 4 QESQSWPLAPMRVHQRSDEE----NPAFKALRKERTNKCFVYIFAGIVILGAILLIFALI 59 Query: 264 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFEL-QNTIASVLYR 440 L+ K P+ K+ S+T+++L+Y AT+ + +KN NFG + N A LY Sbjct: 60 VLRSKSPEIKLKSVTVKSLDYSTSPWPSLNATLIATVAIKNPNFGPYRFGSNNSAVFLYG 119 Query: 441 DTILGEANISETRVKAQQTKRMDCTFDLKSNGL-SGDKNFSSEIDSGILKIRSYASLSGK 617 LGE I + + A+ TKR++ T +++++ L G N ++ SG++ + SY +G+ Sbjct: 120 GGKLGEQRIRQGKATAKATKRVNVTVEIRTSRLPQGSNNLGGDLSSGMVNLSSYCKFTGR 179 Query: 618 VHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 VHL+ I + RKT MNC M L+L + I++L C Sbjct: 180 VHLIKIFENRKTAEMNCAMTLVLKTKMIKNLRC 212 >ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao] gi|508776114|gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] Length = 214 Score = 151 bits (382), Expect = 3e-34 Identities = 76/207 (36%), Positives = 121/207 (58%), Gaps = 1/207 (0%) Frame = +3 Query: 99 YPLAPA-NIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKV 275 YPL PA N + RSDEE ++ +++K KC +YI+ V QT IIL FA+ +++ Sbjct: 9 YPLVPAANGHERSDEESVAA--HSKELKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRI 66 Query: 276 KLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRDTILG 455 + P ++ S + N G MNT VKN NFG F+ + + + YR T +G Sbjct: 67 RNPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVG 126 Query: 456 EANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNI 635 A I + R +A+ TK++D +L SNGL +I +G+L + S + L GK+HLM + Sbjct: 127 RATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKV 186 Query: 636 IKKRKTTVMNCTMNLILNSSSIEDLIC 716 IKK+K+T MNCTM++ +++ ++ ++IC Sbjct: 187 IKKKKSTQMNCTMDVAIDTRTVRNIIC 213 >ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 150 bits (380), Expect = 5e-34 Identities = 70/179 (39%), Positives = 113/179 (63%) Frame = +3 Query: 180 EEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXAT 359 + N+KC Y+ +V QT+IIL FA+ +++K P + ++T+EN + GN Sbjct: 6 KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65 Query: 360 MNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGL 539 + + VKN NFG F+ +N+ +LY +GEA I + R +A+QTK+ D T D+ S+ L Sbjct: 66 LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125 Query: 540 SGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 S + N ++I SG+L + S A LSGKVHLM +IKK+K++ M+CTM + + + +++DL C Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKC 184 >ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777614|gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 188 Score = 150 bits (379), Expect = 6e-34 Identities = 71/177 (40%), Positives = 112/177 (63%) Frame = +3 Query: 186 KNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMN 365 +N KC+ YI+ G+V QT IIL FA+ +++K P +++ S+T+++LNY + Sbjct: 11 QNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPHFNMRLI 70 Query: 366 TIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSG 545 I VKN NFG F NT A+V + ++G+ I ++R +A++TKRM+ T D+ S+ +S Sbjct: 71 MEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSD 130 Query: 546 DKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 + +++ SG L + A L GKV LM ++KKRKT MNCTM + LNS +++DL C Sbjct: 131 EDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDC 187 >ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297644 [Fragaria vesca subsp. vesca] Length = 210 Score = 149 bits (377), Expect = 1e-33 Identities = 80/212 (37%), Positives = 129/212 (60%), Gaps = 1/212 (0%) Frame = +3 Query: 84 EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 263 +E+Q++PLAP ++ RS+E P + R E+++KCFVY+ +GIV +L FA++ Sbjct: 4 QESQIWPLAPGKLHQRSEEN-----PTFKAIRRERSNKCFVYVFSGIVFFCVTVLVFALL 58 Query: 264 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRD 443 L+VK P+ ++ S+T+++L Y + +++ + VKN NFG +E T S LY Sbjct: 59 VLRVKSPEIRLRSVTVKSLKYTSSPPSFN-VSLSGQMSVKNPNFGDYEFVPTTVSFLYSR 117 Query: 444 TILGEANISETRVKAQQTKRMDCTFDLKSNGL-SGDKNFSSEIDSGILKIRSYASLSGKV 620 +G +++ K ++T+R+ DL+SN L G S+I+SG+LK+ +SGKV Sbjct: 118 GAVGSTKVAKGLAKVKKTERLSFGVDLRSNKLPEGANTLKSDINSGMLKLTGTGKVSGKV 177 Query: 621 HLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 L II KRKT M+CTM L+L S +I+DL+C Sbjct: 178 TLWKIINKRKTGKMDCTMTLVLKSKTIKDLVC 209 >ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca subsp. vesca] Length = 222 Score = 148 bits (373), Expect = 3e-33 Identities = 79/220 (35%), Positives = 122/220 (55%), Gaps = 3/220 (1%) Frame = +3 Query: 66 SETKMGEENQLYPLAP-ANIYGRSDEEVATQKP-YASSRREEKNSKCFVYILTGIVIQTS 239 +E K YPL P A Y RSD+E A P A R +K +C +Y+ V Q Sbjct: 2 AENKEAAATSPYPLMPSAPSYMRSDQEAAASAPPSAEELRHKKRMRCLLYVSIFAVFQVV 61 Query: 240 IILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNT 419 +I FA+ +K+K P ++ + +I G+ M+ VKN NFG FE ++ Sbjct: 62 VITVFALTVMKIKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFGVKNTNFGHFEYEDG 121 Query: 420 IASVLYRDTILGEANISETRVKAQQTKRMD-CTFDLKSNGLSGDKNFSSEIDSGILKIRS 596 I YRD +G+ N+ E RV+A+ T+++D + DL S GL + S+I +GI+ I Sbjct: 122 IVVFTYRDVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPITI 181 Query: 597 YASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 + L GK+HLM IIKK+K+ MNCTM ++L + S+++++C Sbjct: 182 SSKLDGKIHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVC 221 >ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776108|gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 191 Score = 145 bits (366), Expect = 2e-32 Identities = 65/184 (35%), Positives = 115/184 (62%), Gaps = 1/184 (0%) Frame = +3 Query: 168 SSRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXX 347 ++ R ++N KC YI+ G++ QT IIL F ++ ++++ P ++ +T+ENLN + Sbjct: 7 TTSRRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSP 66 Query: 348 XXA-TMNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDL 524 + +N + VKN NFG F+ QN+ ++ YR T +GEA I + R +A+ T +++ T + Sbjct: 67 SFSMNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSV 126 Query: 525 KSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIE 704 S+ +S + SS++ SG + + S+A L GK+HL + KK+K+ MNCTM + +S I+ Sbjct: 127 SSDKMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQ 186 Query: 705 DLIC 716 +L+C Sbjct: 187 NLMC 190 >ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776113|gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 141 bits (355), Expect = 4e-31 Identities = 72/185 (38%), Positives = 110/185 (59%), Gaps = 1/185 (0%) Frame = +3 Query: 165 ASSRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNY-GNXXX 341 A+ + +K K F Y +V QT +IL F++ +++K P ++ SIT+E++ Y Sbjct: 16 AAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNP 75 Query: 342 XXXXATMNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFD 521 N + VKN NFG F+ NT S Y +GEA +++ R KA+ TK+M+ T D Sbjct: 76 PSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVD 135 Query: 522 LKSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSI 701 L SN + + N +S+I SG L + ++ LSGKVHLM +IKK+K+ MNCTM + L S +I Sbjct: 136 LNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAI 195 Query: 702 EDLIC 716 +D+ C Sbjct: 196 QDIKC 200 >ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578735 [Solanum tuberosum] Length = 223 Score = 140 bits (354), Expect = 5e-31 Identities = 81/222 (36%), Positives = 129/222 (58%), Gaps = 9/222 (4%) Frame = +3 Query: 78 MGEENQLYPLAPANIYGRSDEEVATQKPYA-----SSRREEKNSKCFVYILTGIVIQTSI 242 M +++ + PLAP Y +SD+ + K ++ + K+ KCFVY L+ IVI + I Sbjct: 1 MAQDSHIIPLAPPRAYPKSDQGLNLSKSINYYNNHNNNKNRKSGKCFVYFLSTIVILSII 60 Query: 243 ILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXX-ATMNTIIKVKNANFGRFELQNT 419 +L F+++F + K P ++ I ++NL + N M I V N NFG+ Q++ Sbjct: 61 MLIFSMVFFRFKSPSFELDHINVQNLRFSNSTNSSSFNMNMGGEIIVDNDNFGQINYQDS 120 Query: 420 IASV-LYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDK--NFSSEIDSGILKI 590 SV LY + +G AN++ RV+A+++KR+ + L++N N SS+I+S +LK+ Sbjct: 121 SMSVFLYDNVTIGIANVNVGRVEARKSKRIGISLQLRTNYQLNYSYGNLSSDINSRMLKL 180 Query: 591 RSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 S+ GKV M II K KT++MNCTMNL L S +I+DL+C Sbjct: 181 TSFGEFRGKVKAMKIISKHKTSIMNCTMNLNLTSQAIQDLLC 222 >ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777615|gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 140 bits (352), Expect = 8e-31 Identities = 80/216 (37%), Positives = 126/216 (58%), Gaps = 5/216 (2%) Frame = +3 Query: 84 EENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAII 263 ++ Q++PLAPAN + RSDEE A+ + + + +K K VYI V QT +IL FA+ Sbjct: 4 KDQQVHPLAPANGHPRSDEESASLQ--SKELKRKKRIKYAVYIAAFAVFQTVVILIFALT 61 Query: 264 FLKVKLPDSKMSSITIENLNYGNXXXXXXXATMN----TIIKVKNANFGRFELQNTIASV 431 ++VK P ++ +T+E + N A+ N T + VKN NFG ++ N S Sbjct: 62 VMRVKNPKVRIGKVTVETMETSNTEAA---ASFNLRFITQVTVKNTNFGHYKFDNATMSF 118 Query: 432 LYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGL-SGDKNFSSEIDSGILKIRSYASL 608 LY ++GEA I + R +A+ TK++D T ++ S+ L S SE+ S +L + S A L Sbjct: 119 LYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKL 178 Query: 609 SGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 GKV LM ++KK+K+ MNCT+ +++ S++DL C Sbjct: 179 KGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKC 214 >ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, partial [Populus trichocarpa] gi|550346930|gb|ERP65340.1| hypothetical protein POPTR_0001s09980g, partial [Populus trichocarpa] Length = 173 Score = 139 bits (349), Expect = 2e-30 Identities = 81/173 (46%), Positives = 114/173 (65%), Gaps = 1/173 (0%) Frame = +3 Query: 201 FVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKV 380 F L IVI ++IIL FAII +K + P K+SS+ +E+L+YGN T+ + V Sbjct: 3 FFNSLALIVILSAIILVFAII-VKPRTPRVKLSSVAVEHLSYGNNPIPSFNMTLAAEVSV 61 Query: 381 KNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKSNG-LSGDKNF 557 KN+NF RF+ +NT +S LY+ ++GEA + RV A++T+RM+ + S G LS KN Sbjct: 62 KNSNFVRFKFENTSSSALYKGMVVGEAKLRSGRVGARKTRRMNIVVKIGSPGSLSEAKNL 121 Query: 558 SSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 SS+I+SG+LK+ SYA+L G V L I+K R T VM+C MNL L+S SI+DL C Sbjct: 122 SSDINSGMLKMNSYATLKGDVRLFGIVKNR-TAVMSCGMNLNLSSRSIQDLEC 173 >emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera] Length = 186 Score = 138 bits (348), Expect = 2e-30 Identities = 68/182 (37%), Positives = 111/182 (60%) Frame = +3 Query: 171 SRREEKNSKCFVYILTGIVIQTSIILFFAIIFLKVKLPDSKMSSITIENLNYGNXXXXXX 350 S R +K+ KC Y+ +V QT IIL F ++ LK++ P +++SI++EN ++ Sbjct: 7 SVRRKKSLKCLAYVAAFVVFQTGIILLFVLLVLKIRDPKVRIASISVENQHFSTNSFSMD 66 Query: 351 XATMNTIIKVKNANFGRFELQNTIASVLYRDTILGEANISETRVKAQQTKRMDCTFDLKS 530 + + VKN NFG F+ N+ A++ Y T +GEA I + R +++ TKR + T + S Sbjct: 67 ---LKARVTVKNTNFGHFKFDNSTATISYFGTAVGEATILKARARSRSTKRFNITVPISS 123 Query: 531 NGLSGDKNFSSEIDSGILKIRSYASLSGKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDL 710 + ++ + +++SG+L + S A LSGK+HL I KK+K+ M+CTM L N+SSIE+L Sbjct: 124 SKVNNHRQLRRDLNSGVLNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENL 183 Query: 711 IC 716 C Sbjct: 184 SC 185 >gb|EYU25165.1| hypothetical protein MIMGU_mgv1a013680mg [Mimulus guttatus] Length = 213 Score = 138 bits (347), Expect = 3e-30 Identities = 65/210 (30%), Positives = 117/210 (55%) Frame = +3 Query: 87 ENQLYPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILFFAIIF 266 E + PL AN +GRSD E A +R++K +KCF+YI ++ Q +I F++ Sbjct: 3 EKEHQPLPYANGHGRSDAEAGAAAHDAREQRKKKRTKCFIYIALFVIFQLGVIAIFSVTV 62 Query: 267 LKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASVLYRDT 446 +K++ P ++ S + + G T+N VKNANFGR++ +NT Y+ T Sbjct: 63 MKIRTPKFRIRSAHLTTFHAGTPGSPSFSGTVNAEFSVKNANFGRYKYRNTTVGFFYKGT 122 Query: 447 ILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLSGKVHL 626 +G+ + ++R + TK+ DL G+ +S++++G+++I S A ++G+V L Sbjct: 123 PVGQVFVRDSRAGWRSTKKFRVVVDLNLANAQGNPQLASDLNAGVVQITSQARMAGRVEL 182 Query: 627 MNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 + ++KK K+T MNC M ++ + I +L+C Sbjct: 183 IFVMKKNKSTDMNCNMEIVTATQQIRNLVC 212 >gb|EYU25167.1| hypothetical protein MIMGU_mgv1a013636mg [Mimulus guttatus] Length = 214 Score = 136 bits (343), Expect = 9e-30 Identities = 69/215 (32%), Positives = 121/215 (56%), Gaps = 2/215 (0%) Frame = +3 Query: 78 MGEENQL--YPLAPANIYGRSDEEVATQKPYASSRREEKNSKCFVYILTGIVIQTSIILF 251 MGE+ Q YP+APAN +GRSD E AS + K ++C +YI +IQ ++++ Sbjct: 1 MGEKEQQLSYPMAPANDHGRSDTEAGGAA--ASELHKRKRTQCLIYIGLLAIIQIAVVIV 58 Query: 252 FAIIFLKVKLPDSKMSSITIENLNYGNXXXXXXXATMNTIIKVKNANFGRFELQNTIASV 431 F++ +K++ P ++ S + N N G +N VKNANFGR++ +T Sbjct: 59 FSLTVMKIRNPRFRIRSAHLTNFNAGTPASPAFTGKLNAEFSVKNANFGRYKYMDTTVDF 118 Query: 432 LYRDTILGEANISETRVKAQQTKRMDCTFDLKSNGLSGDKNFSSEIDSGILKIRSYASLS 611 +YR T +GE + E+R + TK+ + DL + +S++++G++ I S A +S Sbjct: 119 VYRGTRVGEVFVRESRAGWRTTKKFNVAVDLSLANARANPQLASDLNAGVVPISSEARMS 178 Query: 612 GKVHLMNIIKKRKTTVMNCTMNLILNSSSIEDLIC 716 G V L+ ++KK ++T +NCTM ++ + I +++C Sbjct: 179 GSVELLFVLKKNRSTGLNCTMEIVTATQQIRNILC 213