BLASTX nr result
ID: Akebia23_contig00014798
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00014798 (1856 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247... 377 e-102 ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu... 372 e-100 ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family prot... 341 6e-91 ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [... 332 5e-88 ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215... 330 1e-87 ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm... 312 4e-82 ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1... 303 2e-79 gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus... 299 3e-78 ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phas... 297 1e-77 ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [A... 294 1e-76 ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citr... 293 2e-76 ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253... 291 5e-76 emb|CBI17195.3| unnamed protein product [Vitis vinifera] 291 5e-76 ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr... 288 6e-75 ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prun... 276 2e-71 gb|EXB62642.1| hypothetical protein L484_023937 [Morus notabilis] 263 2e-67 ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300... 260 1e-66 tpg|DAA51855.1| TPA: hypothetical protein ZEAMMB73_029894 [Zea m... 256 2e-65 ref|XP_002466313.1| hypothetical protein SORBIDRAFT_01g005470 [S... 255 5e-65 ref|XP_004965630.1| PREDICTED: CCAAT/enhancer-binding protein al... 252 4e-64 >ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED: uncharacterized protein LOC101247662 isoform 2 [Solanum lycopersicum] Length = 473 Score = 377 bits (969), Expect = e-102 Identities = 218/434 (50%), Positives = 264/434 (60%), Gaps = 5/434 (1%) Frame = +2 Query: 179 DPPNPQFSPQTPDNEDAHVEPDEPLFPSGLGHGRG--KPIPSNPVLPSFSSWISSTKSVV 352 D PN FSP +ED+ E P PSG GHGRG KP+PS+P++PSF S++ + + Sbjct: 43 DSPNFGFSPGKSASEDSKPESSTPATPSGTGHGRGRGKPLPSSPIVPSFHSFVDNPNTPA 102 Query: 353 GRGRITQHQQQPPNSRSEESQTVQP-KKPIFFCREDSLESTQKPQFDDSDRNPEEGIK-P 526 GRGR PP ++ Q QP +KPIFF +E+ E+T + P + P Sbjct: 103 GRGRGGIGPFSPPPQPQQQQQ--QPLRKPIFFAKEE--ETTDSNSSSSNAPKPRDDSNLP 158 Query: 527 LSLTSVLPGAGRGKGVKFVDS-EEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSRE 703 S+ SVL GAGRGK ++ S EKP EENRH ++ S +LSRE Sbjct: 159 SSVISVLTGAGRGKPLQTASSVSEKPKEENRHLRPRQQKVADSG--ERASSPPPQRLSRE 216 Query: 704 DAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXD 883 DAVK AV ILS D Sbjct: 217 DAVKKAVGILSRSDDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDEERGD 276 Query: 884 SEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNN 1063 + +G YLGD+ADGE+LA ++G E+M+ L EGFEE+S VLPSPMDDAYL+ALHTN Sbjct: 277 G--NLESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLPSPMDDAYLEALHTNM 334 Query: 1064 LIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDK 1243 +IE EPEYLMGDFE+N L+DALEKMKPFLMAYEGIKDQEEWEE++KE M+ Sbjct: 335 MIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKETMET 394 Query: 1244 LPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGF 1423 +P MKE++D YSGPD VTA QQQQELERVAKTLP+ AP SVKRFT+RAVLSLQSN GWGF Sbjct: 395 VPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGWGF 454 Query: 1424 DKKCQFMDKLVWEV 1465 DKKCQFMDK+V EV Sbjct: 455 DKKCQFMDKVVMEV 468 >ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum] Length = 480 Score = 372 bits (954), Expect = e-100 Identities = 214/435 (49%), Positives = 259/435 (59%), Gaps = 7/435 (1%) Frame = +2 Query: 179 DPPNPQFSPQTPDNEDAHVEPDEPLFPSGLGHGRG--KPIPSNPVLPSFSSWISSTKSVV 352 D PN FSP +ED+ E P PSG GHGRG KP+PS+P++PSF S + + Sbjct: 43 DFPNFGFSPGKSASEDSKPESSTPTTPSGTGHGRGRGKPLPSSPIVPSFYSVVDNPNPPA 102 Query: 353 GRGRITQHQ-QQPPNSRSEESQTVQP-KKPIFFCREDSLESTQKPQFDDSDRNPEEGIKP 526 GRGR PP + ++ Q QP +KPIFF +E+ + D + + Sbjct: 103 GRGRGGIGPFSPPPQPQQQQQQQQQPLRKPIFFAKEEETADSNSSSSDAPTPRDDSNLSS 162 Query: 527 LSLTSVLPGAGRGKGVKFVDS-EEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSRE 703 S+ SVL GAGRGK ++ EKP EENRH ++ S +LSRE Sbjct: 163 -SVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQKVADSG--ERASSPPPQRLSRE 219 Query: 704 DAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXD 883 DAVK AV ILS + Sbjct: 220 DAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDE 279 Query: 884 SEDDFS--TGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHT 1057 D S +G YLGD+ADGE+LAQ++G E M+ L EGFEE+S VLPSPMDDAY++ALHT Sbjct: 280 ERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMSARVLPSPMDDAYIEALHT 339 Query: 1058 NNLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVM 1237 N +IE EPEYLMGDFE+N L+DALEKMKPFLMAYEGIKDQEEWEE++KE M Sbjct: 340 NMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKETM 399 Query: 1238 DKLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGW 1417 + +P MKE++D YSGPD VTA QQQQELERVAKTLP+ AP SVKRFT+RAVLSLQSN GW Sbjct: 400 ETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGW 459 Query: 1418 GFDKKCQFMDKLVWE 1462 GFDKKCQFMDK+V E Sbjct: 460 GFDKKCQFMDKVVME 474 >ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508784903|gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 474 Score = 341 bits (875), Expect = 6e-91 Identities = 200/434 (46%), Positives = 254/434 (58%), Gaps = 7/434 (1%) Frame = +2 Query: 185 PNPQFSPQTPDNEDAHVEPDEPLFPSGLGHGRGK--PIPSNPVLPSFSSWISSTKSVVGR 358 P P S N D+ P P+G+GHGRG+ P+ S+P+ FSS++S T S GR Sbjct: 60 PPPGKSGSGDSNRDSAESP-----PAGVGHGRGRGGPLSSDPIPHPFSSFVSQTGS--GR 112 Query: 359 GRITQHQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSL- 535 GR+T PP Q K+PIF ++D E+ + E I P ++ Sbjct: 113 GRVTSESVPPP-----PPPPAQAKQPIFIKKKDEDETESSAKAAAEPIQSSEPIFPPNIL 167 Query: 536 -TSVLPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAV 712 SVL GAGRGK VK + + EENRH + + S+++S+E+A Sbjct: 168 PVSVLSGAGRGKPVKQPEPASRRQEENRHI------------RVAQQQSPSAQMSQEEAT 215 Query: 713 KNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX---D 883 K A+ ILS D Sbjct: 216 KKAMGILSRRSESGESGMVGRGGRASMGMGGGRGRGRGRGRGMGRGRGRRQGEDTRIVKD 275 Query: 884 SEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNN 1063 S + + GLYLGDNADGE+ AQ IG +NM+KLVEGFEE+ + VLPSPMDDAYLDALHTN Sbjct: 276 SGEGSADGLYLGDNADGEKFAQTIGADNMNKLVEGFEEMGSRVLPSPMDDAYLDALHTNC 335 Query: 1064 LIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDK 1243 IE+EPEYLM +F TN L+DALEKMKPFLMAYEGI+ QEEWEE++KE M++ Sbjct: 336 SIEFEPEYLMEEFGTNPDIDEKPPMPLRDALEKMKPFLMAYEGIQSQEEWEEVIKETMER 395 Query: 1244 LPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGF 1423 +P ++E++D YSGPD VTA +QQ+ELERVAKT+P++AP SVK+F +RAVLSLQSN GWGF Sbjct: 396 VPLLQEIVDYYSGPDRVTAKKQQEELERVAKTIPERAPSSVKQFANRAVLSLQSNPGWGF 455 Query: 1424 DKKCQFMDKLVWEV 1465 DKKCQFMDKLVWEV Sbjct: 456 DKKCQFMDKLVWEV 469 >ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [Glycine max] gi|571476117|ref|XP_006586864.1| PREDICTED: la-related protein 1 isoform X2 [Glycine max] Length = 481 Score = 332 bits (850), Expect = 5e-88 Identities = 201/439 (45%), Positives = 260/439 (59%), Gaps = 13/439 (2%) Frame = +2 Query: 188 NPQFSPQTPDNEDAHVEPDEPLFP--SGLGHGRGKPIPSNPVLPSFSSWISS-TKSVVGR 358 N + +P P++ ++ + EP P SGLGHGRGKP+P + LPSFSS+ISS + GR Sbjct: 52 NNERAPVEPNSSESKSDTTEPPIPPGSGLGHGRGKPMPPSG-LPSFSSFISSINQPPAGR 110 Query: 359 GRIT----QHQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPE---EG 517 GR T QH QPP+S PKKPIFF REDS+ T F R+ + + Sbjct: 111 GRGTAPHPQHDLQPPDSG--------PKKPIFFKREDSVSPTASNDFLPPKRSVDHAHDN 162 Query: 518 IKPLSLTSVLPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLS 697 P S+ VL G GRGK +K D E + EENRH + + S S Sbjct: 163 KLPGSIPGVLSGLGRGKSMKQPDLETQVTEENRHLRTRQAPGAA---SSETVPKRSPIPS 219 Query: 698 REDAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 877 +EDA +NA++ILS Sbjct: 220 QEDATRNALKILSHGKDDGSDTGRGREYGGRGGLDRGRGRGRGRGRGRGMGRGRFVERDV 279 Query: 878 XDS---EDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDA 1048 + DD++TGLY GD+ADGE+LA+++G E M++L EGFEE+++ VLPSP++D +LDA Sbjct: 280 DEKVMDTDDYATGLYAGDDADGEKLARKVGPEIMNQLTEGFEEMTSRVLPSPLEDEFLDA 339 Query: 1049 LHTNNLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMK 1228 L N IE+EPEYL+ +F+ N L+DALEK KPFLM+YEGI+ QEEWEEIM+ Sbjct: 340 LDINYAIEFEPEYLV-EFD-NPDIDEKEPISLRDALEKAKPFLMSYEGIQSQEEWEEIME 397 Query: 1229 EVMDKLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSN 1408 E M ++P +K++ID YSGPD VTA +QQ+ELERVAKTLP P SVK+FT+RAV+SLQSN Sbjct: 398 ETMARVPLLKKIIDHYSGPDRVTAKKQQEELERVAKTLPGSVPSSVKQFTNRAVISLQSN 457 Query: 1409 AGWGFDKKCQFMDKLVWEV 1465 GWGFDKKC FMDKLVWEV Sbjct: 458 PGWGFDKKCHFMDKLVWEV 476 >ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215545 [Cucumis sativus] gi|449502143|ref|XP_004161555.1| PREDICTED: uncharacterized protein LOC101224016 [Cucumis sativus] Length = 478 Score = 330 bits (847), Expect = 1e-87 Identities = 193/430 (44%), Positives = 241/430 (56%), Gaps = 5/430 (1%) Frame = +2 Query: 182 PPNP-QFSPQTPDNEDAHVEPDEPLFPS---GLGHGRGKPIPSNPVLPSFSSWISSTK-S 346 P P F+P P+ E ++ EP+ GLGHGRGKP PS+P+ PSFSS+ S + S Sbjct: 47 PSGPFDFTPPVPNQEHSNASKQEPIDSRPTPGLGHGRGKPTPSSPLRPSFSSFSPSVRPS 106 Query: 347 VVGRGRITQHQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKP 526 VGRGR P+ RS +PKKP+FF + ++ +S R E P Sbjct: 107 SVGRGR----GDASPSIRSPPEPDSEPKKPVFFSKNNAGDSAASTSLGGLHRVSGERNLP 162 Query: 527 LSLTSVLPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSRED 706 SL S G GRGK +K E++P +ENRH + + ++ R + Sbjct: 163 ESLHSEFSGVGRGKPMKQPVPEDQPKQENRHLRPRQEGDGPGAGERGRGRGFEPRIGRGE 222 Query: 707 AVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDS 886 +N R++S D Sbjct: 223 PWRNTNRMVSKDGPDGEVGGGRGTSGYRGRGARGPYRRGARGSFRTGERRERRSGH--DK 280 Query: 887 EDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNL 1066 ED ++ GLYLG+N DGERLA+RIG ENM+KLVEGFEE+S VLPSP+ D YLD + TN + Sbjct: 281 EDGYAAGLYLGNNEDGERLAKRIGTENMNKLVEGFEEMSGRVLPSPLVDQYLDGMDTNFM 340 Query: 1067 IEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKL 1246 IE EPEYLMGDFE N L+DALEKMKPFLMAYE I+ EEWEEI++E M + Sbjct: 341 IECEPEYLMGDFENNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIVEETMQSV 400 Query: 1247 PHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFD 1426 P +KE++D Y GPD VTA +QQ ELERVAKTLP AP SVK+FT+R VLSLQSN GWGFD Sbjct: 401 PLLKEIVDAYGGPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRVVLSLQSNPGWGFD 460 Query: 1427 KKCQFMDKLV 1456 KK Q MDKLV Sbjct: 461 KKWQLMDKLV 470 >ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis] gi|223537066|gb|EEF38701.1| conserved hypothetical protein [Ricinus communis] Length = 436 Score = 312 bits (799), Expect = 4e-82 Identities = 185/424 (43%), Positives = 237/424 (55%), Gaps = 3/424 (0%) Frame = +2 Query: 203 PQTPDNEDAHVEPDEPLFP---SGLGHGRGKPIPSNPVLPSFSSWISSTKSVVGRGRITQ 373 P++ D H P P P +G+GHG G NP+LP+FSS++SS +GRGR Sbjct: 62 PESSDVAKPHYPPPPPPPPPPRNGVGHGHGG---GNPILPAFSSFVSS----IGRGRAIT 114 Query: 374 HQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPG 553 + P+ + ESQ+ + + P ++ S L G Sbjct: 115 DPEPGPSRQPTESQS-------------------------------DSVLPSTIHSSLSG 143 Query: 554 AGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKNAVRIL 733 GRG+ K V + EENRH ++ R+ K+SRE+AVK AV IL Sbjct: 144 FGRGEPDKPVVPTPQVKEENRHIRDRSRAKPKT---EEAEVRAKPKISREEAVKRAVSIL 200 Query: 734 SXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDFSTGLY 913 S D ++ F +GL+ Sbjct: 201 SQGDTGEGMGRGRGGGRGRGRGRGRGRLEQRGRMMD-------------DVDEGFGSGLF 247 Query: 914 LGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPEYLM 1093 LGDNADGE+LA +IGVENM+KLVEG+EE+S VLPSPM+DAYLDALHTN +IE+EPEYLM Sbjct: 248 LGDNADGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDALHTNYMIEFEPEYLM 307 Query: 1094 GDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKELIDI 1273 G+F+ N L+D LEK+KPF+MAYEGI+ QEEWE ++E M +P KE++D Sbjct: 308 GEFDQNPDIDEKPPMPLRDVLEKVKPFIMAYEGIQSQEEWEAAVEETMKNVPLFKEIVDY 367 Query: 1274 YSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQFMDKL 1453 YSGPD +TA +Q++ELERVA T+P AP SVKRF DRAVLSLQSN GWGFDKKCQFMDKL Sbjct: 368 YSGPDRITAKKQEEELERVANTIPASAPASVKRFADRAVLSLQSNPGWGFDKKCQFMDKL 427 Query: 1454 VWEV 1465 V EV Sbjct: 428 VREV 431 >ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis] Length = 407 Score = 303 bits (775), Expect = 2e-79 Identities = 183/416 (43%), Positives = 232/416 (55%), Gaps = 19/416 (4%) Frame = +2 Query: 275 GRGKPIPSNPVLPS-----FSSWISSTKSVVGRGRITQHQQQPPNSRSEESQTVQPKKPI 439 GR P+N ++P+ + + + GRGR++ P S ++Q +P+ Sbjct: 6 GRRISNPNNFIIPNNFFLLYGQGGCTVQQGAGRGRVS-FASDPNESPRPDAQPAKPRT-- 62 Query: 440 FFC--REDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPGAGRGKGVKFVDSEEK----- 598 C E + +STQ P E P S+ S LPGAGRGK +++ Sbjct: 63 --CTPNESATDSTQ----------PSEPNLPSSIISTLPGAGRGKTAVTQQQQQQQQHQR 110 Query: 599 ------PIEENRHTXXXXXXXXXXXXKDQKSDRSSS-KLSREDAVKNAVRILSXXXXXXX 757 P EENRH S+ KLS+EDAVK A+++LS Sbjct: 111 QQPGPPPQEENRHIRARLQPQPRPEKAPAAETGSAQPKLSKEDAVKMAMKVLSRGEEGEG 170 Query: 758 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDFSTGLYLGDNADGE 937 D ED GLYLGDNADGE Sbjct: 171 EGISAGGPGRGRGMGRGRGRGRGRGQGRGRMRRQEME----DDEDGRFGGLYLGDNADGE 226 Query: 938 RLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPEYLMGDFETNXX 1117 +LA+++G E M+ LVEGFEE+S VLPSPM+DAY+DALHTN +IE+EPEYLM +F TN Sbjct: 227 KLAEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPD 286 Query: 1118 XXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKELIDIYSGPDTVT 1297 L+DALEKMKPFLMAYEGI+ QEEWEE + EVM+++P +KE++D YSGPD VT Sbjct: 287 IDEKPPIPLRDALEKMKPFLMAYEGIQSQEEWEEAVNEVMERVPLLKEIVDHYSGPDRVT 346 Query: 1298 AMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQFMDKLVWEV 1465 A QQ +ELERVAKT+P+ AP S+KRF +RAVLSLQSN GWGFDKKCQFMDKL WEV Sbjct: 347 AKQQGEELERVAKTIPESAPASIKRFANRAVLSLQSNPGWGFDKKCQFMDKLAWEV 402 >gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus guttatus] Length = 493 Score = 299 bits (765), Expect = 3e-78 Identities = 189/440 (42%), Positives = 237/440 (53%), Gaps = 20/440 (4%) Frame = +2 Query: 206 QTPDNEDAHVEPDEPLFPSGLGHGRGKPIPSNPVLPSFSSWISSTK-SVVGRGR----IT 370 QT N VE P + G G GRG P+PS+PVLPSFSS+++ +K VGRGR Sbjct: 54 QTDKNSKTEVETPPPSY--GHGRGRGTPLPSSPVLPSFSSFLNESKPPPVGRGRGVAIPA 111 Query: 371 QHQQQPPNSRSEESQTVQP------KKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLS 532 PP R ES + +P K P F ++ E Q + + +E + Sbjct: 112 SPTPPPPPPRVSESPSEKPPPKPNVKLPFLFVKD---EEEQADAAESEVPSAQETLLRSD 168 Query: 533 LTSVLPGAGRGKGVK--FVDSEEKPIEENRH-TXXXXXXXXXXXXKDQKSDRSSSKLSRE 703 + SVL GAGRGK K EKP ENRH + + +LS+E Sbjct: 169 IVSVLSGAGRGKPGKPPTAAQPEKPQSENRHIRQRPPQGKPPVAVSSDGAAPPAVQLSKE 228 Query: 704 DAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXD 883 + VK A ILS Sbjct: 229 EMVKKAKEILSKGDEDGGVSRPEVRDNRDNRDNRGGGRGGRGERGRGRGRGRGRGRGRGR 288 Query: 884 SED------DFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLD 1045 +D D S L++GD AD E++AQ++G + M +L EG +E+S+ VLPSP DDAY+D Sbjct: 289 GDDRYEESDDESDALFIGDPADEEKVAQKLGPDVMAQLAEGIDEMSSRVLPSPFDDAYMD 348 Query: 1046 ALHTNNLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIM 1225 A TN IE EPEYLM +F TN L+DALEKMKPFLM YEGIKDQEEWE+I+ Sbjct: 349 AFETNLRIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMVYEGIKDQEEWEKII 408 Query: 1226 KEVMDKLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQS 1405 +E M +P +KE++D YSGPD VTA QQ +ELERVAKTLP AP SVKRFT+RA+LSLQS Sbjct: 409 EETMKDVPLIKEIVDHYSGPDRVTAKQQNEELERVAKTLPASAPASVKRFTERALLSLQS 468 Query: 1406 NAGWGFDKKCQFMDKLVWEV 1465 N GWGFDKKCQFMDK++ EV Sbjct: 469 NPGWGFDKKCQFMDKVIMEV 488 >ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris] gi|561020640|gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris] Length = 532 Score = 297 bits (760), Expect = 1e-77 Identities = 202/487 (41%), Positives = 252/487 (51%), Gaps = 59/487 (12%) Frame = +2 Query: 182 PPNPQFSPQTPDNEDAHVEPDEPLFPSGLGHGRGKPIPSNPVLPSFSSWISST------- 340 P P S D ++ + P SG GHGRGKP+P + LPSFSS++SS Sbjct: 57 PGKPNSSEPKSDTTESPIPPG-----SGHGHGRGKPMPPSG-LPSFSSFLSSINQPPAGR 110 Query: 341 ------------------------------KSVVGRGRIT--QHQQ-------------- 382 +S GRGR T +HQ Sbjct: 111 GRPTVPHHQNDLQSPAGRGRPTVPHHQNDLQSPAGRGRPTVPRHQNDLQSPAGRGRATVP 170 Query: 383 QPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIK-PLSLTSVLPGAG 559 QPPN PKKPIFF RED T + DD + E+ K P ++ VL G G Sbjct: 171 QPPNDLGPPDSG--PKKPIFFKREDIASPTTR---DDFPIDVEQANKLPGNIIEVLSGLG 225 Query: 560 RGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKNAVRILSX 739 RGK +K D E + EENRH D +R SR+DAV+NA LS Sbjct: 226 RGKPMKQSDPETRVTEENRHLRAPRARGAAA--SDTLYERQPIP-SRDDAVRNARNFLSQ 282 Query: 740 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-----DSEDDFST 904 D+E Sbjct: 283 GEDDVGGTGRGRGFRERGGLGRGRGRGRGRGRGTGRGGFRGRDMDERRGRFMDAEASDDI 342 Query: 905 GLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPE 1084 G Y+GD+ADGE+LA+++G E M++L EGFEE++ VLPSP++D YLDAL N IE+EPE Sbjct: 343 GPYVGDDADGEKLAKKVGPEIMNQLTEGFEEMAGRVLPSPLEDEYLDALDINYAIEFEPE 402 Query: 1085 YLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKEL 1264 YL+ +F+ N L+DALEKMKPFLMAYEGI+ QEEWEEIM+E M ++P +KE+ Sbjct: 403 YLV-EFD-NPDIDEKEPIPLRDALEKMKPFLMAYEGIQSQEEWEEIMEETMAQVPLLKEI 460 Query: 1265 IDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQFM 1444 +D YSGPD VTA +QQ+ELERVAKTLP+ AP SVK+FT+RAV+SLQSN GWGFDKKC FM Sbjct: 461 VDHYSGPDRVTAKKQQEELERVAKTLPESAPSSVKQFTNRAVVSLQSNPGWGFDKKCHFM 520 Query: 1445 DKLVWEV 1465 DKLVWEV Sbjct: 521 DKLVWEV 527 >ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [Amborella trichopoda] gi|548839984|gb|ERN00220.1| hypothetical protein AMTR_s00111p00111440 [Amborella trichopoda] Length = 447 Score = 294 bits (752), Expect = 1e-76 Identities = 173/385 (44%), Positives = 220/385 (57%), Gaps = 2/385 (0%) Frame = +2 Query: 254 FPS-GLGHGRGKPIPSNPVLPSFSSWISSTKSVVGRGRITQHQQQPPNSRSEESQTVQPK 430 FPS G+GHGRG+PI + P+LPSF+ W+S GRGR + P S Q + Sbjct: 68 FPSPGIGHGRGQPIQTTPILPSFAPWMSGPVPGTGRGRPSS-PLPPQLDHSPNQQEPPSR 126 Query: 431 KPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSV-LPGAGRGKGVKFVDSEEKPIE 607 KPIFF R D +E T + + + P E P S++ + G GRGK + S E Sbjct: 127 KPIFFKR-DEIEGTDEGRVQAQNLPPTESPLPRSISPAPIEGFGRGKPTSPLLSHGIEEE 185 Query: 608 ENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKNAVRILSXXXXXXXXXXXXXXXXX 787 ENRH + + KLS E+AV+NA ILS Sbjct: 186 ENRHIRRRSPPPERAGQASRGRASNERKLSSEEAVRNAKDILSRGEGRGGRGLRGGRGLR 245 Query: 788 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDFSTGLYLGDNADGERLAQRIGVEN 967 D +D S GLYLGD+ADGE+L +R+G EN Sbjct: 246 GGRGRGGVWAGRGRQGRGARYQ---------DRREDDSVGLYLGDDADGEKLVKRLGEEN 296 Query: 968 MDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPEYLMGDFETNXXXXXXXXXXLQ 1147 ++++ E F+E+S VLPSPM++AYLDALHTN LIE+EPEY M +F TN L Sbjct: 297 VNQIFEAFDEMSGRVLPSPMEEAYLDALHTNCLIEFEPEYHMEEFGTNPDIDEKPPIPLC 356 Query: 1148 DALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKELIDIYSGPDTVTAMQQQQELER 1327 DALEK+KPF+M YEGI++QEEWEE++KE MDK+P++KEL+DIYSGPD VTA QQQQELER Sbjct: 357 DALEKIKPFIMTYEGIQNQEEWEEVVKETMDKVPYLKELVDIYSGPDRVTARQQQQELER 416 Query: 1328 VAKTLPDKAPKSVKRFTDRAVLSLQ 1402 VA TLP+ P SVK FT+RAVLSLQ Sbjct: 417 VASTLPENVPSSVKNFTNRAVLSLQ 441 >ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citrus clementina] gi|557544515|gb|ESR55493.1| hypothetical protein CICLE_v10019766mg [Citrus clementina] Length = 511 Score = 293 bits (750), Expect = 2e-76 Identities = 181/417 (43%), Positives = 234/417 (56%), Gaps = 13/417 (3%) Frame = +2 Query: 191 PQFSPQTPDNEDAHVEPDEPLFP-SGLGHGRGKPIPS-NPVLPSFSSWISSTKSVVGRGR 364 P +P P +E P +P P SG GHGRG+P + +P + SFSS++++ KS GRGR Sbjct: 101 PSKAPGQPASESKPDSPPQPQAPPSGSGHGRGQPSAAPSPSISSFSSFLTAVKSGAGRGR 160 Query: 365 ITQHQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSV 544 ++ P S ++Q +P+ F E + +STQ P E P S+ S Sbjct: 161 VS-FASDPNESPRPDAQPAKPRT--FTPNESATDSTQ----------PSEPNLPSSIIST 207 Query: 545 LPGAGRGKGVKFVDSEEK----------PIEENRHTXXXXXXXXXXXXKDQKSDRSSS-K 691 LPGAGRGK V +++ P EENRH S+ K Sbjct: 208 LPGAGRGKTVVTQQQQQQQHQRQQPGPPPQEENRHIRARLQPQPRPEKAPAAETGSAQPK 267 Query: 692 LSREDAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 871 LS+EDAVK A++ILS Sbjct: 268 LSKEDAVKMAMKILSRGEEGEGEGISAGGPGRGRGMGRGGGRGRGRGQGRGRMRRQEME- 326 Query: 872 XXXDSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDAL 1051 D ED GLYLGDNADGE+LA+++G E M+ LVEGFEE+S VLPSPM+DAY+DAL Sbjct: 327 ---DDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYIDAL 383 Query: 1052 HTNNLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKE 1231 HTN +IE+EPEYLM +F TN L+DALEKMKPFLMAYEGI+ Q+EWEE + E Sbjct: 384 HTNCMIEFEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQKEWEEAVNE 443 Query: 1232 VMDKLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQ 1402 VM+++P +KE++D YSGPD VTA QQ +ELERVAKT+P+ AP S+KRF + AVLSLQ Sbjct: 444 VMERVPLLKEIVDHYSGPDRVTAKQQGEELERVAKTIPESAPASIKRFANHAVLSLQ 500 >ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera] Length = 482 Score = 291 bits (746), Expect = 5e-76 Identities = 138/195 (70%), Positives = 162/195 (83%) Frame = +2 Query: 881 DSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTN 1060 D++DD+ GLYLGDNAD E+L+ +IG+E M KL E FEE+S VLPSP++DAYLDALHTN Sbjct: 283 DAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTN 342 Query: 1061 NLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMD 1240 LIE+EPEYLM +F TN L+DALEKMKPFLM YEGI+ QEEWEE+MKE M+ Sbjct: 343 CLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETME 402 Query: 1241 KLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWG 1420 +P++KEL+D YSGPD VTA +QQ+ELERVAKTLP+ AP SVKRFTDRA+LSLQSN GWG Sbjct: 403 NVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWG 462 Query: 1421 FDKKCQFMDKLVWEV 1465 FDKKCQFMDKLVWEV Sbjct: 463 FDKKCQFMDKLVWEV 477 Score = 114 bits (285), Expect = 2e-22 Identities = 77/179 (43%), Positives = 96/179 (53%), Gaps = 2/179 (1%) Frame = +2 Query: 206 QTPDNEDAHVEPDEPLFPSGLGHGRGKPI--PSNPVLPSFSSWISSTKSVVGRGRITQHQ 379 +T D + E E FP GLGHGRGKP PS P LPSFSS+ +ST GRGR+T H Sbjct: 54 KTEPTADPNSESSESPFPLGLGHGRGKPPSQPSAPTLPSFSSF-ASTGIGRGRGRLTAH- 111 Query: 380 QQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPGAG 559 P +S ++S PKKPIFF +ED+ +S KPQ PEE P+S+ S L G G Sbjct: 112 --PTDSVPQQSPDFAPKKPIFFSKEDAADSAPKPQSQLGTTPPEENNLPVSILSALSG-G 168 Query: 560 RGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKNAVRILS 736 G+G + P EENRH + + +LSRE+AVK AV ILS Sbjct: 169 AGRGQPLKQTPAPPKEENRH-LRQPRQPVFRSPQQPVAGPPQPRLSREEAVKKAVGILS 226 >emb|CBI17195.3| unnamed protein product [Vitis vinifera] Length = 209 Score = 291 bits (746), Expect = 5e-76 Identities = 138/195 (70%), Positives = 162/195 (83%) Frame = +2 Query: 881 DSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTN 1060 D++DD+ GLYLGDNAD E+L+ +IG+E M KL E FEE+S VLPSP++DAYLDALHTN Sbjct: 10 DAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTN 69 Query: 1061 NLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMD 1240 LIE+EPEYLM +F TN L+DALEKMKPFLM YEGI+ QEEWEE+MKE M+ Sbjct: 70 CLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETME 129 Query: 1241 KLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWG 1420 +P++KEL+D YSGPD VTA +QQ+ELERVAKTLP+ AP SVKRFTDRA+LSLQSN GWG Sbjct: 130 NVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWG 189 Query: 1421 FDKKCQFMDKLVWEV 1465 FDKKCQFMDKLVWEV Sbjct: 190 FDKKCQFMDKLVWEV 204 >ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550322664|gb|EEF06007.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 466 Score = 288 bits (737), Expect = 6e-75 Identities = 182/434 (41%), Positives = 234/434 (53%), Gaps = 12/434 (2%) Frame = +2 Query: 200 SPQTPDNEDAHVEPDEPLFPSGLGHGRGKPIPSNPVLPSFSSWISSTKSV---VGRGRIT 370 +P PD +++ E E PSGLGHGRGKP+ + P+LP+FS++ISS K+ GRGR T Sbjct: 61 APGKPDLDESKTESSESQ-PSGLGHGRGKPVGTGPILPAFSTFISSVKNSQPGAGRGRGT 119 Query: 371 QHQQQPPNSRSEES--QTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSV 544 +P SRS ES ++ PKK E P S+ S Sbjct: 120 T---EPGPSRSTESRPESEPPKKA-------------------------EANLPPSILSG 151 Query: 545 LPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDR------SSSKLSRED 706 L GAGRGK VK E EENRH + QK+ +++K+ R++ Sbjct: 152 LGGAGRGKPVKQEVPIEPAKEENRHLRARSQPRSQPRTRQQKTPDGDDAVPATTKMGRQE 211 Query: 707 AVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDS 886 AVK A+ +LS D Sbjct: 212 AVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGARGGGRGRGRGRRGYG----DK 267 Query: 887 EDDFSTGLYL-GDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNN 1063 E ++ +G+ L G D E+ AQ +GVE M+ LVE FEE+S VLP P++D Y+DA TN Sbjct: 268 EVEYGSGMSLEGHEEDEEKFAQSVGVETMNTLVEAFEEMSGRVLPCPIEDEYVDAFDTNC 327 Query: 1064 LIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDK 1243 E+EPEYLMG+F+ N L+DALEK+KPF+MAY GIK EEWEEI++E M Sbjct: 328 SFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEKVKPFMMAYMGIKTHEEWEEIVEETMKD 387 Query: 1244 LPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGF 1423 P MK+++D YSGPD V+ +Q++ELERVAKT+P AP SVK F DRAVLSLQSN GWGF Sbjct: 388 APLMKKIVDSYSGPDRVSGKKQKEELERVAKTIPASAPDSVKSFADRAVLSLQSNPGWGF 447 Query: 1424 DKKCQFMDKLVWEV 1465 DKKC FMDKL EV Sbjct: 448 DKKCMFMDKLAKEV 461 >ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica] gi|462409156|gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica] Length = 428 Score = 276 bits (706), Expect = 2e-71 Identities = 132/195 (67%), Positives = 162/195 (83%) Frame = +2 Query: 881 DSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTN 1060 DS+ +++GLYLGDNADGE+LA+++G E M+KLVE FEE+S+ VLPSP+DDAY+DA+HTN Sbjct: 229 DSDGSYASGLYLGDNADGEKLAKKLGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAMHTN 288 Query: 1061 NLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMD 1240 +IE EPEYLMG+F N L+DALEKMKPFLMAYE I+ QEEWEE++ E M+ Sbjct: 289 FMIECEPEYLMGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIESQEEWEEVVNETME 348 Query: 1241 KLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWG 1420 ++P +KE++D YSGPD VTA +QQ+ELERVAKTLP K P SVKRFTDRAVLSLQSN GWG Sbjct: 349 RVPLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSNPGWG 408 Query: 1421 FDKKCQFMDKLVWEV 1465 FD+KCQFMDKLV +V Sbjct: 409 FDRKCQFMDKLVAKV 423 Score = 71.6 bits (174), Expect = 1e-09 Identities = 54/147 (36%), Positives = 69/147 (46%), Gaps = 4/147 (2%) Frame = +2 Query: 191 PQFSPQTPDNEDAHVEPDEPLFPSGLGHGRGKPIPSNPVLPSFSSWISSTKSVVGRGRIT 370 P P PD++D +PD P GLGHGRGKP LP+FSS++S+ K G GR Sbjct: 60 PPRVPGQPDSDDP--KPDPPPSAPGLGHGRGKP------LPTFSSFVSAIKPNSGTGRGQ 111 Query: 371 QHQ-QQPPNSR---SEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLT 538 Q Q P SR + ++ +P KPIFF R D + Sbjct: 112 PSQVQSIPESRDPVAPDAGPSKPIKPIFFVRGDGSD------------------------ 147 Query: 539 SVLPGAGRGKGVKFVDSEEKPIEENRH 619 LPG+GRGK + F E + EENRH Sbjct: 148 PALPGSGRGKPMNFTRPEVQVKEENRH 174 >gb|EXB62642.1| hypothetical protein L484_023937 [Morus notabilis] Length = 442 Score = 263 bits (673), Expect = 2e-67 Identities = 171/440 (38%), Positives = 229/440 (52%), Gaps = 15/440 (3%) Frame = +2 Query: 182 PPNPQFSPQTPDNEDAHVEPDEPLFPS---------GLGHGRGKPIPS-NPVLPSFSSWI 331 PP FS TP + PL P G G GRG+P+P +P++PSFSS I Sbjct: 42 PPRSDFS--TPPRAPGQPPDEAPLTPQEASPLSHDHGRGRGRGQPLPPVSPIIPSFSSSI 99 Query: 332 SSTKSVVGRGRITQHQQQPPNSRSEESQTVQPKKPIFFCRE-DSLESTQKPQFDDSDRNP 508 SS GRGR +TV P P +E +E DD P Sbjct: 100 SSG---AGRGR-----------GGSSFKTVLPPPPPLPQQEIQDMEEAPPVAVDDGGSMP 145 Query: 509 EEGIKPLSLTSVLPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDR--- 679 E S+ S+LPG GRG+ K + ++ NRH + + + Sbjct: 146 E------SIASLLPGVGRGQPEKQPEIQQ---HVNRHVQRRWAPESAVVKESKPKEAVAA 196 Query: 680 -SSSKLSREDAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 856 ++ K+S+E+A+K+A+ + S Sbjct: 197 SAAPKMSQEEALKHAMEVFSRNEANGGRGRGRGRGRGRGRGRGRFVK------------- 243 Query: 857 XXXXXXXXDSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDA 1036 + ED+ T L +GD+ADGERLAQR+G E L E FEE+ ++P+ +D+ Sbjct: 244 --------EEEDEKDTWLNVGDDADGERLAQRLGPEKTSVLTEAFEEMGEKLIPA-IDEM 294 Query: 1037 YLDALHTNNLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWE 1216 +LDAL N +E+EPE+LMGD E+N L+DALEK KPFLMAYE I+ QEEWE Sbjct: 295 HLDALDMNFKLEFEPEFLMGDLESNPDIDEKPPIPLRDALEKAKPFLMAYENIESQEEWE 354 Query: 1217 EIMKEVMDKLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLS 1396 EIMKE M+++P +KE++D YSGP+ VT +Q QEL+RV KTLP AP SVK+FT+RAVLS Sbjct: 355 EIMKETMERVPLLKEIVDHYSGPNRVTVKKQHQELDRVTKTLPASAPNSVKQFTERAVLS 414 Query: 1397 LQSNAGWGFDKKCQFMDKLV 1456 LQ+N GWGF +KCQFMDKLV Sbjct: 415 LQNNPGWGFHRKCQFMDKLV 434 >ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300131 [Fragaria vesca subsp. vesca] Length = 464 Score = 260 bits (665), Expect = 1e-66 Identities = 124/195 (63%), Positives = 156/195 (80%) Frame = +2 Query: 881 DSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTN 1060 D + ++GLYLGDNADGE+LA+++G E M++L E FE++ST+VLPSP+DDAY+DAL TN Sbjct: 265 DEDGGIASGLYLGDNADGEKLAEKLGPEVMNQLTEAFEDMSTHVLPSPLDDAYVDALDTN 324 Query: 1061 NLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMD 1240 IE+EPEYLMG+F N L+DALEKMKPFLMAYEGI+ QEEWEE +KE M+ Sbjct: 325 CKIEFEPEYLMGEFNQNPDIDEEPPIPLRDALEKMKPFLMAYEGIQSQEEWEEAIKETME 384 Query: 1241 KLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWG 1420 ++P +K+++D YSGPD VTA +Q++ELERVAKTLP P SVK+FTDRAVLSLQ N GWG Sbjct: 385 RVPLLKKIVDHYSGPDRVTAKKQREELERVAKTLPANVPDSVKQFTDRAVLSLQGNPGWG 444 Query: 1421 FDKKCQFMDKLVWEV 1465 F +KCQFMDKL +V Sbjct: 445 FHRKCQFMDKLTQKV 459 Score = 65.1 bits (157), Expect = 1e-07 Identities = 58/174 (33%), Positives = 82/174 (47%), Gaps = 2/174 (1%) Frame = +2 Query: 218 NEDAHVEPDEP--LFPSGLGHGRGKPIPSNPVLPSFSSWISSTKSVVGRGRITQHQQQPP 391 N A PD+P + +G GHGRGKP+P P P F S I + GRG H + P Sbjct: 55 NHLAEQFPDQPDSVSSTGAGHGRGKPLPQPP--PPFGSGI-RPGAPAGRGH-PGHVRSPG 110 Query: 392 NSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPGAGRGKG 571 SR E + PKKP+FF RED+ E+ +P ++ +VL GRGK Sbjct: 111 ESR-EGDDSGLPKKPVFFRREDAAEN-----------------RPEAILTVLGVTGRGKP 152 Query: 572 VKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKNAVRIL 733 V + + +EE+R + + + K SRE+AVK+A+ IL Sbjct: 153 VS--GAAVQSVEEDRRIGAPV---------EPRREPRKPKSSREEAVKHAMGIL 195 >tpg|DAA51855.1| TPA: hypothetical protein ZEAMMB73_029894 [Zea mays] Length = 455 Score = 256 bits (655), Expect = 2e-65 Identities = 163/419 (38%), Positives = 218/419 (52%), Gaps = 2/419 (0%) Frame = +2 Query: 200 SPQTPDNEDAHVEPDEPLFPSGLGHGRGKP-IPSNPVLPSFSSWISSTKSVVGRGRITQH 376 +P P + D +P P +G GRG+P +PS+P +PSF+ + S VGRGR Sbjct: 53 APGRPISNDDDADPFSATAP--VGRGRGEPEVPSSPGIPSFAVF-----SGVGRGRGRGS 105 Query: 377 QQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPGA 556 PP + S+ QP F + P D S P PL + GA Sbjct: 106 PLPPPPPPEDASK--QPTFTKGFDNAPQRSYPEPPSLDASSSAP-----PLPRPLPISGA 158 Query: 557 GRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSS-KLSREDAVKNAVRIL 733 GRG S +KP EENR +++ + KLS ++AV+ AV +L Sbjct: 159 GRGVPWTQQPSPDKPPEENRFIRRREAVKQSAAEPPKQAPGAQQPKLSPQEAVRRAVELL 218 Query: 734 SXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDFSTGLY 913 D +D Y Sbjct: 219 GGGGRSGEDGGGRGGGGRLSRGRGRGTGRGRRPGRGDRS----------DDVEDVWQASY 268 Query: 914 LGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPEYLM 1093 LGD ADG+RL Q++G + M L + F E + N LP PM+DAYL+A HTNN+IE+EPEY + Sbjct: 269 LGDKADGDRLEQQLGEDKMKILEQAFMEAADNALPHPMEDAYLEACHTNNMIEFEPEYHV 328 Query: 1094 GDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKELIDI 1273 N L++ L+K+KPF++AYEGI++QEEWEE +K+VM + PHMKELID+ Sbjct: 329 NF--GNPDIDEKPPMSLEEMLQKVKPFVVAYEGIQNQEEWEEAVKDVMARAPHMKELIDM 386 Query: 1274 YSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQFMDK 1450 YSGPD VTA QQ++EL+RVA TLP+ P SVKRFTD+ +LSL++N GWGFDKKCQFMDK Sbjct: 387 YSGPDVVTAKQQEEELQRVANTLPESIPSSVKRFTDKTLLSLKNNPGWGFDKKCQFMDK 445 >ref|XP_002466313.1| hypothetical protein SORBIDRAFT_01g005470 [Sorghum bicolor] gi|241920167|gb|EER93311.1| hypothetical protein SORBIDRAFT_01g005470 [Sorghum bicolor] Length = 458 Score = 255 bits (651), Expect = 5e-65 Identities = 159/419 (37%), Positives = 225/419 (53%), Gaps = 2/419 (0%) Frame = +2 Query: 200 SPQTPDNEDAHVEPDEPLFPSGLGHGRGKP-IPSNPVLPSFSSWISSTKSVVGRGRITQH 376 +P P ++D +P + +G GRG+P +PS+P +PSF+ + S VGRGR + Sbjct: 57 APGRPISDDDGADPFSAT--ASVGRGRGEPAVPSSPSIPSFAVF-----SGVGRGRGSPL 109 Query: 377 QQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPGA 556 PP + PK+P R D+ + Q+P + + PL T GA Sbjct: 110 PPPPPPEDA-------PKQPTLTKRFDN--APQRPDPEPPSLDASSSAPPLPRTLPFSGA 160 Query: 557 GRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSS-KLSREDAVKNAVRIL 733 GRG + +KP EENR +++ + KLS+++AV AV +L Sbjct: 161 GRGVPRMQQPAPDKPQEENRFIRRREAAKQAAAVPAKQAPAAQQPKLSQQEAVDRAVELL 220 Query: 734 SXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDFSTGLY 913 D EDD +Y Sbjct: 221 GGGDRSGEDGGGRGGRGRGFRGRGPGRGRFRGRGRSDDRSV--------DVEDD-RQAIY 271 Query: 914 LGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPEYLM 1093 LGDNADG+RL +R+G + M+ L + F E + N LP P++D YL+A HTN++IE+EPEY + Sbjct: 272 LGDNADGDRLEKRLGKDKMEILEQAFMEAADNALPDPVEDGYLEAFHTNSMIEFEPEYHV 331 Query: 1094 GDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKELIDI 1273 N L++ L+K+KPF++A+EGI++QEEWEE +K+VM + PHMKELID+ Sbjct: 332 NF--GNPDIDEKPPMSLEEMLQKVKPFIVAFEGIQNQEEWEESVKDVMARAPHMKELIDM 389 Query: 1274 YSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQFMDK 1450 SGPD VTA QQ++EL+RVA TLP+ P SVKRFTD+ +LSL++N GWGFDKKCQFMDK Sbjct: 390 CSGPDVVTAKQQEEELQRVANTLPESIPSSVKRFTDKTLLSLKNNPGWGFDKKCQFMDK 448 >ref|XP_004965630.1| PREDICTED: CCAAT/enhancer-binding protein alpha-like [Setaria italica] Length = 457 Score = 252 bits (644), Expect = 4e-64 Identities = 159/424 (37%), Positives = 212/424 (50%), Gaps = 1/424 (0%) Frame = +2 Query: 182 PPNPQFSPQTPDNEDAHVEPDEPLFPSGLGHGRGKPI-PSNPVLPSFSSWISSTKSVVGR 358 P P +P ++D +P P+G GRG+P PS+ +PSF++ S VGR Sbjct: 49 PSGPPRAPGRTISDDDGADPFSAAAPAG--RGRGEPAAPSSATIPSFAA-----SSGVGR 101 Query: 359 GRITQHQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLT 538 GR + PP + PK+P R D + P+ S PL Sbjct: 102 GRGSPLPPPPPPEDA-------PKQPTLTKRFDDAPPRRDPE-PPSPEASSSSAPPLPRA 153 Query: 539 SVLPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKN 718 GAGRG +KP EENR KLS EDAVK Sbjct: 154 LPFTGAGRGVPRMQQPPVDKPPEENRFIRRREAAKQAAVGPTSAPGPQQPKLSGEDAVKR 213 Query: 719 AVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDF 898 A+ +L D Sbjct: 214 ALELLGGGGGGRGGGRGDEDGGGRGGRGRGFRGRGRGRGRTRDDRRSVDL--------DD 265 Query: 899 STGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYE 1078 +YLGDNADGE+L +++G + M L + F E + N LP PM++AY +A HTNN+IE+E Sbjct: 266 RQAIYLGDNADGEKLEKKLGEDKMKILEQAFMEAADNALPHPMENAYQEACHTNNMIEFE 325 Query: 1079 PEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMK 1258 P+Y + N L++ L+K+KPF++AYEGI++QEEWEE +K+VM + PHMK Sbjct: 326 PQYHVNF--ANPDIDEKPQMSLEEMLQKVKPFIVAYEGIQNQEEWEEAVKDVMARAPHMK 383 Query: 1259 ELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQ 1438 ELID+YSGPD VTA QQ++EL+RVA TLP+ P SVKRFTD+ +LSL++N GWGFDKKCQ Sbjct: 384 ELIDMYSGPDVVTAKQQEEELQRVANTLPENIPSSVKRFTDKTLLSLKNNPGWGFDKKCQ 443 Query: 1439 FMDK 1450 FMDK Sbjct: 444 FMDK 447