BLASTX nr result

ID: Akebia23_contig00014798 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00014798
         (1856 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247...   377   e-102
ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu...   372   e-100
ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family prot...   341   6e-91
ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [...   332   5e-88
ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215...   330   1e-87
ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm...   312   4e-82
ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1...   303   2e-79
gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus...   299   3e-78
ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phas...   297   1e-77
ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [A...   294   1e-76
ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citr...   293   2e-76
ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253...   291   5e-76
emb|CBI17195.3| unnamed protein product [Vitis vinifera]              291   5e-76
ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr...   288   6e-75
ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prun...   276   2e-71
gb|EXB62642.1| hypothetical protein L484_023937 [Morus notabilis]     263   2e-67
ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300...   260   1e-66
tpg|DAA51855.1| TPA: hypothetical protein ZEAMMB73_029894 [Zea m...   256   2e-65
ref|XP_002466313.1| hypothetical protein SORBIDRAFT_01g005470 [S...   255   5e-65
ref|XP_004965630.1| PREDICTED: CCAAT/enhancer-binding protein al...   252   4e-64

>ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum
            lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED:
            uncharacterized protein LOC101247662 isoform 2 [Solanum
            lycopersicum]
          Length = 473

 Score =  377 bits (969), Expect = e-102
 Identities = 218/434 (50%), Positives = 264/434 (60%), Gaps = 5/434 (1%)
 Frame = +2

Query: 179  DPPNPQFSPQTPDNEDAHVEPDEPLFPSGLGHGRG--KPIPSNPVLPSFSSWISSTKSVV 352
            D PN  FSP    +ED+  E   P  PSG GHGRG  KP+PS+P++PSF S++ +  +  
Sbjct: 43   DSPNFGFSPGKSASEDSKPESSTPATPSGTGHGRGRGKPLPSSPIVPSFHSFVDNPNTPA 102

Query: 353  GRGRITQHQQQPPNSRSEESQTVQP-KKPIFFCREDSLESTQKPQFDDSDRNPEEGIK-P 526
            GRGR       PP    ++ Q  QP +KPIFF +E+  E+T       +   P +    P
Sbjct: 103  GRGRGGIGPFSPPPQPQQQQQ--QPLRKPIFFAKEE--ETTDSNSSSSNAPKPRDDSNLP 158

Query: 527  LSLTSVLPGAGRGKGVKFVDS-EEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSRE 703
             S+ SVL GAGRGK ++   S  EKP EENRH              ++ S     +LSRE
Sbjct: 159  SSVISVLTGAGRGKPLQTASSVSEKPKEENRHLRPRQQKVADSG--ERASSPPPQRLSRE 216

Query: 704  DAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXD 883
            DAVK AV ILS                                                D
Sbjct: 217  DAVKKAVGILSRSDDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDEERGD 276

Query: 884  SEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNN 1063
               +  +G YLGD+ADGE+LA ++G E+M+ L EGFEE+S  VLPSPMDDAYL+ALHTN 
Sbjct: 277  G--NLESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLPSPMDDAYLEALHTNM 334

Query: 1064 LIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDK 1243
            +IE EPEYLMGDFE+N          L+DALEKMKPFLMAYEGIKDQEEWEE++KE M+ 
Sbjct: 335  MIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKETMET 394

Query: 1244 LPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGF 1423
            +P MKE++D YSGPD VTA QQQQELERVAKTLP+ AP SVKRFT+RAVLSLQSN GWGF
Sbjct: 395  VPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGWGF 454

Query: 1424 DKKCQFMDKLVWEV 1465
            DKKCQFMDK+V EV
Sbjct: 455  DKKCQFMDKVVMEV 468


>ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum]
          Length = 480

 Score =  372 bits (954), Expect = e-100
 Identities = 214/435 (49%), Positives = 259/435 (59%), Gaps = 7/435 (1%)
 Frame = +2

Query: 179  DPPNPQFSPQTPDNEDAHVEPDEPLFPSGLGHGRG--KPIPSNPVLPSFSSWISSTKSVV 352
            D PN  FSP    +ED+  E   P  PSG GHGRG  KP+PS+P++PSF S + +     
Sbjct: 43   DFPNFGFSPGKSASEDSKPESSTPTTPSGTGHGRGRGKPLPSSPIVPSFYSVVDNPNPPA 102

Query: 353  GRGRITQHQ-QQPPNSRSEESQTVQP-KKPIFFCREDSLESTQKPQFDDSDRNPEEGIKP 526
            GRGR        PP  + ++ Q  QP +KPIFF +E+    +     D      +  +  
Sbjct: 103  GRGRGGIGPFSPPPQPQQQQQQQQQPLRKPIFFAKEEETADSNSSSSDAPTPRDDSNLSS 162

Query: 527  LSLTSVLPGAGRGKGVKFVDS-EEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSRE 703
             S+ SVL GAGRGK ++      EKP EENRH              ++ S     +LSRE
Sbjct: 163  -SVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQKVADSG--ERASSPPPQRLSRE 219

Query: 704  DAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXD 883
            DAVK AV ILS                                                +
Sbjct: 220  DAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDE 279

Query: 884  SEDDFS--TGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHT 1057
               D S  +G YLGD+ADGE+LAQ++G E M+ L EGFEE+S  VLPSPMDDAY++ALHT
Sbjct: 280  ERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMSARVLPSPMDDAYIEALHT 339

Query: 1058 NNLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVM 1237
            N +IE EPEYLMGDFE+N          L+DALEKMKPFLMAYEGIKDQEEWEE++KE M
Sbjct: 340  NMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKETM 399

Query: 1238 DKLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGW 1417
            + +P MKE++D YSGPD VTA QQQQELERVAKTLP+ AP SVKRFT+RAVLSLQSN GW
Sbjct: 400  ETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGW 459

Query: 1418 GFDKKCQFMDKLVWE 1462
            GFDKKCQFMDK+V E
Sbjct: 460  GFDKKCQFMDKVVME 474


>ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|508784903|gb|EOY32159.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 474

 Score =  341 bits (875), Expect = 6e-91
 Identities = 200/434 (46%), Positives = 254/434 (58%), Gaps = 7/434 (1%)
 Frame = +2

Query: 185  PNPQFSPQTPDNEDAHVEPDEPLFPSGLGHGRGK--PIPSNPVLPSFSSWISSTKSVVGR 358
            P P  S     N D+   P     P+G+GHGRG+  P+ S+P+   FSS++S T S  GR
Sbjct: 60   PPPGKSGSGDSNRDSAESP-----PAGVGHGRGRGGPLSSDPIPHPFSSFVSQTGS--GR 112

Query: 359  GRITQHQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSL- 535
            GR+T     PP          Q K+PIF  ++D  E+    +         E I P ++ 
Sbjct: 113  GRVTSESVPPP-----PPPPAQAKQPIFIKKKDEDETESSAKAAAEPIQSSEPIFPPNIL 167

Query: 536  -TSVLPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAV 712
              SVL GAGRGK VK  +   +  EENRH             +  +    S+++S+E+A 
Sbjct: 168  PVSVLSGAGRGKPVKQPEPASRRQEENRHI------------RVAQQQSPSAQMSQEEAT 215

Query: 713  KNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX---D 883
            K A+ ILS                                                   D
Sbjct: 216  KKAMGILSRRSESGESGMVGRGGRASMGMGGGRGRGRGRGRGMGRGRGRRQGEDTRIVKD 275

Query: 884  SEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNN 1063
            S +  + GLYLGDNADGE+ AQ IG +NM+KLVEGFEE+ + VLPSPMDDAYLDALHTN 
Sbjct: 276  SGEGSADGLYLGDNADGEKFAQTIGADNMNKLVEGFEEMGSRVLPSPMDDAYLDALHTNC 335

Query: 1064 LIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDK 1243
             IE+EPEYLM +F TN          L+DALEKMKPFLMAYEGI+ QEEWEE++KE M++
Sbjct: 336  SIEFEPEYLMEEFGTNPDIDEKPPMPLRDALEKMKPFLMAYEGIQSQEEWEEVIKETMER 395

Query: 1244 LPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGF 1423
            +P ++E++D YSGPD VTA +QQ+ELERVAKT+P++AP SVK+F +RAVLSLQSN GWGF
Sbjct: 396  VPLLQEIVDYYSGPDRVTAKKQQEELERVAKTIPERAPSSVKQFANRAVLSLQSNPGWGF 455

Query: 1424 DKKCQFMDKLVWEV 1465
            DKKCQFMDKLVWEV
Sbjct: 456  DKKCQFMDKLVWEV 469


>ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [Glycine max]
            gi|571476117|ref|XP_006586864.1| PREDICTED: la-related
            protein 1 isoform X2 [Glycine max]
          Length = 481

 Score =  332 bits (850), Expect = 5e-88
 Identities = 201/439 (45%), Positives = 260/439 (59%), Gaps = 13/439 (2%)
 Frame = +2

Query: 188  NPQFSPQTPDNEDAHVEPDEPLFP--SGLGHGRGKPIPSNPVLPSFSSWISS-TKSVVGR 358
            N + +P  P++ ++  +  EP  P  SGLGHGRGKP+P +  LPSFSS+ISS  +   GR
Sbjct: 52   NNERAPVEPNSSESKSDTTEPPIPPGSGLGHGRGKPMPPSG-LPSFSSFISSINQPPAGR 110

Query: 359  GRIT----QHQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPE---EG 517
            GR T    QH  QPP+S         PKKPIFF REDS+  T    F    R+ +   + 
Sbjct: 111  GRGTAPHPQHDLQPPDSG--------PKKPIFFKREDSVSPTASNDFLPPKRSVDHAHDN 162

Query: 518  IKPLSLTSVLPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLS 697
              P S+  VL G GRGK +K  D E +  EENRH               +   + S   S
Sbjct: 163  KLPGSIPGVLSGLGRGKSMKQPDLETQVTEENRHLRTRQAPGAA---SSETVPKRSPIPS 219

Query: 698  REDAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 877
            +EDA +NA++ILS                                               
Sbjct: 220  QEDATRNALKILSHGKDDGSDTGRGREYGGRGGLDRGRGRGRGRGRGRGMGRGRFVERDV 279

Query: 878  XDS---EDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDA 1048
             +     DD++TGLY GD+ADGE+LA+++G E M++L EGFEE+++ VLPSP++D +LDA
Sbjct: 280  DEKVMDTDDYATGLYAGDDADGEKLARKVGPEIMNQLTEGFEEMTSRVLPSPLEDEFLDA 339

Query: 1049 LHTNNLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMK 1228
            L  N  IE+EPEYL+ +F+ N          L+DALEK KPFLM+YEGI+ QEEWEEIM+
Sbjct: 340  LDINYAIEFEPEYLV-EFD-NPDIDEKEPISLRDALEKAKPFLMSYEGIQSQEEWEEIME 397

Query: 1229 EVMDKLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSN 1408
            E M ++P +K++ID YSGPD VTA +QQ+ELERVAKTLP   P SVK+FT+RAV+SLQSN
Sbjct: 398  ETMARVPLLKKIIDHYSGPDRVTAKKQQEELERVAKTLPGSVPSSVKQFTNRAVISLQSN 457

Query: 1409 AGWGFDKKCQFMDKLVWEV 1465
             GWGFDKKC FMDKLVWEV
Sbjct: 458  PGWGFDKKCHFMDKLVWEV 476


>ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215545 [Cucumis sativus]
            gi|449502143|ref|XP_004161555.1| PREDICTED:
            uncharacterized protein LOC101224016 [Cucumis sativus]
          Length = 478

 Score =  330 bits (847), Expect = 1e-87
 Identities = 193/430 (44%), Positives = 241/430 (56%), Gaps = 5/430 (1%)
 Frame = +2

Query: 182  PPNP-QFSPQTPDNEDAHVEPDEPLFPS---GLGHGRGKPIPSNPVLPSFSSWISSTK-S 346
            P  P  F+P  P+ E ++    EP+      GLGHGRGKP PS+P+ PSFSS+  S + S
Sbjct: 47   PSGPFDFTPPVPNQEHSNASKQEPIDSRPTPGLGHGRGKPTPSSPLRPSFSSFSPSVRPS 106

Query: 347  VVGRGRITQHQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKP 526
             VGRGR        P+ RS      +PKKP+FF + ++ +S          R   E   P
Sbjct: 107  SVGRGR----GDASPSIRSPPEPDSEPKKPVFFSKNNAGDSAASTSLGGLHRVSGERNLP 162

Query: 527  LSLTSVLPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSRED 706
             SL S   G GRGK +K    E++P +ENRH             +  +      ++ R +
Sbjct: 163  ESLHSEFSGVGRGKPMKQPVPEDQPKQENRHLRPRQEGDGPGAGERGRGRGFEPRIGRGE 222

Query: 707  AVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDS 886
              +N  R++S                                                D 
Sbjct: 223  PWRNTNRMVSKDGPDGEVGGGRGTSGYRGRGARGPYRRGARGSFRTGERRERRSGH--DK 280

Query: 887  EDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNL 1066
            ED ++ GLYLG+N DGERLA+RIG ENM+KLVEGFEE+S  VLPSP+ D YLD + TN +
Sbjct: 281  EDGYAAGLYLGNNEDGERLAKRIGTENMNKLVEGFEEMSGRVLPSPLVDQYLDGMDTNFM 340

Query: 1067 IEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKL 1246
            IE EPEYLMGDFE N          L+DALEKMKPFLMAYE I+  EEWEEI++E M  +
Sbjct: 341  IECEPEYLMGDFENNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIVEETMQSV 400

Query: 1247 PHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFD 1426
            P +KE++D Y GPD VTA +QQ ELERVAKTLP  AP SVK+FT+R VLSLQSN GWGFD
Sbjct: 401  PLLKEIVDAYGGPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRVVLSLQSNPGWGFD 460

Query: 1427 KKCQFMDKLV 1456
            KK Q MDKLV
Sbjct: 461  KKWQLMDKLV 470


>ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis]
            gi|223537066|gb|EEF38701.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 436

 Score =  312 bits (799), Expect = 4e-82
 Identities = 185/424 (43%), Positives = 237/424 (55%), Gaps = 3/424 (0%)
 Frame = +2

Query: 203  PQTPDNEDAHVEPDEPLFP---SGLGHGRGKPIPSNPVLPSFSSWISSTKSVVGRGRITQ 373
            P++ D    H  P  P  P   +G+GHG G     NP+LP+FSS++SS    +GRGR   
Sbjct: 62   PESSDVAKPHYPPPPPPPPPPRNGVGHGHGG---GNPILPAFSSFVSS----IGRGRAIT 114

Query: 374  HQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPG 553
              +  P+ +  ESQ+                               + + P ++ S L G
Sbjct: 115  DPEPGPSRQPTESQS-------------------------------DSVLPSTIHSSLSG 143

Query: 554  AGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKNAVRIL 733
             GRG+  K V    +  EENRH              ++   R+  K+SRE+AVK AV IL
Sbjct: 144  FGRGEPDKPVVPTPQVKEENRHIRDRSRAKPKT---EEAEVRAKPKISREEAVKRAVSIL 200

Query: 734  SXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDFSTGLY 913
            S                                                D ++ F +GL+
Sbjct: 201  SQGDTGEGMGRGRGGGRGRGRGRGRGRLEQRGRMMD-------------DVDEGFGSGLF 247

Query: 914  LGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPEYLM 1093
            LGDNADGE+LA +IGVENM+KLVEG+EE+S  VLPSPM+DAYLDALHTN +IE+EPEYLM
Sbjct: 248  LGDNADGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDALHTNYMIEFEPEYLM 307

Query: 1094 GDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKELIDI 1273
            G+F+ N          L+D LEK+KPF+MAYEGI+ QEEWE  ++E M  +P  KE++D 
Sbjct: 308  GEFDQNPDIDEKPPMPLRDVLEKVKPFIMAYEGIQSQEEWEAAVEETMKNVPLFKEIVDY 367

Query: 1274 YSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQFMDKL 1453
            YSGPD +TA +Q++ELERVA T+P  AP SVKRF DRAVLSLQSN GWGFDKKCQFMDKL
Sbjct: 368  YSGPDRITAKKQEEELERVANTIPASAPASVKRFADRAVLSLQSNPGWGFDKKCQFMDKL 427

Query: 1454 VWEV 1465
            V EV
Sbjct: 428  VREV 431


>ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis]
          Length = 407

 Score =  303 bits (775), Expect = 2e-79
 Identities = 183/416 (43%), Positives = 232/416 (55%), Gaps = 19/416 (4%)
 Frame = +2

Query: 275  GRGKPIPSNPVLPS-----FSSWISSTKSVVGRGRITQHQQQPPNSRSEESQTVQPKKPI 439
            GR    P+N ++P+     +     + +   GRGR++     P  S   ++Q  +P+   
Sbjct: 6    GRRISNPNNFIIPNNFFLLYGQGGCTVQQGAGRGRVS-FASDPNESPRPDAQPAKPRT-- 62

Query: 440  FFC--REDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPGAGRGKGVKFVDSEEK----- 598
              C   E + +STQ          P E   P S+ S LPGAGRGK       +++     
Sbjct: 63   --CTPNESATDSTQ----------PSEPNLPSSIISTLPGAGRGKTAVTQQQQQQQQHQR 110

Query: 599  ------PIEENRHTXXXXXXXXXXXXKDQKSDRSSS-KLSREDAVKNAVRILSXXXXXXX 757
                  P EENRH                    S+  KLS+EDAVK A+++LS       
Sbjct: 111  QQPGPPPQEENRHIRARLQPQPRPEKAPAAETGSAQPKLSKEDAVKMAMKVLSRGEEGEG 170

Query: 758  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDFSTGLYLGDNADGE 937
                                                     D ED    GLYLGDNADGE
Sbjct: 171  EGISAGGPGRGRGMGRGRGRGRGRGQGRGRMRRQEME----DDEDGRFGGLYLGDNADGE 226

Query: 938  RLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPEYLMGDFETNXX 1117
            +LA+++G E M+ LVEGFEE+S  VLPSPM+DAY+DALHTN +IE+EPEYLM +F TN  
Sbjct: 227  KLAEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPD 286

Query: 1118 XXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKELIDIYSGPDTVT 1297
                    L+DALEKMKPFLMAYEGI+ QEEWEE + EVM+++P +KE++D YSGPD VT
Sbjct: 287  IDEKPPIPLRDALEKMKPFLMAYEGIQSQEEWEEAVNEVMERVPLLKEIVDHYSGPDRVT 346

Query: 1298 AMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQFMDKLVWEV 1465
            A QQ +ELERVAKT+P+ AP S+KRF +RAVLSLQSN GWGFDKKCQFMDKL WEV
Sbjct: 347  AKQQGEELERVAKTIPESAPASIKRFANRAVLSLQSNPGWGFDKKCQFMDKLAWEV 402


>gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus guttatus]
          Length = 493

 Score =  299 bits (765), Expect = 3e-78
 Identities = 189/440 (42%), Positives = 237/440 (53%), Gaps = 20/440 (4%)
 Frame = +2

Query: 206  QTPDNEDAHVEPDEPLFPSGLGHGRGKPIPSNPVLPSFSSWISSTK-SVVGRGR----IT 370
            QT  N    VE   P +  G G GRG P+PS+PVLPSFSS+++ +K   VGRGR      
Sbjct: 54   QTDKNSKTEVETPPPSY--GHGRGRGTPLPSSPVLPSFSSFLNESKPPPVGRGRGVAIPA 111

Query: 371  QHQQQPPNSRSEESQTVQP------KKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLS 532
                 PP  R  ES + +P      K P  F ++   E  Q    +    + +E +    
Sbjct: 112  SPTPPPPPPRVSESPSEKPPPKPNVKLPFLFVKD---EEEQADAAESEVPSAQETLLRSD 168

Query: 533  LTSVLPGAGRGKGVK--FVDSEEKPIEENRH-TXXXXXXXXXXXXKDQKSDRSSSKLSRE 703
            + SVL GAGRGK  K       EKP  ENRH                  +   + +LS+E
Sbjct: 169  IVSVLSGAGRGKPGKPPTAAQPEKPQSENRHIRQRPPQGKPPVAVSSDGAAPPAVQLSKE 228

Query: 704  DAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXD 883
            + VK A  ILS                                                 
Sbjct: 229  EMVKKAKEILSKGDEDGGVSRPEVRDNRDNRDNRGGGRGGRGERGRGRGRGRGRGRGRGR 288

Query: 884  SED------DFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLD 1045
             +D      D S  L++GD AD E++AQ++G + M +L EG +E+S+ VLPSP DDAY+D
Sbjct: 289  GDDRYEESDDESDALFIGDPADEEKVAQKLGPDVMAQLAEGIDEMSSRVLPSPFDDAYMD 348

Query: 1046 ALHTNNLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIM 1225
            A  TN  IE EPEYLM +F TN          L+DALEKMKPFLM YEGIKDQEEWE+I+
Sbjct: 349  AFETNLRIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMVYEGIKDQEEWEKII 408

Query: 1226 KEVMDKLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQS 1405
            +E M  +P +KE++D YSGPD VTA QQ +ELERVAKTLP  AP SVKRFT+RA+LSLQS
Sbjct: 409  EETMKDVPLIKEIVDHYSGPDRVTAKQQNEELERVAKTLPASAPASVKRFTERALLSLQS 468

Query: 1406 NAGWGFDKKCQFMDKLVWEV 1465
            N GWGFDKKCQFMDK++ EV
Sbjct: 469  NPGWGFDKKCQFMDKVIMEV 488


>ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris]
            gi|561020640|gb|ESW19411.1| hypothetical protein
            PHAVU_006G122700g [Phaseolus vulgaris]
          Length = 532

 Score =  297 bits (760), Expect = 1e-77
 Identities = 202/487 (41%), Positives = 252/487 (51%), Gaps = 59/487 (12%)
 Frame = +2

Query: 182  PPNPQFSPQTPDNEDAHVEPDEPLFPSGLGHGRGKPIPSNPVLPSFSSWISST------- 340
            P  P  S    D  ++ + P      SG GHGRGKP+P +  LPSFSS++SS        
Sbjct: 57   PGKPNSSEPKSDTTESPIPPG-----SGHGHGRGKPMPPSG-LPSFSSFLSSINQPPAGR 110

Query: 341  ------------------------------KSVVGRGRIT--QHQQ-------------- 382
                                          +S  GRGR T  +HQ               
Sbjct: 111  GRPTVPHHQNDLQSPAGRGRPTVPHHQNDLQSPAGRGRPTVPRHQNDLQSPAGRGRATVP 170

Query: 383  QPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIK-PLSLTSVLPGAG 559
            QPPN          PKKPIFF RED    T +   DD   + E+  K P ++  VL G G
Sbjct: 171  QPPNDLGPPDSG--PKKPIFFKREDIASPTTR---DDFPIDVEQANKLPGNIIEVLSGLG 225

Query: 560  RGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKNAVRILSX 739
            RGK +K  D E +  EENRH              D   +R     SR+DAV+NA   LS 
Sbjct: 226  RGKPMKQSDPETRVTEENRHLRAPRARGAAA--SDTLYERQPIP-SRDDAVRNARNFLSQ 282

Query: 740  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-----DSEDDFST 904
                                                                D+E     
Sbjct: 283  GEDDVGGTGRGRGFRERGGLGRGRGRGRGRGRGTGRGGFRGRDMDERRGRFMDAEASDDI 342

Query: 905  GLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPE 1084
            G Y+GD+ADGE+LA+++G E M++L EGFEE++  VLPSP++D YLDAL  N  IE+EPE
Sbjct: 343  GPYVGDDADGEKLAKKVGPEIMNQLTEGFEEMAGRVLPSPLEDEYLDALDINYAIEFEPE 402

Query: 1085 YLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKEL 1264
            YL+ +F+ N          L+DALEKMKPFLMAYEGI+ QEEWEEIM+E M ++P +KE+
Sbjct: 403  YLV-EFD-NPDIDEKEPIPLRDALEKMKPFLMAYEGIQSQEEWEEIMEETMAQVPLLKEI 460

Query: 1265 IDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQFM 1444
            +D YSGPD VTA +QQ+ELERVAKTLP+ AP SVK+FT+RAV+SLQSN GWGFDKKC FM
Sbjct: 461  VDHYSGPDRVTAKKQQEELERVAKTLPESAPSSVKQFTNRAVVSLQSNPGWGFDKKCHFM 520

Query: 1445 DKLVWEV 1465
            DKLVWEV
Sbjct: 521  DKLVWEV 527


>ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [Amborella trichopoda]
            gi|548839984|gb|ERN00220.1| hypothetical protein
            AMTR_s00111p00111440 [Amborella trichopoda]
          Length = 447

 Score =  294 bits (752), Expect = 1e-76
 Identities = 173/385 (44%), Positives = 220/385 (57%), Gaps = 2/385 (0%)
 Frame = +2

Query: 254  FPS-GLGHGRGKPIPSNPVLPSFSSWISSTKSVVGRGRITQHQQQPPNSRSEESQTVQPK 430
            FPS G+GHGRG+PI + P+LPSF+ W+S      GRGR +     P    S   Q    +
Sbjct: 68   FPSPGIGHGRGQPIQTTPILPSFAPWMSGPVPGTGRGRPSS-PLPPQLDHSPNQQEPPSR 126

Query: 431  KPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSV-LPGAGRGKGVKFVDSEEKPIE 607
            KPIFF R D +E T + +    +  P E   P S++   + G GRGK    + S     E
Sbjct: 127  KPIFFKR-DEIEGTDEGRVQAQNLPPTESPLPRSISPAPIEGFGRGKPTSPLLSHGIEEE 185

Query: 608  ENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKNAVRILSXXXXXXXXXXXXXXXXX 787
            ENRH               +    +  KLS E+AV+NA  ILS                 
Sbjct: 186  ENRHIRRRSPPPERAGQASRGRASNERKLSSEEAVRNAKDILSRGEGRGGRGLRGGRGLR 245

Query: 788  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDFSTGLYLGDNADGERLAQRIGVEN 967
                                           D  +D S GLYLGD+ADGE+L +R+G EN
Sbjct: 246  GGRGRGGVWAGRGRQGRGARYQ---------DRREDDSVGLYLGDDADGEKLVKRLGEEN 296

Query: 968  MDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPEYLMGDFETNXXXXXXXXXXLQ 1147
            ++++ E F+E+S  VLPSPM++AYLDALHTN LIE+EPEY M +F TN          L 
Sbjct: 297  VNQIFEAFDEMSGRVLPSPMEEAYLDALHTNCLIEFEPEYHMEEFGTNPDIDEKPPIPLC 356

Query: 1148 DALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKELIDIYSGPDTVTAMQQQQELER 1327
            DALEK+KPF+M YEGI++QEEWEE++KE MDK+P++KEL+DIYSGPD VTA QQQQELER
Sbjct: 357  DALEKIKPFIMTYEGIQNQEEWEEVVKETMDKVPYLKELVDIYSGPDRVTARQQQQELER 416

Query: 1328 VAKTLPDKAPKSVKRFTDRAVLSLQ 1402
            VA TLP+  P SVK FT+RAVLSLQ
Sbjct: 417  VASTLPENVPSSVKNFTNRAVLSLQ 441


>ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citrus clementina]
            gi|557544515|gb|ESR55493.1| hypothetical protein
            CICLE_v10019766mg [Citrus clementina]
          Length = 511

 Score =  293 bits (750), Expect = 2e-76
 Identities = 181/417 (43%), Positives = 234/417 (56%), Gaps = 13/417 (3%)
 Frame = +2

Query: 191  PQFSPQTPDNEDAHVEPDEPLFP-SGLGHGRGKPIPS-NPVLPSFSSWISSTKSVVGRGR 364
            P  +P  P +E     P +P  P SG GHGRG+P  + +P + SFSS++++ KS  GRGR
Sbjct: 101  PSKAPGQPASESKPDSPPQPQAPPSGSGHGRGQPSAAPSPSISSFSSFLTAVKSGAGRGR 160

Query: 365  ITQHQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSV 544
            ++     P  S   ++Q  +P+   F   E + +STQ          P E   P S+ S 
Sbjct: 161  VS-FASDPNESPRPDAQPAKPRT--FTPNESATDSTQ----------PSEPNLPSSIIST 207

Query: 545  LPGAGRGKGVKFVDSEEK----------PIEENRHTXXXXXXXXXXXXKDQKSDRSSS-K 691
            LPGAGRGK V     +++          P EENRH                    S+  K
Sbjct: 208  LPGAGRGKTVVTQQQQQQQHQRQQPGPPPQEENRHIRARLQPQPRPEKAPAAETGSAQPK 267

Query: 692  LSREDAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 871
            LS+EDAVK A++ILS                                             
Sbjct: 268  LSKEDAVKMAMKILSRGEEGEGEGISAGGPGRGRGMGRGGGRGRGRGQGRGRMRRQEME- 326

Query: 872  XXXDSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDAL 1051
               D ED    GLYLGDNADGE+LA+++G E M+ LVEGFEE+S  VLPSPM+DAY+DAL
Sbjct: 327  ---DDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYIDAL 383

Query: 1052 HTNNLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKE 1231
            HTN +IE+EPEYLM +F TN          L+DALEKMKPFLMAYEGI+ Q+EWEE + E
Sbjct: 384  HTNCMIEFEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQKEWEEAVNE 443

Query: 1232 VMDKLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQ 1402
            VM+++P +KE++D YSGPD VTA QQ +ELERVAKT+P+ AP S+KRF + AVLSLQ
Sbjct: 444  VMERVPLLKEIVDHYSGPDRVTAKQQGEELERVAKTIPESAPASIKRFANHAVLSLQ 500


>ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera]
          Length = 482

 Score =  291 bits (746), Expect = 5e-76
 Identities = 138/195 (70%), Positives = 162/195 (83%)
 Frame = +2

Query: 881  DSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTN 1060
            D++DD+  GLYLGDNAD E+L+ +IG+E M KL E FEE+S  VLPSP++DAYLDALHTN
Sbjct: 283  DAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTN 342

Query: 1061 NLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMD 1240
             LIE+EPEYLM +F TN          L+DALEKMKPFLM YEGI+ QEEWEE+MKE M+
Sbjct: 343  CLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETME 402

Query: 1241 KLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWG 1420
             +P++KEL+D YSGPD VTA +QQ+ELERVAKTLP+ AP SVKRFTDRA+LSLQSN GWG
Sbjct: 403  NVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWG 462

Query: 1421 FDKKCQFMDKLVWEV 1465
            FDKKCQFMDKLVWEV
Sbjct: 463  FDKKCQFMDKLVWEV 477



 Score =  114 bits (285), Expect = 2e-22
 Identities = 77/179 (43%), Positives = 96/179 (53%), Gaps = 2/179 (1%)
 Frame = +2

Query: 206 QTPDNEDAHVEPDEPLFPSGLGHGRGKPI--PSNPVLPSFSSWISSTKSVVGRGRITQHQ 379
           +T    D + E  E  FP GLGHGRGKP   PS P LPSFSS+ +ST    GRGR+T H 
Sbjct: 54  KTEPTADPNSESSESPFPLGLGHGRGKPPSQPSAPTLPSFSSF-ASTGIGRGRGRLTAH- 111

Query: 380 QQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPGAG 559
             P +S  ++S    PKKPIFF +ED+ +S  KPQ       PEE   P+S+ S L G G
Sbjct: 112 --PTDSVPQQSPDFAPKKPIFFSKEDAADSAPKPQSQLGTTPPEENNLPVSILSALSG-G 168

Query: 560 RGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKNAVRILS 736
            G+G     +   P EENRH             +   +     +LSRE+AVK AV ILS
Sbjct: 169 AGRGQPLKQTPAPPKEENRH-LRQPRQPVFRSPQQPVAGPPQPRLSREEAVKKAVGILS 226


>emb|CBI17195.3| unnamed protein product [Vitis vinifera]
          Length = 209

 Score =  291 bits (746), Expect = 5e-76
 Identities = 138/195 (70%), Positives = 162/195 (83%)
 Frame = +2

Query: 881  DSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTN 1060
            D++DD+  GLYLGDNAD E+L+ +IG+E M KL E FEE+S  VLPSP++DAYLDALHTN
Sbjct: 10   DAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTN 69

Query: 1061 NLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMD 1240
             LIE+EPEYLM +F TN          L+DALEKMKPFLM YEGI+ QEEWEE+MKE M+
Sbjct: 70   CLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETME 129

Query: 1241 KLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWG 1420
             +P++KEL+D YSGPD VTA +QQ+ELERVAKTLP+ AP SVKRFTDRA+LSLQSN GWG
Sbjct: 130  NVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWG 189

Query: 1421 FDKKCQFMDKLVWEV 1465
            FDKKCQFMDKLVWEV
Sbjct: 190  FDKKCQFMDKLVWEV 204


>ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550322664|gb|EEF06007.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 466

 Score =  288 bits (737), Expect = 6e-75
 Identities = 182/434 (41%), Positives = 234/434 (53%), Gaps = 12/434 (2%)
 Frame = +2

Query: 200  SPQTPDNEDAHVEPDEPLFPSGLGHGRGKPIPSNPVLPSFSSWISSTKSV---VGRGRIT 370
            +P  PD +++  E  E   PSGLGHGRGKP+ + P+LP+FS++ISS K+     GRGR T
Sbjct: 61   APGKPDLDESKTESSESQ-PSGLGHGRGKPVGTGPILPAFSTFISSVKNSQPGAGRGRGT 119

Query: 371  QHQQQPPNSRSEES--QTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSV 544
                +P  SRS ES  ++  PKK                          E   P S+ S 
Sbjct: 120  T---EPGPSRSTESRPESEPPKKA-------------------------EANLPPSILSG 151

Query: 545  LPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDR------SSSKLSRED 706
            L GAGRGK VK     E   EENRH             + QK+        +++K+ R++
Sbjct: 152  LGGAGRGKPVKQEVPIEPAKEENRHLRARSQPRSQPRTRQQKTPDGDDAVPATTKMGRQE 211

Query: 707  AVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDS 886
            AVK A+ +LS                                                D 
Sbjct: 212  AVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGARGGGRGRGRGRRGYG----DK 267

Query: 887  EDDFSTGLYL-GDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNN 1063
            E ++ +G+ L G   D E+ AQ +GVE M+ LVE FEE+S  VLP P++D Y+DA  TN 
Sbjct: 268  EVEYGSGMSLEGHEEDEEKFAQSVGVETMNTLVEAFEEMSGRVLPCPIEDEYVDAFDTNC 327

Query: 1064 LIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDK 1243
              E+EPEYLMG+F+ N          L+DALEK+KPF+MAY GIK  EEWEEI++E M  
Sbjct: 328  SFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEKVKPFMMAYMGIKTHEEWEEIVEETMKD 387

Query: 1244 LPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGF 1423
             P MK+++D YSGPD V+  +Q++ELERVAKT+P  AP SVK F DRAVLSLQSN GWGF
Sbjct: 388  APLMKKIVDSYSGPDRVSGKKQKEELERVAKTIPASAPDSVKSFADRAVLSLQSNPGWGF 447

Query: 1424 DKKCQFMDKLVWEV 1465
            DKKC FMDKL  EV
Sbjct: 448  DKKCMFMDKLAKEV 461


>ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica]
            gi|462409156|gb|EMJ14490.1| hypothetical protein
            PRUPE_ppa006080mg [Prunus persica]
          Length = 428

 Score =  276 bits (706), Expect = 2e-71
 Identities = 132/195 (67%), Positives = 162/195 (83%)
 Frame = +2

Query: 881  DSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTN 1060
            DS+  +++GLYLGDNADGE+LA+++G E M+KLVE FEE+S+ VLPSP+DDAY+DA+HTN
Sbjct: 229  DSDGSYASGLYLGDNADGEKLAKKLGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAMHTN 288

Query: 1061 NLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMD 1240
             +IE EPEYLMG+F  N          L+DALEKMKPFLMAYE I+ QEEWEE++ E M+
Sbjct: 289  FMIECEPEYLMGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIESQEEWEEVVNETME 348

Query: 1241 KLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWG 1420
            ++P +KE++D YSGPD VTA +QQ+ELERVAKTLP K P SVKRFTDRAVLSLQSN GWG
Sbjct: 349  RVPLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSNPGWG 408

Query: 1421 FDKKCQFMDKLVWEV 1465
            FD+KCQFMDKLV +V
Sbjct: 409  FDRKCQFMDKLVAKV 423



 Score = 71.6 bits (174), Expect = 1e-09
 Identities = 54/147 (36%), Positives = 69/147 (46%), Gaps = 4/147 (2%)
 Frame = +2

Query: 191 PQFSPQTPDNEDAHVEPDEPLFPSGLGHGRGKPIPSNPVLPSFSSWISSTKSVVGRGRIT 370
           P   P  PD++D   +PD P    GLGHGRGKP      LP+FSS++S+ K   G GR  
Sbjct: 60  PPRVPGQPDSDDP--KPDPPPSAPGLGHGRGKP------LPTFSSFVSAIKPNSGTGRGQ 111

Query: 371 QHQ-QQPPNSR---SEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLT 538
             Q Q  P SR   + ++   +P KPIFF R D  +                        
Sbjct: 112 PSQVQSIPESRDPVAPDAGPSKPIKPIFFVRGDGSD------------------------ 147

Query: 539 SVLPGAGRGKGVKFVDSEEKPIEENRH 619
             LPG+GRGK + F   E +  EENRH
Sbjct: 148 PALPGSGRGKPMNFTRPEVQVKEENRH 174


>gb|EXB62642.1| hypothetical protein L484_023937 [Morus notabilis]
          Length = 442

 Score =  263 bits (673), Expect = 2e-67
 Identities = 171/440 (38%), Positives = 229/440 (52%), Gaps = 15/440 (3%)
 Frame = +2

Query: 182  PPNPQFSPQTPDNEDAHVEPDEPLFPS---------GLGHGRGKPIPS-NPVLPSFSSWI 331
            PP   FS  TP         + PL P          G G GRG+P+P  +P++PSFSS I
Sbjct: 42   PPRSDFS--TPPRAPGQPPDEAPLTPQEASPLSHDHGRGRGRGQPLPPVSPIIPSFSSSI 99

Query: 332  SSTKSVVGRGRITQHQQQPPNSRSEESQTVQPKKPIFFCRE-DSLESTQKPQFDDSDRNP 508
            SS     GRGR                +TV P  P    +E   +E       DD    P
Sbjct: 100  SSG---AGRGR-----------GGSSFKTVLPPPPPLPQQEIQDMEEAPPVAVDDGGSMP 145

Query: 509  EEGIKPLSLTSVLPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDR--- 679
            E      S+ S+LPG GRG+  K  + ++     NRH             + +  +    
Sbjct: 146  E------SIASLLPGVGRGQPEKQPEIQQ---HVNRHVQRRWAPESAVVKESKPKEAVAA 196

Query: 680  -SSSKLSREDAVKNAVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 856
             ++ K+S+E+A+K+A+ + S                                        
Sbjct: 197  SAAPKMSQEEALKHAMEVFSRNEANGGRGRGRGRGRGRGRGRGRFVK------------- 243

Query: 857  XXXXXXXXDSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDA 1036
                    + ED+  T L +GD+ADGERLAQR+G E    L E FEE+   ++P+ +D+ 
Sbjct: 244  --------EEEDEKDTWLNVGDDADGERLAQRLGPEKTSVLTEAFEEMGEKLIPA-IDEM 294

Query: 1037 YLDALHTNNLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWE 1216
            +LDAL  N  +E+EPE+LMGD E+N          L+DALEK KPFLMAYE I+ QEEWE
Sbjct: 295  HLDALDMNFKLEFEPEFLMGDLESNPDIDEKPPIPLRDALEKAKPFLMAYENIESQEEWE 354

Query: 1217 EIMKEVMDKLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLS 1396
            EIMKE M+++P +KE++D YSGP+ VT  +Q QEL+RV KTLP  AP SVK+FT+RAVLS
Sbjct: 355  EIMKETMERVPLLKEIVDHYSGPNRVTVKKQHQELDRVTKTLPASAPNSVKQFTERAVLS 414

Query: 1397 LQSNAGWGFDKKCQFMDKLV 1456
            LQ+N GWGF +KCQFMDKLV
Sbjct: 415  LQNNPGWGFHRKCQFMDKLV 434


>ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300131 [Fragaria vesca
            subsp. vesca]
          Length = 464

 Score =  260 bits (665), Expect = 1e-66
 Identities = 124/195 (63%), Positives = 156/195 (80%)
 Frame = +2

Query: 881  DSEDDFSTGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTN 1060
            D +   ++GLYLGDNADGE+LA+++G E M++L E FE++ST+VLPSP+DDAY+DAL TN
Sbjct: 265  DEDGGIASGLYLGDNADGEKLAEKLGPEVMNQLTEAFEDMSTHVLPSPLDDAYVDALDTN 324

Query: 1061 NLIEYEPEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMD 1240
              IE+EPEYLMG+F  N          L+DALEKMKPFLMAYEGI+ QEEWEE +KE M+
Sbjct: 325  CKIEFEPEYLMGEFNQNPDIDEEPPIPLRDALEKMKPFLMAYEGIQSQEEWEEAIKETME 384

Query: 1241 KLPHMKELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWG 1420
            ++P +K+++D YSGPD VTA +Q++ELERVAKTLP   P SVK+FTDRAVLSLQ N GWG
Sbjct: 385  RVPLLKKIVDHYSGPDRVTAKKQREELERVAKTLPANVPDSVKQFTDRAVLSLQGNPGWG 444

Query: 1421 FDKKCQFMDKLVWEV 1465
            F +KCQFMDKL  +V
Sbjct: 445  FHRKCQFMDKLTQKV 459



 Score = 65.1 bits (157), Expect = 1e-07
 Identities = 58/174 (33%), Positives = 82/174 (47%), Gaps = 2/174 (1%)
 Frame = +2

Query: 218 NEDAHVEPDEP--LFPSGLGHGRGKPIPSNPVLPSFSSWISSTKSVVGRGRITQHQQQPP 391
           N  A   PD+P  +  +G GHGRGKP+P  P  P F S I    +  GRG    H + P 
Sbjct: 55  NHLAEQFPDQPDSVSSTGAGHGRGKPLPQPP--PPFGSGI-RPGAPAGRGH-PGHVRSPG 110

Query: 392 NSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPGAGRGKG 571
            SR E   +  PKKP+FF RED+ E+                 +P ++ +VL   GRGK 
Sbjct: 111 ESR-EGDDSGLPKKPVFFRREDAAEN-----------------RPEAILTVLGVTGRGKP 152

Query: 572 VKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKNAVRIL 733
           V    +  + +EE+R               + + +    K SRE+AVK+A+ IL
Sbjct: 153 VS--GAAVQSVEEDRRIGAPV---------EPRREPRKPKSSREEAVKHAMGIL 195


>tpg|DAA51855.1| TPA: hypothetical protein ZEAMMB73_029894 [Zea mays]
          Length = 455

 Score =  256 bits (655), Expect = 2e-65
 Identities = 163/419 (38%), Positives = 218/419 (52%), Gaps = 2/419 (0%)
 Frame = +2

Query: 200  SPQTPDNEDAHVEPDEPLFPSGLGHGRGKP-IPSNPVLPSFSSWISSTKSVVGRGRITQH 376
            +P  P + D   +P     P  +G GRG+P +PS+P +PSF+ +     S VGRGR    
Sbjct: 53   APGRPISNDDDADPFSATAP--VGRGRGEPEVPSSPGIPSFAVF-----SGVGRGRGRGS 105

Query: 377  QQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPGA 556
               PP    + S+  QP     F         + P  D S   P     PL     + GA
Sbjct: 106  PLPPPPPPEDASK--QPTFTKGFDNAPQRSYPEPPSLDASSSAP-----PLPRPLPISGA 158

Query: 557  GRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSS-KLSREDAVKNAVRIL 733
            GRG       S +KP EENR                +++  +   KLS ++AV+ AV +L
Sbjct: 159  GRGVPWTQQPSPDKPPEENRFIRRREAVKQSAAEPPKQAPGAQQPKLSPQEAVRRAVELL 218

Query: 734  SXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDFSTGLY 913
                                                             D  +D     Y
Sbjct: 219  GGGGRSGEDGGGRGGGGRLSRGRGRGTGRGRRPGRGDRS----------DDVEDVWQASY 268

Query: 914  LGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPEYLM 1093
            LGD ADG+RL Q++G + M  L + F E + N LP PM+DAYL+A HTNN+IE+EPEY +
Sbjct: 269  LGDKADGDRLEQQLGEDKMKILEQAFMEAADNALPHPMEDAYLEACHTNNMIEFEPEYHV 328

Query: 1094 GDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKELIDI 1273
                 N          L++ L+K+KPF++AYEGI++QEEWEE +K+VM + PHMKELID+
Sbjct: 329  NF--GNPDIDEKPPMSLEEMLQKVKPFVVAYEGIQNQEEWEEAVKDVMARAPHMKELIDM 386

Query: 1274 YSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQFMDK 1450
            YSGPD VTA QQ++EL+RVA TLP+  P SVKRFTD+ +LSL++N GWGFDKKCQFMDK
Sbjct: 387  YSGPDVVTAKQQEEELQRVANTLPESIPSSVKRFTDKTLLSLKNNPGWGFDKKCQFMDK 445


>ref|XP_002466313.1| hypothetical protein SORBIDRAFT_01g005470 [Sorghum bicolor]
            gi|241920167|gb|EER93311.1| hypothetical protein
            SORBIDRAFT_01g005470 [Sorghum bicolor]
          Length = 458

 Score =  255 bits (651), Expect = 5e-65
 Identities = 159/419 (37%), Positives = 225/419 (53%), Gaps = 2/419 (0%)
 Frame = +2

Query: 200  SPQTPDNEDAHVEPDEPLFPSGLGHGRGKP-IPSNPVLPSFSSWISSTKSVVGRGRITQH 376
            +P  P ++D   +P      + +G GRG+P +PS+P +PSF+ +     S VGRGR +  
Sbjct: 57   APGRPISDDDGADPFSAT--ASVGRGRGEPAVPSSPSIPSFAVF-----SGVGRGRGSPL 109

Query: 377  QQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLTSVLPGA 556
               PP   +       PK+P    R D+  + Q+P  +    +      PL  T    GA
Sbjct: 110  PPPPPPEDA-------PKQPTLTKRFDN--APQRPDPEPPSLDASSSAPPLPRTLPFSGA 160

Query: 557  GRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSS-KLSREDAVKNAVRIL 733
            GRG       + +KP EENR                +++  +   KLS+++AV  AV +L
Sbjct: 161  GRGVPRMQQPAPDKPQEENRFIRRREAAKQAAAVPAKQAPAAQQPKLSQQEAVDRAVELL 220

Query: 734  SXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDFSTGLY 913
                                                             D EDD    +Y
Sbjct: 221  GGGDRSGEDGGGRGGRGRGFRGRGPGRGRFRGRGRSDDRSV--------DVEDD-RQAIY 271

Query: 914  LGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYEPEYLM 1093
            LGDNADG+RL +R+G + M+ L + F E + N LP P++D YL+A HTN++IE+EPEY +
Sbjct: 272  LGDNADGDRLEKRLGKDKMEILEQAFMEAADNALPDPVEDGYLEAFHTNSMIEFEPEYHV 331

Query: 1094 GDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMKELIDI 1273
                 N          L++ L+K+KPF++A+EGI++QEEWEE +K+VM + PHMKELID+
Sbjct: 332  NF--GNPDIDEKPPMSLEEMLQKVKPFIVAFEGIQNQEEWEESVKDVMARAPHMKELIDM 389

Query: 1274 YSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQFMDK 1450
             SGPD VTA QQ++EL+RVA TLP+  P SVKRFTD+ +LSL++N GWGFDKKCQFMDK
Sbjct: 390  CSGPDVVTAKQQEEELQRVANTLPESIPSSVKRFTDKTLLSLKNNPGWGFDKKCQFMDK 448


>ref|XP_004965630.1| PREDICTED: CCAAT/enhancer-binding protein alpha-like [Setaria
            italica]
          Length = 457

 Score =  252 bits (644), Expect = 4e-64
 Identities = 159/424 (37%), Positives = 212/424 (50%), Gaps = 1/424 (0%)
 Frame = +2

Query: 182  PPNPQFSPQTPDNEDAHVEPDEPLFPSGLGHGRGKPI-PSNPVLPSFSSWISSTKSVVGR 358
            P  P  +P    ++D   +P     P+G   GRG+P  PS+  +PSF++      S VGR
Sbjct: 49   PSGPPRAPGRTISDDDGADPFSAAAPAG--RGRGEPAAPSSATIPSFAA-----SSGVGR 101

Query: 359  GRITQHQQQPPNSRSEESQTVQPKKPIFFCREDSLESTQKPQFDDSDRNPEEGIKPLSLT 538
            GR +     PP   +       PK+P    R D     + P+   S         PL   
Sbjct: 102  GRGSPLPPPPPPEDA-------PKQPTLTKRFDDAPPRRDPE-PPSPEASSSSAPPLPRA 153

Query: 539  SVLPGAGRGKGVKFVDSEEKPIEENRHTXXXXXXXXXXXXKDQKSDRSSSKLSREDAVKN 718
                GAGRG         +KP EENR                        KLS EDAVK 
Sbjct: 154  LPFTGAGRGVPRMQQPPVDKPPEENRFIRRREAAKQAAVGPTSAPGPQQPKLSGEDAVKR 213

Query: 719  AVRILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSEDDF 898
            A+ +L                                                     D 
Sbjct: 214  ALELLGGGGGGRGGGRGDEDGGGRGGRGRGFRGRGRGRGRTRDDRRSVDL--------DD 265

Query: 899  STGLYLGDNADGERLAQRIGVENMDKLVEGFEEISTNVLPSPMDDAYLDALHTNNLIEYE 1078
               +YLGDNADGE+L +++G + M  L + F E + N LP PM++AY +A HTNN+IE+E
Sbjct: 266  RQAIYLGDNADGEKLEKKLGEDKMKILEQAFMEAADNALPHPMENAYQEACHTNNMIEFE 325

Query: 1079 PEYLMGDFETNXXXXXXXXXXLQDALEKMKPFLMAYEGIKDQEEWEEIMKEVMDKLPHMK 1258
            P+Y +     N          L++ L+K+KPF++AYEGI++QEEWEE +K+VM + PHMK
Sbjct: 326  PQYHVNF--ANPDIDEKPQMSLEEMLQKVKPFIVAYEGIQNQEEWEEAVKDVMARAPHMK 383

Query: 1259 ELIDIYSGPDTVTAMQQQQELERVAKTLPDKAPKSVKRFTDRAVLSLQSNAGWGFDKKCQ 1438
            ELID+YSGPD VTA QQ++EL+RVA TLP+  P SVKRFTD+ +LSL++N GWGFDKKCQ
Sbjct: 384  ELIDMYSGPDVVTAKQQEEELQRVANTLPENIPSSVKRFTDKTLLSLKNNPGWGFDKKCQ 443

Query: 1439 FMDK 1450
            FMDK
Sbjct: 444  FMDK 447


Top