BLASTX nr result

ID: Rehmannia23_contig00014013 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00014013
         (1303 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu...   354   4e-95
ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247...   353   9e-95
gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, ...   340   6e-91
gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]       331   5e-88
ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1...   329   1e-87
ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253...   328   4e-87
gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus pe...   317   7e-84
ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm...   311   5e-82
emb|CBI17195.3| unnamed protein product [Vitis vinifera]              307   5e-81
ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300...   300   9e-79
ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [...   300   1e-78
ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507...   293   1e-76
ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Caps...   293   1e-76
gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus...   292   2e-76
ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp....   290   9e-76
gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot...   289   2e-75
ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein...   289   2e-75
gb|AAM65660.1| Contains similarity to RNA-binding protein from A...   289   2e-75
ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr...   288   3e-75
ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutr...   286   1e-74

>ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum]
          Length = 480

 Score =  354 bits (909), Expect = 4e-95
 Identities = 197/347 (56%), Positives = 232/347 (66%), Gaps = 9/347 (2%)
 Frame = -1

Query: 1303 DEAQYNAVESEIPAIQEKP-LPNDVINVISGVGRGKPMKSPAPQSEKPKAENRHIRQRQQ 1127
            + A  N+  S+ P  ++   L + VI+V++G GRGKP+++ +P SEKPK ENRH+R RQQ
Sbjct: 140  ETADSNSSSSDAPTPRDDSNLSSSVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQ 199

Query: 1126 XXXXXXXXXXXXXXXXXSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXX 947
                              P ++LS+E+ VKKA  ILS                       
Sbjct: 200  KVADSGERASSP------PPQRLSREDAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRG 253

Query: 946  XXXXXXXXXXXXXXGDD------RYEESDDEA--SGLYLGDPADEEKMAQKLGPEVMNKL 791
                                   R EE  D +  SG YLGD AD EK+AQKLGPE MN L
Sbjct: 254  RGGRGAVRGRGGRGRGRGRGRGRRDEERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTL 313

Query: 790  AEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALE 611
            AEGFEEMS+RVLPSP+DDAY++A HTN+MIECEPEYLM +F +NPDIDE PPIPLRDALE
Sbjct: 314  AEGFEEMSARVLPSPMDDAYIEALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALE 373

Query: 610  KMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKT 431
            KMKPFLMAYEGI+ Q        ETM+ VPL+KEIVD+YSGPDRVTAKQQQ+ELERVAKT
Sbjct: 374  KMKPFLMAYEGIKDQEEWEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKT 433

Query: 430  LPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
            LP SAP+ VKRFTERAVLSLQSNPGWGFDKKCQFMDK+V+E SQ YK
Sbjct: 434  LPESAPNSVKRFTERAVLSLQSNPGWGFDKKCQFMDKVVMEASQHYK 480


>ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum
            lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED:
            uncharacterized protein LOC101247662 isoform 2 [Solanum
            lycopersicum]
          Length = 473

 Score =  353 bits (906), Expect = 9e-95
 Identities = 194/323 (60%), Positives = 222/323 (68%), Gaps = 4/323 (1%)
 Frame = -1

Query: 1246 LPNDVINVISGVGRGKPMKSPAPQSEKPKAENRHIRQRQQXXXXXXXXXXXXXXXXXSPR 1067
            LP+ VI+V++G GRGKP+++ +  SEKPK ENRH+R RQQ                  P 
Sbjct: 157  LPSSVISVLTGAGRGKPLQTASSVSEKPKEENRHLRPRQQKVADSGERASSP------PP 210

Query: 1066 EQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDD--R 893
            ++LS+E+ VKKA  ILS                                          R
Sbjct: 211  QRLSREDAVKKAVGILSRSDDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRR 270

Query: 892  YEESDDE--ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAY 719
             EE  D    SG YLGD AD EK+A KLGPE MN LAEGFEEMS+RVLPSP+DDAYL+A 
Sbjct: 271  DEERGDGNLESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLPSPMDDAYLEAL 330

Query: 718  HTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXE 539
            HTN+MIECEPEYLM +F +NPDIDE PPIPLRDALEKMKPFLMAYEGI+ Q        E
Sbjct: 331  HTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKE 390

Query: 538  TMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNP 359
            TM+ VPL+KEIVD+YSGPDRVTAKQQQ+ELERVAKTLP SAP+ VKRFTERAVLSLQSNP
Sbjct: 391  TMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNP 450

Query: 358  GWGFDKKCQFMDKLVIEVSQQYK 290
            GWGFDKKCQFMDK+V+EVSQ YK
Sbjct: 451  GWGFDKKCQFMDKVVMEVSQHYK 473


>gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao]
          Length = 474

 Score =  340 bits (873), Expect = 6e-91
 Identities = 196/350 (56%), Positives = 222/350 (63%), Gaps = 12/350 (3%)
 Frame = -1

Query: 1303 DEAQYNAVESEIPAIQEKPL--PNDV-INVISGVGRGKPMKSPAPQSEKPKAENRHIRQR 1133
            DE + +A  +  P    +P+  PN + ++V+SG GRGKP+K P P S + + ENRHIR  
Sbjct: 142  DETESSAKAAAEPIQSSEPIFPPNILPVSVLSGAGRGKPVKQPEPASRRQE-ENRHIRVA 200

Query: 1132 QQXXXXXXXXXXXXXXXXXSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXX 953
            QQ                  P  Q+SQEE  KKA  ILS                     
Sbjct: 201  QQQS----------------PSAQMSQEEATKKAMGILSRRSESGESGMVGRGGRASMGM 244

Query: 952  XXXXXXXXXXXXXXXXGDDRYEESDDE---------ASGLYLGDPADEEKMAQKLGPEVM 800
                            G  R +  D           A GLYLGD AD EK AQ +G + M
Sbjct: 245  GGGRGRGRGRGRGMGRGRGRRQGEDTRIVKDSGEGSADGLYLGDNADGEKFAQTIGADNM 304

Query: 799  NKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRD 620
            NKL EGFEEM SRVLPSP+DDAYLDA HTN  IE EPEYLMEEFGTNPDIDEKPP+PLRD
Sbjct: 305  NKLVEGFEEMGSRVLPSPMDDAYLDALHTNCSIEFEPEYLMEEFGTNPDIDEKPPMPLRD 364

Query: 619  ALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERV 440
            ALEKMKPFLMAYEGIQSQ        ETM++VPL++EIVD+YSGPDRVTAK+QQEELERV
Sbjct: 365  ALEKMKPFLMAYEGIQSQEEWEEVIKETMERVPLLQEIVDYYSGPDRVTAKKQQEELERV 424

Query: 439  AKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
            AKT+P  AP  VK+F  RAVLSLQSNPGWGFDKKCQFMDKLV EVSQQYK
Sbjct: 425  AKTIPERAPSSVKQFANRAVLSLQSNPGWGFDKKCQFMDKLVWEVSQQYK 474


>gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]
          Length = 426

 Score =  331 bits (848), Expect = 5e-88
 Identities = 180/317 (56%), Positives = 214/317 (67%)
 Frame = -1

Query: 1240 NDVINVISGVGRGKPMKSPAPQSEKPKAENRHIRQRQQXXXXXXXXXXXXXXXXXSPREQ 1061
            +D++  +SG+GRG P K P PQ+ KP   NRHIRQ Q                   P +Q
Sbjct: 123  DDILTNLSGMGRGTPGKPP-PQTLKPTPINRHIRQPQPRPSTALS-----------PDQQ 170

Query: 1060 LSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEES 881
            LS+EEK+KKA EILS                                       D   ES
Sbjct: 171  LSKEEKLKKAVEILSRGDPDRGPIRSPTGRGRGRGRGRGGRGGRFSGRGRGREADAAIES 230

Query: 880  DDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMI 701
            D+E  G++ GDPADE+K+A+KLG EVMNK+ EG EEMSSRVLPS +DDAY+DAYHTNL++
Sbjct: 231  DEELPGMF-GDPADEQKVAEKLGVEVMNKITEGMEEMSSRVLPSLIDDAYVDAYHTNLLL 289

Query: 700  ECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVP 521
            ECEPEY ME+FGTNPDID+KPPIPLR+A EKMKPFLM + GI++Q        ETM+ VP
Sbjct: 290  ECEPEYFMEDFGTNPDIDDKPPIPLREAFEKMKPFLMQHIGIETQEEWEQIIEETMESVP 349

Query: 520  LIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDK 341
              K+I+DHY+GPDRVTA QQ  ELERVA TLPA+AP  VKRFTERAVLSL+SNPGWGF K
Sbjct: 350  RWKKIIDHYAGPDRVTALQQIGELERVAGTLPATAPASVKRFTERAVLSLKSNPGWGFKK 409

Query: 340  KCQFMDKLVIEVSQQYK 290
            KCQFMDK+V+EVSQQYK
Sbjct: 410  KCQFMDKVVMEVSQQYK 426


>ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis]
          Length = 407

 Score =  329 bits (844), Expect = 1e-87
 Identities = 186/345 (53%), Positives = 217/345 (62%), Gaps = 12/345 (3%)
 Frame = -1

Query: 1288 NAVESEIPAIQEKPLPNDVINVISGVGRGKPMKSPAPQSEK----------PKAENRHIR 1139
            +A +S  P+  E  LP+ +I+ + G GRGK   +   Q ++          P+ ENRHIR
Sbjct: 68   SATDSTQPS--EPNLPSSIISTLPGAGRGKTAVTQQQQQQQQHQRQQPGPPPQEENRHIR 125

Query: 1138 QRQQXXXXXXXXXXXXXXXXXSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXX 959
             R Q                   + +LS+E+ VK A ++LS                   
Sbjct: 126  ARLQPQPRPEKAPAAETGSA---QPKLSKEDAVKMAMKVLSRGEEGEGEGISAGGPGRGR 182

Query: 958  XXXXXXXXXXXXXXXXXXGDDRYEESDDEA--SGLYLGDPADEEKMAQKLGPEVMNKLAE 785
                                 +  E D++    GLYLGD AD EK+A+K+G E MN L E
Sbjct: 183  GMGRGRGRGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVE 242

Query: 784  GFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKM 605
            GFEEMS RVLPSP++DAY+DA HTN MIE EPEYLMEEFGTNPDIDEKPPIPLRDALEKM
Sbjct: 243  GFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPDIDEKPPIPLRDALEKM 302

Query: 604  KPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLP 425
            KPFLMAYEGIQSQ        E M++VPL+KEIVDHYSGPDRVTAKQQ EELERVAKT+P
Sbjct: 303  KPFLMAYEGIQSQEEWEEAVNEVMERVPLLKEIVDHYSGPDRVTAKQQGEELERVAKTIP 362

Query: 424  ASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
             SAP  +KRF  RAVLSLQSNPGWGFDKKCQFMDKL  EVSQQYK
Sbjct: 363  ESAPASIKRFANRAVLSLQSNPGWGFDKKCQFMDKLAWEVSQQYK 407


>ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera]
          Length = 482

 Score =  328 bits (840), Expect = 4e-87
 Identities = 187/339 (55%), Positives = 218/339 (64%), Gaps = 16/339 (4%)
 Frame = -1

Query: 1258 QEKPLPNDVINVISG-VGRGKPMK-SPAPQSEKPKAENRHIRQRQQXXXXXXXXXXXXXX 1085
            +E  LP  +++ +SG  GRG+P+K +PAP    PK ENRH+RQ +Q              
Sbjct: 153  EENNLPVSILSALSGGAGRGQPLKQTPAP----PKEENRHLRQPRQPVFRSPQQPVAGP- 207

Query: 1084 XXXSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 905
                P+ +LS+EE VKKA  ILS                                     
Sbjct: 208  ----PQPRLSREEAVKKAVGILSRGGDGGGDGDGDEGGRGRGFRGRGRGRGRGAQGWMGR 263

Query: 904  GDDRY--------------EESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMS 767
            G  R               +  DD  +GLYLGD AD EK++ K+G E M+KL E FEEMS
Sbjct: 264  GRGRGRGRGRMGDRRGRGGDAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMS 323

Query: 766  SRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMA 587
             RVLPSP++DAYLDA HTN +IE EPEYLMEEFGTNPDIDE PPIPLRDALEKMKPFLM 
Sbjct: 324  GRVLPSPIEDAYLDALHTNCLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQ 383

Query: 586  YEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDP 407
            YEGIQSQ        ETM+ VP +KE+VD+YSGPDRVTAK+QQEELERVAKTLP +AP+ 
Sbjct: 384  YEGIQSQEEWEEVMKETMENVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNS 443

Query: 406  VKRFTERAVLSLQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
            VKRFT+RA+LSLQSNPGWGFDKKCQFMDKLV EVSQ YK
Sbjct: 444  VKRFTDRAILSLQSNPGWGFDKKCQFMDKLVWEVSQHYK 482


>gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica]
          Length = 428

 Score =  317 bits (812), Expect = 7e-84
 Identities = 159/202 (78%), Positives = 173/202 (85%), Gaps = 1/202 (0%)
 Frame = -1

Query: 895 RYEESDDE-ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAY 719
           R ++SD   ASGLYLGD AD EK+A+KLGPE+MNKL E FEEMSS VLPSPLDDAY+DA 
Sbjct: 226 RGKDSDGSYASGLYLGDNADGEKLAKKLGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAM 285

Query: 718 HTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXE 539
           HTN MIECEPEYLM EF  NPDIDEKPPI LRDALEKMKPFLMAYE I+SQ        E
Sbjct: 286 HTNFMIECEPEYLMGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIESQEEWEEVVNE 345

Query: 538 TMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNP 359
           TM++VPL+KEIVDHYSGPDRVTAK+QQEELERVAKTLPA  PD VKRFT+RAVLSLQSNP
Sbjct: 346 TMERVPLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSNP 405

Query: 358 GWGFDKKCQFMDKLVIEVSQQY 293
           GWGFD+KCQFMDKLV +VSQ Y
Sbjct: 406 GWGFDRKCQFMDKLVAKVSQHY 427


>ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis]
            gi|223537066|gb|EEF38701.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 436

 Score =  311 bits (796), Expect = 5e-82
 Identities = 176/320 (55%), Positives = 206/320 (64%), Gaps = 2/320 (0%)
 Frame = -1

Query: 1246 LPNDVINVISGVGRGKPMKSPAPQSEKPKAENRHIRQRQQXXXXXXXXXXXXXXXXXSPR 1067
            LP+ + + +SG GRG+P K   P + + K ENRHIR R +                   +
Sbjct: 133  LPSTIHSSLSGFGRGEPDKPVVP-TPQVKEENRHIRDRSRAKPKTEEAEVRA-------K 184

Query: 1066 EQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYE 887
             ++S+EE VK+A  ILS                                        R  
Sbjct: 185  PKISREEAVKRAVSILSQGDTGEGMGRGRGGGRGRGRGRGRGRLEQRG---------RMM 235

Query: 886  ESDDEA--SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHT 713
            +  DE   SGL+LGD AD EK+A K+G E MNKL EG+EEMS RVLPSP++DAYLDA HT
Sbjct: 236  DDVDEGFGSGLFLGDNADGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDALHT 295

Query: 712  NLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETM 533
            N MIE EPEYLM EF  NPDIDEKPP+PLRD LEK+KPF+MAYEGIQSQ        ETM
Sbjct: 296  NYMIEFEPEYLMGEFDQNPDIDEKPPMPLRDVLEKVKPFIMAYEGIQSQEEWEAAVEETM 355

Query: 532  KKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGW 353
            K VPL KEIVD+YSGPDR+TAK+Q+EELERVA T+PASAP  VKRF +RAVLSLQSNPGW
Sbjct: 356  KNVPLFKEIVDYYSGPDRITAKKQEEELERVANTIPASAPASVKRFADRAVLSLQSNPGW 415

Query: 352  GFDKKCQFMDKLVIEVSQQY 293
            GFDKKCQFMDKLV EV+Q Y
Sbjct: 416  GFDKKCQFMDKLVREVNQCY 435


>emb|CBI17195.3| unnamed protein product [Vitis vinifera]
          Length = 209

 Score =  307 bits (787), Expect = 5e-81
 Identities = 151/200 (75%), Positives = 169/200 (84%)
 Frame = -1

Query: 889 EESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTN 710
           +  DD  +GLYLGD AD EK++ K+G E M+KL E FEEMS RVLPSP++DAYLDA HTN
Sbjct: 10  DAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTN 69

Query: 709 LMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMK 530
            +IE EPEYLMEEFGTNPDIDE PPIPLRDALEKMKPFLM YEGIQSQ        ETM+
Sbjct: 70  CLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETME 129

Query: 529 KVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWG 350
            VP +KE+VD+YSGPDRVTAK+QQEELERVAKTLP +AP+ VKRFT+RA+LSLQSNPGWG
Sbjct: 130 NVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWG 189

Query: 349 FDKKCQFMDKLVIEVSQQYK 290
           FDKKCQFMDKLV EVSQ YK
Sbjct: 190 FDKKCQFMDKLVWEVSQHYK 209


>ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300131 [Fragaria vesca
           subsp. vesca]
          Length = 464

 Score =  300 bits (768), Expect = 9e-79
 Identities = 150/206 (72%), Positives = 171/206 (83%), Gaps = 3/206 (1%)
 Frame = -1

Query: 898 DRYEESDDE---ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYL 728
           DR    D++   ASGLYLGD AD EK+A+KLGPEVMN+L E FE+MS+ VLPSPLDDAY+
Sbjct: 259 DRRRRGDEDGGIASGLYLGDNADGEKLAEKLGPEVMNQLTEAFEDMSTHVLPSPLDDAYV 318

Query: 727 DAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXX 548
           DA  TN  IE EPEYLM EF  NPDIDE+PPIPLRDALEKMKPFLMAYEGIQSQ      
Sbjct: 319 DALDTNCKIEFEPEYLMGEFNQNPDIDEEPPIPLRDALEKMKPFLMAYEGIQSQEEWEEA 378

Query: 547 XXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQ 368
             ETM++VPL+K+IVDHYSGPDRVTAK+Q+EELERVAKTLPA+ PD VK+FT+RAVLSLQ
Sbjct: 379 IKETMERVPLLKKIVDHYSGPDRVTAKKQREELERVAKTLPANVPDSVKQFTDRAVLSLQ 438

Query: 367 SNPGWGFDKKCQFMDKLVIEVSQQYK 290
            NPGWGF +KCQFMDKL  +VS+ YK
Sbjct: 439 GNPGWGFHRKCQFMDKLTQKVSKHYK 464


>ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [Glycine max]
            gi|571476117|ref|XP_006586864.1| PREDICTED: la-related
            protein 1 isoform X2 [Glycine max]
          Length = 481

 Score =  300 bits (767), Expect = 1e-78
 Identities = 171/331 (51%), Positives = 205/331 (61%), Gaps = 9/331 (2%)
 Frame = -1

Query: 1255 EKPLPNDVINVISGVGRGKPMKSPAPQSEKPKAENRHIRQRQQXXXXXXXXXXXXXXXXX 1076
            +  LP  +  V+SG+GRGK MK P  +++  + ENRH+R RQ                  
Sbjct: 161  DNKLPGSIPGVLSGLGRGKSMKQPDLETQVTE-ENRHLRTRQAPGAASSETVPKRSPIP- 218

Query: 1075 SPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG-- 902
                  SQE+  + A +ILS                                     G  
Sbjct: 219  ------SQEDATRNALKILSHGKDDGSDTGRGREYGGRGGLDRGRGRGRGRGRGRGMGRG 272

Query: 901  -------DDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPL 743
                   D++  ++DD A+GLY GD AD EK+A+K+GPE+MN+L EGFEEM+SRVLPSPL
Sbjct: 273  RFVERDVDEKVMDTDDYATGLYAGDDADGEKLARKVGPEIMNQLTEGFEEMTSRVLPSPL 332

Query: 742  DDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQX 563
            +D +LDA   N  IE EPEYL+E    NPDIDEK PI LRDALEK KPFLM+YEGIQSQ 
Sbjct: 333  EDEFLDALDINYAIEFEPEYLVEF--DNPDIDEKEPISLRDALEKAKPFLMSYEGIQSQE 390

Query: 562  XXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERA 383
                   ETM +VPL+K+I+DHYSGPDRVTAK+QQEELERVAKTLP S P  VK+FT RA
Sbjct: 391  EWEEIMEETMARVPLLKKIIDHYSGPDRVTAKKQQEELERVAKTLPGSVPSSVKQFTNRA 450

Query: 382  VLSLQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
            V+SLQSNPGWGFDKKC FMDKLV EVSQ YK
Sbjct: 451  VISLQSNPGWGFDKKCHFMDKLVWEVSQHYK 481


>ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum]
          Length = 504

 Score =  293 bits (750), Expect = 1e-76
 Identities = 149/208 (71%), Positives = 170/208 (81%), Gaps = 4/208 (1%)
 Frame = -1

Query: 901 DDRYEESDDEA----SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDA 734
           DDR+ +  D A    SGL+LGD  D EK+A+K+GPEVMN+  EGFEEM SRVLPSPL+D 
Sbjct: 298 DDRFGQIQDNARSNASGLFLGDDVDGEKLAKKVGPEVMNQFTEGFEEMISRVLPSPLEDE 357

Query: 733 YLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXX 554
           Y++A+  N  IE EPEY+ME F +NPDIDEK PIPLRDALEKMKPFLM YEGIQSQ    
Sbjct: 358 YVEAFDINCAIEFEPEYIME-FDSNPDIDEKEPIPLRDALEKMKPFLMNYEGIQSQEEWE 416

Query: 553 XXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLS 374
               ETM++VPL+K+IVDHYSGPDRVTAK+QQEELERVAKTLPASAP  V +FT RAV+S
Sbjct: 417 AIMEETMERVPLLKKIVDHYSGPDRVTAKKQQEELERVAKTLPASAPSSVVQFTNRAVMS 476

Query: 373 LQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
           LQSNPGWGFDKKCQFMDKLV EVSQ +K
Sbjct: 477 LQSNPGWGFDKKCQFMDKLVFEVSQHHK 504


>ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Capsella rubella]
            gi|482575944|gb|EOA40131.1| hypothetical protein
            CARUB_v10008838mg [Capsella rubella]
          Length = 525

 Score =  293 bits (749), Expect = 1e-76
 Identities = 167/345 (48%), Positives = 206/345 (59%), Gaps = 14/345 (4%)
 Frame = -1

Query: 1282 VESEIPAIQEKP----LPNDVINVI-------SGVGRGKPMKSPAPQSEKPKAENRHIRQ 1136
            V S  PA + K     LP++V N +       SG GRGKP+   AP   +   ENRHIR+
Sbjct: 187  VTSSPPAPESKSGQTDLPDNVFNALGSEIPHSSGAGRGKPLVESAPIQRE---ENRHIRR 243

Query: 1135 RQ---QXXXXXXXXXXXXXXXXXSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXX 965
                 Q                 +PR +LS EE  ++A+  LS                 
Sbjct: 244  PPPPPQQQRSQPQQKRAQTPRDETPRPRLSAEEAGRRARSELSRGEAEGSGVRGRGGRGR 303

Query: 964  XXXXXXXXXXXXXXXXXXXXGDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAE 785
                                 D + EE + EA  ++ GD AD EK A K+GPE+M  LAE
Sbjct: 304  GRGARGRGRGRGGEGWRD---DKKEEEGEQEAMSVFAGDSADGEKFANKMGPELMKTLAE 360

Query: 784  GFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKM 605
            GFEE+  + LPS   DA +DAY TNLMIECEPEY+M +FG+NPDIDEKPP+ LR+ LEK+
Sbjct: 361  GFEEVCEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKV 420

Query: 604  KPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLP 425
            KPF++AYEGI+ Q        E M + PL+KEIVDHYSGPDRVTAK+Q EEL+R+A TLP
Sbjct: 421  KPFIVAYEGIKDQEEWEEAINEAMAQAPLMKEIVDHYSGPDRVTAKKQNEELDRIATTLP 480

Query: 424  ASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
             SAPD VKRF +RA L+L+SNPGWGFDKK QFMDKLV+EVSQ YK
Sbjct: 481  KSAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKLVLEVSQSYK 525


>gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris]
          Length = 532

 Score =  292 bits (747), Expect = 2e-76
 Identities = 174/334 (52%), Positives = 201/334 (60%), Gaps = 11/334 (3%)
 Frame = -1

Query: 1258 QEKPLPNDVINVISGVGRGKPMKSPAPQSEKPKAENRHIRQRQQXXXXXXXXXXXXXXXX 1079
            Q   LP ++I V+SG+GRGKPMK    QS+         R  +                 
Sbjct: 209  QANKLPGNIIEVLSGLGRGKPMK----QSDPETRVTEENRHLRAPRARGAAASDTLYERQ 264

Query: 1078 XSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGD 899
              P    S+++ V+ A+  LS                                     G 
Sbjct: 265  PIP----SRDDAVRNARNFLSQGEDDVGGTGRGRGFRERGGLGRGRGRGRGRGRGTGRGG 320

Query: 898  DRYEESDD--------EAS---GLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLP 752
             R  + D+        EAS   G Y+GD AD EK+A+K+GPE+MN+L EGFEEM+ RVLP
Sbjct: 321  FRGRDMDERRGRFMDAEASDDIGPYVGDDADGEKLAKKVGPEIMNQLTEGFEEMAGRVLP 380

Query: 751  SPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQ 572
            SPL+D YLDA   N  IE EPEYL+E    NPDIDEK PIPLRDALEKMKPFLMAYEGIQ
Sbjct: 381  SPLEDEYLDALDINYAIEFEPEYLVEF--DNPDIDEKEPIPLRDALEKMKPFLMAYEGIQ 438

Query: 571  SQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFT 392
            SQ        ETM +VPL+KEIVDHYSGPDRVTAK+QQEELERVAKTLP SAP  VK+FT
Sbjct: 439  SQEEWEEIMEETMAQVPLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPESAPSSVKQFT 498

Query: 391  ERAVLSLQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
             RAV+SLQSNPGWGFDKKC FMDKLV EVSQ YK
Sbjct: 499  NRAVVSLQSNPGWGFDKKCHFMDKLVWEVSQHYK 532


>ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297340299|gb|EFH70716.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 769

 Score =  290 bits (742), Expect = 9e-76
 Identities = 159/333 (47%), Positives = 202/333 (60%), Gaps = 15/333 (4%)
 Frame = -1

Query: 1243 PNDVINVIS-------GVGRGKPMKSPAPQSEKPKAENRHIR--------QRQQXXXXXX 1109
            P+++ N +        G GRGKP+   AP  ++   +NR IR        Q+QQ      
Sbjct: 443  PDNIFNALGSEFSHPIGAGRGKPLVESAPIQQE---DNRQIRRPQPPPPPQQQQQQRAQP 499

Query: 1108 XXXXXXXXXXXSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 929
                       +P+ QLS+EE  ++A+  LS                             
Sbjct: 500  QQKRAPTVKDEAPKPQLSREEAGRRARSELSRGEAEGGGVRGRGGRGRGRGARGRGRGRG 559

Query: 928  XXXXXXXXGDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPS 749
                     D + EE + EA  ++ GD AD EK AQK+GPE+M  LAEGFEE+  + LPS
Sbjct: 560  GDGWRD---DKKEEEGEQEAMSIFAGDSADGEKFAQKMGPELMKTLAEGFEEVCEKALPS 616

Query: 748  PLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQS 569
               DA +DAY TNLMIECEPEY+M +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ 
Sbjct: 617  TTHDAIIDAYDTNLMIECEPEYIMADFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKD 676

Query: 568  QXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTE 389
            Q        E M + PL+KEIVDHYSGPDRVTAK+Q EEL+ +A T+PASAPD VKRF +
Sbjct: 677  QEEWEEAVNEAMAQAPLMKEIVDHYSGPDRVTAKKQNEELDSIATTIPASAPDSVKRFAD 736

Query: 388  RAVLSLQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
            RA L+L+SNPGWGFDKK QFMDKLV+EVSQ YK
Sbjct: 737  RAALTLKSNPGWGFDKKYQFMDKLVLEVSQSYK 769


>gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain.
            ESTs gb|H37317, gb|F14415, gb|AA651290 come from this
            gene [Arabidopsis thaliana]
          Length = 829

 Score =  289 bits (739), Expect = 2e-75
 Identities = 156/325 (48%), Positives = 197/325 (60%), Gaps = 7/325 (2%)
 Frame = -1

Query: 1243 PNDVINVI-------SGVGRGKPMKSPAPQSEKPKAENRHIRQRQQXXXXXXXXXXXXXX 1085
            P+++ N +       SG GRGKP+   AP  ++   + R      Q              
Sbjct: 506  PDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQIRRPPPPPQQQRVQPQQKRAPTV 565

Query: 1084 XXXSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 905
               +P+ QLS EE  ++A+  LS                                     
Sbjct: 566  KDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRD 625

Query: 904  GDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLD 725
             D + EE + EA  ++ GD AD EK A+K+GPE+M  LAEGFEE+  + LPS   DA +D
Sbjct: 626  -DKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIID 684

Query: 724  AYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXX 545
            AY TNLMIECEPEY+M +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q       
Sbjct: 685  AYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAI 744

Query: 544  XETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQS 365
             E M + PL+KEIVDHYSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+S
Sbjct: 745  NEAMTQAPLMKEIVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKS 804

Query: 364  NPGWGFDKKCQFMDKLVIEVSQQYK 290
            NPGWGFDKK QFMDKLV+EVSQ YK
Sbjct: 805  NPGWGFDKKYQFMDKLVLEVSQSYK 829


>ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown
            protein; 43598-45751 [Arabidopsis thaliana]
            gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8
            [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1|
            At1g53640/F22G10.8 [Arabidopsis thaliana]
            gi|110740318|dbj|BAF02054.1| hypothetical protein
            [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis thaliana]
          Length = 523

 Score =  289 bits (739), Expect = 2e-75
 Identities = 156/325 (48%), Positives = 197/325 (60%), Gaps = 7/325 (2%)
 Frame = -1

Query: 1243 PNDVINVI-------SGVGRGKPMKSPAPQSEKPKAENRHIRQRQQXXXXXXXXXXXXXX 1085
            P+++ N +       SG GRGKP+   AP  ++   + R      Q              
Sbjct: 200  PDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQIRRPPPPPQQQRVQPQQKRAPTV 259

Query: 1084 XXXSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 905
               +P+ QLS EE  ++A+  LS                                     
Sbjct: 260  KDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRD 319

Query: 904  GDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLD 725
             D + EE + EA  ++ GD AD EK A+K+GPE+M  LAEGFEE+  + LPS   DA +D
Sbjct: 320  -DKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIID 378

Query: 724  AYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXX 545
            AY TNLMIECEPEY+M +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q       
Sbjct: 379  AYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAI 438

Query: 544  XETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQS 365
             E M + PL+KEIVDHYSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+S
Sbjct: 439  NEAMTQAPLMKEIVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKS 498

Query: 364  NPGWGFDKKCQFMDKLVIEVSQQYK 290
            NPGWGFDKK QFMDKLV+EVSQ YK
Sbjct: 499  NPGWGFDKKYQFMDKLVLEVSQSYK 523


>gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain
            [Arabidopsis thaliana]
          Length = 523

 Score =  289 bits (739), Expect = 2e-75
 Identities = 156/325 (48%), Positives = 197/325 (60%), Gaps = 7/325 (2%)
 Frame = -1

Query: 1243 PNDVINVI-------SGVGRGKPMKSPAPQSEKPKAENRHIRQRQQXXXXXXXXXXXXXX 1085
            P+++ N +       SG GRGKP+   AP  ++   + R      Q              
Sbjct: 200  PDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQIRRPPPPPQQQRVQPQQKRAPTV 259

Query: 1084 XXXSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 905
               +P+ QLS EE  ++A+  LS                                     
Sbjct: 260  KDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRD 319

Query: 904  GDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLD 725
             D + EE + EA  ++ GD AD EK A+K+GPE+M  LAEGFEE+  + LPS   DA +D
Sbjct: 320  -DKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIID 378

Query: 724  AYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXX 545
            AY TNLMIECEPEY+M +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q       
Sbjct: 379  AYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAI 438

Query: 544  XETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQS 365
             E M + PL+KEIVDHYSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+S
Sbjct: 439  NEAMTQAPLMKEIVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKS 498

Query: 364  NPGWGFDKKCQFMDKLVIEVSQQYK 290
            NPGWGFDKK QFMDKLV+EVSQ YK
Sbjct: 499  NPGWGFDKKYQFMDKLVLEVSQSYK 523


>ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550322664|gb|EEF06007.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 466

 Score =  288 bits (738), Expect = 3e-75
 Identities = 167/335 (49%), Positives = 199/335 (59%), Gaps = 5/335 (1%)
 Frame = -1

Query: 1279 ESEIPAIQEKPLPNDVINVISGVGRGKPMKSPAPQSEKPKAENRHIRQRQQXXXXXXXXX 1100
            ESE P   E  LP  +++ + G GRGKP+K   P  E  K ENRH+R R Q         
Sbjct: 133  ESEPPKKAEANLPPSILSGLGGAGRGKPVKQEVP-IEPAKEENRHLRARSQPRSQPRTRQ 191

Query: 1099 XXXXXXXXS--PREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 926
                    +     ++ ++E VKKA E+LS                              
Sbjct: 192  QKTPDGDDAVPATTKMGRQEAVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGAR 251

Query: 925  XXXXXXXGDDR-YEESDDE-ASGLYL-GDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVL 755
                      R Y + + E  SG+ L G   DEEK AQ +G E MN L E FEEMS RVL
Sbjct: 252  GGGRGRGRGRRGYGDKEVEYGSGMSLEGHEEDEEKFAQSVGVETMNTLVEAFEEMSGRVL 311

Query: 754  PSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGI 575
            P P++D Y+DA+ TN   E EPEYLM EF  NPDIDEKPP+PLRDALEK+KPF+MAY GI
Sbjct: 312  PCPIEDEYVDAFDTNCSFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEKVKPFMMAYMGI 371

Query: 574  QSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRF 395
            ++         ETMK  PL+K+IVD YSGPDRV+ K+Q+EELERVAKT+PASAPD VK F
Sbjct: 372  KTHEEWEEIVEETMKDAPLMKKIVDSYSGPDRVSGKKQKEELERVAKTIPASAPDSVKSF 431

Query: 394  TERAVLSLQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
             +RAVLSLQSNPGWGFDKKC FMDKL  EVSQ YK
Sbjct: 432  ADRAVLSLQSNPGWGFDKKCMFMDKLAKEVSQHYK 466


>ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum]
            gi|557089350|gb|ESQ30058.1| hypothetical protein
            EUTSA_v10011382mg [Eutrema salsugineum]
          Length = 531

 Score =  286 bits (732), Expect = 1e-74
 Identities = 161/328 (49%), Positives = 196/328 (59%), Gaps = 10/328 (3%)
 Frame = -1

Query: 1243 PNDVINVISGVGRGKPMKSPAPQSEKPKAENRHIR----------QRQQXXXXXXXXXXX 1094
            PN  I   SG GRGKP    AP  ++   ENRHIR          Q++            
Sbjct: 209  PNQRIVPGSGAGRGKPFVESAPLQQE---ENRHIRRPQPPPPQQQQQRSQPQPQHQQKRV 265

Query: 1093 XXXXXXSPREQLSQEEKVKKAKEILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 914
                  +PR +LS EE  ++A+  LS                                  
Sbjct: 266  QPPKDEAPRPKLSIEEAGRRARSQLSRGEAEGGGLRGRGGGRGRGRGARGRGRGRGGEGW 325

Query: 913  XXXGDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDA 734
                 +  EE++ EA   ++GD AD EK A K+GPE+M  LA+G+E++  R LPS  +DA
Sbjct: 326  RDVKME--EEAEQEAISTFVGDSADGEKFANKMGPEIMKMLADGYEDICERALPSTANDA 383

Query: 733  YLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXX 554
             LDAY TNLMIECEPEYLM  FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q    
Sbjct: 384  VLDAYETNLMIECEPEYLMPAFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWE 443

Query: 553  XXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLS 374
                E M + PLIKEIVDHYSGPDRVTAK+Q EEL+R+A T+P SAPD VKRF +RA LS
Sbjct: 444  EAIDEVMAQAPLIKEIVDHYSGPDRVTAKKQNEELDRIATTVPKSAPDSVKRFADRAALS 503

Query: 373  LQSNPGWGFDKKCQFMDKLVIEVSQQYK 290
            L+SNPGWGFDKK QFMDKLV EVSQ YK
Sbjct: 504  LKSNPGWGFDKKYQFMDKLVAEVSQSYK 531


Top