BLASTX nr result

ID: Mentha22_contig00018599 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00018599
         (615 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus...   141   2e-31
gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]       122   8e-26
ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247...   109   6e-22
ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu...   108   1e-21
ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citr...   107   4e-21
ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1...   106   5e-21
ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm...   105   8e-21
ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [A...   104   2e-20
gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot...   104   2e-20
ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein...   104   2e-20
gb|AAM65660.1| Contains similarity to RNA-binding protein from A...   104   2e-20
ref|XP_007014541.1| Hydroxyproline-rich glycoprotein family prot...   103   3e-20
ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family prot...   103   3e-20
ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Caps...   102   1e-19
ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp....    99   1e-18
ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr...    97   4e-18
ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prun...    95   2e-17
ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutr...    94   4e-17
ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507...    88   2e-15
ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phas...    87   3e-15

>gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus guttatus]
          Length = 493

 Score =  141 bits (355), Expect = 2e-31
 Identities = 93/251 (37%), Positives = 128/251 (50%), Gaps = 47/251 (18%)
 Frame = +3

Query: 3   PPKPDVNEPFRF---GEAQSGWTESESPPPKDKALPTGILGILSGAGRGKP-TIPSAPHP 170
           PPKP+V  PF F    E Q+   ESE P  ++  L + I+ +LSGAGRGKP   P+A  P
Sbjct: 131 PPKPNVKLPFLFVKDEEEQADAAESEVPSAQETLLRSDIVSVLSGAGRGKPGKPPTAAQP 190

Query: 171 EKT---------RQTDGREP-----NKGTPVREQLSQEEKVRKAKEILS---KXXXXXXX 299
           EK          R   G+ P     +   P   QLS+EE V+KAKEILS   +       
Sbjct: 191 EKPQSENRHIRQRPPQGKPPVAVSSDGAAPPAVQLSKEEMVKKAKEILSKGDEDGGVSRP 250

Query: 300 XXXXXXXXXXXXXXXXXXXXDQSRGRFSKDGA--------------------------AD 401
                               ++ RGR    G                           AD
Sbjct: 251 EVRDNRDNRDNRGGGRGGRGERGRGRGRGRGRGRGRGRGDDRYEESDDESDALFIGDPAD 310

Query: 402 REKLAKKLGPEIMSKVVEGLEEMASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTN 581
            EK+A+KLGP++M+++ EG++EM+S+ +P P  +A ++AF+T++ +EC PEY MEEFGTN
Sbjct: 311 EEKVAQKLGPDVMAQLAEGIDEMSSRVLPSPFDDAYMDAFETNLRIECEPEYLMEEFGTN 370

Query: 582 PDIDEKAPMPL 614
           PDIDEK P+PL
Sbjct: 371 PDIDEKPPIPL 381


>gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]
          Length = 426

 Score =  122 bits (306), Expect = 8e-26
 Identities = 79/202 (39%), Positives = 107/202 (52%), Gaps = 22/202 (10%)
 Frame = +3

Query: 75  PPPKDKALPTGILGILSGAGRGKP------TIPSAPHPEKTRQTDGREPNKGTPVREQLS 236
           PPP+D A    IL  LSG GRG P      T+   P     RQ   R P+      +QLS
Sbjct: 114 PPPRDTAALDDILTNLSGMGRGTPGKPPPQTLKPTPINRHIRQPQPR-PSTALSPDQQLS 172

Query: 237 QEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXXDQS-RGRFSKDGAA----- 398
           +EEK++KA EILS+                             S RGR  +  AA     
Sbjct: 173 KEEKLKKAVEILSRGDPDRGPIRSPTGRGRGRGRGRGGRGGRFSGRGRGREADAAIESDE 232

Query: 399 ----------DREKLAKKLGPEIMSKVVEGLEEMASKAVPDPHKEALVNAFQTDIMLECL 548
                     D +K+A+KLG E+M+K+ EG+EEM+S+ +P    +A V+A+ T+++LEC 
Sbjct: 233 ELPGMFGDPADEQKVAEKLGVEVMNKITEGMEEMSSRVLPSLIDDAYVDAYHTNLLLECE 292

Query: 549 PEYFMEEFGTNPDIDEKAPMPL 614
           PEYFME+FGTNPDID+K P+PL
Sbjct: 293 PEYFMEDFGTNPDIDDKPPIPL 314


>ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum
           lycopersicum] gi|460368563|ref|XP_004230135.1|
           PREDICTED: uncharacterized protein LOC101247662 isoform
           2 [Solanum lycopersicum]
          Length = 473

 Score =  109 bits (273), Expect = 6e-22
 Identities = 76/247 (30%), Positives = 117/247 (47%), Gaps = 43/247 (17%)
 Frame = +3

Query: 3   PPKPD------VNEPFRFGEAQ----SGWTESESPPPKDKA-LPTGILGILSGAGRGKPT 149
           PP+P       + +P  F + +    S  + S +P P+D + LP+ ++ +L+GAGRGKP 
Sbjct: 115 PPQPQQQQQQPLRKPIFFAKEEETTDSNSSSSNAPKPRDDSNLPSSVISVLTGAGRGKPL 174

Query: 150 IPSAPHPEKTRQTDGR-EPNK----------GTPVREQLSQEEKVRKAKEILSKXXXXXX 296
             ++   EK ++ +    P +           +P  ++LS+E+ V+KA  ILS+      
Sbjct: 175 QTASSVSEKPKEENRHLRPRQQKVADSGERASSPPPQRLSREDAVKKAVGILSRSDDGDV 234

Query: 297 XXXXXXXXXXXXXXXXXXXXXDQSRGR---------------------FSKDGAADREKL 413
                                   RGR                     F     AD EKL
Sbjct: 235 GGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDEERGDGNLESGFYLGDDADGEKL 294

Query: 414 AKKLGPEIMSKVVEGLEEMASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDID 593
           A KLGPE M+ + EG EEM+++ +P P  +A + A  T++M+EC PEY M +F +NPDID
Sbjct: 295 AAKLGPESMNTLAEGFEEMSARVLPSPMDDAYLEALHTNMMIECEPEYLMGDFESNPDID 354

Query: 594 EKAPMPL 614
           E  P+PL
Sbjct: 355 ETPPIPL 361


>ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum]
          Length = 480

 Score =  108 bits (270), Expect = 1e-21
 Identities = 72/227 (31%), Positives = 112/227 (49%), Gaps = 37/227 (16%)
 Frame = +3

Query: 45  AQSGWTESESPPPKDKA-LPTGILGILSGAGRGKPTIPSAPHPEKTRQTDGR-EPNK--- 209
           A S  + S++P P+D + L + ++ +L+GAGRGKP   ++P  EK ++ +    P +   
Sbjct: 142 ADSNSSSSDAPTPRDDSNLSSSVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQKV 201

Query: 210 -------GTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXX--- 359
                   +P  ++LS+E+ V+KA  ILS+                              
Sbjct: 202 ADSGERASSPPPQRLSREDAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVR 261

Query: 360 ----------------DQSRGRFSKDGA------ADREKLAKKLGPEIMSKVVEGLEEMA 473
                           D+ RG  S +        AD EKLA+KLGPE M+ + EG EEM+
Sbjct: 262 GRGGRGRGRGRGRGRRDEERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMS 321

Query: 474 SKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPL 614
           ++ +P P  +A + A  T++M+EC PEY M +F +NPDIDE  P+PL
Sbjct: 322 ARVLPSPMDDAYIEALHTNMMIECEPEYLMGDFESNPDIDETPPIPL 368


>ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citrus clementina]
           gi|557544515|gb|ESR55493.1| hypothetical protein
           CICLE_v10019766mg [Citrus clementina]
          Length = 511

 Score =  107 bits (266), Expect = 4e-21
 Identities = 80/247 (32%), Positives = 113/247 (45%), Gaps = 44/247 (17%)
 Frame = +3

Query: 6   PKPDVN--EPFRFGEAQSGWTESESPPPKDKALPTGILGILSGAGRGKPTI--------- 152
           P+PD    +P  F   +S    ++S  P +  LP+ I+  L GAGRGK  +         
Sbjct: 171 PRPDAQPAKPRTFTPNESA---TDSTQPSEPNLPSSIISTLPGAGRGKTVVTQQQQQQQH 227

Query: 153 ----PSAPHPEKTRQTDGREPNKGTP----------VREQLSQEEKVRKAKEILSKXXXX 290
               P  P  E+ R    R   +  P           + +LS+E+ V+ A +ILS+    
Sbjct: 228 QRQQPGPPPQEENRHIRARLQPQPRPEKAPAAETGSAQPKLSKEDAVKMAMKILSRGEEG 287

Query: 291 XXXXXXXXXXXXXXXXXXXXXXX---DQSRGRFSK-------DGA---------ADREKL 413
                                      Q RGR  +       DG          AD EKL
Sbjct: 288 EGEGISAGGPGRGRGMGRGGGRGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKL 347

Query: 414 AKKLGPEIMSKVVEGLEEMASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDID 593
           A+K+G E M+ +VEG EEM+ + +P P ++A ++A  T+ M+E  PEY MEEFGTNPDID
Sbjct: 348 AEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPDID 407

Query: 594 EKAPMPL 614
           EK P+PL
Sbjct: 408 EKPPIPL 414


>ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis]
          Length = 407

 Score =  106 bits (265), Expect = 5e-21
 Identities = 77/246 (31%), Positives = 112/246 (45%), Gaps = 43/246 (17%)
 Frame = +3

Query: 6   PKPDVNEPFRFGEAQSGWTESESPPPKDKALPTGILGILSGAGRGKPTI----------- 152
           P+PD  +P +        + ++S  P +  LP+ I+  L GAGRGK  +           
Sbjct: 51  PRPDA-QPAKPRTCTPNESATDSTQPSEPNLPSSIISTLPGAGRGKTAVTQQQQQQQQHQ 109

Query: 153 ---PSAPHPEKTRQTDGREPNKGTP----------VREQLSQEEKVRKAKEILSKXXXXX 293
              P  P  E+ R    R   +  P           + +LS+E+ V+ A ++LS+     
Sbjct: 110 RQQPGPPPQEENRHIRARLQPQPRPEKAPAAETGSAQPKLSKEDAVKMAMKVLSRGEEGE 169

Query: 294 XXXXXXXXXXXXXXXXXXXXXX---DQSRGRFSK-------DGA---------ADREKLA 416
                                     Q RGR  +       DG          AD EKLA
Sbjct: 170 GEGISAGGPGRGRGMGRGRGRGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLA 229

Query: 417 KKLGPEIMSKVVEGLEEMASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDE 596
           +K+G E M+ +VEG EEM+ + +P P ++A ++A  T+ M+E  PEY MEEFGTNPDIDE
Sbjct: 230 EKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPDIDE 289

Query: 597 KAPMPL 614
           K P+PL
Sbjct: 290 KPPIPL 295


>ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis]
           gi|223537066|gb|EEF38701.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 436

 Score =  105 bits (263), Expect = 8e-21
 Identities = 75/205 (36%), Positives = 101/205 (49%), Gaps = 20/205 (9%)
 Frame = +3

Query: 60  TESESPPPKDKALPTGILGILSGAGRGKPTIPSAPHPE-KTRQTDGREPNKGTPVREQ-- 230
           TES+S    D  LP+ I   LSG GRG+P  P  P P+ K      R+ ++  P  E+  
Sbjct: 125 TESQS----DSVLPSTIHSSLSGFGRGEPDKPVVPTPQVKEENRHIRDRSRAKPKTEEAE 180

Query: 231 ------LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXXDQSR------- 371
                 +S+EE V++A  ILS+                            + R       
Sbjct: 181 VRAKPKISREEAVKRAVSILSQGDTGEGMGRGRGGGRGRGRGRGRGRLEQRGRMMDDVDE 240

Query: 372 ----GRFSKDGAADREKLAKKLGPEIMSKVVEGLEEMASKAVPDPHKEALVNAFQTDIML 539
               G F  D A D EKLA K+G E M+K+VEG EEM+ + +P P ++A ++A  T+ M+
Sbjct: 241 GFGSGLFLGDNA-DGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDALHTNYMI 299

Query: 540 ECLPEYFMEEFGTNPDIDEKAPMPL 614
           E  PEY M EF  NPDIDEK PMPL
Sbjct: 300 EFEPEYLMGEFDQNPDIDEKPPMPL 324


>ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [Amborella trichopoda]
           gi|548839984|gb|ERN00220.1| hypothetical protein
           AMTR_s00111p00111440 [Amborella trichopoda]
          Length = 447

 Score =  104 bits (260), Expect = 2e-20
 Identities = 72/214 (33%), Positives = 103/214 (48%), Gaps = 27/214 (12%)
 Frame = +3

Query: 54  GWTESESPPPKDKALPTGILGI-LSGAGRGKPTIPSAPH---PEKTRQTDGREP------ 203
           G  ++++ PP +  LP  I    + G GRGKPT P   H    E+ R    R P      
Sbjct: 142 GRVQAQNLPPTESPLPRSISPAPIEGFGRGKPTSPLLSHGIEEEENRHIRRRSPPPERAG 201

Query: 204 --NKGTPVREQ-LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXXDQSRG 374
             ++G    E+ LS EE VR AK+ILS+                              +G
Sbjct: 202 QASRGRASNERKLSSEEAVRNAKDILSRGEGRGGRGLRGGRGLRGGRGRGGVWAGRGRQG 261

Query: 375 RFSK--------------DGAADREKLAKKLGPEIMSKVVEGLEEMASKAVPDPHKEALV 512
           R ++                 AD EKL K+LG E ++++ E  +EM+ + +P P +EA +
Sbjct: 262 RGARYQDRREDDSVGLYLGDDADGEKLVKRLGEENVNQIFEAFDEMSGRVLPSPMEEAYL 321

Query: 513 NAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPL 614
           +A  T+ ++E  PEY MEEFGTNPDIDEK P+PL
Sbjct: 322 DALHTNCLIEFEPEYHMEEFGTNPDIDEKPPIPL 355


>gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain.
            ESTs gb|H37317, gb|F14415, gb|AA651290 come from this
            gene [Arabidopsis thaliana]
          Length = 829

 Score =  104 bits (259), Expect = 2e-20
 Identities = 74/228 (32%), Positives = 101/228 (44%), Gaps = 45/228 (19%)
 Frame = +3

Query: 66   SESPPPKDKA----LPTGILGIL-------SGAGRGKPTIPSAPHPEKTRQTDGREP--- 203
            S  PPP+ K      P  I   L       SGAGRGKP + SAP  ++  +   R P   
Sbjct: 491  SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQIRRPPPPP 550

Query: 204  ---------------NKGTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXX 338
                             GTP + QLS EE  R+A+  LS+                    
Sbjct: 551  QQQRVQPQQKRAPTVKDGTP-KPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRGR 609

Query: 339  XXXXXXX----------------DQSRGRFSKDGAADREKLAKKLGPEIMSKVVEGLEEM 470
                                   +Q   R     +AD EK A+K+GPE+M  + EG EE+
Sbjct: 610  GARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEI 669

Query: 471  ASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPL 614
              KA+P    +A+++A+ T++M+EC PEY M +FG+NPDIDEK PM L
Sbjct: 670  CEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSL 717


>ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
           thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown
           protein; 43598-45751 [Arabidopsis thaliana]
           gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8
           [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1|
           At1g53640/F22G10.8 [Arabidopsis thaliana]
           gi|110740318|dbj|BAF02054.1| hypothetical protein
           [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis thaliana]
          Length = 523

 Score =  104 bits (259), Expect = 2e-20
 Identities = 74/228 (32%), Positives = 101/228 (44%), Gaps = 45/228 (19%)
 Frame = +3

Query: 66  SESPPPKDKA----LPTGILGIL-------SGAGRGKPTIPSAPHPEKTRQTDGREP--- 203
           S  PPP+ K      P  I   L       SGAGRGKP + SAP  ++  +   R P   
Sbjct: 185 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQIRRPPPPP 244

Query: 204 ---------------NKGTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXX 338
                            GTP + QLS EE  R+A+  LS+                    
Sbjct: 245 QQQRVQPQQKRAPTVKDGTP-KPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRGR 303

Query: 339 XXXXXXX----------------DQSRGRFSKDGAADREKLAKKLGPEIMSKVVEGLEEM 470
                                  +Q   R     +AD EK A+K+GPE+M  + EG EE+
Sbjct: 304 GARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEI 363

Query: 471 ASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPL 614
             KA+P    +A+++A+ T++M+EC PEY M +FG+NPDIDEK PM L
Sbjct: 364 CEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSL 411


>gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis
           thaliana gi|2129727 and contains RNA recognition
           PF|00076 domain [Arabidopsis thaliana]
          Length = 523

 Score =  104 bits (259), Expect = 2e-20
 Identities = 74/228 (32%), Positives = 101/228 (44%), Gaps = 45/228 (19%)
 Frame = +3

Query: 66  SESPPPKDKA----LPTGILGIL-------SGAGRGKPTIPSAPHPEKTRQTDGREP--- 203
           S  PPP+ K      P  I   L       SGAGRGKP + SAP  ++  +   R P   
Sbjct: 185 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQIRRPPPPP 244

Query: 204 ---------------NKGTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXX 338
                            GTP + QLS EE  R+A+  LS+                    
Sbjct: 245 QQQRVQPQQKRAPTVKDGTP-KPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRGR 303

Query: 339 XXXXXXX----------------DQSRGRFSKDGAADREKLAKKLGPEIMSKVVEGLEEM 470
                                  +Q   R     +AD EK A+K+GPE+M  + EG EE+
Sbjct: 304 GARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEEI 363

Query: 471 ASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPL 614
             KA+P    +A+++A+ T++M+EC PEY M +FG+NPDIDEK PM L
Sbjct: 364 CEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSL 411


>ref|XP_007014541.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
           [Theobroma cacao] gi|508784904|gb|EOY32160.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative isoform 2 [Theobroma cacao]
          Length = 403

 Score =  103 bits (258), Expect = 3e-20
 Identities = 80/241 (33%), Positives = 103/241 (42%), Gaps = 37/241 (15%)
 Frame = +3

Query: 3   PPKPDVNEPFRFGEAQSGWTES------ESPPPKDKALPTGIL--GILSGAGRGKPTIPS 158
           PP     +P    +     TES      E     +   P  IL   +LSGAGRGKP    
Sbjct: 125 PPPAQAKQPIFIKKKDEDETESSAKAAAEPIQSSEPIFPPNILPVSVLSGAGRGKPV--K 182

Query: 159 APHPEKTRQTDGRE---PNKGTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXX 329
            P P   RQ + R      + +P   Q+SQEE  +KA  ILS+                 
Sbjct: 183 QPEPASRRQEENRHIRVAQQQSP-SAQMSQEEATKKAMGILSRRSESGESGMVGRGGRAS 241

Query: 330 XXXXXXXXXX-----DQSRGRFSKDGA---------------------ADREKLAKKLGP 431
                             RGR  + G                      AD EK A+ +G 
Sbjct: 242 MGMGGGRGRGRGRGRGMGRGRGRRQGEDTRIVKDSGEGSADGLYLGDNADGEKFAQTIGA 301

Query: 432 EIMSKVVEGLEEMASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMP 611
           + M+K+VEG EEM S+ +P P  +A ++A  T+  +E  PEY MEEFGTNPDIDEK PMP
Sbjct: 302 DNMNKLVEGFEEMGSRVLPSPMDDAYLDALHTNCSIEFEPEYLMEEFGTNPDIDEKPPMP 361

Query: 612 L 614
           L
Sbjct: 362 L 362


>ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
           [Theobroma cacao] gi|508784903|gb|EOY32159.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative isoform 1 [Theobroma cacao]
          Length = 474

 Score =  103 bits (258), Expect = 3e-20
 Identities = 80/241 (33%), Positives = 103/241 (42%), Gaps = 37/241 (15%)
 Frame = +3

Query: 3   PPKPDVNEPFRFGEAQSGWTES------ESPPPKDKALPTGIL--GILSGAGRGKPTIPS 158
           PP     +P    +     TES      E     +   P  IL   +LSGAGRGKP    
Sbjct: 125 PPPAQAKQPIFIKKKDEDETESSAKAAAEPIQSSEPIFPPNILPVSVLSGAGRGKPV--K 182

Query: 159 APHPEKTRQTDGRE---PNKGTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXX 329
            P P   RQ + R      + +P   Q+SQEE  +KA  ILS+                 
Sbjct: 183 QPEPASRRQEENRHIRVAQQQSP-SAQMSQEEATKKAMGILSRRSESGESGMVGRGGRAS 241

Query: 330 XXXXXXXXXX-----DQSRGRFSKDGA---------------------ADREKLAKKLGP 431
                             RGR  + G                      AD EK A+ +G 
Sbjct: 242 MGMGGGRGRGRGRGRGMGRGRGRRQGEDTRIVKDSGEGSADGLYLGDNADGEKFAQTIGA 301

Query: 432 EIMSKVVEGLEEMASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMP 611
           + M+K+VEG EEM S+ +P P  +A ++A  T+  +E  PEY MEEFGTNPDIDEK PMP
Sbjct: 302 DNMNKLVEGFEEMGSRVLPSPMDDAYLDALHTNCSIEFEPEYLMEEFGTNPDIDEKPPMP 361

Query: 612 L 614
           L
Sbjct: 362 L 362


>ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Capsella rubella]
           gi|482575944|gb|EOA40131.1| hypothetical protein
           CARUB_v10008838mg [Capsella rubella]
          Length = 525

 Score =  102 bits (253), Expect = 1e-19
 Identities = 74/226 (32%), Positives = 101/226 (44%), Gaps = 43/226 (19%)
 Frame = +3

Query: 66  SESPPPKDKA----LPTGILGIL-------SGAGRGKPTIPSAP--------------HP 170
           S  P P+ K+    LP  +   L       SGAGRGKP + SAP               P
Sbjct: 189 SSPPAPESKSGQTDLPDNVFNALGSEIPHSSGAGRGKPLVESAPIQREENRHIRRPPPPP 248

Query: 171 EKTR----QTDGREPNKGTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXX 338
           ++ R    Q   + P   TP R +LS EE  R+A+  LS+                    
Sbjct: 249 QQQRSQPQQKRAQTPRDETP-RPRLSAEEAGRRARSELSRGEAEGSGVRGRGGRGRGRGA 307

Query: 339 XXXXXXXDQSRGRFSKD--------------GAADREKLAKKLGPEIMSKVVEGLEEMAS 476
                       R  K                +AD EK A K+GPE+M  + EG EE+  
Sbjct: 308 RGRGRGRGGEGWRDDKKEEEGEQEAMSVFAGDSADGEKFANKMGPELMKTLAEGFEEVCE 367

Query: 477 KAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPL 614
           KA+P    +A+++A+ T++M+EC PEY M +FG+NPDIDEK PM L
Sbjct: 368 KALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSL 413


>ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297340299|gb|EFH70716.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 769

 Score = 98.6 bits (244), Expect = 1e-18
 Identities = 64/199 (32%), Positives = 91/199 (45%), Gaps = 36/199 (18%)
 Frame = +3

Query: 126  GAGRGKPTIPSAP----------------HPEKTRQTDGREPNKGTPV------REQLSQ 239
            GAGRGKP + SAP                 P++ +Q   +   K  P       + QLS+
Sbjct: 459  GAGRGKPLVESAPIQQEDNRQIRRPQPPPPPQQQQQQRAQPQQKRAPTVKDEAPKPQLSR 518

Query: 240  EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXXDQSRGRFSKD---------- 389
            EE  R+A+  LS+                                R  K           
Sbjct: 519  EEAGRRARSELSRGEAEGGGVRGRGGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMS 578

Query: 390  ----GAADREKLAKKLGPEIMSKVVEGLEEMASKAVPDPHKEALVNAFQTDIMLECLPEY 557
                 +AD EK A+K+GPE+M  + EG EE+  KA+P    +A+++A+ T++M+EC PEY
Sbjct: 579  IFAGDSADGEKFAQKMGPELMKTLAEGFEEVCEKALPSTTHDAIIDAYDTNLMIECEPEY 638

Query: 558  FMEEFGTNPDIDEKAPMPL 614
             M +FG+NPDIDEK PM L
Sbjct: 639  IMADFGSNPDIDEKPPMSL 657


>ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
           gi|550322664|gb|EEF06007.2| hydroxyproline-rich
           glycoprotein [Populus trichocarpa]
          Length = 466

 Score = 97.1 bits (240), Expect = 4e-18
 Identities = 73/232 (31%), Positives = 101/232 (43%), Gaps = 40/232 (17%)
 Frame = +3

Query: 39  GEAQSGWTESESPPPK--DKALPTGILGILSGAGRGKPT---IPSAPHPE---------- 173
           G ++S  +  ES PPK  +  LP  IL  L GAGRGKP    +P  P  E          
Sbjct: 123 GPSRSTESRPESEPPKKAEANLPPSILSGLGGAGRGKPVKQEVPIEPAKEENRHLRARSQ 182

Query: 174 -----KTRQTDGREPNKGTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXX 338
                +TRQ    + +   P   ++ ++E V+KA E+LS+                    
Sbjct: 183 PRSQPRTRQQKTPDGDDAVPATTKMGRQEAVKKAMELLSRGGGEGEVGGRGGGRGSFVPG 242

Query: 339 XXXXXXXDQSRGR-------------------FSKDG-AADREKLAKKLGPEIMSKVVEG 458
                   +  GR                    S +G   D EK A+ +G E M+ +VE 
Sbjct: 243 RGGGRGGARGGGRGRGRGRRGYGDKEVEYGSGMSLEGHEEDEEKFAQSVGVETMNTLVEA 302

Query: 459 LEEMASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPL 614
            EEM+ + +P P ++  V+AF T+   E  PEY M EF  NPDIDEK PMPL
Sbjct: 303 FEEMSGRVLPCPIEDEYVDAFDTNCSFEFEPEYLMGEFDKNPDIDEKPPMPL 354


>ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica]
           gi|462409156|gb|EMJ14490.1| hypothetical protein
           PRUPE_ppa006080mg [Prunus persica]
          Length = 428

 Score = 94.7 bits (234), Expect = 2e-17
 Identities = 63/169 (37%), Positives = 82/169 (48%), Gaps = 4/169 (2%)
 Frame = +3

Query: 120 LSGAGRGKP---TIPSAPHPEKTRQTDGR-EPNKGTPVREQLSQEEKVRKAKEILSKXXX 287
           L G+GRGKP   T P     E+ R    R EP+   P         + R  + +  +   
Sbjct: 150 LPGSGRGKPMNFTRPEVQVKEENRHIQARPEPDPNQPRTRPRGPNGRGR-GRGMRGRGRG 208

Query: 288 XXXXXXXXXXXXXXXXXXXXXXXXDQSRGRFSKDGAADREKLAKKLGPEIMSKVVEGLEE 467
                                     + G +  D A D EKLAKKLGPEIM+K+VE  EE
Sbjct: 209 RGRGRGDFRMSERGDRRRGKDSDGSYASGLYLGDNA-DGEKLAKKLGPEIMNKLVERFEE 267

Query: 468 MASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPL 614
           M+S+ +P P  +A V+A  T+ M+EC PEY M EF  NPDIDEK P+ L
Sbjct: 268 MSSEVLPSPLDDAYVDAMHTNFMIECEPEYLMGEFNKNPDIDEKPPISL 316


>ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum]
           gi|557089350|gb|ESQ30058.1| hypothetical protein
           EUTSA_v10011382mg [Eutrema salsugineum]
          Length = 531

 Score = 93.6 bits (231), Expect = 4e-17
 Identities = 68/223 (30%), Positives = 95/223 (42%), Gaps = 40/223 (17%)
 Frame = +3

Query: 66  SESPPPKDKALPTGILGILSGAGRGKPTIPSAP------------------------HPE 173
           SE   P  + +P       SGAGRGKP + SAP                         P+
Sbjct: 204 SEFSQPNQRIVPG------SGAGRGKPFVESAPLQQEENRHIRRPQPPPPQQQQQRSQPQ 257

Query: 174 KTRQTDGREPNKGTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXX 353
              Q    +P K    R +LS EE  R+A+  LS+                         
Sbjct: 258 PQHQQKRVQPPKDEAPRPKLSIEEAGRRARSQLSRGEAEGGGLRGRGGGRGRGRGARGRG 317

Query: 354 XXDQSRG----------------RFSKDGAADREKLAKKLGPEIMSKVVEGLEEMASKAV 485
                 G                 F  D +AD EK A K+GPEIM  + +G E++  +A+
Sbjct: 318 RGRGGEGWRDVKMEEEAEQEAISTFVGD-SADGEKFANKMGPEIMKMLADGYEDICERAL 376

Query: 486 PDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPL 614
           P    +A+++A++T++M+EC PEY M  FG+NPDIDEK PM L
Sbjct: 377 PSTANDAVLDAYETNLMIECEPEYLMPAFGSNPDIDEKPPMSL 419


>ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum]
          Length = 504

 Score = 88.2 bits (217), Expect = 2e-15
 Identities = 65/212 (30%), Positives = 90/212 (42%), Gaps = 31/212 (14%)
 Frame = +3

Query: 72  SPPPKDKALPTGILGILSGAGRGKPTIPSAPHP---EKTRQTDGREPNKGTPVREQLSQE 242
           S    D      +L +LSGAGRGKP  P+       E+ R    R  +     +  L+ +
Sbjct: 182 SDQESDNRFSMSVLKVLSGAGRGKPIEPAVSETQVVEENRHVRNRRASDVPMRQPMLTGD 241

Query: 243 EKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXX-----DQSRGRFSKDGAADR- 404
             ++ A++ LSK                                 + RG F   G  DR 
Sbjct: 242 GALQNARKYLSKFDGDGSGSGRGGEPRERGAFGRGRGRGRGRGRGRGRGGFRGTGGDDRF 301

Query: 405 ----------------------EKLAKKLGPEIMSKVVEGLEEMASKAVPDPHKEALVNA 518
                                 EKLAKK+GPE+M++  EG EEM S+ +P P ++  V A
Sbjct: 302 GQIQDNARSNASGLFLGDDVDGEKLAKKVGPEVMNQFTEGFEEMISRVLPSPLEDEYVEA 361

Query: 519 FQTDIMLECLPEYFMEEFGTNPDIDEKAPMPL 614
           F  +  +E  PEY M EF +NPDIDEK P+PL
Sbjct: 362 FDINCAIEFEPEYIM-EFDSNPDIDEKEPIPL 392


>ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris]
           gi|561020640|gb|ESW19411.1| hypothetical protein
           PHAVU_006G122700g [Phaseolus vulgaris]
          Length = 532

 Score = 87.4 bits (215), Expect = 3e-15
 Identities = 75/245 (30%), Positives = 106/245 (43%), Gaps = 41/245 (16%)
 Frame = +3

Query: 3   PPKPDVNEP--FRFGEAQSGWTESESPPPKDKA--LPTGILGILSGAGRGKPTIPSAPHP 170
           PP     +P  F+  +  S  T  + P   ++A  LP  I+ +LSG GRGKP   S P  
Sbjct: 178 PPDSGPKKPIFFKREDIASPTTRDDFPIDVEQANKLPGNIIEVLSGLGRGKPMKQSDPET 237

Query: 171 EKTRQTDG-REPN-KGTPVREQL-------SQEEKVRKAKEILS---------------- 275
             T +    R P  +G    + L       S+++ VR A+  LS                
Sbjct: 238 RVTEENRHLRAPRARGAAASDTLYERQPIPSRDDAVRNARNFLSQGEDDVGGTGRGRGFR 297

Query: 276 -KXXXXXXXXXXXXXXXXXXXXXXXXXXXDQSRGRFSKDGA-----------ADREKLAK 419
            +                           D+ RGRF    A           AD EKLAK
Sbjct: 298 ERGGLGRGRGRGRGRGRGTGRGGFRGRDMDERRGRFMDAEASDDIGPYVGDDADGEKLAK 357

Query: 420 KLGPEIMSKVVEGLEEMASKAVPDPHKEALVNAFQTDIMLECLPEYFMEEFGTNPDIDEK 599
           K+GPEIM+++ EG EEMA + +P P ++  ++A   +  +E  PEY +E    NPDIDEK
Sbjct: 358 KVGPEIMNQLTEGFEEMAGRVLPSPLEDEYLDALDINYAIEFEPEYLVE--FDNPDIDEK 415

Query: 600 APMPL 614
            P+PL
Sbjct: 416 EPIPL 420


Top