BLASTX nr result

ID: Mentha24_contig00015097 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00015097
         (550 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus...   113   4e-23
gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]        99   5e-19
ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu...    96   6e-18
ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247...    94   2e-17
ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm...    94   2e-17
gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot...    83   4e-14
ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein...    83   4e-14
gb|AAM65660.1| Contains similarity to RNA-binding protein from A...    83   4e-14
ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Caps...    82   9e-14
ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [A...    82   1e-13
ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp....    81   2e-13
dbj|BAD43943.1| unknown protein [Arabidopsis thaliana] gi|519705...    80   3e-13
ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citr...    80   4e-13
ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutr...    80   4e-13
ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1...    79   7e-13
ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr...    78   1e-12
ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prun...    77   4e-12
ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phas...    75   1e-11
ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507...    74   2e-11
ref|XP_007014541.1| Hydroxyproline-rich glycoprotein family prot...    72   7e-11

>gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus guttatus]
          Length = 493

 Score =  113 bits (282), Expect = 4e-23
 Identities = 77/216 (35%), Positives = 106/216 (49%), Gaps = 37/216 (17%)
 Frame = -1

Query: 538 EAQSGWTESETPPPKDKALPTGILGVLSGAGRGKPTK-PSVPHPEK----TRQTGGREPS 374
           E Q+   ESE P  ++  L + I+ VLSGAGRGKP K P+   PEK     R    R P 
Sbjct: 147 EEQADAAESEVPSAQETLLRSDIVSVLSGAGRGKPGKPPTAAQPEKPQSENRHIRQRPPQ 206

Query: 373 QSP----NKDTAV--REQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXY 212
             P    + D A     QLS+EE V+KAKEILSK                          
Sbjct: 207 GKPPVAVSSDGAAPPAVQLSKEEMVKKAKEILSKGDEDGGVSRPEVRDNRDNRDNRGGGR 266

Query: 211 EERGDQSRGRFSGHG--------------------------AADREKLAKRLGPEIMSKV 110
             RG++ RGR  G G                           AD EK+A++LGP++M+++
Sbjct: 267 GGRGERGRGRGRGRGRGRGRGRGDDRYEESDDESDALFIGDPADEEKVAQKLGPDVMAQL 326

Query: 109 VEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 2
            EG++EM+SR +P P  +A +DAF+T++ +EC PEY
Sbjct: 327 AEGIDEMSSRVLPSPFDDAYMDAFETNLRIECEPEY 362


>gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]
          Length = 426

 Score = 99.4 bits (246), Expect = 5e-19
 Identities = 70/194 (36%), Positives = 96/194 (49%), Gaps = 26/194 (13%)
 Frame = -1

Query: 505 PPPKDKALPTGILGVLSGAGRGKPTKPSVPHPEKTRQTGG----REPSQSPNKDTAVREQ 338
           PPP+D A    IL  LSG GRG P KP    P+  + T      R+P   P+   +  +Q
Sbjct: 114 PPPRDTAALDDILTNLSGMGRGTPGKPP---PQTLKPTPINRHIRQPQPRPSTALSPDQQ 170

Query: 337 LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQSRG-RFSGHGA- 164
           LS+EEK++KA EILS+                            RG   RG RFSG G  
Sbjct: 171 LSKEEKLKKAVEILSRGDPDRGPIRSPTGRGRGRG---------RGRGGRGGRFSGRGRG 221

Query: 163 --------------------ADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVD 44
                               AD +K+A++LG E+M+K+ EG+EEM+SR +P    +A VD
Sbjct: 222 READAAIESDEELPGMFGDPADEQKVAEKLGVEVMNKITEGMEEMSSRVLPSLIDDAYVD 281

Query: 43  AFQTDIMLECLPEY 2
           A+ T+++LEC PEY
Sbjct: 282 AYHTNLLLECEPEY 295


>ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum]
          Length = 480

 Score = 95.9 bits (237), Expect = 6e-18
 Identities = 67/208 (32%), Positives = 100/208 (48%), Gaps = 30/208 (14%)
 Frame = -1

Query: 535 AQSGWTESETPPPKDKA-LPTGILGVLSGAGRGKPTKPSVPHPEKTRQTGGR-EPSQSPN 362
           A S  + S+ P P+D + L + ++ VL+GAGRGKP + + P  EK ++      P Q   
Sbjct: 142 ADSNSSSSDAPTPRDDSNLSSSVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQKV 201

Query: 361 KDTAVR------EQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXY---E 209
            D+  R      ++LS+E+ V+KA  ILS+                              
Sbjct: 202 ADSGERASSPPPQRLSREDAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVR 261

Query: 208 ERGDQSRGRFSGHGA-------------------ADREKLAKRLGPEIMSKVVEGLEEMA 86
            RG + RGR  G G                    AD EKLA++LGPE M+ + EG EEM+
Sbjct: 262 GRGGRGRGRGRGRGRRDEERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMS 321

Query: 85  SRAVPDPHKEALVDAFQTDIMLECLPEY 2
           +R +P P  +A ++A  T++M+EC PEY
Sbjct: 322 ARVLPSPMDDAYIEALHTNMMIECEPEY 349


>ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum
           lycopersicum] gi|460368563|ref|XP_004230135.1|
           PREDICTED: uncharacterized protein LOC101247662 isoform
           2 [Solanum lycopersicum]
          Length = 473

 Score = 94.0 bits (232), Expect = 2e-17
 Identities = 66/203 (32%), Positives = 97/203 (47%), Gaps = 27/203 (13%)
 Frame = -1

Query: 529 SGWTESETPPPKDKA-LPTGILGVLSGAGRGKPTKPSVPHPEKTRQTGGR-EPSQSPNKD 356
           S  + S  P P+D + LP+ ++ VL+GAGRGKP + +    EK ++      P Q    D
Sbjct: 141 SNSSSSNAPKPRDDSNLPSSVISVLTGAGRGKPLQTASSVSEKPKEENRHLRPRQQKVAD 200

Query: 355 TAVR------EQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQ 194
           +  R      ++LS+E+ V+KA  ILS+                            RG +
Sbjct: 201 SGERASSPPPQRLSREDAVKKAVGILSR-SDDGDVGGGRGMGGGFRGRGGRGAVRGRGGR 259

Query: 193 SRGRFSGHGA-------------------ADREKLAKRLGPEIMSKVVEGLEEMASRAVP 71
            RGR  G G                    AD EKLA +LGPE M+ + EG EEM++R +P
Sbjct: 260 GRGRGRGRGRRDEERGDGNLESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLP 319

Query: 70  DPHKEALVDAFQTDIMLECLPEY 2
            P  +A ++A  T++M+EC PEY
Sbjct: 320 SPMDDAYLEALHTNMMIECEPEY 342


>ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis]
           gi|223537066|gb|EEF38701.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 436

 Score = 94.0 bits (232), Expect = 2e-17
 Identities = 64/191 (33%), Positives = 93/191 (48%), Gaps = 12/191 (6%)
 Frame = -1

Query: 538 EAQSGWTESETPPPKDKALPTGILGVLSGAGRGKPTKPSVPHP---EKTRQTGGREPSQS 368
           + + G +   T    D  LP+ I   LSG GRG+P KP VP P   E+ R    R  ++ 
Sbjct: 115 DPEPGPSRQPTESQSDSVLPSTIHSSLSGFGRGEPDKPVVPTPQVKEENRHIRDRSRAKP 174

Query: 367 PNKDTAVREQ--LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQ 194
             ++  VR +  +S+EE V++A  ILS+                          E+RG  
Sbjct: 175 KTEEAEVRAKPKISREEAVKRAVSILSQGDTGEGMGRGRGGGRGRGRGRGRGRLEQRGRM 234

Query: 193 SRGRFSGHGA-------ADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQ 35
                 G G+       AD EKLA ++G E M+K+VEG EEM+ R +P P ++A +DA  
Sbjct: 235 MDDVDEGFGSGLFLGDNADGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDALH 294

Query: 34  TDIMLECLPEY 2
           T+ M+E  PEY
Sbjct: 295 TNYMIEFEPEY 305


>gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain.
            ESTs gb|H37317, gb|F14415, gb|AA651290 come from this
            gene [Arabidopsis thaliana]
          Length = 829

 Score = 83.2 bits (204), Expect = 4e-14
 Identities = 65/210 (30%), Positives = 87/210 (41%), Gaps = 39/210 (18%)
 Frame = -1

Query: 514  SETPPPKDKA----LPTGILGVL-------SGAGRGKPTKPSVP-HPEKTRQTGGREPSQ 371
            S  PPP+ K      P  I   L       SGAGRGKP   S P   E  RQ   R P  
Sbjct: 491  SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQI--RRPPP 548

Query: 370  SPN--------------KDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXX 233
             P               KD   + QLS EE  R+A+  LS+                   
Sbjct: 549  PPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRG 608

Query: 232  XXXXXXY-------------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEE 92
                                EE G+Q   R     +AD EK A+++GPE+M  + EG EE
Sbjct: 609  RGARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEE 668

Query: 91   MASRAVPDPHKEALVDAFQTDIMLECLPEY 2
            +  +A+P    +A++DA+ T++M+EC PEY
Sbjct: 669  ICEKALPSTTHDAIIDAYDTNLMIECEPEY 698


>ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
           thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown
           protein; 43598-45751 [Arabidopsis thaliana]
           gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8
           [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1|
           At1g53640/F22G10.8 [Arabidopsis thaliana]
           gi|110740318|dbj|BAF02054.1| hypothetical protein
           [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis thaliana]
          Length = 523

 Score = 83.2 bits (204), Expect = 4e-14
 Identities = 65/210 (30%), Positives = 87/210 (41%), Gaps = 39/210 (18%)
 Frame = -1

Query: 514 SETPPPKDKA----LPTGILGVL-------SGAGRGKPTKPSVP-HPEKTRQTGGREPSQ 371
           S  PPP+ K      P  I   L       SGAGRGKP   S P   E  RQ   R P  
Sbjct: 185 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQI--RRPPP 242

Query: 370 SPN--------------KDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXX 233
            P               KD   + QLS EE  R+A+  LS+                   
Sbjct: 243 PPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRG 302

Query: 232 XXXXXXY-------------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEE 92
                               EE G+Q   R     +AD EK A+++GPE+M  + EG EE
Sbjct: 303 RGARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEE 362

Query: 91  MASRAVPDPHKEALVDAFQTDIMLECLPEY 2
           +  +A+P    +A++DA+ T++M+EC PEY
Sbjct: 363 ICEKALPSTTHDAIIDAYDTNLMIECEPEY 392


>gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis
           thaliana gi|2129727 and contains RNA recognition
           PF|00076 domain [Arabidopsis thaliana]
          Length = 523

 Score = 83.2 bits (204), Expect = 4e-14
 Identities = 65/210 (30%), Positives = 87/210 (41%), Gaps = 39/210 (18%)
 Frame = -1

Query: 514 SETPPPKDKA----LPTGILGVL-------SGAGRGKPTKPSVP-HPEKTRQTGGREPSQ 371
           S  PPP+ K      P  I   L       SGAGRGKP   S P   E  RQ   R P  
Sbjct: 185 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQI--RRPPP 242

Query: 370 SPN--------------KDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXX 233
            P               KD   + QLS EE  R+A+  LS+                   
Sbjct: 243 PPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRG 302

Query: 232 XXXXXXY-------------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEE 92
                               EE G+Q   R     +AD EK A+++GPE+M  + EG EE
Sbjct: 303 RGARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEE 362

Query: 91  MASRAVPDPHKEALVDAFQTDIMLECLPEY 2
           +  +A+P    +A++DA+ T++M+EC PEY
Sbjct: 363 ICEKALPSTTHDAIIDAYDTNLMIECEPEY 392


>ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Capsella rubella]
           gi|482575944|gb|EOA40131.1| hypothetical protein
           CARUB_v10008838mg [Capsella rubella]
          Length = 525

 Score = 82.0 bits (201), Expect = 9e-14
 Identities = 60/207 (28%), Positives = 90/207 (43%), Gaps = 36/207 (17%)
 Frame = -1

Query: 514 SETPPPKDKA----LPTGILGVL-------SGAGRGKPTKPSVP--------------HP 410
           S  P P+ K+    LP  +   L       SGAGRGKP   S P               P
Sbjct: 189 SSPPAPESKSGQTDLPDNVFNALGSEIPHSSGAGRGKPLVESAPIQREENRHIRRPPPPP 248

Query: 409 EKTRQTGGREPSQSPNKDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXX 230
           ++ R    ++ +Q+P  +T  R +LS EE  R+A+  LS+                    
Sbjct: 249 QQQRSQPQQKRAQTPRDETP-RPRLSAEEAGRRARSELSRGEAEGSGVRGRGGRGRGRGA 307

Query: 229 XXXXXY-----------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEEMAS 83
                            EE G+Q         +AD EK A ++GPE+M  + EG EE+  
Sbjct: 308 RGRGRGRGGEGWRDDKKEEEGEQEAMSVFAGDSADGEKFANKMGPELMKTLAEGFEEVCE 367

Query: 82  RAVPDPHKEALVDAFQTDIMLECLPEY 2
           +A+P    +A++DA+ T++M+EC PEY
Sbjct: 368 KALPSTTHDAIIDAYDTNLMIECEPEY 394


>ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [Amborella trichopoda]
           gi|548839984|gb|ERN00220.1| hypothetical protein
           AMTR_s00111p00111440 [Amborella trichopoda]
          Length = 447

 Score = 81.6 bits (200), Expect = 1e-13
 Identities = 61/196 (31%), Positives = 87/196 (44%), Gaps = 21/196 (10%)
 Frame = -1

Query: 526 GWTESETPPPKDKALPTGILGV-LSGAGRGKPTKPSVPH---PEKTRQTGGREP-----S 374
           G  +++  PP +  LP  I    + G GRGKPT P + H    E+ R    R P      
Sbjct: 142 GRVQAQNLPPTESPLPRSISPAPIEGFGRGKPTSPLLSHGIEEEENRHIRRRSPPPERAG 201

Query: 373 QSPNKDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXY------ 212
           Q+     +   +LS EE VR AK+ILS+                                
Sbjct: 202 QASRGRASNERKLSSEEAVRNAKDILSRGEGRGGRGLRGGRGLRGGRGRGGVWAGRGRQG 261

Query: 211 ------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEAL 50
                 + R D S G + G  A D EKL KRLG E ++++ E  +EM+ R +P P +EA 
Sbjct: 262 RGARYQDRREDDSVGLYLGDDA-DGEKLVKRLGEENVNQIFEAFDEMSGRVLPSPMEEAY 320

Query: 49  VDAFQTDIMLECLPEY 2
           +DA  T+ ++E  PEY
Sbjct: 321 LDALHTNCLIEFEPEY 336


>ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297340299|gb|EFH70716.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 769

 Score = 81.3 bits (199), Expect = 2e-13
 Identities = 55/180 (30%), Positives = 82/180 (45%), Gaps = 29/180 (16%)
 Frame = -1

Query: 454 GAGRGKPT---------------KPSVPHPEKTRQTGGREPSQ--SPN-KDTAVREQLSQ 329
           GAGRGKP                +P  P P + +Q    +P Q  +P  KD A + QLS+
Sbjct: 459 GAGRGKPLVESAPIQQEDNRQIRRPQPPPPPQQQQQQRAQPQQKRAPTVKDEAPKPQLSR 518

Query: 328 EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXY-----------EERGDQSRGR 182
           EE  R+A+  LS+                                     EE G+Q    
Sbjct: 519 EEAGRRARSELSRGEAEGGGVRGRGGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMS 578

Query: 181 FSGHGAADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 2
                +AD EK A+++GPE+M  + EG EE+  +A+P    +A++DA+ T++M+EC PEY
Sbjct: 579 IFAGDSADGEKFAQKMGPELMKTLAEGFEEVCEKALPSTTHDAIIDAYDTNLMIECEPEY 638


>dbj|BAD43943.1| unknown protein [Arabidopsis thaliana] gi|51970532|dbj|BAD43958.1|
           unknown protein [Arabidopsis thaliana]
          Length = 417

 Score = 80.5 bits (197), Expect = 3e-13
 Identities = 64/209 (30%), Positives = 86/209 (41%), Gaps = 39/209 (18%)
 Frame = -1

Query: 514 SETPPPKDKA----LPTGILGVL-------SGAGRGKPTKPSVP-HPEKTRQTGGREPSQ 371
           S  PPP+ K      P  I   L       SGAGRGKP   S P   E  RQ   R P  
Sbjct: 185 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQI--RRPPP 242

Query: 370 SPN--------------KDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXX 233
            P               KD   + QLS EE  R+A+  LS+                   
Sbjct: 243 PPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRG 302

Query: 232 XXXXXXY-------------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEE 92
                               EE G+Q   R     +AD EK A+++GPE+M  + EG EE
Sbjct: 303 RGARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEE 362

Query: 91  MASRAVPDPHKEALVDAFQTDIMLECLPE 5
           +  +A+P    +A++DA+ T++M+EC PE
Sbjct: 363 ICEKALPSTTHDAIIDAYDTNLMIECEPE 391


>ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citrus clementina]
           gi|557544515|gb|ESR55493.1| hypothetical protein
           CICLE_v10019766mg [Citrus clementina]
          Length = 511

 Score = 79.7 bits (195), Expect = 4e-13
 Identities = 60/206 (29%), Positives = 92/206 (44%), Gaps = 35/206 (16%)
 Frame = -1

Query: 514 SETPPPKDKALPTGILGVLSGAGRGKPT-------------KPSVPHPEKTRQTGGR--- 383
           +++  P +  LP+ I+  L GAGRGK               +P  P  E+ R    R   
Sbjct: 190 TDSTQPSEPNLPSSIISTLPGAGRGKTVVTQQQQQQQHQRQQPGPPPQEENRHIRARLQP 249

Query: 382 --EPSQSPNKDT-AVREQLSQEEKVRKAKEILSK-------------XXXXXXXXXXXXX 251
              P ++P  +T + + +LS+E+ V+ A +ILS+                          
Sbjct: 250 QPRPEKAPAAETGSAQPKLSKEDAVKMAMKILSRGEEGEGEGISAGGPGRGRGMGRGGGR 309

Query: 250 XXXXXXXXXXXXYEERGDQSRGRFSG---HGAADREKLAKRLGPEIMSKVVEGLEEMASR 80
                        +E  D   GRF G      AD EKLA+++G E M+ +VEG EEM+ R
Sbjct: 310 GRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVEGFEEMSGR 369

Query: 79  AVPDPHKEALVDAFQTDIMLECLPEY 2
            +P P ++A +DA  T+ M+E  PEY
Sbjct: 370 VLPSPMEDAYIDALHTNCMIEFEPEY 395


>ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum]
           gi|557089350|gb|ESQ30058.1| hypothetical protein
           EUTSA_v10011382mg [Eutrema salsugineum]
          Length = 531

 Score = 79.7 bits (195), Expect = 4e-13
 Identities = 62/215 (28%), Positives = 90/215 (41%), Gaps = 44/215 (20%)
 Frame = -1

Query: 514 SETPPPKDKALPTGILGVLSGAGRGKP---------------TKPSVPHPEKTRQTGGRE 380
           SE   P  + +P       SGAGRGKP                +P  P P++ +Q    +
Sbjct: 204 SEFSQPNQRIVPG------SGAGRGKPFVESAPLQQEENRHIRRPQPPPPQQQQQRSQPQ 257

Query: 379 PSQS-----PNKDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXX 215
           P        P KD A R +LS EE  R+A+  LS+                         
Sbjct: 258 PQHQQKRVQPPKDEAPRPKLSIEEAGRRARSQLSRGEAEGGGLRGRGGGRG--------- 308

Query: 214 YEERGDQSRGRFSGHG------------------------AADREKLAKRLGPEIMSKVV 107
              RG  +RGR  G G                        +AD EK A ++GPEIM  + 
Sbjct: 309 ---RGRGARGRGRGRGGEGWRDVKMEEEAEQEAISTFVGDSADGEKFANKMGPEIMKMLA 365

Query: 106 EGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 2
           +G E++  RA+P    +A++DA++T++M+EC PEY
Sbjct: 366 DGYEDICERALPSTANDAVLDAYETNLMIECEPEY 400


>ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis]
          Length = 407

 Score = 79.0 bits (193), Expect = 7e-13
 Identities = 59/207 (28%), Positives = 92/207 (44%), Gaps = 36/207 (17%)
 Frame = -1

Query: 514 SETPPPKDKALPTGILGVLSGAGRGKPT--------------KPSVPHPEKTRQTGGR-- 383
           +++  P +  LP+ I+  L GAGRGK                +P  P  E+ R    R  
Sbjct: 70  TDSTQPSEPNLPSSIISTLPGAGRGKTAVTQQQQQQQQHQRQQPGPPPQEENRHIRARLQ 129

Query: 382 ---EPSQSPNKDT-AVREQLSQEEKVRKAKEILSK-------------XXXXXXXXXXXX 254
               P ++P  +T + + +LS+E+ V+ A ++LS+                         
Sbjct: 130 PQPRPEKAPAAETGSAQPKLSKEDAVKMAMKVLSRGEEGEGEGISAGGPGRGRGMGRGRG 189

Query: 253 XXXXXXXXXXXXXYEERGDQSRGRFSG---HGAADREKLAKRLGPEIMSKVVEGLEEMAS 83
                         +E  D   GRF G      AD EKLA+++G E M+ +VEG EEM+ 
Sbjct: 190 RGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVEGFEEMSG 249

Query: 82  RAVPDPHKEALVDAFQTDIMLECLPEY 2
           R +P P ++A +DA  T+ M+E  PEY
Sbjct: 250 RVLPSPMEDAYIDALHTNCMIEFEPEY 276


>ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
           gi|550322664|gb|EEF06007.2| hydroxyproline-rich
           glycoprotein [Populus trichocarpa]
          Length = 466

 Score = 78.2 bits (191), Expect = 1e-12
 Identities = 66/204 (32%), Positives = 86/204 (42%), Gaps = 32/204 (15%)
 Frame = -1

Query: 517 ESETPPPKDKALPTGILGVLSGAGRGKPTKPSVP-HPEKTRQTGGREPSQ---------- 371
           ESE P   +  LP  IL  L GAGRGKP K  VP  P K      R  SQ          
Sbjct: 133 ESEPPKKAEANLPPSILSGLGGAGRGKPVKQEVPIEPAKEENRHLRARSQPRSQPRTRQQ 192

Query: 370 -SPNKDTAV--REQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEER- 203
            +P+ D AV    ++ ++E V+KA E+LS+                            R 
Sbjct: 193 KTPDGDDAVPATTKMGRQEAVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGARG 252

Query: 202 GDQSRGR-----------------FSGHGAADREKLAKRLGPEIMSKVVEGLEEMASRAV 74
           G + RGR                   GH   D EK A+ +G E M+ +VE  EEM+ R +
Sbjct: 253 GGRGRGRGRRGYGDKEVEYGSGMSLEGH-EEDEEKFAQSVGVETMNTLVEAFEEMSGRVL 311

Query: 73  PDPHKEALVDAFQTDIMLECLPEY 2
           P P ++  VDAF T+   E  PEY
Sbjct: 312 PCPIEDEYVDAFDTNCSFEFEPEY 335


>ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica]
           gi|462409156|gb|EMJ14490.1| hypothetical protein
           PRUPE_ppa006080mg [Prunus persica]
          Length = 428

 Score = 76.6 bits (187), Expect = 4e-12
 Identities = 57/165 (34%), Positives = 74/165 (44%), Gaps = 12/165 (7%)
 Frame = -1

Query: 460 LSGAGRGKP---TKPSVPHPEKTRQTGGREPSQSPNKDTAVREQLSQEEKVRKAKEILSK 290
           L G+GRGKP   T+P V   E+ R    R P   PN+        +   + R  +     
Sbjct: 150 LPGSGRGKPMNFTRPEVQVKEENRHIQAR-PEPDPNQPRTRPRGPNGRGRGRGMR----- 203

Query: 289 XXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQSRGRFSGHGAA---------DREKLAKR 137
                                      ERGD+ RG+ S    A         D EKLAK+
Sbjct: 204 -----------GRGRGRGRGRGDFRMSERGDRRRGKDSDGSYASGLYLGDNADGEKLAKK 252

Query: 136 LGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 2
           LGPEIM+K+VE  EEM+S  +P P  +A VDA  T+ M+EC PEY
Sbjct: 253 LGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAMHTNFMIECEPEY 297


>ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris]
           gi|561020640|gb|ESW19411.1| hypothetical protein
           PHAVU_006G122700g [Phaseolus vulgaris]
          Length = 532

 Score = 74.7 bits (182), Expect = 1e-11
 Identities = 66/215 (30%), Positives = 91/215 (42%), Gaps = 32/215 (14%)
 Frame = -1

Query: 550 FRFGEAQSGWTESETPPPKDKA--LPTGILGVLSGAGRGKPTKPSVPHP---EKTRQTGG 386
           F+  +  S  T  + P   ++A  LP  I+ VLSG GRGKP K S P     E+ R    
Sbjct: 189 FKREDIASPTTRDDFPIDVEQANKLPGNIIEVLSGLGRGKPMKQSDPETRVTEENRHLRA 248

Query: 385 REPSQSPNKDTAVREQL--SQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXY 212
                +   DT    Q   S+++ VR A+  LS+                          
Sbjct: 249 PRARGAAASDTLYERQPIPSRDDAVRNARNFLSQGEDDVGGTGRGRGFRERGGLGRGRGR 308

Query: 211 EE-----------RG---DQSRGRFSGHGA-----------ADREKLAKRLGPEIMSKVV 107
                        RG   D+ RGRF    A           AD EKLAK++GPEIM+++ 
Sbjct: 309 GRGRGRGTGRGGFRGRDMDERRGRFMDAEASDDIGPYVGDDADGEKLAKKVGPEIMNQLT 368

Query: 106 EGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 2
           EG EEMA R +P P ++  +DA   +  +E  PEY
Sbjct: 369 EGFEEMAGRVLPSPLEDEYLDALDINYAIEFEPEY 403


>ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum]
          Length = 504

 Score = 74.3 bits (181), Expect = 2e-11
 Identities = 57/192 (29%), Positives = 81/192 (42%), Gaps = 28/192 (14%)
 Frame = -1

Query: 493 DKALPTGILGVLSGAGRGKPTKPSVPHP---EKTRQTGGREPSQSPNKDTAVREQLSQEE 323
           D      +L VLSGAGRGKP +P+V      E+ R    R  S  P +    +  L+ + 
Sbjct: 187 DNRFSMSVLKVLSGAGRGKPIEPAVSETQVVEENRHVRNRRASDVPMR----QPMLTGDG 242

Query: 322 KVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQSRGR--FSGHGAADR-- 155
            ++ A++ LSK                               + RGR  F G G  DR  
Sbjct: 243 ALQNARKYLSKFDGDGSGSGRGGEPRERGAFGRGRGRGRGRGRGRGRGGFRGTGGDDRFG 302

Query: 154 ---------------------EKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAF 38
                                EKLAK++GPE+M++  EG EEM SR +P P ++  V+AF
Sbjct: 303 QIQDNARSNASGLFLGDDVDGEKLAKKVGPEVMNQFTEGFEEMISRVLPSPLEDEYVEAF 362

Query: 37  QTDIMLECLPEY 2
             +  +E  PEY
Sbjct: 363 DINCAIEFEPEY 374


>ref|XP_007014541.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
           [Theobroma cacao] gi|508784904|gb|EOY32160.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative isoform 2 [Theobroma cacao]
          Length = 403

 Score = 72.4 bits (176), Expect = 7e-11
 Identities = 60/183 (32%), Positives = 77/183 (42%), Gaps = 27/183 (14%)
 Frame = -1

Query: 469 LGVLSGAGRGKPTKPSVPHPEKTRQTGGRE----PSQSPNKDTAVREQLSQEEKVRKAKE 302
           + VLSGAGRGKP K   P P   RQ   R       QSP+       Q+SQEE  +KA  
Sbjct: 169 VSVLSGAGRGKPVKQ--PEPASRRQEENRHIRVAQQQSPSA------QMSQEEATKKAMG 220

Query: 301 ILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQSRGRF--------------SGHGA 164
           ILS+                               +  GR               SG G+
Sbjct: 221 ILSRRSESGESGMVGRGGRASMGMGGGRGRGRGRGRGMGRGRGRRQGEDTRIVKDSGEGS 280

Query: 163 ADR---------EKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECL 11
           AD          EK A+ +G + M+K+VEG EEM SR +P P  +A +DA  T+  +E  
Sbjct: 281 ADGLYLGDNADGEKFAQTIGADNMNKLVEGFEEMGSRVLPSPMDDAYLDALHTNCSIEFE 340

Query: 10  PEY 2
           PEY
Sbjct: 341 PEY 343


Top