BLASTX nr result

ID: Mentha26_contig00024134 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00024134
         (838 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU28645.1| hypothetical protein MIMGU_mgv1a007091mg [Mimulus...   243   5e-62
ref|XP_006361922.1| PREDICTED: uncharacterized protein At2g33490...   189   1e-45
ref|XP_004228436.1| PREDICTED: uncharacterized protein At2g33490...   185   2e-44
ref|XP_002512634.1| conserved hypothetical protein [Ricinus comm...   182   1e-43
ref|XP_003620964.1| hypothetical protein MTR_7g005390 [Medicago ...   170   5e-40
emb|CBI21307.3| unnamed protein product [Vitis vinifera]              163   7e-38
ref|XP_007043861.1| Hydroxyproline-rich glycoprotein family prot...   162   1e-37
emb|CBI38174.3| unnamed protein product [Vitis vinifera]              160   4e-37
ref|XP_004310118.1| PREDICTED: uncharacterized protein At2g33490...   159   2e-36
ref|XP_007139091.1| hypothetical protein PHAVU_008G000700g [Phas...   158   3e-36
ref|XP_007031960.1| Hydroxyproline-rich glycoprotein family prot...   157   4e-36
ref|XP_002263726.1| PREDICTED: uncharacterized protein At2g33490...   155   1e-35
ref|XP_006483028.1| PREDICTED: uncharacterized protein At2g33490...   155   2e-35
ref|XP_006438830.1| hypothetical protein CICLE_v10030944mg [Citr...   155   2e-35
ref|XP_004140551.1| PREDICTED: uncharacterized protein At2g33490...   155   2e-35
ref|XP_002512882.1| conserved hypothetical protein [Ricinus comm...   152   1e-34
ref|XP_006589804.1| PREDICTED: uncharacterized protein At2g33490...   151   3e-34
ref|XP_006589803.1| PREDICTED: uncharacterized protein At2g33490...   151   3e-34
gb|EPS73859.1| hydroxyproline-rich glycoprotein family protein [...   150   6e-34
ref|XP_006447002.1| hypothetical protein CICLE_v10014551mg [Citr...   149   2e-33

>gb|EYU28645.1| hypothetical protein MIMGU_mgv1a007091mg [Mimulus guttatus]
          Length = 420

 Score =  243 bits (621), Expect = 5e-62
 Identities = 136/245 (55%), Positives = 162/245 (66%), Gaps = 3/245 (1%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRAV 252
           +SFDYGQNG  +EV   +NSMELDN D   SP+ KL    ENLQ P +N  S+Q+G R +
Sbjct: 93  LSFDYGQNGPVEEVSTIKNSMELDNTDATFSPELKL----ENLQSPARNSFSFQKGARVI 148

Query: 253 SKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAPQTRLAD- 429
           SKSAPLFP+++ +S ERM  +GPS SRKFTSYVLPTPDETK P SGKLF+E PQTR    
Sbjct: 149 SKSAPLFPEKRTDSTERMTPIGPSPSRKFTSYVLPTPDETKIPSSGKLFNEVPQTRQQPV 208

Query: 430 FNLRHSSPIDQNRYER-RRENDKLSGPIILDTQSVLKESNT-STKASVLPPPLSEGLSFT 603
            NLRHSSP+DQN+YE+  + +DKLSGPIILDTQS+LKESN  +TKA+ LPPP SEGL   
Sbjct: 209 LNLRHSSPLDQNKYEKFHQTSDKLSGPIILDTQSLLKESNNPTTKATPLPPPFSEGL--- 265

Query: 604 QHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTPMPRP 783
                                              +GPI +S F  P+SGSLLRTP+PRP
Sbjct: 266 -----------------------------------TGPIATSPFPPPFSGSLLRTPLPRP 290

Query: 784 TSNPK 798
           TS PK
Sbjct: 291 TSTPK 295


>ref|XP_006361922.1| PREDICTED: uncharacterized protein At2g33490-like [Solanum
           tuberosum]
          Length = 628

 Score =  189 bits (479), Expect = 1e-45
 Identities = 118/248 (47%), Positives = 142/248 (57%), Gaps = 6/248 (2%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQ-NFLSYQRGIRA 249
           +SFDYGQN    +V  S +SMELD  DV +      G +KENL   H  N  S+ R + +
Sbjct: 273 LSFDYGQN---DQVYTSTHSMELDKVDVAVPEVASKGASKENLSKSHGGNSFSFFRDVNS 329

Query: 250 VSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAPQTRLAD 429
            +KSAPL   RK + AE   RM  SLS KF  YVLPTP E K P S K  +  PQT+   
Sbjct: 330 -TKSAPLLSGRKPDPAEGAARMTSSLSNKFQPYVLPTPVEAKIPASVKSHNIDPQTKRTS 388

Query: 430 FNLR-----HSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEGL 594
                    HSSP+DQ +YE+    DK SGPI ++TQSV  ES  +     LPPPLSEGL
Sbjct: 389 QTTSVVQKWHSSPLDQFKYEKLVAGDKFSGPITMNTQSVPTESKNNASTGWLPPPLSEGL 448

Query: 595 SFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTPM 774
           S  Q D    S AKK KRQAFSGPLTAKP    P +++  PI SS + L +S   L T  
Sbjct: 449 SSAQRD---LSNAKKVKRQAFSGPLTAKPWPKKPIVSSGSPIASSGYPLHFSLPFLHTST 505

Query: 775 PRPTSNPK 798
           P P+S PK
Sbjct: 506 PEPSSTPK 513


>ref|XP_004228436.1| PREDICTED: uncharacterized protein At2g33490-like [Solanum
           lycopersicum]
          Length = 632

 Score =  185 bits (470), Expect = 2e-44
 Identities = 117/248 (47%), Positives = 141/248 (56%), Gaps = 6/248 (2%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQ-NFLSYQRGIRA 249
           +SFDYGQN    +V  S +SMELD  DV +S     G +KENL   H  N  S+ R +  
Sbjct: 273 LSFDYGQN---DQVYTSTHSMELDKVDVAVSEVASKGASKENLSKIHGGNSFSFFRDVN- 328

Query: 250 VSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAPQTRLAD 429
           ++KSAPL   RK + AE   RM  SLS KF  YVLPTP E K P S K  +   QT+   
Sbjct: 329 ITKSAPLLSGRKPDPAEGAARMTSSLSNKFQPYVLPTPVEVKIPESVKSHNIDLQTKRTS 388

Query: 430 -----FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEGL 594
                    HSSP+DQ +YE+    DK  GPI ++TQSV  ES  +     LPPPLSEGL
Sbjct: 389 QTSGVVQKWHSSPLDQFKYEKLMAGDKFPGPITVNTQSVPTESKNNASTGWLPPPLSEGL 448

Query: 595 SFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTPM 774
           S  Q D    S AKK KRQAFSGPLTAKP    P +++  PI SS + L +S   L T  
Sbjct: 449 SSAQRD---LSNAKKVKRQAFSGPLTAKPWPKKPIVSSGSPIASSGYPLHFSLPFLHTST 505

Query: 775 PRPTSNPK 798
           P P+S PK
Sbjct: 506 PEPSSTPK 513


>ref|XP_002512634.1| conserved hypothetical protein [Ricinus communis]
           gi|223548595|gb|EEF50086.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 681

 Score =  182 bits (462), Expect = 1e-43
 Identities = 109/249 (43%), Positives = 155/249 (62%), Gaps = 7/249 (2%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRAV 252
           +SFDYGQN   Q+  AS +SMELD  DV       LG +KEN  G ++   S++  +RA 
Sbjct: 271 LSFDYGQNDHEQDASASRSSMELDLVDVTFPRVATLGVSKENRDGNYRKSFSFKGDVRAA 330

Query: 253 SKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETK---TPRSGKLFSEAPQTRL 423
           S+SAPLF +   +SAERM +M PSLSRK  +YVLPTP +TK   +  SG L S+  +T L
Sbjct: 331 SQSAPLFAEINSDSAERMKQMRPSLSRKLNTYVLPTPVDTKNSISTGSGSLVSQTLKTNL 390

Query: 424 A--DFNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEGLS 597
           +    NL HSSPID  +Y +   ++K SG ++ D +SVL+ESN +  ++ LPPPL++GL 
Sbjct: 391 SGRTQNLWHSSPIDPKKYGKLVGDEKPSGSMVKDAESVLRESNKNCASTQLPPPLADGLF 450

Query: 598 FTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTPMP 777
            ++ DP + S +KK KR AFSGP+T+       K+  + PI + A  + +SG LL  P  
Sbjct: 451 ISRFDPPVTSDSKKIKRYAFSGPITS-------KVWPNKPISAEAIGM-FSGPLLENPTV 502

Query: 778 R--PTSNPK 798
           +   +S+PK
Sbjct: 503 QLSSSSSPK 511


>ref|XP_003620964.1| hypothetical protein MTR_7g005390 [Medicago truncatula]
           gi|355495979|gb|AES77182.1| hypothetical protein
           MTR_7g005390 [Medicago truncatula]
          Length = 732

 Score =  170 bits (431), Expect = 5e-40
 Identities = 111/250 (44%), Positives = 152/250 (60%), Gaps = 8/250 (3%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRAV 252
           +SFDYG N   ++V  S NSMELD  +  L      G AKENL    +N  S++  +RA 
Sbjct: 281 LSFDYGPNEQERDVSTSRNSMELDQVEHTLPRGSPAGGAKENLDKLQRNLFSFK--VRAG 338

Query: 253 SKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAP-----QT 417
           S+SAPLF   K +S+E++ +M PSLSRKF+SYVLPTP + K+P S   F + P     QT
Sbjct: 339 SQSAPLFADNKPDSSEKLRQMRPSLSRKFSSYVLPTPVDAKSPIS--FFPDKPKPSTMQT 396

Query: 418 RLAD--FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEG 591
            L +   NL HSSP+DQ ++E +   D+ S P I +TQS L+ESN +   + LP PL +G
Sbjct: 397 NLNEPTKNLWHSSPLDQKKHE-KDIRDEHSDPTIRNTQSALRESNNNASFTRLPLPLVDG 455

Query: 592 LSFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTP 771
            +   HD N+++Y+KK KR AFSGPLT+ P  T        P+      L +SG LL T 
Sbjct: 456 PASLNHD-NVSAYSKKIKRHAFSGPLTSNPWPTR-------PVSMENIQL-FSGPLLPTR 506

Query: 772 MPR-PTSNPK 798
           +P+ P+S+PK
Sbjct: 507 IPQPPSSSPK 516


>emb|CBI21307.3| unnamed protein product [Vitis vinifera]
          Length = 643

 Score =  163 bits (413), Expect = 7e-38
 Identities = 103/248 (41%), Positives = 141/248 (56%), Gaps = 6/248 (2%)
 Frame = +1

Query: 73   MSFDYGQNGLRQEV-PASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRA 249
            +SFDY QN    EV  A+ NSMELD +D+       + T + N +  H +   + R  RA
Sbjct: 266  LSFDYRQNKRGIEVVSATRNSMELDQSDLSFPQASTVETVELNPEKNHGDLQGFSREPRA 325

Query: 250  VSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAPQTRLAD 429
             S SAP+  + K + +ER+     S +RK  +YVLP P   K+    +  +  P+TR   
Sbjct: 326  GSYSAPIIAE-KSDPSERIRTQ--SSTRKLHTYVLPIPVGAKSSTPSRTSNSVPRTRPTS 382

Query: 430  F-----NLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEGL 594
                  NL HSSP++  ++E+   +D +SG  I + QSVLKESN++  A  LPPPL+EGL
Sbjct: 383  LHGGTRNLWHSSPLEPKKHEKDSGDDHMSGSTISEAQSVLKESNSNNAAIRLPPPLAEGL 442

Query: 595  SFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTPM 774
            S  Q D    S  KK KR AFSGPLT KP  T P L++SGPI  +      SG L R P+
Sbjct: 443  SLPQLDTLNTSDTKKVKRLAFSGPLTGKPWSTKPVLSSSGPIAPAELPQLVSGLLSRVPI 502

Query: 775  PRPTSNPK 798
            P+P+S+PK
Sbjct: 503  PQPSSSPK 510


>ref|XP_007043861.1| Hydroxyproline-rich glycoprotein family protein, putative [Theobroma
            cacao] gi|508707796|gb|EOX99692.1| Hydroxyproline-rich
            glycoprotein family protein, putative [Theobroma cacao]
          Length = 654

 Score =  162 bits (411), Expect = 1e-37
 Identities = 109/258 (42%), Positives = 146/258 (56%), Gaps = 16/258 (6%)
 Frame = +1

Query: 73   MSFDYGQNGLRQE-VPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRA 249
            +SFDYGQN   Q  VP S +SMELD   +       +  AKENL+   ++  S++  +R 
Sbjct: 285  LSFDYGQNEQDQNMVPTSRHSMELDQGGLTFPQVAMVEAAKENLERTRRHSFSFRGEMRN 344

Query: 250  VSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETK--------TPRSGKLFSE 405
             S+SAPLF + K   ++    M P L+RKF SYVLPTP  TK         P+S K  S 
Sbjct: 345  SSQSAPLFAENK---SDPYGTMQPLLARKFNSYVLPTPVATKGCIGLGNPAPQSFKTSSN 401

Query: 406  APQTRLADFNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLS 585
                     NL HSSP++  +YER   ++K SG  +++ QSVLKESN +  ++ LPPPL+
Sbjct: 402  EHSN-----NLWHSSPLEHKKYERILGDEKYSGSAVMNAQSVLKESNNNASSTRLPPPLA 456

Query: 586  EGLSFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTA-----SGPIGSSAFSLPYS 750
            + + F++  P  AS +KK KRQAFSGPLT+KP  T P         SGPI  + FS P S
Sbjct: 457  DRVLFSRVSPIAASDSKKIKRQAFSGPLTSKPWPTKPVSVEHPGLFSGPILRNPFSQPPS 516

Query: 751  GSLLRTPMPRPT--SNPK 798
             S   +P   PT  S+PK
Sbjct: 517  TSPKVSPNTSPTFVSSPK 534


>emb|CBI38174.3| unnamed protein product [Vitis vinifera]
          Length = 651

 Score =  160 bits (406), Expect = 4e-37
 Identities = 103/248 (41%), Positives = 144/248 (58%), Gaps = 8/248 (3%)
 Frame = +1

Query: 73  MSFDYGQNGLRQE-VPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRA 249
           +SFDYGQ    Q+ V  + NSMELD  D+       + + KENL   + +  +     R 
Sbjct: 265 LSFDYGQIDHEQDFVSTARNSMELDQEDLTFPQVATMDSVKENLDKSYVDSFAINSESRK 324

Query: 250 VSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAPQTRLAD 429
           +S+SAPLF ++K +SA  M  M P  +RK  +YVLPTPD+TK+  S    S++P +R   
Sbjct: 325 ISQSAPLFAEKKFDSAW-MREMRPLSTRKLRTYVLPTPDDTKS--SAPTRSDSPDSRPIP 381

Query: 430 -------FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSE 588
                   NL HSSP++  +YE+   +DK+     ++ QSVLKESNT+  +S LPPPL +
Sbjct: 382 TSLSGRPHNLWHSSPLEP-KYEKILGDDKVFESTAMNAQSVLKESNTNHVSSGLPPPLVD 440

Query: 589 GLSFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRT 768
           GL F   + ++ S  KK KR AFSGPLT  P  T P L+AS P+ S      +SG LLR+
Sbjct: 441 GLPFRMINSSVVSDTKKVKRYAFSGPLTRNPWSTKPGLSASDPLVSVTQPQLFSGPLLRS 500

Query: 769 PMPRPTSN 792
            MP  +S+
Sbjct: 501 QMPHLSSS 508


>ref|XP_004310118.1| PREDICTED: uncharacterized protein At2g33490-like [Fragaria vesca
           subsp. vesca]
          Length = 628

 Score =  159 bits (401), Expect = 2e-36
 Identities = 102/243 (41%), Positives = 140/243 (57%), Gaps = 4/243 (1%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRAV 252
           +SFDYG N    +V  + +SMELD  D+     PK+ + +       +N  S++   RA 
Sbjct: 272 LSFDYGNNEREPDVSTTRSSMELDQVDITF---PKISSVEALKDRLRRNSFSFKG--RAF 326

Query: 253 SKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRS---GKLFSEAPQTRL 423
           S+SAPLFP+ KL+  E+M  + PSLSRKF SYVLPTP +T    S   G    E  QTRL
Sbjct: 327 SQSAPLFPENKLDQTEKMRHLQPSLSRKFHSYVLPTPVDTNHSVSTGPGNTVPETVQTRL 386

Query: 424 ADF-NLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEGLSF 600
           +   NL HSSP++ N  E+   + K +GP +++ QSVL+ESN +  +  LPPPL+  +  
Sbjct: 387 SGRQNLWHSSPLEPNN-EKIMGDKKPAGPTVINAQSVLRESNNNIASHRLPPPLAGRMLS 445

Query: 601 TQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTPMPR 780
           T  DP  A  +KK KR AFSGPLT+K   + P    S  +        +SG LLR P+P+
Sbjct: 446 TGLDPLAAFDSKKLKRLAFSGPLTSKHSSSKPVSVGSHQM--------FSGPLLRNPIPQ 497

Query: 781 PTS 789
           P S
Sbjct: 498 PPS 500


>ref|XP_007139091.1| hypothetical protein PHAVU_008G000700g [Phaseolus vulgaris]
            gi|593331332|ref|XP_007139092.1| hypothetical protein
            PHAVU_008G000700g [Phaseolus vulgaris]
            gi|593331334|ref|XP_007139093.1| hypothetical protein
            PHAVU_008G000700g [Phaseolus vulgaris]
            gi|561012224|gb|ESW11085.1| hypothetical protein
            PHAVU_008G000700g [Phaseolus vulgaris]
            gi|561012225|gb|ESW11086.1| hypothetical protein
            PHAVU_008G000700g [Phaseolus vulgaris]
            gi|561012226|gb|ESW11087.1| hypothetical protein
            PHAVU_008G000700g [Phaseolus vulgaris]
          Length = 636

 Score =  158 bits (399), Expect = 3e-36
 Identities = 106/254 (41%), Positives = 146/254 (57%), Gaps = 12/254 (4%)
 Frame = +1

Query: 73   MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRAV 252
            +SFDYGQ    Q+V  S NSMELD  ++          AKENL    +N  S++   R  
Sbjct: 277  LSFDYGQTEQDQDVSTSRNSMELDQVELTPPGGFTAEAAKENLDKLQRNLFSFRT--RTG 334

Query: 253  SKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAP---QTRL 423
            S+SAPLF   KL+++E++ +M PSLSRKF+SYVLPTP   K+  S    +  P   Q  L
Sbjct: 335  SQSAPLFADNKLDASEKLRQMRPSLSRKFSSYVLPTPVGAKSSISSSSNNPKPSKVQENL 394

Query: 424  AD--FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEGLS 597
            ++   NL HSSP++Q ++E +   D+ SG  +   QSVLKESNT+T ++ LP PL + L 
Sbjct: 395  SEPTKNLWHSSPLEQKKHE-KDIGDEFSGSTVRSAQSVLKESNTNTASTRLPLPLGDNL- 452

Query: 598  FTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLT-----ASGPIGSSAFSLPYSGSLL 762
                +  +++++KK KRQAFSGPLT+ P  T P L       SGP+       P S S  
Sbjct: 453  -LSSNDYISAHSKKIKRQAFSGPLTSNPGPTKPVLVDSVQLLSGPLFPGPIPKPRSSSPK 511

Query: 763  RTPMPRPT--SNPK 798
             +P   PT  S+PK
Sbjct: 512  VSPTTSPTLMSSPK 525


>ref|XP_007031960.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
           gi|508710989|gb|EOY02886.1| Hydroxyproline-rich
           glycoprotein family protein [Theobroma cacao]
          Length = 649

 Score =  157 bits (398), Expect = 4e-36
 Identities = 104/248 (41%), Positives = 144/248 (58%), Gaps = 6/248 (2%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVP-ASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRA 249
           +SFDY  N    +V  AS NSME+D          K+  A+ NL+  H + L   R  R 
Sbjct: 264 LSFDYRANEKGLDVTSASRNSMEVDEIGRSYPQTSKMENAEVNLEKSHGDILVSSREHRV 323

Query: 250 VSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAPQTRLAD 429
            S SAP+FP+RKL+ AER+ +M  S +RK  +YVLPTP+++K+  S +  S  P TR  +
Sbjct: 324 GSYSAPIFPERKLDPAERVKQMLQSSTRKSNTYVLPTPNDSKSALSSRTISPIPPTRPTN 383

Query: 430 -----FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEGL 594
                 NL HSSP++Q ++E +   D  S   I  ++SV KE N+S  ++ LPPPLSEG 
Sbjct: 384 VAGRPHNLWHSSPLEQKKHE-KDSGDGQSEFTIWKSESVFKECNSSNTSTQLPPPLSEGP 442

Query: 595 SFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTPM 774
             TQ D   +S  KK KR+A SGPLT+K + T P L+A+GPI S+      S +    P+
Sbjct: 443 VPTQLD--TSSEVKKIKRKAVSGPLTSKQLPTKP-LSATGPIPSAELPHLASAAFSHLPI 499

Query: 775 PRPTSNPK 798
           P+P S PK
Sbjct: 500 PQPLSPPK 507


>ref|XP_002263726.1| PREDICTED: uncharacterized protein At2g33490-like [Vitis vinifera]
          Length = 653

 Score =  155 bits (393), Expect = 1e-35
 Identities = 103/250 (41%), Positives = 144/250 (57%), Gaps = 10/250 (4%)
 Frame = +1

Query: 73   MSFDYGQNGLRQE-VPASENSMELDNADVMLSPDPKLGTAK--ENLQGPHQNFLSYQRGI 243
            +SFDYGQ    Q+ V  + NSMELD  D+       + + K  ENL   + +  +     
Sbjct: 265  LSFDYGQIDHEQDFVSTARNSMELDQEDLTFPQVATMDSVKLQENLDKSYVDSFAINSES 324

Query: 244  RAVSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAPQTRL 423
            R +S+SAPLF ++K +SA  M  M P  +RK  +YVLPTPD+TK+  S    S++P +R 
Sbjct: 325  RKISQSAPLFAEKKFDSAW-MREMRPLSTRKLRTYVLPTPDDTKS--SAPTRSDSPDSRP 381

Query: 424  AD-------FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPL 582
                      NL HSSP++  +YE+   +DK+     ++ QSVLKESNT+  +S LPPPL
Sbjct: 382  IPTSLSGRPHNLWHSSPLEP-KYEKILGDDKVFESTAMNAQSVLKESNTNHVSSGLPPPL 440

Query: 583  SEGLSFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLL 762
             +GL F   + ++ S  KK KR AFSGPLT  P  T P L+AS P+ S      +SG LL
Sbjct: 441  VDGLPFRMINSSVVSDTKKVKRYAFSGPLTRNPWSTKPGLSASDPLVSVTQPQLFSGPLL 500

Query: 763  RTPMPRPTSN 792
            R+ MP  +S+
Sbjct: 501  RSQMPHLSSS 510


>ref|XP_006483028.1| PREDICTED: uncharacterized protein At2g33490-like isoform X1
           [Citrus sinensis] gi|568858996|ref|XP_006483029.1|
           PREDICTED: uncharacterized protein At2g33490-like
           isoform X2 [Citrus sinensis]
          Length = 633

 Score =  155 bits (392), Expect = 2e-35
 Identities = 103/254 (40%), Positives = 141/254 (55%), Gaps = 12/254 (4%)
 Frame = +1

Query: 73  MSFDYGQNGLRQE-VPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRA 249
           +SFDY QN   Q+ V +S  SMELD  D+  S   +L T+KE L   ++  LS+ R +R 
Sbjct: 268 LSFDYRQNEQEQDAVSSSRKSMELDQPDITFSQVARLETSKETLDRNYRKSLSFSREVRF 327

Query: 250 VSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAPQTRLAD 429
            S+SAPLF   K +  +R  +M  S +RKF +YVLPTP +TK+  S    S  P      
Sbjct: 328 SSQSAPLFMDNKSDLDDRRKQMRQSSTRKFNTYVLPTPGDTKSSHSPGPGSPVPSALRPS 387

Query: 430 -----FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVL-PPPLSEG 591
                +NLRH SP+D  ++++    DK S       QS+L+ESN +T +  L PPP ++G
Sbjct: 388 LGGQTYNLRHESPLDMLKFDKSLGGDKTSKD---TAQSILRESNNNTASIQLPPPPPADG 444

Query: 592 LSFTQHDPNLASYAKKAKRQAFSGPLT----AKPMLTNPKLTASGPIGSSAFSLPYSGSL 759
              ++ DP  AS  KK KRQ+FSGPLT    AKP+L   +   SGP             L
Sbjct: 445 FLLSRLDPRGASVPKKVKRQSFSGPLTGPRPAKPVLKEHQQLLSGP-------------L 491

Query: 760 LRTPMPR-PTSNPK 798
           LR PMP+ P+S+PK
Sbjct: 492 LRNPMPQPPSSSPK 505


>ref|XP_006438830.1| hypothetical protein CICLE_v10030944mg [Citrus clementina]
           gi|557541026|gb|ESR52070.1| hypothetical protein
           CICLE_v10030944mg [Citrus clementina]
          Length = 633

 Score =  155 bits (392), Expect = 2e-35
 Identities = 103/254 (40%), Positives = 141/254 (55%), Gaps = 12/254 (4%)
 Frame = +1

Query: 73  MSFDYGQNGLRQE-VPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRA 249
           +SFDY QN   Q+ V +S  SMELD  D+  S   +L T+KE L   ++  LS+ R +R 
Sbjct: 268 LSFDYRQNEQEQDAVSSSRKSMELDQPDITFSQVTRLETSKETLDRNYRKSLSFSREVRF 327

Query: 250 VSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAPQTRLAD 429
            S+SAPLF   K +  +R  +M  S +RKF +YVLPTP +TK+  S    S  P      
Sbjct: 328 SSQSAPLFMDNKSDLDDRRKQMRQSSTRKFNTYVLPTPGDTKSSHSPGPGSPVPSALRPS 387

Query: 430 -----FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVL-PPPLSEG 591
                +NLRH SP+D  ++++    DK S       QS+L+ESN +T +  L PPP ++G
Sbjct: 388 LGGQTYNLRHESPLDMLKFDKSLGGDKTSKD---TAQSILRESNNNTASIQLPPPPPADG 444

Query: 592 LSFTQHDPNLASYAKKAKRQAFSGPLT----AKPMLTNPKLTASGPIGSSAFSLPYSGSL 759
              ++ DP  AS  KK KRQ+FSGPLT    AKP+L   +   SGP             L
Sbjct: 445 FLLSRLDPRGASVPKKVKRQSFSGPLTGPRPAKPVLKEHQQLLSGP-------------L 491

Query: 760 LRTPMPR-PTSNPK 798
           LR PMP+ P+S+PK
Sbjct: 492 LRNPMPQPPSSSPK 505


>ref|XP_004140551.1| PREDICTED: uncharacterized protein At2g33490-like [Cucumis sativus]
           gi|449528537|ref|XP_004171260.1| PREDICTED:
           uncharacterized protein At2g33490-like [Cucumis sativus]
          Length = 642

 Score =  155 bits (392), Expect = 2e-35
 Identities = 105/246 (42%), Positives = 139/246 (56%), Gaps = 4/246 (1%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRAV 252
           +SFDY QN   Q +   +NS ELD  D+       L   KENL    +N  S+  G R V
Sbjct: 272 LSFDYAQNDHDQAISTLQNS-ELDQPDLAFHHVEAL---KENLDRNRRNSFSF--GGRTV 325

Query: 253 SKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRS---GKLFSEAPQTRL 423
           S+SAPLFP +K ++AER+ +M PS +RKF +YVLPTP +TK   S   G       QT  
Sbjct: 326 SQSAPLFPDKKFDAAERVRQMRPSSTRKFHTYVLPTPADTKGSNSRVPGNPLPNTIQTIR 385

Query: 424 ADFNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEGLSFT 603
               +RHSSP++   Y++   ++  SG      QSVLKESNT+  ++ LPPPLS+GL   
Sbjct: 386 QQNLMRHSSPLEPRNYDKLVGDENASGHGATKAQSVLKESNTNASSTQLPPPLSDGL--- 442

Query: 604 QHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTPMPRP 783
                 AS AKK KR AFSGPL  KP    P      P+ ++     +SG LLR P+P+P
Sbjct: 443 PRHSLAASDAKKIKRLAFSGPLIGKPSTNKP-----APVENAQL---FSGPLLRNPIPQP 494

Query: 784 -TSNPK 798
            +S+PK
Sbjct: 495 LSSSPK 500


>ref|XP_002512882.1| conserved hypothetical protein [Ricinus communis]
           gi|223547893|gb|EEF49385.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 656

 Score =  152 bits (385), Expect = 1e-34
 Identities = 101/256 (39%), Positives = 144/256 (56%), Gaps = 17/256 (6%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEV-PASENSMELDNADVM-----------LSPDPKLGTAKENLQGPHQ 216
           +SFDY +N    +V  AS NSME+D+ D+            L+PD   G  + +L+ P  
Sbjct: 267 LSFDYRENKQGHDVISASRNSMEVDDEDLSFPQASFTENAELNPDKSQGGLQASLREP-- 324

Query: 217 NFLSYQRGIRAVSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKL 396
                    R  S SAP+FP+RK +  ER+ R+  S +RK  +YVLPTP + K+P S + 
Sbjct: 325 ---------RPGSHSAPIFPERKSDPIERI-RLMQSSARKSNTYVLPTPIDAKSPISSRT 374

Query: 397 FSEAPQTRLADF-----NLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKA 561
                 TR +DF     NL HSSP++Q ++E+   +  LS    L T+S  K+S+ ++ +
Sbjct: 375 SGSVANTRPSDFSGRTHNLWHSSPLEQKKHEKDPGDYHLSELTALKTRSAHKDSSINSTS 434

Query: 562 SVLPPPLSEGLSFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSL 741
           ++LPPPL EG+S    D   AS  KK KRQ+FSGP+T+KP  T P L+ASGPI S+    
Sbjct: 435 TLLPPPLVEGISLPHLDMYNASDNKKIKRQSFSGPITSKPWSTKPALSASGPIFSNELPQ 494

Query: 742 PYSGSLLRTPMPRPTS 789
             SG   R  +P+ TS
Sbjct: 495 QVSGVPSRVTIPQNTS 510


>ref|XP_006589804.1| PREDICTED: uncharacterized protein At2g33490-like isoform X2
           [Glycine max]
          Length = 621

 Score =  151 bits (381), Expect = 3e-34
 Identities = 101/248 (40%), Positives = 143/248 (57%), Gaps = 6/248 (2%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRAV 252
           +SFDY QN   Q+V   ENSM                  KENL    +N  S++  +R+ 
Sbjct: 268 LSFDYAQNECEQDVSTFENSM------------------KENLDRLRRNSFSFK--VRSA 307

Query: 253 SKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAP---QTRL 423
           S+SAPLF   K +S+E++ +M  +LSRKF SYVLPTP + K+  S +  ++ P   +T L
Sbjct: 308 SQSAPLFVDNKRDSSEKLRQMRQTLSRKFNSYVLPTPVDGKSSISLRSSNQVPSKIKTNL 367

Query: 424 AD--FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEGLS 597
            +   NL HSSP+++ +YE    +   SGP +   QSVLKESN++T  S LPPPL +G  
Sbjct: 368 NEPVKNLWHSSPLEKKKYENIFGDGGFSGPDVRTAQSVLKESNSNTAYSRLPPPLIDGNL 427

Query: 598 FTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTPMP 777
            + HD  + +Y+KK KR AFSGPL +    T P      P+   +  L +SG LLRT +P
Sbjct: 428 SSNHD-YITAYSKKIKRHAFSGPLVSNAWPTKP------PVSVKSVQL-FSGPLLRTSIP 479

Query: 778 R-PTSNPK 798
           + P+S+PK
Sbjct: 480 QPPSSSPK 487


>ref|XP_006589803.1| PREDICTED: uncharacterized protein At2g33490-like isoform X1
           [Glycine max]
          Length = 624

 Score =  151 bits (381), Expect = 3e-34
 Identities = 101/248 (40%), Positives = 143/248 (57%), Gaps = 6/248 (2%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIRAV 252
           +SFDY QN   Q+V   ENSM                  KENL    +N  S++  +R+ 
Sbjct: 271 LSFDYAQNECEQDVSTFENSM------------------KENLDRLRRNSFSFK--VRSA 310

Query: 253 SKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAP---QTRL 423
           S+SAPLF   K +S+E++ +M  +LSRKF SYVLPTP + K+  S +  ++ P   +T L
Sbjct: 311 SQSAPLFVDNKRDSSEKLRQMRQTLSRKFNSYVLPTPVDGKSSISLRSSNQVPSKIKTNL 370

Query: 424 AD--FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEGLS 597
            +   NL HSSP+++ +YE    +   SGP +   QSVLKESN++T  S LPPPL +G  
Sbjct: 371 NEPVKNLWHSSPLEKKKYENIFGDGGFSGPDVRTAQSVLKESNSNTAYSRLPPPLIDGNL 430

Query: 598 FTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTPMP 777
            + HD  + +Y+KK KR AFSGPL +    T P      P+   +  L +SG LLRT +P
Sbjct: 431 SSNHD-YITAYSKKIKRHAFSGPLVSNAWPTKP------PVSVKSVQL-FSGPLLRTSIP 482

Query: 778 R-PTSNPK 798
           + P+S+PK
Sbjct: 483 QPPSSSPK 490


>gb|EPS73859.1| hydroxyproline-rich glycoprotein family protein [Genlisea aurea]
          Length = 571

 Score =  150 bits (379), Expect = 6e-34
 Identities = 107/247 (43%), Positives = 138/247 (55%), Gaps = 7/247 (2%)
 Frame = +1

Query: 73  MSFDYGQNGLRQEVPASENSMELDNADVMLSPDPKLGT--AKENLQGPHQNFLSYQRGIR 246
           +SFDYG+N    ++  S++SME D++D      P      + E L  P +N  S++R  R
Sbjct: 276 LSFDYGRNQPSTDMSESKHSMEADHSDAAAYSPPYRHNRGSTEILVSPRRNSFSFRREAR 335

Query: 247 AVSKSAPLFPQRKL-ESAERMPRMG-PSLSRKFTSYVLPTPDETKTPR-SGKLFSEAPQT 417
           AVSKSAP+ P+R   +S ER+ ++G PS SR+ TSYVLPTPDET++P  SGKL+  AP  
Sbjct: 336 AVSKSAPILPERTTPDSDERVLQLGGPSPSRRSTSYVLPTPDETRSPAASGKLYKAAPNP 395

Query: 418 RLADFNLRHSSPIDQNRYERRRENDKLS-GPIILDTQSVLKESNTSTKASVLPPPLSEGL 594
                                 +N+K S G IILD + VL ES  S K S+         
Sbjct: 396 ----------------------QNEKWSAGSIILDPRFVLNESAASKKVSL--------- 424

Query: 595 SFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSG-SLLRTP 771
                DP   SY+KKAKRQAFSGPL   P  TN  L+ASGPI       P+ G SLLRTP
Sbjct: 425 -----DPIADSYSKKAKRQAFSGPLIGNPWPTNQNLSASGPI-----LRPFPGPSLLRTP 474

Query: 772 MPRPTSN 792
           +PRP S+
Sbjct: 475 LPRPISS 481


>ref|XP_006447002.1| hypothetical protein CICLE_v10014551mg [Citrus clementina]
           gi|568829044|ref|XP_006468842.1| PREDICTED:
           uncharacterized protein At2g33490-like [Citrus sinensis]
           gi|557549613|gb|ESR60242.1| hypothetical protein
           CICLE_v10014551mg [Citrus clementina]
          Length = 650

 Score =  149 bits (375), Expect = 2e-33
 Identities = 98/249 (39%), Positives = 144/249 (57%), Gaps = 7/249 (2%)
 Frame = +1

Query: 73  MSFDYGQN--GLRQEVPASENSMELDNADVMLSPDPKLGTAKENLQGPHQNFLSYQRGIR 246
           +SFDY  N  GL   V  S  SME+D+ DV       +  A+ NL      + +  R  R
Sbjct: 266 LSFDYRDNKQGL-DVVSTSRKSMEVDDVDVSFPQASTVENAEVNLDKNPGEYQASHRERR 324

Query: 247 AVSKSAPLFPQRKLESAERMPRMGPSLSRKFTSYVLPTPDETKTPRSGKLFSEAPQTRLA 426
             S SAP+FP+RK++ AER+ ++  S +R+ ++YVLPTP + K P S    S AP+TR +
Sbjct: 325 GSSFSAPIFPERKIDPAERIRQVQQSSARQPSTYVLPTPIDAKVPISS---SVAPRTRPS 381

Query: 427 D-----FNLRHSSPIDQNRYERRRENDKLSGPIILDTQSVLKESNTSTKASVLPPPLSEG 591
           +     +NL HSSP++Q + +R   +  LS   +L +QS+LKES+ S  AS  PPPL +G
Sbjct: 382 NPSGRTYNLSHSSPLEQKKEDRDYGDAHLSEHSVLKSQSLLKESD-SNNASTRPPPLRDG 440

Query: 592 LSFTQHDPNLASYAKKAKRQAFSGPLTAKPMLTNPKLTASGPIGSSAFSLPYSGSLLRTP 771
           L+  Q D   +S  KK K QA SGPL++K   + P L++SGPI  +      SG L   P
Sbjct: 441 LALPQLDTLNSSDTKKIKTQASSGPLSSKSSSSKPALSSSGPITYTELPQIVSGLLSHAP 500

Query: 772 MPRPTSNPK 798
           +P+  ++P+
Sbjct: 501 VPQTKTSPR 509


Top