BLASTX nr result

ID: Mentha27_contig00015644 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00015644
         (1493 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr...   199   3e-48
ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-...   198   5e-48
ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun...   192   4e-46
ref|XP_002865912.1| hydroxyproline-rich glycoprotein family prot...   190   1e-45
ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein...   189   3e-45
ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr...   189   3e-45
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   188   5e-45
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   188   6e-45
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   188   6e-45
emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]   188   6e-45
ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Caps...   186   2e-44
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   186   2e-44
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   184   1e-43
ref|XP_002867602.1| hydroxyproline-rich glycoprotein family prot...   181   6e-43
ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutr...   180   2e-42
ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Caps...   179   4e-42
ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein...   177   1e-41
emb|CAA18164.1| putative protein [Arabidopsis thaliana] gi|72694...   172   5e-40
ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family prot...   163   2e-37
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   162   5e-37

>ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina]
            gi|557523850|gb|ESR35217.1| hypothetical protein
            CICLE_v10004813mg [Citrus clementina]
          Length = 500

 Score =  199 bits (505), Expect = 3e-48
 Identities = 150/501 (29%), Positives = 193/501 (38%), Gaps = 150/501 (29%)
 Frame = +1

Query: 97   MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276
            M SVHDS                  +P+ +QKRRWG  WS+YWCFGS+K SKRI H +++
Sbjct: 1    MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLV 60

Query: 277  SQETVNGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 456
             +  V G +      Q                                            
Sbjct: 61   PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLSVN 120

Query: 457  XELT---AQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFA 627
                   A +F IGPYA ETQLV+PPVFS+FTT+PS+A  TPPPE+VQ+TTPSSPEVPFA
Sbjct: 121  AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180

Query: 628  QLLSSSLAQKWRN-------------------------------------SGAPSPFYDK 696
            QLL+SSL +  RN                                     SG  SPF D+
Sbjct: 181  QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240

Query: 697  RADIDLPMVEAPKFVGYEHFMNYKW----------------------------------- 771
               +D     APK +G+EHF   KW                                   
Sbjct: 241  HPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300

Query: 772  -------------GSRLGSGALTPNGKEPPSQECNILENNQNFEVVESEN---------- 882
                         GSRLGSG+LTP+G  P S++   +  NQ  EV    N          
Sbjct: 301  LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRD-GFVRENQISEVASLANSDNGTKSDEH 359

Query: 883  --NHRVSFELRGEDIPISIMKETTKGKDLATEVALSFQTQTSVRSD-------------- 1014
              +HRVSFEL GE++   +  ++     +  E       +  +R D              
Sbjct: 360  IIDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCP 419

Query: 1015 -------------DGRD-------RTASFGSSKDFNFNNT----------------NDEV 1086
                         DG +       R+ + GS K+FNF+NT                N+ V
Sbjct: 420  EESSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENV 479

Query: 1087 AIELGPQKNWNFFPMLQSGGS 1149
              E  P  NW FFPMLQS  S
Sbjct: 480  GKESKPSNNWTFFPMLQSEAS 500


>ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis]
          Length = 500

 Score =  198 bits (504), Expect = 5e-48
 Identities = 150/501 (29%), Positives = 193/501 (38%), Gaps = 150/501 (29%)
 Frame = +1

Query: 97   MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276
            M SVHDS                  +P+ +QKRRWG  WS+YWCFGS+K SKRI H +++
Sbjct: 1    MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLL 60

Query: 277  SQETVNGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 456
             +  V G +      Q                                            
Sbjct: 61   PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPAGLLSLNSLSVN 120

Query: 457  XELT---AQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFA 627
                   A +F IGPYA ETQLV+PPVFS+FTT+PS+A  TPPPE+VQ+TTPSSPEVPFA
Sbjct: 121  AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180

Query: 628  QLLSSSLAQKWRN-------------------------------------SGAPSPFYDK 696
            QLL+SSL +  RN                                     SG  SPF D+
Sbjct: 181  QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240

Query: 697  RADIDLPMVEAPKFVGYEHFMNYKW----------------------------------- 771
               +D     APK +G+EHF   KW                                   
Sbjct: 241  HPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300

Query: 772  -------------GSRLGSGALTPNGKEPPSQECNILENNQNFEVVESEN---------- 882
                         GSRLGSG+LTP+G  P S++   +  NQ  EV    N          
Sbjct: 301  LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRD-GFVRENQISEVASLANSDNGTKSDEH 359

Query: 883  --NHRVSFELRGEDIPISIMKETTKGKDLATEVALSFQTQTSVRSD-------------- 1014
              +HRVSFEL GE++   +  ++     +  E       +  +R D              
Sbjct: 360  IIDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCP 419

Query: 1015 -------------DGRD-------RTASFGSSKDFNFNNT----------------NDEV 1086
                         DG +       R+ + GS K+FNF+NT                N+ V
Sbjct: 420  EESSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENV 479

Query: 1087 AIELGPQKNWNFFPMLQSGGS 1149
              E  P  NW FFPMLQS  S
Sbjct: 480  GKESKPSNNWTFFPMLQSEAS 500


>ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
            gi|462415503|gb|EMJ20240.1| hypothetical protein
            PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  192 bits (487), Expect = 4e-46
 Identities = 157/503 (31%), Positives = 203/503 (40%), Gaps = 154/503 (30%)
 Frame = +1

Query: 97   MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276
            MRSV+ S                  QP+ V KRRWG  WS+YWCFG +K+ KRIGH +++
Sbjct: 1    MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHKN-KRIGHAVLV 59

Query: 277  SQ--------ETVNGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 432
             +          ++  +TST                                        
Sbjct: 60   PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSAN 119

Query: 433  XXXXXXXXXELTAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSP 612
                        A IF IGPYA ETQLVSPPVFS+F T+PS+AP TPPPE+VQ+TTPSSP
Sbjct: 120  AYSPGGP-----ASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSP 174

Query: 613  EVPFAQLLSSSLAQKWR-------------------------------------NSGAPS 681
            EVPFAQLL+SSL +  R                                     NSG  S
Sbjct: 175  EVPFAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSS 234

Query: 682  PFYDKRADIDLPMVEAPKFVGYEHFMNYKW------------------------------ 771
            PF D+   ++  M EAPK  G++HF   KW                              
Sbjct: 235  PFPDRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGN 294

Query: 772  ------------------GSRLGSGALTPNGKEPPSQECNILEN-----------NQNFE 864
                              GSRLGSG LTP+G  P S++  +LEN               +
Sbjct: 295  ELGSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQ 354

Query: 865  VVESENNHRVSFELRGEDIPISIMKETT--------KGKDLATEV-----ALSFQT---- 993
             VE+  +HRVSFEL GED+   +  +            K +A+E      ALS  +    
Sbjct: 355  TVETVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHC 414

Query: 994  -----QTSVR-----SDDGRD------RTASFGSSKDFNFNNTNDEV------------- 1086
                 ++S R     S +G D      R+ + GS+KDFNF+NT  EV             
Sbjct: 415  EFSVEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWAN 474

Query: 1087 ----AIELGPQKNWNFFPMLQSG 1143
                A E  P  +W FFP+LQ G
Sbjct: 475  KNVAAKESKPCNDWTFFPILQPG 497


>ref|XP_002865912.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
            subsp. lyrata] gi|297311747|gb|EFH42171.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 437

 Score =  190 bits (483), Expect = 1e-45
 Identities = 147/439 (33%), Positives = 196/439 (44%), Gaps = 88/439 (20%)
 Frame = +1

Query: 97   MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276
            MR+V++S                  QPS VQK RWG  WS+Y CFG+ K++KRIG+ +++
Sbjct: 1    MRNVNNSVETVNAAATAIVTAESRVQPSSVQKGRWGKCWSLYSCFGTQKNNKRIGNAVLV 60

Query: 277  SQETVNGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 456
             +   +GV   T                                                
Sbjct: 61   PEPVASGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPGGQLSLTSNTFS 120

Query: 457  XELTAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPE-AVQMTTPSSPEVPFAQL 633
             +    +F +GPYA+ETQ V+PPVFS+F T+PS+AP TPPPE +V +TTPSSPEVPFAQL
Sbjct: 121  PKEPQSVFTVGPYANETQPVTPPVFSAFVTEPSTAPYTPPPESSVHITTPSSPEVPFAQL 180

Query: 634  LSSSLA-----------QKW-----------------------------RNSGAPSPFYD 693
            L+SSL            QK+                              NSG  SP+  
Sbjct: 181  LTSSLELTRRNSSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPG 240

Query: 694  KRADIDLPMVEAPKFVGYEHFMNYKWGSRLG--------------SGALTPNGKE----- 816
            K   ++  + E PKF+G+EHF   KWGSR G              SGALTPNG E     
Sbjct: 241  KSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASGALTPNGLEIISGN 300

Query: 817  -PPSQECNILENNQNFEVVESEN----------NHRVSFELRGEDIPISIMKETTKGKD- 960
              PS     L +NQ  EV    N          +HRVSFEL GED+   +  +  +  D 
Sbjct: 301  LTPSNTTWPL-HNQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHDR 359

Query: 961  ------LATEVALSFQTQTSV--RSDDGRD--------RTASFGSSKDFNFNNTNDEVAI 1092
                  + TE + S   + ++  RS D            ++S GSSK+F F+NT DE  I
Sbjct: 360  MNNNDRIETEESSSTDLRRNMEKRSADRETEQQRIQKLNSSSIGSSKEFKFDNTKDE-NI 418

Query: 1093 ELGPQKNWNFFPMLQSGGS 1149
            E     +W+FFP L+SG S
Sbjct: 419  EKVAGNSWSFFPGLRSGVS 437


>ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|10177409|dbj|BAB10540.1| unnamed protein
            product [Arabidopsis thaliana] gi|40823427|gb|AAR92282.1|
            At5g52430 [Arabidopsis thaliana]
            gi|56381929|gb|AAV85683.1| At5g52430 [Arabidopsis
            thaliana] gi|110738650|dbj|BAF01250.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332008830|gb|AED96213.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 438

 Score =  189 bits (480), Expect = 3e-45
 Identities = 141/416 (33%), Positives = 186/416 (44%), Gaps = 90/416 (21%)
 Frame = +1

Query: 172  QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXX 351
            QPS  QK RWG  WS+Y CFG+ K++KRIG+ +++ +   +GV   T             
Sbjct: 27   QPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTSGVPVVTVQNSATSTTVVLP 86

Query: 352  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPPVF 531
                                                +    +F +GPYA+ETQ V+PPVF
Sbjct: 87   FIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQSVFTVGPYANETQPVTPPVF 146

Query: 532  SSFTTQPSSAPLTPPPEA-VQMTTPSSPEVPFAQLLSSSLAQKWR--------------- 663
            S+F T+PS+AP TPPPE+ V +TTPSSPEVPFAQLL+SSL    R               
Sbjct: 147  SAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQKFSSSHY 206

Query: 664  -------------------------NSGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYK 768
                                     NSG  SP+  K   ++  + E PKF+G+EHF   K
Sbjct: 207  EFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLGFEHFTARK 266

Query: 769  WGSRLG--------------SGALTPNGKEPPSQECNILEN-------NQNFEVVESEN- 882
            WGSR G              SGALTPNG E  S   N+  N       NQ  EV    N 
Sbjct: 267  WGSRFGSGSITPVGHGSGLASGALTPNGPEIVSG--NLTPNNTTWPLQNQISEVASLANS 324

Query: 883  ---------NHRVSFELRGEDIPISIMKETTKGKD-------LATEVALSFQTQTSVRSD 1014
                     +HRVSFEL GED+   +  +  +  D       + TE + S   + ++   
Sbjct: 325  DHGSEVMVADHRVSFELTGEDVARCLASKLNRSHDRMNNNDRIETEESSSTDIRRNIEKR 384

Query: 1015 DGRDR-----------TASFGSSKDFNFNNTNDEVAIELGPQKNWNFFPMLQSGGS 1149
             G DR           ++S GSSK+F F+NT DE  IE     +W+FFP L+SG S
Sbjct: 385  SG-DRENEQHRIQKLSSSSIGSSKEFKFDNTKDE-NIEKVAGNSWSFFPGLRSGVS 438


>ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|222858882|gb|EEE96429.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 507

 Score =  189 bits (480), Expect = 3e-45
 Identities = 150/473 (31%), Positives = 194/473 (41%), Gaps = 153/473 (32%)
 Frame = +1

Query: 178  SPVQKRRWGGWWSMYWCFGSY---KHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXX 348
            S VQKRRWGG WS+YWCFGS+   K+SKRIGH +++ +  V G  +S+   Q        
Sbjct: 31   SSVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVLVPEPEVPGAVSSSTENQTQSTPILL 90

Query: 349  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELT---AQIFLIGPYADETQLVS 519
                                                       A IF IGPYA ETQLV+
Sbjct: 91   PFIAPPSSPASFLQSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPYAHETQLVT 150

Query: 520  PPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWR------------ 663
            PPVFS+FTT+PS+AP TPPPE+VQ+TTPSSPEVPFAQLL+SSL +  R            
Sbjct: 151  PPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGPNQKFSLSH 210

Query: 664  -------------------------NSGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYK 768
                                     NSG  SPF D+   ++  M EAPK +G+EHF   K
Sbjct: 211  YEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLLGFEHFSTRK 270

Query: 769  WGSRLGSGALTPNGKEP------------------------------------------- 819
            WGSRLGSG+LTP+                                               
Sbjct: 271  WGSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMGLSRLCSGTATPDGAGLRSRLGSGTL 330

Query: 820  ------PSQECNILENNQNFEV---VESEN---------NHRVSFELRGEDI-------- 921
                  P+ +   L  NQ  EV     SEN         +HRVSFEL GE++        
Sbjct: 331  TPDCFVPASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEEVARCLEIKS 390

Query: 922  ----------PISIMKE-TTKGKDLAT---------EVALSFQTQTSVRSDDG----RDR 1029
                      P   M E   +G  LA          E +     + S  +++     + R
Sbjct: 391  VASTRTFPEYPQDTMPEDPVRGDRLAMNGERCLQNGEASSEMPEKNSEETEEDHVYRKHR 450

Query: 1030 TASFGSSKDFNFNNTNDEVA-----------------IELGPQKNWNFFPMLQ 1137
            + + GS K+FNF+N+  EV+                  E  P  +W FFP+LQ
Sbjct: 451  SITLGSIKEFNFDNSKGEVSDKPAISSEWWANETIAGKEARPANSWTFFPLLQ 503


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  188 bits (478), Expect = 5e-45
 Identities = 150/469 (31%), Positives = 198/469 (42%), Gaps = 147/469 (31%)
 Frame = +1

Query: 172  QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXX 351
            QP+ VQKRRWGG WS+YWCFGS+K +KRIGH ++  +  V G   ++   Q         
Sbjct: 40   QPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTSAENQSQSTAITVP 98

Query: 352  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELT---AQIFLIGPYADETQLVSP 522
                                                      A IF IGPYA ETQLV+P
Sbjct: 99   FIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTP 158

Query: 523  PVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWR------------- 663
            P FS+FTT+PS+AP TPPPE+VQ+TTPSSPEVPFAQLL+SSL +  R             
Sbjct: 159  PAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHY 218

Query: 664  ------------------------NSGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW 771
                                    NSG  SPF D+   ++  M EAPK +G+EHF   KW
Sbjct: 219  EFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHFTTRKW 278

Query: 772  GSRLGSGALTPNG----------------------------------------------- 810
            GSRLGSG +TP+G                                               
Sbjct: 279  GSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSLTPD 338

Query: 811  -KEPPSQECNILEN--NQNFEVVESEN---------NHRVSFELRGEDI----------- 921
               P S++   LEN  ++   +  SEN         +HRVSFEL GE++           
Sbjct: 339  AVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKSLAS 398

Query: 922  --------PISIMKETTK-GKDLATEVALSFQTQTSVRSDD------------GRDRTAS 1038
                    P S+ ++  K GK L T+  L    +TS  + +             + R+ +
Sbjct: 399  CRAFSECPPDSMAEDQIKSGKMLMTDENLP-TGETSGETPEKPSGEMEEEHCYRKHRSIT 457

Query: 1039 FGSSKDFNFNNT---------------NDEVA-IELGPQKNWNFFPMLQ 1137
             GS K+FNF+N+               N+ +A  E  P  NW FFP+LQ
Sbjct: 458  LGSIKEFNFDNSKEVPDKPSINSEWWANETIAGKEARPANNWTFFPLLQ 506


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
            lycopersicum]
          Length = 470

 Score =  188 bits (477), Expect = 6e-45
 Identities = 134/344 (38%), Positives = 165/344 (47%), Gaps = 116/344 (33%)
 Frame = +1

Query: 466  TAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSS 645
            TA IF IGPYA ETQLVSPPVFS+FTT+PS+A  TPPPE V MTTP SPEVPFAQLL+SS
Sbjct: 127  TASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSS 186

Query: 646  LAQKWR------------------------------------NSGAPSPFYDKRADIDLP 717
            LA+  R                                    NSG  SPF  K   I+  
Sbjct: 187  LARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFR 246

Query: 718  MVEAPKFVGYEHFMNYKWGSR----------------------------LGSGALTPNGK 813
              E PKF+GYEHF   KWGSR                            LGSG +TPNG 
Sbjct: 247  KGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGG 306

Query: 814  EPPSQECNILEN-----------NQNFEVVESENNHRVSFELRGEDIPISIMKE------ 942
            EPPS++  +LEN           +   E+ E+  +HRVSFEL  ED+P    KE      
Sbjct: 307  EPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHS 366

Query: 943  -TTKGKDLATEVALSFQTQTSV-----------RSDDGRD------RTASFGSSKDFNFN 1068
              T   D++  +A   ++ +S+            S+ G D      R  +FGSSKDF+F+
Sbjct: 367  QPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFD 426

Query: 1069 N----------------TNDEVAI-ELGPQKNWNFFPMLQSGGS 1149
            N                T+D+ A+ E G Q NW FFP+LQ G S
Sbjct: 427  NVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470



 Score = 70.1 bits (170), Expect = 2e-09
 Identities = 27/42 (64%), Positives = 33/42 (78%)
 Frame = +1

Query: 172 QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNG 297
           QPS VQKRRWG  WS+YWCFGS+KHSKRIGH +++ +    G
Sbjct: 26  QPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPG 67


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  188 bits (477), Expect = 6e-45
 Identities = 127/325 (39%), Positives = 162/325 (49%), Gaps = 100/325 (30%)
 Frame = +1

Query: 469  AQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSL 648
            A +F IGPYA ETQLVSPPVFS+F T+PS+AP TPPPE+VQ+TTPSSPEVPFAQLL+SSL
Sbjct: 128  ASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSL 187

Query: 649  AQKWR----------------------------------NSGAPSPFYDKRADIDLPMVE 726
             +  R                                  NSG  SPF D+R     P+VE
Sbjct: 188  DRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISPISNSGTSSPFPDRR-----PIVE 242

Query: 727  APKFVGYEHFMNYKWGSRLGSGALTPNGKEPPSQECNILENNQNFEVVESEN-------- 882
            APK +G+EHF   +WGSRLGSG+LTP+G  P S++  +LE NQ  EV    N        
Sbjct: 243  APKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLE-NQISEVASLANSESGSQNG 301

Query: 883  ----NHRVSFELRGEDIPISIMK------ETTKG--KDLATE------------------ 972
                +HRVSFEL GED+ + + K      ET +   +D+  E                  
Sbjct: 302  ETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCC 361

Query: 973  ---VALSFQTQTSVRSDDGRDRTA-------SFGSSKDFNFNNTNDEVAIE--------- 1095
               V  + +  +   S +G +            GS K+FNF+NT  EV+ +         
Sbjct: 362  EFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWW 421

Query: 1096 ---------LGPQKNWNFFPMLQSG 1143
                      GPQ NW FFP+LQ G
Sbjct: 422  VNEKVVGKGTGPQTNWTFFPLLQPG 446



 Score = 66.2 bits (160), Expect = 4e-08
 Identities = 30/67 (44%), Positives = 40/67 (59%)
 Frame = +1

Query: 97  MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276
           MRSV++S                  QP+ VQKRRWG   S+YWCFGS++HSKRIGH +++
Sbjct: 1   MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60

Query: 277 SQETVNG 297
            +  V G
Sbjct: 61  PEPMVPG 67


>emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]
          Length = 385

 Score =  188 bits (477), Expect = 6e-45
 Identities = 127/325 (39%), Positives = 162/325 (49%), Gaps = 100/325 (30%)
 Frame = +1

Query: 469  AQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSL 648
            A +F IGPYA ETQLVSPPVFS+F T+PS+AP TPPPE+VQ+TTPSSPEVPFAQLL+SSL
Sbjct: 65   ASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSL 124

Query: 649  AQKWR----------------------------------NSGAPSPFYDKRADIDLPMVE 726
             +  R                                  NSG  SPF D+R     P+VE
Sbjct: 125  DRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISPISNSGTSSPFPDRR-----PIVE 179

Query: 727  APKFVGYEHFMNYKWGSRLGSGALTPNGKEPPSQECNILENNQNFEVVESEN-------- 882
            APK +G+EHF   +WGSRLGSG+LTP+G  P S++  +LE NQ  EV    N        
Sbjct: 180  APKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLE-NQISEVASLANSESGSQNG 238

Query: 883  ----NHRVSFELRGEDIPISIMK------ETTKG--KDLATE------------------ 972
                +HRVSFEL GED+ + + K      ET +   +D+  E                  
Sbjct: 239  ETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCC 298

Query: 973  ---VALSFQTQTSVRSDDGRDRTA-------SFGSSKDFNFNNTNDEVAIE--------- 1095
               V  + +  +   S +G +            GS K+FNF+NT  EV+ +         
Sbjct: 299  EFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWW 358

Query: 1096 ---------LGPQKNWNFFPMLQSG 1143
                      GPQ NW FFP+LQ G
Sbjct: 359  VNEKVVGKGTGPQTNWTFFPLLQPG 383


>ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Capsella rubella]
            gi|482549191|gb|EOA13385.1| hypothetical protein
            CARUB_v10026425mg [Capsella rubella]
          Length = 437

 Score =  186 bits (473), Expect = 2e-44
 Identities = 142/441 (32%), Positives = 191/441 (43%), Gaps = 90/441 (20%)
 Frame = +1

Query: 97   MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276
            MR+V++S                  QPS VQKRRW   WS+Y CFGS K++KRIG+ +++
Sbjct: 1    MRNVNNSVETVNAAATAIITAESRVQPSSVQKRRWAKCWSLYSCFGSQKNNKRIGNAVLV 60

Query: 277  SQETVNGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 456
             +   +GV   T                                                
Sbjct: 61   PEPVASGVPVVTVQNSATSTTVVLPFIAPPSSPASFLPSDPSSVSHSPVGPLSLTSNTFS 120

Query: 457  XELTAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLL 636
             +    +F +GPYA+ETQ V+PPVFS+F T+PS+AP TPPPE+    TPSSPEVPFAQLL
Sbjct: 121  PKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPES--SVTPSSPEVPFAQLL 178

Query: 637  SSSLAQKWR---------------------------------------NSGAPSPFYDKR 699
            +SSL    R                                       NSG  SP+  K 
Sbjct: 179  TSSLELTRRDSSGINQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKS 238

Query: 700  ADIDLPMVEAPKFVGYEHFMNYKWGSRLGSGALTP--------------------NGKEP 819
              ++  + E PKF+G+EHF   KWGSR GSG++TP                    +G   
Sbjct: 239  PMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGMASGALTPNAPEIISGNLT 298

Query: 820  PSQECNILENNQNFEVVESEN----------NHRVSFELRGEDIPISIMKETTKGKD--- 960
            PS     L+ NQ  EV    N          +HRVSFEL GED+   +  +  +  D   
Sbjct: 299  PSNTTWPLQ-NQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHDRMN 357

Query: 961  ----LATEVAL--------SFQTQTSVRSDDGRDR------TASFGSSKDFNFNNTNDEV 1086
                +ATE +         SFQ   S  + +   +      ++S GSSK+F F+NT DE 
Sbjct: 358  NNDRIATEESSSTDRGRRNSFQKIESTENRETEQQRIQKLSSSSIGSSKEFKFDNTKDE- 416

Query: 1087 AIELGPQKNWNFFPMLQSGGS 1149
             IE     +W+FFP L+SG S
Sbjct: 417  NIEKVAGNSWSFFPGLRSGVS 437


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  186 bits (472), Expect = 2e-44
 Identities = 133/344 (38%), Positives = 161/344 (46%), Gaps = 116/344 (33%)
 Frame = +1

Query: 466  TAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSS 645
            TA IF IGPYA ETQLVSPPVFS+FTT+PS+A  TPPPE V MTTP SPEVPFAQLL+SS
Sbjct: 127  TASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFAQLLTSS 186

Query: 646  LAQKWR------------------------------------NSGAPSPFYDKRADIDLP 717
            LA+  R                                    NSG  SPF  K   I+  
Sbjct: 187  LARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFR 246

Query: 718  MVEAPKFVGYEHFMNYKWGSR----------------------------LGSGALTPNGK 813
              E PKF+GYEHF   KWGSR                            LGSG +TPNG 
Sbjct: 247  KGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGG 306

Query: 814  EPPSQECNILE-----------NNQNFEVVESENNHRVSFELRGEDIPISIMKE------ 942
            EPPS++  +LE           ++   E+ E   +HRVSFEL GED+P    KE      
Sbjct: 307  EPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKEPVMSHS 366

Query: 943  -TTKGKDLATEVALSFQTQTSV-----------RSDDGRD------RTASFGSSKDFNFN 1068
              T   D++  +A   ++ +S+            S+ G D      R  +FGSSKDF+F+
Sbjct: 367  QQTLPMDVSNLLANEMKSGSSMAEEKTYGSPRKASESGEDQCHRKHRNITFGSSKDFDFD 426

Query: 1069 NTNDEV-----------------AIELGPQKNWNFFPMLQSGGS 1149
            N   EV                   E G Q NW FFP+LQ G S
Sbjct: 427  NVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470



 Score = 70.1 bits (170), Expect = 2e-09
 Identities = 27/42 (64%), Positives = 33/42 (78%)
 Frame = +1

Query: 172 QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNG 297
           QPS VQKRRWG  WS+YWCFGS+KHSKRIGH +++ +    G
Sbjct: 26  QPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPAAPG 67


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  184 bits (466), Expect = 1e-43
 Identities = 137/420 (32%), Positives = 178/420 (42%), Gaps = 98/420 (23%)
 Frame = +1

Query: 184  VQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXXXXXX 363
            VQKRRWG WWSMYWCFG  +H KRIGH +++ + T  G                      
Sbjct: 37   VQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPRAENPIQTPSIVLPFVAP 96

Query: 364  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPPVFSSFT 543
                                                 IF IGPYA ETQLVSPPVFS+FT
Sbjct: 97   PSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTSIFAIGPYAHETQLVSPPVFSTFT 156

Query: 544  TQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSS--------------------------- 642
            T+PS+AP TPPPE+V +TTPSSPEVPFAQLL                             
Sbjct: 157  TEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYPGS 216

Query: 643  ------SLAQKWRNSGAPSPFYDKRAD------IDLPMVEAPKFVGYEHFMNYKWGSRLG 786
                  S +     SG  SPF D          ++    + PK +  +      WGSRLG
Sbjct: 217  PVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSRLG 276

Query: 787  SGALTPNGKEPPSQECNILENNQNFEVV---ESEN---------NHRVSFELRGEDI--- 921
            SG++TP+G +  S +   L   Q  EVV    S N         NHRVSFEL  E++   
Sbjct: 277  SGSVTPDGAKSTSSD-GFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIRC 335

Query: 922  ----PISIMK---------ETTKGKDLATEVALSFQTQTSVRSDDG-------------- 1020
                P+++ +         E  + K+  ++V  S        S+D               
Sbjct: 336  VEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQLH 395

Query: 1021 -RDRTASFGSSKDFNFNN---------------TNDEV-AIELGPQKNWNFFPMLQSGGS 1149
             + R+ + GS K+FNF+N                N++V A E GP KNW+FFPM+Q G S
Sbjct: 396  PKQRSITLGSVKEFNFDNPDGGDSGNSIGSDWWANEKVDAKENGPTKNWSFFPMMQPGVS 455


>ref|XP_002867602.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
            subsp. lyrata] gi|297313438|gb|EFH43861.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  181 bits (460), Expect = 6e-43
 Identities = 137/412 (33%), Positives = 184/412 (44%), Gaps = 88/412 (21%)
 Frame = +1

Query: 172  QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXX 351
            QPS V K+ WG WWS+Y CFGS K++KRIGH +++ +   +G + +    Q         
Sbjct: 22   QPSSVHKK-WGSWWSLYLCFGSKKNNKRIGHAVLVPEPAASGAAVAP--VQNSSSNSTSM 78

Query: 352  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPPVF 531
                                                      F IGPYA ETQ V+PPVF
Sbjct: 79   FMPFIAPPSSPASFLPSGPPSVSHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVF 138

Query: 532  SSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWRN--------------- 666
            S+FTT+PS+AP TPPPE     +PSSPEVPFAQLL+SSL +  RN               
Sbjct: 139  SAFTTEPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLEKARRNIGGGMHHKFSAAHYE 193

Query: 667  -------------------SGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW------ 771
                               SG  SP+  K + I+  + E PKF+G+EHF   KW      
Sbjct: 194  FKSHQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGS 253

Query: 772  --------GSRLGSGALTPNGKEPPSQECNILENNQNFEVVESENN-------------- 885
                    GSRLGSGALTP+G  P   E ++L+ +Q  EV    N+              
Sbjct: 254  GSITPAGQGSRLGSGALTPDGLTP--LEGSLLD-SQITEVASLANSDHGSSRHNDEAAVV 310

Query: 886  -HRVSFELRGEDIPISIMK--------ETTKGKDLATEVALSFQTQTSVRSDDGRD-RTA 1035
             HRVSFEL GED+   +          E   G+ L        +T     S+  +  R+ 
Sbjct: 311  PHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPN---GCKTSGETESEQSQKLRSF 367

Query: 1036 SFGSSKDFNFNNTNDEVAIEL----------------GPQKNWNFFPMLQSG 1143
            S GSSK+F F+NTN+E+  ++                 P+ +W FFP+L+SG
Sbjct: 368  STGSSKEFKFDNTNEEMIEKVRSEWWANEKVAGKGDHSPRNSWTFFPVLRSG 419


>ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutrema salsugineum]
            gi|557102915|gb|ESQ43278.1| hypothetical protein
            EUTSA_v10013563mg [Eutrema salsugineum]
          Length = 440

 Score =  180 bits (456), Expect = 2e-42
 Identities = 144/441 (32%), Positives = 188/441 (42%), Gaps = 90/441 (20%)
 Frame = +1

Query: 97   MRSVHDSXXXXXXXXXXXXXXXXXXQPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVI 276
            MR+V++S                  QPS V KRRW   WS+  CFGS K++KRIG+ +++
Sbjct: 1    MRNVNNSVETVNAAATAIVTAESRVQPSSVPKRRWRNCWSLNSCFGSQKNNKRIGNAMLV 60

Query: 277  SQETV--NGVSTSTCYAQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 450
              E V   G    T                                              
Sbjct: 61   VPEPVATGGAPVVTVQNSATSSSIVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNT 120

Query: 451  XXXELTAQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPE-AVQMTTPSSPEVPFA 627
                    +F IGPYA+ETQ V+PPVFS+F T+PS+AP TPPPE +V +TTPSSPEVPFA
Sbjct: 121  FSTTEPQSVFTIGPYANETQPVTPPVFSAFITEPSTAPFTPPPESSVHITTPSSPEVPFA 180

Query: 628  QLLSSSLA----------QKW-----------------------------RNSGAPSPFY 690
            QLL+SSL           QK+                              NSG  SP+ 
Sbjct: 181  QLLTSSLELTRRNSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYP 240

Query: 691  DKRADIDLPMVEAPKFVGYEHFMNYKWGSRL----------GSGALTPNGKEPPSQECNI 840
             K   ++  + E PKF+G+EHF   KWGSR           GSGALTPNG    S+    
Sbjct: 241  GKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGALTPNGPGMVSESLTP 300

Query: 841  LENN--------QNFEVVESEN----------NHRVSFELRGEDIPISIMKETTKGKDLA 966
              NN        Q  EV    N          +HRVSFEL GED+   +  +  +  D  
Sbjct: 301  NNNNNTTWPLTSQVSEVASLANSDHGSEVVAADHRVSFELTGEDVARCLASKLNRSHDRM 360

Query: 967  T---------EVALSFQTQTSVRSDDGRDR-----------TASFGSSKDFNFNNTNDEV 1086
                        ++SFQ + +       DR           ++S GSSK+F F+NT +E 
Sbjct: 361  NNDERVETDERRSISFQKRENNVERVSGDREIEQQRIHKLSSSSIGSSKEFKFDNTKEE- 419

Query: 1087 AIELGPQKNWNFFPMLQSGGS 1149
             IE     +W+FFP L+SG S
Sbjct: 420  NIEKVAGNSWSFFPGLRSGVS 440


>ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Capsella rubella]
            gi|482552442|gb|EOA16635.1| hypothetical protein
            CARUB_v10004810mg [Capsella rubella]
          Length = 444

 Score =  179 bits (453), Expect = 4e-42
 Identities = 134/424 (31%), Positives = 180/424 (42%), Gaps = 99/424 (23%)
 Frame = +1

Query: 172  QPSP--VQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXX 345
            QPS   + K++WG WWS+YWCFGS K++KRIGH ++  +   +GV+ +            
Sbjct: 27   QPSSSLLHKKKWGSWWSLYWCFGSKKNNKRIGHAVLAPEPAASGVAVAPVQNSSSSNSTS 86

Query: 346  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPP 525
                                                        F IGPYA ETQ V+PP
Sbjct: 87   IFMPFIAPPSSPASFLPSGPPSVSHTPDPCRLRCSLLVNEPPSAFAIGPYAHETQPVTPP 146

Query: 526  VFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWRN------------- 666
            VFS+FTT+PS+AP TPPPE     +PSSPEVPFAQLL+SSL +  RN             
Sbjct: 147  VFSAFTTEPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERARRNSSGGMNHKFSAAH 201

Query: 667  ---------------------SGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW---- 771
                                 SG  SP+  K + I+  + E PKF+G+EHF   KW    
Sbjct: 202  YEFKSHQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRF 261

Query: 772  ----------GSRLGSGALTPNGKEPPSQEC---------NILENNQNFEVVESENN--- 885
                      GSRLGSGALTP+G      +          + L ++Q  EV    N+   
Sbjct: 262  GSGSITPAGQGSRLGSGALTPDGGGGMGSKIASGALTPLEDSLLDSQVSEVASLANSDHG 321

Query: 886  ------------HRVSFELRGEDIPISIMK--------ETTKGKDLATEVALSFQTQTSV 1005
                        HRVSFEL GED+   +          E   G+ L        +T    
Sbjct: 322  SSRHNDEAVVVAHRVSFELTGEDVARCLASKLNRSGSHERASGEHLRPN---GCKTSGET 378

Query: 1006 RSDDGRD-RTASFGSSKDFNFNNTNDEVAIEL----------------GPQKNWNFFPML 1134
             S+  +  R+ S GSSK+F F+NT +E   ++                 P  +W FFP+L
Sbjct: 379  ESEQSQKLRSFSLGSSKEFKFDNTEEETIEKVRSEWWANEKVAGKGDHSPANSWTFFPVL 438

Query: 1135 QSGG 1146
            +S G
Sbjct: 439  RSSG 442


>ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|26449762|dbj|BAC42004.1| unknown protein
            [Arabidopsis thaliana] gi|28951011|gb|AAO63429.1|
            At4g25620 [Arabidopsis thaliana]
            gi|332659684|gb|AEE85084.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 449

 Score =  177 bits (448), Expect = 1e-41
 Identities = 138/431 (32%), Positives = 186/431 (43%), Gaps = 107/431 (24%)
 Frame = +1

Query: 172  QPSPVQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXX 351
            QPS VQK+R G WWS+YWCFGS K++KRIGH +++ +   +G + +    Q         
Sbjct: 27   QPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAP--VQNSSSNSTSI 83

Query: 352  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPPVF 531
                                                      F IGPYA ETQ V+PPVF
Sbjct: 84   FMPFIAPPSSPASFLPSGPPSASHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVF 143

Query: 532  SSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWRN--------------- 666
            S+FTT+PS+AP TPPPE     +PSSPEVPFAQLL+SSL +  RN               
Sbjct: 144  SAFTTEPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYE 198

Query: 667  -------------------SGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW------ 771
                               SG  SP+  K + I+  + E PKF+G+EHF   KW      
Sbjct: 199  FKSCQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGS 258

Query: 772  --------GSRLGSGALTPNGKE-------PPSQECNI-------------LENNQNFEV 867
                    GSRLGSGALTP+G +       P   E  I             L ++Q  EV
Sbjct: 259  GSITPAGQGSRLGSGALTPDGSKLTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEV 318

Query: 868  VESENN---------------HRVSFELRGEDIPISIMK--------ETTKGKDLATEVA 978
                N+               HRVSFEL GED+   +          E   G+ L     
Sbjct: 319  ASLANSDHGSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCC 378

Query: 979  LSFQTQTSVRSDDGRDRTASFGSSKDFNFNNTNDEVAIEL----------------GPQK 1110
             +     S +S   + R+ S GS+K+F F++TN+E+  ++                 P+ 
Sbjct: 379  KTSGETESEQSQ--KLRSFSTGSNKEFKFDSTNEEMIEKIRSEWWANEKVAGKGDHSPRN 436

Query: 1111 NWNFFPMLQSG 1143
            +W FFP+L+SG
Sbjct: 437  SWTFFPVLRSG 447


>emb|CAA18164.1| putative protein [Arabidopsis thaliana] gi|7269412|emb|CAB81372.1|
            putative protein [Arabidopsis thaliana]
          Length = 424

 Score =  172 bits (435), Expect = 5e-40
 Identities = 133/426 (31%), Positives = 182/426 (42%), Gaps = 107/426 (25%)
 Frame = +1

Query: 187  QKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETVNGVSTSTCYAQXXXXXXXXXXXXXX 366
            QK++ G WWS+YWCFGS K++KRIGH +++ +   +G + +    Q              
Sbjct: 6    QKKKRGSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAP--VQNSSSNSTSIFMPFI 63

Query: 367  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTAQIFLIGPYADETQLVSPPVFSSFTT 546
                                                 F IGPYA ETQ V+PPVFS+FTT
Sbjct: 64   APPSSPASFLPSGPPSASHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSAFTT 123

Query: 547  QPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLAQKWRN-------------------- 666
            +PS+AP TPPPE     +PSSPEVPFAQLL+SSL +  RN                    
Sbjct: 124  EPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQ 178

Query: 667  --------------SGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW----------- 771
                          SG  SP+  K + I+  + E PKF+G+EHF   KW           
Sbjct: 179  VYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITP 238

Query: 772  ---GSRLGSGALTPNGKE-------PPSQECNI-------------LENNQNFEVVESEN 882
               GSRLGSGALTP+G +       P   E  I             L ++Q  EV    N
Sbjct: 239  AGQGSRLGSGALTPDGSKLTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLAN 298

Query: 883  N---------------HRVSFELRGEDIPISIMK--------ETTKGKDLATEVALSFQT 993
            +               HRVSFEL GED+   +          E   G+ L      +   
Sbjct: 299  SDHGSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCCKTSGE 358

Query: 994  QTSVRSDDGRDRTASFGSSKDFNFNNTNDEVAIEL----------------GPQKNWNFF 1125
              S +S   + R+ S GS+K+F F++TN+E+  ++                 P+ +W FF
Sbjct: 359  TESEQSQ--KLRSFSTGSNKEFKFDSTNEEMIEKIRSEWWANEKVAGKGDHSPRNSWTFF 416

Query: 1126 PMLQSG 1143
            P+L+SG
Sbjct: 417  PVLRSG 422


>ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508776005|gb|EOY23261.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 540

 Score =  163 bits (412), Expect = 2e-37
 Identities = 120/324 (37%), Positives = 156/324 (48%), Gaps = 99/324 (30%)
 Frame = +1

Query: 469  AQIFLIGPYADETQLVSPPVFSSFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSL 648
            A IF IGPYA ETQLV+PPVFS+ T +PS+AP TPPPE++Q+TTPSSPEVPFAQLL+SSL
Sbjct: 216  ASIFAIGPYAHETQLVTPPVFSALTPEPSTAPFTPPPESIQLTTPSSPEVPFAQLLASSL 275

Query: 649  AQKWR----NSGAPSPFYDKRADIDLPMVEAPKFVGYEHFMNYKW--------------- 771
                R    NSG  SPF D+R  ++  M EAPK +G+E+    KW               
Sbjct: 276  ESARRKAISNSGTSSPFPDRRPILEFHMGEAPKLLGFENLTTRKWCSRLGSGSLTPDGLG 335

Query: 772  -GSRLGS----------------GALTPNGKEPPSQECNILENNQNFEVV---------- 870
             GSRLGS                G+LTP+G  PPS++   L  +Q  EV           
Sbjct: 336  RGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPPSRD-GFLLGSQISEVALLTNQANGPK 394

Query: 871  --ESENNHRVSFELRGEDI----------PISIMKETTKG-------------KDL--AT 969
              E+  +HRVSFEL GED+          P   + E  K              KDL  + 
Sbjct: 395  NDETIVDHRVSFELSGEDVARCLESKSLLPSRTVSEYPKDLVAEGRIERDGIKKDLESSC 454

Query: 970  EVALSFQTQTSVRSDDG---------RDRTASFGSSKDFNFNNTNDEVA----------- 1089
            E+ +   +  +V    G         + R+ + GS K+FNF+NT  E +           
Sbjct: 455  ELFIRETSNETVEKASGKAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWA 514

Query: 1090 ------IELGPQKNWNFFPMLQSG 1143
                   E  P  +W FFPM + G
Sbjct: 515  NEKFARKEARPGNSWTFFPMFRPG 538


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  162 bits (409), Expect = 5e-37
 Identities = 133/419 (31%), Positives = 172/419 (41%), Gaps = 97/419 (23%)
 Frame = +1

Query: 184  VQKRRWGGWWSMYWCFGSYKHSKRIGHTLVISQETV--NGVSTSTCYAQXXXXXXXXXXX 357
            VQKRRWG  WS+Y CFG  KH K+IGH ++  + +   NG   S    Q           
Sbjct: 37   VQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAP 96

Query: 358  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEL-TAQIFLIGPYADETQLVSPPVFS 534
                                                  A IF IGPYA ETQLVSPPVFS
Sbjct: 97   PSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFS 156

Query: 535  SFTTQPSSAPLTPPPEAVQMTTPSSPEVPFAQLLSSSLA------------QKWR----- 663
            +FTT+PS+AP TPPPE+V +TTPSSPEVPFAQ L  SL             Q ++     
Sbjct: 157  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPFDFQSYQFHPGS 216

Query: 664  -------------NSGAPSPFYDKRADI------DLPMVEAPKFVGYEHFMNYKWGSRLG 786
                          SG  SPF D    +      +  + E PK +  +     +WGS  G
Sbjct: 217  PVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQG 276

Query: 787  SGALTPNGKEPPSQECNILENNQNFEVVESEN-----------NHRVSFELRGED----- 918
            SGALTP      S   N L + Q  +V                NHRVSFEL  ED     
Sbjct: 277  SGALTPESVRRGSP--NFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCV 334

Query: 919  ----------IPISIMKET-TKGKDLATEVALSFQTQTSVRSDDG--------------- 1020
                      +P  +   T  K +  + E   SF+ +  V S+D                
Sbjct: 335  EEKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHR 394

Query: 1021 RDRTASFGSSKDFNFNNTND----------------EVAIELGPQKNWNFFPMLQSGGS 1149
            + ++ + GS K+FNF+N ++                 +  E    KNW+FFPM+QSG S
Sbjct: 395  KQQSITLGSVKEFNFDNADEGDSRKPSSSNWWANGSVIGKEGETTKNWSFFPMVQSGVS 453


Top