BLASTX nr result

ID: Mentha25_contig00024423 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00024423
         (1488 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   202   4e-49
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   196   2e-47
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   187   1e-44
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   176   3e-41
gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]     173   2e-40
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   170   2e-39
ref|XP_002865912.1| hydroxyproline-rich glycoprotein family prot...   165   6e-38
ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein...   162   5e-37
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   161   6e-37
ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun...   160   1e-36
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   160   2e-36
ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Caps...   159   2e-36
ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr...   159   4e-36
ref|XP_002867602.1| hydroxyproline-rich glycoprotein family prot...   155   6e-35
ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutr...   154   1e-34
ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Caps...   154   1e-34
ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein...   149   2e-33
emb|CAA18164.1| putative protein [Arabidopsis thaliana] gi|72694...   145   3e-32
ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494...   145   5e-32
ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phas...   144   8e-32

>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
            lycopersicum]
          Length = 470

 Score =  202 bits (513), Expect = 4e-49
 Identities = 148/447 (33%), Positives = 187/447 (41%), Gaps = 134/447 (29%)
 Frame = -1

Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030
            QP  VQKRRWG  WS+YWCFGS+KHSKRIGH + + +    G +   +  + PN ++T  
Sbjct: 26   QPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVT--ENPNHSATIV 83

Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQLV 871
            +PF                                   +       IF  GPYA ETQLV
Sbjct: 84   IPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLV 143

Query: 870  SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR----------- 724
            SPPVFS+FTT+PS+A FTPPPE V MTTP SPE PFAQLL+SSLA+  R           
Sbjct: 144  SPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLS 203

Query: 723  -------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYK 619
                                     N+   SP   K   ++ R  E P+F+GYEHF   K
Sbjct: 204  QYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRK 263

Query: 618  WGS------------------------------NSSALTPNGKEPPSQECDILEN----- 544
            WGS                               S  +TPNG EPPS++  +LEN     
Sbjct: 264  WGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEV 323

Query: 543  -----------------NHRVSFELRGEDIPTSIVK-------GTTKGKDLATEVALSFR 436
                             +HRVSFEL  ED+P+   K         T   D++  +A   R
Sbjct: 324  ASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMDVSNLLASEMR 383

Query: 435  TQTSV-----------RSNDGRDD-----RTTSFGSSKXXXXXXXXDEV----------- 337
            + +S+            S  G D+     R  +FGSSK         EV           
Sbjct: 384  SGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWW 443

Query: 336  -----GIKELGPQKNWNFFPMLQSGGS 271
                  +KE G Q NW FFP+LQ G S
Sbjct: 444  TSDKAAVKESGIQNNWTFFPVLQPGVS 470


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  196 bits (499), Expect = 2e-47
 Identities = 147/447 (32%), Positives = 185/447 (41%), Gaps = 134/447 (29%)
 Frame = -1

Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030
            QP  VQKRRWG  WS+YWCFGS+KHSKRIGH + + +    G +   +  + PN ++T  
Sbjct: 26   QPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPAAPGPAVPVT--ENPNHSATIV 83

Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQLV 871
            +PF                                   +       IF  GPYA ETQLV
Sbjct: 84   IPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSINAYSPGGTASIFAIGPYAHETQLV 143

Query: 870  SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR----------- 724
            SPPVFS+FTT+PS+A FTPPPE V MTTP SPE PFAQLL+SSLA+  R           
Sbjct: 144  SPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLS 203

Query: 723  -------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYK 619
                                     N+   SP   K   ++ R  E P+F+GYEHF   K
Sbjct: 204  QYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRK 263

Query: 618  WGSN------------------------------SSALTPNGKEPPSQECDILEN----- 544
            WGS                               S  +TPNG EPPS++  +LE      
Sbjct: 264  WGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLEYQISEV 323

Query: 543  -----------------NHRVSFELRGEDIPTSIVKGT-------TKGKDLATEVALSFR 436
                             +HRVSFEL GED+P+   K         T   D++  +A   +
Sbjct: 324  ASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKEPVMSHSQQTLPMDVSNLLANEMK 383

Query: 435  TQTSVR-----------SNDGRDD-----RTTSFGSSKXXXXXXXXDEV----------- 337
            + +S+            S  G D      R  +FGSSK         EV           
Sbjct: 384  SGSSMAEEKTYGSPRKASESGEDQCHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWW 443

Query: 336  -----GIKELGPQKNWNFFPMLQSGGS 271
                   KE G Q NW FFP+LQ G S
Sbjct: 444  TSDKAAGKESGIQNNWTFFPVLQPGVS 470


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  187 bits (474), Expect = 1e-44
 Identities = 144/453 (31%), Positives = 190/453 (41%), Gaps = 117/453 (25%)
 Frame = -1

Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105
            MRSV++S                 VQP  VQKRRWG   S+YWCFGS++HSKRIGH + +
Sbjct: 1    MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60

Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925
             +  + G     S  +  N +++  LPF                                
Sbjct: 61   PEPMVPGAVAPAS--ENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALS 118

Query: 924  XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 766
               +       +F  GPYA ETQLVSPPVFS+F T+PS+APFTPPPESVQ+TTPSSPE P
Sbjct: 119  VNAYSPSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVP 178

Query: 765  FAQLLSSSLAQKWRNT--------------------EVP--------------SPLFDKR 688
            FAQLL+SSL +  RN+                    E P              SP  D+R
Sbjct: 179  FAQLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISPISNSGTSSPFPDRR 238

Query: 687  ANVDLRLVEAPEFVGYEHFMNYKWGS--NSSALTPNGKEPPSQECDILEN---------- 544
                  +VEAP+ +G+EHF   +WGS   S +LTP+G  P S++  +LEN          
Sbjct: 239  P-----IVEAPKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLAN 293

Query: 543  ------------NHRVSFELRGEDIPTSIVKGTTKG--------KDLATE---------- 454
                        +HRVSFEL GED+   + K             +D+  E          
Sbjct: 294  SESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGI 353

Query: 453  -----------VALSFRTQTSVRSNDGRDDR------TTSFGSSKXXXXXXXXDEVGIKE 325
                       V  + +  +   S +G +++          GS K         EV  K 
Sbjct: 354  SESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKP 413

Query: 324  -----------------LGPQKNWNFFPMLQSG 277
                              GPQ NW FFP+LQ G
Sbjct: 414  NIIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPG 446


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  176 bits (445), Expect = 3e-41
 Identities = 146/483 (30%), Positives = 190/483 (39%), Gaps = 149/483 (30%)
 Frame = -1

Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105
            MRSV+DS                 VQP  VQK+RWG  W +YWCFGS K+SKRIGH + +
Sbjct: 1    MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60

Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925
             +  + G S ST+     NPT    LPF                                
Sbjct: 61   PEPVVPGASVSTA-ENVSNPTGII-LPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 118

Query: 924  XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 766
               +       IF  GPYA ETQLV+PPVFS+ TT+PS+APFTPPPESVQ+TTPSSPE P
Sbjct: 119  VNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVP 178

Query: 765  FAQLLSSSLAQKWRNTEV-------------------------------------PSPLF 697
            FAQLL+SSL +  RN+ +                                      SP  
Sbjct: 179  FAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFP 238

Query: 696  DKRANVDLRLVEAPEFVGYEHFMNYKWGS------------------------------- 610
            D+R  ++ R+ EAP+ +G+E+F   KWGS                               
Sbjct: 239  DRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLG 298

Query: 609  ---NSSALTPNGKEPPSQ----------ECDILEN------------NHRVSFELRGEDI 505
                S +LTP+G  P S+          E  +L N            +HRVSFEL GED+
Sbjct: 299  SRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDV 358

Query: 504  ------------------PTSIV-------KGTTKGKDLATEVALSFRTQTSVRSNDGRD 400
                              P  +V        G  K  + + E+ +   +  +V    G  
Sbjct: 359  APCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEA 418

Query: 399  D--------RTTSFGSSKXXXXXXXXDE----------------VGIKELGPQKNWNFFP 292
            +        R+ + GS K         E                V  KE  P  +W FFP
Sbjct: 419  EEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFP 478

Query: 291  MLQ 283
            MLQ
Sbjct: 479  MLQ 481


>gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]
          Length = 521

 Score =  173 bits (438), Expect = 2e-40
 Identities = 106/281 (37%), Positives = 134/281 (47%), Gaps = 47/281 (16%)
 Frame = -1

Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105
            MR+V++S                  QP AV KRRWG  WS+YWCFGS+K+SKRIGH + +
Sbjct: 1    MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFGSHKNSKRIGHAVLV 60

Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925
             +  + G +      Q P+  +   LPF                                
Sbjct: 61   PEPVLPGAAAPAPENQAPS--TAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 118

Query: 924  XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 766
               +       IF  GPYA ETQLVSPPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE P
Sbjct: 119  INAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 178

Query: 765  FAQLLSSSLAQKWRNTE--------------------------------------VPSPL 700
            FAQLL+SSL +  RN+                                         SP 
Sbjct: 179  FAQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPF 238

Query: 699  FDKRANVDLRLVEAPEFVGYEHFMNYKWGS--NSSALTPNG 583
             DK   +  R+ EAP  +G+EHF  +KWGS   S +LTP+G
Sbjct: 239  PDKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDG 279


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  170 bits (430), Expect = 2e-39
 Identities = 146/487 (29%), Positives = 190/487 (39%), Gaps = 153/487 (31%)
 Frame = -1

Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQ----KRRWGEWWSMYWCFGSYKHSKRIGH 1117
            MRSV+DS                 VQP  VQ    K+RWG  W +YWCFGS K+SKRIGH
Sbjct: 1    MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60

Query: 1116 TLAISQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXX 937
             + + +  + G S ST+     NPT    LPF                            
Sbjct: 61   AVLVPEPVVPGASVSTA-ENVSNPTGII-LPFIAPPSSPASFLQSDPPSATQSPAGLLSL 118

Query: 936  XXXXXXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSS 778
                   +       IF  GPYA ETQLV+PPVFS+ TT+PS+APFTPPPESVQ+TTPSS
Sbjct: 119  TSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSS 178

Query: 777  PEAPFAQLLSSSLAQKWRNTEV-------------------------------------P 709
            PE PFAQLL+SSL +  RN+ +                                      
Sbjct: 179  PEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTS 238

Query: 708  SPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS--------------------------- 610
            SP  D+R  ++ R+ EAP+ +G+E+F   KWGS                           
Sbjct: 239  SPFPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDG 298

Query: 609  -------NSSALTPNGKEPPSQ----------ECDILEN------------NHRVSFELR 517
                    S +LTP+G  P S+          E  +L N            +HRVSFEL 
Sbjct: 299  MGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELS 358

Query: 516  GEDI------------------PTSIV-------KGTTKGKDLATEVALSFRTQTSVRSN 412
            GED+                  P  +V        G  K  + + E+ +   +  +V   
Sbjct: 359  GEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKA 418

Query: 411  DGRDD--------RTTSFGSSKXXXXXXXXDE----------------VGIKELGPQKNW 304
             G  +        R+ + GS K         E                V  KE  P  +W
Sbjct: 419  SGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSW 478

Query: 303  NFFPMLQ 283
             FFPMLQ
Sbjct: 479  TFFPMLQ 485


>ref|XP_002865912.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
            subsp. lyrata] gi|297311747|gb|EFH42171.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 437

 Score =  165 bits (417), Expect = 6e-38
 Identities = 139/441 (31%), Positives = 186/441 (42%), Gaps = 103/441 (23%)
 Frame = -1

Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105
            MR+V++S                 VQP +VQK RWG+ WS+Y CFG+ K++KRIG+ + +
Sbjct: 1    MRNVNNSVETVNAAATAIVTAESRVQPSSVQKGRWGKCWSLYSCFGTQKNNKRIGNAVLV 60

Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925
             +   +GV   T   Q    ++T  LPF                                
Sbjct: 61   PEPVASGVPVVT--VQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPGGQLSLTSNT 118

Query: 924  XXPH----IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFA 760
              P     +F  GPYA+ETQ V+PPVFS+F T+PS+AP+TPPPE SV +TTPSSPE PFA
Sbjct: 119  FSPKEPQSVFTVGPYANETQPVTPPVFSAFVTEPSTAPYTPPPESSVHITTPSSPEVPFA 178

Query: 759  QLLSSSLAQKWRNTE----------------------------------------VPSPL 700
            QLL+SSL    RN+                                           SP 
Sbjct: 179  QLLTSSLELTRRNSSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 238

Query: 699  FDKRANVDLRLVEAPEFVGYEHFMNYKWGSN----------------SSALTPNGKE--- 577
              K   V+ R+ E P+F+G+EHF   KWGS                 S ALTPNG E   
Sbjct: 239  PGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASGALTPNGLEIIS 298

Query: 576  ---PPSQE--------------------CDILENNHRVSFELRGEDIPTSIVKGTTKGKD 466
                PS                       +++  +HRVSFEL GED+   +     +  D
Sbjct: 299  GNLTPSNTTWPLHNQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHD 358

Query: 465  -------LATEVALS--FRTQTSVRSNDGRDDR-------TTSFGSSKXXXXXXXXDEVG 334
                   + TE + S   R     RS D   ++       ++S GSSK        DE  
Sbjct: 359  RMNNNDRIETEESSSTDLRRNMEKRSADRETEQQRIQKLNSSSIGSSKEFKFDNTKDENI 418

Query: 333  IKELGPQKNWNFFPMLQSGGS 271
             K  G   +W+FFP L+SG S
Sbjct: 419  EKVAG--NSWSFFPGLRSGVS 437


>ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|10177409|dbj|BAB10540.1| unnamed protein
            product [Arabidopsis thaliana] gi|40823427|gb|AAR92282.1|
            At5g52430 [Arabidopsis thaliana]
            gi|56381929|gb|AAV85683.1| At5g52430 [Arabidopsis
            thaliana] gi|110738650|dbj|BAF01250.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332008830|gb|AED96213.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 438

 Score =  162 bits (409), Expect = 5e-37
 Identities = 133/416 (31%), Positives = 179/416 (43%), Gaps = 103/416 (24%)
 Frame = -1

Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030
            QP + QK RWG+ WS+Y CFG+ K++KRIG+ + + +   +GV   T   Q    ++T  
Sbjct: 27   QPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTSGVPVVT--VQNSATSTTVV 84

Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH----IFLKGPYADETQLVSPP 862
            LPF                                  P     +F  GPYA+ETQ V+PP
Sbjct: 85   LPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQSVFTVGPYANETQPVTPP 144

Query: 861  VFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFAQLLSSSLA-----------QKW--- 727
            VFS+F T+PS+AP+TPPPE SV +TTPSSPE PFAQLL+SSL            QK+   
Sbjct: 145  VFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQKFSSS 204

Query: 726  --------------------------RNTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMN 625
                                       N+   SP   K   V+ R+ E P+F+G+EHF  
Sbjct: 205  HYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLGFEHFTA 264

Query: 624  YKWGSN----------------SSALTPNGKEPPS------------------------- 568
             KWGS                 S ALTPNG E  S                         
Sbjct: 265  RKWGSRFGSGSITPVGHGSGLASGALTPNGPEIVSGNLTPNNTTWPLQNQISEVASLANS 324

Query: 567  -QECDILENNHRVSFELRGEDIPTSIVKGTTKGKD-------LATEVALS--FRTQTSVR 418
                +++  +HRVSFEL GED+   +     +  D       + TE + S   R     R
Sbjct: 325  DHGSEVMVADHRVSFELTGEDVARCLASKLNRSHDRMNNNDRIETEESSSTDIRRNIEKR 384

Query: 417  SNDGRDDR-------TTSFGSSKXXXXXXXXDEVGIKELGPQKNWNFFPMLQSGGS 271
            S D  +++       ++S GSSK        DE   K  G   +W+FFP L+SG S
Sbjct: 385  SGDRENEQHRIQKLSSSSIGSSKEFKFDNTKDENIEKVAG--NSWSFFPGLRSGVS 438


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  161 bits (408), Expect = 6e-37
 Identities = 127/421 (30%), Positives = 172/421 (40%), Gaps = 112/421 (26%)
 Frame = -1

Query: 1197 VQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTTLPFX 1018
            VQKRRWG WWSMYWCFG  +H KRIGH + + + T  G       A+ P  T +  LPF 
Sbjct: 37   VQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPR--AENPIQTPSIVLPFV 94

Query: 1017 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH----IFLKGPYADETQLVSPPVFSS 850
                                             P     IF  GPYA ETQLVSPPVFS+
Sbjct: 95   APPSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTSIFAIGPYAHETQLVSPPVFST 154

Query: 849  FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLL--------------------------- 751
            FTT+PS+APFTPPPESV +TTPSSPE PFAQLL                           
Sbjct: 155  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYP 214

Query: 750  ----------SSSLAQKWRNTEVPSPLFDKRAN--VDLRLVEAPEFVGYEHFMNYKWGS- 610
                      SS ++    ++  P   F  R +  ++ R  + P+ +  +      WGS 
Sbjct: 215  GSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSR 274

Query: 609  -NSSALTPNGKEPPSQECDILEN----------------------NHRVSFELRGEDI-- 505
              S ++TP+G +  S +  +L+                       NHRVSFEL  E++  
Sbjct: 275  LGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIR 334

Query: 504  -----PTSIVKGTT---------KGKDLATEVALSFRTQTSVRSNDGRD----------- 400
                 P ++ +  +         + K+  ++V  S        SND  +           
Sbjct: 335  CVEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQL 394

Query: 399  ---DRTTSFGSSK---------------XXXXXXXXDEVGIKELGPQKNWNFFPMLQSGG 274
                R+ + GS K                       ++V  KE GP KNW+FFPM+Q G 
Sbjct: 395  HPKQRSITLGSVKEFNFDNPDGGDSGNSIGSDWWANEKVDAKENGPTKNWSFFPMMQPGV 454

Query: 273  S 271
            S
Sbjct: 455  S 455


>ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
            gi|462415503|gb|EMJ20240.1| hypothetical protein
            PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  160 bits (406), Expect = 1e-36
 Identities = 138/502 (27%), Positives = 183/502 (36%), Gaps = 166/502 (33%)
 Frame = -1

Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105
            MRSV+ S                  QP  V KRRWG  WS+YWCFG +K+ KRIGH + +
Sbjct: 1    MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHKN-KRIGHAVLV 59

Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTL--PFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 931
             +  + G + S       N T++T +  PF                              
Sbjct: 60   PEPVVPGAAVSAI----DNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKS 115

Query: 930  XXXXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPE 772
                 +       IF  GPYA ETQLVSPPVFS+F T+PS+APFTPPPESVQ+TTPSSPE
Sbjct: 116  LSANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPE 175

Query: 771  APFAQLLSSSLAQKWR-------------------------------------NTEVPSP 703
             PFAQLL+SSL +  R                                     N+   SP
Sbjct: 176  VPFAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSP 235

Query: 702  LFDKRANVDLRLVEAPEFVGYEHFMNYKWGS----------------------------- 610
              D+   ++ R+ EAP+  G++HF   KWGS                             
Sbjct: 236  FPDRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNE 295

Query: 609  ---------------------NSSALTPNGKEPPSQECDILEN----------------- 544
                                  S  LTP+G  P S++  +LEN                 
Sbjct: 296  LGSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQT 355

Query: 543  -----NHRVSFELRGEDIPTSIVKGTTKGKDLAT----EVALSFRTQTSVRSNDG----- 406
                 +HRVSFEL GED+   +          A+     +A  + ++    S+D      
Sbjct: 356  VETVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCE 415

Query: 405  -----------------------RDDRTTSFGSSKXXXXXXXXDE--------------- 340
                                   R  R+ + GS+K         E               
Sbjct: 416  FSVEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANK 475

Query: 339  -VGIKELGPQKNWNFFPMLQSG 277
             V  KE  P  +W FFP+LQ G
Sbjct: 476  NVAAKESKPCNDWTFFPILQPG 497


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  160 bits (404), Expect = 2e-36
 Identities = 96/255 (37%), Positives = 127/255 (49%), Gaps = 46/255 (18%)
 Frame = -1

Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030
            QP  VQKRRWG  WS+YWCFGS+K +KRIGH +   +  + G   ++  A+  + ++  T
Sbjct: 40   QPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTS--AENQSQSTAIT 96

Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQLV 871
            +PF                                   +       IF  GPYA ETQLV
Sbjct: 97   VPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLV 156

Query: 870  SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR----------- 724
            +PP FS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SSL +  R           
Sbjct: 157  TPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALS 216

Query: 723  --------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNY 622
                                      N+   SP  D+   ++ R+ EAP+ +G+EHF   
Sbjct: 217  HYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHFTTR 276

Query: 621  KWGS--NSSALTPNG 583
            KWGS   S  +TP+G
Sbjct: 277  KWGSRLGSGTVTPDG 291


>ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Capsella rubella]
            gi|482549191|gb|EOA13385.1| hypothetical protein
            CARUB_v10026425mg [Capsella rubella]
          Length = 437

 Score =  159 bits (403), Expect = 2e-36
 Identities = 140/443 (31%), Positives = 185/443 (41%), Gaps = 105/443 (23%)
 Frame = -1

Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105
            MR+V++S                 VQP +VQKRRW + WS+Y CFGS K++KRIG+ + +
Sbjct: 1    MRNVNNSVETVNAAATAIITAESRVQPSSVQKRRWAKCWSLYSCFGSQKNNKRIGNAVLV 60

Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925
             +   +GV   T   Q    ++T  LPF                                
Sbjct: 61   PEPVASGVPVVT--VQNSATSTTVVLPFIAPPSSPASFLPSDPSSVSHSPVGPLSLTSNT 118

Query: 924  XXPH----IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQ 757
              P     +F  GPYA+ETQ V+PPVFS+F T+PS+AP+TPPPES    TPSSPE PFAQ
Sbjct: 119  FSPKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPES--SVTPSSPEVPFAQ 176

Query: 756  LLSSSLA----------QKW-----------------------------RNTEVPSPLFD 694
            LL+SSL           QK+                              N+   SP   
Sbjct: 177  LLTSSLELTRRDSSGINQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPG 236

Query: 693  KRANVDLRLVEAPEFVGYEHFMNYKWGSN----------------SSALTPNGKE----- 577
            K   V+ R+ E P+F+G+EHF   KWGS                 S ALTPN  E     
Sbjct: 237  KSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGMASGALTPNAPEIISGN 296

Query: 576  -PPSQECDILEN--------------------NHRVSFELRGEDIPTSIVKGTTKGKD-- 466
              PS     L+N                    +HRVSFEL GED+   +     +  D  
Sbjct: 297  LTPSNTTWPLQNQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHDRM 356

Query: 465  -----LATEVAL--------SFRTQTSVRSNDGRDDR-----TTSFGSSKXXXXXXXXDE 340
                 +ATE +         SF+   S  + +    R     ++S GSSK        DE
Sbjct: 357  NNNDRIATEESSSTDRGRRNSFQKIESTENRETEQQRIQKLSSSSIGSSKEFKFDNTKDE 416

Query: 339  VGIKELGPQKNWNFFPMLQSGGS 271
               K  G   +W+FFP L+SG S
Sbjct: 417  NIEKVAG--NSWSFFPGLRSGVS 437


>ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|222858882|gb|EEE96429.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 507

 Score =  159 bits (401), Expect = 4e-36
 Identities = 100/255 (39%), Positives = 128/255 (50%), Gaps = 50/255 (19%)
 Frame = -1

Query: 1200 AVQKRRWGEWWSMYWCFGSY---KHSKRIGHTLAISQETING-VSTSTSYAQKPNPTSTT 1033
            +VQKRRWG  WS+YWCFGS+   K+SKRIGH + + +  + G VS+ST    +  P    
Sbjct: 32   SVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVLVPEPEVPGAVSSSTENQTQSTPI--- 88

Query: 1032 TLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQL 874
             LPF                                   +       IF  GPYA ETQL
Sbjct: 89   LLPFIAPPSSPASFLQSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPYAHETQL 148

Query: 873  VSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR---------- 724
            V+PPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SSL +  R          
Sbjct: 149  VTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGPNQKFSL 208

Query: 723  ---------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMN 625
                                       N+   SP  D+   ++ R+ EAP+ +G+EHF  
Sbjct: 209  SHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLLGFEHFST 268

Query: 624  YKWGS--NSSALTPN 586
             KWGS   S +LTP+
Sbjct: 269  RKWGSRLGSGSLTPD 283


>ref|XP_002867602.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
            subsp. lyrata] gi|297313438|gb|EFH43861.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  155 bits (391), Expect = 6e-35
 Identities = 125/407 (30%), Positives = 168/407 (41%), Gaps = 96/407 (23%)
 Frame = -1

Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030
            QP +V K+ WG WWS+Y CFGS K++KRIGH + + +   +G + +       N TS   
Sbjct: 22   QPSSVHKK-WGSWWSLYLCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSMFM 80

Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSS 850
                                                 P  F  GPYA ETQ V+PPVFS+
Sbjct: 81   PFIAPPSSPASFLPSGPPSVSHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSA 140

Query: 849  FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------------- 721
            FTT+PS+APFTPPPES     PSSPE PFAQLL+SSL +  RN                 
Sbjct: 141  FTTEPSTAPFTPPPES-----PSSPEVPFAQLLTSSLEKARRNIGGGMHHKFSAAHYEFK 195

Query: 720  -----------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS------ 610
                             +   SP   K + ++ R+ E P+F+G+EHF   KWGS      
Sbjct: 196  SHQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGS 255

Query: 609  ----------NSSALTPNGKEP------PSQECDI--LENN---------------HRVS 529
                       S ALTP+G  P       SQ  ++  L N+               HRVS
Sbjct: 256  ITPAGQGSRLGSGALTPDGLTPLEGSLLDSQITEVASLANSDHGSSRHNDEAAVVPHRVS 315

Query: 528  FELRGEDIPTSIVKGTTK--------GKDLATEVALSFRTQTSVRSNDGRDDRTTSFGSS 373
            FEL GED+   +     +        G+ L        +T     S   +  R+ S GSS
Sbjct: 316  FELTGEDVARCLASKLNRSGSHEKASGEHLRPN---GCKTSGETESEQSQKLRSFSTGSS 372

Query: 372  KXXXXXXXXDEV---------------GIKELGPQKNWNFFPMLQSG 277
            K        +E+               G  +  P+ +W FFP+L+SG
Sbjct: 373  KEFKFDNTNEEMIEKVRSEWWANEKVAGKGDHSPRNSWTFFPVLRSG 419


>ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutrema salsugineum]
            gi|557102915|gb|ESQ43278.1| hypothetical protein
            EUTSA_v10013563mg [Eutrema salsugineum]
          Length = 440

 Score =  154 bits (388), Expect = 1e-34
 Identities = 132/442 (29%), Positives = 172/442 (38%), Gaps = 104/442 (23%)
 Frame = -1

Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 1105
            MR+V++S                 VQP +V KRRW   WS+  CFGS K++KRIG+ + +
Sbjct: 1    MRNVNNSVETVNAAATAIVTAESRVQPSSVPKRRWRNCWSLNSCFGSQKNNKRIGNAMLV 60

Query: 1104 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 925
              E +          Q    +S+  LPF                                
Sbjct: 61   VPEPVATGGAPVVTVQNSATSSSIVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNT 120

Query: 924  XXP----HIFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFA 760
                    +F  GPYA+ETQ V+PPVFS+F T+PS+APFTPPPE SV +TTPSSPE PFA
Sbjct: 121  FSTTEPQSVFTIGPYANETQPVTPPVFSAFITEPSTAPFTPPPESSVHITTPSSPEVPFA 180

Query: 759  QLLSSSLAQKWRNTE---------------------------------------VPSPLF 697
            QLL+SSL    RN+                                          SP  
Sbjct: 181  QLLTSSLELTRRNSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYP 240

Query: 696  DKRANVDLRLVEAPEFVGYEHFMNYKWGS------------NSSALTPNGKEPPSQECDI 553
             K   V+ R+ E P+F+G+EHF   KWGS             S ALTPNG    S+    
Sbjct: 241  GKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGALTPNGPGMVSESLTP 300

Query: 552  LENN-----------------------------HRVSFELRGEDIPTSIVKGTTKGKDLA 460
              NN                             HRVSFEL GED+   +     +  D  
Sbjct: 301  NNNNNTTWPLTSQVSEVASLANSDHGSEVVAADHRVSFELTGEDVARCLASKLNRSHDRM 360

Query: 459  T---EVALSFRTQTSVRSNDGRDDR----------------TTSFGSSKXXXXXXXXDEV 337
                 V    R   S +  +   +R                ++S GSSK        +E 
Sbjct: 361  NNDERVETDERRSISFQKRENNVERVSGDREIEQQRIHKLSSSSIGSSKEFKFDNTKEEN 420

Query: 336  GIKELGPQKNWNFFPMLQSGGS 271
              K  G   +W+FFP L+SG S
Sbjct: 421  IEKVAG--NSWSFFPGLRSGVS 440


>ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Capsella rubella]
            gi|482552442|gb|EOA16635.1| hypothetical protein
            CARUB_v10004810mg [Capsella rubella]
          Length = 444

 Score =  154 bits (388), Expect = 1e-34
 Identities = 127/450 (28%), Positives = 176/450 (39%), Gaps = 113/450 (25%)
 Frame = -1

Query: 1284 MRSVHDSXXXXXXXXXXXXXXXXXVQPPA---VQKRRWGEWWSMYWCFGSYKHSKRIGHT 1114
            MRSV++S                 +Q P+   + K++WG WWS+YWCFGS K++KRIGH 
Sbjct: 1    MRSVNNSVDTVTAAASAIVSADSRLQQPSSSLLHKKKWGSWWSLYWCFGSKKNNKRIGHA 60

Query: 1113 LAISQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 934
            +   +   +GV+ +       + +++  +PF                             
Sbjct: 61   VLAPEPAASGVAVAPVQNSSSSNSTSIFMPFIAPPSSPASFLPSGPPSVSHTPDPCRLRC 120

Query: 933  XXXXXP--HIFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFA 760
                      F  GPYA ETQ V+PPVFS+FTT+PS+APFTPPPES     PSSPE PFA
Sbjct: 121  SLLVNEPPSAFAIGPYAHETQPVTPPVFSAFTTEPSTAPFTPPPES-----PSSPEVPFA 175

Query: 759  QLLSSSLAQKWRNTE----------------------------------VPSPLFDKRAN 682
            QLL+SSL +  RN+                                     SP   K + 
Sbjct: 176  QLLTSSLERARRNSSGGMNHKFSAAHYEFKSHQVYPGSPGGNLISPGSGTSSPYPGKCSI 235

Query: 681  VDLRLVEAPEFVGYEHFMNYKWGS----------------NSSALTPNG----------- 583
            ++ R+ E P+F+G+EHF   KWGS                 S ALTP+G           
Sbjct: 236  IEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDGGGGMGSKIASG 295

Query: 582  ---------KEPPSQECDILENN---------------HRVSFELRGEDIPTSIVKGTTK 475
                      +    E   L N+               HRVSFEL GED+   +     +
Sbjct: 296  ALTPLEDSLLDSQVSEVASLANSDHGSSRHNDEAVVVAHRVSFELTGEDVARCLASKLNR 355

Query: 474  --------GKDLATEVALSFRTQTSVRSNDGRDDRTTSFGSSKXXXXXXXXDE------- 340
                    G+ L        +T     S   +  R+ S GSSK        +E       
Sbjct: 356  SGSHERASGEHLRPN---GCKTSGETESEQSQKLRSFSLGSSKEFKFDNTEEETIEKVRS 412

Query: 339  --------VGIKELGPQKNWNFFPMLQSGG 274
                     G  +  P  +W FFP+L+S G
Sbjct: 413  EWWANEKVAGKGDHSPANSWTFFPVLRSSG 442


>ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|26449762|dbj|BAC42004.1| unknown protein
            [Arabidopsis thaliana] gi|28951011|gb|AAO63429.1|
            At4g25620 [Arabidopsis thaliana]
            gi|332659684|gb|AEE85084.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 449

 Score =  149 bits (377), Expect = 2e-33
 Identities = 125/430 (29%), Positives = 168/430 (39%), Gaps = 119/430 (27%)
 Frame = -1

Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030
            QP +VQK+R G WWS+YWCFGS K++KRIGH + + +   +G + +       N TS   
Sbjct: 27   QPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSIFM 85

Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSS 850
                                                 P  F  GPYA ETQ V+PPVFS+
Sbjct: 86   PFIAPPSSPASFLPSGPPSASHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSA 145

Query: 849  FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------------- 721
            FTT+PS+APFTPPPES     PSSPE PFAQLL+SSL +  RN                 
Sbjct: 146  FTTEPSTAPFTPPPES-----PSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFK 200

Query: 720  -----------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS------ 610
                             +   SP   K + ++ R+ E P+F+G+EHF   KWGS      
Sbjct: 201  SCQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGS 260

Query: 609  ----------NSSALTPNGKEPPS-------------------------------QECDI 553
                       S ALTP+G +  S                                E   
Sbjct: 261  ITPAGQGSRLGSGALTPDGSKLTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVAS 320

Query: 552  LENN---------------HRVSFELRGEDIPTSIVKGTTK--------GKDLATEVALS 442
            L N+               HRVSFEL GED+   +     +        G+ L       
Sbjct: 321  LANSDHGSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCC-- 378

Query: 441  FRTQTSVRSNDGRDDRTTSFGSSKXXXXXXXXDEV---------------GIKELGPQKN 307
             +T     S   +  R+ S GS+K        +E+               G  +  P+ +
Sbjct: 379  -KTSGETESEQSQKLRSFSTGSNKEFKFDSTNEEMIEKIRSEWWANEKVAGKGDHSPRNS 437

Query: 306  WNFFPMLQSG 277
            W FFP+L+SG
Sbjct: 438  WTFFPVLRSG 447


>emb|CAA18164.1| putative protein [Arabidopsis thaliana] gi|7269412|emb|CAB81372.1|
            putative protein [Arabidopsis thaliana]
          Length = 424

 Score =  145 bits (367), Expect = 3e-32
 Identities = 121/425 (28%), Positives = 164/425 (38%), Gaps = 119/425 (28%)
 Frame = -1

Query: 1194 QKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTTLPFXX 1015
            QK++ G WWS+YWCFGS K++KRIGH + + +   +G + +       N TS        
Sbjct: 6    QKKKRGSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSIFMPFIAP 65

Query: 1014 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSSFTTQP 835
                                            P  F  GPYA ETQ V+PPVFS+FTT+P
Sbjct: 66   PSSPASFLPSGPPSASHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSAFTTEP 125

Query: 834  SSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN---------------------- 721
            S+APFTPPPES     PSSPE PFAQLL+SSL +  RN                      
Sbjct: 126  STAPFTPPPES-----PSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQVY 180

Query: 720  ------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS----------- 610
                        +   SP   K + ++ R+ E P+F+G+EHF   KWGS           
Sbjct: 181  PGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAG 240

Query: 609  -----NSSALTPNGKEPPS-------------------------------QECDILENN- 541
                  S ALTP+G +  S                                E   L N+ 
Sbjct: 241  QGSRLGSGALTPDGSKLTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVASLANSD 300

Query: 540  --------------HRVSFELRGEDIPTSIVKGTTK--------GKDLATEVALSFRTQT 427
                          HRVSFEL GED+   +     +        G+ L        +T  
Sbjct: 301  HGSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCC---KTSG 357

Query: 426  SVRSNDGRDDRTTSFGSSKXXXXXXXXDEV---------------GIKELGPQKNWNFFP 292
               S   +  R+ S GS+K        +E+               G  +  P+ +W FFP
Sbjct: 358  ETESEQSQKLRSFSTGSNKEFKFDSTNEEMIEKIRSEWWANEKVAGKGDHSPRNSWTFFP 417

Query: 291  MLQSG 277
            +L+SG
Sbjct: 418  VLRSG 422


>ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494240 [Cicer arietinum]
          Length = 492

 Score =  145 bits (366), Expect = 5e-32
 Identities = 92/253 (36%), Positives = 122/253 (48%), Gaps = 44/253 (17%)
 Frame = -1

Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 1030
            QP    K+RWG  +S+  CFGS+K SKRIGH + + +     V  + S    PNP++   
Sbjct: 27   QPSTSPKKRWGSCFSLSSCFGSHKSSKRIGHAVLVPEPVAPIVPVAHS---APNPSTVIV 83

Query: 1029 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH------IFLKGPYADETQLVS 868
            +PF                                          IF  GPYA ETQLVS
Sbjct: 84   MPFIAPPSSPASFLQSDPPSSTHSPAAGLLSPSVNAAYSSSGSASIFTIGPYAYETQLVS 143

Query: 867  PPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------- 721
            PPVFS+FTT+PS+A FTPPPESVQMTTPSSPE PFAQLL+SSL +  +N           
Sbjct: 144  PPVFSNFTTEPSTASFTPPPESVQMTTPSSPEVPFAQLLASSLDRARKNNGSHKFALYNY 203

Query: 720  -------------------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKW 616
                                     +   +P  D+R++++L   E P+ +G+EHF   +W
Sbjct: 204  EFQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSLELSRGETPKILGFEHFSTRRW 263

Query: 615  GS--NSSALTPNG 583
             S   S +LTP+G
Sbjct: 264  NSRIGSGSLTPDG 276


>ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris]
            gi|561016644|gb|ESW15448.1| hypothetical protein
            PHAVU_007G073100g [Phaseolus vulgaris]
          Length = 479

 Score =  144 bits (364), Expect = 8e-32
 Identities = 106/333 (31%), Positives = 139/333 (41%), Gaps = 98/333 (29%)
 Frame = -1

Query: 1209 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQ--ETINGVSTSTSYAQKPNPTST 1036
            QP    K+RWG  WS+YWCFG +K+SKRIG+ + + +  E    + +  + A  PNP++ 
Sbjct: 23   QPATSPKKRWGSCWSLYWCFGPHKNSKRIGNAVLVPEPVEPAGQIGSHLATAA-PNPSTA 81

Query: 1035 TTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-----IFLKGPYADETQLV 871
              +PF                                         IF  GPY  ETQLV
Sbjct: 82   VAMPFIVPPSSPASFLESDSSSATQSPVGLFSLSSLNANASCGPASIFAIGPYTYETQLV 141

Query: 870  SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSS----------------- 742
            SPPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SS                 
Sbjct: 142  SPPVFSNFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRDCKDKGTNQRFALS 201

Query: 741  -----LAQKWRNTEVP---------------SPLFDKRANVDLRLVEAPEFVGYEHFMNY 622
                 L Q++  +  P               +P  D    ++    EA   +G+EHF  +
Sbjct: 202  NYEFQLYQQYPGSPGPQLISPASIISTSGSSTPFPDTHPLLEFHKGEASNLLGFEHFSTH 261

Query: 621  KWG--------------------------------SNSSALTPNGKEPPSQ--------- 565
            KW                                 S+S  LTP G  P ++         
Sbjct: 262  KWNSRLGSGSLTPDSTGQGSGLGSGSLTPNAVKLVSSSGCLTPEGVAPTARNGIYVGKQT 321

Query: 564  -ECDILEN------------NHRVSFELRGEDI 505
             E   L N            +HRVSFEL GED+
Sbjct: 322  SELTPLANSENECQPNAALVDHRVSFELTGEDV 354


Top