BLASTX nr result

ID: Mentha22_contig00026187 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00026187
         (1095 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   197   8e-48
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   183   1e-43
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   174   6e-41
gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]     173   1e-40
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   168   3e-39
ref|XP_002865912.1| hydroxyproline-rich glycoprotein family prot...   162   2e-37
ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun...   160   1e-36
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   160   1e-36
ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein...   159   1e-36
ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr...   159   3e-36
ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Caps...   157   7e-36
ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Caps...   152   2e-34
ref|XP_002867602.1| hydroxyproline-rich glycoprotein family prot...   152   2e-34
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   152   3e-34
ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutr...   151   4e-34
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   147   8e-33
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   146   2e-32
ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494...   145   3e-32
ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein...   145   3e-32
ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phas...   144   5e-32

>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
            lycopersicum]
          Length = 470

 Score =  197 bits (500), Expect = 8e-48
 Identities = 138/404 (34%), Positives = 176/404 (43%), Gaps = 118/404 (29%)
 Frame = -2

Query: 863  QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684
            QP  VQKRRWG  WS+YWCFGS+KHSKRIGH + + +    G +   +  + PN ++T  
Sbjct: 26   QPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVT--ENPNHSATIV 83

Query: 683  LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQLV 525
            +PF                                   +       IF  GPYA ETQLV
Sbjct: 84   IPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLV 143

Query: 524  SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR----------- 378
            SPPVFS+FTT+PS+A FTPPPE V MTTP SPE PFAQLL+SSLA+  R           
Sbjct: 144  SPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLS 203

Query: 377  -------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYK 273
                                     N+   SP   K   ++ R  E P+F+GYEHF   K
Sbjct: 204  QYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRK 263

Query: 272  WGS------------------------------NSSALTPNGKEPPSQECDILEN----- 198
            WGS                               S  +TPNG EPPS++  +LEN     
Sbjct: 264  WGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEV 323

Query: 197  -----------------NHRVSFELRGEDIPTSIVK-------GTTKGKDLATEVALSFR 90
                             +HRVSFEL  ED+P+   K         T   D++  +A   R
Sbjct: 324  ASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMDVSNLLASEMR 383

Query: 89   TQTSV-----------RSNDGRDD-----RTTSFGSSKDFDFNN 6
            + +S+            S  G D+     R  +FGSSKDFDF+N
Sbjct: 384  SGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDN 427


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  183 bits (465), Expect = 1e-43
 Identities = 122/331 (36%), Positives = 158/331 (47%), Gaps = 65/331 (19%)
 Frame = -2

Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759
           MRSV++S                 VQP  VQKRRWG   S+YWCFGS++HSKRIGH + +
Sbjct: 1   MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60

Query: 758 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579
            +  + G     S  +  N +++  LPF                                
Sbjct: 61  PEPMVPGAVAPAS--ENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALS 118

Query: 578 XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 420
              +       +F  GPYA ETQLVSPPVFS+F T+PS+APFTPPPESVQ+TTPSSPE P
Sbjct: 119 VNAYSPSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVP 178

Query: 419 FAQLLSSSLAQKWRNT--------------------EVP--------------SPLFDKR 342
           FAQLL+SSL +  RN+                    E P              SP  D+R
Sbjct: 179 FAQLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISPISNSGTSSPFPDRR 238

Query: 341 ANVDLRLVEAPEFVGYEHFMNYKWGS--NSSALTPNGKEPPSQECDILEN---------- 198
                 +VEAP+ +G+EHF   +WGS   S +LTP+G  P S++  +LEN          
Sbjct: 239 P-----IVEAPKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLAN 293

Query: 197 ------------NHRVSFELRGEDIPTSIVK 141
                       +HRVSFEL GED+   + K
Sbjct: 294 SESGSQNGETVIDHRVSFELAGEDVAVCVEK 324


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  174 bits (441), Expect = 6e-41
 Identities = 137/444 (30%), Positives = 183/444 (41%), Gaps = 133/444 (29%)
 Frame = -2

Query: 938  MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759
            MRSV+DS                 VQP  VQK+RWG  W +YWCFGS K+SKRIGH + +
Sbjct: 1    MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60

Query: 758  SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579
             +  + G S ST+     NPT    LPF                                
Sbjct: 61   PEPVVPGASVSTA-ENVSNPTGII-LPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 118

Query: 578  XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 420
               +       IF  GPYA ETQLV+PPVFS+ TT+PS+APFTPPPESVQ+TTPSSPE P
Sbjct: 119  VNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVP 178

Query: 419  FAQLLSSSLAQKWRNTEV-------------------------------------PSPLF 351
            FAQLL+SSL +  RN+ +                                      SP  
Sbjct: 179  FAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFP 238

Query: 350  DKRANVDLRLVEAPEFVGYEHFMNYKWGS------------------------------- 264
            D+R  ++ R+ EAP+ +G+E+F   KWGS                               
Sbjct: 239  DRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLG 298

Query: 263  ---NSSALTPNGKEPPSQ----------ECDILEN------------NHRVSFELRGEDI 159
                S +LTP+G  P S+          E  +L N            +HRVSFEL GED+
Sbjct: 299  SRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDV 358

Query: 158  ------------------PTSIV-------KGTTKGKDLATEVALSFRTQTSVRSNDGRD 54
                              P  +V        G  K  + + E+ +   +  +V    G  
Sbjct: 359  APCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEA 418

Query: 53   D--------RTTSFGSSKDFDFNN 6
            +        R+ + GS K+F+F+N
Sbjct: 419  EEEHSYQKHRSVTLGSIKEFNFDN 442


>gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]
          Length = 521

 Score =  173 bits (438), Expect = 1e-40
 Identities = 106/281 (37%), Positives = 134/281 (47%), Gaps = 47/281 (16%)
 Frame = -2

Query: 938 MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759
           MR+V++S                  QP AV KRRWG  WS+YWCFGS+K+SKRIGH + +
Sbjct: 1   MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFGSHKNSKRIGHAVLV 60

Query: 758 SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579
            +  + G +      Q P+  +   LPF                                
Sbjct: 61  PEPVLPGAAAPAPENQAPS--TAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 118

Query: 578 XXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAP 420
              +       IF  GPYA ETQLVSPPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE P
Sbjct: 119 INAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 178

Query: 419 FAQLLSSSLAQKWRNTE--------------------------------------VPSPL 354
           FAQLL+SSL +  RN+                                         SP 
Sbjct: 179 FAQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPF 238

Query: 353 FDKRANVDLRLVEAPEFVGYEHFMNYKWGS--NSSALTPNG 237
            DK   +  R+ EAP  +G+EHF  +KWGS   S +LTP+G
Sbjct: 239 PDKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDG 279


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  168 bits (426), Expect = 3e-39
 Identities = 137/448 (30%), Positives = 183/448 (40%), Gaps = 137/448 (30%)
 Frame = -2

Query: 938  MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQ----KRRWGEWWSMYWCFGSYKHSKRIGH 771
            MRSV+DS                 VQP  VQ    K+RWG  W +YWCFGS K+SKRIGH
Sbjct: 1    MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60

Query: 770  TLAISQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXX 591
             + + +  + G S ST+     NPT    LPF                            
Sbjct: 61   AVLVPEPVVPGASVSTA-ENVSNPTGII-LPFIAPPSSPASFLQSDPPSATQSPAGLLSL 118

Query: 590  XXXXXXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSS 432
                   +       IF  GPYA ETQLV+PPVFS+ TT+PS+APFTPPPESVQ+TTPSS
Sbjct: 119  TSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSS 178

Query: 431  PEAPFAQLLSSSLAQKWRNTEV-------------------------------------P 363
            PE PFAQLL+SSL +  RN+ +                                      
Sbjct: 179  PEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTS 238

Query: 362  SPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS--------------------------- 264
            SP  D+R  ++ R+ EAP+ +G+E+F   KWGS                           
Sbjct: 239  SPFPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDG 298

Query: 263  -------NSSALTPNGKEPPSQ----------ECDILEN------------NHRVSFELR 171
                    S +LTP+G  P S+          E  +L N            +HRVSFEL 
Sbjct: 299  MGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELS 358

Query: 170  GEDI------------------PTSIV-------KGTTKGKDLATEVALSFRTQTSVRSN 66
            GED+                  P  +V        G  K  + + E+ +   +  +V   
Sbjct: 359  GEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKA 418

Query: 65   DGRDD--------RTTSFGSSKDFDFNN 6
             G  +        R+ + GS K+F+F+N
Sbjct: 419  SGEAEEEHSYQKHRSVTLGSIKEFNFDN 446


>ref|XP_002865912.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
            subsp. lyrata] gi|297311747|gb|EFH42171.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 437

 Score =  162 bits (411), Expect = 2e-37
 Identities = 130/414 (31%), Positives = 176/414 (42%), Gaps = 103/414 (24%)
 Frame = -2

Query: 938  MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759
            MR+V++S                 VQP +VQK RWG+ WS+Y CFG+ K++KRIG+ + +
Sbjct: 1    MRNVNNSVETVNAAATAIVTAESRVQPSSVQKGRWGKCWSLYSCFGTQKNNKRIGNAVLV 60

Query: 758  SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579
             +   +GV   T   Q    ++T  LPF                                
Sbjct: 61   PEPVASGVPVVT--VQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPGGQLSLTSNT 118

Query: 578  XXPH----IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFA 414
              P     +F  GPYA+ETQ V+PPVFS+F T+PS+AP+TPPPE SV +TTPSSPE PFA
Sbjct: 119  FSPKEPQSVFTVGPYANETQPVTPPVFSAFVTEPSTAPYTPPPESSVHITTPSSPEVPFA 178

Query: 413  QLLSSSLAQKWRNTE----------------------------------------VPSPL 354
            QLL+SSL    RN+                                           SP 
Sbjct: 179  QLLTSSLELTRRNSSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 238

Query: 353  FDKRANVDLRLVEAPEFVGYEHFMNYKWGSN----------------SSALTPNGKE--- 231
              K   V+ R+ E P+F+G+EHF   KWGS                 S ALTPNG E   
Sbjct: 239  PGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASGALTPNGLEIIS 298

Query: 230  ---PPSQE--------------------CDILENNHRVSFELRGEDIPTSIVKGTTKGKD 120
                PS                       +++  +HRVSFEL GED+   +     +  D
Sbjct: 299  GNLTPSNTTWPLHNQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHD 358

Query: 119  -------LATEVALS--FRTQTSVRSNDGRDDR-------TTSFGSSKDFDFNN 6
                   + TE + S   R     RS D   ++       ++S GSSK+F F+N
Sbjct: 359  RMNNNDRIETEESSSTDLRRNMEKRSADRETEQQRIQKLNSSSIGSSKEFKFDN 412


>ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
            gi|462415503|gb|EMJ20240.1| hypothetical protein
            PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  160 bits (404), Expect = 1e-36
 Identities = 130/461 (28%), Positives = 175/461 (37%), Gaps = 150/461 (32%)
 Frame = -2

Query: 938  MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759
            MRSV+ S                  QP  V KRRWG  WS+YWCFG +K+ KRIGH + +
Sbjct: 1    MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHKN-KRIGHAVLV 59

Query: 758  SQETINGVSTSTSYAQKPNPTSTTTL--PFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 585
             +  + G + S       N T++T +  PF                              
Sbjct: 60   PEPVVPGAAVSAI----DNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKS 115

Query: 584  XXXXPH-------IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPE 426
                 +       IF  GPYA ETQLVSPPVFS+F T+PS+APFTPPPESVQ+TTPSSPE
Sbjct: 116  LSANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPE 175

Query: 425  APFAQLLSSSLAQKWR-------------------------------------NTEVPSP 357
             PFAQLL+SSL +  R                                     N+   SP
Sbjct: 176  VPFAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSP 235

Query: 356  LFDKRANVDLRLVEAPEFVGYEHFMNYKWGS----------------------------- 264
              D+   ++ R+ EAP+  G++HF   KWGS                             
Sbjct: 236  FPDRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNE 295

Query: 263  ---------------------NSSALTPNGKEPPSQECDILEN----------------- 198
                                  S  LTP+G  P S++  +LEN                 
Sbjct: 296  LGSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQT 355

Query: 197  -----NHRVSFELRGEDIPTSIVKGTTKGKDLAT----EVALSFRTQTSVRSNDG----- 60
                 +HRVSFEL GED+   +          A+     +A  + ++    S+D      
Sbjct: 356  VETVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCE 415

Query: 59   -----------------------RDDRTTSFGSSKDFDFNN 6
                                   R  R+ + GS+KDF+F+N
Sbjct: 416  FSVEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDN 456


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
           gi|223547583|gb|EEF49078.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 510

 Score =  160 bits (404), Expect = 1e-36
 Identities = 96/255 (37%), Positives = 127/255 (49%), Gaps = 46/255 (18%)
 Frame = -2

Query: 863 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684
           QP  VQKRRWG  WS+YWCFGS+K +KRIGH +   +  + G   ++  A+  + ++  T
Sbjct: 40  QPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTS--AENQSQSTAIT 96

Query: 683 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQLV 525
           +PF                                   +       IF  GPYA ETQLV
Sbjct: 97  VPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLV 156

Query: 524 SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR----------- 378
           +PP FS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SSL +  R           
Sbjct: 157 TPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALS 216

Query: 377 --------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNY 276
                                     N+   SP  D+   ++ R+ EAP+ +G+EHF   
Sbjct: 217 HYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHFTTR 276

Query: 275 KWGS--NSSALTPNG 237
           KWGS   S  +TP+G
Sbjct: 277 KWGSRLGSGTVTPDG 291


>ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|10177409|dbj|BAB10540.1| unnamed protein
            product [Arabidopsis thaliana] gi|40823427|gb|AAR92282.1|
            At5g52430 [Arabidopsis thaliana]
            gi|56381929|gb|AAV85683.1| At5g52430 [Arabidopsis
            thaliana] gi|110738650|dbj|BAF01250.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332008830|gb|AED96213.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 438

 Score =  159 bits (403), Expect = 1e-36
 Identities = 124/389 (31%), Positives = 169/389 (43%), Gaps = 103/389 (26%)
 Frame = -2

Query: 863  QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684
            QP + QK RWG+ WS+Y CFG+ K++KRIG+ + + +   +GV   T   Q    ++T  
Sbjct: 27   QPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTSGVPVVT--VQNSATSTTVV 84

Query: 683  LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH----IFLKGPYADETQLVSPP 516
            LPF                                  P     +F  GPYA+ETQ V+PP
Sbjct: 85   LPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQSVFTVGPYANETQPVTPP 144

Query: 515  VFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFAQLLSSSLA-----------QKW--- 381
            VFS+F T+PS+AP+TPPPE SV +TTPSSPE PFAQLL+SSL            QK+   
Sbjct: 145  VFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQKFSSS 204

Query: 380  --------------------------RNTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMN 279
                                       N+   SP   K   V+ R+ E P+F+G+EHF  
Sbjct: 205  HYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLGFEHFTA 264

Query: 278  YKWGSN----------------SSALTPNGKEPPS------------------------- 222
             KWGS                 S ALTPNG E  S                         
Sbjct: 265  RKWGSRFGSGSITPVGHGSGLASGALTPNGPEIVSGNLTPNNTTWPLQNQISEVASLANS 324

Query: 221  -QECDILENNHRVSFELRGEDIPTSIVKGTTKGKD-------LATEVALS--FRTQTSVR 72
                +++  +HRVSFEL GED+   +     +  D       + TE + S   R     R
Sbjct: 325  DHGSEVMVADHRVSFELTGEDVARCLASKLNRSHDRMNNNDRIETEESSSTDIRRNIEKR 384

Query: 71   SNDGRDDR-------TTSFGSSKDFDFNN 6
            S D  +++       ++S GSSK+F F+N
Sbjct: 385  SGDRENEQHRIQKLSSSSIGSSKEFKFDN 413


>ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
           gi|222858882|gb|EEE96429.1| hydroxyproline-rich
           glycoprotein [Populus trichocarpa]
          Length = 507

 Score =  159 bits (401), Expect = 3e-36
 Identities = 100/255 (39%), Positives = 128/255 (50%), Gaps = 50/255 (19%)
 Frame = -2

Query: 854 AVQKRRWGEWWSMYWCFGSY---KHSKRIGHTLAISQETING-VSTSTSYAQKPNPTSTT 687
           +VQKRRWG  WS+YWCFGS+   K+SKRIGH + + +  + G VS+ST    +  P    
Sbjct: 32  SVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVLVPEPEVPGAVSSSTENQTQSTPI--- 88

Query: 686 TLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-------IFLKGPYADETQL 528
            LPF                                   +       IF  GPYA ETQL
Sbjct: 89  LLPFIAPPSSPASFLQSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPYAHETQL 148

Query: 527 VSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWR---------- 378
           V+PPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SSL +  R          
Sbjct: 149 VTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGPNQKFSL 208

Query: 377 ---------------------------NTEVPSPLFDKRANVDLRLVEAPEFVGYEHFMN 279
                                      N+   SP  D+   ++ R+ EAP+ +G+EHF  
Sbjct: 209 SHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLLGFEHFST 268

Query: 278 YKWGS--NSSALTPN 240
            KWGS   S +LTP+
Sbjct: 269 RKWGSRLGSGSLTPD 283


>ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Capsella rubella]
            gi|482549191|gb|EOA13385.1| hypothetical protein
            CARUB_v10026425mg [Capsella rubella]
          Length = 437

 Score =  157 bits (397), Expect = 7e-36
 Identities = 131/416 (31%), Positives = 175/416 (42%), Gaps = 105/416 (25%)
 Frame = -2

Query: 938  MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759
            MR+V++S                 VQP +VQKRRW + WS+Y CFGS K++KRIG+ + +
Sbjct: 1    MRNVNNSVETVNAAATAIITAESRVQPSSVQKRRWAKCWSLYSCFGSQKNNKRIGNAVLV 60

Query: 758  SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579
             +   +GV   T   Q    ++T  LPF                                
Sbjct: 61   PEPVASGVPVVT--VQNSATSTTVVLPFIAPPSSPASFLPSDPSSVSHSPVGPLSLTSNT 118

Query: 578  XXPH----IFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQ 411
              P     +F  GPYA+ETQ V+PPVFS+F T+PS+AP+TPPPES    TPSSPE PFAQ
Sbjct: 119  FSPKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPES--SVTPSSPEVPFAQ 176

Query: 410  LLSSSLA----------QKW-----------------------------RNTEVPSPLFD 348
            LL+SSL           QK+                              N+   SP   
Sbjct: 177  LLTSSLELTRRDSSGINQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPG 236

Query: 347  KRANVDLRLVEAPEFVGYEHFMNYKWGSN----------------SSALTPNGKE----- 231
            K   V+ R+ E P+F+G+EHF   KWGS                 S ALTPN  E     
Sbjct: 237  KSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGMASGALTPNAPEIISGN 296

Query: 230  -PPSQECDILEN--------------------NHRVSFELRGEDIPTSIVKGTTKGKD-- 120
              PS     L+N                    +HRVSFEL GED+   +     +  D  
Sbjct: 297  LTPSNTTWPLQNQISEVASLANSDHGSEVIVADHRVSFELTGEDVARCLASKLNRSHDRM 356

Query: 119  -----LATEVAL--------SFRTQTSVRSNDGRDDR-----TTSFGSSKDFDFNN 6
                 +ATE +         SF+   S  + +    R     ++S GSSK+F F+N
Sbjct: 357  NNNDRIATEESSSTDRGRRNSFQKIESTENRETEQQRIQKLSSSSIGSSKEFKFDN 412


>ref|XP_006283737.1| hypothetical protein CARUB_v10004810mg [Capsella rubella]
            gi|482552442|gb|EOA16635.1| hypothetical protein
            CARUB_v10004810mg [Capsella rubella]
          Length = 444

 Score =  152 bits (385), Expect = 2e-34
 Identities = 120/409 (29%), Positives = 166/409 (40%), Gaps = 98/409 (23%)
 Frame = -2

Query: 938  MRSVHDSXXXXXXXXXXXXXXXXXVQPPA---VQKRRWGEWWSMYWCFGSYKHSKRIGHT 768
            MRSV++S                 +Q P+   + K++WG WWS+YWCFGS K++KRIGH 
Sbjct: 1    MRSVNNSVDTVTAAASAIVSADSRLQQPSSSLLHKKKWGSWWSLYWCFGSKKNNKRIGHA 60

Query: 767  LAISQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 588
            +   +   +GV+ +       + +++  +PF                             
Sbjct: 61   VLAPEPAASGVAVAPVQNSSSSNSTSIFMPFIAPPSSPASFLPSGPPSVSHTPDPCRLRC 120

Query: 587  XXXXXP--HIFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFA 414
                      F  GPYA ETQ V+PPVFS+FTT+PS+APFTPPPES     PSSPE PFA
Sbjct: 121  SLLVNEPPSAFAIGPYAHETQPVTPPVFSAFTTEPSTAPFTPPPES-----PSSPEVPFA 175

Query: 413  QLLSSSLAQKWRNTE----------------------------------VPSPLFDKRAN 336
            QLL+SSL +  RN+                                     SP   K + 
Sbjct: 176  QLLTSSLERARRNSSGGMNHKFSAAHYEFKSHQVYPGSPGGNLISPGSGTSSPYPGKCSI 235

Query: 335  VDLRLVEAPEFVGYEHFMNYKWGS----------------NSSALTPNG----------- 237
            ++ R+ E P+F+G+EHF   KWGS                 S ALTP+G           
Sbjct: 236  IEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDGGGGMGSKIASG 295

Query: 236  ---------KEPPSQECDILENN---------------HRVSFELRGEDIPTSIVKGTTK 129
                      +    E   L N+               HRVSFEL GED+   +     +
Sbjct: 296  ALTPLEDSLLDSQVSEVASLANSDHGSSRHNDEAVVVAHRVSFELTGEDVARCLASKLNR 355

Query: 128  --------GKDLATEVALSFRTQTSVRSNDGRDDRTTSFGSSKDFDFNN 6
                    G+ L        +T     S   +  R+ S GSSK+F F+N
Sbjct: 356  SGSHERASGEHLRPN---GCKTSGETESEQSQKLRSFSLGSSKEFKFDN 401


>ref|XP_002867602.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
            subsp. lyrata] gi|297313438|gb|EFH43861.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  152 bits (385), Expect = 2e-34
 Identities = 118/367 (32%), Positives = 156/367 (42%), Gaps = 81/367 (22%)
 Frame = -2

Query: 863  QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684
            QP +V K+ WG WWS+Y CFGS K++KRIGH + + +   +G + +       N TS   
Sbjct: 22   QPSSVHKK-WGSWWSLYLCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSMFM 80

Query: 683  LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSS 504
                                                 P  F  GPYA ETQ V+PPVFS+
Sbjct: 81   PFIAPPSSPASFLPSGPPSVSHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSA 140

Query: 503  FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------------- 375
            FTT+PS+APFTPPPES     PSSPE PFAQLL+SSL +  RN                 
Sbjct: 141  FTTEPSTAPFTPPPES-----PSSPEVPFAQLLTSSLEKARRNIGGGMHHKFSAAHYEFK 195

Query: 374  -----------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS------ 264
                             +   SP   K + ++ R+ E P+F+G+EHF   KWGS      
Sbjct: 196  SHQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGS 255

Query: 263  ----------NSSALTPNGKEP------PSQECDI--LENN---------------HRVS 183
                       S ALTP+G  P       SQ  ++  L N+               HRVS
Sbjct: 256  ITPAGQGSRLGSGALTPDGLTPLEGSLLDSQITEVASLANSDHGSSRHNDEAAVVPHRVS 315

Query: 182  FELRGEDIPTSIVKGTTK--------GKDLATEVALSFRTQTSVRSNDGRDDRTTSFGSS 27
            FEL GED+   +     +        G+ L        +T     S   +  R+ S GSS
Sbjct: 316  FELTGEDVARCLASKLNRSGSHEKASGEHLRPN---GCKTSGETESEQSQKLRSFSTGSS 372

Query: 26   KDFDFNN 6
            K+F F+N
Sbjct: 373  KEFKFDN 379


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  152 bits (383), Expect = 3e-34
 Identities = 115/379 (30%), Positives = 159/379 (41%), Gaps = 97/379 (25%)
 Frame = -2

Query: 851  VQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTTLPFX 672
            VQKRRWG WWSMYWCFG  +H KRIGH + + + T  G       A+ P  T +  LPF 
Sbjct: 37   VQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPR--AENPIQTPSIVLPFV 94

Query: 671  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH----IFLKGPYADETQLVSPPVFSS 504
                                             P     IF  GPYA ETQLVSPPVFS+
Sbjct: 95   APPSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTSIFAIGPYAHETQLVSPPVFST 154

Query: 503  FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLL--------------------------- 405
            FTT+PS+APFTPPPESV +TTPSSPE PFAQLL                           
Sbjct: 155  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYP 214

Query: 404  ----------SSSLAQKWRNTEVPSPLFDKRAN--VDLRLVEAPEFVGYEHFMNYKWGS- 264
                      SS ++    ++  P   F  R +  ++ R  + P+ +  +      WGS 
Sbjct: 215  GSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSR 274

Query: 263  -NSSALTPNGKEPPSQECDILEN----------------------NHRVSFELRGEDI-- 159
              S ++TP+G +  S +  +L+                       NHRVSFEL  E++  
Sbjct: 275  LGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIR 334

Query: 158  -----PTSIVKGTT---------KGKDLATEVALSFRTQTSVRSNDGRD----------- 54
                 P ++ +  +         + K+  ++V  S        SND  +           
Sbjct: 335  CVEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEKAVADGEEAQL 394

Query: 53   ---DRTTSFGSSKDFDFNN 6
                R+ + GS K+F+F+N
Sbjct: 395  HPKQRSITLGSVKEFNFDN 413


>ref|XP_006401825.1| hypothetical protein EUTSA_v10013563mg [Eutrema salsugineum]
            gi|557102915|gb|ESQ43278.1| hypothetical protein
            EUTSA_v10013563mg [Eutrema salsugineum]
          Length = 440

 Score =  151 bits (382), Expect = 4e-34
 Identities = 124/415 (29%), Positives = 162/415 (39%), Gaps = 104/415 (25%)
 Frame = -2

Query: 938  MRSVHDSXXXXXXXXXXXXXXXXXVQPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAI 759
            MR+V++S                 VQP +V KRRW   WS+  CFGS K++KRIG+ + +
Sbjct: 1    MRNVNNSVETVNAAATAIVTAESRVQPSSVPKRRWRNCWSLNSCFGSQKNNKRIGNAMLV 60

Query: 758  SQETINGVSTSTSYAQKPNPTSTTTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579
              E +          Q    +S+  LPF                                
Sbjct: 61   VPEPVATGGAPVVTVQNSATSSSIVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNT 120

Query: 578  XXP----HIFLKGPYADETQLVSPPVFSSFTTQPSSAPFTPPPE-SVQMTTPSSPEAPFA 414
                    +F  GPYA+ETQ V+PPVFS+F T+PS+APFTPPPE SV +TTPSSPE PFA
Sbjct: 121  FSTTEPQSVFTIGPYANETQPVTPPVFSAFITEPSTAPFTPPPESSVHITTPSSPEVPFA 180

Query: 413  QLLSSSLAQKWRNTE---------------------------------------VPSPLF 351
            QLL+SSL    RN+                                          SP  
Sbjct: 181  QLLTSSLELTRRNSSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYP 240

Query: 350  DKRANVDLRLVEAPEFVGYEHFMNYKWGS------------NSSALTPNGKEPPSQECDI 207
             K   V+ R+ E P+F+G+EHF   KWGS             S ALTPNG    S+    
Sbjct: 241  GKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGALTPNGPGMVSESLTP 300

Query: 206  LENN-----------------------------HRVSFELRGEDIPTSIVKGTTKGKDLA 114
              NN                             HRVSFEL GED+   +     +  D  
Sbjct: 301  NNNNNTTWPLTSQVSEVASLANSDHGSEVVAADHRVSFELTGEDVARCLASKLNRSHDRM 360

Query: 113  T---EVALSFRTQTSVRSNDGRDDR----------------TTSFGSSKDFDFNN 6
                 V    R   S +  +   +R                ++S GSSK+F F+N
Sbjct: 361  NNDERVETDERRSISFQKRENNVERVSGDREIEQQRIHKLSSSSIGSSKEFKFDN 415


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  147 bits (371), Expect = 8e-33
 Identities = 113/373 (30%), Positives = 153/373 (41%), Gaps = 90/373 (24%)
 Frame = -2

Query: 854  AVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVS--TSTSYAQKPNPTSTTTL 681
            ++QKRRWG  WSMYWCFGS K +KRIGH + I + T +G    +S + +Q P+       
Sbjct: 36   SIQKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPETTASGADRPSSNTSSQAPSIVLPFIA 95

Query: 680  PFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSSF 501
            P                                     IF  GPYA ETQLVSPPVFS+F
Sbjct: 96   PPSSPASFLPSEPPSATHSPVGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAF 155

Query: 500  TTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRNTEVP-------------- 363
            TT+PS+APFTPPPESV +TTPSSPE PFA+LL  +          P              
Sbjct: 156  TTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPG 215

Query: 362  -------------------SPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS--NSSALT 246
                               SP  D+           P+F+  E    ++WGS   S  LT
Sbjct: 216  SPVSNLISPGSAISVSGTSSPFLDREYTPG-----RPQFLNLEKIAPHEWGSRQGSGTLT 270

Query: 245  PNGKEPPSQE----------------------CDILENNHRVSFELRGEDI-------PT 153
            P    P   +                       D+   +HRVSFE+  ED+       PT
Sbjct: 271  PEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPT 330

Query: 152  SIV-----------KGTTKGKDLAT----------EVALSFRTQTSVRSNDG---RDDRT 45
             ++           + T + ++LA           E +      +S    DG   +  R+
Sbjct: 331  MMMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHEGSSTDGEDGQRQQKHRS 390

Query: 44   TSFGSSKDFDFNN 6
             + GSSK+F+F+N
Sbjct: 391  ITLGSSKEFNFDN 403


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  146 bits (368), Expect = 2e-32
 Identities = 111/368 (30%), Positives = 154/368 (41%), Gaps = 85/368 (23%)
 Frame = -2

Query: 854  AVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVS--TSTSYAQKPNPTSTTTL 681
            ++QKRRWG  WSMYWCFGS K +KRIGH + I + T +     +S + +Q P+       
Sbjct: 36   SIQKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADRPSSNTSSQAPSIVLPFIA 95

Query: 680  PFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSSF 501
            P                                     IF  GPYA ETQLVSPPVFS+F
Sbjct: 96   PPSSPASFLPSEPPSATHSPVGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAF 155

Query: 500  TTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRNTEVP-------------- 363
            TT+PS+APFTPPPESV +TTPSSPE PFA+LL  +          P              
Sbjct: 156  TTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPG 215

Query: 362  ---SPLFDKRANVDLRLVEA-----------PEFVGYEHFMNYKWGS--NSSALTPNGKE 231
               S L    + + +    +           P+F+  E    ++WGS   S  LTP    
Sbjct: 216  SPVSNLISPGSAISVSGTSSPFLEREYTPGRPQFLNLEKIAPHEWGSRQGSGTLTPEAVN 275

Query: 230  PPSQEC----------------------DILENNHRVSFELRGEDI-------PTSIV-- 144
            P   +                       D+   +HRVSFE+  ED+       PT ++  
Sbjct: 276  PKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRT 335

Query: 143  ---------KGTTKGKDLAT----------EVALSFRTQTSVRSNDG---RDDRTTSFGS 30
                     + T + ++LA           E +      +S    DG   +  R+ + GS
Sbjct: 336  GSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQQKHRSITLGS 395

Query: 29   SKDFDFNN 6
            SK+F+F+N
Sbjct: 396  SKEFNFDN 403


>ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494240 [Cicer arietinum]
          Length = 492

 Score =  145 bits (366), Expect = 3e-32
 Identities = 92/253 (36%), Positives = 122/253 (48%), Gaps = 44/253 (17%)
 Frame = -2

Query: 863 QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684
           QP    K+RWG  +S+  CFGS+K SKRIGH + + +     V  + S    PNP++   
Sbjct: 27  QPSTSPKKRWGSCFSLSSCFGSHKSSKRIGHAVLVPEPVAPIVPVAHS---APNPSTVIV 83

Query: 683 LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH------IFLKGPYADETQLVS 522
           +PF                                          IF  GPYA ETQLVS
Sbjct: 84  MPFIAPPSSPASFLQSDPPSSTHSPAAGLLSPSVNAAYSSSGSASIFTIGPYAYETQLVS 143

Query: 521 PPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------- 375
           PPVFS+FTT+PS+A FTPPPESVQMTTPSSPE PFAQLL+SSL +  +N           
Sbjct: 144 PPVFSNFTTEPSTASFTPPPESVQMTTPSSPEVPFAQLLASSLDRARKNNGSHKFALYNY 203

Query: 374 -------------------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKW 270
                                    +   +P  D+R++++L   E P+ +G+EHF   +W
Sbjct: 204 EFQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSLELSRGETPKILGFEHFSTRRW 263

Query: 269 GS--NSSALTPNG 237
            S   S +LTP+G
Sbjct: 264 NSRIGSGSLTPDG 276


>ref|NP_194292.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|26449762|dbj|BAC42004.1| unknown protein
            [Arabidopsis thaliana] gi|28951011|gb|AAO63429.1|
            At4g25620 [Arabidopsis thaliana]
            gi|332659684|gb|AEE85084.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 449

 Score =  145 bits (366), Expect = 3e-32
 Identities = 117/390 (30%), Positives = 156/390 (40%), Gaps = 104/390 (26%)
 Frame = -2

Query: 863  QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQETINGVSTSTSYAQKPNPTSTTT 684
            QP +VQK+R G WWS+YWCFGS K++KRIGH + + +   +G + +       N TS   
Sbjct: 27   QPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAPVQNSSSNSTSIFM 85

Query: 683  LPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPHIFLKGPYADETQLVSPPVFSS 504
                                                 P  F  GPYA ETQ V+PPVFS+
Sbjct: 86   PFIAPPSSPASFLPSGPPSASHTPDPGLLCSLTVNEPPSAFTIGPYAHETQPVTPPVFSA 145

Query: 503  FTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSSLAQKWRN----------------- 375
            FTT+PS+APFTPPPES     PSSPE PFAQLL+SSL +  RN                 
Sbjct: 146  FTTEPSTAPFTPPPES-----PSSPEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFK 200

Query: 374  -----------------TEVPSPLFDKRANVDLRLVEAPEFVGYEHFMNYKWGS------ 264
                             +   SP   K + ++ R+ E P+F+G+EHF   KWGS      
Sbjct: 201  SCQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGS 260

Query: 263  ----------NSSALTPNGKEPPS-------------------------------QECDI 207
                       S ALTP+G +  S                                E   
Sbjct: 261  ITPAGQGSRLGSGALTPDGSKLTSGVVTPNGAETVIRMSYGNLTPLEGSLLDSQISEVAS 320

Query: 206  LENN---------------HRVSFELRGEDIPTSIVKGTTK--------GKDLATEVALS 96
            L N+               HRVSFEL GED+   +     +        G+ L       
Sbjct: 321  LANSDHGSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHLRPNCC-- 378

Query: 95   FRTQTSVRSNDGRDDRTTSFGSSKDFDFNN 6
             +T     S   +  R+ S GS+K+F F++
Sbjct: 379  -KTSGETESEQSQKLRSFSTGSNKEFKFDS 407


>ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris]
            gi|561016644|gb|ESW15448.1| hypothetical protein
            PHAVU_007G073100g [Phaseolus vulgaris]
          Length = 479

 Score =  144 bits (364), Expect = 5e-32
 Identities = 106/333 (31%), Positives = 139/333 (41%), Gaps = 98/333 (29%)
 Frame = -2

Query: 863  QPPAVQKRRWGEWWSMYWCFGSYKHSKRIGHTLAISQ--ETINGVSTSTSYAQKPNPTST 690
            QP    K+RWG  WS+YWCFG +K+SKRIG+ + + +  E    + +  + A  PNP++ 
Sbjct: 23   QPATSPKKRWGSCWSLYWCFGPHKNSKRIGNAVLVPEPVEPAGQIGSHLATAA-PNPSTA 81

Query: 689  TTLPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPH-----IFLKGPYADETQLV 525
              +PF                                         IF  GPY  ETQLV
Sbjct: 82   VAMPFIVPPSSPASFLESDSSSATQSPVGLFSLSSLNANASCGPASIFAIGPYTYETQLV 141

Query: 524  SPPVFSSFTTQPSSAPFTPPPESVQMTTPSSPEAPFAQLLSSS----------------- 396
            SPPVFS+FTT+PS+APFTPPPESVQ+TTPSSPE PFAQLL+SS                 
Sbjct: 142  SPPVFSNFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRDCKDKGTNQRFALS 201

Query: 395  -----LAQKWRNTEVP---------------SPLFDKRANVDLRLVEAPEFVGYEHFMNY 276
                 L Q++  +  P               +P  D    ++    EA   +G+EHF  +
Sbjct: 202  NYEFQLYQQYPGSPGPQLISPASIISTSGSSTPFPDTHPLLEFHKGEASNLLGFEHFSTH 261

Query: 275  KWG--------------------------------SNSSALTPNGKEPPSQ--------- 219
            KW                                 S+S  LTP G  P ++         
Sbjct: 262  KWNSRLGSGSLTPDSTGQGSGLGSGSLTPNAVKLVSSSGCLTPEGVAPTARNGIYVGKQT 321

Query: 218  -ECDILEN------------NHRVSFELRGEDI 159
             E   L N            +HRVSFEL GED+
Sbjct: 322  SELTPLANSENECQPNAALVDHRVSFELTGEDV 354


Top