BLASTX nr result

ID: Rheum21_contig00011415 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00011415
         (1842 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   429   e-117
gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i...   424   e-116
ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr...   422   e-115
ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-...   422   e-115
ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr...   421   e-115
gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i...   421   e-115
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   420   e-115
gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus pe...   410   e-111
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   407   e-111
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   407   e-111
ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210...   380   e-102
ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225...   379   e-102
emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]   370   1e-99
gb|ESW15448.1| hypothetical protein PHAVU_007G073100g [Phaseolus...   347   1e-92
ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein...   343   1e-91
ref|XP_002865912.1| hydroxyproline-rich glycoprotein family prot...   343   2e-91
gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]     338   5e-90
ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Caps...   338   5e-90
ref|XP_006589528.1| PREDICTED: uncharacterized protein LOC100798...   337   1e-89
gb|ESW24210.1| hypothetical protein PHAVU_004G111400g [Phaseolus...   336   2e-89

>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  429 bits (1104), Expect = e-117
 Identities = 239/437 (54%), Positives = 282/437 (64%), Gaps = 37/437 (8%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            ESRVQP+T VQKR WG C SLY CFG  + SKR+ HA+LVPEP+V     P  EN N ST
Sbjct: 22   ESRVQPTT-VQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAVAPASENLNLST 80

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + +PF+        FL+SDP ++ QSP G ++L +LSVN+YSPSGPA++FA GPYA+ET
Sbjct: 81   SIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHET 140

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPNQKH 854
            QLVSPPVFSTF TEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL+R+RRNSG NQK 
Sbjct: 141  QLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKL 200

Query: 855  GLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESKNF 1028
             LS YEF  +QLY  SP GHLISP    S SGTSSP+P + P     I +A KL   ++F
Sbjct: 201  SLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRRP-----IVEAPKLLGFEHF 252

Query: 1029 VTTHKWGSRLGSGSLTPDGLGPASRDSLLLENQISEVASLANSEHGSESGEAVVDQRVSF 1208
             +T +WGSRLGSGSLTPDG GPASRDS LLENQISEVASLANSE GS++GE V+D RVSF
Sbjct: 253  -STRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSF 311

Query: 1209 ELAGEDVPTCVEIKKTSPHSPQDNVVPACL-------------XXXXXXCELCV-EVTTT 1346
            ELAGEDV  CVE K  +      N +   +                   CE CV E    
Sbjct: 312  ELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKA 371

Query: 1347 EMPEGDPQEENNNCKHKHSSVSLGSVKEFKFDSADRD---------------------GG 1463
               +   + E   C  KH  +  GS+KEF FD+   +                     G 
Sbjct: 372  ASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGT 431

Query: 1464 GPSNSWTFFPLLHPGVS 1514
            GP  +WTFFPLL PG+S
Sbjct: 432  GPQTNWTFFPLLQPGIS 448


>gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 485

 Score =  424 bits (1091), Expect = e-116
 Identities = 241/468 (51%), Positives = 294/468 (62%), Gaps = 68/468 (14%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            +SRVQP+T VQK+ WG CW LY CFG QK+SKR+ HA+LVPEP+V   ++ T EN ++ T
Sbjct: 22   DSRVQPTT-VQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPT 80

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + +PF+        FL+SDP +A QSP G ++L SLSVN+YSP GPA++FA GPYA+ET
Sbjct: 81   GIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHET 140

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPNQKH 854
            QLV+PPVFS  TTEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL RARRNSG NQK 
Sbjct: 141  QLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKF 200

Query: 855  GLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESKNF 1028
            GLS+YEF  +Q+Y GSP G+LISPGS  S SGTSSP+P + P+L FR+ +A KL   +NF
Sbjct: 201  GLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFENF 260

Query: 1029 VTTHKWGSRLGSGS--------------------------------LTPDGLGPASRDSL 1112
             TT KWGSRLGSGS                                LTPDGLGPASRD  
Sbjct: 261  -TTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGF 319

Query: 1113 LLENQISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVEIKKTSP-----HSPQD 1277
            L+ +QISEVA LAN  +G ++ E +VD RVSFEL+GEDV  C+E K   P       P+D
Sbjct: 320  LVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKD 379

Query: 1278 NVV------PACLXXXXXXCELCVEVT---TTEMPEGDPQEENNNCKHKHSSVSLGSVKE 1430
             V                 CEL +  T   T E   G+ +EE++    KH SV+LGS+KE
Sbjct: 380  LVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHS--YQKHRSVTLGSIKE 437

Query: 1431 FKFDSADRDGGG--------------------PSNSWTFFPLLHPGVS 1514
            F FD+   +                       P NSWTFFP+L P VS
Sbjct: 438  FNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|222858882|gb|EEE96429.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 507

 Score =  422 bits (1086), Expect = e-115
 Identities = 246/486 (50%), Positives = 302/486 (62%), Gaps = 86/486 (17%)
 Frame = +3

Query: 315  ESRVQPSTS-VQKRSWGGCWSLYSCFGCQ---KSSKRVDHAILVPEPIVHRNTIPTGENP 482
            ESRVQPS+S VQKR WGGCWSLY CFG     K+SKR+ HA+LVPEP V      + EN 
Sbjct: 23   ESRVQPSSSSVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVLVPEPEVPGAVSSSTENQ 82

Query: 483  NHSTPMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPY 662
              STP+ +PF+        FL+SDP ++ QSP G ++L SLS N+YSP GPA++FA GPY
Sbjct: 83   TQSTPILLPFIAPPSSPASFLQSDPPSSTQSPAGLLSLTSLSANAYSPRGPASIFAIGPY 142

Query: 663  AYETQLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGP 842
            A+ETQLV+PPVFS FTTEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL RARRNSGP
Sbjct: 143  AHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGP 202

Query: 843  NQKHGLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFE 1016
            NQK  LS+YEF  + LY GSP G +ISPGS  S SGTSSP+P +HP+L FR+ +A KL  
Sbjct: 203  NQKFSLSHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLLG 262

Query: 1017 SKNFVTTHKWGSRLGSGS-------------------LTPDGLG---------------- 1091
             ++F +T KWGSRLGSGS                   +TPDG+G                
Sbjct: 263  FEHF-STRKWGSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMGLSRLCSGTATPDGAGL 321

Query: 1092 ---------------PASRDSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGED 1226
                           PAS+   LLENQISEVASL NSE+GS++ E VV  RVSFEL+GE+
Sbjct: 322  RSRLGSGTLTPDCFVPASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEE 381

Query: 1227 VPTCVEIK-----KTSPHSPQDNVV--PACLXXXXXXCELCVE--VTTTEMPEGDPQE-E 1376
            V  C+EIK     +T P  PQD +   P          E C++    ++EMPE + +E E
Sbjct: 382  VARCLEIKSVASTRTFPEYPQDTMPEDPVRGDRLAMNGERCLQNGEASSEMPEKNSEETE 441

Query: 1377 NNNCKHKHSSVSLGSVKEFKFDSADRD--------------------GGGPSNSWTFFPL 1496
             ++   KH S++LGS+KEF FD++  +                       P+NSWTFFPL
Sbjct: 442  EDHVYRKHRSITLGSIKEFNFDNSKGEVSDKPAISSEWWANETIAGKEARPANSWTFFPL 501

Query: 1497 LHPGVS 1514
            L P VS
Sbjct: 502  LQPEVS 507


>ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis]
          Length = 500

 Score =  422 bits (1085), Expect = e-115
 Identities = 239/482 (49%), Positives = 295/482 (61%), Gaps = 82/482 (17%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            ESR++P+ ++QKR WG CWSLY CFG  K+SKR+ HA+L+PEP+V     P  E   HST
Sbjct: 22   ESRLRPA-AIQKRRWGSCWSLYWCFGSHKTSKRISHAVLLPEPMVTGAAAPAAETQAHST 80

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + +PF+        FL+SDPS+A QSP G ++L SLSVN+YSP GPA++FA GPYA+ET
Sbjct: 81   AIVLPFIAPPSSPASFLQSDPSSATQSPAGLLSLNSLSVNAYSPGGPASMFAIGPYAHET 140

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPNQKH 854
            QLV+PPVFS FTTEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL RARRNSG NQK 
Sbjct: 141  QLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKL 200

Query: 855  GLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESKNF 1028
             LS+Y +  +QLY GSP G LISPGSV S SGTSSP+P +HP+L F  A A KL   ++F
Sbjct: 201  SLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDRHPILDFSAAAAPKLLGFEHF 260

Query: 1029 VTTHKW------------------------------------------------GSRLGS 1064
             TT KW                                                GSRLGS
Sbjct: 261  -TTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSRLGSGTVTPDGAGLGSRLGS 319

Query: 1065 GSLTPDGLGPASRDSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVE 1244
            GSLTPDG+GP SRD  + ENQISEVASLANS++G++S E ++D RVSFEL+GE+V  C+ 
Sbjct: 320  GSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDHRVSFELSGEEVARCLA 379

Query: 1245 IKKTS-----PHSPQDNVVP-------ACLXXXXXXCELCVEVTTTEMPEGDPQE-ENNN 1385
             K  +     P  PQD +VP         L       ELC E ++  MPE   ++ E   
Sbjct: 380  NKSAASPRIVPEFPQD-IVPEGEIRRDGKLTDSENHFELCPEESSNRMPEKTMRDGEEEY 438

Query: 1386 CKHKHSSVSLGSVKEFKFDSADRDGGG-------------------PSNSWTFFPLLHPG 1508
            C  KH S++LGS+KEF FD+ + +                      PSN+WTFFP+L   
Sbjct: 439  CYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENVGKESKPSNNWTFFPMLQSE 498

Query: 1509 VS 1514
             S
Sbjct: 499  AS 500


>ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina]
            gi|557523850|gb|ESR35217.1| hypothetical protein
            CICLE_v10004813mg [Citrus clementina]
          Length = 500

 Score =  421 bits (1083), Expect = e-115
 Identities = 239/482 (49%), Positives = 294/482 (60%), Gaps = 82/482 (17%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            ESR++P+ ++QKR WG CWSLY CFG  K+SKR+ HA+LVPEP+V     P  E   HST
Sbjct: 22   ESRLRPA-AIQKRRWGSCWSLYWCFGSHKTSKRISHAVLVPEPMVTGAAAPAAETQAHST 80

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + +PF+        FL+SDP +A QSP G ++L SLSVN+YSP GPA++FA GPYA+ET
Sbjct: 81   AIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLSVNAYSPGGPASMFAIGPYAHET 140

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPNQKH 854
            QLV+PPVFS FTTEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL RARRNSG NQK 
Sbjct: 141  QLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKL 200

Query: 855  GLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESKNF 1028
             LS+Y +  +QLY GSP G LISPGSV S SGTSSP+P +HP+L F  A A KL   ++F
Sbjct: 201  SLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDRHPILDFSAAAAPKLLGFEHF 260

Query: 1029 VTTHKW------------------------------------------------GSRLGS 1064
             TT KW                                                GSRLGS
Sbjct: 261  -TTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSRLGSGTVTPDGAGLGSRLGS 319

Query: 1065 GSLTPDGLGPASRDSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVE 1244
            GSLTPDG+GP SRD  + ENQISEVASLANS++G++S E ++D RVSFEL+GE+V  C+ 
Sbjct: 320  GSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDHRVSFELSGEEVARCLA 379

Query: 1245 IKKTS-----PHSPQDNVVP-------ACLXXXXXXCELCVEVTTTEMPEGDPQE-ENNN 1385
             K  +     P  PQD +VP         L       ELC E ++  MPE   ++ E   
Sbjct: 380  NKSAASPRIVPEFPQD-IVPEGEIRRDGKLTDSENHFELCPEESSNRMPEKTMRDGEEEY 438

Query: 1386 CKHKHSSVSLGSVKEFKFDSADRDGGG-------------------PSNSWTFFPLLHPG 1508
            C  KH S++LGS+KEF FD+ + +                      PSN+WTFFP+L   
Sbjct: 439  CYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENVGKESKPSNNWTFFPMLQSE 498

Query: 1509 VS 1514
             S
Sbjct: 499  AS 500


>gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 489

 Score =  421 bits (1082), Expect = e-115
 Identities = 240/471 (50%), Positives = 293/471 (62%), Gaps = 71/471 (15%)
 Frame = +3

Query: 315  ESRVQPST---SVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPN 485
            +SRVQP+T    V K+ WG CW LY CFG QK+SKR+ HA+LVPEP+V   ++ T EN +
Sbjct: 22   DSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVS 81

Query: 486  HSTPMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYA 665
            + T + +PF+        FL+SDP +A QSP G ++L SLSVN+YSP GPA++FA GPYA
Sbjct: 82   NPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYA 141

Query: 666  YETQLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPN 845
            +ETQLV+PPVFS  TTEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL RARRNSG N
Sbjct: 142  HETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGIN 201

Query: 846  QKHGLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFES 1019
            QK GLS+YEF  +Q+Y GSP G+LISPGS  S SGTSSP+P + P+L FR+ +A KL   
Sbjct: 202  QKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGF 261

Query: 1020 KNFVTTHKWGSRLGSGS--------------------------------LTPDGLGPASR 1103
            +NF TT KWGSRLGSGS                                LTPDGLGPASR
Sbjct: 262  ENF-TTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASR 320

Query: 1104 DSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVEIKKTSP-----HS 1268
            D  L+ +QISEVA LAN  +G ++ E +VD RVSFEL+GEDV  C+E K   P       
Sbjct: 321  DGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEY 380

Query: 1269 PQDNVV------PACLXXXXXXCELCVEVT---TTEMPEGDPQEENNNCKHKHSSVSLGS 1421
            P+D V                 CEL +  T   T E   G+ +EE++    KH SV+LGS
Sbjct: 381  PKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHS--YQKHRSVTLGS 438

Query: 1422 VKEFKFDSADRDGGG--------------------PSNSWTFFPLLHPGVS 1514
            +KEF FD+   +                       P NSWTFFP+L P VS
Sbjct: 439  IKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  420 bits (1080), Expect = e-115
 Identities = 240/478 (50%), Positives = 292/478 (61%), Gaps = 78/478 (16%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            ESRVQP+T VQKR WGGCWSLY CFG  K+ KR+ HA+L PEP V    + + EN + ST
Sbjct: 36   ESRVQPTT-VQKRRWGGCWSLYWCFGSHKT-KRIGHAVLAPEPEVQGAVVTSAENQSQST 93

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             +++PF+        FL+SDP +A QSP G ++L SLSVN+YSP GPA++FA GPYA+ET
Sbjct: 94   AITVPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHET 153

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPNQKH 854
            QLV+PP FS FTTEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL RARRNSG NQK 
Sbjct: 154  QLVTPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKF 213

Query: 855  GLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESKNF 1028
             LS+YEF  + LY GSP G LISPGSV S SGTSSP+P ++P+L FR+ +A KL   ++F
Sbjct: 214  ALSHYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHF 273

Query: 1029 VTTHKWGSR------------------------------------------------LGS 1064
             TT KWGSR                                                LGS
Sbjct: 274  -TTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGS 332

Query: 1065 GSLTPDGLGPASRDSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVE 1244
            GSLTPD +GPASRD   LENQISEVASLANSE+GS++ E +VD RVSFEL+GE+V  C+E
Sbjct: 333  GSLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLE 392

Query: 1245 IK-----KTSPHSPQDNVVPACLXXXXXXC---ELCVEVTTTEMPEGDPQE-ENNNCKHK 1397
             K     +     P D++    +           L    T+ E PE    E E  +C  K
Sbjct: 393  SKSLASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTGETSGETPEKPSGEMEEEHCYRK 452

Query: 1398 HSSVSLGSVKEFKFDSADR-------------------DGGGPSNSWTFFPLLHPGVS 1514
            H S++LGS+KEF FD++                         P+N+WTFFPLL P VS
Sbjct: 453  HRSITLGSIKEFNFDNSKEVPDKPSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510


>gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  410 bits (1054), Expect = e-111
 Identities = 236/480 (49%), Positives = 284/480 (59%), Gaps = 81/480 (16%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            E+R QP+T V KR WG CWSLY CFG  K+ KR+ HA+LVPEP+V    +   +N   ST
Sbjct: 22   EARPQPTT-VPKRRWGSCWSLYWCFGPHKN-KRIGHAVLVPEPVVPGAAVSAIDNQTTST 79

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + +PF+        FL SDP +A QSP G ++L SLS N+YSP GPA++F+ GPYAYET
Sbjct: 80   AIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSANAYSPGGPASIFSIGPYAYET 139

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPNQKH 854
            QLVSPPVFSTF TEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL+R RRNSG NQK 
Sbjct: 140  QLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRNRRNSGTNQKF 199

Query: 855  GLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESKNF 1028
             LS+YEF  +Q Y GSP G+LISPGS  S SGTSSP+P +HP+L FR+ +A KLF   +F
Sbjct: 200  ALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDRHPVLEFRMGEAPKLFGFDHF 259

Query: 1029 VTTHKWGSRLGSGSLTPDGL---------------------------------------- 1088
             TT KWGSR+GSGSLTPDG+                                        
Sbjct: 260  -TTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSRLGSGCVTPNGAGIGSRLGS 318

Query: 1089 --------GPASRDSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVE 1244
                    GPASRDS LLENQISEVASLANSE G ++ E V D RVSFEL GEDV  C+ 
Sbjct: 319  GCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETVFDHRVSFELTGEDVACCLA 378

Query: 1245 IKKTSPH---SPQDNVV--------PACLXXXXXXCELCVEVTTTEMPEGDPQEENNNCK 1391
             K  + +   S    V+         A        CE  VE +++ +PE    E  +   
Sbjct: 379  NKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSVEESSSRIPENVSGEGEDQGY 438

Query: 1392 HKHSSVSLGSVKEFKFDSADRD--------------------GGGPSNSWTFFPLLHPGV 1511
             KH S++LGS K+F FD+   +                       P N WTFFP+L PGV
Sbjct: 439  RKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNVAAKESKPCNDWTFFPILQPGV 498


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  407 bits (1046), Expect = e-111
 Identities = 234/451 (51%), Positives = 272/451 (60%), Gaps = 51/451 (11%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            ESRVQPST VQKR WG CWSLY CFG  K SKR+ HA+LVPEP      +P  ENPNHS 
Sbjct: 22   ESRVQPST-VQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPAAPGPAVPVTENPNHSA 80

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + IPF+        FL SDP +A QSP G ++L SLS+N+YSP G A++FA GPYA+ET
Sbjct: 81   TIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSINAYSPGGTASIFAIGPYAHET 140

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPNQKH 854
            QLVSPPVFSTFTTEPSTA  TPPPE V MTTP SPEVPFAQLLTSSL R RR SG N K 
Sbjct: 141  QLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKF 200

Query: 855  GLSYYEFHQLYS-GSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESKNFV 1031
             LS YEF      GSP  +LISPGSV S SGTSSP+P K P++ FR  +  K    ++F 
Sbjct: 201  PLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHF- 259

Query: 1032 TTHKWGSRLGSGSLTPDGLG----------------------------PASRDSLLLENQ 1127
            +T KWGSR+GSGSLTP G G                            P SRDS LLE Q
Sbjct: 260  STRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLEYQ 319

Query: 1128 ISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVEIKKTSPHSPQ--DNVVPACLX 1301
            ISEVASLANS++GSE GE V+D RVSFEL GEDVP+C E +    HS Q     V   L 
Sbjct: 320  ISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKEPVMSHSQQTLPMDVSNLLA 379

Query: 1302 XXXXXCELCVEVTTTEMPEGDPQEENNNCKHKHSSVSLGSVKEFKFD------------- 1442
                      E  T   P    +   + C  KH +++ GS K+F FD             
Sbjct: 380  NEMKSGSSMAEEKTYGSPRKASESGEDQCHRKHRNITFGSSKDFDFDNVKIEVLEKDSID 439

Query: 1443 ----SADRDGG---GPSNSWTFFPLLHPGVS 1514
                ++D+  G   G  N+WTFFP+L PGVS
Sbjct: 440  CEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
            lycopersicum]
          Length = 470

 Score =  407 bits (1046), Expect = e-111
 Identities = 233/453 (51%), Positives = 274/453 (60%), Gaps = 53/453 (11%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            ESRVQPST VQKR WG CWSLY CFG  K SKR+ HA+LVPEP+     +P  ENPNHS 
Sbjct: 22   ESRVQPST-VQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVTENPNHSA 80

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + IPF+        FL SDP +A QSP G ++L +LS+N+YSP G A++FA GPYA+ET
Sbjct: 81   TIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHET 140

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPNQKH 854
            QLVSPPVFSTFTTEPSTA  TPPPE V MTTP SPEVPFAQLLTSSL R RR SG N K 
Sbjct: 141  QLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKF 200

Query: 855  GLSYYEFHQLYS-GSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESKNFV 1031
             LS YEF      GSP  +LISPGSV S SGTSSP+P K P++ FR  +  K    ++F 
Sbjct: 201  PLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHF- 259

Query: 1032 TTHKWGSRLGSGSLTPDGLG----------------------------PASRDSLLLENQ 1127
            +T KWGSR+GSGS+TP G G                            P SRDS LLENQ
Sbjct: 260  STRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQ 319

Query: 1128 ISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVEIKKTSPHS----PQDNVVPAC 1295
            ISEVASLANS++GSE GEAV+D RVSFEL  EDVP+C E +    HS    P D  V   
Sbjct: 320  ISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMD--VSNL 377

Query: 1296 LXXXXXXCELCVEVTTTEMPEGDPQEENNNCKHKHSSVSLGSVKEFKFDSA-----DRDG 1460
            L           E  T   P    +   + C  KH +++ GS K+F FD+      ++D 
Sbjct: 378  LASEMRSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDS 437

Query: 1461 ---------------GGPSNSWTFFPLLHPGVS 1514
                            G  N+WTFFP+L PGVS
Sbjct: 438  IDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470


>ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210841 [Cucumis sativus]
          Length = 497

 Score =  380 bits (976), Expect = e-102
 Identities = 228/482 (47%), Positives = 279/482 (57%), Gaps = 82/482 (17%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGC--QKSSKRVDHAILVPEPIVHRNTIPTGENPNH 488
            E+RVQP+T   KR WG CWSLY CFG   QKS+KR+ HA+LVPEP V     P  E+   
Sbjct: 22   EARVQPTTP-PKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTP 80

Query: 489  STPMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAY 668
            ST M +PF+        FL+S+P++  QSP G ++L +LSVN+YSP+GPA++FA GPY Y
Sbjct: 81   STTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTY 140

Query: 669  ETQLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPNQ 848
            +TQLVSPPVFS FTTEPSTA +TPPPESVQ+TTPSSPEVPFA+LLTSSL+   ++ G NQ
Sbjct: 141  DTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ 200

Query: 849  KHGLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESK 1022
            K  LS+ +F  +Q Y GSP  HLISPGSV S SGTSSP+P KHP+L FR+ADA KL   +
Sbjct: 201  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLE 260

Query: 1023 NFVTTHKWGSRLGSGSLTPDGLGPASR--------------------------------- 1103
            +F TT KW SR+GSGSLTPDG G  SR                                 
Sbjct: 261  HF-TTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRLGSGSVTPNGMRQDSRL 319

Query: 1104 ---------------DSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTC 1238
                           DS LL+NQISEVASLANSE G ++   V + RVSFEL GEDV  C
Sbjct: 320  GSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQND--VTNHRVSFELTGEDVARC 377

Query: 1239 VEIK-----KTSPHSPQDNVVP-----ACLXXXXXXCELCVEVTTTEMPEGDPQEENNNC 1388
            +  K     +T   SP+                   CE   ++ T+  PE  P E+ + C
Sbjct: 378  LANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEF-FDIKTSAAPEKTPGED-DQC 435

Query: 1389 KHKHSSVSLGSVKEFKFDSADRD--------------------GGGPSNSWTFFPLLHPG 1508
                 +V+LGS KEF FD    +                       P N+WTFFPLL PG
Sbjct: 436  YQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPG 495

Query: 1509 VS 1514
            VS
Sbjct: 496  VS 497


>ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225370 [Cucumis sativus]
          Length = 497

 Score =  379 bits (972), Expect = e-102
 Identities = 227/482 (47%), Positives = 278/482 (57%), Gaps = 82/482 (17%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGC--QKSSKRVDHAILVPEPIVHRNTIPTGENPNH 488
            E+RVQP+T   KR WG CWSLY CFG   QKS+KR+ HA+LVPEP V     P  E+   
Sbjct: 22   EARVQPTTP-PKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTP 80

Query: 489  STPMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAY 668
            ST M +PF+        FL+S+P++  QSP G ++  +LSVN+YSP+GPA++FA GPY Y
Sbjct: 81   STTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSFTALSVNNYSPNGPASIFAIGPYTY 140

Query: 669  ETQLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPNQ 848
            +TQLVSPPVFS FTTEPSTA +TPPPESVQ+TTPSSPEVPFA+LLTSSL+   ++ G NQ
Sbjct: 141  DTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQ 200

Query: 849  KHGLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESK 1022
            K  LS+ +F  +Q Y GSP  HLISPGSV S SGTSSP+P KHP+L FR+ADA KL   +
Sbjct: 201  KFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLE 260

Query: 1023 NFVTTHKWGSRLGSGSLTPDGLGPASR--------------------------------- 1103
            +F TT KW SR+GSGSLTPDG G  SR                                 
Sbjct: 261  HF-TTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRLGSGSVTPNGMRQDSRL 319

Query: 1104 ---------------DSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTC 1238
                           DS LL+NQISEVASLANSE G ++   V + RVSFEL GEDV  C
Sbjct: 320  GSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQND--VTNHRVSFELTGEDVARC 377

Query: 1239 VEIK-----KTSPHSPQDNVVP-----ACLXXXXXXCELCVEVTTTEMPEGDPQEENNNC 1388
            +  K     +T   SP+                   CE   ++ T+  PE  P E+ + C
Sbjct: 378  LANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEF-FDIKTSAAPEKTPGED-DQC 435

Query: 1389 KHKHSSVSLGSVKEFKFDSADRD--------------------GGGPSNSWTFFPLLHPG 1508
                 +V+LGS KEF FD    +                       P N+WTFFPLL PG
Sbjct: 436  YQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPG 495

Query: 1509 VS 1514
            VS
Sbjct: 496  VS 497


>emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]
          Length = 385

 Score =  370 bits (949), Expect = 1e-99
 Identities = 208/387 (53%), Positives = 246/387 (63%), Gaps = 37/387 (9%)
 Frame = +3

Query: 465  PTGENPNHSTPMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANV 644
            P  EN N ST + +PF+        FL+SDP ++ QSP G ++L +LSVN+YSPSGPA++
Sbjct: 8    PASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASM 67

Query: 645  FATGPYAYETQLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRA 824
            FA GPYA+ETQLVSPPVFSTF TEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL+R+
Sbjct: 68   FAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRS 127

Query: 825  RRNSGPNQKHGLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIAD 998
            RRNSG NQK  LS YEF  +QLY  SP GHLISP    S SGTSSP+P + P     I +
Sbjct: 128  RRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRRP-----IVE 179

Query: 999  ASKLFESKNFVTTHKWGSRLGSGSLTPDGLGPASRDSLLLENQISEVASLANSEHGSESG 1178
            A KL   ++F +T +WGSRLGSGSLTPDG GPASRDS LLENQISEVASLANSE GS++G
Sbjct: 180  APKLLGFEHF-STRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNG 238

Query: 1179 EAVVDQRVSFELAGEDVPTCVEIKKTSPHSPQDNVVPACL-------------XXXXXXC 1319
            E V+D RVSFELAGEDV  CVE K  +      N +   +                   C
Sbjct: 239  ETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCC 298

Query: 1320 ELCV-EVTTTEMPEGDPQEENNNCKHKHSSVSLGSVKEFKFDSADRD------------- 1457
            E CV E       +   + E   C  KH  +  GS+KEF FD+   +             
Sbjct: 299  EFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWW 358

Query: 1458 --------GGGPSNSWTFFPLLHPGVS 1514
                    G GP  +WTFFPLL PG+S
Sbjct: 359  VNEKVVGKGTGPQTNWTFFPLLQPGIS 385


>gb|ESW15448.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris]
          Length = 479

 Score =  347 bits (890), Expect = 1e-92
 Identities = 214/464 (46%), Positives = 265/464 (57%), Gaps = 69/464 (14%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTG---ENPN 485
            ESRVQP+TS +KR WG CWSLY CFG  K+SKR+ +A+LVPEP+     I +      PN
Sbjct: 19   ESRVQPATSPKKR-WGSCWSLYWCFGPHKNSKRIGNAVLVPEPVEPAGQIGSHLATAAPN 77

Query: 486  HSTPMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYA 665
             ST +++PF+        FL SD S+A QSP G  +L+SL+ N+    GPA++FA GPY 
Sbjct: 78   PSTAVAMPFIVPPSSPASFLESDSSSATQSPVGLFSLSSLNANA--SCGPASIFAIGPYT 135

Query: 666  YETQLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPN 845
            YETQLVSPPVFS FTTEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL+R  ++ G N
Sbjct: 136  YETQLVSPPVFSNFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRDCKDKGTN 195

Query: 846  QKHGLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFES 1019
            Q+  LS YEF  +Q Y GSP   LISP S+ STSG+S+P+P  HPLL F   +AS L   
Sbjct: 196  QRFALSNYEFQLYQQYPGSPGPQLISPASIISTSGSSTPFPDTHPLLEFHKGEASNLLGF 255

Query: 1020 KNFVTTHKWGSRLGSGSLTPD------------------------------GLGPASRDS 1109
            ++F +THKW SRLGSGSLTPD                              G+ P +R+ 
Sbjct: 256  EHF-STHKWNSRLGSGSLTPDSTGQGSGLGSGSLTPNAVKLVSSSGCLTPEGVAPTARNG 314

Query: 1110 LLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVEIKKTSP------HSP 1271
            + +  Q SE+  LANSE+  +   A+VD RVSFEL GEDV  C+  K  SP       S 
Sbjct: 315  IYVGKQTSELTPLANSENECQPNAALVDHRVSFELTGEDVARCLANKSGSPLIGNISGSS 374

Query: 1272 QDNVVPACL------XXXXXXCELCVEVTTTEMPEGDPQEENNNCKHKHSSVSLGSVKEF 1433
            Q  +V   +            C+LC   T+ + PE  P E    C  KH+S S  S K+F
Sbjct: 375  QGALVGEPVDRERIHKNSDSDCDLCSRKTSNDKPENSPGEGEEQCCLKHNSSS--SSKDF 432

Query: 1434 KFDSADRDG----------------------GGPSNSWTFFPLL 1499
             FDS  R G                      G  SN   FFP+L
Sbjct: 433  NFDS--RKGVVSDNPANASEWWTNKKIVGKEGSSSNGSAFFPML 474


>ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|10177409|dbj|BAB10540.1| unnamed protein
            product [Arabidopsis thaliana] gi|40823427|gb|AAR92282.1|
            At5g52430 [Arabidopsis thaliana]
            gi|56381929|gb|AAV85683.1| At5g52430 [Arabidopsis
            thaliana] gi|110738650|dbj|BAF01250.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332008830|gb|AED96213.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 438

 Score =  343 bits (881), Expect = 1e-91
 Identities = 215/433 (49%), Positives = 259/433 (59%), Gaps = 33/433 (7%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            ESRVQPS+S QK  WG CWSLYSCFG QK++KR+ +A+LVPEP+     + T +N   ST
Sbjct: 23   ESRVQPSSS-QKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTSGVPVVTVQNSATST 81

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + +PF+        FL+SDPS+   SP GP++L S   N++SP  P +VF  GPYA ET
Sbjct: 82   TVVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTS---NTFSPKEPQSVFTVGPYANET 138

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPE-SVQMTTPSSPEVPFAQLLTSSLNRARRN--SGPN 845
            Q V+PPVFS F TEPSTA  TPPPE SV +TTPSSPEVPFAQLLTSSL   RR+  SG N
Sbjct: 139  QPVTPPVFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMN 198

Query: 846  QKHGLSYYEF--HQLYSGSP-AGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFE 1016
            QK   S+YEF  +Q+  GSP  G+LISPGSV S SGTSSPYP K P++ FRI +  K   
Sbjct: 199  QKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLG 258

Query: 1017 SKNFVTTHKWGSRLGSGSLTPDGLGPASRDSLL--------------------LENQISE 1136
             ++F T  KWGSR GSGS+TP G G       L                    L+NQISE
Sbjct: 259  FEHF-TARKWGSRFGSGSITPVGHGSGLASGALTPNGPEIVSGNLTPNNTTWPLQNQISE 317

Query: 1137 VASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVEIKKTSPHSPQDNVVPACLXXXXXX 1316
            VASLANS+HGSE    V D RVSFEL GEDV  C+  K    H   +N            
Sbjct: 318  VASLANSDHGSE--VMVADHRVSFELTGEDVARCLASKLNRSHDRMNN---------NDR 366

Query: 1317 CELCVEVTT-----TEMPEGDPQEENNNCKHKHSSVSLGSVKEFKFDSADRDG--GGPSN 1475
             E     +T      E   GD + E +  + K SS S+GS KEFKFD+   +       N
Sbjct: 367  IETEESSSTDIRRNIEKRSGDRENEQHRIQ-KLSSSSIGSSKEFKFDNTKDENIEKVAGN 425

Query: 1476 SWTFFPLLHPGVS 1514
            SW+FFP L  GVS
Sbjct: 426  SWSFFPGLRSGVS 438


>ref|XP_002865912.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
            subsp. lyrata] gi|297311747|gb|EFH42171.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 437

 Score =  343 bits (879), Expect = 2e-91
 Identities = 215/433 (49%), Positives = 256/433 (59%), Gaps = 33/433 (7%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            ESRVQPS SVQK  WG CWSLYSCFG QK++KR+ +A+LVPEP+     + T +N   ST
Sbjct: 22   ESRVQPS-SVQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVASGVPVVTVQNSATST 80

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + +PF+        FL+SDPS+   SPGG ++L S   N++SP  P +VF  GPYA ET
Sbjct: 81   TVVLPFIAPPSSPASFLQSDPSSVSHSPGGQLSLTS---NTFSPKEPQSVFTVGPYANET 137

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPE-SVQMTTPSSPEVPFAQLLTSSLNRARRN--SGPN 845
            Q V+PPVFS F TEPSTA  TPPPE SV +TTPSSPEVPFAQLLTSSL   RRN  SG N
Sbjct: 138  QPVTPPVFSAFVTEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRNSSSGMN 197

Query: 846  QKHGLSYYEF--HQLYSGSP-AGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFE 1016
            QK   S+YEF  +Q+  GSP  G+LISPGSV S SGTSSPYP K P++ FRI +  K   
Sbjct: 198  QKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLG 257

Query: 1017 SKNFVTTHKWGSRLGSGSLTPDGLGPASRDSLL--------------------LENQISE 1136
             ++F T  KWGSR GSGS+TP G G       L                    L NQISE
Sbjct: 258  FEHF-TARKWGSRFGSGSITPVGHGSGLASGALTPNGLEIISGNLTPSNTTWPLHNQISE 316

Query: 1137 VASLANSEHGSESGEAVVDQRVSFELAGEDVPTCVEIKKTSPHSPQDNVVPACLXXXXXX 1316
            VASLANS+HGSE    V D RVSFEL GEDV  C+  K    H   +N            
Sbjct: 317  VASLANSDHGSE--VIVADHRVSFELTGEDVARCLASKLNRSHDRMNN---------NDR 365

Query: 1317 CELCVEVTT-----TEMPEGDPQEENNNCKHKHSSVSLGSVKEFKFDSADRDG--GGPSN 1475
             E     +T      E    D + E    +  +SS S+GS KEFKFD+   +       N
Sbjct: 366  IETEESSSTDLRRNMEKRSADRETEQQRIQKLNSS-SIGSSKEFKFDNTKDENIEKVAGN 424

Query: 1476 SWTFFPLLHPGVS 1514
            SW+FFP L  GVS
Sbjct: 425  SWSFFPGLRSGVS 437


>gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]
          Length = 521

 Score =  338 bits (867), Expect = 5e-90
 Identities = 172/266 (64%), Positives = 204/266 (76%), Gaps = 3/266 (1%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            E+R QP+ +V KR WG CWSLY CFG  K+SKR+ HA+LVPEP++     P  EN   ST
Sbjct: 22   EARAQPA-AVPKRRWGSCWSLYWCFGSHKNSKRIGHAVLVPEPVLPGAAAPAPENQAPST 80

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + +PF+        FL+SDP +A QSP G ++L SLS+N+YSP GP ++FA GPYAYET
Sbjct: 81   AIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSINAYSPGGPTSIFAIGPYAYET 140

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRN-SGPNQK 851
            QLVSPPVFSTFTTEPSTA  TPPPESVQ+TTPSSPEVPFAQLLTSSL+R RRN SG NQK
Sbjct: 141  QLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRTRRNSSGANQK 200

Query: 852  HGLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESKN 1025
              LS+ EF  +QLY GSP G+LISPGSV S SGTSSP+P KHP+L FR+ +A +L   ++
Sbjct: 201  FSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPFPDKHPILGFRMGEAPRLLGFEH 260

Query: 1026 FVTTHKWGSRLGSGSLTPDGLGPASR 1103
            F TT KWGSRLGSGSLTPDG+G  SR
Sbjct: 261  F-TTWKWGSRLGSGSLTPDGVGLGSR 285



 Score =  128 bits (321), Expect = 1e-26
 Identities = 80/191 (41%), Positives = 105/191 (54%), Gaps = 35/191 (18%)
 Frame = +3

Query: 1047 GSRLGSGSLTPDGLGPASRDSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGED 1226
            GSRLGSG+LTPDG    S DS LLENQISEVASLANS++G ++  +VVD RVSFEL GED
Sbjct: 331  GSRLGSGTLTPDGFLVVSGDSFLLENQISEVASLANSDNGCQNDGSVVDHRVSFELTGED 390

Query: 1227 VPTCVEIK------KTSPHSPQDNVVPACLXXXXXXC--------ELCVEVTTTEMPEGD 1364
            V  C+  K      +T+  S +D+                     + CVE T+ + P+ D
Sbjct: 391  VARCLASKSASSNGRTTSESLEDSPAECPTKKDGISANNVDSPNDQSCVEETSNKTPQSD 450

Query: 1365 PQE-ENNNCKHKHSSVSLGSVKEFKFDSADRD--------------------GGGPSNSW 1481
             +E E+++   KH S++LGS+KEF FD+   D                         NSW
Sbjct: 451  CREGEDDHFYQKHRSITLGSIKEFNFDNTKADVSVKPTIGSEWWANEKVAGKEAKAGNSW 510

Query: 1482 TFFPLLHPGVS 1514
            +FFP+L PGVS
Sbjct: 511  SFFPILQPGVS 521


>ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Capsella rubella]
            gi|482549191|gb|EOA13385.1| hypothetical protein
            CARUB_v10026425mg [Capsella rubella]
          Length = 437

 Score =  338 bits (867), Expect = 5e-90
 Identities = 212/430 (49%), Positives = 253/430 (58%), Gaps = 30/430 (6%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTIPTGENPNHST 494
            ESRVQPS SVQKR W  CWSLYSCFG QK++KR+ +A+LVPEP+     + T +N   ST
Sbjct: 22   ESRVQPS-SVQKRRWAKCWSLYSCFGSQKNNKRIGNAVLVPEPVASGVPVVTVQNSATST 80

Query: 495  PMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYAYET 674
             + +PF+        FL SDPS+   SP GP++L S   N++SP  P +VF  GPYA ET
Sbjct: 81   TVVLPFIAPPSSPASFLPSDPSSVSHSPVGPLSLTS---NTFSPKEPQSVFTVGPYANET 137

Query: 675  QLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARR-NSGPNQK 851
            Q V+PPVFS F TEPSTA  TPPPES    TPSSPEVPFAQLLTSSL   RR +SG NQK
Sbjct: 138  QPVTPPVFSAFITEPSTAPYTPPPES--SVTPSSPEVPFAQLLTSSLELTRRDSSGINQK 195

Query: 852  HGLSYYEF--HQLYSGSP-AGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFESK 1022
               S+YEF  +Q+  GSP  G+LISPGSV S SGTSSPYP K P++ FRI +  K    +
Sbjct: 196  FSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLGFE 255

Query: 1023 NFVTTHKWGSRLGSGSLTPDGLGPASRDSLL--------------------LENQISEVA 1142
            +F T  KWGSR GSGS+TP G G       L                    L+NQISEVA
Sbjct: 256  HF-TARKWGSRFGSGSITPVGHGSGMASGALTPNAPEIISGNLTPSNTTWPLQNQISEVA 314

Query: 1143 SLANSEHGSESGEAVVDQRVSFELAGEDVPTCVEIKKTSPHSPQDN----VVPACLXXXX 1310
            SLANS+HGSE    V D RVSFEL GEDV  C+  K    H   +N              
Sbjct: 315  SLANSDHGSE--VIVADHRVSFELTGEDVARCLASKLNRSHDRMNNNDRIATEESSSTDR 372

Query: 1311 XXCELCVEVTTTEMPEGDPQEENNNCKHKHSSVSLGSVKEFKFDSADRDG--GGPSNSWT 1484
                   ++ +TE  E + Q        K SS S+GS KEFKFD+   +       NSW+
Sbjct: 373  GRRNSFQKIESTENRETEQQR-----IQKLSSSSIGSSKEFKFDNTKDENIEKVAGNSWS 427

Query: 1485 FFPLLHPGVS 1514
            FFP L  GVS
Sbjct: 428  FFPGLRSGVS 437


>ref|XP_006589528.1| PREDICTED: uncharacterized protein LOC100798631 isoform X1 [Glycine
            max]
          Length = 504

 Score =  337 bits (864), Expect = 1e-89
 Identities = 210/487 (43%), Positives = 261/487 (53%), Gaps = 92/487 (18%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKSSKRVDHAILVPEPIVHRNTI---PTGENPN 485
            ESR+QP+T+V K+ WG CWSL  CFG  K+SKRV +A+LVPEP+     +   P    PN
Sbjct: 22   ESRIQPTTTVPKKRWGSCWSLCWCFGPHKNSKRVGNAVLVPEPVEPIGPVGFHPATAAPN 81

Query: 486  HSTPMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVFATGPYA 665
             ST + +PF+        FL+SDP +A QSP G  +L+SL+VN+    GPA++FA GPY 
Sbjct: 82   PSTAIVMPFIVPPSSPASFLQSDPPSATQSPVGLFSLSSLTVNA--SGGPASIFAIGPYT 139

Query: 666  YETQLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRARRNSGPN 845
            YETQLVSPPVFSTFTTEPSTA  TPPPESVQ+TTPSSPEVPFAQLL SSL+R  +++G N
Sbjct: 140  YETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLASSLDRNCKSNGTN 199

Query: 846  QKHGLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADASKLFES 1019
            Q+  LS YEF  +Q Y GSP   L+SP S+ STSG+S+P+P +HP+L F   +A KL   
Sbjct: 200  QRFALSNYEFQPYQQYPGSPGTQLVSPRSIISTSGSSTPFPDRHPVLEFHKGEAPKLLGF 259

Query: 1020 KNFVTTHKWGSRLGSGSLTPDGLG------------------------------------ 1091
            +NF+ THKW SRLGSGSLTPD  G                                    
Sbjct: 260  ENFL-THKWNSRLGSGSLTPDSAGQGSRLGSGSFTPDAVKLASQLGSGCLTPDGLCQDSR 318

Query: 1092 ------------PASRDSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELAGEDVPT 1235
                        P +R+ + +  QISEV S+ NSE+  +   A+VD RVSFEL G DVP 
Sbjct: 319  FGSGSLTPDAVAPTARNDIDIGKQISEVTSIVNSENECQPKAALVDHRVSFELTGVDVPR 378

Query: 1236 CVEIKK--------------TSPHSPQDNVVPACLXXXXXXCELCVEVTTTEMPE---GD 1364
            C+  K               T    P D  +          C  C   T+    +     
Sbjct: 379  CLANKSGSSLLGNMSGSSQGTLVEDPVD--IEKIQKNSNSSCAFCSRKTSNASNDKSCNS 436

Query: 1365 PQEENNNCKHKHSSVSLGSVKEFKFDSADRDG----------------------GGPSNS 1478
            P E    C  KH   S  S KEF FD  +R G                      G  SNS
Sbjct: 437  PGEGAEQCCRKHH--SFNSSKEFNFD--NRKGVVSDTPANSSNWWTNKKIVGKEGRSSNS 492

Query: 1479 WTFFPLL 1499
            WTFFP+L
Sbjct: 493  WTFFPML 499


>gb|ESW24210.1| hypothetical protein PHAVU_004G111400g [Phaseolus vulgaris]
          Length = 500

 Score =  336 bits (861), Expect = 2e-89
 Identities = 215/488 (44%), Positives = 270/488 (55%), Gaps = 93/488 (19%)
 Frame = +3

Query: 315  ESRVQPSTSVQKRSWGGCWSLYSCFGCQKS---SKRVDHAILVPEPIVHRNTIPTGEN-- 479
            ESRVQP+T V K+ WGGCWS Y CFG  KS   SKR+ HA+LVPEP+      PTG    
Sbjct: 23   ESRVQPTT-VPKKRWGGCWSQYWCFGSYKSTKSSKRIGHAVLVPEPVA-----PTGPAAA 76

Query: 480  ----PNHSTPMSIPFVXXXXXXXXFLRSDPSTAIQSPGGPINLASLSVNSYSPSGPANVF 647
                PN ST + +PF+         ++SDP +AIQSP G ++L+SL+ ++YS  GPA++F
Sbjct: 77   AAAPPNPSTAIVMPFIAPPSSPASLIQSDPPSAIQSPPGLLSLSSLAASAYSSGGPASMF 136

Query: 648  ATGPYAYETQLVSPPVFSTFTTEPSTAAVTPPPESVQMTTPSSPEVPFAQLLTSSLNRAR 827
              GPYAYETQLVSPPVFS FTTEPSTA  TPPPESV  TTPSSP+VPFAQLL SSL+RAR
Sbjct: 137  TIGPYAYETQLVSPPVFSNFTTEPSTAPFTPPPESVHQTTPSSPDVPFAQLLASSLDRAR 196

Query: 828  RNSGPNQKHGLSYYEF--HQLYSGSPAGHLISPGSVNSTSGTSSPYPVKHPLLAFRIADA 1001
            +++G NQK  L  Y+F  +  Y GSP G LISPGS  STSGTS+P+P + P L FR  + 
Sbjct: 197  KSNG-NQKFALYNYDFQPYHQYPGSPGGQLISPGSAFSTSGTSTPFPDRPPTLEFRKGET 255

Query: 1002 SKLFESKNFVTTHKWGSRLGSGSLTPDGLGPASR-------------------------- 1103
             K+   ++F +T +W SRLGSGSLTPDG G  SR                          
Sbjct: 256  PKILGVEHF-STQRWSSRLGSGSLTPDGAGQGSRLGSGSVTPDGVGLASRLGSGCATPDG 314

Query: 1104 ----------------------DSLLLENQISEVASLANSEHGSESGEAVVDQRVSFELA 1217
                                  ++L ++NQIS+ A+LANS++G  S   ++D RVSFEL 
Sbjct: 315  LGQESRLGSGCLTPDGVGQINENNLPVQNQISKEATLANSDNGHPSNATLIDHRVSFELT 374

Query: 1218 GEDVPTC------VEIKKTSPHS-------PQDNVVPACLXXXXXXCELCVEVTTTEMPE 1358
            GEDV  C      V ++  S  S       P D      L      C +C E   T+   
Sbjct: 375  GEDVARCLANKTGVLLRNMSGSSQGILAKDPVDR--ERVLRDTDASCNVCTE--KTDDKP 430

Query: 1359 GDPQEENNNCKHKHSSVSLGSVKEFKFDS---------------------ADRDGGGPSN 1475
             +P  E   C HK +SV+  S KEF FDS                     A R+G   +N
Sbjct: 431  YNPIGEGEQCFHKQNSVN--SSKEFNFDSSKGVVSGTGGSGSEWWTNRRVAGREGRS-AN 487

Query: 1476 SWTFFPLL 1499
            SW FFP+L
Sbjct: 488  SWAFFPML 495


Top