BLASTX nr result

ID: Rheum21_contig00001353 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00001353
         (2779 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI29877.3| unnamed protein product [Vitis vinifera]             1203   0.0  
ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267...  1201   0.0  
ref|XP_002308967.2| exostosin family protein [Populus trichocarp...  1197   0.0  
gb|EMJ03138.1| hypothetical protein PRUPE_ppa001595mg [Prunus pe...  1191   0.0  
ref|XP_004307567.1| PREDICTED: uncharacterized protein LOC101304...  1190   0.0  
ref|XP_006493901.1| PREDICTED: uncharacterized protein LOC102626...  1181   0.0  
ref|XP_006421449.1| hypothetical protein CICLE_v10004353mg [Citr...  1181   0.0  
gb|EOY09341.1| Exostosin family protein isoform 1 [Theobroma cac...  1167   0.0  
ref|XP_004516917.1| PREDICTED: uncharacterized protein LOC101503...  1160   0.0  
ref|XP_003519065.1| PREDICTED: uncharacterized protein LOC100783...  1155   0.0  
ref|XP_006402860.1| hypothetical protein EUTSA_v10005794mg [Eutr...  1155   0.0  
ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807...  1151   0.0  
ref|XP_004161484.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...  1150   0.0  
gb|ESW17624.1| hypothetical protein PHAVU_007G255200g [Phaseolus...  1149   0.0  
ref|XP_004136589.1| PREDICTED: uncharacterized protein LOC101206...  1147   0.0  
ref|XP_006349551.1| PREDICTED: uncharacterized protein LOC102592...  1147   0.0  
ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata...  1145   0.0  
ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]...  1135   0.0  
ref|XP_006290615.1| hypothetical protein CARUB_v10016706mg [Caps...  1128   0.0  
ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]...  1126   0.0  

>emb|CBI29877.3| unnamed protein product [Vitis vinifera]
          Length = 822

 Score = 1203 bits (3113), Expect = 0.0
 Identities = 555/790 (70%), Positives = 634/790 (80%), Gaps = 2/790 (0%)
 Frame = +2

Query: 194  EIMFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTAL 373
            E+ F +QK  CS+SL+ T+AS+VAL+SV HLF F L PS + F + Q Q +C   N +  
Sbjct: 28   EMTFFLQKWKCSWSLLATVASVVALISVAHLFLFPLAPSLEYFSMGQGQKTCTPINASIR 87

Query: 374  G-GRDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNG 550
            G   D   ++   DLD RFP D H +VVYRGAPWKAEIGRWFSGCDS            G
Sbjct: 88   GVDHDGKNLQPSFDLDHRFPADSHKSVVYRGAPWKAEIGRWFSGCDSIAAEVSIIEKIGG 147

Query: 551  KPCQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSH 730
            K C+NDCSGQG+CN ELGQCRCFHGF GEGC+++L L CNYP +P+ P G WVVSIC + 
Sbjct: 148  KDCKNDCSGQGICNHELGQCRCFHGFSGEGCSERLHLDCNYPSSPEQPYGPWVVSICPAS 207

Query: 731  CDTMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPD-PNIFTTNASLPG 907
            CDT RAMCFCGEGTKYP+RP  E+CGF + +PT PG  K+ DW K D  NIFTTN S PG
Sbjct: 208  CDTTRAMCFCGEGTKYPHRPVAEACGFQMNLPTTPGDPKLVDWTKADLDNIFTTNDSKPG 267

Query: 908  WCNVDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXX 1087
            WCNVDP EAYA K+++KEECDCKYDCL+G+FCEIPV C+C+N                  
Sbjct: 268  WCNVDPTEAYALKMQYKEECDCKYDCLLGRFCEIPVLCTCVNQCSGHGHCRGGFCQCHRG 327

Query: 1088 WYGVDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPP 1267
            WYG DCSIPSV SS++EWPRWLRPA V+VP  +    ++ +L A V KKRPLIYVYDLPP
Sbjct: 328  WYGTDCSIPSVLSSVREWPRWLRPAHVEVPDDMHLSGSLVNLDAVVKKKRPLIYVYDLPP 387

Query: 1268 QFNSKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFF 1447
            +FNS LLEGRHFKFECVNRIYD +N T WTEQLYGAQMAIYES+L+SPHRT +GEEADFF
Sbjct: 388  EFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYGAQMAIYESILASPHRTLDGEEADFF 447

Query: 1448 FVPVLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIW 1627
            FVPVLDSCII RADDAPH  +  + GLRSS TLE YK AY+HI EQY +WN SSG+DHIW
Sbjct: 448  FVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTLEFYKTAYDHIVEQYPFWNRSSGRDHIW 507

Query: 1628 FFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPS 1807
            FFSWDEGACYAPKEIW+SMMLVHWGNTNSKHNHSTTAYWADNWD +S  RRG HPCFDP 
Sbjct: 508  FFSWDEGACYAPKEIWDSMMLVHWGNTNSKHNHSTTAYWADNWDSVSSDRRGNHPCFDPY 567

Query: 1808 KDLVLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVA 1987
            KDLVLPAWKRPD  +L S LW+R REQR  LFYFNGNLGPAY  GRPE TYSMGIRQKVA
Sbjct: 568  KDLVLPAWKRPDVVSLSSKLWSRPREQRKTLFYFNGNLGPAYEGGRPETTYSMGIRQKVA 627

Query: 1988 EEFGSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQG 2167
            EEFGSSPNKEG+LGKQH  DVIVTPLRS +Y   LA+SVFCGV PGDGWSGR EDSILQG
Sbjct: 628  EEFGSSPNKEGKLGKQHAEDVIVTPLRSGNYHESLASSVFCGVMPGDGWSGRFEDSILQG 687

Query: 2168 CIPVIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQR 2347
            CIPV+IQDGI+LP+EN+LNYESFAVRI EDEIPN++KILR  NETEI+FKL NV+KIWQR
Sbjct: 688  CIPVVIQDGIFLPFENMLNYESFAVRIREDEIPNLIKILRGMNETEIEFKLENVRKIWQR 747

Query: 2348 FLYRDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQL 2527
            FLYRDSILLEAERQ++++ +V+DWA++L Q SEDDVFATLIQVLHYKLHN+PWR+QL  L
Sbjct: 748  FLYRDSILLEAERQKTAFGNVEDWAVQLLQLSEDDVFATLIQVLHYKLHNDPWRQQLAHL 807

Query: 2528 KKEFGLPKQC 2557
            KK+FGL ++C
Sbjct: 808  KKDFGLAQEC 817


>ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267584 [Vitis vinifera]
          Length = 794

 Score = 1201 bits (3108), Expect = 0.0
 Identities = 554/787 (70%), Positives = 632/787 (80%), Gaps = 2/787 (0%)
 Frame = +2

Query: 203  FIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALG-G 379
            F +QK  CS+SL+ T+AS+VAL+SV HLF F L PS + F + Q Q +C   N +  G  
Sbjct: 3    FFLQKWKCSWSLLATVASVVALISVAHLFLFPLAPSLEYFSMGQGQKTCTPINASIRGVD 62

Query: 380  RDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKPC 559
             D   ++   DLD RFP D H +VVYRGAPWKAEIGRWFSGCDS            GK C
Sbjct: 63   HDGKNLQPSFDLDHRFPADSHKSVVYRGAPWKAEIGRWFSGCDSIAAEVSIIEKIGGKDC 122

Query: 560  QNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCDT 739
            +NDCSGQG+CN ELGQCRCFHGF GEGC+++L L CNYP +P+ P G WVVSIC + CDT
Sbjct: 123  KNDCSGQGICNHELGQCRCFHGFSGEGCSERLHLDCNYPSSPEQPYGPWVVSICPASCDT 182

Query: 740  MRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPD-PNIFTTNASLPGWCN 916
             RAMCFCGEGTKYP+RP  E+CGF + +PT PG  K+ DW K D  NIFTTN S PGWCN
Sbjct: 183  TRAMCFCGEGTKYPHRPVAEACGFQMNLPTTPGDPKLVDWTKADLDNIFTTNDSKPGWCN 242

Query: 917  VDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYG 1096
            VDP EAYA K+++KEECDCKYDCL+G+FCEIPV C+C+N                  WYG
Sbjct: 243  VDPTEAYALKMQYKEECDCKYDCLLGRFCEIPVLCTCVNQCSGHGHCRGGFCQCHRGWYG 302

Query: 1097 VDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFN 1276
             DCSIPSV SS++EWPRWLRPA V+VP  +    ++ +L A V KKRPLIYVYDLPP+FN
Sbjct: 303  TDCSIPSVLSSVREWPRWLRPAHVEVPDDMHLSGSLVNLDAVVKKKRPLIYVYDLPPEFN 362

Query: 1277 SKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVP 1456
            S LLEGRHFKFECVNRIYD +N T WTEQLYGAQMAIYES+L+SPHRT +GEEADFFFVP
Sbjct: 363  SLLLEGRHFKFECVNRIYDDRNATYWTEQLYGAQMAIYESILASPHRTLDGEEADFFFVP 422

Query: 1457 VLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFS 1636
            VLDSCII RADDAPH  +  + GLRSS TLE YK AY+HI EQY +WN SSG+DHIWFFS
Sbjct: 423  VLDSCIIVRADDAPHLNMHAHGGLRSSLTLEFYKTAYDHIVEQYPFWNRSSGRDHIWFFS 482

Query: 1637 WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDL 1816
            WDEGACYAPKEIW+SMMLVHWGNTNSKHNHSTTAYWADNWD +S  RRG HPCFDP KDL
Sbjct: 483  WDEGACYAPKEIWDSMMLVHWGNTNSKHNHSTTAYWADNWDSVSSDRRGNHPCFDPYKDL 542

Query: 1817 VLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEF 1996
            VLPAWKRPD  +L S LW+R REQR  LFYFNGNLGPAY  GRPE TYSMGIRQKVAEEF
Sbjct: 543  VLPAWKRPDVVSLSSKLWSRPREQRKTLFYFNGNLGPAYEGGRPETTYSMGIRQKVAEEF 602

Query: 1997 GSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIP 2176
            GSSPNKEG+LGKQH  DVIVTPLRS +Y   LA+SVFCGV PGDGWSGR EDSILQGCIP
Sbjct: 603  GSSPNKEGKLGKQHAEDVIVTPLRSGNYHESLASSVFCGVMPGDGWSGRFEDSILQGCIP 662

Query: 2177 VIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLY 2356
            V+IQDGI+LP+EN+LNYESFAVRI EDEIPN++KILR  NETEI+FKL NV+KIWQRFLY
Sbjct: 663  VVIQDGIFLPFENMLNYESFAVRIREDEIPNLIKILRGMNETEIEFKLENVRKIWQRFLY 722

Query: 2357 RDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQLKKE 2536
            RDSILLEAERQ++++ +V+DWA++L Q SEDDVFATLIQVLHYKLHN+PWR+QL  LKK+
Sbjct: 723  RDSILLEAERQKTAFGNVEDWAVQLLQLSEDDVFATLIQVLHYKLHNDPWRQQLAHLKKD 782

Query: 2537 FGLPKQC 2557
            FGL ++C
Sbjct: 783  FGLAQEC 789


>ref|XP_002308967.2| exostosin family protein [Populus trichocarpa]
            gi|550335517|gb|EEE92490.2| exostosin family protein
            [Populus trichocarpa]
          Length = 793

 Score = 1197 bits (3096), Expect = 0.0
 Identities = 553/789 (70%), Positives = 637/789 (80%), Gaps = 3/789 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALGG 379
            M  I K  CS+SL+ TIASIVALVSVVHLF F + PS+D F + Q Q SC G N  ++ G
Sbjct: 1    MITISKWKCSWSLMATIASIVALVSVVHLFLFPVVPSFDPFSVWQVQDSC-GPNNESVDG 59

Query: 380  R---DELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNG 550
            R   D   ++  LDL+ +FP DLH AV YR APWKAEIGRW SGCD+           +G
Sbjct: 60   RTGHDPGNLQPVLDLEHKFPADLHRAVFYRNAPWKAEIGRWLSGCDAVTKEVSVVETISG 119

Query: 551  KPCQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSH 730
            + C+NDCSGQGVCN ELGQCRCFHGF GEGC+++L L+CNYP +P+LP GRWVVSICS+H
Sbjct: 120  RSCKNDCSGQGVCNYELGQCRCFHGFSGEGCSERLHLECNYPKSPELPYGRWVVSICSAH 179

Query: 731  CDTMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGW 910
            CD  RAMCFCGEGTKYPNRPA E+CGF + +P+E GA +  DW KPD +I+TTN S  GW
Sbjct: 180  CDPTRAMCFCGEGTKYPNRPAAETCGFQLSLPSEIGAPRQVDWAKPDLDIYTTNKSKLGW 239

Query: 911  CNVDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXW 1090
            CNVDP E YA KVKFKEECDCKYDCL G+FCE+PVQCSCIN                  W
Sbjct: 240  CNVDPAEGYANKVKFKEECDCKYDCLSGRFCEVPVQCSCINQCSGHGHCRGGFCQCANGW 299

Query: 1091 YGVDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQ 1270
            YG DCSIPSV SS++EWPRWLRPAQ+DVP +      +  L+A V KKRPLIY+YDLPP+
Sbjct: 300  YGTDCSIPSVTSSVREWPRWLRPAQLDVPDNAHLTGKLVDLNAVVKKKRPLIYIYDLPPK 359

Query: 1271 FNSKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFF 1450
            FNS LLEGRHFKFECVNR+Y+  N T+WT+QLYGAQMA+YES+L+SP+RT NGEEADFFF
Sbjct: 360  FNSLLLEGRHFKFECVNRLYNDNNATIWTDQLYGAQMALYESILASPYRTLNGEEADFFF 419

Query: 1451 VPVLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWF 1630
            VPVLDSCIITRADDAPH ++  + GLRSS TLE Y+KAY+HI E Y +WN SSG+DHIW 
Sbjct: 420  VPVLDSCIITRADDAPHLSMEQHLGLRSSLTLEFYRKAYDHIVEHYPFWNRSSGRDHIWS 479

Query: 1631 FSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSK 1810
            FSWDEGACYAPKEIWNSMM+VHWGNTNSKHNHSTTAYWADNWD IS  RRGKHPCFDP K
Sbjct: 480  FSWDEGACYAPKEIWNSMMVVHWGNTNSKHNHSTTAYWADNWDKISSDRRGKHPCFDPDK 539

Query: 1811 DLVLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAE 1990
            DLVLPAWKRPD   L + LWAR  E+R  LFYFNGNLGPAY NGRPEA YSMGIRQK+AE
Sbjct: 540  DLVLPAWKRPDVNALSTKLWARPLEKRKTLFYFNGNLGPAYLNGRPEALYSMGIRQKLAE 599

Query: 1991 EFGSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGC 2170
            EFGS+PNK+G LGKQH  +VIV+PLRS  Y  DLA+SVFCGV PGDGWSGRMEDSILQGC
Sbjct: 600  EFGSTPNKDGNLGKQHAENVIVSPLRSESYHEDLASSVFCGVMPGDGWSGRMEDSILQGC 659

Query: 2171 IPVIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRF 2350
            IPV+IQDGIYLPYENVLNYESFAVRI EDEIPN++KIL+ FNETEI+ KL +VQKI QRF
Sbjct: 660  IPVVIQDGIYLPYENVLNYESFAVRILEDEIPNLIKILQGFNETEIENKLTSVQKIGQRF 719

Query: 2351 LYRDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQLK 2530
            LYRDS+LLEAERQ++++ +V+DWA+E  + +EDDV AT +QVLHYKLHN+PWRRQL   K
Sbjct: 720  LYRDSMLLEAERQKTAFGYVEDWAVEFLRLTEDDVVATFVQVLHYKLHNDPWRRQLGSQK 779

Query: 2531 KEFGLPKQC 2557
            K+FGLP++C
Sbjct: 780  KDFGLPQEC 788


>gb|EMJ03138.1| hypothetical protein PRUPE_ppa001595mg [Prunus persica]
          Length = 795

 Score = 1191 bits (3080), Expect = 0.0
 Identities = 551/794 (69%), Positives = 641/794 (80%), Gaps = 8/794 (1%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSV-----VHLFFFNLGPSWDNFGIRQTQTSCLGTNG 364
            M  IQK  CS+S I TIASIVAL S+     VHLF+F L PS++ F   Q Q SC+  NG
Sbjct: 1    MLSIQKWKCSWSQIATIASIVALASIILGSIVHLFWFPLVPSFNYFS--QAQNSCVPING 58

Query: 365  TALGGRDELP--VEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXX 538
            +A    D +    + P+DLD +FP DLH AVV+RGAPWKAEIGRW SGCD          
Sbjct: 59   SAEAVIDNVKGNFKPPIDLDRQFPSDLHKAVVFRGAPWKAEIGRWLSGCDPISDEVNIVE 118

Query: 539  XXNGKPCQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSI 718
               G  C+NDCSGQGVCN ELGQCRC+HG+ GEGC+++L+L+CNYPG+PD P GRWVVSI
Sbjct: 119  VIGGSGCKNDCSGQGVCNRELGQCRCYHGYSGEGCSERLQLECNYPGSPDQPYGRWVVSI 178

Query: 719  CSSHCDTMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPD-PNIFTTNA 895
            CS+HCDT RA CFCGEGTKYPNRP  E+CGF + +P+EPGA K+TDW K D  N+FT N 
Sbjct: 179  CSAHCDTTRAFCFCGEGTKYPNRPVAEACGFQVQLPSEPGAPKLTDWAKADLDNVFTKNG 238

Query: 896  SLPGWCNVDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXX 1075
            S PGWCNVDP E YA KV+FKEECDCKYDC  G+FCE+PV C+CIN              
Sbjct: 239  SKPGWCNVDPAEVYAHKVQFKEECDCKYDCFWGRFCEVPVLCTCINQCSGHGHCRGGFCQ 298

Query: 1076 XXXXWYGVDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVY 1255
                WYG+DCSIPSV SS++EWP+WLRPAQVDVP S      + +L+A V KKRPLIYVY
Sbjct: 299  CDNGWYGIDCSIPSVTSSVREWPQWLRPAQVDVPDSSHLPGKVVNLNAVVKKKRPLIYVY 358

Query: 1256 DLPPQFNSKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEE 1435
            DLPP FNS LLEGRHF+ ECVNRIYD +N T+WT+QLYGAQ+A+YES+L+SP+RT NGEE
Sbjct: 359  DLPPDFNSLLLEGRHFRLECVNRIYDGKNSTLWTDQLYGAQVALYESILASPYRTLNGEE 418

Query: 1436 ADFFFVPVLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGK 1615
            ADFFFVPVLDSCIITRADDAPH +++ ++GLRSS TLE Y+KAY+HI EQY +WN SSG+
Sbjct: 419  ADFFFVPVLDSCIITRADDAPHLSMQ-HKGLRSSLTLEYYRKAYDHIVEQYPFWNRSSGR 477

Query: 1616 DHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPC 1795
            DHIWFFSWDEGACYAPKEIWNSMMLVHWGNTN KH HSTTAYWADNWD I   +RG HPC
Sbjct: 478  DHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNLKHKHSTTAYWADNWDTIPSDKRGNHPC 537

Query: 1796 FDPSKDLVLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIR 1975
            FDP KDLVLP+WK PD  +L S LWARS + R  LFYFNGNLGPAY NGRPEA+YSMGIR
Sbjct: 538  FDPDKDLVLPSWKSPDVNSLSSKLWARSHDTRKTLFYFNGNLGPAYPNGRPEASYSMGIR 597

Query: 1976 QKVAEEFGSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDS 2155
            QK+AEEFGSSPNKEG+LGKQH  DVIVTPLRS +Y GDLA+S+FCGVFPGDGWSGRMEDS
Sbjct: 598  QKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSENYHGDLASSIFCGVFPGDGWSGRMEDS 657

Query: 2156 ILQGCIPVIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQK 2335
            ILQGCIPV+IQDGI+LPYENVLNY+S+AVRI EDEIP+++ ILRAFNETEI+F+L NVQK
Sbjct: 658  ILQGCIPVVIQDGIFLPYENVLNYDSYAVRIREDEIPDLINILRAFNETEIKFRLENVQK 717

Query: 2336 IWQRFLYRDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQ 2515
            IWQRFLYRDSI+LEAERQ++ + H++DWA + SQ  EDDV AT +QVLHYKLHN+PWR+ 
Sbjct: 718  IWQRFLYRDSIMLEAERQKTDFGHMEDWAAQFSQLIEDDVVATFVQVLHYKLHNDPWRQH 777

Query: 2516 LQQLKKEFGLPKQC 2557
            +  +KKEFGLP++C
Sbjct: 778  V-HVKKEFGLPQEC 790


>ref|XP_004307567.1| PREDICTED: uncharacterized protein LOC101304329 [Fragaria vesca
            subsp. vesca]
          Length = 791

 Score = 1190 bits (3078), Expect = 0.0
 Identities = 552/792 (69%), Positives = 642/792 (81%), Gaps = 6/792 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIV-----ALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNG 364
            MF I +   S+S+I TIASIV     AL S+VHLFFF L PS++ F   Q Q SC+  NG
Sbjct: 1    MFSILRWKGSWSMIATIASIVGLISLALASIVHLFFFPLVPSFNYFS--QAQNSCVPING 58

Query: 365  TALGGRDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXX 544
            +A    D +     +DL+ +FP DLH AVVYRGAPWKAEIGRW +GC S           
Sbjct: 59   SAEAITDHIK---GIDLEYQFPSDLHKAVVYRGAPWKAEIGRWLAGCLSITNEVNIVELI 115

Query: 545  NGKPCQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICS 724
             G  C+NDCSGQGVCN ELGQCRCFHG+ GEGC++ L+L+CNYPG+PD P GRWVVSICS
Sbjct: 116  GGSGCKNDCSGQGVCNRELGQCRCFHGYSGEGCSETLQLECNYPGSPDQPYGRWVVSICS 175

Query: 725  SHCDTMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPD-PNIFTTNASL 901
            +HCDT +AMCFCGEGTKYPNRP  E+CGF +  P++PGA K+TDW K D  N+ TTN+S 
Sbjct: 176  AHCDTKKAMCFCGEGTKYPNRPVAEACGFQVKPPSKPGAPKLTDWEKADLDNLLTTNSSK 235

Query: 902  PGWCNVDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXX 1081
            PGWCNVDP EAYA KV+FK+ECDCKYDCL+G+FCE+PV C+CIN                
Sbjct: 236  PGWCNVDPAEAYALKVQFKQECDCKYDCLLGRFCEVPVLCTCINQCSGHGHCRGGFCQCN 295

Query: 1082 XXWYGVDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDL 1261
              WYG+DCSIPSV SS++EWP+WLRPAQV++P +      + +L+A V KKRPLIYVYDL
Sbjct: 296  NGWYGIDCSIPSVASSVREWPQWLRPAQVNIPDNSHLTGKVVNLNAVVKKKRPLIYVYDL 355

Query: 1262 PPQFNSKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEAD 1441
            PP FNS LLEGRHFKFECVNRIYD  N TVWT+ LYG+QMA+YES+L+SP+RT NGEEAD
Sbjct: 356  PPDFNSLLLEGRHFKFECVNRIYDDLNSTVWTDMLYGSQMALYESILASPYRTLNGEEAD 415

Query: 1442 FFFVPVLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDH 1621
            FFFVPVLDSCIITRADDAPH ++++++GLRSS TLE YKKAY+HI EQY +WNHSSG+DH
Sbjct: 416  FFFVPVLDSCIITRADDAPHLSMQEHKGLRSSLTLEYYKKAYDHIVEQYPFWNHSSGRDH 475

Query: 1622 IWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFD 1801
            IWFFSWDEGACYAPKEIWNSMML+HWGNTNSKH HSTTAYW DNW+ IS  RRG HPCFD
Sbjct: 476  IWFFSWDEGACYAPKEIWNSMMLIHWGNTNSKHKHSTTAYWGDNWNDISSDRRGNHPCFD 535

Query: 1802 PSKDLVLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQK 1981
            P KDLVLPAWK PD  +L S LWAR  E R  LFYFNGNLGPAY NGRPE TYSMGIRQK
Sbjct: 536  PEKDLVLPAWKSPDVNSLSSKLWARPHEMRKTLFYFNGNLGPAYPNGRPENTYSMGIRQK 595

Query: 1982 VAEEFGSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSIL 2161
            +AEEFGSSPNKEG+LGKQH  DVIVTPLRS +Y  D+A+S+FCGVFPGDGWSGRMEDSIL
Sbjct: 596  LAEEFGSSPNKEGKLGKQHAEDVIVTPLRSENYHEDIASSIFCGVFPGDGWSGRMEDSIL 655

Query: 2162 QGCIPVIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIW 2341
            QGCIPV+IQDGI+LPYENVLNYESFAVRI EDEI N++ ILRAFNETEI+F+LANVQ+IW
Sbjct: 656  QGCIPVVIQDGIFLPYENVLNYESFAVRIREDEISNLINILRAFNETEIKFRLANVQQIW 715

Query: 2342 QRFLYRDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQ 2521
            QRFLYRDSILLEAERQ++S+  + DWA++ SQ  EDDVF T +QVLHYKLHN+PWR+ + 
Sbjct: 716  QRFLYRDSILLEAERQKTSFGRMGDWAVQFSQLIEDDVFQTFVQVLHYKLHNDPWRQHV- 774

Query: 2522 QLKKEFGLPKQC 2557
            ++KKEFGLP++C
Sbjct: 775  RVKKEFGLPQEC 786


>ref|XP_006493901.1| PREDICTED: uncharacterized protein LOC102626477 isoform X1 [Citrus
            sinensis]
          Length = 791

 Score = 1181 bits (3056), Expect = 0.0
 Identities = 543/790 (68%), Positives = 633/790 (80%), Gaps = 4/790 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQT-QTSCLGTNGTALG 376
            M  I+K   S++L+ T+AS++ LVSVVHLF F L PS+D F  RQ  Q SC+    +A G
Sbjct: 1    MISIEKWRFSWTLVATVASVLTLVSVVHLFLFPLVPSFDYFTARQQIQNSCVPIKESAEG 60

Query: 377  GRDELPVEVP--LDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNG 550
              + +    P  L+LD RFP DLHNAVVYR APWKAEIGRW SGCDS            G
Sbjct: 61   VTNRVWENSPPQLNLDHRFPADLHNAVVYRNAPWKAEIGRWLSGCDSVAKEVDLVEMIGG 120

Query: 551  KPCQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSH 730
            K C++DCSGQGVCN ELGQCRCFHGF G+GC++++  QCN+P TP+LP GRWVVSIC +H
Sbjct: 121  KSCKSDCSGQGVCNHELGQCRCFHGFRGKGCSERIHFQCNFPKTPELPYGRWVVSICPTH 180

Query: 731  CDTMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPD-PNIFTTNASLPG 907
            CDT RAMCFCGEGTKYPNRP  E+CGF + +P++PGA K TDW K D  NIFTTN S PG
Sbjct: 181  CDTTRAMCFCGEGTKYPNRPVAEACGFQVNLPSQPGAPKSTDWAKADLDNIFTTNGSKPG 240

Query: 908  WCNVDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXX 1087
            WCNVDP EAYA KV+FKEECDCKYD L+GQFCE+PV  +C+N                  
Sbjct: 241  WCNVDPEEAYALKVQFKEECDCKYDGLLGQFCEVPVSSTCVNQCSGHGHCRGGFCQCDNG 300

Query: 1088 WYGVDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPP 1267
            WYGVDCSIPSV SS+ EWP+WLRPA +D+P +     N+ +L+A V KKRPL+YVYDLPP
Sbjct: 301  WYGVDCSIPSVMSSMSEWPQWLRPAHIDIPINANITGNLVNLNAVVKKKRPLVYVYDLPP 360

Query: 1268 QFNSKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFF 1447
            +FNS LLEGRH+K ECVNRIY+ +N+T+WT+ LYG+QMA YES+L+SPHRT NGEEADFF
Sbjct: 361  EFNSLLLEGRHYKLECVNRIYNEKNETLWTDMLYGSQMAFYESILASPHRTLNGEEADFF 420

Query: 1448 FVPVLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIW 1627
            FVPVLDSCIITRADDAPH + ++++GLRSS TLE YKKAYEHI E Y YWN +SG+DHIW
Sbjct: 421  FVPVLDSCIITRADDAPHLSAQEHRGLRSSLTLEFYKKAYEHIIEHYPYWNRTSGRDHIW 480

Query: 1628 FFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPS 1807
            FFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWD IS  RRG H CFDP 
Sbjct: 481  FFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDRISSSRRGNHSCFDPE 540

Query: 1808 KDLVLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVA 1987
            KDLVLPAWK PD   L+S LWA  RE+R  LFYFNGNLG AY NGRPE++YSMG+RQK+A
Sbjct: 541  KDLVLPAWKAPDAFVLRSKLWASPREKRKTLFYFNGNLGSAYPNGRPESSYSMGVRQKLA 600

Query: 1988 EEFGSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQG 2167
            EE+GSSPNKEG+LGKQH  DVIVT LRS +Y  DL++SVFCGV PGDGWSGRMEDSILQG
Sbjct: 601  EEYGSSPNKEGKLGKQHAEDVIVTSLRSENYHEDLSSSVFCGVLPGDGWSGRMEDSILQG 660

Query: 2168 CIPVIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQR 2347
            CIPV+IQDGI+LPYENVLNYESF VRI EDEIPN++ ILR  NETEIQF+LANVQK+WQR
Sbjct: 661  CIPVVIQDGIFLPYENVLNYESFVVRISEDEIPNLINILRGLNETEIQFRLANVQKVWQR 720

Query: 2348 FLYRDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQL 2527
            FLYRDSILLEA+RQ + +  ++DWA+E  +  EDDVF TLIQ+LHYKLHN+PWRR+L   
Sbjct: 721  FLYRDSILLEAKRQNAKFGRMNDWAVEFLKLREDDVFTTLIQILHYKLHNDPWRRELVHQ 780

Query: 2528 KKEFGLPKQC 2557
            KK+FG+P++C
Sbjct: 781  KKDFGIPQEC 790


>ref|XP_006421449.1| hypothetical protein CICLE_v10004353mg [Citrus clementina]
            gi|557523322|gb|ESR34689.1| hypothetical protein
            CICLE_v10004353mg [Citrus clementina]
          Length = 791

 Score = 1181 bits (3056), Expect = 0.0
 Identities = 542/790 (68%), Positives = 635/790 (80%), Gaps = 4/790 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQT-QTSCLGTNGTALG 376
            M  I+K   S++L+ T+AS++ LVSVVHLF F L PS+D F  RQ  Q SC+    +A G
Sbjct: 1    MISIEKWRFSWTLVATVASVLTLVSVVHLFLFPLVPSFDYFTARQQIQNSCVPIKESAEG 60

Query: 377  GRDELPVEVP--LDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNG 550
              + +    P  L+LD RFP DLHNAVVYR APWKAEIGRW SGCDS            G
Sbjct: 61   VTNRVWENSPPQLNLDHRFPADLHNAVVYRNAPWKAEIGRWLSGCDSVAKEVDLVEMIGG 120

Query: 551  KPCQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSH 730
            K C++DCSGQGVCN ELGQCRCFHGF G+GC++++  QCN+P TP+LP GRWVVSIC +H
Sbjct: 121  KSCKSDCSGQGVCNHELGQCRCFHGFRGKGCSERIHFQCNFPKTPELPYGRWVVSICPTH 180

Query: 731  CDTMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPD-PNIFTTNASLPG 907
            CDT RAMCFCGEGTKYPNRP  E+CGF + +P++PGA K+T+W K D  NIFTTN S PG
Sbjct: 181  CDTTRAMCFCGEGTKYPNRPVAEACGFQVNLPSQPGAPKLTNWAKADLDNIFTTNGSKPG 240

Query: 908  WCNVDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXX 1087
            WCN+DP EAYA KV+FKEECDCKYD L+GQFCE+PV  +C+N                  
Sbjct: 241  WCNIDPKEAYALKVQFKEECDCKYDGLLGQFCEVPVSSTCVNQCSGHGHCRGGFCQCDNG 300

Query: 1088 WYGVDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPP 1267
            WYGVDCSIPSV SS+ EWP+WLRPA +D+P +     N+ +L+A V KKRPL+YVYDLPP
Sbjct: 301  WYGVDCSIPSVMSSMSEWPQWLRPAHIDIPINANITGNLVNLNAVVKKKRPLLYVYDLPP 360

Query: 1268 QFNSKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFF 1447
            +FNS LLEGRH+K ECVNRIY+ +N+T+WT+ LYG+QMA YES+L+SPHRT NGEEADFF
Sbjct: 361  EFNSLLLEGRHYKLECVNRIYNEKNETLWTDMLYGSQMAFYESILASPHRTLNGEEADFF 420

Query: 1448 FVPVLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIW 1627
            FVPVLDSCIITRADDAPH + ++++ LRSS TLE YKKAYEHI E Y YWNH+SG+DHIW
Sbjct: 421  FVPVLDSCIITRADDAPHLSAQEHRSLRSSLTLEFYKKAYEHIIEHYPYWNHTSGRDHIW 480

Query: 1628 FFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPS 1807
            FFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWD IS  RRG H CFDP 
Sbjct: 481  FFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDRISSSRRGNHSCFDPE 540

Query: 1808 KDLVLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVA 1987
            KDLVLPAWK PD   L+S LWA  RE+R  LFYFNGNLG AY NGRPE++YSMGIRQK+A
Sbjct: 541  KDLVLPAWKAPDAFVLRSKLWASPREKRKTLFYFNGNLGSAYPNGRPESSYSMGIRQKLA 600

Query: 1988 EEFGSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQG 2167
            EE+GSSPNKEG+LGKQH  DVIVT LRS +Y  DL++SVFCGV PGDGWSGRMEDSILQG
Sbjct: 601  EEYGSSPNKEGKLGKQHAEDVIVTSLRSENYHEDLSSSVFCGVLPGDGWSGRMEDSILQG 660

Query: 2168 CIPVIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQR 2347
            CIPV+IQDGI+LPYENVLNYESF VRI EDEIPN++ ILR  NETEIQF+LANVQK+WQR
Sbjct: 661  CIPVVIQDGIFLPYENVLNYESFVVRISEDEIPNLINILRGLNETEIQFRLANVQKVWQR 720

Query: 2348 FLYRDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQL 2527
            FLYRDSILLEA+RQ +++  ++DWA+E  +  EDDVF TLIQ+LHYKLHN+PWRR+L   
Sbjct: 721  FLYRDSILLEAKRQNATFGRMNDWAVEFLKLREDDVFTTLIQILHYKLHNDPWRRELVHQ 780

Query: 2528 KKEFGLPKQC 2557
            KK+FG+P++C
Sbjct: 781  KKDFGIPQEC 790


>gb|EOY09341.1| Exostosin family protein isoform 1 [Theobroma cacao]
            gi|508717445|gb|EOY09342.1| Exostosin family protein
            isoform 1 [Theobroma cacao]
          Length = 794

 Score = 1167 bits (3018), Expect = 0.0
 Identities = 533/790 (67%), Positives = 629/790 (79%), Gaps = 3/790 (0%)
 Frame = +2

Query: 197  IMFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALG 376
            +MF +QK  CS+SL+ T+AS++  VSVVHLF F + PS+D F   Q Q  C+  N +   
Sbjct: 1    MMFSVQKWKCSWSLVATVASVIVPVSVVHLFLFPVVPSFDYFRAPQVQYKCVPINASVEK 60

Query: 377  GRDEL--PVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNG 550
              D +   ++  LDLD RFP DLHN VVY  APWKAEIG+W S CD+            G
Sbjct: 61   VADHVWENIQPGLDLDHRFPSDLHNGVVYHNAPWKAEIGQWLSSCDAIAREVNIVETIGG 120

Query: 551  KPCQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSH 730
            + C+ DCSGQGVCN E+GQCRCFHGF GE C++++ L CNYP TP+LP GRWVVSIC +H
Sbjct: 121  RRCKADCSGQGVCNHEMGQCRCFHGFSGEECSERVHLSCNYPKTPELPYGRWVVSICPAH 180

Query: 731  CDTMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPD-PNIFTTNASLPG 907
            CDT RAMCFCGEGTKYPNRP  E+CGF + +P+EPG  K+TDW K D  NIFTTN S PG
Sbjct: 181  CDTTRAMCFCGEGTKYPNRPVAEACGFQMNLPSEPGGPKLTDWSKADLDNIFTTNGSKPG 240

Query: 908  WCNVDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXX 1087
            WCNVDP  AYA KV FKEECDCKYD L G+FCE+PV+  CIN                  
Sbjct: 241  WCNVDPDAAYASKVLFKEECDCKYDGLWGRFCEVPVESVCINQCSGHGHCRGGFCQCYNG 300

Query: 1088 WYGVDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPP 1267
            WYG DCSIPSV S + EWP+WLRPAQVD+P S+++  ++ +L A V KKRPLIYVYDLPP
Sbjct: 301  WYGTDCSIPSVVSPMGEWPKWLRPAQVDIP-SIEHTGSLVNLDAAVKKKRPLIYVYDLPP 359

Query: 1268 QFNSKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFF 1447
            +FNS LLEGRHFKFECVNRIYD +N T+WT+QLYG+QMA+YES+L+SP+RT NGEEADFF
Sbjct: 360  EFNSLLLEGRHFKFECVNRIYDDRNATLWTDQLYGSQMALYESILASPYRTLNGEEADFF 419

Query: 1448 FVPVLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIW 1627
            FVPVLDSCIITRADDAPH ++ ++ GLRSS TLE Y+KAY+HI E+Y YWN S+G+DH+W
Sbjct: 420  FVPVLDSCIITRADDAPHLSMENHTGLRSSLTLEFYRKAYDHIVEKYAYWNRSAGRDHVW 479

Query: 1628 FFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPS 1807
             FSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWD I   RRG HPCFDP+
Sbjct: 480  SFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDKIPSDRRGNHPCFDPA 539

Query: 1808 KDLVLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVA 1987
            KDLVLPAWK PD   L + LW+R RE+R  LFYFNGNLGPA+ +GRPE TYSMGIRQK+A
Sbjct: 540  KDLVLPAWKHPDVTALSAKLWSRPREKRKTLFYFNGNLGPAFTSGRPETTYSMGIRQKLA 599

Query: 1988 EEFGSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQG 2167
            +EFGS+PNKEG+LGKQH  DVIVT LRS++Y  D+A S FCGV PGDGWSGRMEDS+LQG
Sbjct: 600  DEFGSTPNKEGKLGKQHAEDVIVTSLRSNNYHEDIANSTFCGVLPGDGWSGRMEDSVLQG 659

Query: 2168 CIPVIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQR 2347
            CIPV+IQDGI+LPYENVLNYESFAVRI EDEIPN++KIL+  NE+EI+FKLANVQKI QR
Sbjct: 660  CIPVVIQDGIFLPYENVLNYESFAVRIREDEIPNLIKILQGINESEIEFKLANVQKIQQR 719

Query: 2348 FLYRDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQL 2527
            FLYR+SILLEAERQ++ +  ++DWA++  Q +EDDVF T +QVLHYKLHN+PWRRQL  L
Sbjct: 720  FLYRNSILLEAERQKTLFGRLEDWAVQFLQQTEDDVFTTFLQVLHYKLHNDPWRRQLAHL 779

Query: 2528 KKEFGLPKQC 2557
            KKE+G+P +C
Sbjct: 780  KKEYGVPPEC 789


>ref|XP_004516917.1| PREDICTED: uncharacterized protein LOC101503851 isoform X1 [Cicer
            arietinum] gi|502181977|ref|XP_004516918.1| PREDICTED:
            uncharacterized protein LOC101503851 isoform X2 [Cicer
            arietinum]
          Length = 796

 Score = 1160 bits (3001), Expect = 0.0
 Identities = 539/787 (68%), Positives = 621/787 (78%), Gaps = 1/787 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALGG 379
            +F ++   CS+SL  +IAS+VA+VSVVHLF F L PS+D F +     SC+  N ++   
Sbjct: 8    LFSMKNWRCSWSLAASIASVVAMVSVVHLFLFPLTPSFDYFKL--ASDSCVSNNVSSADL 65

Query: 380  RDELPVEVP-LDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKP 556
                 +E P +DL  RFP DLH++V Y+GA WKAEIGRW SGCDS            G  
Sbjct: 66   VSNHGLEEPAIDLKYRFPADLHSSVAYKGALWKAEIGRWLSGCDSITKDVNISEIIGGND 125

Query: 557  CQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCD 736
            C+NDCSG GVCN ELGQCRCFHG+VG+GC    EL+CN+PG+   P GRWVVSIC ++CD
Sbjct: 126  CKNDCSGLGVCNRELGQCRCFHGYVGDGCVDIQELECNFPGSLHEPFGRWVVSICPANCD 185

Query: 737  TMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGWCN 916
              RAMCFCGEGTKYP RP  ESCGF    P+EPG  K+ +W K D ++FTTN S+PGWCN
Sbjct: 186  KTRAMCFCGEGTKYPYRPLAESCGFQYNQPSEPGGPKIVNWTKVDQDVFTTNGSIPGWCN 245

Query: 917  VDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYG 1096
            VDP++AY GKVKFKEEC C YD  +G+FCE+PVQ  CIN                  WYG
Sbjct: 246  VDPVDAYEGKVKFKEECHCPYDGFIGRFCEVPVQSICINQCNGHGQCRGGFCQCDNGWYG 305

Query: 1097 VDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFN 1276
             DCSIPSV SSI+EWP WLRPA+VDVP ++   + + +L+A V KKRPLIY+YDLPP+FN
Sbjct: 306  ADCSIPSVISSIREWPSWLRPARVDVPDNIHVSEKLINLNAVVAKKRPLIYIYDLPPEFN 365

Query: 1277 SKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVP 1456
            S LLEGRHFK ECVNRIYD  N T+WTEQLYGAQMAIYESLL+SPHRT NGEEADFFFVP
Sbjct: 366  SLLLEGRHFKLECVNRIYDGNNATIWTEQLYGAQMAIYESLLASPHRTLNGEEADFFFVP 425

Query: 1457 VLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFS 1636
            +LDSCIITR DDAPH +L+++ GLRSS TLE  KKAY HI EQY YWNHSSG+DHIWFFS
Sbjct: 426  ILDSCIITRGDDAPHLSLQEHSGLRSSLTLEYSKKAYYHIVEQYPYWNHSSGRDHIWFFS 485

Query: 1637 WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDL 1816
            WDEGACYAPKEIWNSMMLVHWGNTN+KHNHSTTAYWADNWD IS  RRG HPCFDP KDL
Sbjct: 486  WDEGACYAPKEIWNSMMLVHWGNTNTKHNHSTTAYWADNWDTISSDRRGIHPCFDPDKDL 545

Query: 1817 VLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEF 1996
            VLPAWK PD   L   LWAR RE+R  LFYFNGNLGPAY +GRPE +YSMGIRQK+ EEF
Sbjct: 546  VLPAWKVPDANMLTMKLWARPREKRKTLFYFNGNLGPAYPHGRPEYSYSMGIRQKLGEEF 605

Query: 1997 GSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIP 2176
            GSSPNK+G+LGKQH  DVIVTP+RS +Y  D+A SVFCGVFPGDGWSGRMEDS+LQGCIP
Sbjct: 606  GSSPNKDGKLGKQHAEDVIVTPVRSDNYHADIANSVFCGVFPGDGWSGRMEDSVLQGCIP 665

Query: 2177 VIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLY 2356
            V+IQDGI+LPYENVLNY+SFAVRI E+EIPNM+KILR FN+TEI  KLANVQKIWQRFLY
Sbjct: 666  VVIQDGIFLPYENVLNYDSFAVRIPEEEIPNMIKILRGFNDTEINLKLANVQKIWQRFLY 725

Query: 2357 RDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQLKKE 2536
            R+SILLEAERQ++++ HVDDWA+E  + +EDDV ATLIQVLHYKLHN+PWR+ +   KK 
Sbjct: 726  RNSILLEAERQKTAFGHVDDWAVEFLRLTEDDVTATLIQVLHYKLHNDPWRKLVGHNKK- 784

Query: 2537 FGLPKQC 2557
            FGLP QC
Sbjct: 785  FGLPNQC 791


>ref|XP_003519065.1| PREDICTED: uncharacterized protein LOC100783624 [Glycine max]
          Length = 795

 Score = 1155 bits (2989), Expect = 0.0
 Identities = 539/789 (68%), Positives = 624/789 (79%), Gaps = 3/789 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTAL-- 373
            +F + K  CS+SL  TIAS+VALVSVVHLF F L P+++ F I   Q SC  TN +A   
Sbjct: 8    LFSMNKWRCSWSLAATIASVVALVSVVHLFLFPLTPTFNYFKI--AQDSCFPTNASAEFP 65

Query: 374  GGRDELPVEVP-LDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNG 550
              RD+   E P +D   +FP DLH A VY+GAPWKAEIG+W +GCDS            G
Sbjct: 66   SNRDQ---EWPAVDFKRQFPADLHGAFVYQGAPWKAEIGQWLAGCDSVIKEVNITEIIGG 122

Query: 551  KPCQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSH 730
              C+ DCSGQGVCN ELGQCRCFHG+ G+GC ++L+LQCN+ G+PD P GRWVVSIC ++
Sbjct: 123  NNCKKDCSGQGVCNLELGQCRCFHGYSGDGCTEKLQLQCNFLGSPDQPFGRWVVSICPAN 182

Query: 731  CDTMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGW 910
            CD  RAMCFCGEGTKYPNRP  E+CGF    P+EP   ++ +W K D ++FTTN S+PGW
Sbjct: 183  CDKTRAMCFCGEGTKYPNRPLAETCGFQFNPPSEPDGPRIVNWTKIDQDVFTTNRSIPGW 242

Query: 911  CNVDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXW 1090
            CNVDP EAYAGK K KEECDCKYD L G+ CE+PV+  CIN                  W
Sbjct: 243  CNVDPAEAYAGKAKIKEECDCKYDGLAGRLCEVPVESVCINQCSGHGHCRGGFCQCDNGW 302

Query: 1091 YGVDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQ 1270
            YGVDCS+PSV SSIKEWP WLRPA++D+       + + +L+A V KKRPL+YVYDLPP+
Sbjct: 303  YGVDCSMPSVISSIKEWPSWLRPARIDIADDTHANEKMINLNAVVAKKRPLVYVYDLPPE 362

Query: 1271 FNSKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFF 1450
            FNS LLEGRHFK ECVNRIYD  N TVWT+QLYGAQ+A+YESLL+SPHRT NGEEADFFF
Sbjct: 363  FNSLLLEGRHFKLECVNRIYDGNNITVWTDQLYGAQIALYESLLASPHRTLNGEEADFFF 422

Query: 1451 VPVLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWF 1630
            VPVLDSCIITRADDAPH +++++ GLRSS TLE YKKAY HI EQY YWN SSG+DH+W 
Sbjct: 423  VPVLDSCIITRADDAPHLSMQEHMGLRSSLTLEYYKKAYIHIVEQYPYWNRSSGRDHVWS 482

Query: 1631 FSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSK 1810
            FSWDEGACYAPKEIWNSMMLVHWGNTN+KHNHSTTAYWADNWD IS  +RG HPCFDP K
Sbjct: 483  FSWDEGACYAPKEIWNSMMLVHWGNTNTKHNHSTTAYWADNWDKISSDKRGTHPCFDPDK 542

Query: 1811 DLVLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAE 1990
            DLVLPAWK PD   L S LWA S E+R  LFYFNGNLGPAY +GRPE TYSMGIRQK+AE
Sbjct: 543  DLVLPAWKVPDANVLTSKLWAWSHEKRKTLFYFNGNLGPAYPHGRPEDTYSMGIRQKLAE 602

Query: 1991 EFGSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGC 2170
            EFGSSPNK+G+LGKQH  DVIVTP RS +Y  DLA+SVFCGVFPGDGWSGRMEDSILQGC
Sbjct: 603  EFGSSPNKDGKLGKQHAKDVIVTPERSENYHLDLASSVFCGVFPGDGWSGRMEDSILQGC 662

Query: 2171 IPVIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRF 2350
            IPV+IQDGI+LPYENVLNY+SFAVRI E EIPN++KILR FN+TEI+FKL NVQKIWQRF
Sbjct: 663  IPVVIQDGIFLPYENVLNYDSFAVRIPEAEIPNLIKILRGFNDTEIEFKLENVQKIWQRF 722

Query: 2351 LYRDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQLK 2530
            +YRDS+LLEAERQ+++  HVDDWA+E  + +EDDVF TLIQ+LHYKLHN+PWR+Q++   
Sbjct: 723  MYRDSVLLEAERQKTAIGHVDDWAVEFLKLTEDDVFVTLIQILHYKLHNDPWRKQVRH-N 781

Query: 2531 KEFGLPKQC 2557
            K FGLP QC
Sbjct: 782  KHFGLPHQC 790


>ref|XP_006402860.1| hypothetical protein EUTSA_v10005794mg [Eutrema salsugineum]
            gi|557103959|gb|ESQ44313.1| hypothetical protein
            EUTSA_v10005794mg [Eutrema salsugineum]
          Length = 791

 Score = 1155 bits (2987), Expect = 0.0
 Identities = 522/789 (66%), Positives = 623/789 (78%), Gaps = 1/789 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALGG 379
            MF  QK  CS+S I T+AS++ LVS+VH+F   + PS+D+  +RQ Q +  GT+  ++  
Sbjct: 1    MFSHQKWKCSWSQIATVASVIVLVSLVHIFLGPVVPSFDSVSVRQAQ-NLSGTSNDSIRQ 59

Query: 380  RDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKPC 559
              E   +  +  D RFP DLH AVVYR A WKAEIG+W S CD+            G+ C
Sbjct: 60   VSEDSSKTVVAFDRRFPADLHGAVVYRNASWKAEIGQWLSSCDAVAKDVDIIEPIGGRKC 119

Query: 560  QNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCDT 739
             NDCS QGVCN E G CRCFHG+ GE C+Q+L L+CNY  TP++P GRWVVSICS HCDT
Sbjct: 120  LNDCSSQGVCNHEFGICRCFHGYTGEDCSQKLRLECNYEKTPEMPYGRWVVSICSRHCDT 179

Query: 740  MRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGWCNV 919
             RAMCFCGEGTKYPNRP PESCGF I  P  P   K+TDW KPD +I TTN+S  GWCNV
Sbjct: 180  TRAMCFCGEGTKYPNRPVPESCGFQINSPVNPDEPKMTDWSKPDLDILTTNSSKQGWCNV 239

Query: 920  DPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYGV 1099
            DP +AYA KV+ KEECDCKYDCL G+FCE+PVQC+C+N                  W+G 
Sbjct: 240  DPEDAYALKVQIKEECDCKYDCLWGRFCEVPVQCTCVNQCSGHGKCRGGFCQCDKGWFGT 299

Query: 1100 DCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFNS 1279
            DCSIPS  S++ EWP+WLRPA ++VP       N++++SA V KKRPLIY+YDLPP FNS
Sbjct: 300  DCSIPSTLSTVGEWPQWLRPAHLEVPSDKNVPGNLSNISAVVKKKRPLIYIYDLPPDFNS 359

Query: 1280 KLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVPV 1459
             LLEGRHFK ECVNRIYD +N T+WT+ LYG+QMA YE++L++ HRT NGEEADFFFVPV
Sbjct: 360  LLLEGRHFKLECVNRIYDDRNATIWTDYLYGSQMAFYENILATAHRTLNGEEADFFFVPV 419

Query: 1460 LDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFSW 1639
            LDSCIITRADDAPH +++++ GLRSS TLE YK+AYEHI E+Y YWN SSG+DHIWFFSW
Sbjct: 420  LDSCIITRADDAPHLSMQNHTGLRSSLTLEFYKRAYEHIVEKYPYWNRSSGRDHIWFFSW 479

Query: 1640 DEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDLV 1819
            DEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYW DNWD IS+ RRG HPCFDP KDLV
Sbjct: 480  DEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDEISNERRGDHPCFDPRKDLV 539

Query: 1820 LPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEFG 1999
            +PAWK PD  ++++N WAR RE+R  LFYFNGNLGPAY  GRPE +YSMGIRQK+AEEFG
Sbjct: 540  IPAWKVPDPFSMRANYWARPREKRKTLFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEFG 599

Query: 2000 SSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIPV 2179
            SSPNKEG+LGKQH +DV+VTPLRS +Y  D+ATS+FCGVFPGDGWSGRMEDSILQGC+PV
Sbjct: 600  SSPNKEGKLGKQHADDVVVTPLRSDNYHNDIATSIFCGVFPGDGWSGRMEDSILQGCVPV 659

Query: 2180 IIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLYR 2359
            IIQDGIYLPYEN+LNYESFAVR+ ED+IP+++  LR FNETEIQF+LANV+KIWQRFL+R
Sbjct: 660  IIQDGIYLPYENMLNYESFAVRVSEDDIPSLINTLRGFNETEIQFRLANVKKIWQRFLFR 719

Query: 2360 DSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRR-QLQQLKKE 2536
            DSILLEAERQ++S+ H +DWA++ S+   DD+FAT IQ LH+KLHN+PWRR Q+    KE
Sbjct: 720  DSILLEAERQKASFGHEEDWAVQFSKLKHDDIFATFIQTLHFKLHNDPWRREQVVNRTKE 779

Query: 2537 FGLPKQCFK 2563
            +GLP++C +
Sbjct: 780  YGLPQECLQ 788


>ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807663 [Glycine max]
          Length = 795

 Score = 1151 bits (2978), Expect = 0.0
 Identities = 532/786 (67%), Positives = 617/786 (78%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALGG 379
            +F + K  CS+SL  TIAS+VALVSVVHLF F L P+++ F I   Q SC  TN +A   
Sbjct: 8    LFSMNKWRCSWSLAATIASVVALVSVVHLFLFPLTPTFNYFKI--AQDSCFPTNASAEFP 65

Query: 380  RDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKPC 559
             +       +D   +FP DLH A VY G PWKAEIG+W +GCDS            G  C
Sbjct: 66   SNHDQERPAVDFKHQFPADLHGAFVYHGVPWKAEIGQWLAGCDSVIKDVNITEIIGGINC 125

Query: 560  QNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCDT 739
            +NDCSGQG+CN +LGQCRCFHG+ G+GC + L+L+CN+ G+PD P GRWVVSIC ++CD 
Sbjct: 126  KNDCSGQGICNRQLGQCRCFHGYSGDGCTKNLQLECNFLGSPDQPFGRWVVSICPANCDK 185

Query: 740  MRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGWCNV 919
             RAMCFCGEG KYPNRP  E+CGF    P+EP   ++ +W K D ++FTTN S+PGWCNV
Sbjct: 186  TRAMCFCGEGAKYPNRPLAETCGFQFDPPSEPDGPRIVNWTKIDQDVFTTNRSIPGWCNV 245

Query: 920  DPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYGV 1099
            DP EAYAGK K KEECDCKYD L G+FCE+PV+  CIN                  WYGV
Sbjct: 246  DPAEAYAGKAKVKEECDCKYDGLAGRFCEVPVESVCINQCSGHGHCRGGFCQVSAGWYGV 305

Query: 1100 DCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFNS 1279
            DCS+PSV SSIKEWP WLRPA++ +       + + +L+A V KKRPL+YVYDLPP+FNS
Sbjct: 306  DCSMPSVISSIKEWPSWLRPARIHIADDTHANEKMINLNAVVAKKRPLVYVYDLPPEFNS 365

Query: 1280 KLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVPV 1459
             LLEGRH+K ECVNRIYD  N TVWT+QLYGAQ+A+YESLL+SPHRT NGEEADFFFVPV
Sbjct: 366  LLLEGRHYKLECVNRIYDDNNITVWTDQLYGAQIALYESLLASPHRTLNGEEADFFFVPV 425

Query: 1460 LDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFSW 1639
            LDSCIITRADDAPH +++++ GLRSS TLE YK  Y HI EQY YW+HSSG+DHIW FSW
Sbjct: 426  LDSCIITRADDAPHLSMQEHMGLRSSLTLEYYKNTYTHIVEQYPYWSHSSGRDHIWSFSW 485

Query: 1640 DEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDLV 1819
            DEGACYAPKEIWNSMMLVHWGNTN+KHNHSTTAYWADNWD IS  RRG HPCFDP KDLV
Sbjct: 486  DEGACYAPKEIWNSMMLVHWGNTNTKHNHSTTAYWADNWDKISSDRRGIHPCFDPDKDLV 545

Query: 1820 LPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEFG 1999
            LPAWK PD   L S LWARS E+R  LFYFNGNLGPAY +GRPE TYSMGIRQK+AEEFG
Sbjct: 546  LPAWKVPDAYVLTSKLWARSHEKRKTLFYFNGNLGPAYPHGRPEDTYSMGIRQKLAEEFG 605

Query: 2000 SSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIPV 2179
            SSPNK+G+LGKQH  DVIVTP RS DY  DLA+SVFCGVFPGDGWSGRMEDSILQGCIPV
Sbjct: 606  SSPNKDGKLGKQHAKDVIVTPERSEDYHMDLASSVFCGVFPGDGWSGRMEDSILQGCIPV 665

Query: 2180 IIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLYR 2359
            +IQDGI+LPYENVLNY+SFAVRI E EIPN++K LR FN+TEI+FKLANVQKIWQRFLYR
Sbjct: 666  VIQDGIFLPYENVLNYDSFAVRIPEAEIPNLIKTLRGFNDTEIEFKLANVQKIWQRFLYR 725

Query: 2360 DSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQLKKEF 2539
            DS+LLEAERQ+++  HVDDWA+E  + +EDD FATLIQ+LHYKLHN+ WR+Q++   K+F
Sbjct: 726  DSVLLEAERQKTAIGHVDDWAVEFLKLTEDDAFATLIQILHYKLHNDRWRKQVRH-NKQF 784

Query: 2540 GLPKQC 2557
            GLP QC
Sbjct: 785  GLPHQC 790


>ref|XP_004161484.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101226446
            [Cucumis sativus]
          Length = 859

 Score = 1150 bits (2976), Expect = 0.0
 Identities = 524/787 (66%), Positives = 618/787 (78%), Gaps = 1/787 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALGG 379
            M   QK NCS+SL  +IASI+ LV+VVHLFFF L PS DN          +  +  A   
Sbjct: 1    MAFAQKWNCSWSLGASIASIIGLVTVVHLFFFPLVPSLDNLRRFPNSGFAVNVSTEAYNN 60

Query: 380  RDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKPC 559
              +     P+DL  +FPPD HNAVVY GAPWK+ IG+W SGCD+            G  C
Sbjct: 61   HAKEDPAPPIDLTHKFPPDSHNAVVYHGAPWKSHIGQWLSGCDANTKDLQIVELVGGSGC 120

Query: 560  QNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCDT 739
            +NDC+GQGVCN E GQCRCFHG+ GEGC++++ L+CN+PG+   P G WVVSICS+HCDT
Sbjct: 121  KNDCNGQGVCNYEFGQCRCFHGYSGEGCSEKVNLECNHPGSEGEPYGPWVVSICSAHCDT 180

Query: 740  MRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPD-PNIFTTNASLPGWCN 916
             RAMCFCGEGTKYPNRP  E+CGF +  P+EP   KVTDW K D  NIFTTN S  GWCN
Sbjct: 181  TRAMCFCGEGTKYPNRPVAEACGFQMRPPSEPNGSKVTDWTKADLDNIFTTNGSKSGWCN 240

Query: 917  VDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYG 1096
            VDP EAYA KV+FKEECDCKYDC +G+FCE+PV C+CIN                  WYG
Sbjct: 241  VDPAEAYASKVQFKEECDCKYDCSLGRFCELPVSCTCINQCSGHGHCMGGFCQCNEGWYG 300

Query: 1097 VDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFN 1276
            VDCSIPSV +S++EWP+WL PA++D+P  +   +   +L   VNK+RPLIY+YDLPP FN
Sbjct: 301  VDCSIPSVQTSVREWPQWLLPARIDIPDRLHITEKSFNLKPMVNKRRPLIYIYDLPPGFN 360

Query: 1277 SKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVP 1456
            S+LL+GRH+KFECVNR+Y+ +N T+WT+ LYGA+MA YES+L+SPHRT NGEEADFFFVP
Sbjct: 361  SQLLQGRHWKFECVNRMYNERNATMWTDDLYGAEMAFYESILASPHRTLNGEEADFFFVP 420

Query: 1457 VLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFS 1636
            VLDSCIITRADDAPH +LRDY GLRS  TL+ YKKA++HI EQY YWN SSG+DHIWFFS
Sbjct: 421  VLDSCIITRADDAPHLSLRDYMGLRSFLTLDFYKKAHDHIVEQYPYWNRSSGRDHIWFFS 480

Query: 1637 WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDL 1816
            WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYW DNWD I   +RG HPCFDP KDL
Sbjct: 481  WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDNIPSSKRGNHPCFDPEKDL 540

Query: 1817 VLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEF 1996
            V+PAWKRPDG  L   LWAR RE+R   F+FNGNLGPAY  GRPE+TYSMGIRQKVAEEF
Sbjct: 541  VVPAWKRPDGSRLSKKLWARPREERKTFFFFNGNLGPAYERGRPESTYSMGIRQKVAEEF 600

Query: 1997 GSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIP 2176
            GSSPNKEG+LGKQH  DVIVTPLRS +Y  DLA+SVFCGV PGDGWSGRMEDSILQGCIP
Sbjct: 601  GSSPNKEGKLGKQHAADVIVTPLRSENYHEDLASSVFCGVMPGDGWSGRMEDSILQGCIP 660

Query: 2177 VIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLY 2356
            VIIQDGI+LPYENVLNY+SFAVRIGED+IPN++ ILR FNE+EI+FKL+NV+KIWQRF+Y
Sbjct: 661  VIIQDGIFLPYENVLNYDSFAVRIGEDDIPNLINILRGFNESEIEFKLSNVRKIWQRFMY 720

Query: 2357 RDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQLKKE 2536
            R++++LEA+RQ++ Y   +DWA E SQ  +DD  AT++QVLH+KLH++PWRR ++   KE
Sbjct: 721  REAVMLEAQRQKAVYGIQEDWADEYSQLIDDDAVATVLQVLHHKLHSDPWRRHVKS-NKE 779

Query: 2537 FGLPKQC 2557
            FGLP +C
Sbjct: 780  FGLPHEC 786


>gb|ESW17624.1| hypothetical protein PHAVU_007G255200g [Phaseolus vulgaris]
          Length = 795

 Score = 1149 bits (2972), Expect = 0.0
 Identities = 533/781 (68%), Positives = 617/781 (79%)
 Frame = +2

Query: 215  KGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALGGRDELP 394
            K  CS+SL  TIAS+VALVSVVHLF F L P+++ F I   + SC+  N +A    +   
Sbjct: 13   KWRCSWSLAVTIASVVALVSVVHLFMFPLTPTFNYFKI--AKDSCIQANASAEFPSNRDQ 70

Query: 395  VEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKPCQNDCS 574
             +  +D   +FP DLH +VVY+GAPWKAEIG W + CDS               C+NDCS
Sbjct: 71   EQPAVDFKLQFPADLHGSVVYQGAPWKAEIGHWLAACDSVIKEVNITEIIGVNNCKNDCS 130

Query: 575  GQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCDTMRAMC 754
            GQGVCN ELGQCRCFHG+ G+GC +Q +L+CNY G+PDL  GRWVVSIC ++CD  RAMC
Sbjct: 131  GQGVCNRELGQCRCFHGYSGDGCTEQRQLECNYEGSPDLQFGRWVVSICPANCDKTRAMC 190

Query: 755  FCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGWCNVDPIEA 934
            FCGEGTKYPNRP  E+CGF  I P+EP   K+ +W K D ++FTTN S+ GWCNVDP +A
Sbjct: 191  FCGEGTKYPNRPLAETCGFQYIPPSEPDGPKIVNWTKIDQDVFTTNGSIRGWCNVDPADA 250

Query: 935  YAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYGVDCSIP 1114
            YAGK K KEECDCKYD L G+ CE+PV+  CIN                  WYGVDCS+P
Sbjct: 251  YAGKAKIKEECDCKYDGLSGRLCEVPVESVCINQCSRHGHCRGGFCQCDKGWYGVDCSMP 310

Query: 1115 SVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFNSKLLEG 1294
            S  SSI EWP WLRPA++D+         + +L+A V KKRPLIYVYDLPP+FNS LLEG
Sbjct: 311  SAISSIIEWPSWLRPARIDIVDDTHANGKMINLNAVVAKKRPLIYVYDLPPEFNSLLLEG 370

Query: 1295 RHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVPVLDSCI 1474
            RHFK ECVNRIYD +N T+WT+QLYGAQMA+YESLL+SPHRT NGEEADFFFVPVLDSCI
Sbjct: 371  RHFKLECVNRIYDDKNVTIWTDQLYGAQMALYESLLASPHRTVNGEEADFFFVPVLDSCI 430

Query: 1475 ITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFSWDEGAC 1654
            ITRADDAPH +L+++ GLRSS TLE YKKAY HI +QY YWNHSSG+DHIWFFSWDEGAC
Sbjct: 431  ITRADDAPHLSLQEHMGLRSSLTLEYYKKAYTHIVDQYPYWNHSSGRDHIWFFSWDEGAC 490

Query: 1655 YAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDLVLPAWK 1834
            YAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWD I   +RG HPCFDP KDLVLPAWK
Sbjct: 491  YAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDTIPSDKRGIHPCFDPDKDLVLPAWK 550

Query: 1835 RPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEFGSSPNK 2014
             PD   L S LWAR+ E+R  LFYFNGNLGPAY +GRPE +YSMGIRQK+AEEFGSSPNK
Sbjct: 551  VPDANVLTSKLWARTHEERKTLFYFNGNLGPAYPHGRPEDSYSMGIRQKLAEEFGSSPNK 610

Query: 2015 EGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIPVIIQDG 2194
            +G+LGKQH  DVIVT  R+ +Y  DLA+SVFCGVFPGDGWSGRMEDSILQGCIPV+IQDG
Sbjct: 611  DGKLGKQHAKDVIVTQERTENYHLDLASSVFCGVFPGDGWSGRMEDSILQGCIPVVIQDG 670

Query: 2195 IYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLYRDSILL 2374
            I+LPYEN+LNY+SFAVR+ E+EIPN++KILR FNETEI+FKLANVQKIWQRFLYRDS+LL
Sbjct: 671  IFLPYENILNYDSFAVRLSEEEIPNLLKILRGFNETEIKFKLANVQKIWQRFLYRDSVLL 730

Query: 2375 EAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQLKKEFGLPKQ 2554
            EAERQ+++  +VDDWA+E  +  EDDV ATLIQVLHYKLHNEPWR+QL+   K+FGLP Q
Sbjct: 731  EAERQKTAIGYVDDWAIEFLKLIEDDVSATLIQVLHYKLHNEPWRKQLRH-NKQFGLPHQ 789

Query: 2555 C 2557
            C
Sbjct: 790  C 790


>ref|XP_004136589.1| PREDICTED: uncharacterized protein LOC101206674 [Cucumis sativus]
          Length = 791

 Score = 1147 bits (2968), Expect = 0.0
 Identities = 523/787 (66%), Positives = 617/787 (78%), Gaps = 1/787 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALGG 379
            M   QK NCS+SL  +IASI+ LV+VVHLFFF L PS DN          +  +  A   
Sbjct: 1    MAFAQKWNCSWSLGASIASIIGLVTVVHLFFFPLVPSLDNLRRFPNSGFAVNVSTEAYNN 60

Query: 380  RDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKPC 559
              +      +DL  +FPPD HNAVVY GAPWK+ IG+W SGCD+            G  C
Sbjct: 61   HAKEDPAPAIDLTHKFPPDSHNAVVYHGAPWKSHIGQWLSGCDANTKDLQIVELVGGSGC 120

Query: 560  QNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCDT 739
            +NDC+GQGVCN E GQCRCFHG+ GEGC++++ L+CN+PG+   P G WVVSICS+HCDT
Sbjct: 121  KNDCNGQGVCNYEFGQCRCFHGYSGEGCSEKVNLECNHPGSEGEPYGPWVVSICSAHCDT 180

Query: 740  MRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPD-PNIFTTNASLPGWCN 916
             RAMCFCGEGTKYPNRP  E+CGF +  P+EP   KVTDW K D  NIFTTN S  GWCN
Sbjct: 181  TRAMCFCGEGTKYPNRPVAEACGFQMRPPSEPNGSKVTDWTKADLDNIFTTNGSKSGWCN 240

Query: 917  VDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYG 1096
            VDP EAYA KV+FKEECDCKYDC +G+FCE+PV C+CIN                  WYG
Sbjct: 241  VDPAEAYASKVQFKEECDCKYDCSLGRFCELPVSCTCINQCSGHGHCMGGFCQCNEGWYG 300

Query: 1097 VDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFN 1276
            VDCSIPSV +S++EWP+WL PA++D+P  +   +   +L   VNK+RPLIY+YDLPP FN
Sbjct: 301  VDCSIPSVQTSVREWPQWLLPARIDIPDRLHITEKSFNLKPMVNKRRPLIYIYDLPPGFN 360

Query: 1277 SKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVP 1456
            S+LL+GRH+KFECVNR+Y+ +N T+WT+ LYGA+MA YES+L+SPHRT NGEEADFFFVP
Sbjct: 361  SQLLQGRHWKFECVNRMYNERNATMWTDDLYGAEMAFYESILASPHRTLNGEEADFFFVP 420

Query: 1457 VLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFS 1636
            VLDSCIITRADDAPH +LRDY GLRS  TL+ YKKA++HI EQY YWN SSG+DHIWFFS
Sbjct: 421  VLDSCIITRADDAPHLSLRDYMGLRSFLTLDFYKKAHDHIVEQYPYWNRSSGRDHIWFFS 480

Query: 1637 WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDL 1816
            WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYW DNWD I   +RG HPCFDP KDL
Sbjct: 481  WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDNIPSSKRGNHPCFDPEKDL 540

Query: 1817 VLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEF 1996
            V+PAWKRPDG  L   LWAR RE+R   F+FNGNLGPAY  GRPE+TYSMGIRQKVAEEF
Sbjct: 541  VVPAWKRPDGSRLSKKLWARPREERKTFFFFNGNLGPAYERGRPESTYSMGIRQKVAEEF 600

Query: 1997 GSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIP 2176
            GSSPNKEG+LGKQH  DVIVTPLRS +Y  DLA+SVFCGV PGDGWSGRMEDSILQGCIP
Sbjct: 601  GSSPNKEGKLGKQHAADVIVTPLRSENYHEDLASSVFCGVMPGDGWSGRMEDSILQGCIP 660

Query: 2177 VIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLY 2356
            VIIQDGI+LPYENVLNY+SFAVRIGED+IPN++ ILR FNE+EI+FKL+NV+KIWQRF+Y
Sbjct: 661  VIIQDGIFLPYENVLNYDSFAVRIGEDDIPNLINILRGFNESEIEFKLSNVRKIWQRFMY 720

Query: 2357 RDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQLKKE 2536
            R++++LEA+RQ++ Y   +DWA E SQ  +DD  AT++QVLH+KLH++PWRR ++   KE
Sbjct: 721  REAVMLEAQRQKAVYGIQEDWADEYSQLIDDDAVATVLQVLHHKLHSDPWRRHVKS-NKE 779

Query: 2537 FGLPKQC 2557
            FGLP +C
Sbjct: 780  FGLPHEC 786


>ref|XP_006349551.1| PREDICTED: uncharacterized protein LOC102592127 [Solanum tuberosum]
          Length = 790

 Score = 1147 bits (2966), Expect = 0.0
 Identities = 538/789 (68%), Positives = 617/789 (78%), Gaps = 2/789 (0%)
 Frame = +2

Query: 197  IMFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALG 376
            +M+  QK  CS+S +  IASIV LVSVVHLF + + PS D F  RQ + SC+  N T   
Sbjct: 1    MMWFKQKRMCSWSSVTIIASIVTLVSVVHLFLYPVVPSLDYF--RQYKNSCIPINSTK-- 56

Query: 377  GRDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKP 556
                    + +    +FP DLHN VVYRGAPWK ++G+W +GCDS            GK 
Sbjct: 57   STQPTHNNIIISNQTKFPLDLHNGVVYRGAPWKNQVGQWLAGCDSITSPLKVIEHIGGKS 116

Query: 557  CQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCD 736
            C+NDCSGQG+CN ELGQCRCFHGF GE CA++ EL CNYP + + P G WVVSIC ++CD
Sbjct: 117  CRNDCSGQGICNRELGQCRCFHGFTGEECAERQELSCNYPRSKEKPFGHWVVSICPAYCD 176

Query: 737  TMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGWCN 916
            T RAMCFCGEGTKYPNRP PE+CGFTI  P++PG   VTD+ K D ++FTTN S  GWCN
Sbjct: 177  TTRAMCFCGEGTKYPNRPVPETCGFTINPPSKPGGAPVTDFTKADLDVFTTNGSKRGWCN 236

Query: 917  VDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYG 1096
            VDP EAYA KV FKEECDCKYD L G+FCE+ V  +CIN                  W+G
Sbjct: 237  VDPEEAYASKVLFKEECDCKYDGLWGRFCEVSVLSTCINQCSGHGLCRGGFCQCDSGWFG 296

Query: 1097 VDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFN 1276
             DCS+PSV SSI+EWP WLRPAQV VP +V +  N+ +L A V KKRPLIYVYDLPP FN
Sbjct: 297  TDCSVPSVLSSIREWPLWLRPAQVTVPENVNSNGNLINLDAIVEKKRPLIYVYDLPPDFN 356

Query: 1277 SKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVP 1456
            S LLEGRHFK EC+NRIYD +N TVWT+QLYGAQMA+YES+L+SPHRT NGEEADFFFVP
Sbjct: 357  SLLLEGRHFKLECINRIYDQRNATVWTDQLYGAQMALYESMLASPHRTLNGEEADFFFVP 416

Query: 1457 VLDSCIITRADDAPHFTLRDY--QGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWF 1630
            VLDSCIITRADDAPH +++++   GLRSS TLE YKKAY+HI  QY YW+ S+GKDHIWF
Sbjct: 417  VLDSCIITRADDAPHLSMQEHIHGGLRSSLTLEFYKKAYDHIITQYPYWSRSAGKDHIWF 476

Query: 1631 FSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSK 1810
            FSWDEGACYAPKEIWNS+MLVHWGNTNSKHNHSTTAYW DNWD IS  RRG H CFDP K
Sbjct: 477  FSWDEGACYAPKEIWNSIMLVHWGNTNSKHNHSTTAYWGDNWDPISSDRRGNHTCFDPDK 536

Query: 1811 DLVLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAE 1990
            DLVLPAWKRPD  +L +  W+R RE+R   FYFNGNLGPAY NGRPEATYSMGIRQKVAE
Sbjct: 537  DLVLPAWKRPDEGSLNAKHWSRVREERKTFFYFNGNLGPAYENGRPEATYSMGIRQKVAE 596

Query: 1991 EFGSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGC 2170
            EFGS+ NKEG+LGKQH  DVIVTPLR+ +Y  +LA+SVFCGV PGDGWSGRMEDSILQGC
Sbjct: 597  EFGSTLNKEGKLGKQHAEDVIVTPLRAGNYHEELASSVFCGVMPGDGWSGRMEDSILQGC 656

Query: 2171 IPVIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRF 2350
            IPV+IQDGIYLPYEN LNYESFAVRI EDEIPN++ ILR+FNETEI+FKL NV+KIWQRF
Sbjct: 657  IPVVIQDGIYLPYENFLNYESFAVRIREDEIPNLLNILRSFNETEIEFKLENVKKIWQRF 716

Query: 2351 LYRDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRRQLQQLK 2530
            LYRDS++LEAERQ++    V+DW L+ SQ  EDDVFAT IQVLHYKLHN+ WR+QL   K
Sbjct: 717  LYRDSVVLEAERQKAVRGSVEDWGLKFSQLKEDDVFATFIQVLHYKLHNDTWRQQLILQK 776

Query: 2531 KEFGLPKQC 2557
            KEFGLPK+C
Sbjct: 777  KEFGLPKEC 785


>ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297324003|gb|EFH54424.1| exostosin family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 792

 Score = 1145 bits (2963), Expect = 0.0
 Identities = 520/787 (66%), Positives = 613/787 (77%), Gaps = 1/787 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALGG 379
            MF  QK   S+S I T+AS++ LVS+VHLF   + PS+D+  +RQ Q     TN +    
Sbjct: 1    MFSHQKWKFSWSQIATVASVIVLVSLVHLFLGPVVPSFDSIIVRQAQNLSGPTNESITQV 60

Query: 380  RDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKPC 559
              +L   + +  D RFP D H AVVYR A WKAEIG+W S CD+            G+ C
Sbjct: 61   TKDLSQSLVVAFDRRFPADSHGAVVYRNASWKAEIGQWLSSCDAVAKEVDVIEPIGGRKC 120

Query: 560  QNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCDT 739
             NDCSGQGVCN E G CRCFHGF G+ C+Q+L L CNY  TP++P G+WVVSICS HCDT
Sbjct: 121  MNDCSGQGVCNYEFGLCRCFHGFTGDDCSQKLHLDCNYEKTPEMPYGKWVVSICSRHCDT 180

Query: 740  MRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGWCNV 919
             RAMCFCGEGTKYPNRP PESCGF I  P  P   K+TDW KPD +I TTN+S  GWCNV
Sbjct: 181  TRAMCFCGEGTKYPNRPVPESCGFQINSPANPDEPKMTDWSKPDLDILTTNSSKQGWCNV 240

Query: 920  DPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYGV 1099
            DP +AYA KV+ KEECDCKYDCL G+FCEIPVQC+C+N                  W+G 
Sbjct: 241  DPEDAYALKVQIKEECDCKYDCLWGRFCEIPVQCTCVNQCSGHGKCRGGFCQCDKGWFGT 300

Query: 1100 DCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFNS 1279
            DCS PS  S++ EWP+WLRPA ++VP       N+T+LSA V KKRPLIY+YDLPP FNS
Sbjct: 301  DCSTPSTLSTVGEWPQWLRPAHLEVPSEKNVPGNLTNLSAVVKKKRPLIYIYDLPPDFNS 360

Query: 1280 KLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVPV 1459
             L+EGRHFK ECVNRIYD +N TVWT+ LYG+QMA YE++L++ HRT NGEEADFFFVPV
Sbjct: 361  LLIEGRHFKLECVNRIYDERNATVWTDYLYGSQMAFYENILATAHRTLNGEEADFFFVPV 420

Query: 1460 LDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFSW 1639
            LDSCII RADDAPH  ++++ GLRSSFTLE YK+AYEHI E+Y YWN S+G+DHIWFFSW
Sbjct: 421  LDSCIINRADDAPHINMQNHTGLRSSFTLEFYKRAYEHIVEKYPYWNRSAGRDHIWFFSW 480

Query: 1640 DEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDLV 1819
            DEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYW DNWD ISD RRG HPCFDP KDLV
Sbjct: 481  DEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDDISDERRGDHPCFDPRKDLV 540

Query: 1820 LPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEFG 1999
            +PAWK PD  ++++N WAR RE+R  LFYFNGNLGPAY  GRPE +YSMGIRQK+AEEFG
Sbjct: 541  IPAWKVPDPYSMRANYWARPREKRKTLFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEFG 600

Query: 2000 SSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIPV 2179
            SSPNKEG+LGKQH  DVIVTPLRS +Y  D+A S+FCG FPGDGWSGRMEDSILQGC+PV
Sbjct: 601  SSPNKEGKLGKQHAEDVIVTPLRSDNYHKDIANSIFCGAFPGDGWSGRMEDSILQGCVPV 660

Query: 2180 IIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLYR 2359
            IIQDGIYLPYEN+LNYESFAVR+ ED+IPN++  LR F+ETEIQF+LANV+K+WQRFL+R
Sbjct: 661  IIQDGIYLPYENMLNYESFAVRVSEDDIPNLINTLRGFSETEIQFRLANVKKLWQRFLFR 720

Query: 2360 DSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRR-QLQQLKKE 2536
            DSILLEAERQ++SY H ++WA++ S+   DD+FAT IQ LH+KLHN+PWRR Q+    K+
Sbjct: 721  DSILLEAERQKASYGHEEEWAVQFSKLKHDDIFATFIQTLHFKLHNDPWRREQVVNRTKD 780

Query: 2537 FGLPKQC 2557
            +GLP++C
Sbjct: 781  YGLPQEC 787


>ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]
            gi|44917463|gb|AAS49056.1| At3g57630 [Arabidopsis
            thaliana] gi|46931284|gb|AAT06446.1| At3g57630
            [Arabidopsis thaliana] gi|332646159|gb|AEE79680.1|
            exostosin family protein [Arabidopsis thaliana]
          Length = 793

 Score = 1135 bits (2937), Expect = 0.0
 Identities = 517/788 (65%), Positives = 612/788 (77%), Gaps = 2/788 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGT-ALG 376
            MF  QK   S+S I T+AS++ LVS+VHLF   + PS+D+  +RQ Q  C  +N + +  
Sbjct: 1    MFSHQKWKFSWSQIATVASVIVLVSLVHLFLGPVVPSFDSITVRQAQNLCGPSNESISQV 60

Query: 377  GRDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKP 556
             ++     V +  D RFP D H AVVYR A WKAEIG+W S CD+            G+ 
Sbjct: 61   TKNSSQSLVVVAFDRRFPADSHGAVVYRNASWKAEIGQWLSSCDAVAKEVDIIEPIGGRK 120

Query: 557  CQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCD 736
            C +DCSGQGVCN E G CRCFHGF GE C+Q+L L CNY  TP++P G+WVVSICS HCD
Sbjct: 121  CMSDCSGQGVCNHEFGLCRCFHGFTGEDCSQKLRLDCNYEKTPEMPYGKWVVSICSRHCD 180

Query: 737  TMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGWCN 916
            T RAMCFCGEGTKYPNRP PESCGF I  PT P   K+TDW KPD +I TTN+S  GWCN
Sbjct: 181  TTRAMCFCGEGTKYPNRPVPESCGFQINSPTNPDEPKMTDWSKPDLDILTTNSSKQGWCN 240

Query: 917  VDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYG 1096
            VDP +AYA KVK KEECDCKYDCL G+FCEIPVQC+C+N                  W+G
Sbjct: 241  VDPEDAYAMKVKIKEECDCKYDCLWGRFCEIPVQCTCVNQCSGHGKCRGGFCQCDKGWFG 300

Query: 1097 VDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFN 1276
             DCSIPS  S++ EWP+WLRPA ++VP       N+ +LSA V KKRPLIY+YDLPP FN
Sbjct: 301  TDCSIPSTLSTVGEWPQWLRPAHLEVPSEKNVPGNLINLSAVVKKKRPLIYIYDLPPDFN 360

Query: 1277 SKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVP 1456
            S L+EGRHFKFECVNRIYD +N TVWT+ LYG+QMA YE++L++ HRT NGEEADFFFVP
Sbjct: 361  SLLIEGRHFKFECVNRIYDERNATVWTDYLYGSQMAFYENILATAHRTMNGEEADFFFVP 420

Query: 1457 VLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFS 1636
            VLDSCII RADDAPH  ++++ GLRSS TLE YK+AYEHI E+Y YWN S+G+DHIWFFS
Sbjct: 421  VLDSCIINRADDAPHINMQNHTGLRSSLTLEFYKRAYEHIVEKYPYWNRSAGRDHIWFFS 480

Query: 1637 WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDL 1816
            WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAY+ DNWD ISD RRG HPCFDP KDL
Sbjct: 481  WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYFGDNWDDISDERRGDHPCFDPRKDL 540

Query: 1817 VLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEF 1996
            V+PAWK PD  +++ N W R RE+R  LFYFNGNLGPAY  GRPE +YSMGIRQK+AEEF
Sbjct: 541  VIPAWKVPDPYSMRKNYWERPREKRKTLFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEF 600

Query: 1997 GSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIP 2176
            GSSPNKEG+LGKQH  DVIVTPLRS +Y  D+A S+FCG FPGDGWSGRMEDSILQGC+P
Sbjct: 601  GSSPNKEGKLGKQHAEDVIVTPLRSDNYHKDIANSIFCGAFPGDGWSGRMEDSILQGCVP 660

Query: 2177 VIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLY 2356
            VIIQDGIYLPYEN+LNYESFAVR+ ED+IPN++  LR F+E EIQF+L NV+++WQRFL+
Sbjct: 661  VIIQDGIYLPYENMLNYESFAVRVNEDDIPNLINTLRGFSEAEIQFRLGNVKELWQRFLF 720

Query: 2357 RDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRR-QLQQLKK 2533
            RDSILLEAERQ+++Y H +DWA++ S+   DD+FAT+IQ LH+KLHN+PWRR Q     K
Sbjct: 721  RDSILLEAERQKATYGHEEDWAVQFSKLKHDDIFATIIQTLHFKLHNDPWRREQAVNRTK 780

Query: 2534 EFGLPKQC 2557
            ++GLP++C
Sbjct: 781  DYGLPQEC 788


>ref|XP_006290615.1| hypothetical protein CARUB_v10016706mg [Capsella rubella]
            gi|482559322|gb|EOA23513.1| hypothetical protein
            CARUB_v10016706mg [Capsella rubella]
          Length = 796

 Score = 1128 bits (2917), Expect = 0.0
 Identities = 517/784 (65%), Positives = 610/784 (77%), Gaps = 2/784 (0%)
 Frame = +2

Query: 212  QKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGTALGGRDEL 391
            QK N S+S I  +AS++ LVS+VHLF   + PS+D   +RQ Q     +N +     ++ 
Sbjct: 8    QKWNFSWSQIAIVASVIVLVSLVHLFLGPVLPSFDTVSVRQAQNLSGPSNESITQVTEDS 67

Query: 392  PVEVPLD-LDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKPCQND 568
              E  L   D RFP D H AVVYR A WKAEIG+W S CD+            G+ C +D
Sbjct: 68   TSESVLAAFDRRFPADSHGAVVYRNASWKAEIGQWLSSCDAVAKEVDIIEPIGGRKCLSD 127

Query: 569  CSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCDTMRA 748
            CSGQGVCN E G CRCFHGF G+ C+Q+  L+CNY  TP++P G WVVSICS+HCDT RA
Sbjct: 128  CSGQGVCNHEFGICRCFHGFTGQDCSQKQRLECNYEKTPEMPYGPWVVSICSTHCDTTRA 187

Query: 749  MCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGWCNVDPI 928
            MCFCGEGTKYPNRP PESCGF    P  P   K+TDW KPD +I TTN+S  GWCNVDP 
Sbjct: 188  MCFCGEGTKYPNRPVPESCGFQSNSPANPDEPKMTDWSKPDLDILTTNSSKQGWCNVDPE 247

Query: 929  EAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYGVDCS 1108
            +AYA KV+ KEECDCKYDCL G+FCEIPVQC+C+N                  W+G DCS
Sbjct: 248  DAYALKVQIKEECDCKYDCLWGRFCEIPVQCTCVNQCSGHGKCRGGFCQCDKGWFGTDCS 307

Query: 1109 IPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFNSKLL 1288
            IPS  S++ EWP+WLRPA ++VP   +   NI +LSA V KKRPLIY+YDLPP FNS LL
Sbjct: 308  IPSTLSTVGEWPQWLRPAHLEVPSDKEVPGNIINLSAVVKKKRPLIYIYDLPPDFNSLLL 367

Query: 1289 EGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVPVLDS 1468
            EGRHFK ECVNRIYD +N TVWT+ LYG+QMA YE++L++ HRT NGEEADFFFVPVLDS
Sbjct: 368  EGRHFKLECVNRIYDERNATVWTDYLYGSQMAFYENILATGHRTLNGEEADFFFVPVLDS 427

Query: 1469 CIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFSWDEG 1648
            CIITRADDAPHF ++++  LRSSFTLE YK+AYEHI E+Y YWN +SG+DHIWFFSWDEG
Sbjct: 428  CIITRADDAPHFNMKNHSRLRSSFTLEFYKRAYEHIVEKYPYWNRTSGRDHIWFFSWDEG 487

Query: 1649 ACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDLVLPA 1828
            ACYAPKEIWNSMMLVHWGNTNSKHNHSTTAY  DNWD ISD RRG HPCFDP KDLV+PA
Sbjct: 488  ACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYRDDNWDSISDERRGDHPCFDPRKDLVIPA 547

Query: 1829 WKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEFGSSP 2008
            WK PD  ++++N WAR RE+R  LFYFNGNLGPAY  GRPE +YSMGIRQK+AEEFGSSP
Sbjct: 548  WKLPDPYSVRANYWARPREKRKTLFYFNGNLGPAYAEGRPEDSYSMGIRQKLAEEFGSSP 607

Query: 2009 NKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIPVIIQ 2188
            N+EG+LGKQH  DVIVTPLRS +Y  D++ S+FCG FPGDGWSGRMEDSILQGC+PVIIQ
Sbjct: 608  NREGKLGKQHTEDVIVTPLRSDNYHKDISNSIFCGAFPGDGWSGRMEDSILQGCVPVIIQ 667

Query: 2189 DGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLYRDSI 2368
            DGIYLPYEN+LNYESFAVR+ ED+IPN++  LR F+ETEIQF+LANV+K+WQRFL+RDSI
Sbjct: 668  DGIYLPYENMLNYESFAVRVSEDDIPNLINTLRGFSETEIQFRLANVKKLWQRFLFRDSI 727

Query: 2369 LLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRR-QLQQLKKEFGL 2545
            LLEAERQ++SY H +DWA++ S+   DD+FAT IQ LH+KLHN+PWRR Q+    KE+GL
Sbjct: 728  LLEAERQKASYGHEEDWAVQFSKLKHDDIFATFIQTLHFKLHNDPWRREQVINRTKEYGL 787

Query: 2546 PKQC 2557
            P++C
Sbjct: 788  PQEC 791


>ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]
            gi|110740929|dbj|BAE98560.1| hypothetical protein
            [Arabidopsis thaliana] gi|332646160|gb|AEE79681.1|
            exostosin family protein [Arabidopsis thaliana]
          Length = 791

 Score = 1126 bits (2913), Expect = 0.0
 Identities = 515/788 (65%), Positives = 610/788 (77%), Gaps = 2/788 (0%)
 Frame = +2

Query: 200  MFIIQKGNCSYSLIGTIASIVALVSVVHLFFFNLGPSWDNFGIRQTQTSCLGTNGT-ALG 376
            MF  QK   S+S I T+AS++ LVS+VHLF   + PS+D+  +RQ Q  C  +N + +  
Sbjct: 1    MFSHQKWKFSWSQIATVASVIVLVSLVHLFLGPVVPSFDSITVRQAQNLCGPSNESISQV 60

Query: 377  GRDELPVEVPLDLDARFPPDLHNAVVYRGAPWKAEIGRWFSGCDSXXXXXXXXXXXNGKP 556
             ++     V +  D RFP D H AVVYR A WKAEIG+W S CD+            G+ 
Sbjct: 61   TKNSSQSLVVVAFDRRFPADSHGAVVYRNASWKAEIGQWLSSCDAVAKEVDIIEPIGGRK 120

Query: 557  CQNDCSGQGVCNGELGQCRCFHGFVGEGCAQQLELQCNYPGTPDLPNGRWVVSICSSHCD 736
            C +DCSGQGVCN E G CRCFHGF    C+Q+L L CNY  TP++P G+WVVSICS HCD
Sbjct: 121  CMSDCSGQGVCNHEFGLCRCFHGFTD--CSQKLRLDCNYEKTPEMPYGKWVVSICSRHCD 178

Query: 737  TMRAMCFCGEGTKYPNRPAPESCGFTIIMPTEPGAFKVTDWGKPDPNIFTTNASLPGWCN 916
            T RAMCFCGEGTKYPNRP PESCGF I  PT P   K+TDW KPD +I TTN+S  GWCN
Sbjct: 179  TTRAMCFCGEGTKYPNRPVPESCGFQINSPTNPDEPKMTDWSKPDLDILTTNSSKQGWCN 238

Query: 917  VDPIEAYAGKVKFKEECDCKYDCLVGQFCEIPVQCSCINXXXXXXXXXXXXXXXXXXWYG 1096
            VDP +AYA KVK KEECDCKYDCL G+FCEIPVQC+C+N                  W+G
Sbjct: 239  VDPEDAYAMKVKIKEECDCKYDCLWGRFCEIPVQCTCVNQCSGHGKCRGGFCQCDKGWFG 298

Query: 1097 VDCSIPSVFSSIKEWPRWLRPAQVDVPGSVKNLDNITSLSATVNKKRPLIYVYDLPPQFN 1276
             DCSIPS  S++ EWP+WLRPA ++VP       N+ +LSA V KKRPLIY+YDLPP FN
Sbjct: 299  TDCSIPSTLSTVGEWPQWLRPAHLEVPSEKNVPGNLINLSAVVKKKRPLIYIYDLPPDFN 358

Query: 1277 SKLLEGRHFKFECVNRIYDHQNKTVWTEQLYGAQMAIYESLLSSPHRTTNGEEADFFFVP 1456
            S L+EGRHFKFECVNRIYD +N TVWT+ LYG+QMA YE++L++ HRT NGEEADFFFVP
Sbjct: 359  SLLIEGRHFKFECVNRIYDERNATVWTDYLYGSQMAFYENILATAHRTMNGEEADFFFVP 418

Query: 1457 VLDSCIITRADDAPHFTLRDYQGLRSSFTLELYKKAYEHIAEQYTYWNHSSGKDHIWFFS 1636
            VLDSCII RADDAPH  ++++ GLRSS TLE YK+AYEHI E+Y YWN S+G+DHIWFFS
Sbjct: 419  VLDSCIINRADDAPHINMQNHTGLRSSLTLEFYKRAYEHIVEKYPYWNRSAGRDHIWFFS 478

Query: 1637 WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDMISDVRRGKHPCFDPSKDL 1816
            WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAY+ DNWD ISD RRG HPCFDP KDL
Sbjct: 479  WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYFGDNWDDISDERRGDHPCFDPRKDL 538

Query: 1817 VLPAWKRPDGPTLKSNLWARSREQRTKLFYFNGNLGPAYNNGRPEATYSMGIRQKVAEEF 1996
            V+PAWK PD  +++ N W R RE+R  LFYFNGNLGPAY  GRPE +YSMGIRQK+AEEF
Sbjct: 539  VIPAWKVPDPYSMRKNYWERPREKRKTLFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEF 598

Query: 1997 GSSPNKEGRLGKQHQNDVIVTPLRSSDYQGDLATSVFCGVFPGDGWSGRMEDSILQGCIP 2176
            GSSPNKEG+LGKQH  DVIVTPLRS +Y  D+A S+FCG FPGDGWSGRMEDSILQGC+P
Sbjct: 599  GSSPNKEGKLGKQHAEDVIVTPLRSDNYHKDIANSIFCGAFPGDGWSGRMEDSILQGCVP 658

Query: 2177 VIIQDGIYLPYENVLNYESFAVRIGEDEIPNMMKILRAFNETEIQFKLANVQKIWQRFLY 2356
            VIIQDGIYLPYEN+LNYESFAVR+ ED+IPN++  LR F+E EIQF+L NV+++WQRFL+
Sbjct: 659  VIIQDGIYLPYENMLNYESFAVRVNEDDIPNLINTLRGFSEAEIQFRLGNVKELWQRFLF 718

Query: 2357 RDSILLEAERQRSSYNHVDDWALELSQSSEDDVFATLIQVLHYKLHNEPWRR-QLQQLKK 2533
            RDSILLEAERQ+++Y H +DWA++ S+   DD+FAT+IQ LH+KLHN+PWRR Q     K
Sbjct: 719  RDSILLEAERQKATYGHEEDWAVQFSKLKHDDIFATIIQTLHFKLHNDPWRREQAVNRTK 778

Query: 2534 EFGLPKQC 2557
            ++GLP++C
Sbjct: 779  DYGLPQEC 786


Top