BLASTX nr result

ID: Mentha24_contig00031028 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00031028
         (1335 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU30534.1| hypothetical protein MIMGU_mgv1a017955mg, partial...   697   0.0  
emb|CBI29877.3| unnamed protein product [Vitis vinifera]              612   e-173
ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267...   612   e-173
ref|XP_007028839.1| Exostosin family protein isoform 1 [Theobrom...   610   e-172
ref|XP_006493902.1| PREDICTED: uncharacterized protein LOC102626...   606   e-171
ref|XP_006493901.1| PREDICTED: uncharacterized protein LOC102626...   606   e-171
ref|XP_006421449.1| hypothetical protein CICLE_v10004353mg [Citr...   605   e-170
ref|XP_007201939.1| hypothetical protein PRUPE_ppa001595mg [Prun...   602   e-169
ref|XP_006349551.1| PREDICTED: uncharacterized protein LOC102592...   598   e-168
ref|XP_003519065.1| PREDICTED: uncharacterized protein LOC100783...   598   e-168
ref|XP_002308967.2| exostosin family protein [Populus trichocarp...   596   e-168
ref|XP_004234838.1| PREDICTED: uncharacterized protein LOC101249...   587   e-165
ref|XP_004307567.1| PREDICTED: uncharacterized protein LOC101304...   586   e-165
ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807...   585   e-164
ref|XP_007145630.1| hypothetical protein PHAVU_007G255200g [Phas...   585   e-164
ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]...   583   e-164
ref|XP_006402860.1| hypothetical protein EUTSA_v10005794mg [Eutr...   582   e-163
ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata...   582   e-163
ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]...   573   e-161
ref|XP_004516917.1| PREDICTED: uncharacterized protein LOC101503...   572   e-160

>gb|EYU30534.1| hypothetical protein MIMGU_mgv1a017955mg, partial [Mimulus guttatus]
          Length = 475

 Score =  697 bits (1799), Expect = 0.0
 Identities = 314/425 (73%), Positives = 344/425 (80%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGS 240
            MFSLQKWKCSW                HLFLYP++P +DY S+ QA++SC+   GST G 
Sbjct: 1    MFSLQKWKCSWSLAATIASILALISVVHLFLYPVIPSMDYFSLRQAESSCITVTGSTEGG 60

Query: 241  EKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCDS 420
            EKY       EG  +N  K++ H  VDLN +Y AD HNAVTYRGAPWKAEIGRWLSGCDS
Sbjct: 61   EKYFPRTGSNEGTKDNA-KENVHRAVDLNVRYTADLHNAVTYRGAPWKAEIGRWLSGCDS 119

Query: 421  KVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDNL 600
               AV+IVEKIGG+ C++ECSGQG+CN DLG C CFHGFSGE CSERLQLNCNYP  D  
Sbjct: 120  NFSAVQIVEKIGGESCENECSGQGVCNHDLGQCRCFHGFSGEACSERLQLNCNYPGSDTE 179

Query: 601  PYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAAD 780
            PYGHWVVSIC  YCDTSRAMCFCGEGTKYPNRP  ESCGF IN PSEPG PR TDW   D
Sbjct: 180  PYGHWVVSICSTYCDTSRAMCFCGEGTKYPNRPAAESCGFVINPPSEPGAPRFTDWAIPD 239

Query: 781  RDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGHG 960
            +DIFTTNSS  GWCNVDPAE YA+NV+ KE+CDCKYDGLFGRFCET V SVCINQCSGHG
Sbjct: 240  QDIFTTNSSKEGWCNVDPAEAYASNVTFKEDCDCKYDGLFGRFCETTVSSVCINQCSGHG 299

Query: 961  HCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQK 1140
            +CRGGFC+CE GWYGVDCS+PSVLSSI EWP+WLRP+HI VPDS R  G L +LDAVVQK
Sbjct: 300  YCRGGFCQCENGWYGVDCSIPSVLSSITEWPKWLRPAHISVPDSKRDTGNLVSLDAVVQK 359

Query: 1141 KRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLASP 1320
            KRPLIYVYDLPPDFNSLLLEGRHFK ECVNR+YDHRN TIWT+QLYGAQMA YES+LASP
Sbjct: 360  KRPLIYVYDLPPDFNSLLLEGRHFKFECVNRIYDHRNGTIWTEQLYGAQMAIYESILASP 419

Query: 1321 HRTLN 1335
            +RTLN
Sbjct: 420  YRTLN 424


>emb|CBI29877.3| unnamed protein product [Vitis vinifera]
          Length = 822

 Score =  612 bits (1579), Expect = e-173
 Identities = 277/425 (65%), Positives = 317/425 (74%), Gaps = 1/425 (0%)
 Frame = +1

Query: 64   FSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGSE 243
            F LQKWKCSW                HLFL+PL P L+Y S+ Q Q +C P N S  G +
Sbjct: 31   FFLQKWKCSWSLLATVASVVALISVAHLFLFPLAPSLEYFSMGQGQKTCTPINASIRGVD 90

Query: 244  KYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCDSK 423
                     +G    P  D DH       ++PADSH +V YRGAPWKAEIGRW SGCDS 
Sbjct: 91   H--------DGKNLQPSFDLDH-------RFPADSHKSVVYRGAPWKAEIGRWFSGCDSI 135

Query: 424  VEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDNLP 603
               V I+EKIGGK CK++CSGQGICN +LG C CFHGFSGEGCSERL L+CNYP+    P
Sbjct: 136  AAEVSIIEKIGGKDCKNDCSGQGICNHELGQCRCFHGFSGEGCSERLHLDCNYPSSPEQP 195

Query: 604  YGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAADR 783
            YG WVVSICPA CDT+RAMCFCGEGTKYP+RPV E+CGF++N P+ PG P++ DW  AD 
Sbjct: 196  YGPWVVSICPASCDTTRAMCFCGEGTKYPHRPVAEACGFQMNLPTTPGDPKLVDWTKADL 255

Query: 784  D-IFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGHG 960
            D IFTTN S  GWCNVDP E YA  +  KEECDCKYD L GRFCE PV   C+NQCSGHG
Sbjct: 256  DNIFTTNDSKPGWCNVDPTEAYALKMQYKEECDCKYDCLLGRFCEIPVLCTCVNQCSGHG 315

Query: 961  HCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQK 1140
            HCRGGFC+C +GWYG DCS+PSVLSS+ EWP WLRP+H++VPD    +G+L NLDAVV+K
Sbjct: 316  HCRGGFCQCHRGWYGTDCSIPSVLSSVREWPRWLRPAHVEVPDDMHLSGSLVNLDAVVKK 375

Query: 1141 KRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLASP 1320
            KRPLIYVYDLPP+FNSLLLEGRHFK ECVNR+YD RNAT WT+QLYGAQMA YES+LASP
Sbjct: 376  KRPLIYVYDLPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYGAQMAIYESILASP 435

Query: 1321 HRTLN 1335
            HRTL+
Sbjct: 436  HRTLD 440


>ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267584 [Vitis vinifera]
          Length = 794

 Score =  612 bits (1579), Expect = e-173
 Identities = 277/425 (65%), Positives = 317/425 (74%), Gaps = 1/425 (0%)
 Frame = +1

Query: 64   FSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGSE 243
            F LQKWKCSW                HLFL+PL P L+Y S+ Q Q +C P N S  G +
Sbjct: 3    FFLQKWKCSWSLLATVASVVALISVAHLFLFPLAPSLEYFSMGQGQKTCTPINASIRGVD 62

Query: 244  KYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCDSK 423
                     +G    P  D DH       ++PADSH +V YRGAPWKAEIGRW SGCDS 
Sbjct: 63   H--------DGKNLQPSFDLDH-------RFPADSHKSVVYRGAPWKAEIGRWFSGCDSI 107

Query: 424  VEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDNLP 603
               V I+EKIGGK CK++CSGQGICN +LG C CFHGFSGEGCSERL L+CNYP+    P
Sbjct: 108  AAEVSIIEKIGGKDCKNDCSGQGICNHELGQCRCFHGFSGEGCSERLHLDCNYPSSPEQP 167

Query: 604  YGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAADR 783
            YG WVVSICPA CDT+RAMCFCGEGTKYP+RPV E+CGF++N P+ PG P++ DW  AD 
Sbjct: 168  YGPWVVSICPASCDTTRAMCFCGEGTKYPHRPVAEACGFQMNLPTTPGDPKLVDWTKADL 227

Query: 784  D-IFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGHG 960
            D IFTTN S  GWCNVDP E YA  +  KEECDCKYD L GRFCE PV   C+NQCSGHG
Sbjct: 228  DNIFTTNDSKPGWCNVDPTEAYALKMQYKEECDCKYDCLLGRFCEIPVLCTCVNQCSGHG 287

Query: 961  HCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQK 1140
            HCRGGFC+C +GWYG DCS+PSVLSS+ EWP WLRP+H++VPD    +G+L NLDAVV+K
Sbjct: 288  HCRGGFCQCHRGWYGTDCSIPSVLSSVREWPRWLRPAHVEVPDDMHLSGSLVNLDAVVKK 347

Query: 1141 KRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLASP 1320
            KRPLIYVYDLPP+FNSLLLEGRHFK ECVNR+YD RNAT WT+QLYGAQMA YES+LASP
Sbjct: 348  KRPLIYVYDLPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYGAQMAIYESILASP 407

Query: 1321 HRTLN 1335
            HRTL+
Sbjct: 408  HRTLD 412


>ref|XP_007028839.1| Exostosin family protein isoform 1 [Theobroma cacao]
            gi|590636390|ref|XP_007028840.1| Exostosin family protein
            isoform 1 [Theobroma cacao] gi|508717444|gb|EOY09341.1|
            Exostosin family protein isoform 1 [Theobroma cacao]
            gi|508717445|gb|EOY09342.1| Exostosin family protein
            isoform 1 [Theobroma cacao]
          Length = 794

 Score =  610 bits (1574), Expect = e-172
 Identities = 277/433 (63%), Positives = 322/433 (74%), Gaps = 7/433 (1%)
 Frame = +1

Query: 58   VMFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGG 237
            +MFS+QKWKCSW                HLFL+P+VP  DY    Q Q  C+P N S   
Sbjct: 1    MMFSVQKWKCSWSLVATVASVIVPVSVVHLFLFPVVPSFDYFRAPQVQYKCVPINASV-- 58

Query: 238  SEKYVNGMEPKEGLTENPEKDSDH------PVVDLNAQYPADSHNAVTYRGAPWKAEIGR 399
                              EK +DH      P +DL+ ++P+D HN V Y  APWKAEIG+
Sbjct: 59   ------------------EKVADHVWENIQPGLDLDHRFPSDLHNGVVYHNAPWKAEIGQ 100

Query: 400  WLSGCDSKVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCN 579
            WLS CD+    V IVE IGG+RCK +CSGQG+CN ++G C CFHGFSGE CSER+ L+CN
Sbjct: 101  WLSSCDAIAREVNIVETIGGRRCKADCSGQGVCNHEMGQCRCFHGFSGEECSERVHLSCN 160

Query: 580  YPAEDNLPYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRM 759
            YP    LPYG WVVSICPA+CDT+RAMCFCGEGTKYPNRPV E+CGF++N PSEPGGP++
Sbjct: 161  YPKTPELPYGRWVVSICPAHCDTTRAMCFCGEGTKYPNRPVAEACGFQMNLPSEPGGPKL 220

Query: 760  TDWGAADRD-IFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVC 936
            TDW  AD D IFTTN S  GWCNVDP   YA+ V  KEECDCKYDGL+GRFCE PVESVC
Sbjct: 221  TDWSKADLDNIFTTNGSKPGWCNVDPDAAYASKVLFKEECDCKYDGLWGRFCEVPVESVC 280

Query: 937  INQCSGHGHCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLS 1116
            INQCSGHGHCRGGFC+C  GWYG DCS+PSV+S +GEWP+WLRP+ +D+P S    G+L 
Sbjct: 281  INQCSGHGHCRGGFCQCYNGWYGTDCSIPSVVSPMGEWPKWLRPAQVDIP-SIEHTGSLV 339

Query: 1117 NLDAVVQKKRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMAT 1296
            NLDA V+KKRPLIYVYDLPP+FNSLLLEGRHFK ECVNR+YD RNAT+WTDQLYG+QMA 
Sbjct: 340  NLDAAVKKKRPLIYVYDLPPEFNSLLLEGRHFKFECVNRIYDDRNATLWTDQLYGSQMAL 399

Query: 1297 YESMLASPHRTLN 1335
            YES+LASP+RTLN
Sbjct: 400  YESILASPYRTLN 412


>ref|XP_006493902.1| PREDICTED: uncharacterized protein LOC102626477 isoform X2 [Citrus
            sinensis]
          Length = 697

 Score =  606 bits (1563), Expect = e-171
 Identities = 270/427 (63%), Positives = 322/427 (75%), Gaps = 2/427 (0%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSI-SQAQNSCLPTNGSTGG 237
            M S++KW+ SW                HLFL+PLVP  DY +   Q QNSC+P       
Sbjct: 1    MISIEKWRFSWTLVATVASVLTLVSVVHLFLFPLVPSFDYFTARQQIQNSCVPIK----- 55

Query: 238  SEKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCD 417
                    E  EG+T    ++S  P ++L+ ++PAD HNAV YR APWKAEIGRWLSGCD
Sbjct: 56   --------ESAEGVTNRVWENSP-PQLNLDHRFPADLHNAVVYRNAPWKAEIGRWLSGCD 106

Query: 418  SKVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDN 597
            S  + V++VE IGGK CK +CSGQG+CN +LG C CFHGF G+GCSER+   CN+P    
Sbjct: 107  SVAKEVDLVEMIGGKSCKSDCSGQGVCNHELGQCRCFHGFRGKGCSERIHFQCNFPKTPE 166

Query: 598  LPYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAA 777
            LPYG WVVSICP +CDT+RAMCFCGEGTKYPNRPV E+CGF++N PS+PG P+ TDW  A
Sbjct: 167  LPYGRWVVSICPTHCDTTRAMCFCGEGTKYPNRPVAEACGFQVNLPSQPGAPKSTDWAKA 226

Query: 778  DRD-IFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSG 954
            D D IFTTN S  GWCNVDP E YA  V  KEECDCKYDGL G+FCE PV S C+NQCSG
Sbjct: 227  DLDNIFTTNGSKPGWCNVDPEEAYALKVQFKEECDCKYDGLLGQFCEVPVSSTCVNQCSG 286

Query: 955  HGHCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVV 1134
            HGHCRGGFC+C+ GWYGVDCS+PSV+SS+ EWP+WLRP+HID+P +A   G L NL+AVV
Sbjct: 287  HGHCRGGFCQCDNGWYGVDCSIPSVMSSMSEWPQWLRPAHIDIPINANITGNLVNLNAVV 346

Query: 1135 QKKRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLA 1314
            +KKRPL+YVYDLPP+FNSLLLEGRH+KLECVNR+Y+ +N T+WTD LYG+QMA YES+LA
Sbjct: 347  KKKRPLVYVYDLPPEFNSLLLEGRHYKLECVNRIYNEKNETLWTDMLYGSQMAFYESILA 406

Query: 1315 SPHRTLN 1335
            SPHRTLN
Sbjct: 407  SPHRTLN 413


>ref|XP_006493901.1| PREDICTED: uncharacterized protein LOC102626477 isoform X1 [Citrus
            sinensis]
          Length = 791

 Score =  606 bits (1563), Expect = e-171
 Identities = 270/427 (63%), Positives = 322/427 (75%), Gaps = 2/427 (0%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSI-SQAQNSCLPTNGSTGG 237
            M S++KW+ SW                HLFL+PLVP  DY +   Q QNSC+P       
Sbjct: 1    MISIEKWRFSWTLVATVASVLTLVSVVHLFLFPLVPSFDYFTARQQIQNSCVPIK----- 55

Query: 238  SEKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCD 417
                    E  EG+T    ++S  P ++L+ ++PAD HNAV YR APWKAEIGRWLSGCD
Sbjct: 56   --------ESAEGVTNRVWENSP-PQLNLDHRFPADLHNAVVYRNAPWKAEIGRWLSGCD 106

Query: 418  SKVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDN 597
            S  + V++VE IGGK CK +CSGQG+CN +LG C CFHGF G+GCSER+   CN+P    
Sbjct: 107  SVAKEVDLVEMIGGKSCKSDCSGQGVCNHELGQCRCFHGFRGKGCSERIHFQCNFPKTPE 166

Query: 598  LPYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAA 777
            LPYG WVVSICP +CDT+RAMCFCGEGTKYPNRPV E+CGF++N PS+PG P+ TDW  A
Sbjct: 167  LPYGRWVVSICPTHCDTTRAMCFCGEGTKYPNRPVAEACGFQVNLPSQPGAPKSTDWAKA 226

Query: 778  DRD-IFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSG 954
            D D IFTTN S  GWCNVDP E YA  V  KEECDCKYDGL G+FCE PV S C+NQCSG
Sbjct: 227  DLDNIFTTNGSKPGWCNVDPEEAYALKVQFKEECDCKYDGLLGQFCEVPVSSTCVNQCSG 286

Query: 955  HGHCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVV 1134
            HGHCRGGFC+C+ GWYGVDCS+PSV+SS+ EWP+WLRP+HID+P +A   G L NL+AVV
Sbjct: 287  HGHCRGGFCQCDNGWYGVDCSIPSVMSSMSEWPQWLRPAHIDIPINANITGNLVNLNAVV 346

Query: 1135 QKKRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLA 1314
            +KKRPL+YVYDLPP+FNSLLLEGRH+KLECVNR+Y+ +N T+WTD LYG+QMA YES+LA
Sbjct: 347  KKKRPLVYVYDLPPEFNSLLLEGRHYKLECVNRIYNEKNETLWTDMLYGSQMAFYESILA 406

Query: 1315 SPHRTLN 1335
            SPHRTLN
Sbjct: 407  SPHRTLN 413


>ref|XP_006421449.1| hypothetical protein CICLE_v10004353mg [Citrus clementina]
            gi|557523322|gb|ESR34689.1| hypothetical protein
            CICLE_v10004353mg [Citrus clementina]
          Length = 791

 Score =  605 bits (1559), Expect = e-170
 Identities = 268/427 (62%), Positives = 323/427 (75%), Gaps = 2/427 (0%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSI-SQAQNSCLPTNGSTGG 237
            M S++KW+ SW                HLFL+PLVP  DY +   Q QNSC+P       
Sbjct: 1    MISIEKWRFSWTLVATVASVLTLVSVVHLFLFPLVPSFDYFTARQQIQNSCVPIK----- 55

Query: 238  SEKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCD 417
                    E  EG+T    ++S  P ++L+ ++PAD HNAV YR APWKAEIGRWLSGCD
Sbjct: 56   --------ESAEGVTNRVWENSP-PQLNLDHRFPADLHNAVVYRNAPWKAEIGRWLSGCD 106

Query: 418  SKVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDN 597
            S  + V++VE IGGK CK +CSGQG+CN +LG C CFHGF G+GCSER+   CN+P    
Sbjct: 107  SVAKEVDLVEMIGGKSCKSDCSGQGVCNHELGQCRCFHGFRGKGCSERIHFQCNFPKTPE 166

Query: 598  LPYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAA 777
            LPYG WVVSICP +CDT+RAMCFCGEGTKYPNRPV E+CGF++N PS+PG P++T+W  A
Sbjct: 167  LPYGRWVVSICPTHCDTTRAMCFCGEGTKYPNRPVAEACGFQVNLPSQPGAPKLTNWAKA 226

Query: 778  DRD-IFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSG 954
            D D IFTTN S  GWCN+DP E YA  V  KEECDCKYDGL G+FCE PV S C+NQCSG
Sbjct: 227  DLDNIFTTNGSKPGWCNIDPKEAYALKVQFKEECDCKYDGLLGQFCEVPVSSTCVNQCSG 286

Query: 955  HGHCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVV 1134
            HGHCRGGFC+C+ GWYGVDCS+PSV+SS+ EWP+WLRP+HID+P +A   G L NL+AVV
Sbjct: 287  HGHCRGGFCQCDNGWYGVDCSIPSVMSSMSEWPQWLRPAHIDIPINANITGNLVNLNAVV 346

Query: 1135 QKKRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLA 1314
            +KKRPL+YVYDLPP+FNSLLLEGRH+KLECVNR+Y+ +N T+WTD LYG+QMA YES+LA
Sbjct: 347  KKKRPLLYVYDLPPEFNSLLLEGRHYKLECVNRIYNEKNETLWTDMLYGSQMAFYESILA 406

Query: 1315 SPHRTLN 1335
            SPHRTLN
Sbjct: 407  SPHRTLN 413


>ref|XP_007201939.1| hypothetical protein PRUPE_ppa001595mg [Prunus persica]
            gi|462397470|gb|EMJ03138.1| hypothetical protein
            PRUPE_ppa001595mg [Prunus persica]
          Length = 795

 Score =  602 bits (1552), Expect = e-169
 Identities = 275/431 (63%), Positives = 324/431 (75%), Gaps = 6/431 (1%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXX-----HLFLYPLVPPLDYLSISQAQNSCLPTNG 225
            M S+QKWKCSW                     HLF +PLVP  +Y S  QAQNSC+P NG
Sbjct: 1    MLSIQKWKCSWSQIATIASIVALASIILGSIVHLFWFPLVPSFNYFS--QAQNSCVPING 58

Query: 226  STGGSEKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWL 405
            S              E + +N  K +  P +DL+ Q+P+D H AV +RGAPWKAEIGRWL
Sbjct: 59   SA-------------EAVIDNV-KGNFKPPIDLDRQFPSDLHKAVVFRGAPWKAEIGRWL 104

Query: 406  SGCDSKVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYP 585
            SGCD   + V IVE IGG  CK++CSGQG+CNR+LG C C+HG+SGEGCSERLQL CNYP
Sbjct: 105  SGCDPISDEVNIVEVIGGSGCKNDCSGQGVCNRELGQCRCYHGYSGEGCSERLQLECNYP 164

Query: 586  AEDNLPYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTD 765
               + PYG WVVSIC A+CDT+RA CFCGEGTKYPNRPV E+CGF++  PSEPG P++TD
Sbjct: 165  GSPDQPYGRWVVSICSAHCDTTRAFCFCGEGTKYPNRPVAEACGFQVQLPSEPGAPKLTD 224

Query: 766  WGAADRD-IFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCIN 942
            W  AD D +FT N S  GWCNVDPAE YA  V  KEECDCKYD  +GRFCE PV   CIN
Sbjct: 225  WAKADLDNVFTKNGSKPGWCNVDPAEVYAHKVQFKEECDCKYDCFWGRFCEVPVLCTCIN 284

Query: 943  QCSGHGHCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNL 1122
            QCSGHGHCRGGFC+C+ GWYG+DCS+PSV SS+ EWP+WLRP+ +DVPDS+   G + NL
Sbjct: 285  QCSGHGHCRGGFCQCDNGWYGIDCSIPSVTSSVREWPQWLRPAQVDVPDSSHLPGKVVNL 344

Query: 1123 DAVVQKKRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYE 1302
            +AVV+KKRPLIYVYDLPPDFNSLLLEGRHF+LECVNR+YD +N+T+WTDQLYGAQ+A YE
Sbjct: 345  NAVVKKKRPLIYVYDLPPDFNSLLLEGRHFRLECVNRIYDGKNSTLWTDQLYGAQVALYE 404

Query: 1303 SMLASPHRTLN 1335
            S+LASP+RTLN
Sbjct: 405  SILASPYRTLN 415


>ref|XP_006349551.1| PREDICTED: uncharacterized protein LOC102592127 [Solanum tuberosum]
          Length = 790

 Score =  598 bits (1543), Expect = e-168
 Identities = 268/426 (62%), Positives = 319/426 (74%)
 Frame = +1

Query: 58   VMFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGG 237
            +M+  QK  CSW                HLFLYP+VP LDY    Q +NSC+P N     
Sbjct: 1    MMWFKQKRMCSWSSVTIIASIVTLVSVVHLFLYPVVPSLDYFR--QYKNSCIPINS---- 54

Query: 238  SEKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCD 417
                          T++ +   ++ ++    ++P D HN V YRGAPWK ++G+WL+GCD
Sbjct: 55   --------------TKSTQPTHNNIIISNQTKFPLDLHNGVVYRGAPWKNQVGQWLAGCD 100

Query: 418  SKVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDN 597
            S    ++++E IGGK C+++CSGQGICNR+LG C CFHGF+GE C+ER +L+CNYP    
Sbjct: 101  SITSPLKVIEHIGGKSCRNDCSGQGICNRELGQCRCFHGFTGEECAERQELSCNYPRSKE 160

Query: 598  LPYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAA 777
             P+GHWVVSICPAYCDT+RAMCFCGEGTKYPNRPVPE+CGF IN PS+PGG  +TD+  A
Sbjct: 161  KPFGHWVVSICPAYCDTTRAMCFCGEGTKYPNRPVPETCGFTINPPSKPGGAPVTDFTKA 220

Query: 778  DRDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGH 957
            D D+FTTN S RGWCNVDP E YA+ V  KEECDCKYDGL+GRFCE  V S CINQCSGH
Sbjct: 221  DLDVFTTNGSKRGWCNVDPEEAYASKVLFKEECDCKYDGLWGRFCEVSVLSTCINQCSGH 280

Query: 958  GHCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQ 1137
            G CRGGFC+C+ GW+G DCSVPSVLSSI EWP WLRP+ + VP++  S G L NLDA+V+
Sbjct: 281  GLCRGGFCQCDSGWFGTDCSVPSVLSSIREWPLWLRPAQVTVPENVNSNGNLINLDAIVE 340

Query: 1138 KKRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLAS 1317
            KKRPLIYVYDLPPDFNSLLLEGRHFKLEC+NR+YD RNAT+WTDQLYGAQMA YESMLAS
Sbjct: 341  KKRPLIYVYDLPPDFNSLLLEGRHFKLECINRIYDQRNATVWTDQLYGAQMALYESMLAS 400

Query: 1318 PHRTLN 1335
            PHRTLN
Sbjct: 401  PHRTLN 406


>ref|XP_003519065.1| PREDICTED: uncharacterized protein LOC100783624 [Glycine max]
          Length = 795

 Score =  598 bits (1543), Expect = e-168
 Identities = 267/426 (62%), Positives = 314/426 (73%), Gaps = 1/426 (0%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGS 240
            +FS+ KW+CSW                HLFL+PL P  +Y  I  AQ+SC PTN S    
Sbjct: 8    LFSMNKWRCSWSLAATIASVVALVSVVHLFLFPLTPTFNYFKI--AQDSCFPTNASA--- 62

Query: 241  EKYVNGMEPKEGLTENPE-KDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCD 417
                          E P  +D + P VD   Q+PAD H A  Y+GAPWKAEIG+WL+GCD
Sbjct: 63   --------------EFPSNRDQEWPAVDFKRQFPADLHGAFVYQGAPWKAEIGQWLAGCD 108

Query: 418  SKVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDN 597
            S ++ V I E IGG  CK +CSGQG+CN +LG C CFHG+SG+GC+E+LQL CN+    +
Sbjct: 109  SVIKEVNITEIIGGNNCKKDCSGQGVCNLELGQCRCFHGYSGDGCTEKLQLQCNFLGSPD 168

Query: 598  LPYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAA 777
             P+G WVVSICPA CD +RAMCFCGEGTKYPNRP+ E+CGF+ N PSEP GPR+ +W   
Sbjct: 169  QPFGRWVVSICPANCDKTRAMCFCGEGTKYPNRPLAETCGFQFNPPSEPDGPRIVNWTKI 228

Query: 778  DRDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGH 957
            D+D+FTTN SI GWCNVDPAE YA    IKEECDCKYDGL GR CE PVESVCINQCSGH
Sbjct: 229  DQDVFTTNRSIPGWCNVDPAEAYAGKAKIKEECDCKYDGLAGRLCEVPVESVCINQCSGH 288

Query: 958  GHCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQ 1137
            GHCRGGFC+C+ GWYGVDCS+PSV+SSI EWP WLRP+ ID+ D   +   + NL+AVV 
Sbjct: 289  GHCRGGFCQCDNGWYGVDCSMPSVISSIKEWPSWLRPARIDIADDTHANEKMINLNAVVA 348

Query: 1138 KKRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLAS 1317
            KKRPL+YVYDLPP+FNSLLLEGRHFKLECVNR+YD  N T+WTDQLYGAQ+A YES+LAS
Sbjct: 349  KKRPLVYVYDLPPEFNSLLLEGRHFKLECVNRIYDGNNITVWTDQLYGAQIALYESLLAS 408

Query: 1318 PHRTLN 1335
            PHRTLN
Sbjct: 409  PHRTLN 414


>ref|XP_002308967.2| exostosin family protein [Populus trichocarpa]
            gi|550335517|gb|EEE92490.2| exostosin family protein
            [Populus trichocarpa]
          Length = 793

 Score =  596 bits (1537), Expect = e-168
 Identities = 270/425 (63%), Positives = 311/425 (73%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGS 240
            M ++ KWKCSW                HLFL+P+VP  D  S+ Q Q+SC P N S  G 
Sbjct: 1    MITISKWKCSWSLMATIASIVALVSVVHLFLFPVVPSFDPFSVWQVQDSCGPNNESVDGR 60

Query: 241  EKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCDS 420
                 G +P           +  PV+DL  ++PAD H AV YR APWKAEIGRWLSGCD+
Sbjct: 61   ----TGHDP----------GNLQPVLDLEHKFPADLHRAVFYRNAPWKAEIGRWLSGCDA 106

Query: 421  KVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDNL 600
              + V +VE I G+ CK++CSGQG+CN +LG C CFHGFSGEGCSERL L CNYP    L
Sbjct: 107  VTKEVSVVETISGRSCKNDCSGQGVCNYELGQCRCFHGFSGEGCSERLHLECNYPKSPEL 166

Query: 601  PYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAAD 780
            PYG WVVSIC A+CD +RAMCFCGEGTKYPNRP  E+CGF+++ PSE G PR  DW   D
Sbjct: 167  PYGRWVVSICSAHCDPTRAMCFCGEGTKYPNRPAAETCGFQLSLPSEIGAPRQVDWAKPD 226

Query: 781  RDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGHG 960
             DI+TTN S  GWCNVDPAEGYA  V  KEECDCKYD L GRFCE PV+  CINQCSGHG
Sbjct: 227  LDIYTTNKSKLGWCNVDPAEGYANKVKFKEECDCKYDCLSGRFCEVPVQCSCINQCSGHG 286

Query: 961  HCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQK 1140
            HCRGGFC+C  GWYG DCS+PSV SS+ EWP WLRP+ +DVPD+A   G L +L+AVV+K
Sbjct: 287  HCRGGFCQCANGWYGTDCSIPSVTSSVREWPRWLRPAQLDVPDNAHLTGKLVDLNAVVKK 346

Query: 1141 KRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLASP 1320
            KRPLIY+YDLPP FNSLLLEGRHFK ECVNR+Y+  NATIWTDQLYGAQMA YES+LASP
Sbjct: 347  KRPLIYIYDLPPKFNSLLLEGRHFKFECVNRLYNDNNATIWTDQLYGAQMALYESILASP 406

Query: 1321 HRTLN 1335
            +RTLN
Sbjct: 407  YRTLN 411


>ref|XP_004234838.1| PREDICTED: uncharacterized protein LOC101249053 [Solanum
            lycopersicum]
          Length = 785

 Score =  587 bits (1514), Expect = e-165
 Identities = 264/426 (61%), Positives = 312/426 (73%)
 Frame = +1

Query: 58   VMFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGG 237
            +M   QK   SW                HLF YP VP  DY    Q QNSC+P N +   
Sbjct: 1    MMLFNQKRMFSWSTVTIIVLIVTLVSVVHLFFYPFVPSFDYFR--QYQNSCIPINST--- 55

Query: 238  SEKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCD 417
                               K + + ++    ++  D HN V YRGAPWK E+G+WL+GCD
Sbjct: 56   -------------------KSTHNNIISNQTKFAVDLHNGVVYRGAPWKNEVGQWLAGCD 96

Query: 418  SKVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDN 597
            S   AV+++E+IGGK C+++CSGQGICNR+LG C CFHGF+GE C+ER +L+CNYP    
Sbjct: 97   SVTSAVKVIEQIGGKSCRNDCSGQGICNRELGQCRCFHGFTGEECAERQELSCNYPRSKE 156

Query: 598  LPYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAA 777
             P+GHWVVSICPAYCDT+RAMCFCG+GTKYPNRP+ E+CGF IN PS+PGG  +TD+  A
Sbjct: 157  KPFGHWVVSICPAYCDTTRAMCFCGDGTKYPNRPLAETCGFTINPPSKPGGAPVTDFTKA 216

Query: 778  DRDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGH 957
            D D+FTTN S RGWCNVDP E YA+ V  KEECDCKYDGL+GRFCE  V S CINQCSGH
Sbjct: 217  DLDVFTTNGSKRGWCNVDPEEAYASKVLFKEECDCKYDGLWGRFCEVSVLSTCINQCSGH 276

Query: 958  GHCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQ 1137
            G CRGGFC+C+ GW+G DCSVPSVLSSI EWP WLRP+ + VP++  S G L NLDA+V+
Sbjct: 277  GLCRGGFCQCDSGWFGTDCSVPSVLSSIREWPLWLRPAQVTVPENVNSKGNLVNLDAIVE 336

Query: 1138 KKRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLAS 1317
            KKRPL+YVYDLPPDFNSLLLEGRHFKLEC+NR+YD RNAT+WTDQLYGAQMA YESMLAS
Sbjct: 337  KKRPLLYVYDLPPDFNSLLLEGRHFKLECINRIYDQRNATVWTDQLYGAQMAIYESMLAS 396

Query: 1318 PHRTLN 1335
            PHRTLN
Sbjct: 397  PHRTLN 402


>ref|XP_004307567.1| PREDICTED: uncharacterized protein LOC101304329 [Fragaria vesca
            subsp. vesca]
          Length = 791

 Score =  586 bits (1510), Expect = e-165
 Identities = 266/431 (61%), Positives = 313/431 (72%), Gaps = 6/431 (1%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXX-----HLFLYPLVPPLDYLSISQAQNSCLPTNG 225
            MFS+ +WK SW                     HLF +PLVP  +Y S  QAQNSC+P NG
Sbjct: 1    MFSILRWKGSWSMIATIASIVGLISLALASIVHLFFFPLVPSFNYFS--QAQNSCVPING 58

Query: 226  STGGSEKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWL 405
            S      ++ G                   +DL  Q+P+D H AV YRGAPWKAEIGRWL
Sbjct: 59   SAEAITDHIKG-------------------IDLEYQFPSDLHKAVVYRGAPWKAEIGRWL 99

Query: 406  SGCDSKVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYP 585
            +GC S    V IVE IGG  CK++CSGQG+CNR+LG C CFHG+SGEGCSE LQL CNYP
Sbjct: 100  AGCLSITNEVNIVELIGGSGCKNDCSGQGVCNRELGQCRCFHGYSGEGCSETLQLECNYP 159

Query: 586  AEDNLPYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTD 765
               + PYG WVVSIC A+CDT +AMCFCGEGTKYPNRPV E+CGF++  PS+PG P++TD
Sbjct: 160  GSPDQPYGRWVVSICSAHCDTKKAMCFCGEGTKYPNRPVAEACGFQVKPPSKPGAPKLTD 219

Query: 766  WGAADRD-IFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCIN 942
            W  AD D + TTNSS  GWCNVDPAE YA  V  K+ECDCKYD L GRFCE PV   CIN
Sbjct: 220  WEKADLDNLLTTNSSKPGWCNVDPAEAYALKVQFKQECDCKYDCLLGRFCEVPVLCTCIN 279

Query: 943  QCSGHGHCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNL 1122
            QCSGHGHCRGGFC+C  GWYG+DCS+PSV SS+ EWP+WLRP+ +++PD++   G + NL
Sbjct: 280  QCSGHGHCRGGFCQCNNGWYGIDCSIPSVASSVREWPQWLRPAQVNIPDNSHLTGKVVNL 339

Query: 1123 DAVVQKKRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYE 1302
            +AVV+KKRPLIYVYDLPPDFNSLLLEGRHFK ECVNR+YD  N+T+WTD LYG+QMA YE
Sbjct: 340  NAVVKKKRPLIYVYDLPPDFNSLLLEGRHFKFECVNRIYDDLNSTVWTDMLYGSQMALYE 399

Query: 1303 SMLASPHRTLN 1335
            S+LASP+RTLN
Sbjct: 400  SILASPYRTLN 410


>ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807663 [Glycine max]
          Length = 795

 Score =  585 bits (1508), Expect = e-164
 Identities = 260/425 (61%), Positives = 307/425 (72%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGS 240
            +FS+ KW+CSW                HLFL+PL P  +Y  I  AQ+SC PTN S    
Sbjct: 8    LFSMNKWRCSWSLAATIASVVALVSVVHLFLFPLTPTFNYFKI--AQDSCFPTNASAEFP 65

Query: 241  EKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCDS 420
              +                D + P VD   Q+PAD H A  Y G PWKAEIG+WL+GCDS
Sbjct: 66   SNH----------------DQERPAVDFKHQFPADLHGAFVYHGVPWKAEIGQWLAGCDS 109

Query: 421  KVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDNL 600
             ++ V I E IGG  CK++CSGQGICNR LG C CFHG+SG+GC++ LQL CN+    + 
Sbjct: 110  VIKDVNITEIIGGINCKNDCSGQGICNRQLGQCRCFHGYSGDGCTKNLQLECNFLGSPDQ 169

Query: 601  PYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAAD 780
            P+G WVVSICPA CD +RAMCFCGEG KYPNRP+ E+CGF+ + PSEP GPR+ +W   D
Sbjct: 170  PFGRWVVSICPANCDKTRAMCFCGEGAKYPNRPLAETCGFQFDPPSEPDGPRIVNWTKID 229

Query: 781  RDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGHG 960
            +D+FTTN SI GWCNVDPAE YA    +KEECDCKYDGL GRFCE PVESVCINQCSGHG
Sbjct: 230  QDVFTTNRSIPGWCNVDPAEAYAGKAKVKEECDCKYDGLAGRFCEVPVESVCINQCSGHG 289

Query: 961  HCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQK 1140
            HCRGGFC+   GWYGVDCS+PSV+SSI EWP WLRP+ I + D   +   + NL+AVV K
Sbjct: 290  HCRGGFCQVSAGWYGVDCSMPSVISSIKEWPSWLRPARIHIADDTHANEKMINLNAVVAK 349

Query: 1141 KRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLASP 1320
            KRPL+YVYDLPP+FNSLLLEGRH+KLECVNR+YD  N T+WTDQLYGAQ+A YES+LASP
Sbjct: 350  KRPLVYVYDLPPEFNSLLLEGRHYKLECVNRIYDDNNITVWTDQLYGAQIALYESLLASP 409

Query: 1321 HRTLN 1335
            HRTLN
Sbjct: 410  HRTLN 414


>ref|XP_007145630.1| hypothetical protein PHAVU_007G255200g [Phaseolus vulgaris]
            gi|561018820|gb|ESW17624.1| hypothetical protein
            PHAVU_007G255200g [Phaseolus vulgaris]
          Length = 795

 Score =  585 bits (1507), Expect = e-164
 Identities = 261/426 (61%), Positives = 311/426 (73%), Gaps = 1/426 (0%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGS 240
            + S  KW+CSW                HLF++PL P  +Y  I  A++SC+  N S    
Sbjct: 8    LLSKNKWRCSWSLAVTIASVVALVSVVHLFMFPLTPTFNYFKI--AKDSCIQANASA--- 62

Query: 241  EKYVNGMEPKEGLTENPE-KDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCD 417
                          E P  +D + P VD   Q+PAD H +V Y+GAPWKAEIG WL+ CD
Sbjct: 63   --------------EFPSNRDQEQPAVDFKLQFPADLHGSVVYQGAPWKAEIGHWLAACD 108

Query: 418  SKVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDN 597
            S ++ V I E IG   CK++CSGQG+CNR+LG C CFHG+SG+GC+E+ QL CNY    +
Sbjct: 109  SVIKEVNITEIIGVNNCKNDCSGQGVCNRELGQCRCFHGYSGDGCTEQRQLECNYEGSPD 168

Query: 598  LPYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAA 777
            L +G WVVSICPA CD +RAMCFCGEGTKYPNRP+ E+CGF+   PSEP GP++ +W   
Sbjct: 169  LQFGRWVVSICPANCDKTRAMCFCGEGTKYPNRPLAETCGFQYIPPSEPDGPKIVNWTKI 228

Query: 778  DRDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGH 957
            D+D+FTTN SIRGWCNVDPA+ YA    IKEECDCKYDGL GR CE PVESVCINQCS H
Sbjct: 229  DQDVFTTNGSIRGWCNVDPADAYAGKAKIKEECDCKYDGLSGRLCEVPVESVCINQCSRH 288

Query: 958  GHCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQ 1137
            GHCRGGFC+C+KGWYGVDCS+PS +SSI EWP WLRP+ ID+ D   + G + NL+AVV 
Sbjct: 289  GHCRGGFCQCDKGWYGVDCSMPSAISSIIEWPSWLRPARIDIVDDTHANGKMINLNAVVA 348

Query: 1138 KKRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLAS 1317
            KKRPLIYVYDLPP+FNSLLLEGRHFKLECVNR+YD +N TIWTDQLYGAQMA YES+LAS
Sbjct: 349  KKRPLIYVYDLPPEFNSLLLEGRHFKLECVNRIYDDKNVTIWTDQLYGAQMALYESLLAS 408

Query: 1318 PHRTLN 1335
            PHRT+N
Sbjct: 409  PHRTVN 414


>ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]
            gi|44917463|gb|AAS49056.1| At3g57630 [Arabidopsis
            thaliana] gi|46931284|gb|AAT06446.1| At3g57630
            [Arabidopsis thaliana] gi|332646159|gb|AEE79680.1|
            exostosin family protein [Arabidopsis thaliana]
            gi|591401994|gb|AHL38724.1| glycosyltransferase, partial
            [Arabidopsis thaliana]
          Length = 793

 Score =  583 bits (1502), Expect = e-164
 Identities = 257/425 (60%), Positives = 314/425 (73%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGS 240
            MFS QKWK SW                HLFL P+VP  D +++ QAQN C P+N S    
Sbjct: 1    MFSHQKWKFSWSQIATVASVIVLVSLVHLFLGPVVPSFDSITVRQAQNLCGPSNESISQ- 59

Query: 241  EKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCDS 420
                        +T+N  +     VV  + ++PADSH AV YR A WKAEIG+WLS CD+
Sbjct: 60   ------------VTKNSSQSL--VVVAFDRRFPADSHGAVVYRNASWKAEIGQWLSSCDA 105

Query: 421  KVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDNL 600
              + V+I+E IGG++C  +CSGQG+CN + GLC CFHGF+GE CS++L+L+CNY     +
Sbjct: 106  VAKEVDIIEPIGGRKCMSDCSGQGVCNHEFGLCRCFHGFTGEDCSQKLRLDCNYEKTPEM 165

Query: 601  PYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAAD 780
            PYG WVVSIC  +CDT+RAMCFCGEGTKYPNRPVPESCGF+IN P+ P  P+MTDW   D
Sbjct: 166  PYGKWVVSICSRHCDTTRAMCFCGEGTKYPNRPVPESCGFQINSPTNPDEPKMTDWSKPD 225

Query: 781  RDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGHG 960
             DI TTNSS +GWCNVDP + YA  V IKEECDCKYD L+GRFCE PV+  C+NQCSGHG
Sbjct: 226  LDILTTNSSKQGWCNVDPEDAYAMKVKIKEECDCKYDCLWGRFCEIPVQCTCVNQCSGHG 285

Query: 961  HCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQK 1140
             CRGGFC+C+KGW+G DCS+PS LS++GEWP+WLRP+H++VP      G L NL AVV+K
Sbjct: 286  KCRGGFCQCDKGWFGTDCSIPSTLSTVGEWPQWLRPAHLEVPSEKNVPGNLINLSAVVKK 345

Query: 1141 KRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLASP 1320
            KRPLIY+YDLPPDFNSLL+EGRHFK ECVNR+YD RNAT+WTD LYG+QMA YE++LA+ 
Sbjct: 346  KRPLIYIYDLPPDFNSLLIEGRHFKFECVNRIYDERNATVWTDYLYGSQMAFYENILATA 405

Query: 1321 HRTLN 1335
            HRT+N
Sbjct: 406  HRTMN 410


>ref|XP_006402860.1| hypothetical protein EUTSA_v10005794mg [Eutrema salsugineum]
            gi|557103959|gb|ESQ44313.1| hypothetical protein
            EUTSA_v10005794mg [Eutrema salsugineum]
          Length = 791

 Score =  582 bits (1501), Expect = e-163
 Identities = 257/425 (60%), Positives = 315/425 (74%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGS 240
            MFS QKWKCSW                H+FL P+VP  D +S+ QAQN     +G++  S
Sbjct: 1    MFSHQKWKCSWSQIATVASVIVLVSLVHIFLGPVVPSFDSVSVRQAQN----LSGTSNDS 56

Query: 241  EKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCDS 420
             + V+             +DS   VV  + ++PAD H AV YR A WKAEIG+WLS CD+
Sbjct: 57   IRQVS-------------EDSSKTVVAFDRRFPADLHGAVVYRNASWKAEIGQWLSSCDA 103

Query: 421  KVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDNL 600
              + V+I+E IGG++C ++CS QG+CN + G+C CFHG++GE CS++L+L CNY     +
Sbjct: 104  VAKDVDIIEPIGGRKCLNDCSSQGVCNHEFGICRCFHGYTGEDCSQKLRLECNYEKTPEM 163

Query: 601  PYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAAD 780
            PYG WVVSIC  +CDT+RAMCFCGEGTKYPNRPVPESCGF+IN P  P  P+MTDW   D
Sbjct: 164  PYGRWVVSICSRHCDTTRAMCFCGEGTKYPNRPVPESCGFQINSPVNPDEPKMTDWSKPD 223

Query: 781  RDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGHG 960
             DI TTNSS +GWCNVDP + YA  V IKEECDCKYD L+GRFCE PV+  C+NQCSGHG
Sbjct: 224  LDILTTNSSKQGWCNVDPEDAYALKVQIKEECDCKYDCLWGRFCEVPVQCTCVNQCSGHG 283

Query: 961  HCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQK 1140
             CRGGFC+C+KGW+G DCS+PS LS++GEWP+WLRP+H++VP      G LSN+ AVV+K
Sbjct: 284  KCRGGFCQCDKGWFGTDCSIPSTLSTVGEWPQWLRPAHLEVPSDKNVPGNLSNISAVVKK 343

Query: 1141 KRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLASP 1320
            KRPLIY+YDLPPDFNSLLLEGRHFKLECVNR+YD RNATIWTD LYG+QMA YE++LA+ 
Sbjct: 344  KRPLIYIYDLPPDFNSLLLEGRHFKLECVNRIYDDRNATIWTDYLYGSQMAFYENILATA 403

Query: 1321 HRTLN 1335
            HRTLN
Sbjct: 404  HRTLN 408


>ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297324003|gb|EFH54424.1| exostosin family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 792

 Score =  582 bits (1501), Expect = e-163
 Identities = 257/425 (60%), Positives = 313/425 (73%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGS 240
            MFS QKWK SW                HLFL P+VP  D + + QAQN   PTN      
Sbjct: 1    MFSHQKWKFSWSQIATVASVIVLVSLVHLFLGPVVPSFDSIIVRQAQNLSGPTN------ 54

Query: 241  EKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCDS 420
                      E +T+  +  S   VV  + ++PADSH AV YR A WKAEIG+WLS CD+
Sbjct: 55   ----------ESITQVTKDLSQSLVVAFDRRFPADSHGAVVYRNASWKAEIGQWLSSCDA 104

Query: 421  KVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDNL 600
              + V+++E IGG++C ++CSGQG+CN + GLC CFHGF+G+ CS++L L+CNY     +
Sbjct: 105  VAKEVDVIEPIGGRKCMNDCSGQGVCNYEFGLCRCFHGFTGDDCSQKLHLDCNYEKTPEM 164

Query: 601  PYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAAD 780
            PYG WVVSIC  +CDT+RAMCFCGEGTKYPNRPVPESCGF+IN P+ P  P+MTDW   D
Sbjct: 165  PYGKWVVSICSRHCDTTRAMCFCGEGTKYPNRPVPESCGFQINSPANPDEPKMTDWSKPD 224

Query: 781  RDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGHG 960
             DI TTNSS +GWCNVDP + YA  V IKEECDCKYD L+GRFCE PV+  C+NQCSGHG
Sbjct: 225  LDILTTNSSKQGWCNVDPEDAYALKVQIKEECDCKYDCLWGRFCEIPVQCTCVNQCSGHG 284

Query: 961  HCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQK 1140
             CRGGFC+C+KGW+G DCS PS LS++GEWP+WLRP+H++VP      G L+NL AVV+K
Sbjct: 285  KCRGGFCQCDKGWFGTDCSTPSTLSTVGEWPQWLRPAHLEVPSEKNVPGNLTNLSAVVKK 344

Query: 1141 KRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLASP 1320
            KRPLIY+YDLPPDFNSLL+EGRHFKLECVNR+YD RNAT+WTD LYG+QMA YE++LA+ 
Sbjct: 345  KRPLIYIYDLPPDFNSLLIEGRHFKLECVNRIYDERNATVWTDYLYGSQMAFYENILATA 404

Query: 1321 HRTLN 1335
            HRTLN
Sbjct: 405  HRTLN 409


>ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]
            gi|110740929|dbj|BAE98560.1| hypothetical protein
            [Arabidopsis thaliana] gi|332646160|gb|AEE79681.1|
            exostosin family protein [Arabidopsis thaliana]
          Length = 791

 Score =  573 bits (1478), Expect = e-161
 Identities = 255/425 (60%), Positives = 312/425 (73%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGS 240
            MFS QKWK SW                HLFL P+VP  D +++ QAQN C P+N S    
Sbjct: 1    MFSHQKWKFSWSQIATVASVIVLVSLVHLFLGPVVPSFDSITVRQAQNLCGPSNESISQ- 59

Query: 241  EKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCDS 420
                        +T+N  +     VV  + ++PADSH AV YR A WKAEIG+WLS CD+
Sbjct: 60   ------------VTKNSSQSL--VVVAFDRRFPADSHGAVVYRNASWKAEIGQWLSSCDA 105

Query: 421  KVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDNL 600
              + V+I+E IGG++C  +CSGQG+CN + GLC CFHGF+   CS++L+L+CNY     +
Sbjct: 106  VAKEVDIIEPIGGRKCMSDCSGQGVCNHEFGLCRCFHGFTD--CSQKLRLDCNYEKTPEM 163

Query: 601  PYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAAD 780
            PYG WVVSIC  +CDT+RAMCFCGEGTKYPNRPVPESCGF+IN P+ P  P+MTDW   D
Sbjct: 164  PYGKWVVSICSRHCDTTRAMCFCGEGTKYPNRPVPESCGFQINSPTNPDEPKMTDWSKPD 223

Query: 781  RDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGHG 960
             DI TTNSS +GWCNVDP + YA  V IKEECDCKYD L+GRFCE PV+  C+NQCSGHG
Sbjct: 224  LDILTTNSSKQGWCNVDPEDAYAMKVKIKEECDCKYDCLWGRFCEIPVQCTCVNQCSGHG 283

Query: 961  HCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQK 1140
             CRGGFC+C+KGW+G DCS+PS LS++GEWP+WLRP+H++VP      G L NL AVV+K
Sbjct: 284  KCRGGFCQCDKGWFGTDCSIPSTLSTVGEWPQWLRPAHLEVPSEKNVPGNLINLSAVVKK 343

Query: 1141 KRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLASP 1320
            KRPLIY+YDLPPDFNSLL+EGRHFK ECVNR+YD RNAT+WTD LYG+QMA YE++LA+ 
Sbjct: 344  KRPLIYIYDLPPDFNSLLIEGRHFKFECVNRIYDERNATVWTDYLYGSQMAFYENILATA 403

Query: 1321 HRTLN 1335
            HRT+N
Sbjct: 404  HRTMN 408


>ref|XP_004516917.1| PREDICTED: uncharacterized protein LOC101503851 isoform X1 [Cicer
            arietinum] gi|502181977|ref|XP_004516918.1| PREDICTED:
            uncharacterized protein LOC101503851 isoform X2 [Cicer
            arietinum]
          Length = 796

 Score =  572 bits (1474), Expect = e-160
 Identities = 254/425 (59%), Positives = 309/425 (72%)
 Frame = +1

Query: 61   MFSLQKWKCSWXXXXXXXXXXXXXXXXHLFLYPLVPPLDYLSISQAQNSCLPTNGSTGGS 240
            +FS++ W+CSW                HLFL+PL P  DY  +  A +SC+  N S+   
Sbjct: 8    LFSMKNWRCSWSLAASIASVVAMVSVVHLFLFPLTPSFDYFKL--ASDSCVSNNVSSAD- 64

Query: 241  EKYVNGMEPKEGLTENPEKDSDHPVVDLNAQYPADSHNAVTYRGAPWKAEIGRWLSGCDS 420
                  +    GL E        P +DL  ++PAD H++V Y+GA WKAEIGRWLSGCDS
Sbjct: 65   ------LVSNHGLEE--------PAIDLKYRFPADLHSSVAYKGALWKAEIGRWLSGCDS 110

Query: 421  KVEAVEIVEKIGGKRCKDECSGQGICNRDLGLCHCFHGFSGEGCSERLQLNCNYPAEDNL 600
              + V I E IGG  CK++CSG G+CNR+LG C CFHG+ G+GC +  +L CN+P   + 
Sbjct: 111  ITKDVNISEIIGGNDCKNDCSGLGVCNRELGQCRCFHGYVGDGCVDIQELECNFPGSLHE 170

Query: 601  PYGHWVVSICPAYCDTSRAMCFCGEGTKYPNRPVPESCGFKINEPSEPGGPRMTDWGAAD 780
            P+G WVVSICPA CD +RAMCFCGEGTKYP RP+ ESCGF+ N+PSEPGGP++ +W   D
Sbjct: 171  PFGRWVVSICPANCDKTRAMCFCGEGTKYPYRPLAESCGFQYNQPSEPGGPKIVNWTKVD 230

Query: 781  RDIFTTNSSIRGWCNVDPAEGYAANVSIKEECDCKYDGLFGRFCETPVESVCINQCSGHG 960
            +D+FTTN SI GWCNVDP + Y   V  KEEC C YDG  GRFCE PV+S+CINQC+GHG
Sbjct: 231  QDVFTTNGSIPGWCNVDPVDAYEGKVKFKEECHCPYDGFIGRFCEVPVQSICINQCNGHG 290

Query: 961  HCRGGFCECEKGWYGVDCSVPSVLSSIGEWPEWLRPSHIDVPDSARSAGTLSNLDAVVQK 1140
             CRGGFC+C+ GWYG DCS+PSV+SSI EWP WLRP+ +DVPD+   +  L NL+AVV K
Sbjct: 291  QCRGGFCQCDNGWYGADCSIPSVISSIREWPSWLRPARVDVPDNIHVSEKLINLNAVVAK 350

Query: 1141 KRPLIYVYDLPPDFNSLLLEGRHFKLECVNRMYDHRNATIWTDQLYGAQMATYESMLASP 1320
            KRPLIY+YDLPP+FNSLLLEGRHFKLECVNR+YD  NATIWT+QLYGAQMA YES+LASP
Sbjct: 351  KRPLIYIYDLPPEFNSLLLEGRHFKLECVNRIYDGNNATIWTEQLYGAQMAIYESLLASP 410

Query: 1321 HRTLN 1335
            HRTLN
Sbjct: 411  HRTLN 415


Top