BLASTX nr result

ID: Sinomenium21_contig00030871 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00030871
         (1831 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006482345.1| PREDICTED: probable galacturonosyltransferas...   792   0.0  
ref|XP_006430887.1| hypothetical protein CICLE_v10011396mg [Citr...   791   0.0  
ref|XP_006384242.1| hypothetical protein POPTR_0004s10980g [Popu...   790   0.0  
ref|XP_002530802.1| Glycosyltransferase QUASIMODO1, putative [Ri...   788   0.0  
gb|EXB85815.1| putative galacturonosyltransferase 9 [Morus notab...   787   0.0  
ref|XP_007032943.1| Glycosyltransferase isoform 1 [Theobroma cac...   777   0.0  
ref|XP_006373520.1| hypothetical protein POPTR_0017s144802g, par...   757   0.0  
ref|XP_006841268.1| hypothetical protein AMTR_s00135p00111710 [A...   754   0.0  
ref|XP_002882224.1| hypothetical protein ARALYDRAFT_477468 [Arab...   753   0.0  
ref|XP_002282423.2| PREDICTED: LOW QUALITY PROTEIN: probable gal...   752   0.0  
emb|CBI16902.3| unnamed protein product [Vitis vinifera]              752   0.0  
ref|XP_006338312.1| PREDICTED: probable galacturonosyltransferas...   751   0.0  
ref|XP_006297346.1| hypothetical protein CARUB_v10013365mg [Caps...   751   0.0  
ref|XP_004232120.1| PREDICTED: probable galacturonosyltransferas...   751   0.0  
gb|AHL38785.1| glycosyltransferase, partial [Arabidopsis thaliana]    750   0.0  
ref|NP_566170.1| putative galacturonosyltransferase 9 [Arabidops...   750   0.0  
ref|XP_006408424.1| hypothetical protein EUTSA_v10020416mg [Eutr...   750   0.0  
emb|CAN73730.1| hypothetical protein VITISV_022574 [Vitis vinifera]   749   0.0  
gb|ABD96860.1| hypothetical protein [Cleome spinosa]                  741   0.0  
ref|XP_007151434.1| hypothetical protein PHAVU_004G045900g [Phas...   739   0.0  

>ref|XP_006482345.1| PREDICTED: probable galacturonosyltransferase 9-like [Citrus
            sinensis]
          Length = 559

 Score =  792 bits (2046), Expect = 0.0
 Identities = 397/546 (72%), Positives = 445/546 (81%), Gaps = 23/546 (4%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSN----NHHSVLPSSSGPPVHAYVHRTFLAIKS 1499
            RS  SYRIFVS MF+LLFLATLSV+LTS+    +H   LPSS     +AYV RTFLA+KS
Sbjct: 19   RSLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPGLPSSG----NAYVQRTFLALKS 74

Query: 1498 DPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLKLR----S 1331
            DPLKTRLDLI KQ ND++              LD S+QLR+F+DL+ NFS L ++    S
Sbjct: 75   DPLKTRLDLIQKQANDHITLVNTYAAYARKLKLDISRQLRMFDDLAHNFSDLLMKPGYKS 134

Query: 1330 SAFNPEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLF 1151
            + F  E   DED+L+Q EKE KD+VK ARL+I E+KES+DNQ+KIQKLKDTIFAVNE L 
Sbjct: 135  ALFESEGAVDEDVLRQFEKEVKDKVKVARLMIGESKESYDNQLKIQKLKDTIFAVNELLI 194

Query: 1150 KAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVF 971
            KAKK G F+SLI+AKS PKSLHCLAMRLV ERI+HPDKYK+ E  K EFEDP+LYHYA+F
Sbjct: 195  KAKKNGAFASLISAKSVPKSLHCLAMRLVEERISHPDKYKE-EPPKEEFEDPTLYHYAIF 253

Query: 970  SDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVED 791
            SD               ++EPWKHVFHVVTDRM+ AAMKVWF+MRP  GGAHVE+KAVED
Sbjct: 254  SDNVIAVSVVVRSVVKNAQEPWKHVFHVVTDRMNIAAMKVWFRMRPVEGGAHVEVKAVED 313

Query: 790  FKFLNSSYVPVLRQTES-------------NSTS--NNMKFRNPRYLSMLNHLRFYLPEM 656
            + FLNSSYVPVLRQ ES             N+T   NNMKFRNP+YLSMLNHLRFYLPEM
Sbjct: 314  YSFLNSSYVPVLRQLESAKMQKFYFDHKAENTTKDVNNMKFRNPKYLSMLNHLRFYLPEM 373

Query: 655  YPKLHKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEK 476
            YPKLHKILFLDDDVVVQ+DLTGLW IDLDGKVNGAVETCFGSFHRYAQY+NFSHPLI+EK
Sbjct: 374  YPKLHKILFLDDDVVVQKDLTGLWNIDLDGKVNGAVETCFGSFHRYAQYLNFSHPLIREK 433

Query: 475  FNPKACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKP 296
            FNPKACAWA+GMNIFDLDAWRREKCT+QYH+WQN+NE+RTLWKLGTLPPGLITFYSTTK 
Sbjct: 434  FNPKACAWAFGMNIFDLDAWRREKCTDQYHYWQNLNEDRTLWKLGTLPPGLITFYSTTKS 493

Query: 295  LDKSWHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQ 116
            LDKSWHVLGLGYNPSISMDEIN AAVIHYNGNMKPWLDIAMNQ++ LWTKYVD +MEFVQ
Sbjct: 494  LDKSWHVLGLGYNPSISMDEINKAAVIHYNGNMKPWLDIAMNQYKSLWTKYVDNDMEFVQ 553

Query: 115  MCNFGL 98
            +CNFGL
Sbjct: 554  VCNFGL 559


>ref|XP_006430887.1| hypothetical protein CICLE_v10011396mg [Citrus clementina]
            gi|557532944|gb|ESR44127.1| hypothetical protein
            CICLE_v10011396mg [Citrus clementina]
          Length = 559

 Score =  791 bits (2043), Expect = 0.0
 Identities = 396/546 (72%), Positives = 445/546 (81%), Gaps = 23/546 (4%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSN----NHHSVLPSSSGPPVHAYVHRTFLAIKS 1499
            RS  +YRIFVS MF+LLFLATLSV+LTS+    +H   LPSS     +AYV RTFLA+KS
Sbjct: 19   RSLFAYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPGLPSSG----NAYVQRTFLALKS 74

Query: 1498 DPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLKLR----S 1331
            DPLKTRLDLI KQ ND++              LD S+QLR+F+DL+ NFS L ++    S
Sbjct: 75   DPLKTRLDLIQKQANDHITLVNTYAAYARKLKLDISRQLRMFDDLAHNFSDLLMKPGYKS 134

Query: 1330 SAFNPEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLF 1151
            + F  E   DED+L+Q EKE KD+VK ARL+I E+KES+DNQ+KIQKLKDTIFAVNE L 
Sbjct: 135  ALFESEGAVDEDVLRQFEKEVKDKVKVARLMIGESKESYDNQLKIQKLKDTIFAVNELLI 194

Query: 1150 KAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVF 971
            KAKK G F+SLI+AKS PKSLHCLAMRLV ERI+HPDKYK+ E  K EFEDP+LYHYA+F
Sbjct: 195  KAKKNGAFASLISAKSVPKSLHCLAMRLVEERISHPDKYKE-EPPKEEFEDPTLYHYAIF 253

Query: 970  SDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVED 791
            SD               ++EPWKHVFHVVTDRM+ AAMKVWF+MRP  GGAHVE+KAVED
Sbjct: 254  SDNVIAVSVVVRSVVKNAQEPWKHVFHVVTDRMNIAAMKVWFRMRPVEGGAHVEVKAVED 313

Query: 790  FKFLNSSYVPVLRQTES-------------NSTS--NNMKFRNPRYLSMLNHLRFYLPEM 656
            + FLNSSYVPVLRQ ES             N+T   NNMKFRNP+YLSMLNHLRFYLPEM
Sbjct: 314  YSFLNSSYVPVLRQLESAKMQKFYFDHKAENTTKDVNNMKFRNPKYLSMLNHLRFYLPEM 373

Query: 655  YPKLHKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEK 476
            YPKLHKILFLDDDVVVQ+DLTGLW IDLDGKVNGAVETCFGSFHRYAQY+NFSHPLI+EK
Sbjct: 374  YPKLHKILFLDDDVVVQKDLTGLWNIDLDGKVNGAVETCFGSFHRYAQYLNFSHPLIREK 433

Query: 475  FNPKACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKP 296
            FNPKACAWA+GMNIFDLDAWRREKCT+QYH+WQN+NE+RTLWKLGTLPPGLITFYSTTK 
Sbjct: 434  FNPKACAWAFGMNIFDLDAWRREKCTDQYHYWQNLNEDRTLWKLGTLPPGLITFYSTTKS 493

Query: 295  LDKSWHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQ 116
            LDKSWHVLGLGYNPSISMDEIN AAVIHYNGNMKPWLDIAMNQ++ LWTKYVD +MEFVQ
Sbjct: 494  LDKSWHVLGLGYNPSISMDEINKAAVIHYNGNMKPWLDIAMNQYKSLWTKYVDNDMEFVQ 553

Query: 115  MCNFGL 98
            +CNFGL
Sbjct: 554  VCNFGL 559


>ref|XP_006384242.1| hypothetical protein POPTR_0004s10980g [Populus trichocarpa]
            gi|550340788|gb|ERP62039.1| hypothetical protein
            POPTR_0004s10980g [Populus trichocarpa]
          Length = 562

 Score =  790 bits (2039), Expect = 0.0
 Identities = 395/547 (72%), Positives = 445/547 (81%), Gaps = 24/547 (4%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSNNHH-----SVLPSSSGPPVHAYVHRTFLAIK 1502
            R F SYRIF+S +FTLLFLAT S++ +S++HH       LPSS     +AYV RTFLA+K
Sbjct: 21   RPFFSYRIFISAIFTLLFLATFSILFSSHHHHHHHEDDSLPSSG----NAYVQRTFLAVK 76

Query: 1501 SDPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLKLR---- 1334
            SDPLKTRLDLIYKQ ND++              LD SKQLR+F++L++N + L L+    
Sbjct: 77   SDPLKTRLDLIYKQANDHMTLVNAYAAYARKLKLDISKQLRMFDELAKNLTDLPLKPSYK 136

Query: 1333 SSAFNPEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQL 1154
            SS F P  P DED+L+Q EKE KD VK ARL+I E+KES+DNQIKIQKLKDTIFAVNE L
Sbjct: 137  SSLFEPGSPVDEDVLRQFEKEVKDIVKVARLMIVESKESYDNQIKIQKLKDTIFAVNELL 196

Query: 1153 FKAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAV 974
             KAKK G F+SLI+AKS PKSLHCLAMRLV ER+AHP+KYK+ EG K EFEDPSLYHYA+
Sbjct: 197  IKAKKNGAFASLISAKSVPKSLHCLAMRLVEERVAHPEKYKE-EGYKEEFEDPSLYHYAI 255

Query: 973  FSDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVE 794
            FSD               +EEPWKHVFHVVTDRM+ AAMKVWF+MRP  GGA V IKAVE
Sbjct: 256  FSDNVIAVSVLIRSVVKNAEEPWKHVFHVVTDRMNVAAMKVWFRMRPVEGGAFVGIKAVE 315

Query: 793  DFKFLNSSYVPVLRQTES-------------NST--SNNMKFRNPRYLSMLNHLRFYLPE 659
            +++FLNSSYVPVLRQ E+             N+T  S NMKFRNP+YLSMLNHLRFYLPE
Sbjct: 316  EYRFLNSSYVPVLRQLENANMQKFYFENQAENATKDSTNMKFRNPKYLSMLNHLRFYLPE 375

Query: 658  MYPKLHKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKE 479
            MYPKLHKILFLDDDVVVQ+DLTGLWK+DLDGKVNGAVETCFGSFHRYAQY+NFSHPLIKE
Sbjct: 376  MYPKLHKILFLDDDVVVQKDLTGLWKVDLDGKVNGAVETCFGSFHRYAQYLNFSHPLIKE 435

Query: 478  KFNPKACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTK 299
            +FNPKACAWA+GMNIFDLDAWRREKCTEQYH+WQ++NE RTLWKLGTLPPGLITFYSTTK
Sbjct: 436  RFNPKACAWAFGMNIFDLDAWRREKCTEQYHYWQSLNEERTLWKLGTLPPGLITFYSTTK 495

Query: 298  PLDKSWHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFV 119
             LDKSWHVLGLGYNPSISMDEI+NAAVIHYNGNMKPWLDIAMNQ++ LWTKYVD +MEFV
Sbjct: 496  SLDKSWHVLGLGYNPSISMDEISNAAVIHYNGNMKPWLDIAMNQYKNLWTKYVDNDMEFV 555

Query: 118  QMCNFGL 98
            Q CNFGL
Sbjct: 556  QTCNFGL 562


>ref|XP_002530802.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
            gi|223529623|gb|EEF31570.1| Glycosyltransferase
            QUASIMODO1, putative [Ricinus communis]
          Length = 563

 Score =  788 bits (2034), Expect = 0.0
 Identities = 396/545 (72%), Positives = 444/545 (81%), Gaps = 23/545 (4%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSNN----HHSVLPSSSGPPVHAYVHRTFLAIKS 1499
            RSF SYRI VS MFTLLFLATLSV+LT++     H S LPSS      AYV RTFLA+ S
Sbjct: 23   RSFFSYRILVSAMFTLLFLATLSVLLTTHPPTSPHESSLPSSGD----AYVQRTFLALNS 78

Query: 1498 DPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHL----KLRS 1331
            DPLKTRLDLIYKQ +D++              LD S+QLR+F+DL++NF+ +      + 
Sbjct: 79   DPLKTRLDLIYKQASDHMTLVNAYAAYARKLKLDISRQLRMFDDLAKNFTDITSKPNYKI 138

Query: 1330 SAFNPEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLF 1151
            S F  E   DED+L+Q EKE K+RVK ARL+I+ETKES+DNQIKIQKLKDTIFAVNE L 
Sbjct: 139  SLFESEGAIDEDILRQFEKEIKERVKVARLMIAETKESYDNQIKIQKLKDTIFAVNELLV 198

Query: 1150 KAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVF 971
            KA+K G F+SLI+AKS PKSLHCLAMRLV ERI+HP+KY+D++  K EFEDPSLYHYA+F
Sbjct: 199  KARKNGAFASLISAKSIPKSLHCLAMRLVEERISHPEKYRDED-PKLEFEDPSLYHYAIF 257

Query: 970  SDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVED 791
            SD               +EEPWKHVFHVVTDRM+ AAMKVWF+MRP  GGAHVE+KAVED
Sbjct: 258  SDNVIAVSVVVRSVVKNAEEPWKHVFHVVTDRMNVAAMKVWFRMRPVEGGAHVEVKAVED 317

Query: 790  FKFLNSSYVPVLRQTES-------------NSTSN--NMKFRNPRYLSMLNHLRFYLPEM 656
            F FLNSSYVPVLRQ E+             N+T +  NMKFRNP+YLSMLNHLRFYLPEM
Sbjct: 318  FSFLNSSYVPVLRQLENLKLQKFYFENQAENATKDVSNMKFRNPKYLSMLNHLRFYLPEM 377

Query: 655  YPKLHKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEK 476
            YPKLHKILFLDDDVVVQ+DLTGLWKIDLDGKVNGA ETCFGSFHRYAQY+NFSHPLIKEK
Sbjct: 378  YPKLHKILFLDDDVVVQKDLTGLWKIDLDGKVNGAAETCFGSFHRYAQYLNFSHPLIKEK 437

Query: 475  FNPKACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKP 296
            FNPKACAWAYGMN+FDLDAWRREK TEQYH+WQN+NE+RTLWKLGTLPPGLITFYSTTK 
Sbjct: 438  FNPKACAWAYGMNVFDLDAWRREKSTEQYHYWQNLNEDRTLWKLGTLPPGLITFYSTTKS 497

Query: 295  LDKSWHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQ 116
            LDKSWHVLGLGYNPSISMDEI+NAAVIHYNGNMKPWLDIAMNQ++ LWTKYVD +MEFVQ
Sbjct: 498  LDKSWHVLGLGYNPSISMDEISNAAVIHYNGNMKPWLDIAMNQYKNLWTKYVDSDMEFVQ 557

Query: 115  MCNFG 101
            MCNFG
Sbjct: 558  MCNFG 562


>gb|EXB85815.1| putative galacturonosyltransferase 9 [Morus notabilis]
          Length = 612

 Score =  787 bits (2032), Expect = 0.0
 Identities = 393/545 (72%), Positives = 446/545 (81%), Gaps = 23/545 (4%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSN----NHHSVLPSSSGPPVHAYVHRTFLAIKS 1499
            RSF SYRIFVS MF+LLF+ATLSV+L++N    +H S LP++     +AYVHRTFLA+KS
Sbjct: 23   RSFFSYRIFVSAMFSLLFIATLSVLLSTNPYTPHHDSALPTTG----NAYVHRTFLALKS 78

Query: 1498 DPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLKL----RS 1331
            DPLKTRLDLIYKQ ND++              L+ SKQLR+F DL+   S L++    R+
Sbjct: 79   DPLKTRLDLIYKQANDHMTLVNSYAAYARKLKLEISKQLRMFNDLASEISDLQMKPGYRT 138

Query: 1330 SAFNPEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLF 1151
            S F  + P DED+L+Q EKE KD+VKTAR +++E+KE++DNQ+KIQKLKDTIFAVNE L 
Sbjct: 139  SLFESDGPVDEDVLRQFEKEVKDKVKTARAMVAESKENYDNQLKIQKLKDTIFAVNELLT 198

Query: 1150 KAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVF 971
            KAKK G F+SLIAAKS PKSLHCLAMRLVGE I++P+KY+D EG KPEFEDPSLYHYA+F
Sbjct: 199  KAKKNGAFASLIAAKSAPKSLHCLAMRLVGEMISNPEKYRD-EGPKPEFEDPSLYHYAIF 257

Query: 970  SDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVED 791
            SD               + EPWKHVFHVVTDRM+ AAMKVWFKMRP  GGAHVE+KAVED
Sbjct: 258  SDNVIAVSVVVRSVVKNANEPWKHVFHVVTDRMNLAAMKVWFKMRPADGGAHVEVKAVED 317

Query: 790  FKFLNSSYVPVLRQTES-------------NST--SNNMKFRNPRYLSMLNHLRFYLPEM 656
            F FLNSSYVPVLRQ ES             N+T  ++NMKFRNP+YLSMLNHLRFYLPE+
Sbjct: 318  FSFLNSSYVPVLRQLESANLQKFYFESREENATKDASNMKFRNPKYLSMLNHLRFYLPEI 377

Query: 655  YPKLHKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEK 476
            YPKLHKILFLDDDVVVQ+DLTGLWKIDLDGKVNGAVETCFGSFHRYAQY+NFSHPLIKEK
Sbjct: 378  YPKLHKILFLDDDVVVQKDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYLNFSHPLIKEK 437

Query: 475  FNPKACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKP 296
            FNPKACAWAYGMNIFDLDAWRREK TEQYH+WQN+NE+RTLWKLGTLPPGLITFYSTTK 
Sbjct: 438  FNPKACAWAYGMNIFDLDAWRREKSTEQYHYWQNLNEDRTLWKLGTLPPGLITFYSTTKS 497

Query: 295  LDKSWHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQ 116
            LDKSWHVLGLGYNPSISMD IN A+VIHYNGNMKPWLDIAMNQ++ LWTKYVD +MEFVQ
Sbjct: 498  LDKSWHVLGLGYNPSISMDAINRASVIHYNGNMKPWLDIAMNQYKNLWTKYVDNDMEFVQ 557

Query: 115  MCNFG 101
            +   G
Sbjct: 558  IAKLG 562


>ref|XP_007032943.1| Glycosyltransferase isoform 1 [Theobroma cacao]
            gi|508711972|gb|EOY03869.1| Glycosyltransferase isoform 1
            [Theobroma cacao]
          Length = 554

 Score =  777 bits (2007), Expect = 0.0
 Identities = 390/540 (72%), Positives = 443/540 (82%), Gaps = 16/540 (2%)
 Frame = -1

Query: 1669 FRSFLSYRIFVSGMFTLLFLATLSVVLTSN----NHHSVLPSSSGPPVHAYVHRTFLAIK 1502
            FRS  SYRIFVS MF+LLF+ATLSV+LTS+    +HHS LPS      +AY+HRTFLA+ 
Sbjct: 20   FRSLFSYRIFVSAMFSLLFVATLSVLLTSHPSTTHHHSRLPSGG----NAYMHRTFLALN 75

Query: 1501 SDPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHL----KLR 1334
            SDPLKTRLDLI+KQ ND++              L+ S+QL++F+DL++NFS L      +
Sbjct: 76   SDPLKTRLDLIHKQANDHITLVKAYSAYARKLKLEISRQLKMFDDLAKNFSDLTSKPSYK 135

Query: 1333 SSAFNPEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQL 1154
            SS F      DED+L+Q EKE KDRVK ARLLI+E+KE++DNQ+KIQKLKDTIFAVNE L
Sbjct: 136  SSLFETSGNLDEDVLRQFEKEVKDRVKFARLLIAESKENYDNQLKIQKLKDTIFAVNELL 195

Query: 1153 FKAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAV 974
             KAKK G F+SLIAAKS PKSLHCLAMRLV ERI+HP+KYK+D   K EFEDPSLYHYA+
Sbjct: 196  GKAKKNGAFASLIAAKSIPKSLHCLAMRLVEERISHPEKYKEDL-PKAEFEDPSLYHYAI 254

Query: 973  FSDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVE 794
            FSD               +EEP KHVFHVVTDRM+ AAMKVWF+MRP  GGAHVE+KAVE
Sbjct: 255  FSDNVIAVSVVVRSVVKNAEEPSKHVFHVVTDRMNVAAMKVWFRMRPVEGGAHVEVKAVE 314

Query: 793  DFKFLNSSYVPVLRQTES------NSTS--NNMKFRNPRYLSMLNHLRFYLPEMYPKLHK 638
            D+ FL+SSYVPV+RQ ES      N+T   +NMKFRNP Y+ MLNHLRFYLPEMYPKLHK
Sbjct: 315  DYDFLSSSYVPVVRQIESANVQMENATKEGSNMKFRNPNYMPMLNHLRFYLPEMYPKLHK 374

Query: 637  ILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPKAC 458
            IL LDDDVVVQ+DLTGLWKIDL GKVNGAVETCFGSFHR++QY+NFSHPLIKE+FNPKAC
Sbjct: 375  ILLLDDDVVVQKDLTGLWKIDLAGKVNGAVETCFGSFHRFSQYLNFSHPLIKERFNPKAC 434

Query: 457  AWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKSWH 278
            AWAYGMNIFDLDAWRREKCTE YH+WQN+NE+RTLWKLGTLPPGLITFYS TK LDKSWH
Sbjct: 435  AWAYGMNIFDLDAWRREKCTETYHNWQNLNEDRTLWKLGTLPPGLITFYSLTKSLDKSWH 494

Query: 277  VLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNFGL 98
            VLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQ++ LWTKYVD +MEFVQMCNFG+
Sbjct: 495  VLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQYKNLWTKYVDNDMEFVQMCNFGV 554


>ref|XP_006373520.1| hypothetical protein POPTR_0017s144802g, partial [Populus
            trichocarpa] gi|550320342|gb|ERP51317.1| hypothetical
            protein POPTR_0017s144802g, partial [Populus trichocarpa]
          Length = 504

 Score =  757 bits (1955), Expect = 0.0
 Identities = 375/508 (73%), Positives = 419/508 (82%), Gaps = 19/508 (3%)
 Frame = -1

Query: 1564 LPSSSGPPVHAYVHRTFLAIKSDPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQ 1385
            LPSS     +AYV RTFLAIKSDPLKTRLDLIYKQ ND++              LD S+Q
Sbjct: 2    LPSSG----NAYVQRTFLAIKSDPLKTRLDLIYKQANDHMTLVNAYAAYARKLKLDISRQ 57

Query: 1384 LRIFEDLSRNFSHLKLR----SSAFNPEEPFDEDLLKQLEKEAKDRVKTARLLISETKES 1217
            LR+F++L +N + L L+    SS F P    DED+L+Q EKE K++VK ARL+I+E KES
Sbjct: 58   LRMFDELDKNLTDLPLKPSYKSSLFEPGSDVDEDVLRQFEKEVKEKVKVARLMIAEAKES 117

Query: 1216 FDNQIKIQKLKDTIFAVNEQLFKAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDK 1037
            +DNQIKIQKLKDTIFAVNE L KAKK G F+SLI+AKS PKSLHCLAMRLVGERIAHP+K
Sbjct: 118  YDNQIKIQKLKDTIFAVNELLIKAKKNGAFASLISAKSVPKSLHCLAMRLVGERIAHPEK 177

Query: 1036 YKDDEGEKPEFEDPSLYHYAVFSDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAM 857
            YK+ EG K EFEDPSLYHYA+FSD               +EEPWKHVFHVVTD+M+ AAM
Sbjct: 178  YKE-EGYKAEFEDPSLYHYAIFSDNVIAVSVVIRSVVKNAEEPWKHVFHVVTDKMNVAAM 236

Query: 856  KVWFKMRPPLGGAHVEIKAVEDFKFLNSSYVPVLRQTES-------------NSTSN--N 722
            KVWF+MRP  GGAHVEI AVEDF FLNSSYVPVL+Q ES             N+T +  N
Sbjct: 237  KVWFRMRPVEGGAHVEINAVEDFSFLNSSYVPVLKQLESAKMQKFYFDNQAENATKDGSN 296

Query: 721  MKFRNPRYLSMLNHLRFYLPEMYPKLHKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVET 542
            MKFRNP+Y+SMLNHLRFYLPEMYPKLHKILFLDDDVVVQ+DLTGLWK+DLDGKVNGAVET
Sbjct: 297  MKFRNPKYMSMLNHLRFYLPEMYPKLHKILFLDDDVVVQKDLTGLWKVDLDGKVNGAVET 356

Query: 541  CFGSFHRYAQYMNFSHPLIKEKFNPKACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNEN 362
            CFGSFHRYAQY+NFSHPLIKE+FNPKACAWA+GMNIFDLDAWRREKCTE YH+WQ++NE+
Sbjct: 357  CFGSFHRYAQYLNFSHPLIKERFNPKACAWAFGMNIFDLDAWRREKCTEHYHYWQSLNED 416

Query: 361  RTLWKLGTLPPGLITFYSTTKPLDKSWHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLD 182
            RTLWKLGTLPPGLITFYSTTK LDKSWHVLGLGYNPSISMDEI+NAAVIHYNGNMKPWLD
Sbjct: 417  RTLWKLGTLPPGLITFYSTTKSLDKSWHVLGLGYNPSISMDEISNAAVIHYNGNMKPWLD 476

Query: 181  IAMNQFRPLWTKYVDYEMEFVQMCNFGL 98
            IAMNQ++ LWTKYVD +MEFVQMCNFGL
Sbjct: 477  IAMNQYKNLWTKYVDNDMEFVQMCNFGL 504


>ref|XP_006841268.1| hypothetical protein AMTR_s00135p00111710 [Amborella trichopoda]
            gi|548843184|gb|ERN02943.1| hypothetical protein
            AMTR_s00135p00111710 [Amborella trichopoda]
          Length = 561

 Score =  754 bits (1948), Expect = 0.0
 Identities = 382/544 (70%), Positives = 428/544 (78%), Gaps = 25/544 (4%)
 Frame = -1

Query: 1654 SYRIFVSGMFTLLFLATLSVVLTSN---NHHSVLPS----SSGPPVHAYVHRTFLAIKSD 1496
            SYR+ VS  F LLFLA  S+ L +N   N  + L +    SS  P+ A V RTFLA+KSD
Sbjct: 20   SYRLCVSATFALLFLAGASLFLATNPNPNTPTTLQTHGHYSSHQPLQA-VSRTFLALKSD 78

Query: 1495 PLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLKLRSS---A 1325
            P K RLDLI++Q ND+V              L+NSKQLR F DL RN++ L  + S    
Sbjct: 79   PFKARLDLIHRQANDHVALVTAYEAFARKLKLENSKQLRFFADLRRNYTDLMAKPSYRSL 138

Query: 1324 FNPEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLFKA 1145
             + + P +ED+L+Q EKE K+R+K  R +ISE KESFDNQ+KIQKLKDTIFAVNEQL KA
Sbjct: 139  LDSDAPIEEDILRQFEKEVKERIKLTRQVISEAKESFDNQLKIQKLKDTIFAVNEQLTKA 198

Query: 1144 KKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVFSD 965
            KKQG FSSLIAAKS PKSLHCLAMRL+ ERIAHP+KY+D   E PE EDPSLYHYA+FSD
Sbjct: 199  KKQGAFSSLIAAKSIPKSLHCLAMRLMEERIAHPEKYEDARPEPPELEDPSLYHYAIFSD 258

Query: 964  XXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVEDFK 785
                           + +P KHVFHVVTD+M+ AAMKVWFK RP L GAHVE+KAVED+K
Sbjct: 259  NVIAASVVVNSAVKNANDPSKHVFHVVTDKMNLAAMKVWFKTRP-LHGAHVEVKAVEDYK 317

Query: 784  FLNSSYVPVLRQTES-------------NSTSN--NMKFRNPRYLSMLNHLRFYLPEMYP 650
            FLNSSYVPVLRQ ES             N+T +  NMKFRNP+YLSMLNHLRFYLPEMYP
Sbjct: 318  FLNSSYVPVLRQLESANLQKFYFENKLENATKDTTNMKFRNPKYLSMLNHLRFYLPEMYP 377

Query: 649  KLHKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFN 470
            KLH+ILFLDDDVVVQ+DLTGLW ID+DGKVNGAVETCFGSFHRY +YMNFSHPLIKE+FN
Sbjct: 378  KLHRILFLDDDVVVQKDLTGLWNIDMDGKVNGAVETCFGSFHRYDKYMNFSHPLIKERFN 437

Query: 469  PKACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLD 290
            PKAC WAYGMN FDLDAWR EKCTEQYH+WQN+NENRTLWKLGTLPPGLITFYSTTKPLD
Sbjct: 438  PKACGWAYGMNFFDLDAWRSEKCTEQYHYWQNLNENRTLWKLGTLPPGLITFYSTTKPLD 497

Query: 289  KSWHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMC 110
            KSWHVLGLGYNPSISMDEI NAAV+H+NGNMKPWLDIAMNQFR LWTKYVDY+MEFVQMC
Sbjct: 498  KSWHVLGLGYNPSISMDEIQNAAVVHFNGNMKPWLDIAMNQFRHLWTKYVDYDMEFVQMC 557

Query: 109  NFGL 98
            NFGL
Sbjct: 558  NFGL 561


>ref|XP_002882224.1| hypothetical protein ARALYDRAFT_477468 [Arabidopsis lyrata subsp.
            lyrata] gi|297328064|gb|EFH58483.1| hypothetical protein
            ARALYDRAFT_477468 [Arabidopsis lyrata subsp. lyrata]
          Length = 561

 Score =  753 bits (1944), Expect = 0.0
 Identities = 375/542 (69%), Positives = 431/542 (79%), Gaps = 19/542 (3%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSNNHHSVLPSSSGPPVHAYVHRTFLAIKSDPLK 1487
            RSF SYRIF+S +F+ LFLAT SVVL S+ H      +     +AY+ RTFLA++SDPLK
Sbjct: 21   RSFFSYRIFISALFSFLFLATFSVVLNSSRHQPHQDHTLPSMGNAYMQRTFLALQSDPLK 80

Query: 1486 TRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLK----LRSSAFN 1319
            TRLDLI+KQ  D++              LD SKQL++FEDL+ NFS L+    L+S+  +
Sbjct: 81   TRLDLIHKQATDHLTLVNAYAAYARKLKLDASKQLKLFEDLAINFSDLQSKPGLKSAVSD 140

Query: 1318 PEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLFKAKK 1139
                 +ED  +QLEKE KD+VKTAR++I E+KES+D Q+KIQKLKDTIFAV EQL KAKK
Sbjct: 141  NGNALEEDSFRQLEKEVKDKVKTARMMIVESKESYDTQLKIQKLKDTIFAVQEQLTKAKK 200

Query: 1138 QGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVFSDXX 959
             G  +SLI+AKS PKSLHCLAMRLVGERI++PDKYKD   + P  EDP+LYHYA+FSD  
Sbjct: 201  NGAVASLISAKSVPKSLHCLAMRLVGERISNPDKYKDAPPD-PAAEDPTLYHYAIFSDNV 259

Query: 958  XXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVEDFKFL 779
                         +EEPWKHVFHVVTDRM+ AAMKVWFKMRP   GAHVEIK+VEDFKFL
Sbjct: 260  IAVSVVVRSVVMNAEEPWKHVFHVVTDRMNLAAMKVWFKMRPLDRGAHVEIKSVEDFKFL 319

Query: 778  NSSYVPVLRQTES-------------NST--SNNMKFRNPRYLSMLNHLRFYLPEMYPKL 644
            NSSY PVLRQ ES             N+T  S+N+KF+NP+YLSMLNHLRFYLPEMYPKL
Sbjct: 320  NSSYAPVLRQLESAKLQKFYFENQAENATKDSHNLKFKNPKYLSMLNHLRFYLPEMYPKL 379

Query: 643  HKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPK 464
            +KILFLDDDVVVQ+D+TGLWKI+LDGKVNGAVETCFGSFHRY QY+NFSHPLIKE FNP 
Sbjct: 380  NKILFLDDDVVVQKDVTGLWKINLDGKVNGAVETCFGSFHRYGQYLNFSHPLIKESFNPN 439

Query: 463  ACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKS 284
            ACAWA+GMNIFDL+AWRREKCT+QYH+WQN+NE+RTLWKLGTLPPGLITFYS TK LDKS
Sbjct: 440  ACAWAFGMNIFDLNAWRREKCTDQYHYWQNLNEDRTLWKLGTLPPGLITFYSKTKSLDKS 499

Query: 283  WHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNF 104
            WHVLGLGYNP +SMDEI NA VIHYNGNMKPWLDIAMNQ++ LWTKYVD EMEFVQMCNF
Sbjct: 500  WHVLGLGYNPGVSMDEIRNAGVIHYNGNMKPWLDIAMNQYKSLWTKYVDNEMEFVQMCNF 559

Query: 103  GL 98
            GL
Sbjct: 560  GL 561


>ref|XP_002282423.2| PREDICTED: LOW QUALITY PROTEIN: probable galacturonosyltransferase 9,
            partial [Vitis vinifera]
          Length = 595

 Score =  752 bits (1941), Expect = 0.0
 Identities = 374/532 (70%), Positives = 436/532 (81%), Gaps = 9/532 (1%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSN----NHHSVLPSSSGPPVHAYVHRTFLAIKS 1499
            R+ +SYRIF S +FT+LF+AT+SV+L +N    +H SV+PSS     +AY+ RTFLA+KS
Sbjct: 70   RNIVSYRIFASALFTILFIATISVLLNTNPAPPSHDSVIPSSG----NAYMQRTFLALKS 125

Query: 1498 DPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLKLRSSAFN 1319
            DPL+TRLDLI+KQ ND++              LD S+QLR+F+DL+RNFS L  R     
Sbjct: 126  DPLRTRLDLIHKQANDHITLVNAYAAYARKLKLDISRQLRMFDDLARNFSDLAARPGPRT 185

Query: 1318 -PEEPF----DEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQL 1154
             PE+      DEDL++QLEKE KDRVK ARL+I+E+KES+DNQIKIQKLKDTIF+VNE L
Sbjct: 186  APEDEVEGTGDEDLVRQLEKEVKDRVKIARLMIAESKESYDNQIKIQKLKDTIFSVNELL 245

Query: 1153 FKAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAV 974
             KAKK G  +SLIAAKS PKSLHCLAMRLV ERIAHPDKY ++E +  EFEDPSLYHYA+
Sbjct: 246  VKAKKNGQVASLIAAKSIPKSLHCLAMRLVEERIAHPDKYTEEE-DSAEFEDPSLYHYAI 304

Query: 973  FSDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVE 794
            FS+               ++EPWKHVFHVV+DRM+ AAMKVWFKMRP  GGA VE+KAVE
Sbjct: 305  FSNNVIAVSVVVNSAVKNAQEPWKHVFHVVSDRMNVAAMKVWFKMRPVGGGARVEVKAVE 364

Query: 793  DFKFLNSSYVPVLRQTESNSTSNNMKFRNPRYLSMLNHLRFYLPEMYPKLHKILFLDDDV 614
            D+ FLNSSYVPVLRQ ES +  +N K RNP Y S+LNHLRFYLPEMYPKLH+ILFLDDDV
Sbjct: 365  DYAFLNSSYVPVLRQMESANYGDNAKLRNPNY-SLLNHLRFYLPEMYPKLHRILFLDDDV 423

Query: 613  VVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPKACAWAYGMNI 434
            VVQ+DL+ LW+IDLDGKVNGAVETCFGSFHRYA Y+NFS+ +I+EKFNPKACAWAYGMNI
Sbjct: 424  VVQKDLSALWRIDLDGKVNGAVETCFGSFHRYAHYLNFSNSVIREKFNPKACAWAYGMNI 483

Query: 433  FDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKSWHVLGLGYNP 254
            FDLDAWRREKCT+QYH+WQN+NE+ TLWK G LPPGLITFYSTTK LDKSWHVLGLGYNP
Sbjct: 484  FDLDAWRREKCTDQYHYWQNLNEDGTLWKSGMLPPGLITFYSTTKSLDKSWHVLGLGYNP 543

Query: 253  SISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNFGL 98
            SISMDEIN+AAVIH+NGNMKPWLDIA+NQF+ LWTKYVD +MEFVQ+CNFGL
Sbjct: 544  SISMDEINHAAVIHFNGNMKPWLDIAINQFKNLWTKYVDNDMEFVQVCNFGL 595


>emb|CBI16902.3| unnamed protein product [Vitis vinifera]
          Length = 543

 Score =  752 bits (1941), Expect = 0.0
 Identities = 374/532 (70%), Positives = 436/532 (81%), Gaps = 9/532 (1%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSN----NHHSVLPSSSGPPVHAYVHRTFLAIKS 1499
            R+ +SYRIF S +FT+LF+AT+SV+L +N    +H SV+PSS     +AY+ RTFLA+KS
Sbjct: 18   RNIVSYRIFASALFTILFIATISVLLNTNPAPPSHDSVIPSSG----NAYMQRTFLALKS 73

Query: 1498 DPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLKLRSSAFN 1319
            DPL+TRLDLI+KQ ND++              LD S+QLR+F+DL+RNFS L  R     
Sbjct: 74   DPLRTRLDLIHKQANDHITLVNAYAAYARKLKLDISRQLRMFDDLARNFSDLAARPGPRT 133

Query: 1318 -PEEPF----DEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQL 1154
             PE+      DEDL++QLEKE KDRVK ARL+I+E+KES+DNQIKIQKLKDTIF+VNE L
Sbjct: 134  APEDEVEGTGDEDLVRQLEKEVKDRVKIARLMIAESKESYDNQIKIQKLKDTIFSVNELL 193

Query: 1153 FKAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAV 974
             KAKK G  +SLIAAKS PKSLHCLAMRLV ERIAHPDKY ++E +  EFEDPSLYHYA+
Sbjct: 194  VKAKKNGQVASLIAAKSIPKSLHCLAMRLVEERIAHPDKYTEEE-DSAEFEDPSLYHYAI 252

Query: 973  FSDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVE 794
            FS+               ++EPWKHVFHVV+DRM+ AAMKVWFKMRP  GGA VE+KAVE
Sbjct: 253  FSNNVIAVSVVVNSAVKNAQEPWKHVFHVVSDRMNVAAMKVWFKMRPVGGGARVEVKAVE 312

Query: 793  DFKFLNSSYVPVLRQTESNSTSNNMKFRNPRYLSMLNHLRFYLPEMYPKLHKILFLDDDV 614
            D+ FLNSSYVPVLRQ ES +  +N K RNP Y S+LNHLRFYLPEMYPKLH+ILFLDDDV
Sbjct: 313  DYAFLNSSYVPVLRQMESANYGDNAKLRNPNY-SLLNHLRFYLPEMYPKLHRILFLDDDV 371

Query: 613  VVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPKACAWAYGMNI 434
            VVQ+DL+ LW+IDLDGKVNGAVETCFGSFHRYA Y+NFS+ +I+EKFNPKACAWAYGMNI
Sbjct: 372  VVQKDLSALWRIDLDGKVNGAVETCFGSFHRYAHYLNFSNSVIREKFNPKACAWAYGMNI 431

Query: 433  FDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKSWHVLGLGYNP 254
            FDLDAWRREKCT+QYH+WQN+NE+ TLWK G LPPGLITFYSTTK LDKSWHVLGLGYNP
Sbjct: 432  FDLDAWRREKCTDQYHYWQNLNEDGTLWKSGMLPPGLITFYSTTKSLDKSWHVLGLGYNP 491

Query: 253  SISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNFGL 98
            SISMDEIN+AAVIH+NGNMKPWLDIA+NQF+ LWTKYVD +MEFVQ+CNFGL
Sbjct: 492  SISMDEINHAAVIHFNGNMKPWLDIAINQFKNLWTKYVDNDMEFVQVCNFGL 543


>ref|XP_006338312.1| PREDICTED: probable galacturonosyltransferase 9-like [Solanum
            tuberosum]
          Length = 547

 Score =  751 bits (1939), Expect = 0.0
 Identities = 372/542 (68%), Positives = 441/542 (81%), Gaps = 19/542 (3%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSNNHHSVLPSSSGPPVHAYVHRTFLAIKSDPLK 1487
            R+F SYRIFVS MFTLLFLATLSV+ +S  HH    S+ G   +AY+HR+ +++ SDPLK
Sbjct: 16   RNFFSYRIFVSAMFTLLFLATLSVLFSS--HHD---STIG---NAYLHRSLVSLSSDPLK 67

Query: 1486 TRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLK----LRSSAFN 1319
            TRLDLI+KQ ND+V              L+ +KQL++FEDL++NFS L+     RS+ F+
Sbjct: 68   TRLDLIHKQANDHVALVNAYAAYARKLKLEIAKQLKMFEDLAQNFSDLQSKQNYRSNLFD 127

Query: 1318 PEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLFKAKK 1139
             + P D+D LKQ EKE KD+VK ARLLI+++KES+DNQ+KIQKLKDTIFAVNE   KAKK
Sbjct: 128  TDGPLDDDSLKQFEKEVKDKVKFARLLIADSKESYDNQLKIQKLKDTIFAVNELFVKAKK 187

Query: 1138 QGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVFSDXX 959
             G F+S IAAKSTPKSLHCLAMRL+ ERI+HP+KY+D++  KPE+EDP+LYHYA+FSD  
Sbjct: 188  NGAFASSIAAKSTPKSLHCLAMRLMEERISHPEKYRDED-PKPEYEDPTLYHYAIFSDNV 246

Query: 958  XXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVEDFKFL 779
                         +EEPWKHVFHVVTDRM+ AAMKVWFKMRP +  AH+EIK+VEDF FL
Sbjct: 247  IAVSVVVNSVIKNAEEPWKHVFHVVTDRMNLAAMKVWFKMRP-IQQAHIEIKSVEDFTFL 305

Query: 778  NSSYVPVLRQTES-------------NSTS--NNMKFRNPRYLSMLNHLRFYLPEMYPKL 644
             SSYVPVL+Q ES             N+T   NNMKF+NP+YLSMLNHLRFYLPEMYP L
Sbjct: 306  TSSYVPVLKQLESAKLQNFYFQNSAENATKDVNNMKFKNPKYLSMLNHLRFYLPEMYPTL 365

Query: 643  HKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPK 464
            H+ILFLDDDVVVQ+DLT LW IDLDGKVNGAVETCFGSFHRY+QY+NFSHPL++EKFNPK
Sbjct: 366  HRILFLDDDVVVQKDLTALWTIDLDGKVNGAVETCFGSFHRYSQYLNFSHPLVREKFNPK 425

Query: 463  ACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKS 284
            ACAWA+GMNIFDLDAWRREKCTEQYH+WQN+NE+RTLWKLGTLP GL+TF+S TK LDKS
Sbjct: 426  ACAWAFGMNIFDLDAWRREKCTEQYHYWQNLNEDRTLWKLGTLPAGLMTFFSKTKSLDKS 485

Query: 283  WHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNF 104
            WHVLGLG+NPS+SMDEI+ AAVIHYNG+MKPWLDIA+NQ++ LWTKY+D EMEFVQMCNF
Sbjct: 486  WHVLGLGFNPSVSMDEIHKAAVIHYNGDMKPWLDIALNQYKELWTKYIDSEMEFVQMCNF 545

Query: 103  GL 98
            G+
Sbjct: 546  GV 547


>ref|XP_006297346.1| hypothetical protein CARUB_v10013365mg [Capsella rubella]
            gi|482566055|gb|EOA30244.1| hypothetical protein
            CARUB_v10013365mg [Capsella rubella]
          Length = 561

 Score =  751 bits (1939), Expect = 0.0
 Identities = 373/542 (68%), Positives = 432/542 (79%), Gaps = 19/542 (3%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSNNHHSVLPSSSGPPVHAYVHRTFLAIKSDPLK 1487
            RSF SYRIF+S +F+ LFLAT SVVL S+ H      +     +AY+ RTFLA++SDPLK
Sbjct: 21   RSFFSYRIFISALFSFLFLATFSVVLNSSRHQPHQDHTLPSMGNAYMQRTFLALQSDPLK 80

Query: 1486 TRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLK----LRSSAFN 1319
            TRLDLI+KQ  D++              LD SKQL++FEDL+ NFS L+    L+S+  +
Sbjct: 81   TRLDLIHKQATDHLTLVNAYAAYARKLKLDASKQLKLFEDLAINFSDLQSKPGLKSAVSD 140

Query: 1318 PEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLFKAKK 1139
                 +ED  +QLEKE KD+VKTAR++I E+KES+D Q+KIQKLKDTIFAV EQL KAKK
Sbjct: 141  TGNALEEDTFRQLEKEVKDKVKTARMMIVESKESYDTQLKIQKLKDTIFAVQEQLTKAKK 200

Query: 1138 QGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVFSDXX 959
             G  +SLI+AKS PKSLHCLAMRLVGERI++P+KYKD   + P  EDP+LYHYAVFSD  
Sbjct: 201  NGAVASLISAKSVPKSLHCLAMRLVGERISNPEKYKDASPD-PAAEDPTLYHYAVFSDNV 259

Query: 958  XXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVEDFKFL 779
                         +EEPWKHVFHVVTDRM+ AAMKVWFKMRP   GAH+EIK+VEDFKFL
Sbjct: 260  IAVSVVVRSVVMNAEEPWKHVFHVVTDRMNLAAMKVWFKMRPLDRGAHIEIKSVEDFKFL 319

Query: 778  NSSYVPVLRQTES-------------NST--SNNMKFRNPRYLSMLNHLRFYLPEMYPKL 644
            NSSY PVLRQ ES             N+T  S+N+KF+NP+YLSMLNHLRFYLPEMYPKL
Sbjct: 320  NSSYAPVLRQLESAKLQKFYFENQAENATKDSHNLKFKNPKYLSMLNHLRFYLPEMYPKL 379

Query: 643  HKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPK 464
            +KILFLDDDVVVQ+D+TGLWKI+LDGKVNGAVETCFGSFHRY QY+NF+HPLIKE F+P 
Sbjct: 380  NKILFLDDDVVVQKDVTGLWKINLDGKVNGAVETCFGSFHRYGQYLNFTHPLIKESFSPS 439

Query: 463  ACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKS 284
            ACAWA+GMNIFDL+AWRREKCT+QYH+WQN+NE+RTLWKLGTLPPGLITFYS TK LDKS
Sbjct: 440  ACAWAFGMNIFDLNAWRREKCTDQYHYWQNLNEDRTLWKLGTLPPGLITFYSKTKSLDKS 499

Query: 283  WHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNF 104
            WHVLGLGYNP +SMDEI NAAVIHYNGNMKPWLDIAMNQ++ LWTKYVD EMEFVQMCNF
Sbjct: 500  WHVLGLGYNPGVSMDEIRNAAVIHYNGNMKPWLDIAMNQYKSLWTKYVDNEMEFVQMCNF 559

Query: 103  GL 98
            GL
Sbjct: 560  GL 561


>ref|XP_004232120.1| PREDICTED: probable galacturonosyltransferase 9-like [Solanum
            lycopersicum]
          Length = 547

 Score =  751 bits (1939), Expect = 0.0
 Identities = 372/542 (68%), Positives = 441/542 (81%), Gaps = 19/542 (3%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSNNHHSVLPSSSGPPVHAYVHRTFLAIKSDPLK 1487
            R+F SYRIFVS MFTLLFLATLSV+ +S  HH    S+ G   +AY+HR+ ++I SDPLK
Sbjct: 16   RNFFSYRIFVSAMFTLLFLATLSVLFSS--HHD---STIG---NAYLHRSLVSINSDPLK 67

Query: 1486 TRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLK----LRSSAFN 1319
            TRLDLI+KQ ND++              L+ +KQL++FEDL++NFS L+     RS+ F+
Sbjct: 68   TRLDLIHKQANDHIALVNAYTAYARKLKLEIAKQLKMFEDLAQNFSDLQSKQNYRSNLFD 127

Query: 1318 PEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLFKAKK 1139
             + P D+D LKQ EKE KD+VK ARLLI+++KES+DNQ+KIQKLKDTIFAVNE   KAKK
Sbjct: 128  TDGPLDDDSLKQFEKEVKDKVKFARLLIADSKESYDNQLKIQKLKDTIFAVNELFVKAKK 187

Query: 1138 QGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVFSDXX 959
             G F+S IAAKSTPKSLHCLAMRL+ ERI+HP+KY+D++ E PE+EDP+LYHYA+FSD  
Sbjct: 188  NGAFASSIAAKSTPKSLHCLAMRLMEERISHPEKYRDEDPE-PEYEDPTLYHYAIFSDNV 246

Query: 958  XXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVEDFKFL 779
                         +EEPWKHVFHVVTDRM+ AAMKVWFKMRP +  AH+EIK+VEDF FL
Sbjct: 247  IAVSVVVNSVIKNAEEPWKHVFHVVTDRMNLAAMKVWFKMRP-IQQAHIEIKSVEDFTFL 305

Query: 778  NSSYVPVLRQTES-------------NSTS--NNMKFRNPRYLSMLNHLRFYLPEMYPKL 644
             SSYVPVL+Q ES             N+T   NNMKF+NP+YLSMLNHLRFYLPEMYP L
Sbjct: 306  TSSYVPVLKQLESAKLQNFYFQNSAENATQDVNNMKFKNPKYLSMLNHLRFYLPEMYPTL 365

Query: 643  HKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPK 464
            H+ILFLDDDVVVQ+DLT LW IDLDGKVNGAVETCFGSFHRY+QY+NFSHPL++EKFNPK
Sbjct: 366  HRILFLDDDVVVQKDLTALWTIDLDGKVNGAVETCFGSFHRYSQYLNFSHPLVREKFNPK 425

Query: 463  ACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKS 284
            ACAWA+GMNIFDLDAWRREKCTEQYH+WQN+NE+RTLWKLGTLP GL+TF+S TK LDKS
Sbjct: 426  ACAWAFGMNIFDLDAWRREKCTEQYHYWQNLNEDRTLWKLGTLPAGLMTFFSKTKSLDKS 485

Query: 283  WHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNF 104
            WHVLGLG+NPS+SMDEI+ AAVIHYNG+MKPWLDIA+NQ++ LWTKY+D EMEFVQMCNF
Sbjct: 486  WHVLGLGFNPSVSMDEIHKAAVIHYNGDMKPWLDIALNQYKELWTKYIDSEMEFVQMCNF 545

Query: 103  GL 98
            G+
Sbjct: 546  GV 547


>gb|AHL38785.1| glycosyltransferase, partial [Arabidopsis thaliana]
          Length = 561

 Score =  750 bits (1937), Expect = 0.0
 Identities = 374/542 (69%), Positives = 431/542 (79%), Gaps = 19/542 (3%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSNNHHSVLPSSSGPPVHAYVHRTFLAIKSDPLK 1487
            RSF SYRIF+S +F+ LFLAT SVVL S+ H      +     +AY+ RTFLA++SDPLK
Sbjct: 21   RSFFSYRIFISALFSFLFLATFSVVLNSSRHQPHQDHTLPSMGNAYMQRTFLALQSDPLK 80

Query: 1486 TRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLK----LRSSAFN 1319
            TRLDLI+KQ  D++              LD SKQL++FEDL+ NFS L+    L+S+  +
Sbjct: 81   TRLDLIHKQAIDHLTLVNAYAAYARKLKLDASKQLKLFEDLAINFSDLQSKPGLKSAVSD 140

Query: 1318 PEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLFKAKK 1139
                 +ED  +QLEKE KD+VKTAR++I E+KES+D Q+KIQKLKDTIFAV EQL KAKK
Sbjct: 141  NGNALEEDSFRQLEKEVKDKVKTARMMIVESKESYDTQLKIQKLKDTIFAVQEQLTKAKK 200

Query: 1138 QGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVFSDXX 959
             G  +SLI+AKS PKSLHCLAMRLVGERI++P+KYKD   + P  EDP+LYHYA+FSD  
Sbjct: 201  NGAVASLISAKSVPKSLHCLAMRLVGERISNPEKYKDAPPD-PAAEDPTLYHYAIFSDNV 259

Query: 958  XXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVEDFKFL 779
                         +EEPWKHVFHVVTDRM+ AAMKVWFKMRP   GAHVEIK+VEDFKFL
Sbjct: 260  IAVSGVVRSVVMNAEEPWKHVFHVVTDRMNLAAMKVWFKMRPLDRGAHVEIKSVEDFKFL 319

Query: 778  NSSYVPVLRQTES-------------NST--SNNMKFRNPRYLSMLNHLRFYLPEMYPKL 644
            NSSY PVLRQ ES             N+T  S+N+KF+NP+YLSMLNHLRFYLPEMYPKL
Sbjct: 320  NSSYAPVLRQLESAKLQKFYFENQAENATKDSHNLKFKNPKYLSMLNHLRFYLPEMYPKL 379

Query: 643  HKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPK 464
            +KILFLDDDVVVQ+D+TGLWKI+LDGKVNGAVETCFGSFHRY QY+NFSHPLIKE FNP 
Sbjct: 380  NKILFLDDDVVVQKDVTGLWKINLDGKVNGAVETCFGSFHRYGQYLNFSHPLIKENFNPS 439

Query: 463  ACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKS 284
            ACAWA+GMNIFDL+AWRREKCT+QYH+WQN+NE+RTLWKLGTLPPGLITFYS TK LDKS
Sbjct: 440  ACAWAFGMNIFDLNAWRREKCTDQYHYWQNLNEDRTLWKLGTLPPGLITFYSKTKSLDKS 499

Query: 283  WHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNF 104
            WHVLGLGYNP +SMDEI NA VIHYNGNMKPWLDIAMNQ++ LWTKYVD EMEFVQMCNF
Sbjct: 500  WHVLGLGYNPGVSMDEIRNAGVIHYNGNMKPWLDIAMNQYKSLWTKYVDNEMEFVQMCNF 559

Query: 103  GL 98
            GL
Sbjct: 560  GL 561


>ref|NP_566170.1| putative galacturonosyltransferase 9 [Arabidopsis thaliana]
            gi|26394254|sp|Q9FWA4.1|GAUT9_ARATH RecName:
            Full=Probable galacturonosyltransferase 9
            gi|10092184|gb|AAG12603.1|AC068900_9 unknown protein;
            9779-11709 [Arabidopsis thaliana]
            gi|19310441|gb|AAL84957.1| AT3g02350/F11A12_103
            [Arabidopsis thaliana] gi|21536764|gb|AAM61096.1|
            glycosyl transferase, putative [Arabidopsis thaliana]
            gi|28416491|gb|AAO42776.1| At3g02350/F11A12_103
            [Arabidopsis thaliana] gi|332640274|gb|AEE73795.1|
            putative galacturonosyltransferase 9 [Arabidopsis
            thaliana]
          Length = 561

 Score =  750 bits (1937), Expect = 0.0
 Identities = 374/542 (69%), Positives = 431/542 (79%), Gaps = 19/542 (3%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSNNHHSVLPSSSGPPVHAYVHRTFLAIKSDPLK 1487
            RSF SYRIF+S +F+ LFLAT SVVL S+ H      +     +AY+ RTFLA++SDPLK
Sbjct: 21   RSFFSYRIFISALFSFLFLATFSVVLNSSRHQPHQDHTLPSMGNAYMQRTFLALQSDPLK 80

Query: 1486 TRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLK----LRSSAFN 1319
            TRLDLI+KQ  D++              LD SKQL++FEDL+ NFS L+    L+S+  +
Sbjct: 81   TRLDLIHKQAIDHLTLVNAYAAYARKLKLDASKQLKLFEDLAINFSDLQSKPGLKSAVSD 140

Query: 1318 PEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLFKAKK 1139
                 +ED  +QLEKE KD+VKTAR++I E+KES+D Q+KIQKLKDTIFAV EQL KAKK
Sbjct: 141  NGNALEEDSFRQLEKEVKDKVKTARMMIVESKESYDTQLKIQKLKDTIFAVQEQLTKAKK 200

Query: 1138 QGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVFSDXX 959
             G  +SLI+AKS PKSLHCLAMRLVGERI++P+KYKD   + P  EDP+LYHYA+FSD  
Sbjct: 201  NGAVASLISAKSVPKSLHCLAMRLVGERISNPEKYKDAPPD-PAAEDPTLYHYAIFSDNV 259

Query: 958  XXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVEDFKFL 779
                         +EEPWKHVFHVVTDRM+ AAMKVWFKMRP   GAHVEIK+VEDFKFL
Sbjct: 260  IAVSVVVRSVVMNAEEPWKHVFHVVTDRMNLAAMKVWFKMRPLDRGAHVEIKSVEDFKFL 319

Query: 778  NSSYVPVLRQTES-------------NST--SNNMKFRNPRYLSMLNHLRFYLPEMYPKL 644
            NSSY PVLRQ ES             N+T  S+N+KF+NP+YLSMLNHLRFYLPEMYPKL
Sbjct: 320  NSSYAPVLRQLESAKLQKFYFENQAENATKDSHNLKFKNPKYLSMLNHLRFYLPEMYPKL 379

Query: 643  HKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPK 464
            +KILFLDDDVVVQ+D+TGLWKI+LDGKVNGAVETCFGSFHRY QY+NFSHPLIKE FNP 
Sbjct: 380  NKILFLDDDVVVQKDVTGLWKINLDGKVNGAVETCFGSFHRYGQYLNFSHPLIKENFNPS 439

Query: 463  ACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKS 284
            ACAWA+GMNIFDL+AWRREKCT+QYH+WQN+NE+RTLWKLGTLPPGLITFYS TK LDKS
Sbjct: 440  ACAWAFGMNIFDLNAWRREKCTDQYHYWQNLNEDRTLWKLGTLPPGLITFYSKTKSLDKS 499

Query: 283  WHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNF 104
            WHVLGLGYNP +SMDEI NA VIHYNGNMKPWLDIAMNQ++ LWTKYVD EMEFVQMCNF
Sbjct: 500  WHVLGLGYNPGVSMDEIRNAGVIHYNGNMKPWLDIAMNQYKSLWTKYVDNEMEFVQMCNF 559

Query: 103  GL 98
            GL
Sbjct: 560  GL 561


>ref|XP_006408424.1| hypothetical protein EUTSA_v10020416mg [Eutrema salsugineum]
            gi|557109570|gb|ESQ49877.1| hypothetical protein
            EUTSA_v10020416mg [Eutrema salsugineum]
          Length = 561

 Score =  750 bits (1936), Expect = 0.0
 Identities = 373/542 (68%), Positives = 430/542 (79%), Gaps = 19/542 (3%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSNNHHSVLPSSSGPPVHAYVHRTFLAIKSDPLK 1487
            RSF SYRIF+S +F+ LFLAT SVVL S+ H      +     +AY+ RTFLA++SDPLK
Sbjct: 21   RSFFSYRIFISALFSFLFLATFSVVLNSSRHQPHQDHTLPSMGNAYMQRTFLALQSDPLK 80

Query: 1486 TRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLK----LRSSAFN 1319
            TRLDLI+KQ  D++              LD SKQL++FEDL+ NFS L+    L+S+   
Sbjct: 81   TRLDLIHKQATDHLTLVNAYAAYARKLKLDASKQLKLFEDLAINFSDLQSKPGLKSAVSE 140

Query: 1318 PEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLFKAKK 1139
                 +ED  +QLEKE KD+VKTAR++I E+KES+D Q+KIQKLKDTIFAV EQL KAKK
Sbjct: 141  NGNALEEDTFRQLEKEVKDKVKTARMMIVESKESYDTQLKIQKLKDTIFAVQEQLTKAKK 200

Query: 1138 QGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVFSDXX 959
             G  +SLI+AKS PKSLHCLAMRLVGERI++P+KYKD   + P  EDPSLYHYA+FSD  
Sbjct: 201  SGAVASLISAKSVPKSLHCLAMRLVGERISNPEKYKDAPPD-PAAEDPSLYHYAIFSDNV 259

Query: 958  XXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVEDFKFL 779
                         +EEPWKHVFHVVTDRM+ AAMKVWFKMRP   GAHVEIK+VEDFKFL
Sbjct: 260  IAVSVVVRSVVMNAEEPWKHVFHVVTDRMNLAAMKVWFKMRPLDRGAHVEIKSVEDFKFL 319

Query: 778  NSSYVPVLRQTES-------------NST--SNNMKFRNPRYLSMLNHLRFYLPEMYPKL 644
            N+SY PVLRQ ES             N+T  ++N+KF+NP+YLSMLNHLRFYLPEMYPKL
Sbjct: 320  NTSYAPVLRQLESAKLQKFYFENQAENATKDAHNLKFKNPKYLSMLNHLRFYLPEMYPKL 379

Query: 643  HKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPK 464
            +KILFLDDDVVVQ+D+TGLWKI+LDGKVNGAVETCFGSFHRY QY+NF+HPLIKE FNP 
Sbjct: 380  NKILFLDDDVVVQKDVTGLWKINLDGKVNGAVETCFGSFHRYGQYLNFTHPLIKESFNPN 439

Query: 463  ACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKS 284
            ACAWA+GMNIFDL+AWRREKCT QYH+WQN+NE+RTLWKLGTLPPGLITFYS TK LDKS
Sbjct: 440  ACAWAFGMNIFDLNAWRREKCTNQYHYWQNLNEDRTLWKLGTLPPGLITFYSKTKSLDKS 499

Query: 283  WHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNF 104
            WHVLGLGYNP +SMDEI NAAVIHYNGNMKPWLDIAMNQ++ LWTKYVD EMEFVQMCNF
Sbjct: 500  WHVLGLGYNPGVSMDEIRNAAVIHYNGNMKPWLDIAMNQYKSLWTKYVDNEMEFVQMCNF 559

Query: 103  GL 98
            GL
Sbjct: 560  GL 561


>emb|CAN73730.1| hypothetical protein VITISV_022574 [Vitis vinifera]
          Length = 543

 Score =  749 bits (1935), Expect = 0.0
 Identities = 373/532 (70%), Positives = 435/532 (81%), Gaps = 9/532 (1%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSN----NHHSVLPSSSGPPVHAYVHRTFLAIKS 1499
            R+ +SYRIF S +FT+LF+AT+SV+L +N    +H SV+PSS     +AY+ RTFLA+KS
Sbjct: 18   RNIVSYRIFASALFTILFIATISVLLNTNPAPPSHDSVIPSSG----NAYMQRTFLALKS 73

Query: 1498 DPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLKLRSSAFN 1319
            DPL+TRLDLI+KQ ND++              LD S+QLR+F+DL+RNFS L  R     
Sbjct: 74   DPLRTRLDLIHKQANDHITLVNAYAAYARKLKLDISRQLRMFDDLARNFSDLAARPGPRT 133

Query: 1318 -PEEPF----DEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQL 1154
             PE+      DEDL++QLEKE KDRVK ARL+I+E+KES+DNQIKIQKLKDTIF+VNE L
Sbjct: 134  APEDEVEGTGDEDLVRQLEKEVKDRVKIARLMIAESKESYDNQIKIQKLKDTIFSVNELL 193

Query: 1153 FKAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAV 974
             KAKK G  +SLIAAKS PKSLHCLAMRLV ERIAHPDKY ++E +  EFEDPSLYHYA+
Sbjct: 194  VKAKKNGQVASLIAAKSIPKSLHCLAMRLVXERIAHPDKYTEEE-DSAEFEDPSLYHYAI 252

Query: 973  FSDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVE 794
            FS+               ++EPWKHVFHVV+DRM+ AAMKVWFKMRP  GGA VE+KAVE
Sbjct: 253  FSNNVIAVSVVVNSAVKNAQEPWKHVFHVVSDRMNVAAMKVWFKMRPVGGGARVEVKAVE 312

Query: 793  DFKFLNSSYVPVLRQTESNSTSNNMKFRNPRYLSMLNHLRFYLPEMYPKLHKILFLDDDV 614
            D+ FLNSSYVPVLRQ ES +  +N K RNP Y S+LNHLRFYLPEMYPKLH+ILFLDDDV
Sbjct: 313  DYAFLNSSYVPVLRQMESANYGDNAKLRNPNY-SLLNHLRFYLPEMYPKLHRILFLDDDV 371

Query: 613  VVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPKACAWAYGMNI 434
            VVQ+DL+ LW+IDLDGKVNGAVETCFGSFHRYA Y+NFS+ +I+EK NPKACAWAYGMNI
Sbjct: 372  VVQKDLSALWRIDLDGKVNGAVETCFGSFHRYAHYLNFSNSVIREKXNPKACAWAYGMNI 431

Query: 433  FDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKSWHVLGLGYNP 254
            FDLDAWRREKCT+QYH+WQN+NE+ TLWK G LPPGLITFYSTTK LDKSWHVLGLGYNP
Sbjct: 432  FDLDAWRREKCTDQYHYWQNLNEDGTLWKSGMLPPGLITFYSTTKSLDKSWHVLGLGYNP 491

Query: 253  SISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNFGL 98
            SISMDEIN+AAVIH+NGNMKPWLDIA+NQF+ LWTKYVD +MEFVQ+CNFGL
Sbjct: 492  SISMDEINHAAVIHFNGNMKPWLDIAINQFKNLWTKYVDNDMEFVQVCNFGL 543


>gb|ABD96860.1| hypothetical protein [Cleome spinosa]
          Length = 556

 Score =  741 bits (1914), Expect = 0.0
 Identities = 369/544 (67%), Positives = 431/544 (79%), Gaps = 21/544 (3%)
 Frame = -1

Query: 1666 RSFLSYRIFVSGMFTLLFLATLSVVLTSNNHHS--VLPSSSGPPVHAYVHRTFLAIKSDP 1493
            R   SYRIFVS MF+LLFLAT SVVL S+  H    LP++      AY+HRTFLA++SDP
Sbjct: 18   RGLFSYRIFVSAMFSLLFLATFSVVLNSSRQHQDPTLPNTGS----AYMHRTFLALQSDP 73

Query: 1492 LKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLK----LRSSA 1325
            LKTR+DLI+KQ  D++              +D SKQL++FEDL+ NFS L+    L+S  
Sbjct: 74   LKTRVDLIHKQATDHLTLVNAYAAYARKLKVDASKQLKLFEDLAINFSDLQSKPGLKSVL 133

Query: 1324 FNPEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQLFKA 1145
                   +ED L+Q+EKE KD+VKTAR++I+E+KES+D Q+KIQKLKDTIFAV+EQL KA
Sbjct: 134  SENGNAVEEDTLRQVEKEVKDKVKTARMMIAESKESYDTQLKIQKLKDTIFAVHEQLTKA 193

Query: 1144 KKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAVFSD 965
            KK G  +SLIAAKS PKS+HCLAMRLV ERI+HP+KYK+   + P  EDPSLYHYA+FSD
Sbjct: 194  KKSGTVASLIAAKSVPKSIHCLAMRLVEERISHPEKYKEAPPD-PAVEDPSLYHYAIFSD 252

Query: 964  XXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVEDFK 785
                           +EEPWKHVFHVVTDRM+ AAM VWF MRP   GAH+EIK VEDFK
Sbjct: 253  NVIAVSVVVRSVVMNAEEPWKHVFHVVTDRMNLAAMNVWFNMRPLGRGAHIEIKMVEDFK 312

Query: 784  FLNSSYVPVLRQTES-------------NST--SNNMKFRNPRYLSMLNHLRFYLPEMYP 650
            FLNSSYVPVLRQ ES             NST  ++N+KF+N ++LSMLNHLRFYLPEMYP
Sbjct: 313  FLNSSYVPVLRQLESAKLQKFYFENQAENSTMDAHNLKFKNAKHLSMLNHLRFYLPEMYP 372

Query: 649  KLHKILFLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFN 470
            KL K+LFLDDDVVVQ+DLTGLWKI+LDGKVNGAVETCFGSFHRYAQY+NFSHPLIKE FN
Sbjct: 373  KLRKMLFLDDDVVVQKDLTGLWKINLDGKVNGAVETCFGSFHRYAQYLNFSHPLIKESFN 432

Query: 469  PKACAWAYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLD 290
            P +CAWA+GMNIFDLDAWRREKCTEQYH+WQN+NE+++LW++GTLPPGLITFYS TK LD
Sbjct: 433  PNSCAWAFGMNIFDLDAWRREKCTEQYHYWQNLNEDQSLWRVGTLPPGLITFYSKTKSLD 492

Query: 289  KSWHVLGLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMC 110
            K+WHV+GLGYNPS+ MDEI NAAVIHYNGNMKPWLDIAMNQ++ LWTKYVD EMEFVQMC
Sbjct: 493  KAWHVMGLGYNPSVGMDEIRNAAVIHYNGNMKPWLDIAMNQYKSLWTKYVDGEMEFVQMC 552

Query: 109  NFGL 98
            NFGL
Sbjct: 553  NFGL 556


>ref|XP_007151434.1| hypothetical protein PHAVU_004G045900g [Phaseolus vulgaris]
            gi|561024743|gb|ESW23428.1| hypothetical protein
            PHAVU_004G045900g [Phaseolus vulgaris]
          Length = 546

 Score =  739 bits (1908), Expect = 0.0
 Identities = 363/538 (67%), Positives = 430/538 (79%), Gaps = 14/538 (2%)
 Frame = -1

Query: 1669 FRSFLSYRIFVSGMFTLLFLATLSVVLTSN----NHHSVLPSSSGPPVHAYVHRTFLAIK 1502
            FR   S+RIF+S MF+LLF+ATLSV+ TSN    N  S LP++     +AYV RTFLA+K
Sbjct: 14   FRGLFSFRIFISAMFSLLFIATLSVLFTSNPSTSNDESDLPTTG----NAYVQRTFLALK 69

Query: 1501 SDPLKTRLDLIYKQKNDYVXXXXXXXXXXXXXXLDNSKQLRIFEDLSRNFSHLKLR---- 1334
            SDPL+TR+DLI++Q  D++              LD SKQL+ F++L+RNFS + L+    
Sbjct: 70   SDPLRTRVDLIHQQAKDHIALVNAYGAYARKLKLDISKQLKTFDELARNFSDIALKPVYQ 129

Query: 1333 SSAFNPEEPFDEDLLKQLEKEAKDRVKTARLLISETKESFDNQIKIQKLKDTIFAVNEQL 1154
             S F  + P DED+LKQ EKE K+RVK ARL+I E KE++DNQ+KIQKLKDTIFAV+E L
Sbjct: 130  KSLFESDGPIDEDVLKQFEKEVKERVKIARLIIVEAKENYDNQLKIQKLKDTIFAVHESL 189

Query: 1153 FKAKKQGVFSSLIAAKSTPKSLHCLAMRLVGERIAHPDKYKDDEGEKPEFEDPSLYHYAV 974
             KAKK G  +SLI+A+S PKSLHCLAMRL+GE+I++P+KY+D EG KPEFEDP+LYHYA+
Sbjct: 190  AKAKKNGALASLISARSIPKSLHCLAMRLMGEKISNPEKYRD-EGPKPEFEDPALYHYAI 248

Query: 973  FSDXXXXXXXXXXXXXXXSEEPWKHVFHVVTDRMSFAAMKVWFKMRPPLGGAHVEIKAVE 794
            FSD               +EEPWKHVFHVV +RM+  AMKVWFKMRP  GGA +E+KAVE
Sbjct: 249  FSDNVIAVSVVVRSVVKNAEEPWKHVFHVVANRMNVGAMKVWFKMRPIEGGAFLEVKAVE 308

Query: 793  DFKFLNSSYVPVLRQTESNSTS------NNMKFRNPRYLSMLNHLRFYLPEMYPKLHKIL 632
            +F FLNSSYVP+LRQ ES   +      N+   +N + LSM++HLRFYLP+MYPKL+KIL
Sbjct: 309  EFAFLNSSYVPILRQLESEKMNQPENGTNDANMKNAKSLSMMDHLRFYLPDMYPKLYKIL 368

Query: 631  FLDDDVVVQRDLTGLWKIDLDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPKACAW 452
             LD+DVVVQ+DLTGLWKIDLDGKVNGAVE CFGSFHRYA YMNFSHPLIKEKFNPKACAW
Sbjct: 369  LLDEDVVVQKDLTGLWKIDLDGKVNGAVEICFGSFHRYAHYMNFSHPLIKEKFNPKACAW 428

Query: 451  AYGMNIFDLDAWRREKCTEQYHHWQNMNENRTLWKLGTLPPGLITFYSTTKPLDKSWHVL 272
            AYGMNIF+LDAWRREKCT+ Y +WQN+NE++TLWK G LPPGLITFYSTTK LDKSWHVL
Sbjct: 429  AYGMNIFNLDAWRREKCTDNYQYWQNLNEDQTLWKAGPLPPGLITFYSTTKSLDKSWHVL 488

Query: 271  GLGYNPSISMDEINNAAVIHYNGNMKPWLDIAMNQFRPLWTKYVDYEMEFVQMCNFGL 98
            GLGYNPSISMDEINNAAVIHYNGNMKPWLDIA+NQ++ LWTKYVD +MEFVQMCNFGL
Sbjct: 489  GLGYNPSISMDEINNAAVIHYNGNMKPWLDIALNQYKNLWTKYVDNDMEFVQMCNFGL 546


Top