BLASTX nr result

ID: Papaver27_contig00022675 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver27_contig00022675
         (1488 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276821.2| PREDICTED: uncharacterized protein At4g19900...   253   1e-64
ref|XP_002524140.1| lactosylceramide 4-alpha-galactosyltransfera...   245   4e-62
ref|XP_004233237.1| PREDICTED: uncharacterized protein At4g19900...   238   5e-60
ref|XP_006367891.1| PREDICTED: uncharacterized protein At4g19900...   235   4e-59
ref|XP_006448706.1| hypothetical protein CICLE_v10014513mg [Citr...   234   1e-58
ref|XP_006468482.1| PREDICTED: uncharacterized protein At4g19900...   232   3e-58
ref|XP_007024945.1| Alpha 1,4-glycosyltransferase family protein...   219   2e-54
ref|XP_007024944.1| Alpha 1,4-glycosyltransferase family protein...   219   2e-54
ref|XP_007024943.1| Alpha 1,4-glycosyltransferase family protein...   219   2e-54
gb|EXC24771.1| Uncharacterized protein L484_018485 [Morus notabi...   218   4e-54
gb|EYU43677.1| hypothetical protein MIMGU_mgv1a002953mg [Mimulus...   215   4e-53
ref|NP_193724.2| alpha 1,4-glycosyltransferase-like protein [Ara...   213   2e-52
emb|CAB52870.1| putative protein [Arabidopsis thaliana] gi|72687...   213   2e-52
ref|XP_007214630.1| hypothetical protein PRUPE_ppa002948mg [Prun...   212   4e-52
ref|XP_006285648.1| hypothetical protein CARUB_v10007104mg [Caps...   209   3e-51
ref|XP_006413925.1| hypothetical protein EUTSA_v10024627mg [Eutr...   208   4e-51
ref|XP_004158677.1| PREDICTED: uncharacterized protein At4g19900...   206   2e-50
ref|XP_004134884.1| PREDICTED: uncharacterized protein At4g19900...   206   2e-50
ref|XP_004158676.1| PREDICTED: uncharacterized protein At4g19900...   201   9e-49
ref|XP_004293757.1| PREDICTED: uncharacterized protein At4g19900...   192   3e-46

>ref|XP_002276821.2| PREDICTED: uncharacterized protein At4g19900-like [Vitis vinifera]
          Length = 707

 Score =  253 bits (647), Expect = 1e-64
 Identities = 178/466 (38%), Positives = 235/466 (50%), Gaps = 48/466 (10%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDS----- 1094
            LR+RRR RYG Q+CA+ AA+        LHSRL F R S  S     G+ F         
Sbjct: 5    LRSRRRPRYGAQVCAVIAALLLLLSVTVLHSRLSFSRDSRLSPKVGLGLRFPNSKVPPID 64

Query: 1093 DDIDAIVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTD-DQSKVSDEEEI 917
               DA+V++PL +D                       RIDELDV++ + DQ+ +S+EEEI
Sbjct: 65   PQNDAVVLDPLTQDSDPGGNSGADD------------RIDELDVMEEEADQAGLSNEEEI 112

Query: 916  LRGVDFEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXX 737
            LRGV+ E+E   ++  +R      Y +DH+ GV RRAFDKRSI  W+             
Sbjct: 113  LRGVESEDE---EVGESRVS---GYFFDHVSGVIRRAFDKRSIDQWEDYVGFDVGSGMED 166

Query: 736  XXXXXSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKGDF 569
                  K  F SDD  VDE VRRK+  +  IED LLLKT    +PLREGW  WF+ K DF
Sbjct: 167  RS----KGVFASDDVVVDEEVRRKVGEVDGIEDMLLLKTGRRANPLREGWGPWFDTKSDF 222

Query: 568  LRRDKMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPF- 392
            LRRD+MFKS            LQDPDG+G+T+LTRGD+++ K LLN+F+K+P +VK P  
Sbjct: 223  LRRDRMFKSNLEVLNPMNNPLLQDPDGIGITSLTRGDRLVQKFLLNKFKKVPFLVKKPLG 282

Query: 391  ------FGNNVKERDANV-----DEMKVNKDRMINQVKG-------VERRTLDDNGSSNS 266
                   G+ + E    V     D + V K  + + V+G        ERRTL D  S   
Sbjct: 283  VSATTNLGSRLVEDGGQVAIKIRDSLNVQKTTLGSDVEGRRTEIRRAERRTLHD--SYGF 340

Query: 265  NDGAENLGSIERVIAGYENSNSHSNDERK--LHLPNGKEPLKLNVPNGKV--------NV 116
                + +  +  V+ G    NS    +R   +   + +   +L   NG          N 
Sbjct: 341  GLDTKKIVDVNEVLNGTTTGNSSYKHDRNETVEYKSVQNISELGHKNGDSKARRLGHNNE 400

Query: 115  D---------SGPIYADGSRWGYFPGLPPHLSFTDFVTEFFRQGKC 5
            D         SG IYADG RWGYFPGL P LSF++F+  F R+GKC
Sbjct: 401  DSKARRKSELSGHIYADGKRWGYFPGLHPRLSFSNFMNAFIRKGKC 446


>ref|XP_002524140.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ricinus
            communis] gi|223536607|gb|EEF38251.1| lactosylceramide
            4-alpha-galactosyltransferase, putative [Ricinus
            communis]
          Length = 691

 Score =  245 bits (625), Expect = 4e-62
 Identities = 176/447 (39%), Positives = 228/447 (51%), Gaps = 29/447 (6%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDIDA 1079
            LR+RRR RYG Q+CA+ +A+        LH+R+     SSSS      +  + D D+   
Sbjct: 5    LRSRRRPRYGAQVCAVISALLLLLSVSLLHTRI-----SSSSHHHHHSVHQNDDDDETST 59

Query: 1078 IV-VNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVI-DTDDQSKVSDEEEILRGV 905
            I+  NPLL D                      D+IDELD   D  D + + + +     +
Sbjct: 60   IIHQNPLLSD------------SADDNSNDVVDKIDELDTFEDQKDTTGIRNYDNNEGSL 107

Query: 904  DFEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFD-KRSIHPWQXXXXXXXXXXXXXXXX 728
            + E    + +       S  YV DHI G  RRAF+ KRSI  W                 
Sbjct: 108  EDESGLQAQIKKTAVSASGYYV-DHITGSIRRAFNNKRSIDEWDYDYSSFSAVEDHQKS- 165

Query: 727  XXSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLK----TSPLREGWANWFEKKGDFLRR 560
               K AFGSDD P+DE+VRRK++ +  IEDALLLK     SPLREGW +WF+KKGDFLRR
Sbjct: 166  ---KAAFGSDDIPIDEDVRRKVNEVDGIEDALLLKIGKRVSPLREGWGDWFDKKGDFLRR 222

Query: 559  DKMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPF---- 392
            D+MFKS            LQDPD VG T LTRGDK++ K LLNEF++ P ++KNP     
Sbjct: 223  DRMFKSNLEVLNPLNNPLLQDPDAVGFTGLTRGDKVVQKFLLNEFKRNPFLIKNPLRVLR 282

Query: 391  FGNNVKERDANVDEMKVNKD-RMINQVKGVERRTLDDNGSSNS-----NDGAENLGSIER 230
              + V+E   +V+  K   D    +  K  ERR  D+N S+ S     N+  ENL   E+
Sbjct: 283  MTHEVEENGNDVEIRKSASDFNSRDGSKIAERRIFDENVSTESYGKRVNNVQENLNEDEK 342

Query: 229  --VIAGYENSNSHSNDERKLHLPNGKEPLKLNVPNGKVNVDS----------GPIYADGS 86
              V  G   S+  SND RK         ++L   +G  N +S            IYADG 
Sbjct: 343  TNVTQGDNLSDRLSNDSRKDLSSANSITVELKQMDGVENRESKIIQRKSEELSYIYADGK 402

Query: 85   RWGYFPGLPPHLSFTDFVTEFFRQGKC 5
            RWGYFPGL PHLSF+DF+  FFR+GKC
Sbjct: 403  RWGYFPGLHPHLSFSDFMDSFFRKGKC 429


>ref|XP_004233237.1| PREDICTED: uncharacterized protein At4g19900-like [Solanum
            lycopersicum]
          Length = 681

 Score =  238 bits (607), Expect = 5e-60
 Identities = 167/446 (37%), Positives = 218/446 (48%), Gaps = 27/446 (6%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDIDA 1079
            LR RRR  YG  +CA+ AAI        L+SRL F  Q ++        D          
Sbjct: 5    LRTRRRPLYGAHICALAAAILLLLSVSLLYSRLNFFLQPNNPHPHPLQYDTIS------- 57

Query: 1078 IVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGVDF 899
             + NPL++D+                      RIDELDV D+++ +   D+E +L     
Sbjct: 58   -LNNPLVDDLADADYRSSDD------------RIDELDVADSNNNN---DDEFLLSNES- 100

Query: 898  EEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXXXS 719
             EE D ++ N   + S  Y +D   GV RRAF+KRSI  W+                   
Sbjct: 101  -EEDDEEIINQYPRVSSTYFYDQRHGVVRRAFNKRSIEEWEDYVNFESRMKLGLGFKSDE 159

Query: 718  -KIAFGSDDQPVDENVRRKLDHIRVIEDALLLKTSPLREGWANWFEKKGDFLRRDKMFKS 542
             K AFGSDD PVD  +R KL  I  +EDALLLK SPLREGW  WFEKK DFLRRD+MFKS
Sbjct: 160  SKAAFGSDDLPVDVQMRMKLSEIESVEDALLLKGSPLREGWGEWFEKKSDFLRRDRMFKS 219

Query: 541  XXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGNNV----- 377
                        LQDPDG G T LT+GDKI+ K L+NEF+K+P +VK P   + +     
Sbjct: 220  NLEALNPNNNPMLQDPDGAGTTGLTKGDKIVLKGLMNEFKKVPFLVKKPLSVSELTKSEL 279

Query: 376  ---------------------KERDANVDEMKVNKDRMINQVKGVERRTLDDNGSSNSND 260
                                 KE   N D +K N D  +N+ K V+RRTL+D+       
Sbjct: 280  VNDALELQKMAGLAKNDVFESKELKFNSDLVKTN-DEDVNRGKRVKRRTLNDDARIG--- 335

Query: 259  GAENLGSIERVIAGYENSNSHSNDERKLHLPNGKEPLKLNVPNGKVNVDSGPIYADGSRW 80
                    +RV+    +S   S    K  + NG   +  +   G+V   SG ++ADG RW
Sbjct: 336  --------KRVV---HDSGGDSAPRSKEDIRNGNMKVVEDDSRGEV---SGLVFADGKRW 381

Query: 79   GYFPGLPPHLSFTDFVTEFFRQGKCS 2
            GYFPGL P LSFT+F+  FFR+ KC+
Sbjct: 382  GYFPGLHPRLSFTNFMDSFFRKAKCT 407


>ref|XP_006367891.1| PREDICTED: uncharacterized protein At4g19900-like [Solanum tuberosum]
          Length = 681

 Score =  235 bits (599), Expect = 4e-59
 Identities = 162/445 (36%), Positives = 218/445 (48%), Gaps = 26/445 (5%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDIDA 1079
            LR RRR  YG  +CA+ AA+        L+SRL F  Q ++        D          
Sbjct: 5    LRTRRRPLYGAHICALAAAVLLLLSVSLLYSRLNFFLQPNNPHPHPLQYDTIS------- 57

Query: 1078 IVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGVDF 899
             + NPL++D+                      RIDELDV D+++ +   D+E +L     
Sbjct: 58   -LNNPLVDDLADADYRSSDD------------RIDELDVADSNNNN---DDEFLLSNES- 100

Query: 898  EEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXXXS 719
             EE D ++ N   + S  Y +D   GV RRAF+KRSI  W+                   
Sbjct: 101  -EEDDEEIINQYPRVSSTYFYDQRHGVVRRAFNKRSIEEWEDYVNFESRMKLGLGFKSDE 159

Query: 718  -KIAFGSDDQPVDENVRRKLDHIRVIEDALLLKTSPLREGWANWFEKKGDFLRRDKMFKS 542
             K AFGSDD PVD  +R KL  I  +EDALLLK SPLREGW  WFEKK DFLRRD+MFKS
Sbjct: 160  SKAAFGSDDLPVDVQMRMKLSEIESVEDALLLKGSPLREGWGEWFEKKSDFLRRDRMFKS 219

Query: 541  XXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGNNVKERDA 362
                        LQDPDG G T LT+GDKI+ K L+NEF+K+P +VK P   + + + + 
Sbjct: 220  NLEALNPNNNPMLQDPDGAGTTGLTKGDKIVLKGLMNEFKKVPFLVKKPLSVSELTKSEL 279

Query: 361  NVDEMKVNK-------------------------DRMINQVKGVERRTLDDNGSSNSNDG 257
              D +++ K                         D  +N+ K V+RRTL+D+        
Sbjct: 280  VNDALELQKMAGLAKNDVFESKELKFNSQLVKTNDEDVNRGKRVKRRTLNDDARIG---- 335

Query: 256  AENLGSIERVIAGYENSNSHSNDERKLHLPNGKEPLKLNVPNGKVNVDSGPIYADGSRWG 77
                   +RV     +S+  S    K  + NG   +  +   G+V   SG ++ADG RWG
Sbjct: 336  -------KRV---DHDSDGDSAPRSKEEIRNGNMKVVEDDARGEV---SGLLFADGKRWG 382

Query: 76   YFPGLPPHLSFTDFVTEFFRQGKCS 2
            YFPGL P LSFT+F+  FFR+ KC+
Sbjct: 383  YFPGLQPRLSFTNFMDSFFRKAKCT 407


>ref|XP_006448706.1| hypothetical protein CICLE_v10014513mg [Citrus clementina]
            gi|557551317|gb|ESR61946.1| hypothetical protein
            CICLE_v10014513mg [Citrus clementina]
          Length = 667

 Score =  234 bits (596), Expect = 1e-58
 Identities = 167/437 (38%), Positives = 219/437 (50%), Gaps = 19/437 (4%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDIDA 1079
            LR RRR RYG Q+CA+ AA+        LH+RL    Q          +   Q + D DA
Sbjct: 5    LRARRRPRYGAQVCALIAALLLLLSVSLLHTRLSQPNQI---------LRHHQIASD-DA 54

Query: 1078 IVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDT-DDQSKVSDEEEILRGVD 902
            + ++PLL D                        +D++D +DT DD   V D EE  +   
Sbjct: 55   VFIDPLLSDSDDSNDN----------------NVDKIDELDTLDDNDVVVDNEEKPK--- 95

Query: 901  FEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXXX 722
                          K S  Y +DH+ G  RRAF+KRSI  W                   
Sbjct: 96   --------------KSSSGYYFDHLSGSIRRAFNKRSIDDWDFDYSGFPTLQSNVEDKS- 140

Query: 721  SKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKGDFLRRDK 554
             K AFGSDD PVD+ VRRK+  ++ IEDALLLKT    SPLRE W  WF+KKG+FLRRDK
Sbjct: 141  -KTAFGSDDFPVDDEVRRKMTLVKDIEDALLLKTGKGKSPLRETWGEWFDKKGEFLRRDK 199

Query: 553  MFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGNNVK 374
            MFKS            LQDPDGVG++ LTRGDK++ K LLNEF+ +P + K P     V 
Sbjct: 200  MFKSHLEVLNPMNNPLLQDPDGVGISGLTRGDKVLQKLLLNEFKLVPFIGKKPL---GVL 256

Query: 373  ERDANVDEMKVNKDRM--INQVKGVERRTLDDNGSSNSNDGAENLGSIERV------IAG 218
            +   N++     ++ +   +++K  ERRTLDD  S N+   ++ + + E V       A 
Sbjct: 257  DSSGNLNFRGNGREELGRRSEIKRAERRTLDD--SVNNESYSKRVNNEEPVKDESSGNAT 314

Query: 217  YENSNSHSNDERKLHLPNGKEPLKLN--VPNGKV----NVDSGPIYADGSRWGYFPGLPP 56
             E  +   ND  K     G E  K +  V + K     N  S  IYADG RWGY+PGL P
Sbjct: 315  GELYDKEVNDSNKYLSARGNESSKTDEAVRDSKAYQSKNEFSSHIYADGKRWGYYPGLHP 374

Query: 55   HLSFTDFVTEFFRQGKC 5
             LSF++F+  FFR+GKC
Sbjct: 375  RLSFSNFMDAFFRKGKC 391


>ref|XP_006468482.1| PREDICTED: uncharacterized protein At4g19900-like [Citrus sinensis]
          Length = 667

 Score =  232 bits (592), Expect = 3e-58
 Identities = 167/436 (38%), Positives = 218/436 (50%), Gaps = 18/436 (4%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDIDA 1079
            LR RRR RYG Q+CA+ AA+        LH+RL    Q          +   Q + D DA
Sbjct: 5    LRARRRPRYGAQVCALIAALLLLLSVSLLHTRLSQPNQI---------LRHHQLASD-DA 54

Query: 1078 IVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGVDF 899
            + ++PLL D                       +IDELD +D +D             VD 
Sbjct: 55   VFIDPLLSDSDDSNDNNVD-------------KIDELDTLDDNDVV-----------VDN 90

Query: 898  EEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXXXS 719
            EE+            S  Y +DH+ G  RRAF+KRSI  W                    
Sbjct: 91   EEKPKMS--------SSSYYFDHLSGSIRRAFNKRSIDDWDFDYSGFTTLQSNVEDKS-- 140

Query: 718  KIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKGDFLRRDKM 551
            K AFGSDD PVD+ VRRK+  ++ IEDALLLKT    SPLRE W  WF+KKG+FLRRDKM
Sbjct: 141  KTAFGSDDFPVDDEVRRKMTLVKDIEDALLLKTGKGKSPLREKWGEWFDKKGEFLRRDKM 200

Query: 550  FKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGNNVKE 371
            FKS            LQDPDGVG++ LTRGDK++ K LLNEF+ +P + K P     V +
Sbjct: 201  FKSHLEVLNPMNNPLLQDPDGVGISGLTRGDKVLQKLLLNEFKLVPFIGKKPL---GVLD 257

Query: 370  RDANVDEMKVNKDRM--INQVKGVERRTLDDNGSSNSNDGAENLGSIERV------IAGY 215
               N++     ++ +   +++K  ERRTLDD  S N+   ++ + + E V       A  
Sbjct: 258  SSGNLNFRGNGREELGRRSEIKRAERRTLDD--SVNNESYSKRVNNEEHVKDESSGNATG 315

Query: 214  ENSNSHSNDERKLHLPNGKEPLKLN--VPNGKV----NVDSGPIYADGSRWGYFPGLPPH 53
            E  +   ND  K     G E  K +  V + K     N  S  IYADG RWGY+PGL P 
Sbjct: 316  ELYDKEVNDSNKYLSARGNESSKTDEAVRDSKAYQSKNEFSSHIYADGKRWGYYPGLHPR 375

Query: 52   LSFTDFVTEFFRQGKC 5
            LSF++F+  FFR+GKC
Sbjct: 376  LSFSNFMDAFFRKGKC 391


>ref|XP_007024945.1| Alpha 1,4-glycosyltransferase family protein, putative isoform 3
            [Theobroma cacao] gi|508780311|gb|EOY27567.1| Alpha
            1,4-glycosyltransferase family protein, putative isoform
            3 [Theobroma cacao]
          Length = 539

 Score =  219 bits (559), Expect = 2e-54
 Identities = 157/432 (36%), Positives = 208/432 (48%), Gaps = 14/432 (3%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDIDA 1079
            L++RRR RYG Q+CA  +A+        L+SRL     S SS         S D +D  A
Sbjct: 5    LQSRRRPRYGAQVCAAISALLLLFSVSLLYSRL-----SLSSKPHIYPHHSSIDKNDDVA 59

Query: 1078 IVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGVDF 899
               NPLL D                       +IDE D ++ D+ + +++++     ++ 
Sbjct: 60   FPNNPLLSDSDDDVSTTNDD------------KIDEFDTLE-DNDTVLTEDDNNNNEIEQ 106

Query: 898  EEESDSDLNN--NRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXX 725
            EEE +    N  N+   S  + +DH+ G  +RA +KRSI  W                  
Sbjct: 107  EEEQEITTMNQKNKIFSSGHFYFDHLSGSIKRASNKRSIEDWDYDGGFLNEGFLGEDAKI 166

Query: 724  XSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLK------TSPLREGWANWFEKKGDFLR 563
              KIAFGSDD P+DE VRRK+  +  +EDALL+K       +PLRE W +WF+KKGDFLR
Sbjct: 167  --KIAFGSDDIPLDEEVRRKMSEVEGVEDALLVKKVGGKKANPLREKWGDWFDKKGDFLR 224

Query: 562  RDKMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGN 383
            RD+MFKS            LQDPDGVGVT LTRGD+I+ K +L+EF+K+P   K P    
Sbjct: 225  RDRMFKSNLEVLNPLNNPLLQDPDGVGVTGLTRGDRIVQKWILSEFKKVPFTGKKPL--- 281

Query: 382  NVKERDANVDEMKVNKDRMINQVKGVERRTLDDNGSSNSNDGAENLGSIERVIAGYENSN 203
             + E                   KG E +     G    ND A N      V++  ENS 
Sbjct: 282  GILE-------------------KGSEDK---KGGEGKKNDNARN------VLSKRENSI 313

Query: 202  SHSNDERKLHLPNGKEPLKLNVPNGKVNVD------SGPIYADGSRWGYFPGLPPHLSFT 41
              S      +  N     K  V NG +  D      SG IYADG RWGY+PGL   LSF+
Sbjct: 314  KDSGSNTNGNKTNESNSRKNEVKNGGLEADKMNTEFSGHIYADGKRWGYYPGLDSRLSFS 373

Query: 40   DFVTEFFRQGKC 5
            DF+  F R+GKC
Sbjct: 374  DFMDAFLRKGKC 385


>ref|XP_007024944.1| Alpha 1,4-glycosyltransferase family protein, putative isoform 2
            [Theobroma cacao] gi|508780310|gb|EOY27566.1| Alpha
            1,4-glycosyltransferase family protein, putative isoform
            2 [Theobroma cacao]
          Length = 541

 Score =  219 bits (559), Expect = 2e-54
 Identities = 157/432 (36%), Positives = 208/432 (48%), Gaps = 14/432 (3%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDIDA 1079
            L++RRR RYG Q+CA  +A+        L+SRL     S SS         S D +D  A
Sbjct: 5    LQSRRRPRYGAQVCAAISALLLLFSVSLLYSRL-----SLSSKPHIYPHHSSIDKNDDVA 59

Query: 1078 IVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGVDF 899
               NPLL D                       +IDE D ++ D+ + +++++     ++ 
Sbjct: 60   FPNNPLLSDSDDDVSTTNDD------------KIDEFDTLE-DNDTVLTEDDNNNNEIEQ 106

Query: 898  EEESDSDLNN--NRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXX 725
            EEE +    N  N+   S  + +DH+ G  +RA +KRSI  W                  
Sbjct: 107  EEEQEITTMNQKNKIFSSGHFYFDHLSGSIKRASNKRSIEDWDYDGGFLNEGFLGEDAKI 166

Query: 724  XSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLK------TSPLREGWANWFEKKGDFLR 563
              KIAFGSDD P+DE VRRK+  +  +EDALL+K       +PLRE W +WF+KKGDFLR
Sbjct: 167  --KIAFGSDDIPLDEEVRRKMSEVEGVEDALLVKKVGGKKANPLREKWGDWFDKKGDFLR 224

Query: 562  RDKMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGN 383
            RD+MFKS            LQDPDGVGVT LTRGD+I+ K +L+EF+K+P   K P    
Sbjct: 225  RDRMFKSNLEVLNPLNNPLLQDPDGVGVTGLTRGDRIVQKWILSEFKKVPFTGKKPL--- 281

Query: 382  NVKERDANVDEMKVNKDRMINQVKGVERRTLDDNGSSNSNDGAENLGSIERVIAGYENSN 203
             + E                   KG E +     G    ND A N      V++  ENS 
Sbjct: 282  GILE-------------------KGSEDK---KGGEGKKNDNARN------VLSKRENSI 313

Query: 202  SHSNDERKLHLPNGKEPLKLNVPNGKVNVD------SGPIYADGSRWGYFPGLPPHLSFT 41
              S      +  N     K  V NG +  D      SG IYADG RWGY+PGL   LSF+
Sbjct: 314  KDSGSNTNGNKTNESNSRKNEVKNGGLEADKMNTEFSGHIYADGKRWGYYPGLDSRLSFS 373

Query: 40   DFVTEFFRQGKC 5
            DF+  F R+GKC
Sbjct: 374  DFMDAFLRKGKC 385


>ref|XP_007024943.1| Alpha 1,4-glycosyltransferase family protein, putative isoform 1
            [Theobroma cacao] gi|508780309|gb|EOY27565.1| Alpha
            1,4-glycosyltransferase family protein, putative isoform
            1 [Theobroma cacao]
          Length = 655

 Score =  219 bits (559), Expect = 2e-54
 Identities = 157/432 (36%), Positives = 208/432 (48%), Gaps = 14/432 (3%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDIDA 1079
            L++RRR RYG Q+CA  +A+        L+SRL     S SS         S D +D  A
Sbjct: 5    LQSRRRPRYGAQVCAAISALLLLFSVSLLYSRL-----SLSSKPHIYPHHSSIDKNDDVA 59

Query: 1078 IVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGVDF 899
               NPLL D                       +IDE D ++ D+ + +++++     ++ 
Sbjct: 60   FPNNPLLSDSDDDVSTTNDD------------KIDEFDTLE-DNDTVLTEDDNNNNEIEQ 106

Query: 898  EEESDSDLNN--NRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXX 725
            EEE +    N  N+   S  + +DH+ G  +RA +KRSI  W                  
Sbjct: 107  EEEQEITTMNQKNKIFSSGHFYFDHLSGSIKRASNKRSIEDWDYDGGFLNEGFLGEDAKI 166

Query: 724  XSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLK------TSPLREGWANWFEKKGDFLR 563
              KIAFGSDD P+DE VRRK+  +  +EDALL+K       +PLRE W +WF+KKGDFLR
Sbjct: 167  --KIAFGSDDIPLDEEVRRKMSEVEGVEDALLVKKVGGKKANPLREKWGDWFDKKGDFLR 224

Query: 562  RDKMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGN 383
            RD+MFKS            LQDPDGVGVT LTRGD+I+ K +L+EF+K+P   K P    
Sbjct: 225  RDRMFKSNLEVLNPLNNPLLQDPDGVGVTGLTRGDRIVQKWILSEFKKVPFTGKKPL--- 281

Query: 382  NVKERDANVDEMKVNKDRMINQVKGVERRTLDDNGSSNSNDGAENLGSIERVIAGYENSN 203
             + E                   KG E +     G    ND A N      V++  ENS 
Sbjct: 282  GILE-------------------KGSEDK---KGGEGKKNDNARN------VLSKRENSI 313

Query: 202  SHSNDERKLHLPNGKEPLKLNVPNGKVNVD------SGPIYADGSRWGYFPGLPPHLSFT 41
              S      +  N     K  V NG +  D      SG IYADG RWGY+PGL   LSF+
Sbjct: 314  KDSGSNTNGNKTNESNSRKNEVKNGGLEADKMNTEFSGHIYADGKRWGYYPGLDSRLSFS 373

Query: 40   DFVTEFFRQGKC 5
            DF+  F R+GKC
Sbjct: 374  DFMDAFLRKGKC 385


>gb|EXC24771.1| Uncharacterized protein L484_018485 [Morus notabilis]
          Length = 624

 Score =  218 bits (556), Expect = 4e-54
 Identities = 157/423 (37%), Positives = 206/423 (48%), Gaps = 5/423 (1%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDIDA 1079
            LR+RRR RYG   CA  AA+        L+SRL      + S L R  I         DA
Sbjct: 12   LRSRRRPRYGAYACAAAAALLLLLSVSLLYSRLS---HHNHSILLRRRIP--------DA 60

Query: 1078 I-VVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGVD 902
            + V NPL+ D                      DRIDELD +  +D  +  D+E +     
Sbjct: 61   VSVANPLISD----------DSQDEDAAVAVDDRIDELDDVVFEDSPR--DDEPL----- 103

Query: 901  FEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXXX 722
             E++ +   + NR + S  + +DH+ G  RR F  RSI  W                   
Sbjct: 104  -EDDDEQIPDKNRLRVS-GFFYDHVNGAIRRRFSHRSIDDWDDEYSGFSLGLVAEDQS-- 159

Query: 721  SKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKGDFLRRDK 554
             K AFGSDD PVDE VRRK   +  IEDAL+LK     SPLREGW +WF+KK DF RRD+
Sbjct: 160  -KAAFGSDDVPVDETVRRKASEVVGIEDALMLKVGKRVSPLREGWGDWFDKKSDFFRRDR 218

Query: 553  MFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGNNVK 374
            MFKS            LQDPDG+GVT+LTRGDK++ K+LLNEF+++P+++K P     V 
Sbjct: 219  MFKSNLEILNPLNNPMLQDPDGIGVTSLTRGDKLVQKSLLNEFKRVPLLMKKPL---GVV 275

Query: 373  ERDANVDEMKVNKDRMINQVKGVERRTLDDNGSSNSNDGAENLGSIERVIAGYENSNSHS 194
            E      + KV ++   N++K  ERRTLD N              + R  + +E+     
Sbjct: 276  ELPRTSLKSKVGENG--NEIKKAERRTLDSN--------------VVRRRSEFESY---- 315

Query: 193  NDERKLHLPNGKEPLKLNVPNGKVNVDSGPIYADGSRWGYFPGLPPHLSFTDFVTEFFRQ 14
                                          +YADG RWGY+PGL PHLSF+DF+ EFFR+
Sbjct: 316  ------------------------------VYADGKRWGYYPGLQPHLSFSDFMDEFFRK 345

Query: 13   GKC 5
            GKC
Sbjct: 346  GKC 348


>gb|EYU43677.1| hypothetical protein MIMGU_mgv1a002953mg [Mimulus guttatus]
          Length = 622

 Score =  215 bits (548), Expect = 4e-53
 Identities = 150/419 (35%), Positives = 200/419 (47%), Gaps = 1/419 (0%)
 Frame = -3

Query: 1258 LRNRRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDIDA 1079
            LR+RRR RYG  +CA+ AA+        LHSRL F   S S        D+S        
Sbjct: 5    LRSRRRPRYGAHVCALIAAVLLLLSVSLLHSRLSFFN-SHSQHPHPLPHDYS-------- 55

Query: 1078 IVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGVDF 899
              +NPLL+D+                      RIDELD    DD S  ++EE +      
Sbjct: 56   --LNPLLDDLDSEGITTTNSNDD---------RIDELDDAVLDDGSSNNNEEILAE---- 100

Query: 898  EEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXXXS 719
            EE+ D+   N     S +Y +D + GV RR+F++RSI  W+                   
Sbjct: 101  EEDEDALQQNQNTAVSSNYFFDPVKGVIRRSFNRRSIEEWEDYVPFSWKLTSDLGFKNDD 160

Query: 718  -KIAFGSDDQPVDENVRRKLDHIRVIEDALLLKTSPLREGWANWFEKKGDFLRRDKMFKS 542
             +  FGSDD  VDE +R+KL  ++ IEDALLLK S LREGW  WF+KKGDFLRRD+MFKS
Sbjct: 161  TEPVFGSDDVLVDEKLRKKLSEVKKIEDALLLKGSVLREGWGEWFDKKGDFLRRDRMFKS 220

Query: 541  XXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGNNVKERDA 362
                        LQDPDG GVT LTRGDKI  K L++EF++ P ++K P     + E + 
Sbjct: 221  NIEILNPLNNPILQDPDGTGVTGLTRGDKIFQKGLMDEFKRTPFLIKKPL---AISESET 277

Query: 361  NVDEMKVNKDRMINQVKGVERRTLDDNGSSNSNDGAENLGSIERVIAGYENSNSHSNDER 182
             +   K N+     +V+ VER+TLD+N                                 
Sbjct: 278  GIVGEKGNE----KEVRRVERKTLDNN--------------------------------- 300

Query: 181  KLHLPNGKEPLKLNVPNGKVNVDSGPIYADGSRWGYFPGLPPHLSFTDFVTEFFRQGKC 5
            +++   G + L            +   YADG RWGY+PGL   LSF +F+  FFR+G C
Sbjct: 301  QINKVRGSKAL------------AKEYYADGKRWGYYPGLNGRLSFGNFMDAFFRRGMC 347


>ref|NP_193724.2| alpha 1,4-glycosyltransferase-like protein [Arabidopsis thaliana]
            gi|223635837|sp|P0C8Q4.1|Y4990_ARATH RecName:
            Full=Uncharacterized protein At4g19900
            gi|332658843|gb|AEE84243.1| alpha
            1,4-glycosyltransferase-like protein [Arabidopsis
            thaliana] gi|591401914|gb|AHL38684.1|
            glycosyltransferase, partial [Arabidopsis thaliana]
          Length = 644

 Score =  213 bits (541), Expect = 2e-52
 Identities = 149/425 (35%), Positives = 208/425 (48%), Gaps = 5/425 (1%)
 Frame = -3

Query: 1261 MLRNRR-RSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDI 1085
            MLR+RR RSR+G Q CA+ +A+        L++RL      S + L       S  S+D 
Sbjct: 1    MLRSRRSRSRHGAQACAVMSAVLLLASVSLLYTRLSLFSSHSPNHLR------SGSSEDT 54

Query: 1084 DAIVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGV 905
                 + L+ D                       RIDE D  D  +   VS+EE+     
Sbjct: 55   VLFPDSVLVSDSDVETTGGGGRGSTTSTED----RIDEHD--DAIEDDGVSNEED--ENQ 106

Query: 904  DFEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXX 725
            D E+E + DLN N+   S  + +DH+ GV RRAF+KRSI  W                  
Sbjct: 107  DAEQEQEVDLNRNKAASSSGFYFDHVNGVIRRAFNKRSIDEWDYDYTGFSIDSDSSGDKS 166

Query: 724  XSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKGDFLRRD 557
              + AFGSDD P+DE++RRK+  +  +EDALLLK+    SPLR+GW +WF+KKGDFLRRD
Sbjct: 167  S-RAAFGSDDVPLDESIRRKIVEVTSVEDALLLKSGKKVSPLRQGWGDWFDKKGDFLRRD 225

Query: 556  KMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGNNV 377
            +MFKS            LQDPD VG T LTRGDK++ K  LN+ ++ P + K P    +V
Sbjct: 226  RMFKSNIETLNPLNNPMLQDPDSVGNTGLTRGDKVVQKWRLNQIKRNPFMAKKPL---SV 282

Query: 376  KERDANVDEMKVNKDRMINQVKGVERRTLDDNGSSNSNDGAENLGSIERVIAGYENSNSH 197
                   +E ++     + ++K  ER+TLD N      +  +N+ S              
Sbjct: 283  VSEKKEPNEFRLLSS--VGEIKRGERKTLD-NDEKIEREEQKNVES------------ER 327

Query: 196  SNDERKLHLPNGKEPLKLNVPNGKVNVDSGPIYADGSRWGYFPGLPPHLSFTDFVTEFFR 17
             +DE   H+                       YADG++WGY+PG+ P LSF+DF+  FFR
Sbjct: 328  KHDEVTEHM-----------------------YADGTKWGYYPGIEPSLSFSDFMDSFFR 364

Query: 16   QGKCS 2
            + KCS
Sbjct: 365  KEKCS 369


>emb|CAB52870.1| putative protein [Arabidopsis thaliana] gi|7268785|emb|CAB78991.1|
            putative protein [Arabidopsis thaliana]
          Length = 1302

 Score =  213 bits (541), Expect = 2e-52
 Identities = 149/425 (35%), Positives = 208/425 (48%), Gaps = 5/425 (1%)
 Frame = -3

Query: 1261 MLRNRR-RSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDI 1085
            MLR+RR RSR+G Q CA+ +A+        L++RL      S + L       S  S+D 
Sbjct: 1    MLRSRRSRSRHGAQACAVMSAVLLLASVSLLYTRLSLFSSHSPNHLR------SGSSEDT 54

Query: 1084 DAIVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGV 905
                 + L+ D                       RIDE D  D  +   VS+EE+     
Sbjct: 55   VLFPDSVLVSDSDVETTGGGGRGSTTSTED----RIDEHD--DAIEDDGVSNEED--ENQ 106

Query: 904  DFEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXX 725
            D E+E + DLN N+   S  + +DH+ GV RRAF+KRSI  W                  
Sbjct: 107  DAEQEQEVDLNRNKAASSSGFYFDHVNGVIRRAFNKRSIDEWDYDYTGFSIDSDSSGDKS 166

Query: 724  XSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKGDFLRRD 557
              + AFGSDD P+DE++RRK+  +  +EDALLLK+    SPLR+GW +WF+KKGDFLRRD
Sbjct: 167  S-RAAFGSDDVPLDESIRRKIVEVTSVEDALLLKSGKKVSPLRQGWGDWFDKKGDFLRRD 225

Query: 556  KMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGNNV 377
            +MFKS            LQDPD VG T LTRGDK++ K  LN+ ++ P + K P    +V
Sbjct: 226  RMFKSNIETLNPLNNPMLQDPDSVGNTGLTRGDKVVQKWRLNQIKRNPFMAKKPL---SV 282

Query: 376  KERDANVDEMKVNKDRMINQVKGVERRTLDDNGSSNSNDGAENLGSIERVIAGYENSNSH 197
                   +E ++     + ++K  ER+TLD N      +  +N+ S              
Sbjct: 283  VSEKKEPNEFRLLSS--VGEIKRGERKTLD-NDEKIEREEQKNVES------------ER 327

Query: 196  SNDERKLHLPNGKEPLKLNVPNGKVNVDSGPIYADGSRWGYFPGLPPHLSFTDFVTEFFR 17
             +DE   H+                       YADG++WGY+PG+ P LSF+DF+  FFR
Sbjct: 328  KHDEVTEHM-----------------------YADGTKWGYYPGIEPSLSFSDFMDSFFR 364

Query: 16   QGKCS 2
            + KCS
Sbjct: 365  KEKCS 369


>ref|XP_007214630.1| hypothetical protein PRUPE_ppa002948mg [Prunus persica]
            gi|462410495|gb|EMJ15829.1| hypothetical protein
            PRUPE_ppa002948mg [Prunus persica]
          Length = 619

 Score =  212 bits (539), Expect = 4e-52
 Identities = 155/435 (35%), Positives = 215/435 (49%), Gaps = 16/435 (3%)
 Frame = -3

Query: 1258 LRNRR--RSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDI 1085
            +R+RR  R RYGV +CA+ +A         L++RL     S S S        + ++   
Sbjct: 5    IRSRRPHRPRYGVYICAVISAFLLLLSVSLLYTRL-----SHSESHHFHYRHQNPNAQYG 59

Query: 1084 DAIVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGV 905
            D  + NPL+ D                         D++D +D   +    DEE     V
Sbjct: 60   DVSLTNPLISD--------------ELNDGVSIATEDKIDELDDVVEETPKDEE-----V 100

Query: 904  DFEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXX 725
            + EE+  SD++ ++      YV+DH+ GV RR F+KR I  W                  
Sbjct: 101  EDEEDPQSDISQSKVS---GYVFDHVTGVIRRGFNKRKIEDWDEDYNGFTAGLGALDKS- 156

Query: 724  XSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKGDFLRRD 557
              K+AFGSDD PVD  VRR++  +  IEDALLLK     SPLREGW  WF+KKGDFLRRD
Sbjct: 157  --KVAFGSDDVPVDMEVRRRMSEVVGIEDALLLKVGRKVSPLREGWGEWFDKKGDFLRRD 214

Query: 556  KMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGNNV 377
            +MFKS            LQDPD  GVT LTRGDK++ K  LN F+K+P   K    G + 
Sbjct: 215  RMFKSNLEMLNPLHNPMLQDPDAFGVTGLTRGDKVLQKWWLNHFKKVPFTGKKQ-LGISS 273

Query: 376  KERDANVDEM--------KVNKDRMINQVKGVERRT-LDDNGSSNSNDGAENLGSIERVI 224
            + R+  + E           + D ++N V G+   T LD+N   N     ++L S     
Sbjct: 274  RAREVKLYENGGEGGKKGSSSGDGVVN-VSGIGLGTELDEN--ENDRKAGKDLNSGANGK 330

Query: 223  AGYENSNSHSNDERKLHLPNGKEPLK-LNVPNGKVNVDSGPIYADGSRWGYFPGLPPHLS 47
            +  + + S+ ++     + N  E +   +   G  +  SG IYADG RWGY+PGL P LS
Sbjct: 331  SNTDRNLSYMSNATDKEIGNTVEQISDSDQVGGFKDEFSGVIYADGKRWGYYPGLSPFLS 390

Query: 46   FTDFVTEFFRQGKCS 2
            F+DFV  FFR+GKC+
Sbjct: 391  FSDFVDTFFRKGKCN 405


>ref|XP_006285648.1| hypothetical protein CARUB_v10007104mg [Capsella rubella]
            gi|482554353|gb|EOA18546.1| hypothetical protein
            CARUB_v10007104mg [Capsella rubella]
          Length = 555

 Score =  209 bits (531), Expect = 3e-51
 Identities = 150/436 (34%), Positives = 214/436 (49%), Gaps = 16/436 (3%)
 Frame = -3

Query: 1261 MLRNRR-RSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDI 1085
            MLR+RR RSR+G Q CA  +A+        L++RL      S + L       S   +D 
Sbjct: 1    MLRSRRSRSRHGAQACAAMSAVLLLASVSFLYTRLSLFSSQSPTHLR------SGSGEDA 54

Query: 1084 DAIVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGV 905
                 + L+ D                       RIDE D  D  +   VS+EE+     
Sbjct: 55   VLFPESLLVSDSDVVETTAGGGRGSTTSTED---RIDEHD--DAIEDDGVSNEED--ENQ 107

Query: 904  DFEEESDSDLNNNRFKHSPD-----YVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXX 740
            D E+E + DLN N+   S       + +DH+ GV RRAF+KRSI  W             
Sbjct: 108  DAEQEQEVDLNRNKGSSSSSSSSSGFYFDHVNGVIRRAFNKRSIDEWDYDYAGFSIDSDL 167

Query: 739  XXXXXXSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKGD 572
                   + AFGSDD P+DE++RRK+  +  +EDALLLK+    SPLREGW +WF+KKGD
Sbjct: 168  GDKS---RAAFGSDDIPLDESIRRKVVEVSSVEDALLLKSGKKVSPLREGWGDWFDKKGD 224

Query: 571  FLRRDKMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPF 392
            FLRRD+MFKS            LQDPDGVG+T LTRGDK++ K  LN+ ++ P + K P 
Sbjct: 225  FLRRDRMFKSNIETLNPLNNPMLQDPDGVGITGLTRGDKVVQKWRLNQIKRNPFMAKKPL 284

Query: 391  FGNNVKERDANVDE------MKVNKDRMINQVKGVERRTLDDNGSSNSNDGAENLGSIER 230
               + K+  + V E      ++ + D    ++K  ER+TLDD+         + + + ++
Sbjct: 285  SVVSEKKEPSEVRESGERIRLQNSVDERNGEIKRGERKTLDDD---------KKIETKKQ 335

Query: 229  VIAGYENSNSHSNDERKLHLPNGKEPLKLNVPNGKVNVDSGPIYADGSRWGYFPGLPPHL 50
             I   E                           GK++  +  +YADG++WGY+PG+   L
Sbjct: 336  SIVESE---------------------------GKLDEVTEHMYADGTKWGYYPGIELSL 368

Query: 49   SFTDFVTEFFRQGKCS 2
            SF+DF+  FFR+ KCS
Sbjct: 369  SFSDFMDSFFRKEKCS 384


>ref|XP_006413925.1| hypothetical protein EUTSA_v10024627mg [Eutrema salsugineum]
            gi|557115095|gb|ESQ55378.1| hypothetical protein
            EUTSA_v10024627mg [Eutrema salsugineum]
          Length = 661

 Score =  208 bits (530), Expect = 4e-51
 Identities = 155/440 (35%), Positives = 212/440 (48%), Gaps = 20/440 (4%)
 Frame = -3

Query: 1261 MLRNRR-RSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDI 1085
            MLR+RR RSR+G Q CA+ +A+        L++RL      S + L       S   +D 
Sbjct: 1    MLRSRRSRSRHGAQACAVMSAVLLLASVSLLYTRLSLFSSHSPNHLR------SGSGEDA 54

Query: 1084 DAIVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELD-VIDTDDQSKVSDEEEILRG 908
                 + L+ D                       RIDE D  I+ D    VS+EE+  + 
Sbjct: 55   VLFPDSLLVSDSDVVETAGGRGSSNED-------RIDEHDDAIEDDRNDGVSNEEDENQD 107

Query: 907  VDFEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXX 728
             + E+E D D N      S  + +DH+ GV RRAF+KRSI  W                 
Sbjct: 108  AEQEQEVDPDRNK---ASSSGFYFDHVNGVIRRAFNKRSIDEWDYDYAGFSIGSGIGNDD 164

Query: 727  XXS---KIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKGDF 569
                  K AFGSDD P+DE++RRK+  +  +EDALLLK+    SPLREGW +WF+KKGDF
Sbjct: 165  SFGEKSKAAFGSDDVPLDESIRRKIVEVSSVEDALLLKSGRMVSPLREGWGDWFDKKGDF 224

Query: 568  LRRDKMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFF 389
            LRRD+MFKS            LQDPDGVG+T LTRGDK + K  L+E ++ P +VK P  
Sbjct: 225  LRRDRMFKSNIETLNPLNIPMLQDPDGVGITGLTRGDKAVQKWRLSEIKRNPFMVKKPL- 283

Query: 388  GNNVKERDANVDEMKVNKDRMIN-----------QVKGVERRTLDDNGSSNSNDGAENLG 242
             +  ++R+ N         R+ N           ++K  ER+TLD++  + + +      
Sbjct: 284  -SVAEKREPNEFRESRKGIRLQNSVDESGEVRNGEIKRGERKTLDNDSKAETKEE----- 337

Query: 241  SIERVIAGYENSNSHSNDERKLHLPNGKEPLKLNVPNGKVNVDSGPIYADGSRWGYFPGL 62
              E V   +EN      DE   H+                       YADG+RWGY+P L
Sbjct: 338  --ENVEFDWEN------DEFTEHM-----------------------YADGTRWGYYPRL 366

Query: 61   PPHLSFTDFVTEFFRQGKCS 2
             P LSF+DF+  FFR+ KCS
Sbjct: 367  EPGLSFSDFMDSFFRKEKCS 386


>ref|XP_004158677.1| PREDICTED: uncharacterized protein At4g19900-like isoform 2 [Cucumis
            sativus]
          Length = 537

 Score =  206 bits (524), Expect = 2e-50
 Identities = 157/455 (34%), Positives = 209/455 (45%), Gaps = 36/455 (7%)
 Frame = -3

Query: 1261 MLRN---RRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSL---ERTGIDFSQ 1100
            MLRN   RRR  YG   CA  AA+        L++RL   +  + S     +  G     
Sbjct: 1    MLRNLHTRRRGSYGACFCAFAAALLLLFSVSLLYTRLSRSQSHTHSPHMYPKSLGNILVS 60

Query: 1099 DSDDIDAIVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEE 920
            DSDD   IV+     D                      D+IDELD +D D QS+ S +E+
Sbjct: 61   DSDDDSDIVLGTTTTD---------------------EDKIDELDFVDEDLQSRASGDED 99

Query: 919  ILRGVDFEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFD-KRSIHPWQXXXXXXXXXXX 743
            +  G D E++SD             + +DH+ G  R+ FD KRSI  W            
Sbjct: 100  L--GED-EDQSDQ-------VRVSGFYFDHVSGAIRKVFDNKRSIEDWSDDTSGFPIGLG 149

Query: 742  XXXXXXXSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKG 575
                    K AFGSDD PVDE VRRK   +  IEDALLLK     SPLR+GW +WF+KKG
Sbjct: 150  EVDRS---KSAFGSDDVPVDEEVRRKASEMTGIEDALLLKVGGRVSPLRDGWGDWFDKKG 206

Query: 574  DFLRRDKMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNP 395
            DFLRRD+MFKS            LQDPDG+GV +LTRGD+I+ K  +NEF++ P +V  P
Sbjct: 207  DFLRRDRMFKSNWEVLNPLNNPLLQDPDGLGVASLTRGDRIVQKWWINEFKRAPFLVNKP 266

Query: 394  F------FGNNVKERDANVDEMKV-------------NKDRMINQVKGVERRT---LDDN 281
                   F   V+    +    K              N  + +N++   + RT   L   
Sbjct: 267  LGVTRKVFNTEVENGSMHASIKKSGSLSGQTDINFMDNGKKTVNEIGTSDERTRNNLSRK 326

Query: 280  GSSNSNDGAENLGSIERVIAGYENSNSHSNDERKLHLPNGKEPLKLNVPNGK---VNVDS 110
               N ++ + +  S  R        N  S + R      G +P+       K   V    
Sbjct: 327  KVINFDEDSSSRFSGYRTSISRSTKNEKSGERRTEKADVGDKPVLTKGAGFKPKAVPHTL 386

Query: 109  GPIYADGSRWGYFPGLPPHLSFTDFVTEFFRQGKC 5
              +YADG RWGY+PGL PHLSF+ F+  FF++ KC
Sbjct: 387  TSVYADGKRWGYYPGLHPHLSFSRFMDAFFKKNKC 421


>ref|XP_004134884.1| PREDICTED: uncharacterized protein At4g19900-like [Cucumis sativus]
          Length = 634

 Score =  206 bits (524), Expect = 2e-50
 Identities = 157/455 (34%), Positives = 209/455 (45%), Gaps = 36/455 (7%)
 Frame = -3

Query: 1261 MLRN---RRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSL---ERTGIDFSQ 1100
            MLRN   RRR  YG   CA  AA+        L++RL   +  + S     +  G     
Sbjct: 1    MLRNLHTRRRGSYGACFCAFAAALLLLFSVSLLYTRLSRSQSHTHSPHMYPKSLGNILVS 60

Query: 1099 DSDDIDAIVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEE 920
            DSDD   IV+     D                      D+IDELD +D D QS+ S +E+
Sbjct: 61   DSDDDSDIVLGTTTTD---------------------EDKIDELDFVDEDLQSRASGDED 99

Query: 919  ILRGVDFEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFD-KRSIHPWQXXXXXXXXXXX 743
            +  G D E++SD             + +DH+ G  R+ FD KRSI  W            
Sbjct: 100  L--GED-EDQSDQ-------VRVSGFYFDHVSGAIRKVFDNKRSIEDWSDDTSGFPIGLG 149

Query: 742  XXXXXXXSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKG 575
                    K AFGSDD PVDE VRRK   +  IEDALLLK     SPLR+GW +WF+KKG
Sbjct: 150  EVDRS---KSAFGSDDVPVDEEVRRKASEMTGIEDALLLKVGGRVSPLRDGWGDWFDKKG 206

Query: 574  DFLRRDKMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNP 395
            DFLRRD+MFKS            LQDPDG+GV +LTRGD+I+ K  +NEF++ P +V  P
Sbjct: 207  DFLRRDRMFKSNWEVLNPLNNPLLQDPDGLGVASLTRGDRIVQKWWINEFKRAPFLVNKP 266

Query: 394  F------FGNNVKERDANVDEMKV-------------NKDRMINQVKGVERRT---LDDN 281
                   F   V+    +    K              N  + +N++   + RT   L   
Sbjct: 267  LGVTRKVFNTEVENGSMHASIKKSGSLSGQTDINFMDNGKKTVNEIGTSDERTRNNLSRK 326

Query: 280  GSSNSNDGAENLGSIERVIAGYENSNSHSNDERKLHLPNGKEPLKLNVPNGK---VNVDS 110
               N ++ + +  S  R        N  S + R      G +P+       K   V    
Sbjct: 327  KVINFDEDSSSRFSGYRTSISRSTKNEKSGERRTEKADVGDKPVLTKGAGFKPKAVPHTL 386

Query: 109  GPIYADGSRWGYFPGLPPHLSFTDFVTEFFRQGKC 5
              +YADG RWGY+PGL PHLSF+ F+  FF++ KC
Sbjct: 387  TSVYADGKRWGYYPGLHPHLSFSRFMDAFFKKNKC 421


>ref|XP_004158676.1| PREDICTED: uncharacterized protein At4g19900-like isoform 1 [Cucumis
            sativus]
          Length = 631

 Score =  201 bits (510), Expect = 9e-49
 Identities = 156/430 (36%), Positives = 203/430 (47%), Gaps = 11/430 (2%)
 Frame = -3

Query: 1261 MLRN---RRRSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSL---ERTGIDFSQ 1100
            MLRN   RRR  YG   CA  AA+        L++RL   +  + S     +  G     
Sbjct: 1    MLRNLHTRRRGSYGACFCAFAAALLLLFSVSLLYTRLSRSQSHTHSPHMYPKSLGNILVS 60

Query: 1099 DSDDIDAIVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEE 920
            DSDD   IV+     D                      D+IDELD +D D QS+ S +E+
Sbjct: 61   DSDDDSDIVLGTTTTD---------------------EDKIDELDFVDEDLQSRASGDED 99

Query: 919  ILRGVDFEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFD-KRSIHPWQXXXXXXXXXXX 743
            +  G D E++SD             + +DH+ G  R+ FD KRSI  W            
Sbjct: 100  L--GED-EDQSDQ-------VRVSGFYFDHVSGAIRKVFDNKRSIEDWSDDTSGFPIGLG 149

Query: 742  XXXXXXXSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKG 575
                    K AFGSDD PVDE VRRK   +  IEDALLLK     SPLR+GW +WF+KKG
Sbjct: 150  EVDRS---KSAFGSDDVPVDEEVRRKASEMTGIEDALLLKVGGRVSPLRDGWGDWFDKKG 206

Query: 574  DFLRRDKMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNP 395
            DFLRRD+MFKS            LQDPDG+GV +LTRGD+I+ K  +NEF++ P +V  P
Sbjct: 207  DFLRRDRMFKSNWEVLNPLNNPLLQDPDGLGVASLTRGDRIVQKWWINEFKRAPFLVNKP 266

Query: 394  FFGNNVKERDANVDEMKVNKDRMINQVKGVERRTLDDNGSSNSNDGAENLGSIERVIAGY 215
                  ++R+ N    + +  R     K  ERRT                   E+   G 
Sbjct: 267  L--GVTRKREPN--GYRTSISRSTKNEKSGERRT-------------------EKADVG- 302

Query: 214  ENSNSHSNDERKLHLPNGKEPLKLNVPNGKVNVDSGPIYADGSRWGYFPGLPPHLSFTDF 35
                    D+  L    G +P    VP+   +V     YADG RWGY+PGL PHLSF+ F
Sbjct: 303  --------DKPVLTKGAGFKPKA--VPHTLTSV-----YADGKRWGYYPGLHPHLSFSRF 347

Query: 34   VTEFFRQGKC 5
            +  FF++ KC
Sbjct: 348  MDAFFKKNKC 357


>ref|XP_004293757.1| PREDICTED: uncharacterized protein At4g19900-like [Fragaria vesca
            subsp. vesca]
          Length = 627

 Score =  192 bits (488), Expect = 3e-46
 Identities = 140/422 (33%), Positives = 196/422 (46%), Gaps = 6/422 (1%)
 Frame = -3

Query: 1258 LRNRR--RSRYGVQLCAITAAIXXXXXXXXLHSRLGFDRQSSSSSLERTGIDFSQDSDDI 1085
            +R+RR  R RYGV +CA+ +A+        L++RL   +    +    T ++     DD+
Sbjct: 5    IRSRRPHRPRYGVYICAVISALLLLLSVSLLYTRLSHSQSHHLNYRYPTPLN-----DDV 59

Query: 1084 DAIVVNPLLEDIQXXXXXXXXXXXXXXXXXXXXDRIDELDVIDTDDQSKVSDEEEILRGV 905
               + NPL+ D                      D+ID LD +  DD  +   ++E+    
Sbjct: 60   S--LSNPLISD-------------DAAATVTADDKIDVLDEVVADDAPR---DDEV---- 97

Query: 904  DFEEESDSDLNNNRFKHSPDYVWDHIVGVTRRAFDKRSIHPWQXXXXXXXXXXXXXXXXX 725
               E+ D          S  + +DH+ GV RR  +KR I  W                  
Sbjct: 98   ---EDDDDPQPGQGGGSSAGFFFDHVAGVIRRGLNKRKIEDWDEDYSGFSVGLSVVDKSV 154

Query: 724  XSKIAFGSDDQPVDENVRRKLDHIRVIEDALLLKT----SPLREGWANWFEKKGDFLRRD 557
               +AFGSDD PVD  VRR++  +  +EDAL++K     SPLREGW  WF+KK DFLRRD
Sbjct: 155  ---VAFGSDDVPVDMEVRRRMTEVAGVEDALMVKVGKRGSPLREGWGEWFDKKSDFLRRD 211

Query: 556  KMFKSXXXXXXXXXXXXLQDPDGVGVTTLTRGDKIMHKALLNEFRKLPVVVKNPFFGNNV 377
            KMFKS            LQDPDGVGV+ LTRGDK + K  L+ F+K+P       F +  
Sbjct: 212  KMFKSNLELLNPLHNPMLQDPDGVGVSGLTRGDKAVQKWWLSHFKKVP-------FRSRK 264

Query: 376  KERDANVDEMKVNKDRMINQVKGVERRTLDDNGSSNSNDGAENLGSIERVIAGYENSNSH 197
            KE  +    + V     +++V+  ER+ LD++G           G +E  + G     S 
Sbjct: 265  KENASGSGGVGVE----VSEVERAERKALDESGG----------GKVEVAVGGTVGQISE 310

Query: 196  SNDERKLHLPNGKEPLKLNVPNGKVNVDSGPIYADGSRWGYFPGLPPHLSFTDFVTEFFR 17
            S                        N  SG +YADG RWG++PGL PHLSF DF+ EFF 
Sbjct: 311  SVQ----------------------NEFSGLVYADGKRWGFYPGLHPHLSFPDFMEEFFS 348

Query: 16   QG 11
            +G
Sbjct: 349  KG 350


Top