BLASTX nr result

ID: Akebia24_contig00009997 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00009997
         (1860 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXC01433.1| hypothetical protein L484_022004 [Morus notabilis]     668   0.0  
ref|XP_004153209.1| PREDICTED: uncharacterized protein LOC101204...   667   0.0  
ref|XP_002525649.1| conserved hypothetical protein [Ricinus comm...   667   0.0  
ref|XP_004145318.1| PREDICTED: uncharacterized protein LOC101204...   667   0.0  
ref|XP_004167091.1| PREDICTED: uncharacterized protein LOC101228...   666   0.0  
ref|XP_002299807.2| hypothetical protein POPTR_0001s25900g [Popu...   662   0.0  
ref|XP_007016099.1| F10K1.7 protein [Theobroma cacao] gi|5087864...   660   0.0  
ref|XP_002892388.1| hypothetical protein ARALYDRAFT_887933 [Arab...   659   0.0  
ref|NP_172202.1| uncharacterized protein [Arabidopsis thaliana] ...   652   0.0  
dbj|BAD94602.1| hypothetical protein [Arabidopsis thaliana]           650   0.0  
ref|XP_006307276.1| hypothetical protein CARUB_v10008891mg [Caps...   648   0.0  
ref|XP_003632132.1| PREDICTED: protein O-glucosyltransferase 1-l...   642   0.0  
emb|CAN70836.1| hypothetical protein VITISV_015872 [Vitis vinifera]   640   0.0  
ref|XP_004242062.1| PREDICTED: uncharacterized protein LOC101246...   634   e-179
ref|XP_006350920.1| PREDICTED: uncharacterized protein LOC102588...   631   e-178
ref|XP_004500267.1| PREDICTED: uncharacterized protein LOC101501...   624   e-176
ref|XP_003552954.1| PREDICTED: O-glucosyltransferase rumi-like [...   618   e-174
ref|XP_006488529.1| PREDICTED: uncharacterized protein LOC102623...   615   e-173
ref|XP_006848230.1| hypothetical protein AMTR_s00029p00246540 [A...   615   e-173
ref|XP_003539326.2| PREDICTED: O-glucosyltransferase rumi-like [...   610   e-172

>gb|EXC01433.1| hypothetical protein L484_022004 [Morus notabilis]
          Length = 558

 Score =  668 bits (1724), Expect = 0.0
 Identities = 307/463 (66%), Positives = 371/463 (80%), Gaps = 5/463 (1%)
 Frame = -3

Query: 1654 KVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKIIQCSYLKCRRSSPEPDPS 1475
            +VD   +Q KT+AG NLDPTPWHLFPPK F+  ++++R  KI+ CSYL C  S+ + +PS
Sbjct: 95   QVDDIAAQTKTVAGDNLDPTPWHLFPPKTFSGETRHSRLYKILHCSYLACSHSAYKYNPS 154

Query: 1474 HHSRNT-----RSKCPSFFHWIHRDLDPWTQSHISLSNLLEAQKFAAFRVVIVGGRLYAD 1310
               R +       KCP FF WIH+DL+PW ++ IS  +L EA++FAAFR VIVGGRL+ D
Sbjct: 155  VKRRRSDPDSAARKCPEFFRWIHQDLEPWARTGISAGHLEEAREFAAFRAVIVGGRLFVD 214

Query: 1309 FYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVNKTEYGSSTIWPPPPLF 1130
             YY CVQSR MFTIWGLLQLL+RYPGMVPDVD++FDCMD+P +N TE+GS     P PLF
Sbjct: 215  LYYACVQSRTMFTIWGLLQLLRRYPGMVPDVDMVFDCMDKPSINGTEHGSF----PLPLF 270

Query: 1129 RYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALSWTEKWLLAYWKGNPDV 950
            RYCTT  HFDIPFPDWSFWGWPE N+ PWDEEFR IK+GS+  SWT+K   AYWKGNPDV
Sbjct: 271  RYCTTQAHFDIPFPDWSFWGWPETNLNPWDEEFRDIKRGSERTSWTKKHPRAYWKGNPDV 330

Query: 949  DSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCKHRYKIYAEGYAWSVSL 770
            DSPVR +LL CNHSR WGAQI RQ+W EEA+GGYE+S+L+NQC +RYKIYAEGYAWSVSL
Sbjct: 331  DSPVRTELLNCNHSRTWGAQIWRQDWTEEAKGGYEKSRLSNQCNNRYKIYAEGYAWSVSL 390

Query: 769  KYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSIKFAVDWGNKNPSEAEA 590
            KYILSCGSLALIISPQYEDFF R L+P +NYWP+S +TDLC SIK+ V+WGN +PSEA+A
Sbjct: 391  KYILSCGSLALIISPQYEDFFIRGLIPMKNYWPIS-STDLCPSIKYGVEWGNAHPSEAKA 449

Query: 589  IGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQEVCMESLLCFTDPKQRH 410
            IGKGGQ+FME LSM RVY+YM+HLINEYSKLQ FKP  PSSA EVC ESLLC  D KQR 
Sbjct: 450  IGKGGQEFMESLSMNRVYDYMFHLINEYSKLQTFKPVRPSSALEVCPESLLCHADSKQRK 509

Query: 409  FLERSTASLSSSHPCTFPPGDRDVVKSWIQRKKKIITNVQELE 281
             LE+STA  S + PC+  P D D++KSW+Q+++K I ++++++
Sbjct: 510  LLEKSTAHPSPNPPCSLQPPDSDIIKSWVQQRRKTIKDIEDMK 552


>ref|XP_004153209.1| PREDICTED: uncharacterized protein LOC101204904 [Cucumis sativus]
          Length = 472

 Score =  667 bits (1721), Expect = 0.0
 Identities = 310/465 (66%), Positives = 366/465 (78%), Gaps = 1/465 (0%)
 Frame = -3

Query: 1705 PFVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKII 1526
            P VV + F S T  L +KVD F +Q KT+AGHNLDPTPWHLFPPK F+D +++ RA KII
Sbjct: 13   PSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSDETRHARAVKII 72

Query: 1525 QCSYLKCRRSSPEPDP-SHHSRNTRSKCPSFFHWIHRDLDPWTQSHISLSNLLEAQKFAA 1349
             CSYL CR ++        HS  +  KCP FF WIH DLDPW ++ IS++ L E+QKFAA
Sbjct: 73   HCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISMTQLEESQKFAA 132

Query: 1348 FRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVNKTE 1169
            FRVVIV GRLY D YY CVQSRA+FTIWGL+Q+L+RYPGMVPDVD+MFDCMD+P +N+TE
Sbjct: 133  FRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE 192

Query: 1168 YGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALSWTE 989
              +     P PLFRYCTT  HFDIPFPDWSFWGWPEVN+  W EEF  IK+GS+ LSW  
Sbjct: 193  NKAM----PLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFN 248

Query: 988  KWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCKHRY 809
            K+  AYWKGNPDVDSP R +LL+CNHSR+WGAQIMRQ+W +EAR GYEQSKL+NQC HRY
Sbjct: 249  KFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRY 308

Query: 808  KIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSIKFA 629
            KIYAEG+AWSVSLKYILSCGS++LIISPQYEDFFSR L P +NYWP+ P T++C SIK A
Sbjct: 309  KIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPI-PFTNMCESIKHA 367

Query: 628  VDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQEVCM 449
            VDWGN +  EAE IG+ GQKFME+LSM+ VY YM+HLI EYSKLQDFKP+PP SA EVC 
Sbjct: 368  VDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPPSALEVCT 427

Query: 448  ESLLCFTDPKQRHFLERSTASLSSSHPCTFPPGDRDVVKSWIQRK 314
            +SLLC  D KQ  FLE+S AS+SS  PC+   G  D++ SW+Q+K
Sbjct: 428  DSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 472


>ref|XP_002525649.1| conserved hypothetical protein [Ricinus communis]
            gi|223535085|gb|EEF36767.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 491

 Score =  667 bits (1721), Expect = 0.0
 Identities = 309/478 (64%), Positives = 374/478 (78%), Gaps = 2/478 (0%)
 Frame = -3

Query: 1702 FVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKIIQ 1523
            F  LL   S T    ++VD F S+ KT+AGHNLDPTPWH+FPP+ F++ ++  RA KIIQ
Sbjct: 17   FPCLLGLVSLTLLFFYQVDNFASRTKTVAGHNLDPTPWHIFPPRTFDEETRQARAYKIIQ 76

Query: 1522 CSYLKC--RRSSPEPDPSHHSRNTRSKCPSFFHWIHRDLDPWTQSHISLSNLLEAQKFAA 1349
            CSYL C    ++     S  S    +KCP FF +IH DL PW ++ I+  ++ EA+KFAA
Sbjct: 77   CSYLTCPYTNTTTTRRRSQSSSQANAKCPEFFRFIHHDLQPWARTGITKKHIAEAKKFAA 136

Query: 1348 FRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVNKTE 1169
            FRVVI  GRLY D YY CVQSR MFT+WGLLQLL RYPGMVPDVD+MFDCMDRPV+NKTE
Sbjct: 137  FRVVIFEGRLYLDLYYACVQSRMMFTVWGLLQLLNRYPGMVPDVDIMFDCMDRPVINKTE 196

Query: 1168 YGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALSWTE 989
            + S     P P+FRYCTT  HFDIPFPDWSFWGWPE+NI  W+EEFR IK+GSQ+ SW++
Sbjct: 197  HISF----PLPIFRYCTTQNHFDIPFPDWSFWGWPEINIRSWNEEFRDIKRGSQSKSWSK 252

Query: 988  KWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCKHRY 809
            KW  AYWKGNPDV SP+R +L++CNHSR WGA IMRQ+W EEAR G+E+SKL+NQC +RY
Sbjct: 253  KWPRAYWKGNPDVLSPIRTELMQCNHSRKWGAHIMRQDWGEEARAGFERSKLSNQCNYRY 312

Query: 808  KIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSIKFA 629
            KIYAEG+AWSVSLKYI+SCGSLALIISPQYEDFFSR L+P  NYWPV+ + +LCRSIKFA
Sbjct: 313  KIYAEGFAWSVSLKYIISCGSLALIISPQYEDFFSRGLVPASNYWPVA-SDELCRSIKFA 371

Query: 628  VDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQEVCM 449
            VDWGN NPSEAE+IGK GQ FME LSME VY+YM+HLI EYSKLQ FKP  PSSA EVC 
Sbjct: 372  VDWGNANPSEAESIGKAGQDFMETLSMEGVYDYMFHLITEYSKLQVFKPVLPSSALEVCA 431

Query: 448  ESLLCFTDPKQRHFLERSTASLSSSHPCTFPPGDRDVVKSWIQRKKKIITNVQELEKM 275
            +SLLCF DPKQ+ FLERS A  S    C+  P D + +KSW+Q K++++ +V++++K+
Sbjct: 432  DSLLCFADPKQKQFLERSAAFPSPKPACSLQPADGNAIKSWLQEKQRVMEDVRKMKKV 489


>ref|XP_004145318.1| PREDICTED: uncharacterized protein LOC101204476 [Cucumis sativus]
          Length = 472

 Score =  667 bits (1720), Expect = 0.0
 Identities = 310/465 (66%), Positives = 365/465 (78%), Gaps = 1/465 (0%)
 Frame = -3

Query: 1705 PFVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKII 1526
            P VV + F S T  L +KVD F +Q KT+AGHNLDPTPWHLFPPK F+D +++ RA KII
Sbjct: 13   PSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSDETRHARAVKII 72

Query: 1525 QCSYLKCRRSSPEPDP-SHHSRNTRSKCPSFFHWIHRDLDPWTQSHISLSNLLEAQKFAA 1349
             CSYL CR ++        HS  +  KCP FF WIH DLDPW ++ IS++ L E+QKFAA
Sbjct: 73   HCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISMTQLEESQKFAA 132

Query: 1348 FRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVNKTE 1169
            FRVVIV GRLY D YY CVQSRA+FTIWGL+Q+L+RYPGMVPDVD+MFDCMD+P +N+TE
Sbjct: 133  FRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE 192

Query: 1168 YGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALSWTE 989
              +     P PLFRYCTT  HFDIPFPDWSFWGWPEVN+  W EEF  IK+GS+ LSW  
Sbjct: 193  NKAM----PLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFN 248

Query: 988  KWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCKHRY 809
            K+  AYWKGNPDVDSP R +LL+CNHSR+WGAQIMRQ+W +EAR GYEQSKL+NQC HRY
Sbjct: 249  KFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRY 308

Query: 808  KIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSIKFA 629
            KIYAEG+AWSVSLKYILSCGS++LIISPQYEDFFSR L P +NYWP+ P T++C SIK A
Sbjct: 309  KIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPI-PFTNMCESIKHA 367

Query: 628  VDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQEVCM 449
            VDWGN +  EAE IG+ GQKFME LSM+ VY YM+HLI EYSKLQDFKP+PP SA EVC 
Sbjct: 368  VDWGNTHFPEAETIGRQGQKFMESLSMDTVYSYMFHLITEYSKLQDFKPTPPPSALEVCT 427

Query: 448  ESLLCFTDPKQRHFLERSTASLSSSHPCTFPPGDRDVVKSWIQRK 314
            +SLLC  D KQ  FLE+S AS+SS  PC+   G  D++ SW+Q+K
Sbjct: 428  DSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 472


>ref|XP_004167091.1| PREDICTED: uncharacterized protein LOC101228589 [Cucumis sativus]
          Length = 472

 Score =  666 bits (1718), Expect = 0.0
 Identities = 309/465 (66%), Positives = 366/465 (78%), Gaps = 1/465 (0%)
 Frame = -3

Query: 1705 PFVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKII 1526
            P VV + F S T  L +KVD F +Q KT+AGHNLDPTPWHLFPPK F+D +++ RA KII
Sbjct: 13   PSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSDETRHARAVKII 72

Query: 1525 QCSYLKCRRSSPEPDP-SHHSRNTRSKCPSFFHWIHRDLDPWTQSHISLSNLLEAQKFAA 1349
             CSYL CR ++        HS  +  KCP FF WIH DLDPW ++ IS++ L E+QKFAA
Sbjct: 73   HCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISMTQLEESQKFAA 132

Query: 1348 FRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVNKTE 1169
            FRVVIV GRLY D YY CVQSRA+FTIWGL+Q+L+RYPGMVPDVD+MFDCMD+P +N+TE
Sbjct: 133  FRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE 192

Query: 1168 YGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALSWTE 989
              +     P PLFRYCTT  HFDIPFPDWSFWGWPEVN+  W EEF  IK+GS+ LSW  
Sbjct: 193  NKAM----PLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFN 248

Query: 988  KWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCKHRY 809
            K+  AYWKGNPDVDSP R +LL+CNHSR+WGAQIMRQ+W +EA+ GYEQSKL+NQC HRY
Sbjct: 249  KFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKLSNQCNHRY 308

Query: 808  KIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSIKFA 629
            KIYAEG+AWSVSLKYILSCGS++LIISPQYEDFFSR L P +NYWP+ P T++C SIK A
Sbjct: 309  KIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPI-PFTNMCESIKHA 367

Query: 628  VDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQEVCM 449
            VDWGN +  EAE IG+ GQKFME+LSM+ VY YM+HLI EYSKLQDFKP+PP SA EVC 
Sbjct: 368  VDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPPSALEVCT 427

Query: 448  ESLLCFTDPKQRHFLERSTASLSSSHPCTFPPGDRDVVKSWIQRK 314
            +SLLC  D KQ  FLE+S AS+SS  PC+   G  D++ SW+Q+K
Sbjct: 428  DSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 472


>ref|XP_002299807.2| hypothetical protein POPTR_0001s25900g [Populus trichocarpa]
            gi|550348193|gb|EEE84612.2| hypothetical protein
            POPTR_0001s25900g [Populus trichocarpa]
          Length = 462

 Score =  662 bits (1707), Expect = 0.0
 Identities = 305/460 (66%), Positives = 366/460 (79%), Gaps = 2/460 (0%)
 Frame = -3

Query: 1654 KVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKIIQCSYLKCRRSSPEPDPS 1475
            +VD    Q KT+AGHNL PTPWHLFPPK F+D S++ RA +I+ CSYL C  S+      
Sbjct: 5    QVDNLALQTKTVAGHNLPPTPWHLFPPKNFDDQSRHARAYQILHCSYLTCPYSNTTVSKG 64

Query: 1474 H--HSRNTRSKCPSFFHWIHRDLDPWTQSHISLSNLLEAQKFAAFRVVIVGGRLYADFYY 1301
            H  +S ++  KCP  F +IH DL+PW QS I++ +++ A+ +A+FRVVI  GRLY D YY
Sbjct: 65   HGFNSPSSSPKCPRLFMFIHHDLEPWAQSRITVDHIMGAKNYASFRVVIYKGRLYLDPYY 124

Query: 1300 DCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVNKTEYGSSTIWPPPPLFRYC 1121
             CVQSR MFTIWG LQLL+RYPGMVPDVD+MFDCMD+P +NKTE+ S     P PLFRYC
Sbjct: 125  ACVQSRMMFTIWGFLQLLKRYPGMVPDVDIMFDCMDKPSINKTEHDSF----PLPLFRYC 180

Query: 1120 TTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALSWTEKWLLAYWKGNPDVDSP 941
            TT  HFDIPFPDWSFWGWPEVNI PWDEEFR IK+G+QA SW +KW  AYWKGNPDV SP
Sbjct: 181  TTKDHFDIPFPDWSFWGWPEVNIRPWDEEFRDIKRGAQARSWPKKWPRAYWKGNPDVGSP 240

Query: 940  VRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCKHRYKIYAEGYAWSVSLKYI 761
            +R  LL CNH++ WGAQIMRQ+W EEA+GGY  SKL++QC +RYKIYAEG+AWSVSLKYI
Sbjct: 241  IRTSLLECNHTKKWGAQIMRQDWEEEAKGGYVSSKLSHQCDYRYKIYAEGFAWSVSLKYI 300

Query: 760  LSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSIKFAVDWGNKNPSEAEAIGK 581
            +SCGSLALIISPQYEDFFSR L+P +NYWPVS +  LC+SIKFAVDWGN NP+EA+ IGK
Sbjct: 301  ISCGSLALIISPQYEDFFSRGLIPEKNYWPVS-SDGLCQSIKFAVDWGNTNPTEAQKIGK 359

Query: 580  GGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQEVCMESLLCFTDPKQRHFLE 401
             GQ  ME LSM+RVY+YM+HLI+EYSKLQDFKP PPSSA EVC++SL CF D KQ+ F E
Sbjct: 360  AGQDLMESLSMDRVYDYMFHLISEYSKLQDFKPVPPSSALEVCVDSLTCFADEKQKRFFE 419

Query: 400  RSTASLSSSHPCTFPPGDRDVVKSWIQRKKKIITNVQELE 281
            R+TA  S S PCT  P + D +KSW+Q+K++ ITNV+E+E
Sbjct: 420  RATAFPSPSPPCTLQPANSDFIKSWMQQKQRTITNVREME 459


>ref|XP_007016099.1| F10K1.7 protein [Theobroma cacao] gi|508786462|gb|EOY33718.1| F10K1.7
            protein [Theobroma cacao]
          Length = 508

 Score =  660 bits (1703), Expect = 0.0
 Identities = 314/494 (63%), Positives = 383/494 (77%), Gaps = 11/494 (2%)
 Frame = -3

Query: 1699 VVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKIIQC 1520
            VV L+F S  A L++KVD F SQ KT+AGHNL+PTPWH+FP K+F   ++  RA KIIQC
Sbjct: 21   VVALSFLSLAALLIYKVDDFASQTKTVAGHNLEPTPWHIFPAKKFTGETRQARAYKIIQC 80

Query: 1519 SYLKCR-------RSSPEPDPSHH---SRNTRSKCPSFFHWIHRDLDPWTQSHISLSNLL 1370
            SYL CR       R S E         S     KCP+FF +I+RDL+PW ++ IS+++ +
Sbjct: 81   SYLTCRHATNDAARLSEEQKKQRRRFMSSQVSEKCPNFFKFIYRDLEPWAKTRISINHTM 140

Query: 1369 EAQKFAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDR 1190
            +A++ AA RVVIV GRLY D YY CVQSR MFTIWGLLQLL+RYPGMVPDVD+MFDCMD+
Sbjct: 141  QAKQHAALRVVIVEGRLYVDLYYACVQSRLMFTIWGLLQLLKRYPGMVPDVDMMFDCMDK 200

Query: 1189 PVVNKTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGS 1010
            P +++ E+GS     P PLFRYCTT  HFDIPFPDWSFWGWPE NI PWD++F+ IKQGS
Sbjct: 201  PTIDRIEHGSF----PLPLFRYCTTESHFDIPFPDWSFWGWPETNIQPWDKQFKDIKQGS 256

Query: 1009 QALSWTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLA 830
            QA +WT K   A+WKGNPDV++P+R +L++CNHSR WGAQI+RQNW EEA+GG+ QSKL+
Sbjct: 257  QAENWTRKLPWAFWKGNPDVEAPIRQELMQCNHSRQWGAQIIRQNWAEEAKGGFAQSKLS 316

Query: 829  NQCKHRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDL 650
            NQCKHRYKIYAEGYAWSVSLKYILSCGSLAL+ISPQYED F+R L+P+ NYWPVSP  DL
Sbjct: 317  NQCKHRYKIYAEGYAWSVSLKYILSCGSLALLISPQYEDIFTRGLIPKLNYWPVSP-VDL 375

Query: 649  CRSIKFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPS 470
            C SIKFAVDWGN NPSEAEAIGK GQ+ ME LSM++VY+YM+HLI+EYSKLQDFKP PPS
Sbjct: 376  CHSIKFAVDWGNTNPSEAEAIGKRGQQLMESLSMDQVYDYMFHLISEYSKLQDFKPVPPS 435

Query: 469  SAQEVCMESLLCFTDPKQRHFLERSTASLSSSHPCTF-PPGDRDVVKSWIQRKKKIITNV 293
            SAQEVC ESLLC  +PKQ+ +L+R+ A  S + PC+   P + +      + KKK+I +V
Sbjct: 436  SAQEVCEESLLCLAEPKQKEYLKRAAAVGSPTPPCSLAKPPNSNFFNILTEHKKKLIQHV 495

Query: 292  QELEKM*ARIALYH 251
            +++E    R AL H
Sbjct: 496  KDME---MRNALRH 506


>ref|XP_002892388.1| hypothetical protein ARALYDRAFT_887933 [Arabidopsis lyrata subsp.
            lyrata] gi|297338230|gb|EFH68647.1| hypothetical protein
            ARALYDRAFT_887933 [Arabidopsis lyrata subsp. lyrata]
          Length = 508

 Score =  659 bits (1700), Expect = 0.0
 Identities = 301/479 (62%), Positives = 372/479 (77%), Gaps = 6/479 (1%)
 Frame = -3

Query: 1699 VVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKIIQC 1520
            V+ L+F SFTA LL+KVD F++Q KT+AGHNL+PTPWH+FP K F++G+K+++A +I+QC
Sbjct: 26   VLALSFFSFTALLLYKVDDFIAQTKTLAGHNLEPTPWHIFPRKSFSEGTKHSQAYRILQC 85

Query: 1519 SYLKCRRSSPEPDPSHHS------RNTRSKCPSFFHWIHRDLDPWTQSHISLSNLLEAQK 1358
            SY  C  ++     S  S      R  + +CP FF WIHRDL+PW ++ ++  ++  A+ 
Sbjct: 86   SYFSCPYNAVVQPKSLQSESVSGRRTHQPQCPDFFRWIHRDLEPWAKTGVTKEHVKRAKA 145

Query: 1357 FAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVN 1178
             AAFRVVI+ G+LY D YY CVQSR MFTIWG+LQLL +YPGMVPDVD+MFDCMD+P++N
Sbjct: 146  NAAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLNKYPGMVPDVDMMFDCMDKPIIN 205

Query: 1177 KTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALS 998
            +TEY S     P PLFRYCT   H DIPFPDWSFWGW E N+ PW+EEF  IKQGS+  S
Sbjct: 206  QTEYQSF----PVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWEEEFGDIKQGSRRRS 261

Query: 997  WTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCK 818
            W  K   AYWKGNPDV SP+R++L++CNHSR+WGAQIMRQ+W EEA+GG+EQSKL+NQC 
Sbjct: 262  WDNKQPRAYWKGNPDVVSPIRLELMKCNHSRLWGAQIMRQDWAEEAKGGFEQSKLSNQCN 321

Query: 817  HRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSI 638
            HRYKIYAEGYAWSVSLKYILSCGS+ LIISP+YEDFFSR L+P++NYWP+SP TDLCRSI
Sbjct: 322  HRYKIYAEGYAWSVSLKYILSCGSMTLIISPEYEDFFSRGLLPKENYWPISP-TDLCRSI 380

Query: 637  KFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQE 458
            K+AVDWGN NPS+AE IGK GQ +ME +SM+RVY+YM+HLI EYSKLQ FKP  P+SA E
Sbjct: 381  KYAVDWGNANPSQAETIGKRGQGYMESISMDRVYDYMFHLITEYSKLQKFKPEKPASANE 440

Query: 457  VCMESLLCFTDPKQRHFLERSTASLSSSHPCTFPPGDRDVVKSWIQRKKKIITNVQELE 281
            VC  SLLCF + K+R  LERS    S   PC  P  DR  ++  IQ+KKK I NV+ +E
Sbjct: 441  VCAGSLLCFAEQKERELLERSRVVPSLDQPCKLPVADRSRLERLIQQKKKTIENVRYME 499


>ref|NP_172202.1| uncharacterized protein [Arabidopsis thaliana]
            gi|8954024|gb|AAF82198.1|AC067971_6 Contains similarity
            to an unknown protein T2J13.180 gi|6522568 from
            Arabidopsis thaliana BAC T2J13 gb|AL132967. ESTs
            gb|Z29835 and gb|Z29836 come from this gene [Arabidopsis
            thaliana] gi|332189973|gb|AEE28094.1| uncharacterized
            protein AT1G07220 [Arabidopsis thaliana]
            gi|591402476|gb|AHL38965.1| glycosyltransferase, partial
            [Arabidopsis thaliana]
          Length = 507

 Score =  652 bits (1683), Expect = 0.0
 Identities = 300/479 (62%), Positives = 367/479 (76%), Gaps = 6/479 (1%)
 Frame = -3

Query: 1699 VVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKIIQC 1520
            V+ L+F SFTA L +KVD F++Q KT+AGHNL+PTPWH+FP K F+  +K+++A +I+QC
Sbjct: 25   VLALSFFSFTALLFYKVDDFIAQTKTLAGHNLEPTPWHIFPRKSFSAATKHSQAYRILQC 84

Query: 1519 SYLKCRRSSPEPDPSHHSRNTRSK------CPSFFHWIHRDLDPWTQSHISLSNLLEAQK 1358
            SY  C   +     S HS +   +      CP FF WIHRDL+PW ++ ++  ++  A+ 
Sbjct: 85   SYFSCPYKAVVQPKSLHSESGSGRQTHQPQCPDFFRWIHRDLEPWAKTGVTKEHVKRAKA 144

Query: 1357 FAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVN 1178
             AAFRVVI+ G+LY D YY CVQSR MFTIWG+LQLL +YPGMVPDVD+MFDCMD+P++N
Sbjct: 145  NAAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLTKYPGMVPDVDMMFDCMDKPIIN 204

Query: 1177 KTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALS 998
            +TEY S     P PLFRYCT   H DIPFPDWSFWGW E N+ PW+EEF  IKQGS+  S
Sbjct: 205  QTEYQSF----PVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWEEEFGDIKQGSRRRS 260

Query: 997  WTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCK 818
            W  K   AYWKGNPDV SP+R++L++CNHSR+WGAQIMRQ+W EEA+GG+EQSKL+NQC 
Sbjct: 261  WYNKQPRAYWKGNPDVVSPIRLELMKCNHSRLWGAQIMRQDWAEEAKGGFEQSKLSNQCN 320

Query: 817  HRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSI 638
            HRYKIYAEGYAWSVSLKYILSCGS+ LIISP+YEDFFSR L+P++NYWP+SP TDLCRSI
Sbjct: 321  HRYKIYAEGYAWSVSLKYILSCGSMTLIISPEYEDFFSRGLLPKENYWPISP-TDLCRSI 379

Query: 637  KFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQE 458
            K+AVDWGN NPSEAE IGK GQ +ME LSM RVY+YM+HLI EYSKLQ FKP  P+SA E
Sbjct: 380  KYAVDWGNSNPSEAETIGKRGQGYMESLSMNRVYDYMFHLITEYSKLQKFKPEKPASANE 439

Query: 457  VCMESLLCFTDPKQRHFLERSTASLSSSHPCTFPPGDRDVVKSWIQRKKKIITNVQELE 281
            VC  SLLC  + K+R  LERS    S   PC FP  DR+ ++  IQ+K K I NV+ +E
Sbjct: 440  VCAGSLLCIAEQKERELLERSRVVPSLDQPCKFPVEDRNRLEWLIQQKNKTIENVRYME 498


>dbj|BAD94602.1| hypothetical protein [Arabidopsis thaliana]
          Length = 507

 Score =  650 bits (1676), Expect = 0.0
 Identities = 299/479 (62%), Positives = 366/479 (76%), Gaps = 6/479 (1%)
 Frame = -3

Query: 1699 VVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKIIQC 1520
            V+ L+F SFTA L +KVD F++Q KT+AGHNL+PTPWH+FP K F+  +K+++A +I+QC
Sbjct: 25   VLALSFFSFTALLFYKVDDFIAQTKTLAGHNLEPTPWHIFPRKSFSAATKHSQAYRILQC 84

Query: 1519 SYLKCRRSSPEPDPSHHSRNTRSK------CPSFFHWIHRDLDPWTQSHISLSNLLEAQK 1358
            SY  C   +     S HS +   +      CP FF WIHRDL+PW ++ ++  ++  A+ 
Sbjct: 85   SYFSCPYKAVVQPKSLHSESGSGRQTHQPQCPDFFRWIHRDLEPWAKTGVTKEHVKRAKA 144

Query: 1357 FAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVN 1178
             AAFRVVI+ G+LY D YY CVQSR MFTIWG+LQLL +YPGMVPDVD+MFDCMD+P++N
Sbjct: 145  NAAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLTKYPGMVPDVDMMFDCMDKPIIN 204

Query: 1177 KTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALS 998
            +TEY S     P PLFRYCT   H DIPFPDWSFWGW E N+ PW+ EF  IKQGS+  S
Sbjct: 205  QTEYQSF----PVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWEVEFGDIKQGSRRRS 260

Query: 997  WTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCK 818
            W  K   AYWKGNPDV SP+R++L++CNHSR+WGAQIMRQ+W EEA+GG+EQSKL+NQC 
Sbjct: 261  WYNKQPRAYWKGNPDVVSPIRLELMKCNHSRLWGAQIMRQDWAEEAKGGFEQSKLSNQCN 320

Query: 817  HRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSI 638
            HRYKIYAEGYAWSVSLKYILSCGS+ LIISP+YEDFFSR L+P++NYWP+SP TDLCRSI
Sbjct: 321  HRYKIYAEGYAWSVSLKYILSCGSMTLIISPEYEDFFSRGLLPKENYWPISP-TDLCRSI 379

Query: 637  KFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQE 458
            K+AVDWGN NPSEAE IGK GQ +ME LSM RVY+YM+HLI EYSKLQ FKP  P+SA E
Sbjct: 380  KYAVDWGNSNPSEAETIGKRGQGYMESLSMNRVYDYMFHLITEYSKLQKFKPEKPASANE 439

Query: 457  VCMESLLCFTDPKQRHFLERSTASLSSSHPCTFPPGDRDVVKSWIQRKKKIITNVQELE 281
            VC  SLLC  + K+R  LERS    S   PC FP  DR+ ++  IQ+K K I NV+ +E
Sbjct: 440  VCAGSLLCIAEQKERELLERSRVVPSLDQPCKFPVEDRNRLEWLIQQKNKTIENVRYME 498


>ref|XP_006307276.1| hypothetical protein CARUB_v10008891mg [Capsella rubella]
            gi|482575987|gb|EOA40174.1| hypothetical protein
            CARUB_v10008891mg [Capsella rubella]
          Length = 509

 Score =  648 bits (1671), Expect = 0.0
 Identities = 300/480 (62%), Positives = 371/480 (77%), Gaps = 7/480 (1%)
 Frame = -3

Query: 1699 VVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKIIQC 1520
            V+ L+F SFTA LL+KVD FV+Q KT+AGHNL+PTPWH+FP K F++ +++++A +I+QC
Sbjct: 25   VLALSFFSFTALLLYKVDDFVAQTKTLAGHNLEPTPWHIFPRKSFSEATRHSQAYRILQC 84

Query: 1519 SYLKCR-RSSPEP-----DPSHHSRNTRSKCPSFFHWIHRDLDPWTQSHISLSNLLEAQK 1358
            SY  C  ++  +P     D     R  + +CP FF WIHRDL+PW ++ ++  ++  A+ 
Sbjct: 85   SYFSCPYKAVVQPKGLLSDSGSGRRTHQPQCPDFFRWIHRDLEPWAETGVTKEHVKRAKA 144

Query: 1357 FAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVN 1178
             AAFRVVI+ G+LY D YY CVQSR MFTIWG+LQLL +YPGMVPDVD+MFDCMD+P++N
Sbjct: 145  NAAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLNKYPGMVPDVDMMFDCMDKPIIN 204

Query: 1177 KTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALS 998
            +TE+ S     P PLFRYCT   H DIPFPDWSFWGW E N+ PW+EEF  IKQGS+  S
Sbjct: 205  RTEHQSF----PAPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWEEEFGDIKQGSRKRS 260

Query: 997  WTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCK 818
            W  K   AYWKGNPDV SPVR++L++CNHSR+WGAQIMRQ+W EEA+GG+EQSKL+NQC 
Sbjct: 261  WDSKQPRAYWKGNPDVVSPVRLELMKCNHSRLWGAQIMRQDWAEEAKGGFEQSKLSNQCN 320

Query: 817  HRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSI 638
            HRYKIYAEGYAWSVSLKYILSCGS+ LIISP+YEDFFSR L+P++NYWPVSP TDLCRSI
Sbjct: 321  HRYKIYAEGYAWSVSLKYILSCGSMTLIISPEYEDFFSRGLLPKENYWPVSP-TDLCRSI 379

Query: 637  KFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQE 458
            K AVDWGN NPS+AE IGK GQ +ME +SM RVY+YM+HLI EYSKLQ FKP  P+SA E
Sbjct: 380  KHAVDWGNANPSDAEKIGKRGQGYMESISMNRVYDYMFHLITEYSKLQKFKPEKPASANE 439

Query: 457  VCMESLLCFTDPKQRHFLERS-TASLSSSHPCTFPPGDRDVVKSWIQRKKKIITNVQELE 281
            VC  SLLCF + K+R  LERS   + S    C  P  DR+ ++  IQ+KKK I NV+ +E
Sbjct: 440  VCAGSLLCFAEQKERELLERSRVVAPSVDQQCKLPDADRNRLERLIQQKKKTIENVRYME 499


>ref|XP_003632132.1| PREDICTED: protein O-glucosyltransferase 1-like [Vitis vinifera]
            gi|297745896|emb|CBI15952.3| unnamed protein product
            [Vitis vinifera]
          Length = 464

 Score =  642 bits (1657), Expect = 0.0
 Identities = 302/455 (66%), Positives = 354/455 (77%), Gaps = 7/455 (1%)
 Frame = -3

Query: 1705 PFVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKII 1526
            P ++   F      ++++VD F SQ KT+ GHNL+PTPWHLFPP  F + ++Y R SKII
Sbjct: 14   PCIIFSPFFFLIILVIYQVDDFASQTKTVVGHNLNPTPWHLFPPNTFTEKTRYARVSKII 73

Query: 1525 QCSYLKCRRSSPEPD----PSHHSR---NTRSKCPSFFHWIHRDLDPWTQSHISLSNLLE 1367
            QCSYL CRR S  P     P  H+R   NT  KCP FF  I  DL PW +S ISLS+++E
Sbjct: 74   QCSYLTCRRRSITPTTTKIPEWHTRQSSNTVGKCPMFFTRIDHDLQPWVRSGISLSSVME 133

Query: 1366 AQKFAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRP 1187
            AQKFAAFRVVIVGG+LY DF+Y CVQSRAMFT+WGLLQLL+RYPG VPDVDLMFDCMD+P
Sbjct: 134  AQKFAAFRVVIVGGKLYVDFFYACVQSRAMFTVWGLLQLLRRYPGTVPDVDLMFDCMDKP 193

Query: 1186 VVNKTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQ 1007
             +++ E+GS     P PLFRYCTT  HFDIPFPDWSFWGWPE++IGPWDEEF  IKQGSQ
Sbjct: 194  TISREEHGSK----PLPLFRYCTTMDHFDIPFPDWSFWGWPEIDIGPWDEEFIGIKQGSQ 249

Query: 1006 ALSWTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLAN 827
             L+WT+K   AYWKGNPDV SPVR+DLL+CN+S + GAQIMRQ+W+EEA+ G+++SKL+N
Sbjct: 250  VLNWTQKLSYAYWKGNPDVQSPVRVDLLQCNNSDIIGAQIMRQDWVEEAKNGFKESKLSN 309

Query: 826  QCKHRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLC 647
            QC HRYKIYAEGYAWSVSLKYILSCGSLALII+PQYE+FF+  L+   NYWP+S   D+C
Sbjct: 310  QCNHRYKIYAEGYAWSVSLKYILSCGSLALIIAPQYEEFFNHGLISMTNYWPIS-RLDIC 368

Query: 646  RSIKFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSS 467
             SIKFAV WGN + SEA+AIGK GQ  ME +SM RVY+YMYHLI EYSKL  FKP PP S
Sbjct: 369  PSIKFAVSWGNTHHSEAKAIGKSGQDLMESMSMARVYDYMYHLITEYSKLLRFKPEPPPS 428

Query: 466  AQEVCMESLLCFTDPKQRHFLERSTASLSSSHPCT 362
            A E+C ESLLCF DP QR  LERST   S + PCT
Sbjct: 429  AHEICEESLLCFADPTQRQCLERSTTYPSPTPPCT 463


>emb|CAN70836.1| hypothetical protein VITISV_015872 [Vitis vinifera]
          Length = 922

 Score =  640 bits (1652), Expect = 0.0
 Identities = 302/437 (69%), Positives = 348/437 (79%), Gaps = 7/437 (1%)
 Frame = -3

Query: 1651 VDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKIIQCSYLKCRRSSPEPD--- 1481
            VD F SQ KT+ GHNL+PTPWHLFPPK F + ++YTR SKIIQCSYL CRR S  P    
Sbjct: 490  VDDFASQTKTVVGHNLNPTPWHLFPPKTFTEKTRYTRVSKIIQCSYLTCRRRSITPTTTK 549

Query: 1480 -PSHHSR---NTRSKCPSFFHWIHRDLDPWTQSHISLSNLLEAQKFAAFRVVIVGGRLYA 1313
             P  H+R   NT  KCP FF  I  DL PW +S ISLS+++EAQKFAAFRVVIVGG+LY 
Sbjct: 550  IPEWHTRQSSNTVGKCPMFFTRIXHDLQPWVRSGISLSSVMEAQKFAAFRVVIVGGKLYV 609

Query: 1312 DFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVNKTEYGSSTIWPPPPL 1133
            DF+Y CVQSRAMFT+WGLLQLL+RYPG VPDVDLMFDCMD+P +++ E+GS     P PL
Sbjct: 610  DFFYACVQSRAMFTVWGLLQLLRRYPGTVPDVDLMFDCMDKPTISREEHGSK----PLPL 665

Query: 1132 FRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALSWTEKWLLAYWKGNPD 953
            FRYCTT  HFDIPFPDWSFWGWPE++IGPWDEEF  IKQGSQ L+WT+K   AYWKGNPD
Sbjct: 666  FRYCTTMDHFDIPFPDWSFWGWPEIDIGPWDEEFIGIKQGSQVLNWTQKLSYAYWKGNPD 725

Query: 952  VDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCKHRYKIYAEGYAWSVS 773
            V SPVR+DLL+CN+S + GAQIMRQ+W+EEA+ G+++SKL+NQC HRYKIYAEGYAWSVS
Sbjct: 726  VQSPVRVDLLQCNNSDIIGAQIMRQDWVEEAKNGFKESKLSNQCNHRYKIYAEGYAWSVS 785

Query: 772  LKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSIKFAVDWGNKNPSEAE 593
            LKYILSCGSLALII+PQYE+FF+  L+   NYWP+S   D+C SIKFAV WGN + SEA+
Sbjct: 786  LKYILSCGSLALIIAPQYEEFFNHGLISMTNYWPIS-RLDICPSIKFAVSWGNTHHSEAK 844

Query: 592  AIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQEVCMESLLCFTDPKQR 413
            AIGK GQ  ME +SM RVY+YMYHLI EYSKL  FKP PP SA E+C ESLLCF DP QR
Sbjct: 845  AIGKSGQDLMESMSMARVYDYMYHLITEYSKLLRFKPEPPPSAHEICEESLLCFADPTQR 904

Query: 412  HFLERSTASLSSSHPCT 362
              LERST   S + PCT
Sbjct: 905  QCLERSTTYPSPTPPCT 921


>ref|XP_004242062.1| PREDICTED: uncharacterized protein LOC101246258 [Solanum
            lycopersicum]
          Length = 463

 Score =  634 bits (1634), Expect = e-179
 Identities = 296/467 (63%), Positives = 365/467 (78%), Gaps = 3/467 (0%)
 Frame = -3

Query: 1747 MGMSSK-NIATTLRAPFVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPK 1571
            MG+ S+ N   +    +VV  AF      L F+VD  VSQ KTI GHNL+PTPWH+FP K
Sbjct: 1    MGIFSRHNRRPSFLPRYVVFFAFLFLALILFFEVDNLVSQTKTIVGHNLEPTPWHIFPAK 60

Query: 1570 EFNDGSKYTRASKIIQCSYLKCRRSSPEPD--PSHHSRNTRSKCPSFFHWIHRDLDPWTQ 1397
             F+D S Y++AS IIQCSYL C  SS   D   S   ++   KCP FF  I  DL+PW +
Sbjct: 61   SFDDESTYSKASTIIQCSYLTCSSSSHVVDIPRSTKPQSKSHKCPDFFKSIRYDLEPWAK 120

Query: 1396 SHISLSNLLEAQKFAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDV 1217
            S IS+++++EAQK AAFRVVIVGG+L+ DFYY CVQSRAMFTIWG+LQLL++YPG VPDV
Sbjct: 121  SRISINHVMEAQKNAAFRVVIVGGKLFVDFYYACVQSRAMFTIWGILQLLRKYPGKVPDV 180

Query: 1216 DLMFDCMDRPVVNKTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDE 1037
            DLMFDCMD+P++N+TE+ S     P PLFRYCTT  H+DIPFPDWSFWGW E+NI PW+E
Sbjct: 181  DLMFDCMDKPIINRTEHSSM----PVPLFRYCTTPNHYDIPFPDWSFWGWSEINIRPWNE 236

Query: 1036 EFRSIKQGSQALSWTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEAR 857
            EF+SIK+GS + SWT K  +AYWKGNPDV SP+R++LL CN +++W AQIMRQNW EEA+
Sbjct: 237  EFKSIKEGSNSKSWTSKIPVAYWKGNPDVVSPIRLELLNCNDTKMWRAQIMRQNWTEEAK 296

Query: 856  GGYEQSKLANQCKHRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNY 677
             G+E+SKL+ QC HRYKIYAEGYAWSVSLKYILSCGSL LII+PQY+DFFSR L+P++NY
Sbjct: 297  VGFEKSKLSKQCNHRYKIYAEGYAWSVSLKYILSCGSLPLIITPQYQDFFSRGLIPKKNY 356

Query: 676  WPVSPTTDLCRSIKFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKL 497
            WP+ P  DLC SIK AVDWGN NP EAEAIGK GQ FME LS++R+Y+YMYHLI+EY+KL
Sbjct: 357  WPL-PPFDLCPSIKQAVDWGNANPLEAEAIGKAGQDFMESLSIDRIYDYMYHLISEYAKL 415

Query: 496  QDFKPSPPSSAQEVCMESLLCFTDPKQRHFLERSTASLSSSHPCTFP 356
            QDF P PPSSA E+C++++LCF D +Q+ FL++S    S+  PC+ P
Sbjct: 416  QDFVPVPPSSALELCIDTVLCFADDQQKRFLKKSLVFPSNESPCSLP 462


>ref|XP_006350920.1| PREDICTED: uncharacterized protein LOC102588367 [Solanum tuberosum]
          Length = 463

 Score =  631 bits (1627), Expect = e-178
 Identities = 294/467 (62%), Positives = 365/467 (78%), Gaps = 3/467 (0%)
 Frame = -3

Query: 1747 MGMSSK-NIATTLRAPFVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPK 1571
            MG+ S+ N   +    +VV  AF      L F+VD  VSQ KTI GHNL+PTPWH+FP K
Sbjct: 1    MGIFSRHNRRPSFLPRYVVFFAFLFLALLLFFEVDNLVSQTKTIVGHNLEPTPWHVFPAK 60

Query: 1570 EFNDGSKYTRASKIIQCSYLKCRRSSPEPD--PSHHSRNTRSKCPSFFHWIHRDLDPWTQ 1397
             F++ S Y++AS IIQCSYL C  +S   +   S   ++  +KCP FF  I  DL+PW +
Sbjct: 61   SFDEESTYSKASTIIQCSYLTCSSNSHVTNIPRSTKPQSETNKCPDFFKSIRYDLEPWAK 120

Query: 1396 SHISLSNLLEAQKFAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDV 1217
            S IS+++++EAQK AAFRVVIVGG+L+ DFYY CVQSRAMFTIWG+LQLL++YPG VPDV
Sbjct: 121  SRISINHVMEAQKNAAFRVVIVGGKLFVDFYYACVQSRAMFTIWGILQLLRKYPGKVPDV 180

Query: 1216 DLMFDCMDRPVVNKTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDE 1037
            DLMFDCMD+P++N+TEY S     P PLFRYCTT  H+DIPFPDWSFWGW E+NI PW+E
Sbjct: 181  DLMFDCMDKPIINRTEYSSM----PLPLFRYCTTPNHYDIPFPDWSFWGWSEINIRPWNE 236

Query: 1036 EFRSIKQGSQALSWTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEAR 857
            EF+SIK+GS + SWT K  +AYWKGNPDV SP+R++LL CN +++W AQIMRQNW EEA+
Sbjct: 237  EFKSIKEGSNSRSWTSKIPVAYWKGNPDVVSPIRLELLNCNDTQMWRAQIMRQNWTEEAK 296

Query: 856  GGYEQSKLANQCKHRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNY 677
             G+E+SKL+ QC HRYKIYAEGYAWSVSLKYIL+CGSL LII+PQY+DFFSR L+P++NY
Sbjct: 297  VGFEKSKLSKQCNHRYKIYAEGYAWSVSLKYILACGSLPLIITPQYQDFFSRGLIPKKNY 356

Query: 676  WPVSPTTDLCRSIKFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKL 497
            WP+ P  DLC SIK AVDWGN NP EAEAIGK GQ FME LS++R+Y+YMYHLI+EY+KL
Sbjct: 357  WPL-PPFDLCSSIKDAVDWGNSNPLEAEAIGKAGQDFMESLSIDRIYDYMYHLISEYAKL 415

Query: 496  QDFKPSPPSSAQEVCMESLLCFTDPKQRHFLERSTASLSSSHPCTFP 356
            QDF P PPSSA E+C+ S+LCF D +Q+ FL++S    S+  PC+ P
Sbjct: 416  QDFVPVPPSSALELCINSVLCFADDQQKQFLKKSLVFPSNESPCSLP 462


>ref|XP_004500267.1| PREDICTED: uncharacterized protein LOC101501069 [Cicer arietinum]
          Length = 466

 Score =  624 bits (1609), Expect = e-176
 Identities = 301/468 (64%), Positives = 359/468 (76%), Gaps = 6/468 (1%)
 Frame = -3

Query: 1747 MGMSSKNI--ATTLRAPFVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPP 1574
            MG+SSK+     T   P V+ L+F S TA LL+KVD   S+  T+ GHNL+PTPWH+FP 
Sbjct: 1    MGLSSKHYPRTPTYLLPCVLALSFFSLTALLLYKVDDVASRTGTVVGHNLEPTPWHVFPA 60

Query: 1573 KEFNDGSKYTRASKIIQCSYLKCRRSSPEPDPSHHSRNT----RSKCPSFFHWIHRDLDP 1406
            K F++ ++  RA KIIQCSYL CR SS   D       T    R  CP FF  I +DL+P
Sbjct: 61   KPFDEETRQRRAYKIIQCSYLTCR-SSVSGDRKRLGFATGDSKRQDCPDFFRAIRKDLEP 119

Query: 1405 WTQSHISLSNLLEAQKFAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMV 1226
            W  + IS  +L+ AQ +AAFRVVIVGG+++ D+YY CVQSRAMFT+W LLQLL++YPG+V
Sbjct: 120  WKVTKISKGHLMAAQNYAAFRVVIVGGKMFVDWYYACVQSRAMFTVWSLLQLLRKYPGLV 179

Query: 1225 PDVDLMFDCMDRPVVNKTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGP 1046
            PDVDLMFDCMD+P +NKTE+ S     P PLFRYCTT  HFDIPFPDWSFWGWPE+NIGP
Sbjct: 180  PDVDLMFDCMDKPTINKTEHSSM----PLPLFRYCTTKQHFDIPFPDWSFWGWPEINIGP 235

Query: 1045 WDEEFRSIKQGSQALSWTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLE 866
            W EEF  IKQGSQA+SW  K   AYWKGNPDV SPVR +LL CNHSR +GAQIMRQ+W  
Sbjct: 236  WQEEFPDIKQGSQAVSWVNKMPRAYWKGNPDVFSPVRTELLNCNHSRKYGAQIMRQDWGA 295

Query: 865  EARGGYEQSKLANQCKHRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPR 686
             AR G+++SKL+NQC HRYKIYAEGYAWSVSLKYILSC S+ALIISPQYEDFFSR L+P 
Sbjct: 296  AARSGFKESKLSNQCNHRYKIYAEGYAWSVSLKYILSCSSVALIISPQYEDFFSRGLIPN 355

Query: 685  QNYWPVSPTTDLCRSIKFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEY 506
            QN+  + P  D+C SIK AVDWGN +P+EAEA+GK GQ FM  L+++R+Y+YM+HLI+EY
Sbjct: 356  QNFLLIDP-LDMCPSIKRAVDWGNGHPNEAEALGKRGQDFMGSLNIDRIYDYMFHLISEY 414

Query: 505  SKLQDFKPSPPSSAQEVCMESLLCFTDPKQRHFLERSTASLSSSHPCT 362
            SKL DFKP+PPS+A EVC ES+LCF D KQR FL RSTAS S + PCT
Sbjct: 415  SKLLDFKPTPPSTALEVCAESVLCFADDKQRTFLSRSTASPSQTPPCT 462


>ref|XP_003552954.1| PREDICTED: O-glucosyltransferase rumi-like [Glycine max]
          Length = 464

 Score =  618 bits (1593), Expect = e-174
 Identities = 293/468 (62%), Positives = 358/468 (76%), Gaps = 3/468 (0%)
 Frame = -3

Query: 1747 MGMSSKNI--ATTLRAPFVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPP 1574
            MG SSK+   + T   P V+ LA  S T  LL+KVD   S+  T+ GHNL+PTPWH+FP 
Sbjct: 1    MGPSSKHTPRSPTYLIPCVIALALFSLTGLLLYKVDDVASRTGTVVGHNLEPTPWHVFPH 60

Query: 1573 KEFNDGSKYTRASKIIQCSYLKCRRSSPEPDPSHHS-RNTRSKCPSFFHWIHRDLDPWTQ 1397
            K F++ S+  R  KI+QCSYL CR ++     S  S    R +CP FF  IHRDL PW +
Sbjct: 61   KPFDEESRQQRTYKILQCSYLTCRYAAGAVGGSRRSFAGGREECPEFFRAIHRDLAPWLE 120

Query: 1396 SHISLSNLLEAQKFAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDV 1217
            S IS +++  AQ++AAFRVVIV G+++ D+YY CVQSRAMFT+WGLLQL++RYPG VPDV
Sbjct: 121  SRISKAHVAAAQRYAAFRVVIVEGKVFVDWYYACVQSRAMFTLWGLLQLMRRYPGKVPDV 180

Query: 1216 DLMFDCMDRPVVNKTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDE 1037
            D+MFDCMD+P VN+TE+ +     P PLFRYCTT  HFDIPFPDWSFWGW E+NI PW E
Sbjct: 181  DMMFDCMDKPSVNRTEHQAM----PLPLFRYCTTKEHFDIPFPDWSFWGWSEINIRPWQE 236

Query: 1036 EFRSIKQGSQALSWTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEAR 857
            EF  IKQGS+ +SW  K+  AYWKGNPDV SP+R +L+ CN SR WGA+IMRQ+W E AR
Sbjct: 237  EFPDIKQGSRNVSWKNKFPWAYWKGNPDVASPIRTELINCNDSRKWGAEIMRQDWGEAAR 296

Query: 856  GGYEQSKLANQCKHRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNY 677
             G++QSKL+NQC HRYKIYAEGYAWSVSLKYILSCGS+ALIISPQYEDFFSR L+P  N+
Sbjct: 297  SGFKQSKLSNQCNHRYKIYAEGYAWSVSLKYILSCGSVALIISPQYEDFFSRGLIPNHNF 356

Query: 676  WPVSPTTDLCRSIKFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKL 497
            W V  + +LC SIK+AV+WGN++P EAEAIGK GQ FM  L+M+R+YEYM+HLI+EYSKL
Sbjct: 357  WLVD-SLNLCPSIKYAVEWGNQHPVEAEAIGKRGQDFMGSLNMDRIYEYMFHLISEYSKL 415

Query: 496  QDFKPSPPSSAQEVCMESLLCFTDPKQRHFLERSTASLSSSHPCTFPP 353
            QDFKP+PP++A EVC+ES+LCF D KQR FL +STA  S   PC   P
Sbjct: 416  QDFKPTPPTTALEVCVESVLCFADEKQRMFLNKSTAFPSHKPPCNLKP 463


>ref|XP_006488529.1| PREDICTED: uncharacterized protein LOC102623006 [Citrus sinensis]
          Length = 463

 Score =  615 bits (1587), Expect = e-173
 Identities = 288/452 (63%), Positives = 356/452 (78%), Gaps = 4/452 (0%)
 Frame = -3

Query: 1705 PFVVLLAFCSFTA-FLLFKVDTFVSQIKTIAGHNLDPTPWHLFPPKEFNDGSKYTRASKI 1529
            P V+ L+  S  A FL +KVD F S+ KT+AGHNL+PTPWHLFP + F + S+ ++A KI
Sbjct: 17   PCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESRRSQAYKI 76

Query: 1528 IQCSYLKCRRSSPEPDPSHHSRNTRSK---CPSFFHWIHRDLDPWTQSHISLSNLLEAQK 1358
            + C+YL C  S+  P P      +  K   CP FF  IH+DL+PW +S I++ +++EA++
Sbjct: 77   VHCTYLTCL-SAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKR 135

Query: 1357 FAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDLMFDCMDRPVVN 1178
            FAA R++IV G+LY D YYDCVQSRAMFTIWG LQLL+RYPGMVPDVD+MFDCMD+PV++
Sbjct: 136  FAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVID 195

Query: 1177 KTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDEEFRSIKQGSQALS 998
            K E+GS     P PLFRYCT + HFDIPFPDWSFWGW EVN+ PW+EEF+ IK GSQA S
Sbjct: 196  KKEHGSF----PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKS 251

Query: 997  WTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEARGGYEQSKLANQCK 818
            W EK   AYWKGNPDV SP+R++L++CN S++WGA+I+RQNW EEA+ G+++SKL+NQC 
Sbjct: 252  WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN 311

Query: 817  HRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNYWPVSPTTDLCRSI 638
            HRYKIYAEGYAWSVSLKYILSC S+ALIIS QY+DFFSR L+P +N++P+ P+ DLCRSI
Sbjct: 312  HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPI-PSADLCRSI 370

Query: 637  KFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKLQDFKPSPPSSAQE 458
            K  VDWGN NPSEAE IGK GQ FME L+M+RVY+YM HLI EYSKL D+KP+PPSSA E
Sbjct: 371  KSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFE 430

Query: 457  VCMESLLCFTDPKQRHFLERSTASLSSSHPCT 362
             C+ESLLC  DPKQR  LE++ AS S   PCT
Sbjct: 431  ACVESLLCLADPKQRQNLEKAAASPSPYPPCT 462


>ref|XP_006848230.1| hypothetical protein AMTR_s00029p00246540 [Amborella trichopoda]
            gi|548851535|gb|ERN09811.1| hypothetical protein
            AMTR_s00029p00246540 [Amborella trichopoda]
          Length = 527

 Score =  615 bits (1587), Expect = e-173
 Identities = 301/498 (60%), Positives = 367/498 (73%), Gaps = 24/498 (4%)
 Frame = -3

Query: 1702 FVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHL--FPPKEFNDGSKYTRASKI 1529
            F+ LL+  +    LL+KV       KTI GHNL+PTPWH    PP      +KY   SKI
Sbjct: 29   FLSLLSVAAL--LLLYKVAIISQTTKTIVGHNLEPTPWHRHQLPPDNNQWTAKY---SKI 83

Query: 1528 IQCSYL--KCRRSS-----PEPDPSHHSRN-----TRSKCPSFFHWIHRDLDPWTQSHIS 1385
             +CSYL   C  SS     P   PS    N     +R +CP+FF WIH DL PW  S + 
Sbjct: 84   FRCSYLLNACSSSSSSSTKPYKQPSIAYANPNLSSSRGECPAFFRWIHDDLSPWRDSGVR 143

Query: 1384 LSN--LLEAQKFAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDVDL 1211
            ++   ++EA+K+AAFRVVIVGG+LY D YY CVQSRAMFTIWGLL+LL R+PG+VPDVDL
Sbjct: 144  ITQQKVMEARKWAAFRVVIVGGKLYVDLYYACVQSRAMFTIWGLLRLLDRFPGLVPDVDL 203

Query: 1210 MFDCMDRPVVNKTEYGSSTI------WPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIG 1049
            MFDCMDRP+++++ + + +       WPPPPLFRYC++  HFDIPFPDWSFWGW EVN+ 
Sbjct: 204  MFDCMDRPMIHRSSFNNQSATSNSWNWPPPPLFRYCSSTKHFDIPFPDWSFWGWSEVNLA 263

Query: 1048 PWDEEFRSIKQGSQALSWTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWL 869
            PWDEEFRSIK+GSQ L WTE+   AYWKGNPDV SPVR DLLRCN S +WGAQIMRQ+W+
Sbjct: 264  PWDEEFRSIKRGSQNLKWTEREARAYWKGNPDVQSPVREDLLRCNDSAIWGAQIMRQDWV 323

Query: 868  EEARGGYEQSKLANQCKHRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMP 689
            EEAR GYE+SKLANQC HRYKIYAEGYAWSVSLKYILSCGSLALIISPQY DFFSR L+P
Sbjct: 324  EEARAGYEKSKLANQCTHRYKIYAEGYAWSVSLKYILSCGSLALIISPQYYDFFSRGLIP 383

Query: 688  RQNYWPVSPTTDLCRSIKFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINE 509
            R+++WP+S +T+LC SIKFAVDWGN++  EAEAIG+GGQ FM+ ++ME VY+YM+HL+ E
Sbjct: 384  RKSFWPIS-STNLCPSIKFAVDWGNEHQIEAEAIGRGGQDFMQAMNMETVYDYMFHLLME 442

Query: 508  YSKLQDFKPSPPSSAQEVCMESLLCFTDPKQRHFLERSTASLSSSH--PCTFPPGDRDVV 335
            YSKLQDFKP  P SAQ +C ES+LCF   ++R FL R T +  SSH  PC+ P  D+ ++
Sbjct: 443  YSKLQDFKPKAPPSAQLLCTESVLCFAGARERDFLLRETPAADSSHSLPCSLPSPDKKLI 502

Query: 334  KSWIQRKKKIITNVQELE 281
             +W    + II  +Q  E
Sbjct: 503  GAWTMHNQHIINEIQGKE 520


>ref|XP_003539326.2| PREDICTED: O-glucosyltransferase rumi-like [Glycine max]
          Length = 496

 Score =  610 bits (1572), Expect = e-172
 Identities = 288/468 (61%), Positives = 356/468 (76%), Gaps = 3/468 (0%)
 Frame = -3

Query: 1747 MGMSSKNI--ATTLRAPFVVLLAFCSFTAFLLFKVDTFVSQIKTIAGHNLDPTPWHLFPP 1574
            MG SS +   + T   P V+ LA  S T  LL+KVD   S+  T+ GHNL+PTPWH+FP 
Sbjct: 33   MGPSSTHTPRSPTYLIPCVIALALFSLTGLLLYKVDDVASRTGTVVGHNLEPTPWHVFPH 92

Query: 1573 KEFNDGSKYTRASKIIQCSYLKCRRSSPEPDPSHH-SRNTRSKCPSFFHWIHRDLDPWTQ 1397
            K F++ S+  RA KI+QCSYL CR ++     +   +   R +CP FF  IHRDL PW++
Sbjct: 93   KPFDEESRQQRAYKILQCSYLTCRYAAEALGGARRRTGGGREECPKFFRAIHRDLAPWSE 152

Query: 1396 SHISLSNLLEAQKFAAFRVVIVGGRLYADFYYDCVQSRAMFTIWGLLQLLQRYPGMVPDV 1217
            S IS +++  AQ++AAFRVVIV G+++ D+YY CVQSRAMFT+WGLLQL++RYPGMVPDV
Sbjct: 153  SRISKAHVAAAQRYAAFRVVIVEGKVFVDWYYACVQSRAMFTLWGLLQLMRRYPGMVPDV 212

Query: 1216 DLMFDCMDRPVVNKTEYGSSTIWPPPPLFRYCTTNGHFDIPFPDWSFWGWPEVNIGPWDE 1037
            D+MFDCMD+P VNKTE+ +     P PLFRYCTT  HFDIPFPDWSFWGW E+NI PW E
Sbjct: 213  DMMFDCMDKPSVNKTEHQAM----PLPLFRYCTTKEHFDIPFPDWSFWGWSEINIRPWQE 268

Query: 1036 EFRSIKQGSQALSWTEKWLLAYWKGNPDVDSPVRIDLLRCNHSRVWGAQIMRQNWLEEAR 857
            EF  IK+GS++++W  K   AYWKGNPDV SP+R +L+ CN SR WGA+IMRQ+W E AR
Sbjct: 269  EFPDIKRGSRSVTWKNKLPWAYWKGNPDVASPIRTELINCNDSRKWGAEIMRQDWGEAAR 328

Query: 856  GGYEQSKLANQCKHRYKIYAEGYAWSVSLKYILSCGSLALIISPQYEDFFSRALMPRQNY 677
             G++QSKL++QC HRYKIYAEGYAWSVSLKYILSCGS+ALIISPQYEDFFSR L+P  N+
Sbjct: 329  NGFKQSKLSDQCNHRYKIYAEGYAWSVSLKYILSCGSVALIISPQYEDFFSRGLIPNHNF 388

Query: 676  WPVSPTTDLCRSIKFAVDWGNKNPSEAEAIGKGGQKFMEDLSMERVYEYMYHLINEYSKL 497
            W V P  +LC SIK+AV+WGN++P EAEAIGK GQ  ME L+M R+YEYM+HLI++YSKL
Sbjct: 389  WLVDP-LNLCPSIKYAVEWGNQHPVEAEAIGKRGQDLMESLNMNRIYEYMFHLISDYSKL 447

Query: 496  QDFKPSPPSSAQEVCMESLLCFTDPKQRHFLERSTASLSSSHPCTFPP 353
            QDFKP+PP +A EVC+ES+LCF D KQR FL +S    S   PC   P
Sbjct: 448  QDFKPTPPPTALEVCVESVLCFADEKQRMFLNKSFTFPSHKPPCNLKP 495


Top