BLASTX nr result

ID: Akebia23_contig00016989 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00016989
         (2834 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI18781.3| unnamed protein product [Vitis vinifera]              978   0.0  
emb|CAN70213.1| hypothetical protein VITISV_038741 [Vitis vinifera]   973   0.0  
ref|XP_002272372.2| PREDICTED: uncharacterized protein LOC100258...   972   0.0  
gb|EXB64628.1| putative galacturonosyltransferase 4 [Morus notab...   951   0.0  
ref|XP_007026430.1| Galacturonosyltransferase 4 isoform 1 [Theob...   939   0.0  
ref|XP_007026431.1| Galacturonosyltransferase 4 isoform 2 [Theob...   937   0.0  
ref|XP_002525229.1| Glycosyltransferase QUASIMODO1, putative [Ri...   934   0.0  
ref|XP_004293423.1| PREDICTED: probable galacturonosyltransferas...   923   0.0  
ref|XP_007213869.1| hypothetical protein PRUPE_ppa018681mg [Prun...   916   0.0  
ref|XP_006350232.1| PREDICTED: probable galacturonosyltransferas...   912   0.0  
ref|XP_006857626.1| hypothetical protein AMTR_s00061p00126570 [A...   911   0.0  
ref|XP_006467237.1| PREDICTED: probable galacturonosyltransferas...   910   0.0  
ref|XP_006398488.1| hypothetical protein EUTSA_v10000819mg [Eutr...   909   0.0  
ref|XP_006350235.1| PREDICTED: probable galacturonosyltransferas...   908   0.0  
ref|XP_006449976.1| hypothetical protein CICLE_v10014426mg [Citr...   908   0.0  
ref|XP_004236640.1| PREDICTED: probable galacturonosyltransferas...   906   0.0  
gb|EYU33824.1| hypothetical protein MIMGU_mgv1a002625mg [Mimulus...   905   0.0  
ref|XP_004499343.1| PREDICTED: probable galacturonosyltransferas...   898   0.0  
ref|XP_006600275.1| PREDICTED: probable galacturonosyltransferas...   896   0.0  
ref|XP_006584115.1| PREDICTED: probable galacturonosyltransferas...   892   0.0  

>emb|CBI18781.3| unnamed protein product [Vitis vinifera]
          Length = 638

 Score =  978 bits (2529), Expect = 0.0
 Identities = 488/658 (74%), Positives = 548/658 (83%), Gaps = 11/658 (1%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLG-SF-ISLDSRNEFIEDVSTLSFGGEIRKLNVLP 2298
            M+ RK VLFLL+VTVL+PIVLYTD LG SF  S  + +EF EDV+ L+ GG   KLN+LP
Sbjct: 1    MIKRKTVLFLLLVTVLSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNLLP 60

Query: 2297 QESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT-------- 2142
            QESS TLKEPIGIVYSDN       S ++++S  +L L  S EHKTRVLS T        
Sbjct: 61   QESSTTLKEPIGIVYSDND------SLDVDESAADLQLGGSVEHKTRVLSTTYEEGDRSQ 114

Query: 2141 -ENPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGK 1965
             ENPI+QV DG     + D LQ    +      + +              GQQS    GK
Sbjct: 115  RENPIRQVTDG-----KDDNLQRGSELTSHNASQNSETEH----------GQQSAQTSGK 159

Query: 1964 TNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKE 1785
             +  EP K +  +P  + V+ DARV+ LKDQLIRAKV+L L ATRNN HFIRELR R+KE
Sbjct: 160  GDHKEPVKTRNEKPIDQTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARMKE 219

Query: 1784 VQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVH 1605
            VQRALGDATKDSELP+NAY+KLK MEQTLAKGKQIQDDCAAV+KKLRAILHS EEQLRVH
Sbjct: 220  VQRALGDATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLRVH 279

Query: 1604 KKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDN 1425
            KKQTM+LTQL AKTLPKGLHCLPLRLSTEYY+L+S++QQFPN++KLEDP L+HYALFSDN
Sbjct: 280  KKQTMYLTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFSDN 339

Query: 1424 ILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWL 1245
            ILAAAVVVNSTV++AK+P+ HVFHIV+DRLNYAAMRMWFLANPPGKATIQVQNI+EFTWL
Sbjct: 340  ILAAAVVVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFTWL 399

Query: 1244 NSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKV 1065
            NSSYSPVLKQLGS SMIDYYFK HR+NSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKV
Sbjct: 400  NSSYSPVLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKV 459

Query: 1064 LFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACG 885
            LFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD  ACG
Sbjct: 460  LFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHACG 519

Query: 884  WAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHV 705
            WAYGMN+FDL++WKKQ+ITEVYHTWQKLN  RQLWKLGTLPPGLITFW RTFPIDRSWHV
Sbjct: 520  WAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTFPIDRSWHV 579

Query: 704  LGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531
            LGLGYNP+VN++EI+RAAVIHYNGN+KPWLEIG+PK++ YWAKF D+D  YLRDCNIN
Sbjct: 580  LGLGYNPSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNIN 637


>emb|CAN70213.1| hypothetical protein VITISV_038741 [Vitis vinifera]
          Length = 759

 Score =  973 bits (2516), Expect = 0.0
 Identities = 486/660 (73%), Positives = 546/660 (82%), Gaps = 11/660 (1%)
 Frame = -1

Query: 2477 KTMMVRKPVLFLLVVTVLAPIVLYTDRLG-SF-ISLDSRNEFIEDVSTLSFGGEIRKLNV 2304
            K M+ RK VLFLL+VTV +PIVLYTD LG SF  S  + +EF EDV+ L+ GG   KLN+
Sbjct: 120  KEMIKRKTVLFLLLVTVXSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNL 179

Query: 2303 LPQESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT------ 2142
            LPQESS TLKEPIGIVYSDN       S ++++S  +L L  S EHKTR LS T      
Sbjct: 180  LPQESSTTLKEPIGIVYSDND------SLDVDESAADLQLGGSVEHKTRXLSTTYEEGDR 233

Query: 2141 ---ENPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNAD 1971
               ENPI+QV DG     + D LQ    +      + +              GQQS    
Sbjct: 234  SQRENPIRQVTDG-----KDDSLQRGSELTSHNASQNSETEH----------GQQSAQTS 278

Query: 1970 GKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRV 1791
            GK +  EP K +  +P  + V+ DARV+ LKDQLIRAKV+L L ATRNN HFIRELR R+
Sbjct: 279  GKGDHKEPVKTRNEKPIDQTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARM 338

Query: 1790 KEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLR 1611
            KEVQRALGDATKDSELP+NAY+KLK MEQTLAKGKQIQDDCAAV+KKLRAILHS EEQLR
Sbjct: 339  KEVQRALGDATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLR 398

Query: 1610 VHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFS 1431
            VHKKQTM+LTQL AKTLPKGLHCLPLRLSTEYY+L+S++QQFPN++KLEDP L+HYALFS
Sbjct: 399  VHKKQTMYLTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFS 458

Query: 1430 DNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFT 1251
            DNILAAAVVVNSTV++AK+P+ HVFHIV+DRLNYAAMRMWFLANPPGKATIQVQNI+EFT
Sbjct: 459  DNILAAAVVVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFT 518

Query: 1250 WLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLN 1071
            WLNSSYSPVLKQLGS SMIDYYFK HR+NSDSNLKFRNPKYLSILNHLRFYLPEIFPKLN
Sbjct: 519  WLNSSYSPVLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLN 578

Query: 1070 KVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRA 891
            KVLFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD  A
Sbjct: 579  KVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHA 638

Query: 890  CGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSW 711
            CGWAYGMN+FDL++WKKQ+ITEVYHTWQKLN  RQLWKLGTLPPGLITFW RT PIDRSW
Sbjct: 639  CGWAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTXPIDRSW 698

Query: 710  HVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531
            HVLGLGYNP+VN++EI+RAAVIHYNGN+KPWLEIG+PK++ YWAKF D+D  YLRDCNIN
Sbjct: 699  HVLGLGYNPSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNIN 758


>ref|XP_002272372.2| PREDICTED: uncharacterized protein LOC100258406 [Vitis vinifera]
          Length = 1286

 Score =  972 bits (2513), Expect = 0.0
 Identities = 484/652 (74%), Positives = 544/652 (83%), Gaps = 11/652 (1%)
 Frame = -1

Query: 2453 VLFLLVVTVLAPIVLYTDRLG-SF-ISLDSRNEFIEDVSTLSFGGEIRKLNVLPQESSNT 2280
            +LFLL+VTVL+PIVLYTD LG SF  S  + +EF EDV+ L+ GG   KLN+LPQESS T
Sbjct: 655  LLFLLLVTVLSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNLLPQESSTT 714

Query: 2279 LKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT---------ENPIK 2127
            LKEPIGIVYSDN       S ++++S  +L L  S EHKTRVLS T         ENPI+
Sbjct: 715  LKEPIGIVYSDND------SLDVDESAADLQLGGSVEHKTRVLSTTYEEGDRSQRENPIR 768

Query: 2126 QVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTNSDEP 1947
            QV DG     + D LQ    +      + +              GQQS    GK +  EP
Sbjct: 769  QVTDG-----KDDNLQRGSELTSHNASQNSETEH----------GQQSAQTSGKGDHKEP 813

Query: 1946 PKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRALG 1767
             K +  +P  + V+ DARV+ LKDQLIRAKV+L L ATRNN HFIRELR R+KEVQRALG
Sbjct: 814  VKTRNEKPIDQTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARMKEVQRALG 873

Query: 1766 DATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQTMF 1587
            DATKDSELP+NAY+KLK MEQTLAKGKQIQDDCAAV+KKLRAILHS EEQLRVHKKQTM+
Sbjct: 874  DATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLRVHKKQTMY 933

Query: 1586 LTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAAAV 1407
            LTQL AKTLPKGLHCLPLRLSTEYY+L+S++QQFPN++KLEDP L+HYALFSDNILAAAV
Sbjct: 934  LTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFSDNILAAAV 993

Query: 1406 VVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSYSP 1227
            VVNSTV++AK+P+ HVFHIV+DRLNYAAMRMWFLANPPGKATIQVQNI+EFTWLNSSYSP
Sbjct: 994  VVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFTWLNSSYSP 1053

Query: 1226 VLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDD 1047
            VLKQLGS SMIDYYFK HR+NSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDD
Sbjct: 1054 VLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDD 1113

Query: 1046 IVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMN 867
            IVVQ+DLTGLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD  ACGWAYGMN
Sbjct: 1114 IVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHACGWAYGMN 1173

Query: 866  VFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLGLGYN 687
            +FDL++WKKQ+ITEVYHTWQKLN  RQLWKLGTLPPGLITFW RTFPIDRSWHVLGLGYN
Sbjct: 1174 IFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTFPIDRSWHVLGLGYN 1233

Query: 686  PTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531
            P+VN++EI+RAAVIHYNGN+KPWLEIG+PK++ YWAKF D+D  YLRDCNIN
Sbjct: 1234 PSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNIN 1285


>gb|EXB64628.1| putative galacturonosyltransferase 4 [Morus notabilis]
          Length = 657

 Score =  951 bits (2458), Expect = 0.0
 Identities = 475/671 (70%), Positives = 541/671 (80%), Gaps = 24/671 (3%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLD-SRNEFIEDVSTLSFGGEIRKLNVLPQ 2295
            MMVR  V+ +L VTV+APIVLYTDRLG+F S   S NEF+EDV+T+              
Sbjct: 1    MMVRNVVIGMLFVTVIAPIVLYTDRLGTFQSYSASTNEFVEDVTTV-------------- 46

Query: 2294 ESSNTLKEPIGIVYSDNSRNSTL--------FSDEIEDSVEELPLAESTEH-KTRVLSAT 2142
            E S  +KEPIGIVYSDNS  S           S +  +S ++  L +S EH   RVLS T
Sbjct: 47   EPSTKIKEPIGIVYSDNSNQSLPNSGDAVKESSTDTSNSEQDWQLGDSMEHVSARVLSTT 106

Query: 2141 --------ENPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAG-- 1992
                    EN I++V D  +EG++ + L I    GE KGE                +G  
Sbjct: 107  NDENNSRKENAIREVTDRDQEGDQ-ETLDIVDGEGETKGEAIDAEVKEIQQKVDDGSGDT 165

Query: 1991 ----QQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNN 1824
                +Q+     + +  EP K +  + N   V+PDARVRHLKDQL+RA+VYL L ATRNN
Sbjct: 166  EVKPEQTTETSSRVDKREPRKTRPEKQNDRTVIPDARVRHLKDQLVRARVYLSLPATRNN 225

Query: 1823 PHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLR 1644
            PHF RELR+R+KEVQRALGDA+KDSELPRNAYD+LKAMEQ+LAKGKQIQDDCAA +KKLR
Sbjct: 226  PHFTRELRVRMKEVQRALGDASKDSELPRNAYDRLKAMEQSLAKGKQIQDDCAAAVKKLR 285

Query: 1643 AILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLE 1464
            A+LHSTEEQLRVHKKQT+FLTQL AKTLPKGLHCLPLRL+TEYYSLN SEQ FPNE+KLE
Sbjct: 286  AMLHSTEEQLRVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNYSEQHFPNEDKLE 345

Query: 1463 DPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKA 1284
            DP LYHYALFSDN+LAAAVVVNST+ HAK+P+ HVFHIVTDRLNYAAMRMWFL NPPGKA
Sbjct: 346  DPQLYHYALFSDNVLAAAVVVNSTITHAKDPSKHVFHIVTDRLNYAAMRMWFLVNPPGKA 405

Query: 1283 TIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLR 1104
            T+QVQNIEEFTWLNSSYSPVLKQLGSQSMI+YYF+ HRA+SDSNLKFRNPKYLSILNHLR
Sbjct: 406  TVQVQNIEEFTWLNSSYSPVLKQLGSQSMINYYFRTHRASSDSNLKFRNPKYLSILNHLR 465

Query: 1103 FYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSN 924
            FYLP+IFPKL+KVLF+DDDIVVQ+DLT LWSL+LKG VNGAVETCGESFHRFDRYLNFSN
Sbjct: 466  FYLPQIFPKLDKVLFVDDDIVVQKDLTALWSLDLKGNVNGAVETCGESFHRFDRYLNFSN 525

Query: 923  PLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITF 744
            PLISKNFDPRACGWAYGMN+FDL+EWK+Q IT+VYH+WQKLN  RQLWKLGTLPPGLITF
Sbjct: 526  PLISKNFDPRACGWAYGMNIFDLKEWKRQQITDVYHSWQKLNHDRQLWKLGTLPPGLITF 585

Query: 743  WNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDY 564
            W RT+P+DRSWHVLGLGYNP V QK+I+RAAVIHYNGNMKPWLEIGIPKY+ YWAK+VDY
Sbjct: 586  WKRTYPLDRSWHVLGLGYNPNVGQKDIERAAVIHYNGNMKPWLEIGIPKYRNYWAKYVDY 645

Query: 563  DQVYLRDCNIN 531
            DQ+YLR+CN+N
Sbjct: 646  DQLYLRECNLN 656


>ref|XP_007026430.1| Galacturonosyltransferase 4 isoform 1 [Theobroma cacao]
            gi|508781796|gb|EOY29052.1| Galacturonosyltransferase 4
            isoform 1 [Theobroma cacao]
          Length = 626

 Score =  939 bits (2426), Expect = 0.0
 Identities = 473/656 (72%), Positives = 533/656 (81%), Gaps = 9/656 (1%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292
            M VR  VL LL VTV+API LYTDR+ +F    S  +F++DV+T +  G+ R+LNVLPQE
Sbjct: 1    MKVRHLVLGLLSVTVIAPIFLYTDRVATFNPSSSGRDFLDDVATFTLLGDTRRLNVLPQE 60

Query: 2291 SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHK-TRVLSATE-------- 2139
            +S  +KEP GIVYSD+S NS               + E+ EHK TRVLSAT+        
Sbjct: 61   TSTAIKEPAGIVYSDHSNNSFR------------KVTETREHKSTRVLSATDEERQPQLH 108

Query: 2138 NPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTN 1959
            NPI+QV D     N +  L  + +     G +                 Q + N D K +
Sbjct: 109  NPIRQVTDPA-PANLTTPLDSHPNASHHLGTKLEQQPT-----------QLAGNIDQKEH 156

Query: 1958 SDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQ 1779
            SD    +++A P       DA+VRHLKDQLIRAKVYL L A ++N H  RELRLR+KEV 
Sbjct: 157  SDNKT-SRLAEP------VDAQVRHLKDQLIRAKVYLSLPAIKSNQHVTRELRLRIKEVS 209

Query: 1778 RALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKK 1599
            RALGDATKDS+LP+NA+DKLKAMEQ+L KGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKK
Sbjct: 210  RALGDATKDSDLPKNAFDKLKAMEQSLEKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKK 269

Query: 1598 QTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNIL 1419
            QTMFLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+Q F N+EKLEDP LYHYALFSDN+L
Sbjct: 270  QTMFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQNFLNQEKLEDPRLYHYALFSDNVL 329

Query: 1418 AAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNS 1239
            AAAVVVNSTV+HAK P+NHVFHIVTDRLNYAAMRMWFL NPPGKATIQVQNIEEFTWLNS
Sbjct: 330  AAAVVVNSTVSHAKHPSNHVFHIVTDRLNYAAMRMWFLNNPPGKATIQVQNIEEFTWLNS 389

Query: 1238 SYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 1059
            SYSPVLKQLGS SMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF
Sbjct: 390  SYSPVLKQLGSPSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 449

Query: 1058 LDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWA 879
            LDDDIVV++D++GLWSL+LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFDP ACGWA
Sbjct: 450  LDDDIVVRKDISGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPHACGWA 509

Query: 878  YGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLG 699
            YGMN+FDLEEW++QNITEVYH WQKLN  RQLWKLGTLPPGLITFW RT+P+DRSWHVLG
Sbjct: 510  YGMNIFDLEEWRRQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKRTYPLDRSWHVLG 569

Query: 698  LGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531
            LGYNP VNQ+E++RAAVIHYNGN+KPWLEIGIPKYK YWAK+VDYD +YLRDCNIN
Sbjct: 570  LGYNPNVNQREVERAAVIHYNGNLKPWLEIGIPKYKNYWAKYVDYDNMYLRDCNIN 625


>ref|XP_007026431.1| Galacturonosyltransferase 4 isoform 2 [Theobroma cacao]
            gi|508781797|gb|EOY29053.1| Galacturonosyltransferase 4
            isoform 2 [Theobroma cacao]
          Length = 624

 Score =  937 bits (2423), Expect = 0.0
 Identities = 473/656 (72%), Positives = 532/656 (81%), Gaps = 9/656 (1%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292
            M VR  VL LL VTV+API LYTDR+ +F    S  +F++DV+T +  G+ R+LNVLPQE
Sbjct: 1    MKVRHLVLGLLSVTVIAPIFLYTDRVATFNPSSSGRDFLDDVATFTLLGDTRRLNVLPQE 60

Query: 2291 SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHK-TRVLSATE-------- 2139
            +S  +KEP GIVYSD+S NS                 E+ EHK TRVLSAT+        
Sbjct: 61   TSTAIKEPAGIVYSDHSNNSFR--------------KETREHKSTRVLSATDEERQPQLH 106

Query: 2138 NPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTN 1959
            NPI+QV D     N +  L  + +     G +                 Q + N D K +
Sbjct: 107  NPIRQVTDPA-PANLTTPLDSHPNASHHLGTKLEQQPT-----------QLAGNIDQKEH 154

Query: 1958 SDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQ 1779
            SD    +++A P       DA+VRHLKDQLIRAKVYL L A ++N H  RELRLR+KEV 
Sbjct: 155  SDNKT-SRLAEP------VDAQVRHLKDQLIRAKVYLSLPAIKSNQHVTRELRLRIKEVS 207

Query: 1778 RALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKK 1599
            RALGDATKDS+LP+NA+DKLKAMEQ+L KGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKK
Sbjct: 208  RALGDATKDSDLPKNAFDKLKAMEQSLEKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKK 267

Query: 1598 QTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNIL 1419
            QTMFLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+Q F N+EKLEDP LYHYALFSDN+L
Sbjct: 268  QTMFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQNFLNQEKLEDPRLYHYALFSDNVL 327

Query: 1418 AAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNS 1239
            AAAVVVNSTV+HAK P+NHVFHIVTDRLNYAAMRMWFL NPPGKATIQVQNIEEFTWLNS
Sbjct: 328  AAAVVVNSTVSHAKHPSNHVFHIVTDRLNYAAMRMWFLNNPPGKATIQVQNIEEFTWLNS 387

Query: 1238 SYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 1059
            SYSPVLKQLGS SMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF
Sbjct: 388  SYSPVLKQLGSPSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 447

Query: 1058 LDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWA 879
            LDDDIVV++D++GLWSL+LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFDP ACGWA
Sbjct: 448  LDDDIVVRKDISGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPHACGWA 507

Query: 878  YGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLG 699
            YGMN+FDLEEW++QNITEVYH WQKLN  RQLWKLGTLPPGLITFW RT+P+DRSWHVLG
Sbjct: 508  YGMNIFDLEEWRRQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKRTYPLDRSWHVLG 567

Query: 698  LGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531
            LGYNP VNQ+E++RAAVIHYNGN+KPWLEIGIPKYK YWAK+VDYD +YLRDCNIN
Sbjct: 568  LGYNPNVNQREVERAAVIHYNGNLKPWLEIGIPKYKNYWAKYVDYDNMYLRDCNIN 623


>ref|XP_002525229.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
            gi|223535526|gb|EEF37195.1| Glycosyltransferase
            QUASIMODO1, putative [Ricinus communis]
          Length = 647

 Score =  934 bits (2415), Expect = 0.0
 Identities = 462/650 (71%), Positives = 529/650 (81%), Gaps = 3/650 (0%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTD-RLGSFISLDSRNEFIEDVSTLSFGGEIRK-LNVLP 2298
            M +R  V+ +L+VTV+API+LYTD R  +F S  S  EF+EDV++L+  G+ R  LNVLP
Sbjct: 3    MKLRNLVVGMLLVTVIAPIILYTDNRFSTFNSSSSTTEFLEDVASLTLSGDSRDHLNVLP 62

Query: 2297 QESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHK-TRVLSATENPIKQV 2121
            QES++ LKEPIGIVY+DNS  S   +  I+     LP  ++ EHK TRVLSAT +  +  
Sbjct: 63   QESTSLLKEPIGIVYTDNSTISPPHTSTIQFHSSPLP-QDTREHKSTRVLSATNDQHQSQ 121

Query: 2120 NDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTNSDEPPK 1941
             D +     +           K  ++  +              QQS     K     PPK
Sbjct: 122  TDTIIRQVTNQQASRTTDANNKNSKQNPSDGGSQNAVV-----QQSSLTSEKVTEKGPPK 176

Query: 1940 NKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRALGDA 1761
            ++  +   +  +PDARVR L+DQLIRAKVYL L +T+NNPHF RELRLR+KEVQR LGDA
Sbjct: 177  SRTDKQTAQTPVPDARVRQLRDQLIRAKVYLSLPSTKNNPHFTRELRLRIKEVQRVLGDA 236

Query: 1760 TKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQTMFLT 1581
            TKDS+LP+NA DKLKAM+Q+LAKGKQ+QDDCA+V+KKLRA+LHS+EEQLRVHKKQTMFLT
Sbjct: 237  TKDSDLPKNANDKLKAMDQSLAKGKQVQDDCASVVKKLRAMLHSSEEQLRVHKKQTMFLT 296

Query: 1580 QLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAAAVVV 1401
            QL AKTLPKGLHC PLRL+ EYYSLNSS+QQFPN+EKLEDP LYHYALFSDN+LAAAVVV
Sbjct: 297  QLTAKTLPKGLHCFPLRLTNEYYSLNSSQQQFPNQEKLEDPQLYHYALFSDNVLAAAVVV 356

Query: 1400 NSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSYSPVL 1221
            NST+ HAK+P+ HVFHIVTDRLNYAAMRMWFL NPPG+ATIQVQNIEE TWLNSSYSPVL
Sbjct: 357  NSTITHAKDPSKHVFHIVTDRLNYAAMRMWFLVNPPGQATIQVQNIEELTWLNSSYSPVL 416

Query: 1220 KQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIV 1041
            KQLGSQSMIDYYF+ HRANSDSNLK+RNPKYLSILNHLRFYLPEIFP LNKVLFLDDDIV
Sbjct: 417  KQLGSQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIFPMLNKVLFLDDDIV 476

Query: 1040 VQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMNVF 861
            VQ+DLTGLWSL+LKG VNGAVETCGE FHRFDRYLNFSNPLISKNFDP ACGWAYGMNVF
Sbjct: 477  VQKDLTGLWSLDLKGNVNGAVETCGERFHRFDRYLNFSNPLISKNFDPHACGWAYGMNVF 536

Query: 860  DLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLGLGYNPT 681
            DL++WK+QNIT VYHTWQKLN  R LWKLGTLPPGLITFW +T+ IDRSWHVLGLGYNP 
Sbjct: 537  DLDQWKRQNITGVYHTWQKLNHDRLLWKLGTLPPGLITFWKQTYSIDRSWHVLGLGYNPN 596

Query: 680  VNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531
            VNQ+EI+RAAVIHYNGN+KPWLEIGI KY+ YWAK+VDYD VYLR+CNIN
Sbjct: 597  VNQREIERAAVIHYNGNLKPWLEIGISKYRNYWAKYVDYDHVYLRECNIN 646


>ref|XP_004293423.1| PREDICTED: probable galacturonosyltransferase 4-like [Fragaria vesca
            subsp. vesca]
          Length = 654

 Score =  923 bits (2385), Expect = 0.0
 Identities = 464/663 (69%), Positives = 524/663 (79%), Gaps = 16/663 (2%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGS-----------FISLDSRNEFIEDVSTLSFGG 2325
            MMVR  V+ LL VTV+API+LYTDRLGS           FIS  +++EF+EDV+   F  
Sbjct: 1    MMVRNVVMILLFVTVIAPIILYTDRLGSIHTSSSSSSFPFISA-AQDEFVEDVTAFPFNA 59

Query: 2324 EIR-KLNVLPQESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAE----STEHKT 2160
                +LN+LPQE S TLKEPIG+VYSDNS  S   + E + S            ST    
Sbjct: 60   HSGGRLNLLPQELS-TLKEPIGVVYSDNSTESFPETKESQASTNHSHQVSARVLSTTTNE 118

Query: 2159 RVLSATENPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSV 1980
            + LS  +NPI QV   + +GN+         +  + G +                 Q+S 
Sbjct: 119  QDLSQKDNPIIQVTQTLDQGNQL--------LAAESGAKTATSEKKTDNASQNTLNQKST 170

Query: 1979 NADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELR 1800
                K +  E  K    +   E  + D RVRHLKDQLIRA+VYL L A RNNP F RE+R
Sbjct: 171  QTSIKVDQRESVKTVSVKNIHETTITDGRVRHLKDQLIRARVYLSLPAARNNPQFAREIR 230

Query: 1799 LRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEE 1620
            LR+KEVQRAL DA+KDS+LPRNA D+LKAMEQTLAKGKQIQDDCAA++KKLRA+LHS +E
Sbjct: 231  LRIKEVQRALVDASKDSDLPRNANDRLKAMEQTLAKGKQIQDDCAAMVKKLRAMLHSMDE 290

Query: 1619 QLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYA 1440
            QLRVHKKQTMFLTQL AKT+PKGLHCLPLRL+TEYYSLNSS+  FPN+E+LEDP +YHYA
Sbjct: 291  QLRVHKKQTMFLTQLTAKTVPKGLHCLPLRLTTEYYSLNSSQMNFPNQERLEDPLMYHYA 350

Query: 1439 LFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIE 1260
            +FSDN+LA AVVVNSTV HAK+PA HVFHIVTDRLNYAAMRMWFL NPPG+ATIQVQNIE
Sbjct: 351  IFSDNVLATAVVVNSTVTHAKDPAKHVFHIVTDRLNYAAMRMWFLVNPPGQATIQVQNIE 410

Query: 1259 EFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFP 1080
            EFTWLNSSYSPVLKQLGS SMIDYYF+ HR++SDSNLKFRNPKYLSILNHLRFYLPEIFP
Sbjct: 411  EFTWLNSSYSPVLKQLGSASMIDYYFRTHRSSSDSNLKFRNPKYLSILNHLRFYLPEIFP 470

Query: 1079 KLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFD 900
            KLNKVLFLDDDIVV++DLTGLWSL+LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD
Sbjct: 471  KLNKVLFLDDDIVVRKDLTGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFD 530

Query: 899  PRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPID 720
            P ACGWAYGMNVFDLE+WKKQNITEVYH WQKLN  RQLWKLGTLPPGLITFW  T+P+D
Sbjct: 531  PHACGWAYGMNVFDLEQWKKQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKHTYPLD 590

Query: 719  RSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDC 540
            RSWHVLGLGYNP+V+QKEIDRAAVIHYNGNMKPWLEIGIPKY+ YWAK+VDYD  Y+R+C
Sbjct: 591  RSWHVLGLGYNPSVSQKEIDRAAVIHYNGNMKPWLEIGIPKYRSYWAKYVDYDHKYMREC 650

Query: 539  NIN 531
            NIN
Sbjct: 651  NIN 653


>ref|XP_007213869.1| hypothetical protein PRUPE_ppa018681mg [Prunus persica]
            gi|462409734|gb|EMJ15068.1| hypothetical protein
            PRUPE_ppa018681mg [Prunus persica]
          Length = 659

 Score =  916 bits (2367), Expect = 0.0
 Identities = 465/679 (68%), Positives = 522/679 (76%), Gaps = 32/679 (4%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292
            MMVR  V+ +L VTV+API+LYTDRLGSF            VS+ S      +LN+LPQE
Sbjct: 1    MMVRNVVMVMLFVTVIAPIILYTDRLGSF-----------QVSSSSC-----RLNLLPQE 44

Query: 2291 SSNTLKEPIGIVYSDNSRNSTL----FSDEIEDSVEELPLAESTEH-KTRVLSAT----- 2142
            SS TLKEP+G+VYSDNS NS       S     S ++ P  +S EH   RVLS T     
Sbjct: 45   SSTTLKEPVGVVYSDNSTNSYPETRGSSAHPNHSHKDGPSVDSMEHVSARVLSTTNDQNL 104

Query: 2141 ---ENPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNAD 1971
               +NPI+QV   + +GN     Q    +  K G                K  +QS    
Sbjct: 105  SQTDNPIRQVTQTLEQGN-----QFMSDLHAKGGGASEQSIDNASQTTEIKNERQSTQTS 159

Query: 1970 GKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRV 1791
             + +  +P K    + N E  +PD RVRHLKDQLIRAKVYL L ATRNNPHF RELRLR+
Sbjct: 160  SRVDQRKPKKTMTEKQNDETAVPDVRVRHLKDQLIRAKVYLSLPATRNNPHFTRELRLRI 219

Query: 1790 KEVQRALGDATK-------------------DSELPRNAYDKLKAMEQTLAKGKQIQDDC 1668
            KEV++  G   +                      +  +AYDKLKAMEQTL KGKQIQDDC
Sbjct: 220  KEVKKHFGRQPRILTCQGIFTPSDQVLGSGPSIHVVCDAYDKLKAMEQTLTKGKQIQDDC 279

Query: 1667 AAVIKKLRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQ 1488
            AA++KKLRA+LHS EEQLRVH+KQTMFLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+Q 
Sbjct: 280  AAMVKKLRAMLHSMEEQLRVHRKQTMFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQV 339

Query: 1487 FPNEEKLEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWF 1308
            FPN+EKLEDP LYHYALFSDN+LAAAVVVNST+ HAK+PANHVFHIVTDRLNYAAMRMWF
Sbjct: 340  FPNQEKLEDPLLYHYALFSDNVLAAAVVVNSTITHAKDPANHVFHIVTDRLNYAAMRMWF 399

Query: 1307 LANPPGKATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKY 1128
            L N PGKATIQVQNIEEFTWLNSSYSPVLKQLGS SMI+YYF+ HRANSDSNLKFRNPKY
Sbjct: 400  LVNSPGKATIQVQNIEEFTWLNSSYSPVLKQLGSASMINYYFRTHRANSDSNLKFRNPKY 459

Query: 1127 LSILNHLRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRF 948
            LSILNHLRFYLPE+FPKLNKVLFLDDD+VVQ+DLTGLW+L+LKG VNGAVETCGESFHRF
Sbjct: 460  LSILNHLRFYLPEVFPKLNKVLFLDDDVVVQKDLTGLWALDLKGNVNGAVETCGESFHRF 519

Query: 947  DRYLNFSNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGT 768
            DRYLNFSNPLISKNFD RACGWAYGMN+FDLEEWKKQNITEVYH WQ+LN  RQLWKLGT
Sbjct: 520  DRYLNFSNPLISKNFDARACGWAYGMNIFDLEEWKKQNITEVYHRWQELNHDRQLWKLGT 579

Query: 767  LPPGLITFWNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKG 588
            LPPGLITFW RT+P+DRSWHVLGLGYNP+VNQKEIDRAAVIHYNGNMKPWLEIGIPKY+ 
Sbjct: 580  LPPGLITFWKRTYPLDRSWHVLGLGYNPSVNQKEIDRAAVIHYNGNMKPWLEIGIPKYRN 639

Query: 587  YWAKFVDYDQVYLRDCNIN 531
            YW K+VDYD +Y+R+CNIN
Sbjct: 640  YWVKYVDYDHMYMRECNIN 658


>ref|XP_006350232.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1
            [Solanum tuberosum] gi|565367133|ref|XP_006350233.1|
            PREDICTED: probable galacturonosyltransferase 4-like
            isoform X2 [Solanum tuberosum]
            gi|565367135|ref|XP_006350234.1| PREDICTED: probable
            galacturonosyltransferase 4-like isoform X3 [Solanum
            tuberosum]
          Length = 680

 Score =  912 bits (2358), Expect = 0.0
 Identities = 457/675 (67%), Positives = 534/675 (79%), Gaps = 27/675 (4%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISL--DSRNEFIEDVSTLSFGGEIRKLNVLP 2298
            M +RKPVLFLL+VTV APIVLYTD LG++ +    SR EFIED+ST +FGG++R LNVLP
Sbjct: 3    MKLRKPVLFLLLVTVFAPIVLYTDTLGTYFTSPSSSRTEFIEDLSTFTFGGDVRPLNVLP 62

Query: 2297 QESSNTLKEPIGIVYSDNSRNST-----LFSDEIEDSVEELPLAESTEHKTRVLSATE-- 2139
            QESS +LKEP G VYS+NS +S        S E      +L  AES +H+T   S+ +  
Sbjct: 63   QESSTSLKEPRGDVYSENSSHSLSNASDTLSSEDARKTRQLTEAESMKHQTATGSSNDGV 122

Query: 2138 ------NPIKQVNDGVREGNESDGLQIN------------KSIGEKKGEERTNXXXXXXX 2013
                  + I QV   + E  ++D                 ++I +KK     +       
Sbjct: 123  EVAMNGSHISQVTANLHEPQQTDKTSPKLVSAGKNESIAMETISKKKTSPTDSNQTLDST 182

Query: 2012 XXXXKAGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGAT 1833
                +  Q++V   GK  S E  + K    N +IV PDARVR LKDQLIRAKVYL L AT
Sbjct: 183  KTETRHDQRTVQTSGKFVSGETARGKDEERNVQIVPPDARVRQLKDQLIRAKVYLSLSAT 242

Query: 1832 RNNPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIK 1653
            R+NPHFIRELRLR+KEV RALG+ATKDS+L R+A +KLKAMEQTLAKGKQIQDDCA ++K
Sbjct: 243  RSNPHFIRELRLRIKEVLRALGEATKDSDLSRSANEKLKAMEQTLAKGKQIQDDCATIVK 302

Query: 1652 KLRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEE 1473
            KLRA+LHS EEQLRVHKKQT++LT L AKTLPKGLHCLPLRLSTEY+ LNSS+Q FP++E
Sbjct: 303  KLRAMLHSAEEQLRVHKKQTLYLTHLTAKTLPKGLHCLPLRLSTEYFKLNSSQQHFPHQE 362

Query: 1472 KLEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPP 1293
             LE+P LYHYALFSDNILAAAVVVNSTV+HAK+P+ HVFHIVTDRLN+AAMRMWFLANPP
Sbjct: 363  NLENPKLYHYALFSDNILAAAVVVNSTVSHAKDPSKHVFHIVTDRLNFAAMRMWFLANPP 422

Query: 1292 GKATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILN 1113
              AT+ VQN+EEFTWLNSSYSPVLKQL SQSMIDYYF++ RA+SD N+KFRNPKYLSI+N
Sbjct: 423  KYATVDVQNVEEFTWLNSSYSPVLKQLNSQSMIDYYFRS-RADSDPNVKFRNPKYLSIMN 481

Query: 1112 HLRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLN 933
            HLRFYLPEIFPKL+KVLFLDDDIVVQ+DL GLWSL+LKGKV G VETCGESFHRFDRYLN
Sbjct: 482  HLRFYLPEIFPKLDKVLFLDDDIVVQKDLGGLWSLDLKGKVIGVVETCGESFHRFDRYLN 541

Query: 932  FSNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGL 753
            FSNPLISKNFDPRACGWA+GMN+ DL +W++QNITEVYH+WQ  N  RQLWKLGTLPPGL
Sbjct: 542  FSNPLISKNFDPRACGWAFGMNIIDLNQWRRQNITEVYHSWQNRNHERQLWKLGTLPPGL 601

Query: 752  ITFWNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKF 573
            ITFW RT+ +DRSWHVLGLGYNP V+QK+I RAAVIHYNGN+KPWLEI IPK++ YW+KF
Sbjct: 602  ITFWKRTYALDRSWHVLGLGYNPNVSQKDIQRAAVIHYNGNLKPWLEISIPKFRDYWSKF 661

Query: 572  VDYDQVYLRDCNINR 528
            VDYDQ +LR+CNIN+
Sbjct: 662  VDYDQAFLRECNINK 676


>ref|XP_006857626.1| hypothetical protein AMTR_s00061p00126570 [Amborella trichopoda]
            gi|548861722|gb|ERN19093.1| hypothetical protein
            AMTR_s00061p00126570 [Amborella trichopoda]
          Length = 672

 Score =  911 bits (2354), Expect = 0.0
 Identities = 453/671 (67%), Positives = 535/671 (79%), Gaps = 24/671 (3%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292
            M  R PVL LL  +VLAPIVLYTDRLGSF S  ++  F E+ S +++G +I KL VLPQE
Sbjct: 1    MKFRMPVLLLLCFSVLAPIVLYTDRLGSFSSSIAKAGFSEEFSPINYGRDINKLKVLPQE 60

Query: 2291 SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSATENPIKQVNDG 2112
            S N LKEP G+VY  +   S   S + E  +    + +S      V +  E  I +V+  
Sbjct: 61   SVNALKEPSGVVYLSDKDPSEAISVKEEPKMARSRVLQSNVKPLEVETHIEQVIDKVHRE 120

Query: 2111 VREGNESDG--------------LQINKS-IGEKK----GEERTNXXXXXXXXXXXKAGQ 1989
             + G E  G              LQ N+  IG K+    G +  +            A +
Sbjct: 121  EKNGQEIAGDSQAETIEESQQVLLQSNEQKIGAKREEQFGHQDASIKEEIGLSSRTDAEK 180

Query: 1988 QSVNA----DGKTNSDEPPKNKMARPN-GEIVMPDARVRHLKDQLIRAKVYLGLGATRNN 1824
            Q  +      GK++ D P +    R N  +  MPDARV HL+DQLI+AKVYL LG TR+N
Sbjct: 181  QEPDKPEIESGKSDPDGPSQPSPERQNDNKKPMPDARVHHLRDQLIKAKVYLSLGTTRSN 240

Query: 1823 PHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLR 1644
            PHFI+ELR+R++EVQRALGDATKDSELPR AYDKLKAME+TLAKGKQIQDDCAAVIKKLR
Sbjct: 241  PHFIKELRVRIREVQRALGDATKDSELPRGAYDKLKAMEETLAKGKQIQDDCAAVIKKLR 300

Query: 1643 AILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLE 1464
            AILHSTEEQLRVHKKQ+MFL QL+AKTLPKGLHCLPLRL+TEYYSLNS++QQFPN+EKLE
Sbjct: 301  AILHSTEEQLRVHKKQSMFLMQLSAKTLPKGLHCLPLRLTTEYYSLNSTQQQFPNQEKLE 360

Query: 1463 DPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKA 1284
            +P++YHYALFSDN+LAAAVVVNSTV++A++P NHVFHIVTDRLNYAAMRMWF+ANPPGKA
Sbjct: 361  NPNIYHYALFSDNVLAAAVVVNSTVSNARDPRNHVFHIVTDRLNYAAMRMWFIANPPGKA 420

Query: 1283 TIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLR 1104
            TIQVQ++EEFTWLNSSYSPVLKQLGS SMIDYYF+ HRAN DSNLK+RNPKYLSILNHLR
Sbjct: 421  TIQVQSVEEFTWLNSSYSPVLKQLGSTSMIDYYFRTHRANPDSNLKYRNPKYLSILNHLR 480

Query: 1103 FYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSN 924
            FY+PEIFPKL+KVLFLDDDIVVQRDLT LW ++LKGK+NGAVETC ESFHRFDRYLNFSN
Sbjct: 481  FYMPEIFPKLHKVLFLDDDIVVQRDLTQLWKIDLKGKINGAVETCRESFHRFDRYLNFSN 540

Query: 923  PLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITF 744
            PLISKNF+  ACGWA+GMN+FDL+EWKKQ ITE+YH+WQKLN  RQLWKLGTLPPGLITF
Sbjct: 541  PLISKNFEAHACGWAFGMNIFDLKEWKKQEITEIYHSWQKLNNDRQLWKLGTLPPGLITF 600

Query: 743  WNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDY 564
            +NRTFP++R WHVLGLGY+P+VNQ++I RAA IHYNGN+KPWLEIG+PK++GYW K+++Y
Sbjct: 601  YNRTFPLNRGWHVLGLGYDPSVNQRDIQRAAAIHYNGNLKPWLEIGLPKFRGYWQKYINY 660

Query: 563  DQVYLRDCNIN 531
            +Q YL+DCNIN
Sbjct: 661  NQPYLQDCNIN 671


>ref|XP_006467237.1| PREDICTED: probable galacturonosyltransferase 4-like [Citrus
            sinensis]
          Length = 646

 Score =  910 bits (2351), Expect = 0.0
 Identities = 453/654 (69%), Positives = 526/654 (80%), Gaps = 7/654 (1%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRL-GSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQ 2295
            M  R  V+ +L  TVLAPI+++T     S+ S     EF+ED++  + GG+ R LN+LPQ
Sbjct: 1    MKTRNLVVGMLCATVLAPILIFTSTFKDSYPSSSESGEFLEDLTAFTVGGDARHLNLLPQ 60

Query: 2294 ESSNTL--KEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKT-RVLSATENPIKQ 2124
            ESS TL  K+PI +V SD     +  S              S EHK+ RVLSAT N + Q
Sbjct: 61   ESSTTLSLKQPI-LVISDKIAQHSAHSQSQSQG--------SWEHKSARVLSATTNGLDQ 111

Query: 2123 --VNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNAD-GKTNSD 1953
               ++ +R+  +    QINK   +++ +   N              QQ  +   G     
Sbjct: 112  SKTDNPIRQVTDLTKTQINKHADQEQIKASDNHISAHHSQILDTKHQQESSLTYGVLEKK 171

Query: 1952 EPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRA 1773
            EP K    +   +   PD RVR LKDQLI+AKVYL L A RNN +F+RELRLR+KEVQRA
Sbjct: 172  EPTKINNEKQTEQTTPPDFRVRQLKDQLIKAKVYLSLPAMRNNANFVRELRLRIKEVQRA 231

Query: 1772 LGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQT 1593
            LGDATKDS+LPR A D+LKAMEQ+LAKGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKKQT
Sbjct: 232  LGDATKDSDLPRIANDRLKAMEQSLAKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKKQT 291

Query: 1592 MFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAA 1413
            +FLTQL AKTLPKGLHCLPLRL+TEYY+LNSS++ FPN+EKLEDP L+HYALFSDN+LAA
Sbjct: 292  LFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQRHFPNQEKLEDPRLFHYALFSDNVLAA 351

Query: 1412 AVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSY 1233
            AVVVNSTV HAK P+NHVFHIVTDRLNYAAMRMWFLANPPG+AT+QVQNIEEFTWLNSSY
Sbjct: 352  AVVVNSTVTHAKHPSNHVFHIVTDRLNYAAMRMWFLANPPGRATVQVQNIEEFTWLNSSY 411

Query: 1232 SPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLD 1053
            SPVLKQL SQSMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPE+FP+LNKVLFLD
Sbjct: 412  SPVLKQLNSQSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEVFPRLNKVLFLD 471

Query: 1052 DDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYG 873
            DD+VVQ+DL+GLWS++LKGKVNGAVETCGE+FHRFDRYLNFSNPLISKNFDPRACGWAYG
Sbjct: 472  DDVVVQKDLSGLWSIDLKGKVNGAVETCGETFHRFDRYLNFSNPLISKNFDPRACGWAYG 531

Query: 872  MNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLGLG 693
            MN+FDL+EW++QNIT+VYHTWQK+N  RQLWKLGTLPPGLITFW RT+P+DR WHVLGLG
Sbjct: 532  MNIFDLDEWRRQNITDVYHTWQKMNHDRQLWKLGTLPPGLITFWKRTYPLDRFWHVLGLG 591

Query: 692  YNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531
            YNP+VNQ++I+RAAVIHYNGNMKPWLEI IPKY+ YW K VDYDQ+YLR+CNIN
Sbjct: 592  YNPSVNQRDIERAAVIHYNGNMKPWLEINIPKYRNYWTKHVDYDQLYLRECNIN 645


>ref|XP_006398488.1| hypothetical protein EUTSA_v10000819mg [Eutrema salsugineum]
            gi|557099577|gb|ESQ39941.1| hypothetical protein
            EUTSA_v10000819mg [Eutrema salsugineum]
          Length = 631

 Score =  909 bits (2350), Expect = 0.0
 Identities = 439/645 (68%), Positives = 520/645 (80%), Gaps = 1/645 (0%)
 Frame = -1

Query: 2462 RKPVLFLLVVTVLAPIVLYTD-RLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQESS 2286
            R  VLF L++TV API+LYTD    SF +  S+ +F+EDV+ L+F  +  +LN+LP+ES 
Sbjct: 5    RNLVLFFLLLTVAAPILLYTDPSSASFKTPFSKRDFLEDVTALTFNSDENRLNLLPRESP 64

Query: 2285 NTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSATENPIKQVNDGVR 2106
              ++  +G+VYS  + +S+    E  D +    L+ + +      S TE+PIKQV DG  
Sbjct: 65   EVVRGVVGVVYSKQNSDSSR-RQEARDQLSARVLSTTDDDNQ---SQTEDPIKQVTDGAS 120

Query: 2105 EGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTNSDEPPKNKMAR 1926
            E ++ + +  +    + +                    Q +    GK +  EP      +
Sbjct: 121  EMDKPNDMHASDDNSQNREGMHV---------------QLTQQTSGKVDEQEPKSFGGEK 165

Query: 1925 PNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRALGDATKDSE 1746
              G +VMPD +V+HLKDQLIRAKVYL L A + N HF+RELRLR+KEVQRAL DATKDS+
Sbjct: 166  ERGNVVMPDTQVKHLKDQLIRAKVYLSLPAAKANAHFVRELRLRIKEVQRALSDATKDSD 225

Query: 1745 LPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQTMFLTQLAAK 1566
            LP+NA +KLKAMEQTLAKGKQIQDDC+ V+KKLRA+LHS EEQLRVHKKQTMFLTQL AK
Sbjct: 226  LPKNAVEKLKAMEQTLAKGKQIQDDCSTVVKKLRAMLHSAEEQLRVHKKQTMFLTQLTAK 285

Query: 1565 TLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAAAVVVNSTVN 1386
            T+PKGLHCLPLRL+T+YY+LNSSEQQFPN+E LED  LYHYALFSDN+LA +VVVNST+ 
Sbjct: 286  TIPKGLHCLPLRLTTDYYALNSSEQQFPNQENLEDNQLYHYALFSDNVLATSVVVNSTIT 345

Query: 1385 HAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSYSPVLKQLGS 1206
            +AK P+ HVFHIVTDRLNYAAMRMWFL NPPGKATIQVQN+EEFTWLNSSYSPVLKQL S
Sbjct: 346  NAKHPSKHVFHIVTDRLNYAAMRMWFLDNPPGKATIQVQNVEEFTWLNSSYSPVLKQLSS 405

Query: 1205 QSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIVVQRDL 1026
            QSMIDYYF+AH  NSD+NLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIVVQ+DL
Sbjct: 406  QSMIDYYFRAHHTNSDTNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIVVQKDL 465

Query: 1025 TGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMNVFDLEEW 846
            +GLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMN+FDL+EW
Sbjct: 466  SGLWSVDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMNIFDLDEW 525

Query: 845  KKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLGLGYNPTVNQKE 666
            KKQNITEVYH WQ LN+GR+LWKLGTLPPGLITFW RT+P+DR WH+LGLGYNP+VNQ++
Sbjct: 526  KKQNITEVYHRWQTLNEGRELWKLGTLPPGLITFWRRTYPLDRKWHILGLGYNPSVNQRD 585

Query: 665  IDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531
            I+R AVIHYNGN+KPWLEIGIP+Y+G+WAK VDY+ VYLR+CNIN
Sbjct: 586  IERGAVIHYNGNLKPWLEIGIPRYRGFWAKHVDYEHVYLRECNIN 630


>ref|XP_006350235.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X4
            [Solanum tuberosum]
          Length = 679

 Score =  908 bits (2347), Expect = 0.0
 Identities = 456/674 (67%), Positives = 536/674 (79%), Gaps = 26/674 (3%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISL--DSRNEFIEDVSTLSFGGEIRKLNVLP 2298
            M +RKPVLFLL+VTV APIVLYTD LG++ +    SR EFIED+ST +FGG++R LNVLP
Sbjct: 3    MKLRKPVLFLLLVTVFAPIVLYTDTLGTYFTSPSSSRTEFIEDLSTFTFGGDVRPLNVLP 62

Query: 2297 QESSNTLKEPIGIVYSDNSRNSTLFSDEI---EDSVEELPLAE-STEHKTRVLSATE--- 2139
            QESS +LKEP G VYS+NS +S   + +    ED+ +   L E S +H+T   S+ +   
Sbjct: 63   QESSTSLKEPRGDVYSENSSHSLSNASDTLSSEDARKTRQLTEESMKHQTATGSSNDGVE 122

Query: 2138 -----NPIKQVNDGVREGNESDGLQIN------------KSIGEKKGEERTNXXXXXXXX 2010
                 + I QV   + E  ++D                 ++I +KK     +        
Sbjct: 123  VAMNGSHISQVTANLHEPQQTDKTSPKLVSAGKNESIAMETISKKKTSPTDSNQTLDSTK 182

Query: 2009 XXXKAGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATR 1830
               +  Q++V   GK  S E  + K    N +IV PDARVR LKDQLIRAKVYL L ATR
Sbjct: 183  TETRHDQRTVQTSGKFVSGETARGKDEERNVQIVPPDARVRQLKDQLIRAKVYLSLSATR 242

Query: 1829 NNPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKK 1650
            +NPHFIRELRLR+KEV RALG+ATKDS+L R+A +KLKAMEQTLAKGKQIQDDCA ++KK
Sbjct: 243  SNPHFIRELRLRIKEVLRALGEATKDSDLSRSANEKLKAMEQTLAKGKQIQDDCATIVKK 302

Query: 1649 LRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEK 1470
            LRA+LHS EEQLRVHKKQT++LT L AKTLPKGLHCLPLRLSTEY+ LNSS+Q FP++E 
Sbjct: 303  LRAMLHSAEEQLRVHKKQTLYLTHLTAKTLPKGLHCLPLRLSTEYFKLNSSQQHFPHQEN 362

Query: 1469 LEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPG 1290
            LE+P LYHYALFSDNILAAAVVVNSTV+HAK+P+ HVFHIVTDRLN+AAMRMWFLANPP 
Sbjct: 363  LENPKLYHYALFSDNILAAAVVVNSTVSHAKDPSKHVFHIVTDRLNFAAMRMWFLANPPK 422

Query: 1289 KATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNH 1110
             AT+ VQN+EEFTWLNSSYSPVLKQL SQSMIDYYF++ RA+SD N+KFRNPKYLSI+NH
Sbjct: 423  YATVDVQNVEEFTWLNSSYSPVLKQLNSQSMIDYYFRS-RADSDPNVKFRNPKYLSIMNH 481

Query: 1109 LRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNF 930
            LRFYLPEIFPKL+KVLFLDDDIVVQ+DL GLWSL+LKGKV G VETCGESFHRFDRYLNF
Sbjct: 482  LRFYLPEIFPKLDKVLFLDDDIVVQKDLGGLWSLDLKGKVIGVVETCGESFHRFDRYLNF 541

Query: 929  SNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLI 750
            SNPLISKNFDPRACGWA+GMN+ DL +W++QNITEVYH+WQ  N  RQLWKLGTLPPGLI
Sbjct: 542  SNPLISKNFDPRACGWAFGMNIIDLNQWRRQNITEVYHSWQNRNHERQLWKLGTLPPGLI 601

Query: 749  TFWNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFV 570
            TFW RT+ +DRSWHVLGLGYNP V+QK+I RAAVIHYNGN+KPWLEI IPK++ YW+KFV
Sbjct: 602  TFWKRTYALDRSWHVLGLGYNPNVSQKDIQRAAVIHYNGNLKPWLEISIPKFRDYWSKFV 661

Query: 569  DYDQVYLRDCNINR 528
            DYDQ +LR+CNIN+
Sbjct: 662  DYDQAFLRECNINK 675


>ref|XP_006449976.1| hypothetical protein CICLE_v10014426mg [Citrus clementina]
            gi|557552587|gb|ESR63216.1| hypothetical protein
            CICLE_v10014426mg [Citrus clementina]
          Length = 646

 Score =  908 bits (2347), Expect = 0.0
 Identities = 453/654 (69%), Positives = 525/654 (80%), Gaps = 7/654 (1%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRL-GSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQ 2295
            M  R  V+ +L  TV API+++T     S+ S     EF+ED++  + GG+ R LN+LPQ
Sbjct: 1    MKTRNLVVGMLCATVFAPILIFTSTFKDSYPSSSESGEFLEDLTAFTVGGDARHLNLLPQ 60

Query: 2294 ESSNTL--KEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKT-RVLSATENPIKQ 2124
            ESS TL  K+PI +V SD     +  S              S EHK+ RVLSAT N + Q
Sbjct: 61   ESSTTLSLKQPI-LVISDKIAQHSAHSQSQSQG--------SWEHKSARVLSATTNGLDQ 111

Query: 2123 --VNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQ-SVNADGKTNSD 1953
               ++ +R+  +     INK   +++ +   N              QQ S    G     
Sbjct: 112  SKTDNPIRQVTDLTKTPINKHADQEQIKASDNHISAHHSQILDTKHQQESSQTYGVLEKK 171

Query: 1952 EPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRA 1773
            EP K    +   +   PD RVR LKDQLI+AKVYL L ATRNN +F+RELRLR+KEVQRA
Sbjct: 172  EPTKINNEKQTEQTAPPDFRVRQLKDQLIKAKVYLSLPATRNNANFVRELRLRIKEVQRA 231

Query: 1772 LGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQT 1593
            LGDA+KDS+LPR A D+LKAMEQ+LAKGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKKQT
Sbjct: 232  LGDASKDSDLPRIANDRLKAMEQSLAKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKKQT 291

Query: 1592 MFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAA 1413
            +FLTQL AKTLPKGLHCLPLRL+TEYYSLNSS++ FPN+EKLEDP L+HYALFSDN+LAA
Sbjct: 292  LFLTQLTAKTLPKGLHCLPLRLTTEYYSLNSSQRYFPNQEKLEDPRLFHYALFSDNVLAA 351

Query: 1412 AVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSY 1233
            AVVVNSTV HAK P+NHVFHIVTDRLNYAAMRMWFLANPPG+AT+QVQNIEEFTWLNSSY
Sbjct: 352  AVVVNSTVTHAKHPSNHVFHIVTDRLNYAAMRMWFLANPPGRATVQVQNIEEFTWLNSSY 411

Query: 1232 SPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLD 1053
            SPVLKQL SQSMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPE+FP+LNKVLFLD
Sbjct: 412  SPVLKQLNSQSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEVFPRLNKVLFLD 471

Query: 1052 DDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYG 873
            DD+VVQ+DL+GLWS++LKGKVNGAVETCGE+FHRFDRYLNFSNPLISKNFDPRACGWAYG
Sbjct: 472  DDVVVQKDLSGLWSIDLKGKVNGAVETCGETFHRFDRYLNFSNPLISKNFDPRACGWAYG 531

Query: 872  MNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLGLG 693
            MN+FDL+EW++QNIT+VYHTWQK+N  RQLWKLGTLPPGLITFW RT+P+DR WHVLGLG
Sbjct: 532  MNIFDLDEWRRQNITDVYHTWQKMNHDRQLWKLGTLPPGLITFWKRTYPLDRFWHVLGLG 591

Query: 692  YNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531
            YNP+VNQ++I+RAAVIHYNGNMKPWLEI IPKY+ YW K VDYDQ+YLR+CNIN
Sbjct: 592  YNPSVNQRDIERAAVIHYNGNMKPWLEINIPKYRNYWTKHVDYDQLYLRECNIN 645


>ref|XP_004236640.1| PREDICTED: probable galacturonosyltransferase 4-like [Solanum
            lycopersicum]
          Length = 680

 Score =  906 bits (2342), Expect = 0.0
 Identities = 457/676 (67%), Positives = 532/676 (78%), Gaps = 28/676 (4%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISL--DSRNEFIEDVSTLSFGGEIRKLNVLP 2298
            M +RKPVLFLL+VTV APIVLYTD LG++ +    SR EFIED+ST +FGG++R LNVLP
Sbjct: 3    MKLRKPVLFLLLVTVFAPIVLYTDTLGTYFTSPSSSRTEFIEDLSTFTFGGDVRPLNVLP 62

Query: 2297 QESSNTLKEPIGIVYSDNSRNS------TLFSDEIEDSVEELPLAESTEHKTRVLSATE- 2139
            QESS +LKEP G VYS+NS  +      TL S++   +  +L  AES +H+T   S+ + 
Sbjct: 63   QESSTSLKEPRGDVYSENSSQTISNASDTLGSEDARKT-RQLTEAESLKHQTATGSSNDG 121

Query: 2138 -------NPIKQVNDGVREGNESDGLQ-----------INKSIGEKKGEERTNXXXXXXX 2013
                   N I QV D + E  ++D              I      KK    T+       
Sbjct: 122  VEVAMNGNHISQVTDNLHEPQQTDKTSPKLVSAGKDESIAMETNSKKKTSSTDPNQTLDS 181

Query: 2012 XXXXKA-GQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGA 1836
                    Q +V   GK  S E  + K    N +IV PDARVR LKDQLIRAKVYL L A
Sbjct: 182  TKTETRHDQHTVQTSGKVVSGETARGKDEERNAQIVPPDARVRQLKDQLIRAKVYLSLSA 241

Query: 1835 TRNNPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVI 1656
            TR+NPHFIRELRLR+KE  RALG+ATKDS+L R+A +KLKAMEQTLAKGKQIQDDCA ++
Sbjct: 242  TRSNPHFIRELRLRIKESLRALGEATKDSDLSRSANEKLKAMEQTLAKGKQIQDDCATIV 301

Query: 1655 KKLRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNE 1476
            KKLRA+LHS EEQLRVHKKQT++LT L AKTLPKGLHCLPLRLSTEY+ LNSS+Q FP++
Sbjct: 302  KKLRAMLHSAEEQLRVHKKQTLYLTHLTAKTLPKGLHCLPLRLSTEYFKLNSSQQHFPHQ 361

Query: 1475 EKLEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANP 1296
            E LE+P LYHYALFSDNILAAAVVVNSTV+HAK+P+ HVFHIVTDRLN+AAMRMWFLAN 
Sbjct: 362  ENLENPKLYHYALFSDNILAAAVVVNSTVSHAKDPSKHVFHIVTDRLNFAAMRMWFLANQ 421

Query: 1295 PGKATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSIL 1116
            P  AT+ VQ++EEFTWLNSSYSPVLKQL SQSMIDYYF++ RA+SD N+KFRNPKYLSI+
Sbjct: 422  PKYATVDVQSVEEFTWLNSSYSPVLKQLNSQSMIDYYFRS-RADSDPNVKFRNPKYLSIM 480

Query: 1115 NHLRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYL 936
            NHLRFYLPEIFPKL+KVLFLDDDIVVQ+DL GLWSL+LKGKV G VETCGESFHRFDRYL
Sbjct: 481  NHLRFYLPEIFPKLDKVLFLDDDIVVQKDLGGLWSLDLKGKVIGVVETCGESFHRFDRYL 540

Query: 935  NFSNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPG 756
            NFSNPLIS+NFDPRACGWA+GMN+ DL EW++QNITEVYH+WQ  N  RQLWKLGTLPPG
Sbjct: 541  NFSNPLISENFDPRACGWAFGMNIIDLNEWRRQNITEVYHSWQNRNHERQLWKLGTLPPG 600

Query: 755  LITFWNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAK 576
            LITFW RT+ +DRSWHVLGLGYNP V+QK+I RAAVIHYNGN+KPWLEI IPK++ YW+K
Sbjct: 601  LITFWKRTYALDRSWHVLGLGYNPNVSQKDIQRAAVIHYNGNLKPWLEISIPKFRDYWSK 660

Query: 575  FVDYDQVYLRDCNINR 528
            FVDYDQ +LR+CNIN+
Sbjct: 661  FVDYDQTFLRECNINK 676


>gb|EYU33824.1| hypothetical protein MIMGU_mgv1a002625mg [Mimulus guttatus]
          Length = 653

 Score =  905 bits (2338), Expect = 0.0
 Identities = 449/660 (68%), Positives = 522/660 (79%), Gaps = 16/660 (2%)
 Frame = -1

Query: 2465 VRKPVLFLLVVTVLAPIVLYTDRLGSFIS-LDSRNEFIEDVSTLSFGGEIRKLNVLPQES 2289
            +RKPVLFLL+VTV APIVLYTD LG + +   SRNEF+ED ST +F GE+R LNVLPQES
Sbjct: 5    LRKPVLFLLLVTVFAPIVLYTDTLGLYSTPSSSRNEFMEDGSTFTFAGEVRPLNVLPQES 64

Query: 2288 SNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT------ENPIK 2127
            S TLKEP+G+VYS+NS  ++    E    +      ESTE KT  LS +      ENPI+
Sbjct: 65   STTLKEPLGVVYSENSIEASSNKSEESTRITRQLTEESTEDKTTNLSGSSGGSKDENPIR 124

Query: 2126 QVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTNSDEP 1947
            QV   V E     G + +      +  E  N              Q  V ++  +   E 
Sbjct: 125  QVISTVHEDEVGTGKEKSNKPQLHENTEIENR-------------QDDVTSENVSEKKEL 171

Query: 1946 PKNK---------MARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLR 1794
             + K          +R N   V+ DARVR LKDQLI+ +VYL L ATRNNPHFIR+LRLR
Sbjct: 172  KRIKHSSRTREEVKSRQNERAVLSDARVRQLKDQLIQGRVYLSLSATRNNPHFIRDLRLR 231

Query: 1793 VKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQL 1614
            +KEVQR LG+ATKDSELPRNA +K+KAMEQTL KGKQIQDDCAAV+KKLRA+LH  EEQL
Sbjct: 232  IKEVQRVLGEATKDSELPRNANEKMKAMEQTLLKGKQIQDDCAAVVKKLRAMLHLAEEQL 291

Query: 1613 RVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALF 1434
            R HKKQ +FLT L AKT+PKGLHC PLRLS+EY+ LNSS++ F N+E LE+P LYHYALF
Sbjct: 292  RAHKKQALFLTHLTAKTVPKGLHCFPLRLSSEYFMLNSSQRDFSNKENLENPKLYHYALF 351

Query: 1433 SDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEF 1254
            SDN+LAAAVVVNST+ HAK+P+ HVFH+VTDRLNYAAM+MWFLANPPGKATIQVQN+EEF
Sbjct: 352  SDNVLAAAVVVNSTITHAKDPSKHVFHVVTDRLNYAAMKMWFLANPPGKATIQVQNVEEF 411

Query: 1253 TWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKL 1074
            TWLNSSYSPVLKQL S+SMIDYYFK  RA SDSNLK+RNPKYLSI+NHLRFYLPEIFPKL
Sbjct: 412  TWLNSSYSPVLKQLSSRSMIDYYFKGKRAESDSNLKYRNPKYLSIMNHLRFYLPEIFPKL 471

Query: 1073 NKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPR 894
            +KVLFLDDDIVVQ+DL+G++SLNLKGKV G VETCGE+FHRFDRYLNFSNP+ISKNFDPR
Sbjct: 472  DKVLFLDDDIVVQKDLSGIFSLNLKGKVIGVVETCGETFHRFDRYLNFSNPIISKNFDPR 531

Query: 893  ACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRS 714
            ACGWA+GMN+FDL+EW+KQNITEVYH WQ LN+ R LWKLGTLPPGLITF NRT+ +D+S
Sbjct: 532  ACGWAFGMNIFDLDEWRKQNITEVYHKWQNLNEDRLLWKLGTLPPGLITFSNRTYALDKS 591

Query: 713  WHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNI 534
            WHVLGLGYNP V  K+I+RAAVIHYNGN+KPWLEIG+PK++ YWAKFVDYD  YLR+CNI
Sbjct: 592  WHVLGLGYNPNVPLKDIERAAVIHYNGNLKPWLEIGLPKFRNYWAKFVDYDHQYLRECNI 651


>ref|XP_004499343.1| PREDICTED: probable galacturonosyltransferase 4-like [Cicer
            arietinum]
          Length = 658

 Score =  898 bits (2320), Expect = 0.0
 Identities = 451/668 (67%), Positives = 521/668 (77%), Gaps = 23/668 (3%)
 Frame = -1

Query: 2462 RKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGG-EIRKLNVLPQESS 2286
            R  V  LL +TV+ PI+LYTDRL  F    + +EFI+DV+  + GG +   LN+LPQE+S
Sbjct: 5    RNIVFLLLCITVVTPILLYTDRLTDFNYPSAEHEFIQDVTAFAVGGAKSSHLNLLPQETS 64

Query: 2285 NTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT--------ENPI 2130
              LKEPIG+VYS+++ N           ++ LP  E     TRVLSAT        +NPI
Sbjct: 65   TILKEPIGVVYSEDTSN-----------IKSLPQREHV--LTRVLSATNEEDWSKGDNPI 111

Query: 2129 KQVNDGVREGNESDGLQ--------------INKSIGEKKGEERTNXXXXXXXXXXXKAG 1992
            K + DGV+  N+S  L+              I+    + K  + +N           K G
Sbjct: 112  KLLTDGVKPINQSSYLEKADITGGSVNGEDAIDVDDNDGKLTKSSNASDQVSETILTKQG 171

Query: 1991 QQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFI 1812
            +Q   +  K N+      +  + NG+    DARVR LKDQLI+AKVYL L A RNNPH  
Sbjct: 172  KQRTGSSSKGNNKGTILQETTKHNGQ-TPSDARVRKLKDQLIQAKVYLSLQAVRNNPHLT 230

Query: 1811 RELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILH 1632
            RELRLRVKEV R LGDA+KDS+LPRNA +++K+MEQ+L KG+QIQDDCA  +KKLRA+LH
Sbjct: 231  RELRLRVKEVSRTLGDASKDSDLPRNANERMKSMEQSLMKGRQIQDDCATSVKKLRAMLH 290

Query: 1631 STEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSL 1452
            S+E+QLRVHKKQT FLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+QQFPN+EKLEDP L
Sbjct: 291  SSEDQLRVHKKQTSFLTQLTAKTLPKGLHCLPLRLTTEYYNLNSSQQQFPNQEKLEDPGL 350

Query: 1451 YHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQV 1272
            YHYA+FSDNILA AVVVNST  HAK+ + HVFHIVTDRLNYAAMRMWFLANPPGKA IQV
Sbjct: 351  YHYAIFSDNILATAVVVNSTAAHAKDASKHVFHIVTDRLNYAAMRMWFLANPPGKAAIQV 410

Query: 1271 QNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLP 1092
            QNIE+FTWLNSSYSPVLKQLGS SMIDYYFK HRA SDSNLKFRNPKYLS+LNHLRFYLP
Sbjct: 411  QNIEDFTWLNSSYSPVLKQLGSPSMIDYYFKTHRATSDSNLKFRNPKYLSMLNHLRFYLP 470

Query: 1091 EIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLIS 912
            EIFPKL KVLFLDDD+VVQ+DLTGLWS++LKG VNGAVETC ESFHRFDRYLNFSNPL++
Sbjct: 471  EIFPKLKKVLFLDDDVVVQKDLTGLWSIDLKGNVNGAVETCAESFHRFDRYLNFSNPLVA 530

Query: 911  KNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRT 732
            +NFDPRACGWAYGMNVFDL  WKKQNITEVYH WQKLN  RQLWKLGTLPPGLITFW RT
Sbjct: 531  RNFDPRACGWAYGMNVFDLVGWKKQNITEVYHNWQKLNHDRQLWKLGTLPPGLITFWKRT 590

Query: 731  FPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVY 552
            FP++RSWHVLGLGYNP VNQK+I+RAAVIHYNGNMKPWLEI IPK++ YW K+VDYD VY
Sbjct: 591  FPLNRSWHVLGLGYNPNVNQKDIERAAVIHYNGNMKPWLEISIPKFRAYWTKYVDYDIVY 650

Query: 551  LRDCNINR 528
            LR+CNIN+
Sbjct: 651  LRECNINQ 658


>ref|XP_006600275.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1
            [Glycine max] gi|571532515|ref|XP_006600276.1| PREDICTED:
            probable galacturonosyltransferase 4-like isoform X2
            [Glycine max]
          Length = 661

 Score =  896 bits (2316), Expect = 0.0
 Identities = 447/667 (67%), Positives = 524/667 (78%), Gaps = 20/667 (2%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292
            ++ R  VL LL +T +APIVL+TDRLG+F    +  EFIE V+      +   LN+LPQE
Sbjct: 2    VVTRNIVLLLLSITFVAPIVLFTDRLGTFKYPFAEQEFIEAVTAFVSAADSGHLNLLPQE 61

Query: 2291 SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEH-KTRVLSAT--------E 2139
            SS   KEPIG+VY++++ N+       E+ +  L  A+  EH   RVLSAT        E
Sbjct: 62   SSTVFKEPIGLVYTEDTSNT-------ENLLHGLHFAKPGEHVSARVLSATNDEGQTKGE 114

Query: 2138 NPIKQVNDGVREGNESDGLQINKSIGEK-KGEERTNXXXXXXXXXXXK----------AG 1992
            NPIK V DG+ +GN++  +    + G+   GE+  +                        
Sbjct: 115  NPIKLVTDGINQGNQNSYMVKADTTGDSVNGEDAIDVDDNDGKLAKSSDLVSETTDTKQE 174

Query: 1991 QQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFI 1812
            Q+ + +  +    EP  ++  + N +   PDARV+ LKDQLI+A+VYL L A R+NPH  
Sbjct: 175  QEHIKSSSQVTQKEPILSEADKHNDQ-TPPDARVQQLKDQLIQARVYLSLQAVRSNPHLT 233

Query: 1811 RELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILH 1632
            RELRLRVKEV R LGDA+KDS+LPRNA +++KAMEQTL KG+QIQ+DCAA +KKLRA+LH
Sbjct: 234  RELRLRVKEVSRTLGDASKDSDLPRNANERMKAMEQTLMKGRQIQNDCAAAVKKLRAMLH 293

Query: 1631 STEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSL 1452
            STEEQL VHKKQT+FLTQL AKTLPKGLHCLPLRL+TEYYSLN+S+QQF N++KLEDP L
Sbjct: 294  STEEQLHVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNTSQQQFRNQQKLEDPRL 353

Query: 1451 YHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQV 1272
            YHYA+FSDNILA AVVVNSTV HAK+ + HVFHIVTDRLNYAAMRMWFL NPP KATIQV
Sbjct: 354  YHYAIFSDNILATAVVVNSTVAHAKDTSKHVFHIVTDRLNYAAMRMWFLVNPPQKATIQV 413

Query: 1271 QNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLP 1092
            QNIE+FTWLNSSYSPVLKQLGS SMID+YFK HRA+SDSNLKFRNPKYLSILNHLRFYLP
Sbjct: 414  QNIEDFTWLNSSYSPVLKQLGSPSMIDFYFKTHRASSDSNLKFRNPKYLSILNHLRFYLP 473

Query: 1091 EIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLIS 912
            EIFPKLNKVLFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGE FHRFDRYLNFSNPLI+
Sbjct: 474  EIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLNFSNPLIA 533

Query: 911  KNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRT 732
            KNFDPRACGWAYGMNVFDL +WK+QNIT+VYH WQK+N  RQLWKLGTLPPGLITFW RT
Sbjct: 534  KNFDPRACGWAYGMNVFDLVQWKRQNITDVYHKWQKMNHDRQLWKLGTLPPGLITFWKRT 593

Query: 731  FPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVY 552
            F + RSWHVLGLGYNP +NQKEI+RAAVIHYNGNMKPWLEI IPK++GYW K+VDY+ VY
Sbjct: 594  FQLHRSWHVLGLGYNPNINQKEIERAAVIHYNGNMKPWLEISIPKFRGYWTKYVDYNLVY 653

Query: 551  LRDCNIN 531
            LR+CNIN
Sbjct: 654  LRECNIN 660


>ref|XP_006584115.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1
            [Glycine max] gi|571468064|ref|XP_006584116.1| PREDICTED:
            probable galacturonosyltransferase 4-like isoform X2
            [Glycine max]
          Length = 664

 Score =  892 bits (2304), Expect = 0.0
 Identities = 446/670 (66%), Positives = 523/670 (78%), Gaps = 23/670 (3%)
 Frame = -1

Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292
            ++ R  VL LL +T +APIVLYTDR G+F    +  EFI+ V+      +   LN+LPQE
Sbjct: 2    VVTRNIVLLLLSITFVAPIVLYTDRFGTFKYPFAEQEFIDAVTAFVSAADSGHLNLLPQE 61

Query: 2291 SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEH-KTRVLSAT--------E 2139
            +S   KEPIG+VY++++ N+       ++ +  L  A+  EH   RVLSAT        E
Sbjct: 62   TSTVFKEPIGLVYTEDAANT-------KNLLHGLHFAKPGEHVSARVLSATKDEGQTKGE 114

Query: 2138 NPIKQVNDGVREGNESDGL--------------QINKSIGEKKGEERTNXXXXXXXXXXX 2001
            NPIK V DG+ +GN++  L               I+    + K  + ++           
Sbjct: 115  NPIKLVTDGINQGNQNSYLVKADITGDSVNGEDAIDVDDNDGKLAKSSDASDLASETMDT 174

Query: 2000 KAGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNP 1821
            K  QQ + +  +  + +  K   A  + +   PDARVR+LKDQLI+ +VYL L A RNNP
Sbjct: 175  KQEQQHIKSSSQV-TQKGSKLSEADKHIDQTPPDARVRYLKDQLIQVRVYLSLQAVRNNP 233

Query: 1820 HFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRA 1641
            H  RELRLRVKEV R LGDA+KDS+LPRNA +++KAMEQTL KG+QIQ+DCAA +KKLRA
Sbjct: 234  HLTRELRLRVKEVSRTLGDASKDSDLPRNANERMKAMEQTLMKGRQIQNDCAAAVKKLRA 293

Query: 1640 ILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLED 1461
            +LHSTEEQL VHKKQT+FLTQL AKTLPKGLHCLPLRL+TEYYSLN+S+QQ PN++KLE+
Sbjct: 294  MLHSTEEQLHVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNTSQQQLPNQQKLEN 353

Query: 1460 PSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKAT 1281
            P LYHYA+FSDNILA AVVVNSTV HAK+ +NHVFHIVTDRLNYAAMRMWFL NPP KAT
Sbjct: 354  PRLYHYAIFSDNILATAVVVNSTVAHAKDTSNHVFHIVTDRLNYAAMRMWFLVNPPKKAT 413

Query: 1280 IQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRF 1101
            IQVQNIE+FTWLNSSYSPVLKQLGS SM+D+YFK HRA+SDSNLKFRNPKYLSILNHLRF
Sbjct: 414  IQVQNIEDFTWLNSSYSPVLKQLGSPSMVDFYFKTHRASSDSNLKFRNPKYLSILNHLRF 473

Query: 1100 YLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNP 921
            YLPEIFPKLNKVLFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGE FHRFDRYLNFSNP
Sbjct: 474  YLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLNFSNP 533

Query: 920  LISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFW 741
             I+KNFDPRACGWAYGMNVFDL +WK+QNITEVYH WQKLN  RQLWKLGTLPPGLITFW
Sbjct: 534  HIAKNFDPRACGWAYGMNVFDLVQWKRQNITEVYHNWQKLNHDRQLWKLGTLPPGLITFW 593

Query: 740  NRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYD 561
             RTF ++RSWHVLGLGYNP +NQKEI+RAAVIHYNGNMKPWLEI  PK++GYW K+VDYD
Sbjct: 594  KRTFQLNRSWHVLGLGYNPNINQKEIERAAVIHYNGNMKPWLEISFPKFRGYWTKYVDYD 653

Query: 560  QVYLRDCNIN 531
             VYLR+CNIN
Sbjct: 654  LVYLRECNIN 663


Top