BLASTX nr result

ID: Akebia24_contig00003195 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00003195
         (2740 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI18781.3| unnamed protein product [Vitis vinifera]              972   0.0  
emb|CAN70213.1| hypothetical protein VITISV_038741 [Vitis vinifera]   967   0.0  
ref|XP_002272372.2| PREDICTED: uncharacterized protein LOC100258...   966   0.0  
gb|EXB64628.1| putative galacturonosyltransferase 4 [Morus notab...   947   0.0  
ref|XP_002525229.1| Glycosyltransferase QUASIMODO1, putative [Ri...   934   0.0  
ref|XP_007026430.1| Galacturonosyltransferase 4 isoform 1 [Theob...   933   0.0  
ref|XP_007026431.1| Galacturonosyltransferase 4 isoform 2 [Theob...   932   0.0  
ref|XP_004293423.1| PREDICTED: probable galacturonosyltransferas...   920   0.0  
ref|XP_006350232.1| PREDICTED: probable galacturonosyltransferas...   913   0.0  
ref|XP_006350235.1| PREDICTED: probable galacturonosyltransferas...   911   0.0  
ref|XP_007213869.1| hypothetical protein PRUPE_ppa018681mg [Prun...   911   0.0  
ref|XP_004236640.1| PREDICTED: probable galacturonosyltransferas...   907   0.0  
gb|EYU33824.1| hypothetical protein MIMGU_mgv1a002625mg [Mimulus...   905   0.0  
ref|XP_006467237.1| PREDICTED: probable galacturonosyltransferas...   904   0.0  
ref|XP_006398488.1| hypothetical protein EUTSA_v10000819mg [Eutr...   903   0.0  
ref|XP_006857626.1| hypothetical protein AMTR_s00061p00126570 [A...   903   0.0  
ref|XP_006449976.1| hypothetical protein CICLE_v10014426mg [Citr...   902   0.0  
ref|XP_006600275.1| PREDICTED: probable galacturonosyltransferas...   894   0.0  
ref|XP_004499343.1| PREDICTED: probable galacturonosyltransferas...   892   0.0  
ref|XP_006584115.1| PREDICTED: probable galacturonosyltransferas...   891   0.0  

>emb|CBI18781.3| unnamed protein product [Vitis vinifera]
          Length = 638

 Score =  973 bits (2514), Expect = 0.0
 Identities = 486/658 (73%), Positives = 548/658 (83%), Gaps = 11/658 (1%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLG-SF-ISLDSRNEFIEDVSTLSFGGEIRKLNVLP 601
            M+ RK VLFLL+VTVL+PIVLYTD LG SF  S  + +EF EDV+ L+ GG   KLN+LP
Sbjct: 1    MIKRKTVLFLLLVTVLSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNLLP 60

Query: 602  QESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT-------- 757
            QESS TLKEPIGIVYSDN       S ++++S  +L L  S EHKTRVLS T        
Sbjct: 61   QESSTTLKEPIGIVYSDND------SLDVDESAADLQLGGSVEHKTRVLSTTYEEGDRSQ 114

Query: 758  -ENPIKQVNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSVNADGK 934
             ENPI+QV DG ++ N   G ++      +  E                 GQQS    GK
Sbjct: 115  RENPIRQVTDG-KDDNLQRGSELTSHNASQNSETEH--------------GQQSAQTSGK 159

Query: 935  TNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKE 1114
             +  EP K +  +P  + V+ DARV+ LKDQLIRAKV+L L ATRNN HFIRELR R+KE
Sbjct: 160  GDHKEPVKTRNEKPIDQTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARMKE 219

Query: 1115 VQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVH 1294
            VQRALGDATKDSELP+NAY+KLK MEQTLAKGKQIQDDCAAV+KKLRAILHS EEQLRVH
Sbjct: 220  VQRALGDATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLRVH 279

Query: 1295 KKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDN 1474
            KKQTM+LTQL AKTLPKGLHCLPLRLSTEYY+L+S++QQFPN++KLEDP L+HYALFSDN
Sbjct: 280  KKQTMYLTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFSDN 339

Query: 1475 ILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWL 1654
            ILAAAVVVNSTV++AK+P+ HVFHIV+DRLNYAAMRMWFLANPPGKATIQVQNI+EFTWL
Sbjct: 340  ILAAAVVVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFTWL 399

Query: 1655 NSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKV 1834
            NSSYSPVLKQLGS SMIDYYFK HR+NSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKV
Sbjct: 400  NSSYSPVLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKV 459

Query: 1835 LFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACG 2014
            LFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD  ACG
Sbjct: 460  LFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHACG 519

Query: 2015 WAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSIDRSWHV 2194
            WAYGMN+FDL++WKKQ+ITEVYHTWQKLN  RQLWKLGTLPPGLITFW RTF IDRSWHV
Sbjct: 520  WAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTFPIDRSWHV 579

Query: 2195 LGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 2368
            LGLGYNP+V+++EI+RAAVIHYNGN+KPWLEIG+PK++ YWAKF D+D  YLRDCNIN
Sbjct: 580  LGLGYNPSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNIN 637


>emb|CAN70213.1| hypothetical protein VITISV_038741 [Vitis vinifera]
          Length = 759

 Score =  967 bits (2500), Expect = 0.0
 Identities = 484/660 (73%), Positives = 544/660 (82%), Gaps = 11/660 (1%)
 Frame = +2

Query: 422  KTMMVRKPVLFLLVVTVLAPIVLYTDRLG-SF-ISLDSRNEFIEDVSTLSFGGEIRKLNV 595
            K M+ RK VLFLL+VTV +PIVLYTD LG SF  S  + +EF EDV+ L+ GG   KLN+
Sbjct: 120  KEMIKRKTVLFLLLVTVXSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNL 179

Query: 596  LPQESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT------ 757
            LPQESS TLKEPIGIVYSDN       S ++++S  +L L  S EHKTR LS T      
Sbjct: 180  LPQESSTTLKEPIGIVYSDND------SLDVDESAADLQLGGSVEHKTRXLSTTYEEGDR 233

Query: 758  ---ENPIKQVNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSVNAD 928
               ENPI+QV DG       D LQ    +      + +              GQQS    
Sbjct: 234  SQRENPIRQVTDGK-----DDSLQRGSELTSHNASQNSETEH----------GQQSAQTS 278

Query: 929  GKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRV 1108
            GK +  EP K +  +P  + V+ DARV+ LKDQLIRAKV+L L ATRNN HFIRELR R+
Sbjct: 279  GKGDHKEPVKTRNEKPIDQTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARM 338

Query: 1109 KEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLR 1288
            KEVQRALGDATKDSELP+NAY+KLK MEQTLAKGKQIQDDCAAV+KKLRAILHS EEQLR
Sbjct: 339  KEVQRALGDATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLR 398

Query: 1289 VHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFS 1468
            VHKKQTM+LTQL AKTLPKGLHCLPLRLSTEYY+L+S++QQFPN++KLEDP L+HYALFS
Sbjct: 399  VHKKQTMYLTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFS 458

Query: 1469 DNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFT 1648
            DNILAAAVVVNSTV++AK+P+ HVFHIV+DRLNYAAMRMWFLANPPGKATIQVQNI+EFT
Sbjct: 459  DNILAAAVVVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFT 518

Query: 1649 WLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLN 1828
            WLNSSYSPVLKQLGS SMIDYYFK HR+NSDSNLKFRNPKYLSILNHLRFYLPEIFPKLN
Sbjct: 519  WLNSSYSPVLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLN 578

Query: 1829 KVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRA 2008
            KVLFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD  A
Sbjct: 579  KVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHA 638

Query: 2009 CGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSIDRSW 2188
            CGWAYGMN+FDL++WKKQ+ITEVYHTWQKLN  RQLWKLGTLPPGLITFW RT  IDRSW
Sbjct: 639  CGWAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTXPIDRSW 698

Query: 2189 HVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 2368
            HVLGLGYNP+V+++EI+RAAVIHYNGN+KPWLEIG+PK++ YWAKF D+D  YLRDCNIN
Sbjct: 699  HVLGLGYNPSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNIN 758


>ref|XP_002272372.2| PREDICTED: uncharacterized protein LOC100258406 [Vitis vinifera]
          Length = 1286

 Score =  966 bits (2498), Expect = 0.0
 Identities = 482/652 (73%), Positives = 544/652 (83%), Gaps = 11/652 (1%)
 Frame = +2

Query: 446  VLFLLVVTVLAPIVLYTDRLG-SF-ISLDSRNEFIEDVSTLSFGGEIRKLNVLPQESSNT 619
            +LFLL+VTVL+PIVLYTD LG SF  S  + +EF EDV+ L+ GG   KLN+LPQESS T
Sbjct: 655  LLFLLLVTVLSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNLLPQESSTT 714

Query: 620  LKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT---------ENPIK 772
            LKEPIGIVYSDN       S ++++S  +L L  S EHKTRVLS T         ENPI+
Sbjct: 715  LKEPIGIVYSDND------SLDVDESAADLQLGGSVEHKTRVLSTTYEEGDRSQRENPIR 768

Query: 773  QVNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSVNADGKTNSDEP 952
            QV DG ++ N   G ++      +  E                 GQQS    GK +  EP
Sbjct: 769  QVTDG-KDDNLQRGSELTSHNASQNSETEH--------------GQQSAQTSGKGDHKEP 813

Query: 953  PKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRALG 1132
             K +  +P  + V+ DARV+ LKDQLIRAKV+L L ATRNN HFIRELR R+KEVQRALG
Sbjct: 814  VKTRNEKPIDQTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARMKEVQRALG 873

Query: 1133 DATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQTMF 1312
            DATKDSELP+NAY+KLK MEQTLAKGKQIQDDCAAV+KKLRAILHS EEQLRVHKKQTM+
Sbjct: 874  DATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLRVHKKQTMY 933

Query: 1313 LTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAAAV 1492
            LTQL AKTLPKGLHCLPLRLSTEYY+L+S++QQFPN++KLEDP L+HYALFSDNILAAAV
Sbjct: 934  LTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFSDNILAAAV 993

Query: 1493 VVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSYSP 1672
            VVNSTV++AK+P+ HVFHIV+DRLNYAAMRMWFLANPPGKATIQVQNI+EFTWLNSSYSP
Sbjct: 994  VVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFTWLNSSYSP 1053

Query: 1673 VLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDD 1852
            VLKQLGS SMIDYYFK HR+NSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDD
Sbjct: 1054 VLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDD 1113

Query: 1853 IVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMN 2032
            IVVQ+DLTGLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD  ACGWAYGMN
Sbjct: 1114 IVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHACGWAYGMN 1173

Query: 2033 VFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSIDRSWHVLGLGYN 2212
            +FDL++WKKQ+ITEVYHTWQKLN  RQLWKLGTLPPGLITFW RTF IDRSWHVLGLGYN
Sbjct: 1174 IFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTFPIDRSWHVLGLGYN 1233

Query: 2213 PTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 2368
            P+V+++EI+RAAVIHYNGN+KPWLEIG+PK++ YWAKF D+D  YLRDCNIN
Sbjct: 1234 PSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNIN 1285


>gb|EXB64628.1| putative galacturonosyltransferase 4 [Morus notabilis]
          Length = 657

 Score =  947 bits (2448), Expect = 0.0
 Identities = 474/671 (70%), Positives = 539/671 (80%), Gaps = 24/671 (3%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLD-SRNEFIEDVSTLSFGGEIRKLNVLPQ 604
            MMVR  V+ +L VTV+APIVLYTDRLG+F S   S NEF+EDV+T+              
Sbjct: 1    MMVRNVVIGMLFVTVIAPIVLYTDRLGTFQSYSASTNEFVEDVTTV-------------- 46

Query: 605  ESSNTLKEPIGIVYSDNSRNSTL--------FSDEIEDSVEELPLAESTEH-KTRVLSAT 757
            E S  +KEPIGIVYSDNS  S           S +  +S ++  L +S EH   RVLS T
Sbjct: 47   EPSTKIKEPIGIVYSDNSNQSLPNSGDAVKESSTDTSNSEQDWQLGDSMEHVSARVLSTT 106

Query: 758  --------ENPIKQVNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAG-- 907
                    EN I++V D  +EG+  + L I    GE KGE                +G  
Sbjct: 107  NDENNSRKENAIREVTDRDQEGD-QETLDIVDGEGETKGEAIDAEVKEIQQKVDDGSGDT 165

Query: 908  ----QQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNN 1075
                +Q+     + +  EP K +  + N   V+PDARVRHLKDQL+RA+VYL L ATRNN
Sbjct: 166  EVKPEQTTETSSRVDKREPRKTRPEKQNDRTVIPDARVRHLKDQLVRARVYLSLPATRNN 225

Query: 1076 PHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLR 1255
            PHF RELR+R+KEVQRALGDA+KDSELPRNAYD+LKAMEQ+LAKGKQIQDDCAA +KKLR
Sbjct: 226  PHFTRELRVRMKEVQRALGDASKDSELPRNAYDRLKAMEQSLAKGKQIQDDCAAAVKKLR 285

Query: 1256 AILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLE 1435
            A+LHSTEEQLRVHKKQT+FLTQL AKTLPKGLHCLPLRL+TEYYSLN SEQ FPNE+KLE
Sbjct: 286  AMLHSTEEQLRVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNYSEQHFPNEDKLE 345

Query: 1436 DPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKA 1615
            DP LYHYALFSDN+LAAAVVVNST+ HAK+P+ HVFHIVTDRLNYAAMRMWFL NPPGKA
Sbjct: 346  DPQLYHYALFSDNVLAAAVVVNSTITHAKDPSKHVFHIVTDRLNYAAMRMWFLVNPPGKA 405

Query: 1616 TIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLR 1795
            T+QVQNIEEFTWLNSSYSPVLKQLGSQSMI+YYF+ HRA+SDSNLKFRNPKYLSILNHLR
Sbjct: 406  TVQVQNIEEFTWLNSSYSPVLKQLGSQSMINYYFRTHRASSDSNLKFRNPKYLSILNHLR 465

Query: 1796 FYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSN 1975
            FYLP+IFPKL+KVLF+DDDIVVQ+DLT LWSL+LKG VNGAVETCGESFHRFDRYLNFSN
Sbjct: 466  FYLPQIFPKLDKVLFVDDDIVVQKDLTALWSLDLKGNVNGAVETCGESFHRFDRYLNFSN 525

Query: 1976 PLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITF 2155
            PLISKNFDPRACGWAYGMN+FDL+EWK+Q IT+VYH+WQKLN  RQLWKLGTLPPGLITF
Sbjct: 526  PLISKNFDPRACGWAYGMNIFDLKEWKRQQITDVYHSWQKLNHDRQLWKLGTLPPGLITF 585

Query: 2156 WNRTFSIDRSWHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDY 2335
            W RT+ +DRSWHVLGLGYNP V QK+I+RAAVIHYNGNMKPWLEIGIPKY+ YWAK+VDY
Sbjct: 586  WKRTYPLDRSWHVLGLGYNPNVGQKDIERAAVIHYNGNMKPWLEIGIPKYRNYWAKYVDY 645

Query: 2336 DQVYLRDCNIN 2368
            DQ+YLR+CN+N
Sbjct: 646  DQLYLRECNLN 656


>ref|XP_002525229.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
            gi|223535526|gb|EEF37195.1| Glycosyltransferase
            QUASIMODO1, putative [Ricinus communis]
          Length = 647

 Score =  934 bits (2414), Expect = 0.0
 Identities = 462/650 (71%), Positives = 530/650 (81%), Gaps = 3/650 (0%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTD-RLGSFISLDSRNEFIEDVSTLSFGGEIRK-LNVLP 601
            M +R  V+ +L+VTV+API+LYTD R  +F S  S  EF+EDV++L+  G+ R  LNVLP
Sbjct: 3    MKLRNLVVGMLLVTVIAPIILYTDNRFSTFNSSSSTTEFLEDVASLTLSGDSRDHLNVLP 62

Query: 602  QESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHK-TRVLSATENPIKQV 778
            QES++ LKEPIGIVY+DNS  S   +  I+     LP  ++ EHK TRVLSAT +  +  
Sbjct: 63   QESTSLLKEPIGIVYTDNSTISPPHTSTIQFHSSPLP-QDTREHKSTRVLSATNDQHQSQ 121

Query: 779  NDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSVNADGKTNSDEPPK 958
             D +     +           K  ++  +              QQS     K     PPK
Sbjct: 122  TDTIIRQVTNQQASRTTDANNKNSKQNPSDGGSQNAVV-----QQSSLTSEKVTEKGPPK 176

Query: 959  NKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRALGDA 1138
            ++  +   +  +PDARVR L+DQLIRAKVYL L +T+NNPHF RELRLR+KEVQR LGDA
Sbjct: 177  SRTDKQTAQTPVPDARVRQLRDQLIRAKVYLSLPSTKNNPHFTRELRLRIKEVQRVLGDA 236

Query: 1139 TKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQTMFLT 1318
            TKDS+LP+NA DKLKAM+Q+LAKGKQ+QDDCA+V+KKLRA+LHS+EEQLRVHKKQTMFLT
Sbjct: 237  TKDSDLPKNANDKLKAMDQSLAKGKQVQDDCASVVKKLRAMLHSSEEQLRVHKKQTMFLT 296

Query: 1319 QLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAAAVVV 1498
            QL AKTLPKGLHC PLRL+ EYYSLNSS+QQFPN+EKLEDP LYHYALFSDN+LAAAVVV
Sbjct: 297  QLTAKTLPKGLHCFPLRLTNEYYSLNSSQQQFPNQEKLEDPQLYHYALFSDNVLAAAVVV 356

Query: 1499 NSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSYSPVL 1678
            NST+ HAK+P+ HVFHIVTDRLNYAAMRMWFL NPPG+ATIQVQNIEE TWLNSSYSPVL
Sbjct: 357  NSTITHAKDPSKHVFHIVTDRLNYAAMRMWFLVNPPGQATIQVQNIEELTWLNSSYSPVL 416

Query: 1679 KQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIV 1858
            KQLGSQSMIDYYF+ HRANSDSNLK+RNPKYLSILNHLRFYLPEIFP LNKVLFLDDDIV
Sbjct: 417  KQLGSQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIFPMLNKVLFLDDDIV 476

Query: 1859 VQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMNVF 2038
            VQ+DLTGLWSL+LKG VNGAVETCGE FHRFDRYLNFSNPLISKNFDP ACGWAYGMNVF
Sbjct: 477  VQKDLTGLWSLDLKGNVNGAVETCGERFHRFDRYLNFSNPLISKNFDPHACGWAYGMNVF 536

Query: 2039 DLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSIDRSWHVLGLGYNPT 2218
            DL++WK+QNIT VYHTWQKLN  R LWKLGTLPPGLITFW +T+SIDRSWHVLGLGYNP 
Sbjct: 537  DLDQWKRQNITGVYHTWQKLNHDRLLWKLGTLPPGLITFWKQTYSIDRSWHVLGLGYNPN 596

Query: 2219 VSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 2368
            V+Q+EI+RAAVIHYNGN+KPWLEIGI KY+ YWAK+VDYD VYLR+CNIN
Sbjct: 597  VNQREIERAAVIHYNGNLKPWLEIGISKYRNYWAKYVDYDHVYLRECNIN 646


>ref|XP_007026430.1| Galacturonosyltransferase 4 isoform 1 [Theobroma cacao]
            gi|508781796|gb|EOY29052.1| Galacturonosyltransferase 4
            isoform 1 [Theobroma cacao]
          Length = 626

 Score =  933 bits (2412), Expect = 0.0
 Identities = 471/656 (71%), Positives = 532/656 (81%), Gaps = 9/656 (1%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 607
            M VR  VL LL VTV+API LYTDR+ +F    S  +F++DV+T +  G+ R+LNVLPQE
Sbjct: 1    MKVRHLVLGLLSVTVIAPIFLYTDRVATFNPSSSGRDFLDDVATFTLLGDTRRLNVLPQE 60

Query: 608  SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHK-TRVLSATE-------- 760
            +S  +KEP GIVYSD+S NS               + E+ EHK TRVLSAT+        
Sbjct: 61   TSTAIKEPAGIVYSDHSNNSFR------------KVTETREHKSTRVLSATDEERQPQLH 108

Query: 761  NPIKQVNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSVNADGKTN 940
            NPI+QV D     N +  L  + +     G +                 Q + N D K +
Sbjct: 109  NPIRQVTDPA-PANLTTPLDSHPNASHHLGTKLEQQPT-----------QLAGNIDQKEH 156

Query: 941  SDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQ 1120
            SD    +++A P       DA+VRHLKDQLIRAKVYL L A ++N H  RELRLR+KEV 
Sbjct: 157  SDNKT-SRLAEP------VDAQVRHLKDQLIRAKVYLSLPAIKSNQHVTRELRLRIKEVS 209

Query: 1121 RALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKK 1300
            RALGDATKDS+LP+NA+DKLKAMEQ+L KGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKK
Sbjct: 210  RALGDATKDSDLPKNAFDKLKAMEQSLEKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKK 269

Query: 1301 QTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNIL 1480
            QTMFLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+Q F N+EKLEDP LYHYALFSDN+L
Sbjct: 270  QTMFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQNFLNQEKLEDPRLYHYALFSDNVL 329

Query: 1481 AAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNS 1660
            AAAVVVNSTV+HAK P+NHVFHIVTDRLNYAAMRMWFL NPPGKATIQVQNIEEFTWLNS
Sbjct: 330  AAAVVVNSTVSHAKHPSNHVFHIVTDRLNYAAMRMWFLNNPPGKATIQVQNIEEFTWLNS 389

Query: 1661 SYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 1840
            SYSPVLKQLGS SMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF
Sbjct: 390  SYSPVLKQLGSPSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 449

Query: 1841 LDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWA 2020
            LDDDIVV++D++GLWSL+LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFDP ACGWA
Sbjct: 450  LDDDIVVRKDISGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPHACGWA 509

Query: 2021 YGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSIDRSWHVLG 2200
            YGMN+FDLEEW++QNITEVYH WQKLN  RQLWKLGTLPPGLITFW RT+ +DRSWHVLG
Sbjct: 510  YGMNIFDLEEWRRQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKRTYPLDRSWHVLG 569

Query: 2201 LGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 2368
            LGYNP V+Q+E++RAAVIHYNGN+KPWLEIGIPKYK YWAK+VDYD +YLRDCNIN
Sbjct: 570  LGYNPNVNQREVERAAVIHYNGNLKPWLEIGIPKYKNYWAKYVDYDNMYLRDCNIN 625


>ref|XP_007026431.1| Galacturonosyltransferase 4 isoform 2 [Theobroma cacao]
            gi|508781797|gb|EOY29053.1| Galacturonosyltransferase 4
            isoform 2 [Theobroma cacao]
          Length = 624

 Score =  932 bits (2409), Expect = 0.0
 Identities = 471/656 (71%), Positives = 531/656 (80%), Gaps = 9/656 (1%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 607
            M VR  VL LL VTV+API LYTDR+ +F    S  +F++DV+T +  G+ R+LNVLPQE
Sbjct: 1    MKVRHLVLGLLSVTVIAPIFLYTDRVATFNPSSSGRDFLDDVATFTLLGDTRRLNVLPQE 60

Query: 608  SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHK-TRVLSATE-------- 760
            +S  +KEP GIVYSD+S NS                 E+ EHK TRVLSAT+        
Sbjct: 61   TSTAIKEPAGIVYSDHSNNSFR--------------KETREHKSTRVLSATDEERQPQLH 106

Query: 761  NPIKQVNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSVNADGKTN 940
            NPI+QV D     N +  L  + +     G +                 Q + N D K +
Sbjct: 107  NPIRQVTDPA-PANLTTPLDSHPNASHHLGTKLEQQPT-----------QLAGNIDQKEH 154

Query: 941  SDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQ 1120
            SD    +++A P       DA+VRHLKDQLIRAKVYL L A ++N H  RELRLR+KEV 
Sbjct: 155  SDNKT-SRLAEP------VDAQVRHLKDQLIRAKVYLSLPAIKSNQHVTRELRLRIKEVS 207

Query: 1121 RALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKK 1300
            RALGDATKDS+LP+NA+DKLKAMEQ+L KGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKK
Sbjct: 208  RALGDATKDSDLPKNAFDKLKAMEQSLEKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKK 267

Query: 1301 QTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNIL 1480
            QTMFLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+Q F N+EKLEDP LYHYALFSDN+L
Sbjct: 268  QTMFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQNFLNQEKLEDPRLYHYALFSDNVL 327

Query: 1481 AAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNS 1660
            AAAVVVNSTV+HAK P+NHVFHIVTDRLNYAAMRMWFL NPPGKATIQVQNIEEFTWLNS
Sbjct: 328  AAAVVVNSTVSHAKHPSNHVFHIVTDRLNYAAMRMWFLNNPPGKATIQVQNIEEFTWLNS 387

Query: 1661 SYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 1840
            SYSPVLKQLGS SMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF
Sbjct: 388  SYSPVLKQLGSPSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 447

Query: 1841 LDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWA 2020
            LDDDIVV++D++GLWSL+LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFDP ACGWA
Sbjct: 448  LDDDIVVRKDISGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPHACGWA 507

Query: 2021 YGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSIDRSWHVLG 2200
            YGMN+FDLEEW++QNITEVYH WQKLN  RQLWKLGTLPPGLITFW RT+ +DRSWHVLG
Sbjct: 508  YGMNIFDLEEWRRQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKRTYPLDRSWHVLG 567

Query: 2201 LGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 2368
            LGYNP V+Q+E++RAAVIHYNGN+KPWLEIGIPKYK YWAK+VDYD +YLRDCNIN
Sbjct: 568  LGYNPNVNQREVERAAVIHYNGNLKPWLEIGIPKYKNYWAKYVDYDNMYLRDCNIN 623


>ref|XP_004293423.1| PREDICTED: probable galacturonosyltransferase 4-like [Fragaria vesca
            subsp. vesca]
          Length = 654

 Score =  920 bits (2379), Expect = 0.0
 Identities = 464/663 (69%), Positives = 523/663 (78%), Gaps = 16/663 (2%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGS-----------FISLDSRNEFIEDVSTLSFGG 574
            MMVR  V+ LL VTV+API+LYTDRLGS           FIS  +++EF+EDV+   F  
Sbjct: 1    MMVRNVVMILLFVTVIAPIILYTDRLGSIHTSSSSSSFPFISA-AQDEFVEDVTAFPFNA 59

Query: 575  EIR-KLNVLPQESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAE----STEHKT 739
                +LN+LPQE S TLKEPIG+VYSDNS  S   + E + S            ST    
Sbjct: 60   HSGGRLNLLPQELS-TLKEPIGVVYSDNSTESFPETKESQASTNHSHQVSARVLSTTTNE 118

Query: 740  RVLSATENPIKQVNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSV 919
            + LS  +NPI QV   + +GN        + +  + G +                 Q+S 
Sbjct: 119  QDLSQKDNPIIQVTQTLDQGN--------QLLAAESGAKTATSEKKTDNASQNTLNQKST 170

Query: 920  NADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELR 1099
                K +  E  K    +   E  + D RVRHLKDQLIRA+VYL L A RNNP F RE+R
Sbjct: 171  QTSIKVDQRESVKTVSVKNIHETTITDGRVRHLKDQLIRARVYLSLPAARNNPQFAREIR 230

Query: 1100 LRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEE 1279
            LR+KEVQRAL DA+KDS+LPRNA D+LKAMEQTLAKGKQIQDDCAA++KKLRA+LHS +E
Sbjct: 231  LRIKEVQRALVDASKDSDLPRNANDRLKAMEQTLAKGKQIQDDCAAMVKKLRAMLHSMDE 290

Query: 1280 QLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYA 1459
            QLRVHKKQTMFLTQL AKT+PKGLHCLPLRL+TEYYSLNSS+  FPN+E+LEDP +YHYA
Sbjct: 291  QLRVHKKQTMFLTQLTAKTVPKGLHCLPLRLTTEYYSLNSSQMNFPNQERLEDPLMYHYA 350

Query: 1460 LFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIE 1639
            +FSDN+LA AVVVNSTV HAK+PA HVFHIVTDRLNYAAMRMWFL NPPG+ATIQVQNIE
Sbjct: 351  IFSDNVLATAVVVNSTVTHAKDPAKHVFHIVTDRLNYAAMRMWFLVNPPGQATIQVQNIE 410

Query: 1640 EFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFP 1819
            EFTWLNSSYSPVLKQLGS SMIDYYF+ HR++SDSNLKFRNPKYLSILNHLRFYLPEIFP
Sbjct: 411  EFTWLNSSYSPVLKQLGSASMIDYYFRTHRSSSDSNLKFRNPKYLSILNHLRFYLPEIFP 470

Query: 1820 KLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFD 1999
            KLNKVLFLDDDIVV++DLTGLWSL+LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD
Sbjct: 471  KLNKVLFLDDDIVVRKDLTGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFD 530

Query: 2000 PRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSID 2179
            P ACGWAYGMNVFDLE+WKKQNITEVYH WQKLN  RQLWKLGTLPPGLITFW  T+ +D
Sbjct: 531  PHACGWAYGMNVFDLEQWKKQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKHTYPLD 590

Query: 2180 RSWHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDC 2359
            RSWHVLGLGYNP+VSQKEIDRAAVIHYNGNMKPWLEIGIPKY+ YWAK+VDYD  Y+R+C
Sbjct: 591  RSWHVLGLGYNPSVSQKEIDRAAVIHYNGNMKPWLEIGIPKYRSYWAKYVDYDHKYMREC 650

Query: 2360 NIN 2368
            NIN
Sbjct: 651  NIN 653


>ref|XP_006350232.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1
            [Solanum tuberosum] gi|565367133|ref|XP_006350233.1|
            PREDICTED: probable galacturonosyltransferase 4-like
            isoform X2 [Solanum tuberosum]
            gi|565367135|ref|XP_006350234.1| PREDICTED: probable
            galacturonosyltransferase 4-like isoform X3 [Solanum
            tuberosum]
          Length = 680

 Score =  913 bits (2359), Expect = 0.0
 Identities = 458/675 (67%), Positives = 533/675 (78%), Gaps = 27/675 (4%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISL--DSRNEFIEDVSTLSFGGEIRKLNVLP 601
            M +RKPVLFLL+VTV APIVLYTD LG++ +    SR EFIED+ST +FGG++R LNVLP
Sbjct: 3    MKLRKPVLFLLLVTVFAPIVLYTDTLGTYFTSPSSSRTEFIEDLSTFTFGGDVRPLNVLP 62

Query: 602  QESSNTLKEPIGIVYSDNSRNST-----LFSDEIEDSVEELPLAESTEHKTRVLSATE-- 760
            QESS +LKEP G VYS+NS +S        S E      +L  AES +H+T   S+ +  
Sbjct: 63   QESSTSLKEPRGDVYSENSSHSLSNASDTLSSEDARKTRQLTEAESMKHQTATGSSNDGV 122

Query: 761  ------NPIKQVNDGVREGNGSDGLQIN------------KSIGEKKGEERTNXXXXXXX 886
                  + I QV   + E   +D                 ++I +KK     +       
Sbjct: 123  EVAMNGSHISQVTANLHEPQQTDKTSPKLVSAGKNESIAMETISKKKTSPTDSNQTLDST 182

Query: 887  XXXXXAGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGAT 1066
                   Q++V   GK  S E  + K    N +IV PDARVR LKDQLIRAKVYL L AT
Sbjct: 183  KTETRHDQRTVQTSGKFVSGETARGKDEERNVQIVPPDARVRQLKDQLIRAKVYLSLSAT 242

Query: 1067 RNNPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIK 1246
            R+NPHFIRELRLR+KEV RALG+ATKDS+L R+A +KLKAMEQTLAKGKQIQDDCA ++K
Sbjct: 243  RSNPHFIRELRLRIKEVLRALGEATKDSDLSRSANEKLKAMEQTLAKGKQIQDDCATIVK 302

Query: 1247 KLRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEE 1426
            KLRA+LHS EEQLRVHKKQT++LT L AKTLPKGLHCLPLRLSTEY+ LNSS+Q FP++E
Sbjct: 303  KLRAMLHSAEEQLRVHKKQTLYLTHLTAKTLPKGLHCLPLRLSTEYFKLNSSQQHFPHQE 362

Query: 1427 KLEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPP 1606
             LE+P LYHYALFSDNILAAAVVVNSTV+HAK+P+ HVFHIVTDRLN+AAMRMWFLANPP
Sbjct: 363  NLENPKLYHYALFSDNILAAAVVVNSTVSHAKDPSKHVFHIVTDRLNFAAMRMWFLANPP 422

Query: 1607 GKATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILN 1786
              AT+ VQN+EEFTWLNSSYSPVLKQL SQSMIDYYF++ RA+SD N+KFRNPKYLSI+N
Sbjct: 423  KYATVDVQNVEEFTWLNSSYSPVLKQLNSQSMIDYYFRS-RADSDPNVKFRNPKYLSIMN 481

Query: 1787 HLRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLN 1966
            HLRFYLPEIFPKL+KVLFLDDDIVVQ+DL GLWSL+LKGKV G VETCGESFHRFDRYLN
Sbjct: 482  HLRFYLPEIFPKLDKVLFLDDDIVVQKDLGGLWSLDLKGKVIGVVETCGESFHRFDRYLN 541

Query: 1967 FSNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGL 2146
            FSNPLISKNFDPRACGWA+GMN+ DL +W++QNITEVYH+WQ  N  RQLWKLGTLPPGL
Sbjct: 542  FSNPLISKNFDPRACGWAFGMNIIDLNQWRRQNITEVYHSWQNRNHERQLWKLGTLPPGL 601

Query: 2147 ITFWNRTFSIDRSWHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKF 2326
            ITFW RT+++DRSWHVLGLGYNP VSQK+I RAAVIHYNGN+KPWLEI IPK++ YW+KF
Sbjct: 602  ITFWKRTYALDRSWHVLGLGYNPNVSQKDIQRAAVIHYNGNLKPWLEISIPKFRDYWSKF 661

Query: 2327 VDYDQVYLRDCNINR 2371
            VDYDQ +LR+CNIN+
Sbjct: 662  VDYDQAFLRECNINK 676


>ref|XP_006350235.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X4
            [Solanum tuberosum]
          Length = 679

 Score =  911 bits (2354), Expect = 0.0
 Identities = 457/678 (67%), Positives = 534/678 (78%), Gaps = 30/678 (4%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISL--DSRNEFIEDVSTLSFGGEIRKLNVLP 601
            M +RKPVLFLL+VTV APIVLYTD LG++ +    SR EFIED+ST +FGG++R LNVLP
Sbjct: 3    MKLRKPVLFLLLVTVFAPIVLYTDTLGTYFTSPSSSRTEFIEDLSTFTFGGDVRPLNVLP 62

Query: 602  QESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSATENPIKQVN 781
            QESS +LKEP G VYS+NS +S   + +   S +     + TE   +  +AT +     N
Sbjct: 63   QESSTSLKEPRGDVYSENSSHSLSNASDTLSSEDARKTRQLTEESMKHQTATGSS----N 118

Query: 782  DGVREG-NGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXA------------------ 904
            DGV    NGS   Q+  ++ E +  ++T+            A                  
Sbjct: 119  DGVEVAMNGSHISQVTANLHEPQQTDKTSPKLVSAGKNESIAMETISKKKTSPTDSNQTL 178

Query: 905  ---------GQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGL 1057
                      Q++V   GK  S E  + K    N +IV PDARVR LKDQLIRAKVYL L
Sbjct: 179  DSTKTETRHDQRTVQTSGKFVSGETARGKDEERNVQIVPPDARVRQLKDQLIRAKVYLSL 238

Query: 1058 GATRNNPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAA 1237
             ATR+NPHFIRELRLR+KEV RALG+ATKDS+L R+A +KLKAMEQTLAKGKQIQDDCA 
Sbjct: 239  SATRSNPHFIRELRLRIKEVLRALGEATKDSDLSRSANEKLKAMEQTLAKGKQIQDDCAT 298

Query: 1238 VIKKLRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFP 1417
            ++KKLRA+LHS EEQLRVHKKQT++LT L AKTLPKGLHCLPLRLSTEY+ LNSS+Q FP
Sbjct: 299  IVKKLRAMLHSAEEQLRVHKKQTLYLTHLTAKTLPKGLHCLPLRLSTEYFKLNSSQQHFP 358

Query: 1418 NEEKLEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLA 1597
            ++E LE+P LYHYALFSDNILAAAVVVNSTV+HAK+P+ HVFHIVTDRLN+AAMRMWFLA
Sbjct: 359  HQENLENPKLYHYALFSDNILAAAVVVNSTVSHAKDPSKHVFHIVTDRLNFAAMRMWFLA 418

Query: 1598 NPPGKATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLS 1777
            NPP  AT+ VQN+EEFTWLNSSYSPVLKQL SQSMIDYYF++ RA+SD N+KFRNPKYLS
Sbjct: 419  NPPKYATVDVQNVEEFTWLNSSYSPVLKQLNSQSMIDYYFRS-RADSDPNVKFRNPKYLS 477

Query: 1778 ILNHLRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDR 1957
            I+NHLRFYLPEIFPKL+KVLFLDDDIVVQ+DL GLWSL+LKGKV G VETCGESFHRFDR
Sbjct: 478  IMNHLRFYLPEIFPKLDKVLFLDDDIVVQKDLGGLWSLDLKGKVIGVVETCGESFHRFDR 537

Query: 1958 YLNFSNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLP 2137
            YLNFSNPLISKNFDPRACGWA+GMN+ DL +W++QNITEVYH+WQ  N  RQLWKLGTLP
Sbjct: 538  YLNFSNPLISKNFDPRACGWAFGMNIIDLNQWRRQNITEVYHSWQNRNHERQLWKLGTLP 597

Query: 2138 PGLITFWNRTFSIDRSWHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYW 2317
            PGLITFW RT+++DRSWHVLGLGYNP VSQK+I RAAVIHYNGN+KPWLEI IPK++ YW
Sbjct: 598  PGLITFWKRTYALDRSWHVLGLGYNPNVSQKDIQRAAVIHYNGNLKPWLEISIPKFRDYW 657

Query: 2318 AKFVDYDQVYLRDCNINR 2371
            +KFVDYDQ +LR+CNIN+
Sbjct: 658  SKFVDYDQAFLRECNINK 675


>ref|XP_007213869.1| hypothetical protein PRUPE_ppa018681mg [Prunus persica]
            gi|462409734|gb|EMJ15068.1| hypothetical protein
            PRUPE_ppa018681mg [Prunus persica]
          Length = 659

 Score =  911 bits (2354), Expect = 0.0
 Identities = 462/679 (68%), Positives = 520/679 (76%), Gaps = 32/679 (4%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 607
            MMVR  V+ +L VTV+API+LYTDRLGSF            VS+ S      +LN+LPQE
Sbjct: 1    MMVRNVVMVMLFVTVIAPIILYTDRLGSF-----------QVSSSSC-----RLNLLPQE 44

Query: 608  SSNTLKEPIGIVYSDNSRNSTL----FSDEIEDSVEELPLAESTEH-KTRVLSAT----- 757
            SS TLKEP+G+VYSDNS NS       S     S ++ P  +S EH   RVLS T     
Sbjct: 45   SSTTLKEPVGVVYSDNSTNSYPETRGSSAHPNHSHKDGPSVDSMEHVSARVLSTTNDQNL 104

Query: 758  ---ENPIKQVNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSVNAD 928
               +NPI+QV   + +GN     Q    +  K G                   +QS    
Sbjct: 105  SQTDNPIRQVTQTLEQGN-----QFMSDLHAKGGGASEQSIDNASQTTEIKNERQSTQTS 159

Query: 929  GKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRV 1108
             + +  +P K    + N E  +PD RVRHLKDQLIRAKVYL L ATRNNPHF RELRLR+
Sbjct: 160  SRVDQRKPKKTMTEKQNDETAVPDVRVRHLKDQLIRAKVYLSLPATRNNPHFTRELRLRI 219

Query: 1109 KEVQRALGDATK-------------------DSELPRNAYDKLKAMEQTLAKGKQIQDDC 1231
            KEV++  G   +                      +  +AYDKLKAMEQTL KGKQIQDDC
Sbjct: 220  KEVKKHFGRQPRILTCQGIFTPSDQVLGSGPSIHVVCDAYDKLKAMEQTLTKGKQIQDDC 279

Query: 1232 AAVIKKLRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQ 1411
            AA++KKLRA+LHS EEQLRVH+KQTMFLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+Q 
Sbjct: 280  AAMVKKLRAMLHSMEEQLRVHRKQTMFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQV 339

Query: 1412 FPNEEKLEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWF 1591
            FPN+EKLEDP LYHYALFSDN+LAAAVVVNST+ HAK+PANHVFHIVTDRLNYAAMRMWF
Sbjct: 340  FPNQEKLEDPLLYHYALFSDNVLAAAVVVNSTITHAKDPANHVFHIVTDRLNYAAMRMWF 399

Query: 1592 LANPPGKATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKY 1771
            L N PGKATIQVQNIEEFTWLNSSYSPVLKQLGS SMI+YYF+ HRANSDSNLKFRNPKY
Sbjct: 400  LVNSPGKATIQVQNIEEFTWLNSSYSPVLKQLGSASMINYYFRTHRANSDSNLKFRNPKY 459

Query: 1772 LSILNHLRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRF 1951
            LSILNHLRFYLPE+FPKLNKVLFLDDD+VVQ+DLTGLW+L+LKG VNGAVETCGESFHRF
Sbjct: 460  LSILNHLRFYLPEVFPKLNKVLFLDDDVVVQKDLTGLWALDLKGNVNGAVETCGESFHRF 519

Query: 1952 DRYLNFSNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGT 2131
            DRYLNFSNPLISKNFD RACGWAYGMN+FDLEEWKKQNITEVYH WQ+LN  RQLWKLGT
Sbjct: 520  DRYLNFSNPLISKNFDARACGWAYGMNIFDLEEWKKQNITEVYHRWQELNHDRQLWKLGT 579

Query: 2132 LPPGLITFWNRTFSIDRSWHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKG 2311
            LPPGLITFW RT+ +DRSWHVLGLGYNP+V+QKEIDRAAVIHYNGNMKPWLEIGIPKY+ 
Sbjct: 580  LPPGLITFWKRTYPLDRSWHVLGLGYNPSVNQKEIDRAAVIHYNGNMKPWLEIGIPKYRN 639

Query: 2312 YWAKFVDYDQVYLRDCNIN 2368
            YW K+VDYD +Y+R+CNIN
Sbjct: 640  YWVKYVDYDHMYMRECNIN 658


>ref|XP_004236640.1| PREDICTED: probable galacturonosyltransferase 4-like [Solanum
            lycopersicum]
          Length = 680

 Score =  907 bits (2345), Expect = 0.0
 Identities = 457/676 (67%), Positives = 532/676 (78%), Gaps = 28/676 (4%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISL--DSRNEFIEDVSTLSFGGEIRKLNVLP 601
            M +RKPVLFLL+VTV APIVLYTD LG++ +    SR EFIED+ST +FGG++R LNVLP
Sbjct: 3    MKLRKPVLFLLLVTVFAPIVLYTDTLGTYFTSPSSSRTEFIEDLSTFTFGGDVRPLNVLP 62

Query: 602  QESSNTLKEPIGIVYSDNSRNS------TLFSDEIEDSVEELPLAESTEHKTRVLSATE- 760
            QESS +LKEP G VYS+NS  +      TL S++   +  +L  AES +H+T   S+ + 
Sbjct: 63   QESSTSLKEPRGDVYSENSSQTISNASDTLGSEDARKT-RQLTEAESLKHQTATGSSNDG 121

Query: 761  -------NPIKQVNDGVREGN------------GSDGLQINKSIGEKKGEERTNXXXXXX 883
                   N I QV D + E              G D     ++  +KK            
Sbjct: 122  VEVAMNGNHISQVTDNLHEPQQTDKTSPKLVSAGKDESIAMETNSKKKTSSTDPNQTLDS 181

Query: 884  XXXXXXAGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGA 1063
                    Q +V   GK  S E  + K    N +IV PDARVR LKDQLIRAKVYL L A
Sbjct: 182  TKTETRHDQHTVQTSGKVVSGETARGKDEERNAQIVPPDARVRQLKDQLIRAKVYLSLSA 241

Query: 1064 TRNNPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVI 1243
            TR+NPHFIRELRLR+KE  RALG+ATKDS+L R+A +KLKAMEQTLAKGKQIQDDCA ++
Sbjct: 242  TRSNPHFIRELRLRIKESLRALGEATKDSDLSRSANEKLKAMEQTLAKGKQIQDDCATIV 301

Query: 1244 KKLRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNE 1423
            KKLRA+LHS EEQLRVHKKQT++LT L AKTLPKGLHCLPLRLSTEY+ LNSS+Q FP++
Sbjct: 302  KKLRAMLHSAEEQLRVHKKQTLYLTHLTAKTLPKGLHCLPLRLSTEYFKLNSSQQHFPHQ 361

Query: 1424 EKLEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANP 1603
            E LE+P LYHYALFSDNILAAAVVVNSTV+HAK+P+ HVFHIVTDRLN+AAMRMWFLAN 
Sbjct: 362  ENLENPKLYHYALFSDNILAAAVVVNSTVSHAKDPSKHVFHIVTDRLNFAAMRMWFLANQ 421

Query: 1604 PGKATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSIL 1783
            P  AT+ VQ++EEFTWLNSSYSPVLKQL SQSMIDYYF++ RA+SD N+KFRNPKYLSI+
Sbjct: 422  PKYATVDVQSVEEFTWLNSSYSPVLKQLNSQSMIDYYFRS-RADSDPNVKFRNPKYLSIM 480

Query: 1784 NHLRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYL 1963
            NHLRFYLPEIFPKL+KVLFLDDDIVVQ+DL GLWSL+LKGKV G VETCGESFHRFDRYL
Sbjct: 481  NHLRFYLPEIFPKLDKVLFLDDDIVVQKDLGGLWSLDLKGKVIGVVETCGESFHRFDRYL 540

Query: 1964 NFSNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPG 2143
            NFSNPLIS+NFDPRACGWA+GMN+ DL EW++QNITEVYH+WQ  N  RQLWKLGTLPPG
Sbjct: 541  NFSNPLISENFDPRACGWAFGMNIIDLNEWRRQNITEVYHSWQNRNHERQLWKLGTLPPG 600

Query: 2144 LITFWNRTFSIDRSWHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAK 2323
            LITFW RT+++DRSWHVLGLGYNP VSQK+I RAAVIHYNGN+KPWLEI IPK++ YW+K
Sbjct: 601  LITFWKRTYALDRSWHVLGLGYNPNVSQKDIQRAAVIHYNGNLKPWLEISIPKFRDYWSK 660

Query: 2324 FVDYDQVYLRDCNINR 2371
            FVDYDQ +LR+CNIN+
Sbjct: 661  FVDYDQTFLRECNINK 676


>gb|EYU33824.1| hypothetical protein MIMGU_mgv1a002625mg [Mimulus guttatus]
          Length = 653

 Score =  905 bits (2340), Expect = 0.0
 Identities = 449/660 (68%), Positives = 523/660 (79%), Gaps = 16/660 (2%)
 Frame = +2

Query: 434  VRKPVLFLLVVTVLAPIVLYTDRLGSFIS-LDSRNEFIEDVSTLSFGGEIRKLNVLPQES 610
            +RKPVLFLL+VTV APIVLYTD LG + +   SRNEF+ED ST +F GE+R LNVLPQES
Sbjct: 5    LRKPVLFLLLVTVFAPIVLYTDTLGLYSTPSSSRNEFMEDGSTFTFAGEVRPLNVLPQES 64

Query: 611  SNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT------ENPIK 772
            S TLKEP+G+VYS+NS  ++    E    +      ESTE KT  LS +      ENPI+
Sbjct: 65   STTLKEPLGVVYSENSIEASSNKSEESTRITRQLTEESTEDKTTNLSGSSGGSKDENPIR 124

Query: 773  QVNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSVNADGKTNSDEP 952
            QV   V E     G + +      +  E  N              Q  V ++  +   E 
Sbjct: 125  QVISTVHEDEVGTGKEKSNKPQLHENTEIENR-------------QDDVTSENVSEKKEL 171

Query: 953  PKNK---------MARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLR 1105
             + K          +R N   V+ DARVR LKDQLI+ +VYL L ATRNNPHFIR+LRLR
Sbjct: 172  KRIKHSSRTREEVKSRQNERAVLSDARVRQLKDQLIQGRVYLSLSATRNNPHFIRDLRLR 231

Query: 1106 VKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQL 1285
            +KEVQR LG+ATKDSELPRNA +K+KAMEQTL KGKQIQDDCAAV+KKLRA+LH  EEQL
Sbjct: 232  IKEVQRVLGEATKDSELPRNANEKMKAMEQTLLKGKQIQDDCAAVVKKLRAMLHLAEEQL 291

Query: 1286 RVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALF 1465
            R HKKQ +FLT L AKT+PKGLHC PLRLS+EY+ LNSS++ F N+E LE+P LYHYALF
Sbjct: 292  RAHKKQALFLTHLTAKTVPKGLHCFPLRLSSEYFMLNSSQRDFSNKENLENPKLYHYALF 351

Query: 1466 SDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEF 1645
            SDN+LAAAVVVNST+ HAK+P+ HVFH+VTDRLNYAAM+MWFLANPPGKATIQVQN+EEF
Sbjct: 352  SDNVLAAAVVVNSTITHAKDPSKHVFHVVTDRLNYAAMKMWFLANPPGKATIQVQNVEEF 411

Query: 1646 TWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKL 1825
            TWLNSSYSPVLKQL S+SMIDYYFK  RA SDSNLK+RNPKYLSI+NHLRFYLPEIFPKL
Sbjct: 412  TWLNSSYSPVLKQLSSRSMIDYYFKGKRAESDSNLKYRNPKYLSIMNHLRFYLPEIFPKL 471

Query: 1826 NKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPR 2005
            +KVLFLDDDIVVQ+DL+G++SLNLKGKV G VETCGE+FHRFDRYLNFSNP+ISKNFDPR
Sbjct: 472  DKVLFLDDDIVVQKDLSGIFSLNLKGKVIGVVETCGETFHRFDRYLNFSNPIISKNFDPR 531

Query: 2006 ACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSIDRS 2185
            ACGWA+GMN+FDL+EW+KQNITEVYH WQ LN+ R LWKLGTLPPGLITF NRT+++D+S
Sbjct: 532  ACGWAFGMNIFDLDEWRKQNITEVYHKWQNLNEDRLLWKLGTLPPGLITFSNRTYALDKS 591

Query: 2186 WHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNI 2365
            WHVLGLGYNP V  K+I+RAAVIHYNGN+KPWLEIG+PK++ YWAKFVDYD  YLR+CNI
Sbjct: 592  WHVLGLGYNPNVPLKDIERAAVIHYNGNLKPWLEIGLPKFRNYWAKFVDYDHQYLRECNI 651


>ref|XP_006467237.1| PREDICTED: probable galacturonosyltransferase 4-like [Citrus
            sinensis]
          Length = 646

 Score =  904 bits (2335), Expect = 0.0
 Identities = 451/654 (68%), Positives = 524/654 (80%), Gaps = 7/654 (1%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRL-GSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQ 604
            M  R  V+ +L  TVLAPI+++T     S+ S     EF+ED++  + GG+ R LN+LPQ
Sbjct: 1    MKTRNLVVGMLCATVLAPILIFTSTFKDSYPSSSESGEFLEDLTAFTVGGDARHLNLLPQ 60

Query: 605  ESSNTL--KEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKT-RVLSATENPIKQ 775
            ESS TL  K+PI +V SD     +  S              S EHK+ RVLSAT N + Q
Sbjct: 61   ESSTTLSLKQPI-LVISDKIAQHSAHSQSQSQG--------SWEHKSARVLSATTNGLDQ 111

Query: 776  --VNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSVNAD-GKTNSD 946
               ++ +R+       QINK   +++ +   N              QQ  +   G     
Sbjct: 112  SKTDNPIRQVTDLTKTQINKHADQEQIKASDNHISAHHSQILDTKHQQESSLTYGVLEKK 171

Query: 947  EPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRA 1126
            EP K    +   +   PD RVR LKDQLI+AKVYL L A RNN +F+RELRLR+KEVQRA
Sbjct: 172  EPTKINNEKQTEQTTPPDFRVRQLKDQLIKAKVYLSLPAMRNNANFVRELRLRIKEVQRA 231

Query: 1127 LGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQT 1306
            LGDATKDS+LPR A D+LKAMEQ+LAKGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKKQT
Sbjct: 232  LGDATKDSDLPRIANDRLKAMEQSLAKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKKQT 291

Query: 1307 MFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAA 1486
            +FLTQL AKTLPKGLHCLPLRL+TEYY+LNSS++ FPN+EKLEDP L+HYALFSDN+LAA
Sbjct: 292  LFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQRHFPNQEKLEDPRLFHYALFSDNVLAA 351

Query: 1487 AVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSY 1666
            AVVVNSTV HAK P+NHVFHIVTDRLNYAAMRMWFLANPPG+AT+QVQNIEEFTWLNSSY
Sbjct: 352  AVVVNSTVTHAKHPSNHVFHIVTDRLNYAAMRMWFLANPPGRATVQVQNIEEFTWLNSSY 411

Query: 1667 SPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLD 1846
            SPVLKQL SQSMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPE+FP+LNKVLFLD
Sbjct: 412  SPVLKQLNSQSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEVFPRLNKVLFLD 471

Query: 1847 DDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYG 2026
            DD+VVQ+DL+GLWS++LKGKVNGAVETCGE+FHRFDRYLNFSNPLISKNFDPRACGWAYG
Sbjct: 472  DDVVVQKDLSGLWSIDLKGKVNGAVETCGETFHRFDRYLNFSNPLISKNFDPRACGWAYG 531

Query: 2027 MNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSIDRSWHVLGLG 2206
            MN+FDL+EW++QNIT+VYHTWQK+N  RQLWKLGTLPPGLITFW RT+ +DR WHVLGLG
Sbjct: 532  MNIFDLDEWRRQNITDVYHTWQKMNHDRQLWKLGTLPPGLITFWKRTYPLDRFWHVLGLG 591

Query: 2207 YNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 2368
            YNP+V+Q++I+RAAVIHYNGNMKPWLEI IPKY+ YW K VDYDQ+YLR+CNIN
Sbjct: 592  YNPSVNQRDIERAAVIHYNGNMKPWLEINIPKYRNYWTKHVDYDQLYLRECNIN 645


>ref|XP_006398488.1| hypothetical protein EUTSA_v10000819mg [Eutrema salsugineum]
            gi|557099577|gb|ESQ39941.1| hypothetical protein
            EUTSA_v10000819mg [Eutrema salsugineum]
          Length = 631

 Score =  903 bits (2334), Expect = 0.0
 Identities = 437/645 (67%), Positives = 518/645 (80%), Gaps = 1/645 (0%)
 Frame = +2

Query: 437  RKPVLFLLVVTVLAPIVLYTD-RLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQESS 613
            R  VLF L++TV API+LYTD    SF +  S+ +F+EDV+ L+F  +  +LN+LP+ES 
Sbjct: 5    RNLVLFFLLLTVAAPILLYTDPSSASFKTPFSKRDFLEDVTALTFNSDENRLNLLPRESP 64

Query: 614  NTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSATENPIKQVNDGVR 793
              ++  +G+VYS  + +S+    E  D +    L+ + +      S TE+PIKQV DG  
Sbjct: 65   EVVRGVVGVVYSKQNSDSSR-RQEARDQLSARVLSTTDDDNQ---SQTEDPIKQVTDGAS 120

Query: 794  EGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQSVNADGKTNSDEPPKNKMAR 973
            E +  + +  +    + +                    Q +    GK +  EP      +
Sbjct: 121  EMDKPNDMHASDDNSQNREGMHV---------------QLTQQTSGKVDEQEPKSFGGEK 165

Query: 974  PNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRALGDATKDSE 1153
              G +VMPD +V+HLKDQLIRAKVYL L A + N HF+RELRLR+KEVQRAL DATKDS+
Sbjct: 166  ERGNVVMPDTQVKHLKDQLIRAKVYLSLPAAKANAHFVRELRLRIKEVQRALSDATKDSD 225

Query: 1154 LPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQTMFLTQLAAK 1333
            LP+NA +KLKAMEQTLAKGKQIQDDC+ V+KKLRA+LHS EEQLRVHKKQTMFLTQL AK
Sbjct: 226  LPKNAVEKLKAMEQTLAKGKQIQDDCSTVVKKLRAMLHSAEEQLRVHKKQTMFLTQLTAK 285

Query: 1334 TLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAAAVVVNSTVN 1513
            T+PKGLHCLPLRL+T+YY+LNSSEQQFPN+E LED  LYHYALFSDN+LA +VVVNST+ 
Sbjct: 286  TIPKGLHCLPLRLTTDYYALNSSEQQFPNQENLEDNQLYHYALFSDNVLATSVVVNSTIT 345

Query: 1514 HAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSYSPVLKQLGS 1693
            +AK P+ HVFHIVTDRLNYAAMRMWFL NPPGKATIQVQN+EEFTWLNSSYSPVLKQL S
Sbjct: 346  NAKHPSKHVFHIVTDRLNYAAMRMWFLDNPPGKATIQVQNVEEFTWLNSSYSPVLKQLSS 405

Query: 1694 QSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIVVQRDL 1873
            QSMIDYYF+AH  NSD+NLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIVVQ+DL
Sbjct: 406  QSMIDYYFRAHHTNSDTNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIVVQKDL 465

Query: 1874 TGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMNVFDLEEW 2053
            +GLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMN+FDL+EW
Sbjct: 466  SGLWSVDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMNIFDLDEW 525

Query: 2054 KKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSIDRSWHVLGLGYNPTVSQKE 2233
            KKQNITEVYH WQ LN+GR+LWKLGTLPPGLITFW RT+ +DR WH+LGLGYNP+V+Q++
Sbjct: 526  KKQNITEVYHRWQTLNEGRELWKLGTLPPGLITFWRRTYPLDRKWHILGLGYNPSVNQRD 585

Query: 2234 IDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 2368
            I+R AVIHYNGN+KPWLEIGIP+Y+G+WAK VDY+ VYLR+CNIN
Sbjct: 586  IERGAVIHYNGNLKPWLEIGIPRYRGFWAKHVDYEHVYLRECNIN 630


>ref|XP_006857626.1| hypothetical protein AMTR_s00061p00126570 [Amborella trichopoda]
            gi|548861722|gb|ERN19093.1| hypothetical protein
            AMTR_s00061p00126570 [Amborella trichopoda]
          Length = 672

 Score =  903 bits (2334), Expect = 0.0
 Identities = 450/671 (67%), Positives = 533/671 (79%), Gaps = 24/671 (3%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 607
            M  R PVL LL  +VLAPIVLYTDRLGSF S  ++  F E+ S +++G +I KL VLPQE
Sbjct: 1    MKFRMPVLLLLCFSVLAPIVLYTDRLGSFSSSIAKAGFSEEFSPINYGRDINKLKVLPQE 60

Query: 608  SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSATENPIKQVNDG 787
            S N LKEP G+VY  +   S   S + E  +    + +S      V +  E  I +V+  
Sbjct: 61   SVNALKEPSGVVYLSDKDPSEAISVKEEPKMARSRVLQSNVKPLEVETHIEQVIDKVHRE 120

Query: 788  VREGNGSDG--------------LQINKS-IGEKK----GEERTNXXXXXXXXXXXXAGQ 910
             + G    G              LQ N+  IG K+    G +  +            A +
Sbjct: 121  EKNGQEIAGDSQAETIEESQQVLLQSNEQKIGAKREEQFGHQDASIKEEIGLSSRTDAEK 180

Query: 911  QSVNA----DGKTNSDEPPKNKMARPN-GEIVMPDARVRHLKDQLIRAKVYLGLGATRNN 1075
            Q  +      GK++ D P +    R N  +  MPDARV HL+DQLI+AKVYL LG TR+N
Sbjct: 181  QEPDKPEIESGKSDPDGPSQPSPERQNDNKKPMPDARVHHLRDQLIKAKVYLSLGTTRSN 240

Query: 1076 PHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLR 1255
            PHFI+ELR+R++EVQRALGDATKDSELPR AYDKLKAME+TLAKGKQIQDDCAAVIKKLR
Sbjct: 241  PHFIKELRVRIREVQRALGDATKDSELPRGAYDKLKAMEETLAKGKQIQDDCAAVIKKLR 300

Query: 1256 AILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLE 1435
            AILHSTEEQLRVHKKQ+MFL QL+AKTLPKGLHCLPLRL+TEYYSLNS++QQFPN+EKLE
Sbjct: 301  AILHSTEEQLRVHKKQSMFLMQLSAKTLPKGLHCLPLRLTTEYYSLNSTQQQFPNQEKLE 360

Query: 1436 DPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKA 1615
            +P++YHYALFSDN+LAAAVVVNSTV++A++P NHVFHIVTDRLNYAAMRMWF+ANPPGKA
Sbjct: 361  NPNIYHYALFSDNVLAAAVVVNSTVSNARDPRNHVFHIVTDRLNYAAMRMWFIANPPGKA 420

Query: 1616 TIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLR 1795
            TIQVQ++EEFTWLNSSYSPVLKQLGS SMIDYYF+ HRAN DSNLK+RNPKYLSILNHLR
Sbjct: 421  TIQVQSVEEFTWLNSSYSPVLKQLGSTSMIDYYFRTHRANPDSNLKYRNPKYLSILNHLR 480

Query: 1796 FYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSN 1975
            FY+PEIFPKL+KVLFLDDDIVVQRDLT LW ++LKGK+NGAVETC ESFHRFDRYLNFSN
Sbjct: 481  FYMPEIFPKLHKVLFLDDDIVVQRDLTQLWKIDLKGKINGAVETCRESFHRFDRYLNFSN 540

Query: 1976 PLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITF 2155
            PLISKNF+  ACGWA+GMN+FDL+EWKKQ ITE+YH+WQKLN  RQLWKLGTLPPGLITF
Sbjct: 541  PLISKNFEAHACGWAFGMNIFDLKEWKKQEITEIYHSWQKLNNDRQLWKLGTLPPGLITF 600

Query: 2156 WNRTFSIDRSWHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDY 2335
            +NRTF ++R WHVLGLGY+P+V+Q++I RAA IHYNGN+KPWLEIG+PK++GYW K+++Y
Sbjct: 601  YNRTFPLNRGWHVLGLGYDPSVNQRDIQRAAAIHYNGNLKPWLEIGLPKFRGYWQKYINY 660

Query: 2336 DQVYLRDCNIN 2368
            +Q YL+DCNIN
Sbjct: 661  NQPYLQDCNIN 671


>ref|XP_006449976.1| hypothetical protein CICLE_v10014426mg [Citrus clementina]
            gi|557552587|gb|ESR63216.1| hypothetical protein
            CICLE_v10014426mg [Citrus clementina]
          Length = 646

 Score =  902 bits (2331), Expect = 0.0
 Identities = 451/654 (68%), Positives = 523/654 (79%), Gaps = 7/654 (1%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRL-GSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQ 604
            M  R  V+ +L  TV API+++T     S+ S     EF+ED++  + GG+ R LN+LPQ
Sbjct: 1    MKTRNLVVGMLCATVFAPILIFTSTFKDSYPSSSESGEFLEDLTAFTVGGDARHLNLLPQ 60

Query: 605  ESSNTL--KEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKT-RVLSATENPIKQ 775
            ESS TL  K+PI +V SD     +  S              S EHK+ RVLSAT N + Q
Sbjct: 61   ESSTTLSLKQPI-LVISDKIAQHSAHSQSQSQG--------SWEHKSARVLSATTNGLDQ 111

Query: 776  --VNDGVREGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXXAGQQ-SVNADGKTNSD 946
               ++ +R+        INK   +++ +   N              QQ S    G     
Sbjct: 112  SKTDNPIRQVTDLTKTPINKHADQEQIKASDNHISAHHSQILDTKHQQESSQTYGVLEKK 171

Query: 947  EPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRA 1126
            EP K    +   +   PD RVR LKDQLI+AKVYL L ATRNN +F+RELRLR+KEVQRA
Sbjct: 172  EPTKINNEKQTEQTAPPDFRVRQLKDQLIKAKVYLSLPATRNNANFVRELRLRIKEVQRA 231

Query: 1127 LGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQT 1306
            LGDA+KDS+LPR A D+LKAMEQ+LAKGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKKQT
Sbjct: 232  LGDASKDSDLPRIANDRLKAMEQSLAKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKKQT 291

Query: 1307 MFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAA 1486
            +FLTQL AKTLPKGLHCLPLRL+TEYYSLNSS++ FPN+EKLEDP L+HYALFSDN+LAA
Sbjct: 292  LFLTQLTAKTLPKGLHCLPLRLTTEYYSLNSSQRYFPNQEKLEDPRLFHYALFSDNVLAA 351

Query: 1487 AVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSY 1666
            AVVVNSTV HAK P+NHVFHIVTDRLNYAAMRMWFLANPPG+AT+QVQNIEEFTWLNSSY
Sbjct: 352  AVVVNSTVTHAKHPSNHVFHIVTDRLNYAAMRMWFLANPPGRATVQVQNIEEFTWLNSSY 411

Query: 1667 SPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLD 1846
            SPVLKQL SQSMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPE+FP+LNKVLFLD
Sbjct: 412  SPVLKQLNSQSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEVFPRLNKVLFLD 471

Query: 1847 DDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYG 2026
            DD+VVQ+DL+GLWS++LKGKVNGAVETCGE+FHRFDRYLNFSNPLISKNFDPRACGWAYG
Sbjct: 472  DDVVVQKDLSGLWSIDLKGKVNGAVETCGETFHRFDRYLNFSNPLISKNFDPRACGWAYG 531

Query: 2027 MNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFSIDRSWHVLGLG 2206
            MN+FDL+EW++QNIT+VYHTWQK+N  RQLWKLGTLPPGLITFW RT+ +DR WHVLGLG
Sbjct: 532  MNIFDLDEWRRQNITDVYHTWQKMNHDRQLWKLGTLPPGLITFWKRTYPLDRFWHVLGLG 591

Query: 2207 YNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 2368
            YNP+V+Q++I+RAAVIHYNGNMKPWLEI IPKY+ YW K VDYDQ+YLR+CNIN
Sbjct: 592  YNPSVNQRDIERAAVIHYNGNMKPWLEINIPKYRNYWTKHVDYDQLYLRECNIN 645


>ref|XP_006600275.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1
            [Glycine max] gi|571532515|ref|XP_006600276.1| PREDICTED:
            probable galacturonosyltransferase 4-like isoform X2
            [Glycine max]
          Length = 661

 Score =  894 bits (2309), Expect = 0.0
 Identities = 447/673 (66%), Positives = 525/673 (78%), Gaps = 26/673 (3%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 607
            ++ R  VL LL +T +APIVL+TDRLG+F    +  EFIE V+      +   LN+LPQE
Sbjct: 2    VVTRNIVLLLLSITFVAPIVLFTDRLGTFKYPFAEQEFIEAVTAFVSAADSGHLNLLPQE 61

Query: 608  SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEH-KTRVLSAT--------E 760
            SS   KEPIG+VY++++ N+       E+ +  L  A+  EH   RVLSAT        E
Sbjct: 62   SSTVFKEPIGLVYTEDTSNT-------ENLLHGLHFAKPGEHVSARVLSATNDEGQTKGE 114

Query: 761  NPIKQVNDGVREGN----------------GSDGLQINKSIGE-KKGEERTNXXXXXXXX 889
            NPIK V DG+ +GN                G D + ++ + G+  K  +  +        
Sbjct: 115  NPIKLVTDGINQGNQNSYMVKADTTGDSVNGEDAIDVDDNDGKLAKSSDLVSETTDTKQE 174

Query: 890  XXXXAGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATR 1069
                  Q+ + +  +    EP  ++  + N +   PDARV+ LKDQLI+A+VYL L A R
Sbjct: 175  ------QEHIKSSSQVTQKEPILSEADKHNDQ-TPPDARVQQLKDQLIQARVYLSLQAVR 227

Query: 1070 NNPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKK 1249
            +NPH  RELRLRVKEV R LGDA+KDS+LPRNA +++KAMEQTL KG+QIQ+DCAA +KK
Sbjct: 228  SNPHLTRELRLRVKEVSRTLGDASKDSDLPRNANERMKAMEQTLMKGRQIQNDCAAAVKK 287

Query: 1250 LRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEK 1429
            LRA+LHSTEEQL VHKKQT+FLTQL AKTLPKGLHCLPLRL+TEYYSLN+S+QQF N++K
Sbjct: 288  LRAMLHSTEEQLHVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNTSQQQFRNQQK 347

Query: 1430 LEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPG 1609
            LEDP LYHYA+FSDNILA AVVVNSTV HAK+ + HVFHIVTDRLNYAAMRMWFL NPP 
Sbjct: 348  LEDPRLYHYAIFSDNILATAVVVNSTVAHAKDTSKHVFHIVTDRLNYAAMRMWFLVNPPQ 407

Query: 1610 KATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNH 1789
            KATIQVQNIE+FTWLNSSYSPVLKQLGS SMID+YFK HRA+SDSNLKFRNPKYLSILNH
Sbjct: 408  KATIQVQNIEDFTWLNSSYSPVLKQLGSPSMIDFYFKTHRASSDSNLKFRNPKYLSILNH 467

Query: 1790 LRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNF 1969
            LRFYLPEIFPKLNKVLFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGE FHRFDRYLNF
Sbjct: 468  LRFYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLNF 527

Query: 1970 SNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLI 2149
            SNPLI+KNFDPRACGWAYGMNVFDL +WK+QNIT+VYH WQK+N  RQLWKLGTLPPGLI
Sbjct: 528  SNPLIAKNFDPRACGWAYGMNVFDLVQWKRQNITDVYHKWQKMNHDRQLWKLGTLPPGLI 587

Query: 2150 TFWNRTFSIDRSWHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFV 2329
            TFW RTF + RSWHVLGLGYNP ++QKEI+RAAVIHYNGNMKPWLEI IPK++GYW K+V
Sbjct: 588  TFWKRTFQLHRSWHVLGLGYNPNINQKEIERAAVIHYNGNMKPWLEISIPKFRGYWTKYV 647

Query: 2330 DYDQVYLRDCNIN 2368
            DY+ VYLR+CNIN
Sbjct: 648  DYNLVYLRECNIN 660


>ref|XP_004499343.1| PREDICTED: probable galacturonosyltransferase 4-like [Cicer
            arietinum]
          Length = 658

 Score =  892 bits (2306), Expect = 0.0
 Identities = 448/670 (66%), Positives = 519/670 (77%), Gaps = 25/670 (3%)
 Frame = +2

Query: 437  RKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGG-EIRKLNVLPQESS 613
            R  V  LL +TV+ PI+LYTDRL  F    + +EFI+DV+  + GG +   LN+LPQE+S
Sbjct: 5    RNIVFLLLCITVVTPILLYTDRLTDFNYPSAEHEFIQDVTAFAVGGAKSSHLNLLPQETS 64

Query: 614  NTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT--------ENPI 769
              LKEPIG+VYS+++ N           ++ LP  E     TRVLSAT        +NPI
Sbjct: 65   TILKEPIGVVYSEDTSN-----------IKSLPQREHV--LTRVLSATNEEDWSKGDNPI 111

Query: 770  KQVNDGVR----------------EGNGSDGLQINKSIGEKKGEERTNXXXXXXXXXXXX 901
            K + DGV+                  NG D + ++ + G  K  + +N            
Sbjct: 112  KLLTDGVKPINQSSYLEKADITGGSVNGEDAIDVDDNDG--KLTKSSNASDQVSETILTK 169

Query: 902  AGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPH 1081
             G+Q   +  K N+      +  + NG+    DARVR LKDQLI+AKVYL L A RNNPH
Sbjct: 170  QGKQRTGSSSKGNNKGTILQETTKHNGQ-TPSDARVRKLKDQLIQAKVYLSLQAVRNNPH 228

Query: 1082 FIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAI 1261
              RELRLRVKEV R LGDA+KDS+LPRNA +++K+MEQ+L KG+QIQDDCA  +KKLRA+
Sbjct: 229  LTRELRLRVKEVSRTLGDASKDSDLPRNANERMKSMEQSLMKGRQIQDDCATSVKKLRAM 288

Query: 1262 LHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDP 1441
            LHS+E+QLRVHKKQT FLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+QQFPN+EKLEDP
Sbjct: 289  LHSSEDQLRVHKKQTSFLTQLTAKTLPKGLHCLPLRLTTEYYNLNSSQQQFPNQEKLEDP 348

Query: 1442 SLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATI 1621
             LYHYA+FSDNILA AVVVNST  HAK+ + HVFHIVTDRLNYAAMRMWFLANPPGKA I
Sbjct: 349  GLYHYAIFSDNILATAVVVNSTAAHAKDASKHVFHIVTDRLNYAAMRMWFLANPPGKAAI 408

Query: 1622 QVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFY 1801
            QVQNIE+FTWLNSSYSPVLKQLGS SMIDYYFK HRA SDSNLKFRNPKYLS+LNHLRFY
Sbjct: 409  QVQNIEDFTWLNSSYSPVLKQLGSPSMIDYYFKTHRATSDSNLKFRNPKYLSMLNHLRFY 468

Query: 1802 LPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPL 1981
            LPEIFPKL KVLFLDDD+VVQ+DLTGLWS++LKG VNGAVETC ESFHRFDRYLNFSNPL
Sbjct: 469  LPEIFPKLKKVLFLDDDVVVQKDLTGLWSIDLKGNVNGAVETCAESFHRFDRYLNFSNPL 528

Query: 1982 ISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWN 2161
            +++NFDPRACGWAYGMNVFDL  WKKQNITEVYH WQKLN  RQLWKLGTLPPGLITFW 
Sbjct: 529  VARNFDPRACGWAYGMNVFDLVGWKKQNITEVYHNWQKLNHDRQLWKLGTLPPGLITFWK 588

Query: 2162 RTFSIDRSWHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQ 2341
            RTF ++RSWHVLGLGYNP V+QK+I+RAAVIHYNGNMKPWLEI IPK++ YW K+VDYD 
Sbjct: 589  RTFPLNRSWHVLGLGYNPNVNQKDIERAAVIHYNGNMKPWLEISIPKFRAYWTKYVDYDI 648

Query: 2342 VYLRDCNINR 2371
            VYLR+CNIN+
Sbjct: 649  VYLRECNINQ 658


>ref|XP_006584115.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1
            [Glycine max] gi|571468064|ref|XP_006584116.1| PREDICTED:
            probable galacturonosyltransferase 4-like isoform X2
            [Glycine max]
          Length = 664

 Score =  891 bits (2303), Expect = 0.0
 Identities = 445/672 (66%), Positives = 523/672 (77%), Gaps = 25/672 (3%)
 Frame = +2

Query: 428  MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 607
            ++ R  VL LL +T +APIVLYTDR G+F    +  EFI+ V+      +   LN+LPQE
Sbjct: 2    VVTRNIVLLLLSITFVAPIVLYTDRFGTFKYPFAEQEFIDAVTAFVSAADSGHLNLLPQE 61

Query: 608  SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEH-KTRVLSAT--------E 760
            +S   KEPIG+VY++++ N+       ++ +  L  A+  EH   RVLSAT        E
Sbjct: 62   TSTVFKEPIGLVYTEDAANT-------KNLLHGLHFAKPGEHVSARVLSATKDEGQTKGE 114

Query: 761  NPIKQVNDGVREGN----------------GSDGLQINKSIGEKKGEERTNXXXXXXXXX 892
            NPIK V DG+ +GN                G D + ++ + G  K  + ++         
Sbjct: 115  NPIKLVTDGINQGNQNSYLVKADITGDSVNGEDAIDVDDNDG--KLAKSSDASDLASETM 172

Query: 893  XXXAGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRN 1072
                 QQ + +  +  + +  K   A  + +   PDARVR+LKDQLI+ +VYL L A RN
Sbjct: 173  DTKQEQQHIKSSSQV-TQKGSKLSEADKHIDQTPPDARVRYLKDQLIQVRVYLSLQAVRN 231

Query: 1073 NPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKL 1252
            NPH  RELRLRVKEV R LGDA+KDS+LPRNA +++KAMEQTL KG+QIQ+DCAA +KKL
Sbjct: 232  NPHLTRELRLRVKEVSRTLGDASKDSDLPRNANERMKAMEQTLMKGRQIQNDCAAAVKKL 291

Query: 1253 RAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKL 1432
            RA+LHSTEEQL VHKKQT+FLTQL AKTLPKGLHCLPLRL+TEYYSLN+S+QQ PN++KL
Sbjct: 292  RAMLHSTEEQLHVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNTSQQQLPNQQKL 351

Query: 1433 EDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGK 1612
            E+P LYHYA+FSDNILA AVVVNSTV HAK+ +NHVFHIVTDRLNYAAMRMWFL NPP K
Sbjct: 352  ENPRLYHYAIFSDNILATAVVVNSTVAHAKDTSNHVFHIVTDRLNYAAMRMWFLVNPPKK 411

Query: 1613 ATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHL 1792
            ATIQVQNIE+FTWLNSSYSPVLKQLGS SM+D+YFK HRA+SDSNLKFRNPKYLSILNHL
Sbjct: 412  ATIQVQNIEDFTWLNSSYSPVLKQLGSPSMVDFYFKTHRASSDSNLKFRNPKYLSILNHL 471

Query: 1793 RFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFS 1972
            RFYLPEIFPKLNKVLFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGE FHRFDRYLNFS
Sbjct: 472  RFYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLNFS 531

Query: 1973 NPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLIT 2152
            NP I+KNFDPRACGWAYGMNVFDL +WK+QNITEVYH WQKLN  RQLWKLGTLPPGLIT
Sbjct: 532  NPHIAKNFDPRACGWAYGMNVFDLVQWKRQNITEVYHNWQKLNHDRQLWKLGTLPPGLIT 591

Query: 2153 FWNRTFSIDRSWHVLGLGYNPTVSQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVD 2332
            FW RTF ++RSWHVLGLGYNP ++QKEI+RAAVIHYNGNMKPWLEI  PK++GYW K+VD
Sbjct: 592  FWKRTFQLNRSWHVLGLGYNPNINQKEIERAAVIHYNGNMKPWLEISFPKFRGYWTKYVD 651

Query: 2333 YDQVYLRDCNIN 2368
            YD VYLR+CNIN
Sbjct: 652  YDLVYLRECNIN 663


Top