BLASTX nr result

ID: Stemona21_contig00003684 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Stemona21_contig00003684
         (2371 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI18781.3| unnamed protein product [Vitis vinifera]              871   0.0  
ref|XP_002525229.1| Glycosyltransferase QUASIMODO1, putative [Ri...   870   0.0  
gb|EXB64628.1| putative galacturonosyltransferase 4 [Morus notab...   870   0.0  
emb|CAN70213.1| hypothetical protein VITISV_038741 [Vitis vinifera]   865   0.0  
ref|XP_004973216.1| PREDICTED: probable galacturonosyltransferas...   863   0.0  
ref|XP_002272372.2| PREDICTED: uncharacterized protein LOC100258...   862   0.0  
ref|XP_003573698.1| PREDICTED: probable galacturonosyltransferas...   858   0.0  
gb|EOY29053.1| Galacturonosyltransferase 4 isoform 2 [Theobroma ...   852   0.0  
ref|NP_001061555.1| Os08g0327100 [Oryza sativa Japonica Group] g...   852   0.0  
gb|EOY29052.1| Galacturonosyltransferase 4 isoform 1 [Theobroma ...   851   0.0  
ref|XP_003573699.1| PREDICTED: probable galacturonosyltransferas...   850   0.0  
ref|XP_002444200.1| hypothetical protein SORBIDRAFT_07g014890 [S...   849   0.0  
ref|XP_006857626.1| hypothetical protein AMTR_s00061p00126570 [A...   845   0.0  
gb|ESW19558.1| hypothetical protein PHAVU_006G135100g [Phaseolus...   842   0.0  
ref|XP_006597630.1| PREDICTED: probable galacturonosyltransferas...   841   0.0  
ref|XP_006467237.1| PREDICTED: probable galacturonosyltransferas...   841   0.0  
ref|XP_006600275.1| PREDICTED: probable galacturonosyltransferas...   841   0.0  
ref|XP_003535002.1| PREDICTED: probable galacturonosyltransferas...   840   0.0  
ref|XP_006449976.1| hypothetical protein CICLE_v10014426mg [Citr...   838   0.0  
ref|XP_006350232.1| PREDICTED: probable galacturonosyltransferas...   838   0.0  

>emb|CBI18781.3| unnamed protein product [Vitis vinifera]
          Length = 638

 Score =  871 bits (2251), Expect = 0.0
 Identities = 439/658 (66%), Positives = 527/658 (80%), Gaps = 4/658 (0%)
 Frame = +3

Query: 156  MVKRKPVLVLLCVTVLAPIVLYTDMLA-STSNSIQNSD-FTGDASNLSAGKEVQRLHALP 329
            M+KRK VL LL VTVL+PIVLYTD L  S   S   +D F  D + L+ G    +L+ LP
Sbjct: 1    MIKRKTVLFLLLVTVLSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNLLP 60

Query: 330  LESSDALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATNER-TQ 506
             ESS  LKEPIGIVY DN S     SA        ++L LGGS E K+RVLS T E   +
Sbjct: 61   QESSTTLKEPIGIVYSDNDSLDVDESA--------ADLQLGGSVEHKTRVLSTTYEEGDR 112

Query: 507  SERIGAIKQVTSREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEI-QGSXXXXXXXXRPP 683
            S+R   I+QVT  +  N        ++   E+ S  A   + TE  Q S           
Sbjct: 113  SQRENPIRQVTDGKDDN--------LQRGSELTSHNASQNSETEHGQQSAQTSGKGDHKE 164

Query: 684  KVASRRHQILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELRLRVRDV 863
             V +R  + +D      V+ D +V+ LKDQLIRAKV+LSL ++R N HFIRELR R+++V
Sbjct: 165  PVKTRNEKPID----QTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARMKEV 220

Query: 864  QRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVEDQLQVHK 1043
            QRALGDATKDSELP+N  EK++ MEQTL+KGKQIQDDC+AVVKKLRA+LH+ E+QL+VHK
Sbjct: 221  QRALGDATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLRVHK 280

Query: 1044 KQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYALFSDNI 1223
            KQT++LTQL AKTLPKGLHCL LRLSTEYY +D +++QFP+Q+KLEDP+LFHYALFSDNI
Sbjct: 281  KQTMYLTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFSDNI 340

Query: 1224 LAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIEEFTWLN 1403
            LAA+VVVNSTV NAK+ + HVFHIV+DRLNYAAMRMWFLANPP  ATIQVQNI+EFTWLN
Sbjct: 341  LAAAVVVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFTWLN 400

Query: 1404 ASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFPKLDKVV 1583
            +SYSPV++QLGS SMID+YF+ +R+NSD+NLKFRNPKYLS+LNHLRFYLPEIFPKL+KV+
Sbjct: 401  SSYSPVLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVL 460

Query: 1584 FLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFDPRACGW 1763
            FLDDDIVVQ+DLT LWSIDL+G VNGAVETCGESFHRFDRYLNFSNPLI+KNFD  ACGW
Sbjct: 461  FLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHACGW 520

Query: 1764 AYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLNKSWHVL 1943
            AYGMN+FDL +W++Q ITEVYH+WQKLN DR LWKLGTLPPGLITFWK+TFP+++SWHVL
Sbjct: 521  AYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTFPIDRSWHVL 580

Query: 1944 GLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDCNINP 2117
            GLGYNP+VN+++I+RAAV+HYNGN+KPWLEIG+PKFRNYWAK+ D+D +YL+DCNINP
Sbjct: 581  GLGYNPSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNINP 638


>ref|XP_002525229.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
            gi|223535526|gb|EEF37195.1| Glycosyltransferase
            QUASIMODO1, putative [Ricinus communis]
          Length = 647

 Score =  870 bits (2249), Expect = 0.0
 Identities = 435/662 (65%), Positives = 532/662 (80%), Gaps = 7/662 (1%)
 Frame = +3

Query: 153  KMVKRKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNS-DFTGDASNLS-AGKEVQRLHAL 326
            KM  R  V+ +L VTV+API+LYTD   ST NS  ++ +F  D ++L+ +G     L+ L
Sbjct: 2    KMKLRNLVVGMLLVTVIAPIILYTDNRFSTFNSSSSTTEFLEDVASLTLSGDSRDHLNVL 61

Query: 327  PLESSDALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATNERTQ 506
            P ES+  LKEPIGIVY DNS+ S   +  S+    +S LP        +RVLSATN++ Q
Sbjct: 62   PQESTSLLKEPIGIVYTDNSTISPPHT--STIQFHSSPLPQDTREHKSTRVLSATNDQHQ 119

Query: 507  SERIGAIKQVTSREHSNDGLKNPKLVEGN-KEVGSQQAV----DLTSTEIQGSXXXXXXX 671
            S+    I+QVT+++ S     N K  + N  + GSQ AV     LTS ++          
Sbjct: 120  SQTDTIIRQVTNQQASRTTDANNKNSKQNPSDGGSQNAVVQQSSLTSEKVTEKG------ 173

Query: 672  XRPPKVASRRHQILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELRLR 851
              PPK  +      D Q +   +PD +VR L+DQLIRAKVYLSL S++ NPHF RELRLR
Sbjct: 174  --PPKSRT------DKQTAQTPVPDARVRQLRDQLIRAKVYLSLPSTKNNPHFTRELRLR 225

Query: 852  VRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVEDQL 1031
            +++VQR LGDATKDS+LP+N N+K++AM+Q+L+KGKQ+QDDC++VVKKLRAMLH+ E+QL
Sbjct: 226  IKEVQRVLGDATKDSDLPKNANDKLKAMDQSLAKGKQVQDDCASVVKKLRAMLHSSEEQL 285

Query: 1032 QVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYALF 1211
            +VHKKQT+FLTQL AKTLPKGLHC  LRL+ EYY ++ S++QFP+QEKLEDP+L+HYALF
Sbjct: 286  RVHKKQTMFLTQLTAKTLPKGLHCFPLRLTNEYYSLNSSQQQFPNQEKLEDPQLYHYALF 345

Query: 1212 SDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIEEF 1391
            SDN+LAA+VVVNST+ +AK+ + HVFHIVTDRLNYAAMRMWFL NPP  ATIQVQNIEE 
Sbjct: 346  SDNVLAAAVVVNSTITHAKDPSKHVFHIVTDRLNYAAMRMWFLVNPPGQATIQVQNIEEL 405

Query: 1392 TWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFPKL 1571
            TWLN+SYSPV++QLGSQSMID+YFRT+RANSD+NLK+RNPKYLS+LNHLRFYLPEIFP L
Sbjct: 406  TWLNSSYSPVLKQLGSQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIFPML 465

Query: 1572 DKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFDPR 1751
            +KV+FLDDDIVVQ+DLT LWS+DL+G VNGAVETCGE FHRFDRYLNFSNPLI+KNFDP 
Sbjct: 466  NKVLFLDDDIVVQKDLTGLWSLDLKGNVNGAVETCGERFHRFDRYLNFSNPLISKNFDPH 525

Query: 1752 ACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLNKS 1931
            ACGWAYGMNVFDL +W+RQ IT VYH+WQKLN DRLLWKLGTLPPGLITFWKQT+ +++S
Sbjct: 526  ACGWAYGMNVFDLDQWKRQNITGVYHTWQKLNHDRLLWKLGTLPPGLITFWKQTYSIDRS 585

Query: 1932 WHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDCNI 2111
            WHVLGLGYNPNVNQ++I+RAAV+HYNGN+KPWLEIGI K+RNYWAKYVDYD  YL++CNI
Sbjct: 586  WHVLGLGYNPNVNQREIERAAVIHYNGNLKPWLEIGISKYRNYWAKYVDYDHVYLRECNI 645

Query: 2112 NP 2117
            NP
Sbjct: 646  NP 647


>gb|EXB64628.1| putative galacturonosyltransferase 4 [Morus notabilis]
          Length = 657

 Score =  870 bits (2248), Expect = 0.0
 Identities = 437/673 (64%), Positives = 533/673 (79%), Gaps = 19/673 (2%)
 Frame = +3

Query: 156  MVKRKPVLVLLCVTVLAPIVLYTDMLAS-TSNSIQNSDFTGDASNLSAGKEVQRLHALPL 332
            M+ R  V+ +L VTV+APIVLYTD L +  S S   ++F  D + +              
Sbjct: 1    MMVRNVVIGMLFVTVIAPIVLYTDRLGTFQSYSASTNEFVEDVTTV-------------- 46

Query: 333  ESSDALKEPIGIVYYDNSS----NSEQISARSSDGITASELP--LGGSGE-LKSRVLSAT 491
            E S  +KEPIGIVY DNS+    NS      SS   + SE    LG S E + +RVLS T
Sbjct: 47   EPSTKIKEPIGIVYSDNSNQSLPNSGDAVKESSTDTSNSEQDWQLGDSMEHVSARVLSTT 106

Query: 492  NERTQSERIGAIKQVTSREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEIQ-----GSXX 656
            N+   S +  AI++VT R+   D  +   +V+G  E   + A+D    EIQ     GS  
Sbjct: 107  NDENNSRKENAIREVTDRDQEGDQ-ETLDIVDGEGETKGE-AIDAEVKEIQQKVDDGSGD 164

Query: 657  XXXXXXRPPKVASR------RHQILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRT 818
                  +  + +SR      R    + Q    V+PD +VR+LKDQL+RA+VYLSL ++R 
Sbjct: 165  TEVKPEQTTETSSRVDKREPRKTRPEKQNDRTVIPDARVRHLKDQLVRARVYLSLPATRN 224

Query: 819  NPHFIRELRLRVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKL 998
            NPHF RELR+R+++VQRALGDA+KDSELPRN  ++++AMEQ+L+KGKQIQDDC+A VKKL
Sbjct: 225  NPHFTRELRVRMKEVQRALGDASKDSELPRNAYDRLKAMEQSLAKGKQIQDDCAAAVKKL 284

Query: 999  RAMLHTVEDQLQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKL 1178
            RAMLH+ E+QL+VHKKQTLFLTQL AKTLPKGLHCL LRL+TEYY ++ S + FP+++KL
Sbjct: 285  RAMLHSTEEQLRVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNYSEQHFPNEDKL 344

Query: 1179 EDPKLFHYALFSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRN 1358
            EDP+L+HYALFSDN+LAA+VVVNST+ +AK+ + HVFHIVTDRLNYAAMRMWFL NPP  
Sbjct: 345  EDPQLYHYALFSDNVLAAAVVVNSTITHAKDPSKHVFHIVTDRLNYAAMRMWFLVNPPGK 404

Query: 1359 ATIQVQNIEEFTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHL 1538
            AT+QVQNIEEFTWLN+SYSPV++QLGSQSMI++YFRT+RA+SD+NLKFRNPKYLS+LNHL
Sbjct: 405  ATVQVQNIEEFTWLNSSYSPVLKQLGSQSMINYYFRTHRASSDSNLKFRNPKYLSILNHL 464

Query: 1539 RFYLPEIFPKLDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFS 1718
            RFYLP+IFPKLDKV+F+DDDIVVQ+DLTALWS+DL+G VNGAVETCGESFHRFDRYLNFS
Sbjct: 465  RFYLPQIFPKLDKVLFVDDDIVVQKDLTALWSLDLKGNVNGAVETCGESFHRFDRYLNFS 524

Query: 1719 NPLIAKNFDPRACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLIT 1898
            NPLI+KNFDPRACGWAYGMN+FDL EW+RQ+IT+VYHSWQKLN DR LWKLGTLPPGLIT
Sbjct: 525  NPLISKNFDPRACGWAYGMNIFDLKEWKRQQITDVYHSWQKLNHDRQLWKLGTLPPGLIT 584

Query: 1899 FWKQTFPLNKSWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVD 2078
            FWK+T+PL++SWHVLGLGYNPNV QKDI+RAAV+HYNGNMKPWLEIGIPK+RNYWAKYVD
Sbjct: 585  FWKRTYPLDRSWHVLGLGYNPNVGQKDIERAAVIHYNGNMKPWLEIGIPKYRNYWAKYVD 644

Query: 2079 YDQDYLQDCNINP 2117
            YDQ YL++CN+NP
Sbjct: 645  YDQLYLRECNLNP 657


>emb|CAN70213.1| hypothetical protein VITISV_038741 [Vitis vinifera]
          Length = 759

 Score =  865 bits (2236), Expect = 0.0
 Identities = 440/660 (66%), Positives = 529/660 (80%), Gaps = 4/660 (0%)
 Frame = +3

Query: 150  RKMVKRKPVLVLLCVTVLAPIVLYTDMLA-STSNSIQNSD-FTGDASNLSAGKEVQRLHA 323
            ++M+KRK VL LL VTV +PIVLYTD L  S   S   +D F  D + L+ G    +L+ 
Sbjct: 120  KEMIKRKTVLFLLLVTVXSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNL 179

Query: 324  LPLESSDALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATNER- 500
            LP ESS  LKEPIGIVY DN S     SA        ++L LGGS E K+R LS T E  
Sbjct: 180  LPQESSTTLKEPIGIVYSDNDSLDVDESA--------ADLQLGGSVEHKTRXLSTTYEEG 231

Query: 501  TQSERIGAIKQVTSREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEI-QGSXXXXXXXXR 677
             +S+R   I+QVT      DG K+  L  G+ E+ S  A   + TE  Q S         
Sbjct: 232  DRSQRENPIRQVT------DG-KDDSLQRGS-ELTSHNASQNSETEHGQQSAQTSGKGDH 283

Query: 678  PPKVASRRHQILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELRLRVR 857
               V +R  + +D      V+ D +V+ LKDQLIRAKV+LSL ++R N HFIRELR R++
Sbjct: 284  KEPVKTRNEKPID----QTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARMK 339

Query: 858  DVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVEDQLQV 1037
            +VQRALGDATKDSELP+N  EK++ MEQTL+KGKQIQDDC+AVVKKLRA+LH+ E+QL+V
Sbjct: 340  EVQRALGDATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLRV 399

Query: 1038 HKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYALFSD 1217
            HKKQT++LTQL AKTLPKGLHCL LRLSTEYY +D +++QFP+Q+KLEDP+LFHYALFSD
Sbjct: 400  HKKQTMYLTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFSD 459

Query: 1218 NILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIEEFTW 1397
            NILAA+VVVNSTV NAK+ + HVFHIV+DRLNYAAMRMWFLANPP  ATIQVQNI+EFTW
Sbjct: 460  NILAAAVVVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFTW 519

Query: 1398 LNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFPKLDK 1577
            LN+SYSPV++QLGS SMID+YF+ +R+NSD+NLKFRNPKYLS+LNHLRFYLPEIFPKL+K
Sbjct: 520  LNSSYSPVLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNK 579

Query: 1578 VVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFDPRAC 1757
            V+FLDDDIVVQ+DLT LWSIDL+G VNGAVETCGESFHRFDRYLNFSNPLI+KNFD  AC
Sbjct: 580  VLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHAC 639

Query: 1758 GWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLNKSWH 1937
            GWAYGMN+FDL +W++Q ITEVYH+WQKLN DR LWKLGTLPPGLITFWK+T P+++SWH
Sbjct: 640  GWAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTXPIDRSWH 699

Query: 1938 VLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDCNINP 2117
            VLGLGYNP+VN+++I+RAAV+HYNGN+KPWLEIG+PKFRNYWAK+ D+D +YL+DCNINP
Sbjct: 700  VLGLGYNPSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNINP 759


>ref|XP_004973216.1| PREDICTED: probable galacturonosyltransferase 4-like [Setaria
            italica]
          Length = 648

 Score =  863 bits (2229), Expect = 0.0
 Identities = 418/660 (63%), Positives = 519/660 (78%), Gaps = 8/660 (1%)
 Frame = +3

Query: 162  KRKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLSAGKEVQRLHALPLESS 341
            +R  +L+LL +TVL+P+ LYT  L++  N  Q  DF G+ +N   G +  +L+ALPLE+ 
Sbjct: 6    RRSVLLLLLALTVLSPLALYTSSLSAALNPTQTRDFPGEITNQGRGVKADKLNALPLETV 65

Query: 342  DALKEPIGIVYYDNSSNSEQIS--ARSSDGITASELPLGGSGELKSRVLSATNERTQSER 515
             +LKEP+GIV+      SE++   A+ S G    ELPL  +GE KSRVLS          
Sbjct: 66   SSLKEPVGIVF------SEELDGLAKESTGSEGQELPLRKAGEHKSRVLSEVMVAADGTE 119

Query: 516  IGAIKQVTSREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEIQGSXXXXXXXXRPPKVAS 695
            +  I+QVT RE  + G  +    E  K  GSQQ         Q S         P + ++
Sbjct: 120  V--IEQVTRREAQDGGSASAISDEQEKTTGSQQ---------QSSSKESLRETMPKQTSA 168

Query: 696  R------RHQILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELRLRVR 857
            +      +  I D +  N V+PD  +RN+KDQLI+AKVYL LGS R N  ++++LR R+R
Sbjct: 169  KVLVENSQAAITDGKTKNTVLPDTWIRNIKDQLIKAKVYLGLGSIRANSQYLKDLRQRIR 228

Query: 858  DVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVEDQLQV 1037
            +VQ+ LGDA+KD++LP+N NEK++A+EQ L KGKQ+QDDCS VVKKLRAMLH+ E+QL  
Sbjct: 229  EVQKVLGDASKDTDLPKNANEKVKALEQLLIKGKQMQDDCSIVVKKLRAMLHSAEEQLHA 288

Query: 1038 HKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYALFSD 1217
            HKKQT+FLTQLAAKTLPKGLHCL LRL+ EY+ +DP ++QFP+Q KL +PKL+HYALFSD
Sbjct: 289  HKKQTVFLTQLAAKTLPKGLHCLPLRLANEYFSLDPGQQQFPNQHKLSNPKLYHYALFSD 348

Query: 1218 NILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIEEFTW 1397
            +ILA +VVVNSTV NAK+ +DHVFHIVTDRLNYA MRMWFL NPP  ATI+VQ+I EFTW
Sbjct: 349  SILATAVVVNSTVLNAKHPSDHVFHIVTDRLNYAPMRMWFLTNPPGKATIEVQHIGEFTW 408

Query: 1398 LNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFPKLDK 1577
            LN SYSPV++QLGSQSMID+YF TNRAN D+NLK+RNPKYLS+LNHLRFYLPEI+PKLDK
Sbjct: 409  LNDSYSPVLQQLGSQSMIDYYFGTNRANPDSNLKYRNPKYLSILNHLRFYLPEIYPKLDK 468

Query: 1578 VVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFDPRAC 1757
            +VFLDDDIVV++DLT LWSI+++GKVNGAVETCGESFHRFD YLNFSNP+IAKNFDP AC
Sbjct: 469  MVFLDDDIVVKKDLTGLWSINMKGKVNGAVETCGESFHRFDHYLNFSNPVIAKNFDPHAC 528

Query: 1758 GWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLNKSWH 1937
            GWA+GMNVFDLAEW+RQ IT++YHSWQKLN+DR LWKLGTLPPGLITFW +TFPL++SWH
Sbjct: 529  GWAFGMNVFDLAEWKRQNITQIYHSWQKLNQDRSLWKLGTLPPGLITFWNKTFPLSRSWH 588

Query: 1938 VLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDCNINP 2117
            VLGLGYNP+VN +DI+RAAV+HYNGNMKPWLEIG+PKFR+YW++Y+DYDQ +L++CNINP
Sbjct: 589  VLGLGYNPHVNSRDIERAAVIHYNGNMKPWLEIGLPKFRSYWSRYLDYDQHFLRECNINP 648


>ref|XP_002272372.2| PREDICTED: uncharacterized protein LOC100258406 [Vitis vinifera]
          Length = 1286

 Score =  862 bits (2226), Expect = 0.0
 Identities = 434/652 (66%), Positives = 522/652 (80%), Gaps = 4/652 (0%)
 Frame = +3

Query: 174  VLVLLCVTVLAPIVLYTDMLA-STSNSIQNSD-FTGDASNLSAGKEVQRLHALPLESSDA 347
            +L LL VTVL+PIVLYTD L  S   S   +D F  D + L+ G    +L+ LP ESS  
Sbjct: 655  LLFLLLVTVLSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNLLPQESSTT 714

Query: 348  LKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATNER-TQSERIGA 524
            LKEPIGIVY DN S     SA        ++L LGGS E K+RVLS T E   +S+R   
Sbjct: 715  LKEPIGIVYSDNDSLDVDESA--------ADLQLGGSVEHKTRVLSTTYEEGDRSQRENP 766

Query: 525  IKQVTSREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEI-QGSXXXXXXXXRPPKVASRR 701
            I+QVT  +  N        ++   E+ S  A   + TE  Q S            V +R 
Sbjct: 767  IRQVTDGKDDN--------LQRGSELTSHNASQNSETEHGQQSAQTSGKGDHKEPVKTRN 818

Query: 702  HQILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELRLRVRDVQRALGD 881
             + +D      V+ D +V+ LKDQLIRAKV+LSL ++R N HFIRELR R+++VQRALGD
Sbjct: 819  EKPID----QTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARMKEVQRALGD 874

Query: 882  ATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVEDQLQVHKKQTLFL 1061
            ATKDSELP+N  EK++ MEQTL+KGKQIQDDC+AVVKKLRA+LH+ E+QL+VHKKQT++L
Sbjct: 875  ATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLRVHKKQTMYL 934

Query: 1062 TQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYALFSDNILAASVV 1241
            TQL AKTLPKGLHCL LRLSTEYY +D +++QFP+Q+KLEDP+LFHYALFSDNILAA+VV
Sbjct: 935  TQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFSDNILAAAVV 994

Query: 1242 VNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIEEFTWLNASYSPV 1421
            VNSTV NAK+ + HVFHIV+DRLNYAAMRMWFLANPP  ATIQVQNI+EFTWLN+SYSPV
Sbjct: 995  VNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFTWLNSSYSPV 1054

Query: 1422 MEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFPKLDKVVFLDDDI 1601
            ++QLGS SMID+YF+ +R+NSD+NLKFRNPKYLS+LNHLRFYLPEIFPKL+KV+FLDDDI
Sbjct: 1055 LKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDI 1114

Query: 1602 VVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFDPRACGWAYGMNV 1781
            VVQ+DLT LWSIDL+G VNGAVETCGESFHRFDRYLNFSNPLI+KNFD  ACGWAYGMN+
Sbjct: 1115 VVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHACGWAYGMNI 1174

Query: 1782 FDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLNKSWHVLGLGYNP 1961
            FDL +W++Q ITEVYH+WQKLN DR LWKLGTLPPGLITFWK+TFP+++SWHVLGLGYNP
Sbjct: 1175 FDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTFPIDRSWHVLGLGYNP 1234

Query: 1962 NVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDCNINP 2117
            +VN+++I+RAAV+HYNGN+KPWLEIG+PKFRNYWAK+ D+D +YL+DCNINP
Sbjct: 1235 SVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNINP 1286


>ref|XP_003573698.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 1
            [Brachypodium distachyon]
          Length = 660

 Score =  858 bits (2216), Expect = 0.0
 Identities = 420/664 (63%), Positives = 515/664 (77%), Gaps = 5/664 (0%)
 Frame = +3

Query: 141  RSERKMVK--RKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLSAG-KEVQ 311
            R+   MV+  R  +L LL +TVL+P+VLYT  L+   N IQ  D  G+ +N   G K   
Sbjct: 12   RNNSAMVRTGRSVLLFLLALTVLSPLVLYTRRLSVALNPIQRKDLPGEIANQGLGVKASS 71

Query: 312  RLHALPLESSDALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSAT 491
            RL+ALPLE+  +LKEP+G+V+ +   +    S  S D       P   +    S V +A 
Sbjct: 72   RLNALPLETVSSLKEPVGVVFSEEPRDLSNESIESKD---QESTPRKKANRALSEVTAAD 128

Query: 492  NERTQSERIGAIKQVTSREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEIQGSXXXXXXX 671
               ++ +  G I QVT +E  +  L +  + +  K  GSQQ     ++ ++         
Sbjct: 129  GAGSKED--GLIDQVTRQEGQDGSLVSSSIDQQEKATGSQQQSSSEASSLE--------- 177

Query: 672  XRPPKVASRRHQ--ILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELR 845
              P KV     Q    D +  NM +PD +VRN+KDQLI+AKVYL LG+ R N  ++R+LR
Sbjct: 178  -TPAKVLVENPQKESTDVKSKNMALPDTRVRNIKDQLIKAKVYLGLGAIRANSQYLRDLR 236

Query: 846  LRVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVED 1025
             R+R+VQ+ LGDATKDS+LP+N NEK++A+EQTL KGKQ QDDCS VVKKLRAMLH+ E+
Sbjct: 237  QRIREVQKVLGDATKDSDLPKNANEKVKALEQTLIKGKQTQDDCSVVVKKLRAMLHSAEE 296

Query: 1026 QLQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYA 1205
            QL   KKQT+FLTQLAAKTLPKGLHCL LRL+ EY+ +D  ++QFP+ EKL+DPKL+HYA
Sbjct: 297  QLLAQKKQTVFLTQLAAKTLPKGLHCLPLRLANEYFSLDSVQQQFPNHEKLDDPKLYHYA 356

Query: 1206 LFSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIE 1385
            LFSDNILA +VVVNSTV NAK+ + HVFHIVTDRLNYA M+MWFL+NPP  ATI+VQNI+
Sbjct: 357  LFSDNILATAVVVNSTVLNAKHPSRHVFHIVTDRLNYAPMKMWFLSNPPGKATIEVQNID 416

Query: 1386 EFTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFP 1565
            EFTWLN +YSPV++QLGSQSMID+YFR  RANSD+NLK+RNPKYLSMLNHLRFYLPEI+P
Sbjct: 417  EFTWLNETYSPVLKQLGSQSMIDYYFRAQRANSDSNLKYRNPKYLSMLNHLRFYLPEIYP 476

Query: 1566 KLDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFD 1745
            KLDK+VFLDDD+VV++DLT LWSID++GKVNGAVETCGESFHRFDRYLNFSNP+IAKNFD
Sbjct: 477  KLDKMVFLDDDVVVKKDLTGLWSIDMKGKVNGAVETCGESFHRFDRYLNFSNPVIAKNFD 536

Query: 1746 PRACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLN 1925
            P ACGWA+GMNVFDLAEWRRQ ITE+YHSWQKLN DRLLWKLGTLPPGLITFW +TFPLN
Sbjct: 537  PHACGWAFGMNVFDLAEWRRQDITEIYHSWQKLNEDRLLWKLGTLPPGLITFWNKTFPLN 596

Query: 1926 KSWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDC 2105
            +SWHVLGLGYNP+VN +DI+RAAV+HYNGNMKPWLEIG+PKFR+YW+KY+ YDQ +L++C
Sbjct: 597  RSWHVLGLGYNPHVNSRDIERAAVIHYNGNMKPWLEIGLPKFRSYWSKYLYYDQPFLREC 656

Query: 2106 NINP 2117
            NINP
Sbjct: 657  NINP 660


>gb|EOY29053.1| Galacturonosyltransferase 4 isoform 2 [Theobroma cacao]
          Length = 624

 Score =  852 bits (2202), Expect = 0.0
 Identities = 423/655 (64%), Positives = 514/655 (78%), Gaps = 1/655 (0%)
 Frame = +3

Query: 156  MVKRKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLSAGKEVQRLHALPLE 335
            M  R  VL LL VTV+API LYTD +A+ + S    DF  D +  +   + +RL+ LP E
Sbjct: 1    MKVRHLVLGLLSVTVIAPIFLYTDRVATFNPSSSGRDFLDDVATFTLLGDTRRLNVLPQE 60

Query: 336  SSDALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATNERTQSER 515
            +S A+KEP GIVY D+S+NS +   R                   +RVLSAT+E  Q + 
Sbjct: 61   TSTAIKEPAGIVYSDHSNNSFRKETREHKS---------------TRVLSATDEERQPQL 105

Query: 516  IGAIKQVTSREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEIQGSXXXXXXXXRPPKVAS 695
               I+QVT    +N  L  P     N        ++   T++ G+            +  
Sbjct: 106  HNPIRQVTDPAPAN--LTTPLDSHPNASHHLGTKLEQQPTQLAGN------------IDQ 151

Query: 696  RRHQILDTQRSNMVMP-DVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELRLRVRDVQRA 872
            + H   D + S +  P D +VR+LKDQLIRAKVYLSL + ++N H  RELRLR+++V RA
Sbjct: 152  KEHS--DNKTSRLAEPVDAQVRHLKDQLIRAKVYLSLPAIKSNQHVTRELRLRIKEVSRA 209

Query: 873  LGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVEDQLQVHKKQT 1052
            LGDATKDS+LP+N  +K++AMEQ+L KGKQIQDDC+AVVKKLRAMLH+ E+QL+VHKKQT
Sbjct: 210  LGDATKDSDLPKNAFDKLKAMEQSLEKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKKQT 269

Query: 1053 LFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYALFSDNILAA 1232
            +FLTQL AKTLPKGLHCL LRL+TEYY ++ S++ F +QEKLEDP+L+HYALFSDN+LAA
Sbjct: 270  MFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQNFLNQEKLEDPRLYHYALFSDNVLAA 329

Query: 1233 SVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIEEFTWLNASY 1412
            +VVVNSTV +AK+ ++HVFHIVTDRLNYAAMRMWFL NPP  ATIQVQNIEEFTWLN+SY
Sbjct: 330  AVVVNSTVSHAKHPSNHVFHIVTDRLNYAAMRMWFLNNPPGKATIQVQNIEEFTWLNSSY 389

Query: 1413 SPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFPKLDKVVFLD 1592
            SPV++QLGS SMID+YFR +RANSD+NLKFRNPKYLS+LNHLRFYLPEIFPKL+KV+FLD
Sbjct: 390  SPVLKQLGSPSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLD 449

Query: 1593 DDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFDPRACGWAYG 1772
            DDIVV++D++ LWS+DL+G VNGAVETCGESFHRFDRYLNFSNPLI+KNFDP ACGWAYG
Sbjct: 450  DDIVVRKDISGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPHACGWAYG 509

Query: 1773 MNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLNKSWHVLGLG 1952
            MN+FDL EWRRQ ITEVYH WQKLN DR LWKLGTLPPGLITFWK+T+PL++SWHVLGLG
Sbjct: 510  MNIFDLEEWRRQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKRTYPLDRSWHVLGLG 569

Query: 1953 YNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDCNINP 2117
            YNPNVNQ++++RAAV+HYNGN+KPWLEIGIPK++NYWAKYVDYD  YL+DCNINP
Sbjct: 570  YNPNVNQREVERAAVIHYNGNLKPWLEIGIPKYKNYWAKYVDYDNMYLRDCNINP 624


>ref|NP_001061555.1| Os08g0327100 [Oryza sativa Japonica Group]
            gi|38423965|dbj|BAD01674.1| putative glycosyltransferase
            [Oryza sativa Japonica Group] gi|38637194|dbj|BAD03445.1|
            putative glycosyltransferase [Oryza sativa Japonica
            Group] gi|113623524|dbj|BAF23469.1| Os08g0327100 [Oryza
            sativa Japonica Group] gi|222640351|gb|EEE68483.1|
            hypothetical protein OsJ_26894 [Oryza sativa Japonica
            Group]
          Length = 643

 Score =  852 bits (2201), Expect = 0.0
 Identities = 413/654 (63%), Positives = 517/654 (79%), Gaps = 2/654 (0%)
 Frame = +3

Query: 162  KRKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLSAGKEVQRLHALPLESS 341
            +R  +L+LL +TVL+P+VLYT  L++  N  Q  D  G+  N   G +  +L+ALPLE+ 
Sbjct: 6    RRSVLLLLLALTVLSPLVLYTRRLSAALNPNQRRDLPGEIVNQGRGVKASKLNALPLETV 65

Query: 342  DALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATN--ERTQSER 515
             +LKEP+GIV+ + S  S    A  S    + E  L  +GE K+RVLS     +  +SE 
Sbjct: 66   GSLKEPVGIVFSEESRES----ASKSTEPDSQEFLLRKAGEHKNRVLSEATAADSARSED 121

Query: 516  IGAIKQVTSREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEIQGSXXXXXXXXRPPKVAS 695
               I+QVTS++  +DGL           V  QQ   +T+   Q S          P+  S
Sbjct: 122  DDLIEQVTSKDGEDDGL-------ATVSVDQQQ---ITTASQQRSASEASSLENVPEQTS 171

Query: 696  RRHQILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELRLRVRDVQRAL 875
              + +   +   ++  D ++RN++D LI+AKVYL LG+ R NP ++++LR R+R+VQ+ L
Sbjct: 172  MENSLEGNKDGALL--DTRIRNIRDLLIKAKVYLGLGAIRANPQYLKDLRQRIREVQKVL 229

Query: 876  GDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVEDQLQVHKKQTL 1055
            GDA+KDS+LP+N NEK++ +EQTL KGK +QDDCS VVKKLRAMLH+ E+QL  HKKQT+
Sbjct: 230  GDASKDSDLPKNANEKVKTLEQTLIKGKLMQDDCSVVVKKLRAMLHSAEEQLHAHKKQTV 289

Query: 1056 FLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYALFSDNILAAS 1235
            FLTQLAAKTLPKGLHCL LRL+ EY+L+DPS +QFP++EKL+DPKL+HYALFSDNILAA+
Sbjct: 290  FLTQLAAKTLPKGLHCLPLRLANEYFLLDPSHQQFPNKEKLDDPKLYHYALFSDNILAAA 349

Query: 1236 VVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIEEFTWLNASYS 1415
            VVVNSTV NAK+ + HVFHIVTDRLNYA MRMWFL+NPP  ATI+V+NIEEFTWLNASYS
Sbjct: 350  VVVNSTVLNAKHPSHHVFHIVTDRLNYAPMRMWFLSNPPGKATIEVRNIEEFTWLNASYS 409

Query: 1416 PVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFPKLDKVVFLDD 1595
            PV++QL SQSMID+YFRT+RANSD+NLK+RNPKYLS+LNHLRFYLPEI+P L K+VFLDD
Sbjct: 410  PVLKQLESQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIYPNLHKIVFLDD 469

Query: 1596 DIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFDPRACGWAYGM 1775
            D+V+++DLT+LWSID++GKV G VETCGESFHRFDRYLNFSNP+I KNFDP ACGWA+GM
Sbjct: 470  DVVIKKDLTSLWSIDMKGKVIGVVETCGESFHRFDRYLNFSNPVIVKNFDPHACGWAFGM 529

Query: 1776 NVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLNKSWHVLGLGY 1955
            NVFDLAEWRRQ ITE+YHSWQKLN+DRLLWKLGTLPPGLITFW +T PLN+SWHVLGLGY
Sbjct: 530  NVFDLAEWRRQNITEIYHSWQKLNQDRLLWKLGTLPPGLITFWNKTLPLNRSWHVLGLGY 589

Query: 1956 NPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDCNINP 2117
            NP+V+ +DI+RAAV+HYNGNMKPWLEIG+PKFRNYW+ Y+DYDQ +L++CNINP
Sbjct: 590  NPHVSSRDIERAAVIHYNGNMKPWLEIGLPKFRNYWSAYLDYDQPFLRECNINP 643


>gb|EOY29052.1| Galacturonosyltransferase 4 isoform 1 [Theobroma cacao]
          Length = 626

 Score =  851 bits (2199), Expect = 0.0
 Identities = 422/655 (64%), Positives = 515/655 (78%), Gaps = 1/655 (0%)
 Frame = +3

Query: 156  MVKRKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLSAGKEVQRLHALPLE 335
            M  R  VL LL VTV+API LYTD +A+ + S    DF  D +  +   + +RL+ LP E
Sbjct: 1    MKVRHLVLGLLSVTVIAPIFLYTDRVATFNPSSSGRDFLDDVATFTLLGDTRRLNVLPQE 60

Query: 336  SSDALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATNERTQSER 515
            +S A+KEP GIVY D+S+NS +    + +                +RVLSAT+E  Q + 
Sbjct: 61   TSTAIKEPAGIVYSDHSNNSFRKVTETRE-------------HKSTRVLSATDEERQPQL 107

Query: 516  IGAIKQVTSREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEIQGSXXXXXXXXRPPKVAS 695
               I+QVT    +N  L  P     N        ++   T++ G+            +  
Sbjct: 108  HNPIRQVTDPAPAN--LTTPLDSHPNASHHLGTKLEQQPTQLAGN------------IDQ 153

Query: 696  RRHQILDTQRSNMVMP-DVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELRLRVRDVQRA 872
            + H   D + S +  P D +VR+LKDQLIRAKVYLSL + ++N H  RELRLR+++V RA
Sbjct: 154  KEHS--DNKTSRLAEPVDAQVRHLKDQLIRAKVYLSLPAIKSNQHVTRELRLRIKEVSRA 211

Query: 873  LGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVEDQLQVHKKQT 1052
            LGDATKDS+LP+N  +K++AMEQ+L KGKQIQDDC+AVVKKLRAMLH+ E+QL+VHKKQT
Sbjct: 212  LGDATKDSDLPKNAFDKLKAMEQSLEKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKKQT 271

Query: 1053 LFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYALFSDNILAA 1232
            +FLTQL AKTLPKGLHCL LRL+TEYY ++ S++ F +QEKLEDP+L+HYALFSDN+LAA
Sbjct: 272  MFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQNFLNQEKLEDPRLYHYALFSDNVLAA 331

Query: 1233 SVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIEEFTWLNASY 1412
            +VVVNSTV +AK+ ++HVFHIVTDRLNYAAMRMWFL NPP  ATIQVQNIEEFTWLN+SY
Sbjct: 332  AVVVNSTVSHAKHPSNHVFHIVTDRLNYAAMRMWFLNNPPGKATIQVQNIEEFTWLNSSY 391

Query: 1413 SPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFPKLDKVVFLD 1592
            SPV++QLGS SMID+YFR +RANSD+NLKFRNPKYLS+LNHLRFYLPEIFPKL+KV+FLD
Sbjct: 392  SPVLKQLGSPSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLD 451

Query: 1593 DDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFDPRACGWAYG 1772
            DDIVV++D++ LWS+DL+G VNGAVETCGESFHRFDRYLNFSNPLI+KNFDP ACGWAYG
Sbjct: 452  DDIVVRKDISGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPHACGWAYG 511

Query: 1773 MNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLNKSWHVLGLG 1952
            MN+FDL EWRRQ ITEVYH WQKLN DR LWKLGTLPPGLITFWK+T+PL++SWHVLGLG
Sbjct: 512  MNIFDLEEWRRQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKRTYPLDRSWHVLGLG 571

Query: 1953 YNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDCNINP 2117
            YNPNVNQ++++RAAV+HYNGN+KPWLEIGIPK++NYWAKYVDYD  YL+DCNINP
Sbjct: 572  YNPNVNQREVERAAVIHYNGNLKPWLEIGIPKYKNYWAKYVDYDNMYLRDCNINP 626


>ref|XP_003573699.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 2
            [Brachypodium distachyon]
          Length = 660

 Score =  850 bits (2196), Expect = 0.0
 Identities = 417/664 (62%), Positives = 513/664 (77%), Gaps = 5/664 (0%)
 Frame = +3

Query: 141  RSERKMVK--RKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLSAG-KEVQ 311
            R+   MV+  R  +L LL +TVL+P+VLYT  L+   N IQ  D  G+ +N   G K   
Sbjct: 12   RNNSAMVRTGRSVLLFLLALTVLSPLVLYTRRLSVALNPIQRKDLPGEIANQGLGVKASS 71

Query: 312  RLHALPLESSDALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSAT 491
            RL+ALPLE+  +LKEP+G+V+ +   +    S  S D       P   +    S V +A 
Sbjct: 72   RLNALPLETVSSLKEPVGVVFSEEPRDLSNESIESKD---QESTPRKKANRALSEVTAAD 128

Query: 492  NERTQSERIGAIKQVTSREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEIQGSXXXXXXX 671
               ++ +  G I QVT +E  +  L +  + +  K  GSQQ     ++ ++         
Sbjct: 129  GAGSKED--GLIDQVTRQEGQDGSLVSSSIDQQEKATGSQQQSSSEASSLE--------- 177

Query: 672  XRPPKVASRRHQ--ILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELR 845
              P KV     Q    D +  NM +PD +VRN+KDQLI+AKVYL LG+ R N  ++R+LR
Sbjct: 178  -TPAKVLVENPQKESTDVKSKNMALPDTRVRNIKDQLIKAKVYLGLGAIRANSQYLRDLR 236

Query: 846  LRVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVED 1025
             R+R+VQ+ LGDATKDS+LP+N NEK++A+EQTL KGKQ QDDCS VVKKLRAMLH+ E+
Sbjct: 237  QRIREVQKVLGDATKDSDLPKNANEKVKALEQTLIKGKQTQDDCSVVVKKLRAMLHSAEE 296

Query: 1026 QLQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYA 1205
            QL   KKQT+FLTQLAAKTLPKGLHCL LRL+ EY+ +D  ++QFP+ EKL+DPKL+HYA
Sbjct: 297  QLLAQKKQTVFLTQLAAKTLPKGLHCLPLRLANEYFSLDSVQQQFPNHEKLDDPKLYHYA 356

Query: 1206 LFSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIE 1385
            LFSDNILA +VVVNSTV NAK+ + HVFHIVTDRLNYA M+MWFL+NPP  ATI+VQNI+
Sbjct: 357  LFSDNILATAVVVNSTVLNAKHPSRHVFHIVTDRLNYAPMKMWFLSNPPGKATIEVQNID 416

Query: 1386 EFTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFP 1565
            EFTWLN +YSPV++QLGSQSMID+YFR  RANSD+NLK+RNPKYLSMLNHLRFYLPEI+P
Sbjct: 417  EFTWLNETYSPVLKQLGSQSMIDYYFRAQRANSDSNLKYRNPKYLSMLNHLRFYLPEIYP 476

Query: 1566 KLDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFD 1745
            KLDK+VFLDDD+VV++DLT LWSID++GKVNGAVETCGESFHRFDRYLNFSNP+IAKNFD
Sbjct: 477  KLDKMVFLDDDVVVKKDLTGLWSIDMKGKVNGAVETCGESFHRFDRYLNFSNPVIAKNFD 536

Query: 1746 PRACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLN 1925
            P ACGWA+GMNVFDLAEWRRQ ITE+YHSWQKL+   LLWKLGTLPPGLITFW +TFPLN
Sbjct: 537  PHACGWAFGMNVFDLAEWRRQDITEIYHSWQKLSSGLLLWKLGTLPPGLITFWNKTFPLN 596

Query: 1926 KSWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDC 2105
            +SWHVLGLGYNP+VN +DI+RAAV+HYNGNMKPWLEIG+PKFR+YW+KY+ YDQ +L++C
Sbjct: 597  RSWHVLGLGYNPHVNSRDIERAAVIHYNGNMKPWLEIGLPKFRSYWSKYLYYDQPFLREC 656

Query: 2106 NINP 2117
            NINP
Sbjct: 657  NINP 660


>ref|XP_002444200.1| hypothetical protein SORBIDRAFT_07g014890 [Sorghum bicolor]
            gi|241940550|gb|EES13695.1| hypothetical protein
            SORBIDRAFT_07g014890 [Sorghum bicolor]
          Length = 648

 Score =  849 bits (2193), Expect = 0.0
 Identities = 414/658 (62%), Positives = 515/658 (78%), Gaps = 7/658 (1%)
 Frame = +3

Query: 165  RKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLSAGKEVQRLHALPLESSD 344
            R  +L+LL +TVL+P+ LYT  L +  + IQ  DF G+ +N   G +  +L+ALPLE+  
Sbjct: 7    RLVLLLLLALTVLSPLALYTSRLPAALSPIQTQDFPGEITNQGRGGKADKLNALPLETVS 66

Query: 345  ALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATNERTQSERIGA 524
            +LKEP+GIV+ +  + S+           + +LPL   GE KSR+LS          + A
Sbjct: 67   SLKEPVGIVFSEELTESK-----------SQDLPLTKVGEHKSRMLSEVTVAADGTTLKA 115

Query: 525  ---IKQVTSREHSNDGLKNPKLV--EGNKEVGSQQAVDLTSTEIQGSXXXXXXXXRPPKV 689
               I+QVT+ E  +  L     +  E  K +GSQQ      +  + S         P KV
Sbjct: 116  DEVIEQVTTLEPQDGSLVKGAGISDEQEKNIGSQQ-----QSSSEESSQDTMLKQTPEKV 170

Query: 690  ASRRHQILDTQRSNM--VMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELRLRVRDV 863
                 Q   T       V+PDV++RN+KDQLI+AKVYL LGS R N  ++++LR R+R+V
Sbjct: 171  IVENSQSAKTDGKTKITVLPDVRIRNIKDQLIKAKVYLGLGSIRANSQYLKDLRQRIREV 230

Query: 864  QRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVEDQLQVHK 1043
            Q+ LGDA+KDS+L +N NEK++A+EQ L KGKQ+QDDCS VVKKLRAMLH+ E+QL  HK
Sbjct: 231  QKVLGDASKDSDLLKNANEKVKALEQMLIKGKQMQDDCSIVVKKLRAMLHSAEEQLHAHK 290

Query: 1044 KQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYALFSDNI 1223
            KQT+FLTQLAAKTLPKGLHCL LRL+ EY+ +DP R+QFP+Q+KL +PKL+HYALFSDNI
Sbjct: 291  KQTVFLTQLAAKTLPKGLHCLPLRLANEYFSLDPVRQQFPNQQKLINPKLYHYALFSDNI 350

Query: 1224 LAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIEEFTWLN 1403
            LA +VVVNSTV NAK+ +DHVFHIVTD+LNYA MRMWFL+NPP  ATI+VQ+I EFTWLN
Sbjct: 351  LATAVVVNSTVLNAKHPSDHVFHIVTDKLNYAPMRMWFLSNPPGKATIEVQHIGEFTWLN 410

Query: 1404 ASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFPKLDKVV 1583
             SYSPV++QLGS SMID+YF TNRANSD+NLK+RNPKYLS+LNHLRFYLPEI+PKLDK+V
Sbjct: 411  DSYSPVLKQLGSPSMIDYYFGTNRANSDSNLKYRNPKYLSILNHLRFYLPEIYPKLDKMV 470

Query: 1584 FLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFDPRACGW 1763
            FLDDDIVV++DLT LWSI+++GKVNGAVETCGESFHR+DRYLNFSNP+IAK+FDP ACGW
Sbjct: 471  FLDDDIVVKKDLTGLWSINMKGKVNGAVETCGESFHRYDRYLNFSNPIIAKSFDPHACGW 530

Query: 1764 AYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLNKSWHVL 1943
            A+GMNVFDLAEWRRQ IT++YHSWQKLN DR LWKLGTLPPGLITFW +TFPL++SWHVL
Sbjct: 531  AFGMNVFDLAEWRRQNITQIYHSWQKLNEDRSLWKLGTLPPGLITFWNKTFPLSRSWHVL 590

Query: 1944 GLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDCNINP 2117
            GLGYNP+VN +DI+RAAV+HYNGNMKPWLEIG+PK+R+YW+KY+DYDQ +L++CNINP
Sbjct: 591  GLGYNPHVNSRDIERAAVIHYNGNMKPWLEIGLPKYRSYWSKYLDYDQSFLRECNINP 648


>ref|XP_006857626.1| hypothetical protein AMTR_s00061p00126570 [Amborella trichopoda]
            gi|548861722|gb|ERN19093.1| hypothetical protein
            AMTR_s00061p00126570 [Amborella trichopoda]
          Length = 672

 Score =  845 bits (2184), Expect = 0.0
 Identities = 411/669 (61%), Positives = 517/669 (77%), Gaps = 18/669 (2%)
 Frame = +3

Query: 165  RKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLSAGKEVQRLHALPLESSD 344
            R PVL+LLC +VLAPIVLYTD L S S+SI  + F+ + S ++ G+++ +L  LP ES +
Sbjct: 4    RMPVLLLLCFSVLAPIVLYTDRLGSFSSSIAKAGFSEEFSPINYGRDINKLKVLPQESVN 63

Query: 345  ALKEPIGIVYYDNSSNSEQISARSSDGITASEL------PLGGSGELKSRVLSATNERTQ 506
            ALKEP G+VY  +   SE IS +    +  S +      PL     ++  +     E   
Sbjct: 64   ALKEPSGVVYLSDKDPSEAISVKEEPKMARSRVLQSNVKPLEVETHIEQVIDKVHREEKN 123

Query: 507  SERIGAIKQVTSREHSNDGL--KNPKLVEGNKE--VGSQQAVDLTSTEIQGSXXXXXXXX 674
             + I    Q  + E S   L   N + +   +E   G Q A       +           
Sbjct: 124  GQEIAGDSQAETIEESQQVLLQSNEQKIGAKREEQFGHQDASIKEEIGLSSRTDAEKQEP 183

Query: 675  RPPKVASRRHQ-------ILDTQRSNMV-MPDVKVRNLKDQLIRAKVYLSLGSSRTNPHF 830
              P++ S +           + Q  N   MPD +V +L+DQLI+AKVYLSLG++R+NPHF
Sbjct: 184  DKPEIESGKSDPDGPSQPSPERQNDNKKPMPDARVHHLRDQLIKAKVYLSLGTTRSNPHF 243

Query: 831  IRELRLRVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAML 1010
            I+ELR+R+R+VQRALGDATKDSELPR   +K++AME+TL+KGKQIQDDC+AV+KKLRA+L
Sbjct: 244  IKELRVRIREVQRALGDATKDSELPRGAYDKLKAMEETLAKGKQIQDDCAAVIKKLRAIL 303

Query: 1011 HTVEDQLQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPK 1190
            H+ E+QL+VHKKQ++FL QL+AKTLPKGLHCL LRL+TEYY ++ +++QFP+QEKLE+P 
Sbjct: 304  HSTEEQLRVHKKQSMFLMQLSAKTLPKGLHCLPLRLTTEYYSLNSTQQQFPNQEKLENPN 363

Query: 1191 LFHYALFSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQ 1370
            ++HYALFSDN+LAA+VVVNSTV NA++  +HVFHIVTDRLNYAAMRMWF+ANPP  ATIQ
Sbjct: 364  IYHYALFSDNVLAAAVVVNSTVSNARDPRNHVFHIVTDRLNYAAMRMWFIANPPGKATIQ 423

Query: 1371 VQNIEEFTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYL 1550
            VQ++EEFTWLN+SYSPV++QLGS SMID+YFRT+RAN D+NLK+RNPKYLS+LNHLRFY+
Sbjct: 424  VQSVEEFTWLNSSYSPVLKQLGSTSMIDYYFRTHRANPDSNLKYRNPKYLSILNHLRFYM 483

Query: 1551 PEIFPKLDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLI 1730
            PEIFPKL KV+FLDDDIVVQRDLT LW IDL+GK+NGAVETC ESFHRFDRYLNFSNPLI
Sbjct: 484  PEIFPKLHKVLFLDDDIVVQRDLTQLWKIDLKGKINGAVETCRESFHRFDRYLNFSNPLI 543

Query: 1731 AKNFDPRACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQ 1910
            +KNF+  ACGWA+GMN+FDL EW++Q+ITE+YHSWQKLN DR LWKLGTLPPGLITF+ +
Sbjct: 544  SKNFEAHACGWAFGMNIFDLKEWKKQEITEIYHSWQKLNNDRQLWKLGTLPPGLITFYNR 603

Query: 1911 TFPLNKSWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQD 2090
            TFPLN+ WHVLGLGY+P+VNQ+DI RAA +HYNGN+KPWLEIG+PKFR YW KY++Y+Q 
Sbjct: 604  TFPLNRGWHVLGLGYDPSVNQRDIQRAAAIHYNGNLKPWLEIGLPKFRGYWQKYINYNQP 663

Query: 2091 YLQDCNINP 2117
            YLQDCNINP
Sbjct: 664  YLQDCNINP 672


>gb|ESW19558.1| hypothetical protein PHAVU_006G135100g [Phaseolus vulgaris]
          Length = 661

 Score =  842 bits (2176), Expect = 0.0
 Identities = 420/665 (63%), Positives = 511/665 (76%), Gaps = 14/665 (2%)
 Frame = +3

Query: 165  RKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLS-AGKEVQRLHALPLESS 341
            R  VL+LLC+TV+APIVLYTD L +        +F  D +  + +  +   L+ LP E+S
Sbjct: 5    RNIVLLLLCITVVAPIVLYTDRLGTFEPPSNKQEFVEDVTAFAFSAADSSHLNLLPQETS 64

Query: 342  DALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATNERTQSERIG 521
              +KEP+ +VY +  S + +        + + E        L +RVLS T +  Q+++  
Sbjct: 65   TTVKEPVRVVYTEEDSTNRRDFTPGLQLVKSME-------HLSARVLSTTTDEDQTKKEN 117

Query: 522  AIKQVTS--REHSNDGLKNPKLVEGNKEVGSQQAVDL----------TSTEIQGSXXXXX 665
             IK VT   R+ +  G    K   G + V  + A+D+          TS   Q       
Sbjct: 118  PIKLVTGGIRQGNQVGGSLDKGATG-ENVNGEAAIDVDDNDGKLATSTSVSTQDPEIKEQ 176

Query: 666  XXXRPPKVASRRHQILDTQRSNMVMP-DVKVRNLKDQLIRAKVYLSLGSSRTNPHFIREL 842
                  KV  +  ++ +T + N   P D +V+ LKDQLI+AKV+LSL   ++NPH  REL
Sbjct: 177  ATEASRKVNHKGSRVSETNKHNDQTPSDYRVKQLKDQLIQAKVFLSLPVVKSNPHLTREL 236

Query: 843  RLRVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVE 1022
            RLRV+DV R LGDA+KDS+LPRN NE+M+AMEQTL KGKQ QDDC+AVVKKLRAMLH+ E
Sbjct: 237  RLRVKDVTRTLGDASKDSDLPRNANERMKAMEQTLMKGKQAQDDCAAVVKKLRAMLHSTE 296

Query: 1023 DQLQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHY 1202
            +QL + KKQTLFLTQL AKTLPKGLHCL LRL+TEYY M+ S++QFP+QE +EDP+L+HY
Sbjct: 297  EQLHILKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYNMNSSQQQFPNQENIEDPQLYHY 356

Query: 1203 ALFSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNI 1382
            A+FSDNILA +VVVNSTV NAK+++ HVFH+VTDRLNYAAMRMWFL NPP  ATIQVQNI
Sbjct: 357  AIFSDNILATTVVVNSTVSNAKDASKHVFHVVTDRLNYAAMRMWFLVNPPGKATIQVQNI 416

Query: 1383 EEFTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIF 1562
            E+FTWLN+SYSPV++QLGSQSMID+YF+ +RA SD+NLKFRNPKYLS+LNHLRFYLPEIF
Sbjct: 417  EDFTWLNSSYSPVLKQLGSQSMIDYYFKAHRATSDSNLKFRNPKYLSILNHLRFYLPEIF 476

Query: 1563 PKLDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNF 1742
            PKL+KV+FLDDDIVVQ+DLT LWSIDL+G VNGAVETCGESFHRFDRYLNFS+PLIAKNF
Sbjct: 477  PKLNKVLFLDDDIVVQKDLTDLWSIDLKGNVNGAVETCGESFHRFDRYLNFSHPLIAKNF 536

Query: 1743 DPRACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPL 1922
            DP ACGWAYGMN+FDL +W+RQKITEVYH+WQ LNRDR LWKLGTLPPGLITFWK+TFPL
Sbjct: 537  DPHACGWAYGMNIFDLVQWKRQKITEVYHNWQHLNRDRQLWKLGTLPPGLITFWKRTFPL 596

Query: 1923 NKSWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQD 2102
            N+SWHVLGLGYNPNV QKDI+++AVVHYNGNMKPWLEI IPKFR+YW KYVDY+  YL++
Sbjct: 597  NRSWHVLGLGYNPNVGQKDIEQSAVVHYNGNMKPWLEISIPKFRSYWTKYVDYNHVYLRE 656

Query: 2103 CNINP 2117
            CNINP
Sbjct: 657  CNINP 661


>ref|XP_006597630.1| PREDICTED: probable galacturonosyltransferase 4-like [Glycine max]
          Length = 657

 Score =  841 bits (2173), Expect = 0.0
 Identities = 421/666 (63%), Positives = 509/666 (76%), Gaps = 15/666 (2%)
 Frame = +3

Query: 165  RKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLS-AGKEVQRLHALPLESS 341
            R  VL+LLCVTV+APIVLYTD L +  +     +F  D +  + +  +   L+ LP E+S
Sbjct: 5    RNIVLLLLCVTVVAPIVLYTDRLGTFESPSNKQEFIEDVTAFTFSAADSSHLNLLPQETS 64

Query: 342  DALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATNERTQSERIG 521
             A+KEP+  VY +  S + +   +    + + E        + +R+LS T E  Q++   
Sbjct: 65   TAVKEPVRAVYTEEDSTNRRNLPQGLQLVESRE-------HVSARMLSTTTEEDQTKNEN 117

Query: 522  AIKQVTSREHSNDGLKNPKLVEGNKE-VGSQQAVDLTSTEIQGSXXXXXXXXRPP----- 683
             IK VT      DG+K     + + E V  + A+D+   + + +         P      
Sbjct: 118  PIKLVT------DGIKQGNQGDASGENVNREDAIDVDDNDGKLAKSTSASTQEPQLKEQQ 171

Query: 684  -------KVASRRHQILDTQRSNMVMP-DVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRE 839
                    +  +   + +T + N   P D +V+ LKDQLI+AKVYLSL   ++NPH  RE
Sbjct: 172  QATETSSNINHKGSGLSETNKQNDQPPSDARVKQLKDQLIQAKVYLSLPVVKSNPHLTRE 231

Query: 840  LRLRVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTV 1019
            LRLRV++V R LGDA+KDS+LP+N NE+MRAMEQTL KGKQ QDDC+AVVKKLRAMLH+ 
Sbjct: 232  LRLRVKEVSRTLGDASKDSDLPKNANERMRAMEQTLMKGKQAQDDCAAVVKKLRAMLHST 291

Query: 1020 EDQLQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFH 1199
            E+QL V KKQTLFLTQL AKTLPKGLHCL LRL+TEY+ M+ SR+QFP+QE LEDP L+H
Sbjct: 292  EEQLHVLKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYHNMNSSRQQFPNQENLEDPHLYH 351

Query: 1200 YALFSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQN 1379
            YA+FSDNILA +VVVNSTV+N K+++ HVFHIVTDRLNYAAMRMWFL NPP  ATIQVQN
Sbjct: 352  YAIFSDNILATAVVVNSTVYNTKDASKHVFHIVTDRLNYAAMRMWFLGNPPGKATIQVQN 411

Query: 1380 IEEFTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEI 1559
            IE+FTWLNASYSPV++QLGSQSMID+YF+ +RA SD+NLKFRNPKYLS+LNHLRFYLPEI
Sbjct: 412  IEDFTWLNASYSPVLKQLGSQSMIDYYFKAHRAASDSNLKFRNPKYLSILNHLRFYLPEI 471

Query: 1560 FPKLDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKN 1739
            FPKL+KV+FLDDDIVVQ+DLT LWSIDL+G VNGAVETCGESFHRFDRYLNFSNPLIAKN
Sbjct: 472  FPKLNKVLFLDDDIVVQKDLTDLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLIAKN 531

Query: 1740 FDPRACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFP 1919
            FDP ACGWAYGMNVFDLAEW+RQ IT VYH+WQ LN DR LWKLGTLPPGLITFWK+TFP
Sbjct: 532  FDPHACGWAYGMNVFDLAEWKRQNITGVYHNWQNLNHDRQLWKLGTLPPGLITFWKRTFP 591

Query: 1920 LNKSWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQ 2099
            LN+SWH+LGLGYNPNVNQ+DI+++AVVHYNGNMKPWLEI IPKFR+YW KYVDYD  YL+
Sbjct: 592  LNRSWHILGLGYNPNVNQRDIEQSAVVHYNGNMKPWLEISIPKFRSYWTKYVDYDHVYLR 651

Query: 2100 DCNINP 2117
            +CNINP
Sbjct: 652  ECNINP 657


>ref|XP_006467237.1| PREDICTED: probable galacturonosyltransferase 4-like [Citrus
            sinensis]
          Length = 646

 Score =  841 bits (2173), Expect = 0.0
 Identities = 421/663 (63%), Positives = 525/663 (79%), Gaps = 9/663 (1%)
 Frame = +3

Query: 156  MVKRKPVLVLLCVTVLAPIVLYTDMLA-STSNSIQNSDFTGDASNLSAGKEVQRLHALPL 332
            M  R  V+ +LC TVLAPI+++T     S  +S ++ +F  D +  + G + + L+ LP 
Sbjct: 1    MKTRNLVVGMLCATVLAPILIFTSTFKDSYPSSSESGEFLEDLTAFTVGGDARHLNLLPQ 60

Query: 333  ESSD--ALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKS-RVLSAT-NER 500
            ESS   +LK+PI ++       S++I+  S+   + S+    GS E KS RVLSAT N  
Sbjct: 61   ESSTTLSLKQPILVI-------SDKIAQHSAHSQSQSQ----GSWEHKSARVLSATTNGL 109

Query: 501  TQSERIGAIKQVT--SREHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEIQGSXXXXXXXX 674
             QS+    I+QVT  ++   N      ++   +  + +  +  L +   Q S        
Sbjct: 110  DQSKTDNPIRQVTDLTKTQINKHADQEQIKASDNHISAHHSQILDTKHQQESSLTYGVLE 169

Query: 675  R--PPKVASRRHQILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRELRL 848
            +  P K+ + +      Q      PD +VR LKDQLI+AKVYLSL + R N +F+RELRL
Sbjct: 170  KKEPTKINNEK------QTEQTTPPDFRVRQLKDQLIKAKVYLSLPAMRNNANFVRELRL 223

Query: 849  RVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVEDQ 1028
            R+++VQRALGDATKDS+LPR  N++++AMEQ+L+KGKQIQDDC+AVVKKLRAMLH+ E+Q
Sbjct: 224  RIKEVQRALGDATKDSDLPRIANDRLKAMEQSLAKGKQIQDDCAAVVKKLRAMLHSTEEQ 283

Query: 1029 LQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHYAL 1208
            L+VHKKQTLFLTQL AKTLPKGLHCL LRL+TEYY ++ S+R FP+QEKLEDP+LFHYAL
Sbjct: 284  LRVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQRHFPNQEKLEDPRLFHYAL 343

Query: 1209 FSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNIEE 1388
            FSDN+LAA+VVVNSTV +AK+ ++HVFHIVTDRLNYAAMRMWFLANPP  AT+QVQNIEE
Sbjct: 344  FSDNVLAAAVVVNSTVTHAKHPSNHVFHIVTDRLNYAAMRMWFLANPPGRATVQVQNIEE 403

Query: 1389 FTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIFPK 1568
            FTWLN+SYSPV++QL SQSMID+YFR +RANSD+NLKFRNPKYLS+LNHLRFYLPE+FP+
Sbjct: 404  FTWLNSSYSPVLKQLNSQSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEVFPR 463

Query: 1569 LDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNFDP 1748
            L+KV+FLDDD+VVQ+DL+ LWSIDL+GKVNGAVETCGE+FHRFDRYLNFSNPLI+KNFDP
Sbjct: 464  LNKVLFLDDDVVVQKDLSGLWSIDLKGKVNGAVETCGETFHRFDRYLNFSNPLISKNFDP 523

Query: 1749 RACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPLNK 1928
            RACGWAYGMN+FDL EWRRQ IT+VYH+WQK+N DR LWKLGTLPPGLITFWK+T+PL++
Sbjct: 524  RACGWAYGMNIFDLDEWRRQNITDVYHTWQKMNHDRQLWKLGTLPPGLITFWKRTYPLDR 583

Query: 1929 SWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQDCN 2108
             WHVLGLGYNP+VNQ+DI+RAAV+HYNGNMKPWLEI IPK+RNYW K+VDYDQ YL++CN
Sbjct: 584  FWHVLGLGYNPSVNQRDIERAAVIHYNGNMKPWLEINIPKYRNYWTKHVDYDQLYLRECN 643

Query: 2109 INP 2117
            INP
Sbjct: 644  INP 646


>ref|XP_006600275.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1
            [Glycine max] gi|571532515|ref|XP_006600276.1| PREDICTED:
            probable galacturonosyltransferase 4-like isoform X2
            [Glycine max]
          Length = 661

 Score =  841 bits (2172), Expect = 0.0
 Identities = 418/671 (62%), Positives = 515/671 (76%), Gaps = 17/671 (2%)
 Frame = +3

Query: 156  MVKRKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLSAGKEVQRLHALPLE 335
            +V R  VL+LL +T +APIVL+TD L +        +F    +   +  +   L+ LP E
Sbjct: 2    VVTRNIVLLLLSITFVAPIVLFTDRLGTFKYPFAEQEFIEAVTAFVSAADSGHLNLLPQE 61

Query: 336  SSDALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGE-LKSRVLSATNERTQSE 512
            SS   KEPIG+VY +++SN+E +            L     GE + +RVLSATN+  Q++
Sbjct: 62   SSTVFKEPIGLVYTEDTSNTENL---------LHGLHFAKPGEHVSARVLSATNDEGQTK 112

Query: 513  RIGAIKQVTSREHSNDGLKNPKLVEGNK---EVGSQQAVDLTS------------TEIQG 647
                IK VT  +  N G +N  +V+ +     V  + A+D+              +E   
Sbjct: 113  GENPIKLVT--DGINQGNQNSYMVKADTTGDSVNGEDAIDVDDNDGKLAKSSDLVSETTD 170

Query: 648  SXXXXXXXXRPPKVASRRHQILDTQRSN-MVMPDVKVRNLKDQLIRAKVYLSLGSSRTNP 824
            +           +V  +   + +  + N    PD +V+ LKDQLI+A+VYLSL + R+NP
Sbjct: 171  TKQEQEHIKSSSQVTQKEPILSEADKHNDQTPPDARVQQLKDQLIQARVYLSLQAVRSNP 230

Query: 825  HFIRELRLRVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRA 1004
            H  RELRLRV++V R LGDA+KDS+LPRN NE+M+AMEQTL KG+QIQ+DC+A VKKLRA
Sbjct: 231  HLTRELRLRVKEVSRTLGDASKDSDLPRNANERMKAMEQTLMKGRQIQNDCAAAVKKLRA 290

Query: 1005 MLHTVEDQLQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLED 1184
            MLH+ E+QL VHKKQTLFLTQL AKTLPKGLHCL LRL+TEYY ++ S++QF +Q+KLED
Sbjct: 291  MLHSTEEQLHVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNTSQQQFRNQQKLED 350

Query: 1185 PKLFHYALFSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNAT 1364
            P+L+HYA+FSDNILA +VVVNSTV +AK+++ HVFHIVTDRLNYAAMRMWFL NPP+ AT
Sbjct: 351  PRLYHYAIFSDNILATAVVVNSTVAHAKDTSKHVFHIVTDRLNYAAMRMWFLVNPPQKAT 410

Query: 1365 IQVQNIEEFTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRF 1544
            IQVQNIE+FTWLN+SYSPV++QLGS SMIDFYF+T+RA+SD+NLKFRNPKYLS+LNHLRF
Sbjct: 411  IQVQNIEDFTWLNSSYSPVLKQLGSPSMIDFYFKTHRASSDSNLKFRNPKYLSILNHLRF 470

Query: 1545 YLPEIFPKLDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNP 1724
            YLPEIFPKL+KV+FLDDDIVVQ+DLT LWSIDL+G VNGAVETCGE FHRFDRYLNFSNP
Sbjct: 471  YLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLNFSNP 530

Query: 1725 LIAKNFDPRACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFW 1904
            LIAKNFDPRACGWAYGMNVFDL +W+RQ IT+VYH WQK+N DR LWKLGTLPPGLITFW
Sbjct: 531  LIAKNFDPRACGWAYGMNVFDLVQWKRQNITDVYHKWQKMNHDRQLWKLGTLPPGLITFW 590

Query: 1905 KQTFPLNKSWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYD 2084
            K+TF L++SWHVLGLGYNPN+NQK+I+RAAV+HYNGNMKPWLEI IPKFR YW KYVDY+
Sbjct: 591  KRTFQLHRSWHVLGLGYNPNINQKEIERAAVIHYNGNMKPWLEISIPKFRGYWTKYVDYN 650

Query: 2085 QDYLQDCNINP 2117
              YL++CNINP
Sbjct: 651  LVYLRECNINP 661


>ref|XP_003535002.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 1
            [Glycine max]
          Length = 657

 Score =  840 bits (2170), Expect = 0.0
 Identities = 418/666 (62%), Positives = 508/666 (76%), Gaps = 15/666 (2%)
 Frame = +3

Query: 165  RKPVLVLLCVTVLAPIVLYTDMLASTSNSIQNSDFTGDASNLS-AGKEVQRLHALPLESS 341
            R  VL+LLC+TV+APIVLYTD L +  +     +F  D +  + +  +   L+ LP E+S
Sbjct: 5    RNIVLLLLCITVVAPIVLYTDRLGTFESPSNKQEFIEDVTAFAFSAADFSHLNLLPQETS 64

Query: 342  DALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKSRVLSATNERTQSERIG 521
             A+KEP+ +VY +  S +++   +    + + E        + +R+LS T E   +++  
Sbjct: 65   TAVKEPVRVVYTEEDSTNKRNLPQGLQLVKSRE-------HVFARMLSTTTEEDLAKKEN 117

Query: 522  AIKQVTSREHSNDGLKNPKLVEGNKE-VGSQQAVDLTSTEIQGSXXXXXXXXRPP----- 683
             IK VT      DG+K     + + E V  + A+D+   + + +         P      
Sbjct: 118  PIKLVT------DGIKQGNQGDASGENVNGEDAIDVDDNDGKLAKSISASTQEPEIKEQQ 171

Query: 684  -------KVASRRHQILDTQRSNMVMP-DVKVRNLKDQLIRAKVYLSLGSSRTNPHFIRE 839
                   K+  +  ++ +T + N   P D +V+ +KDQLI+AKVYLSL   ++NPH  RE
Sbjct: 172  LATETSSKINQKGSELSETNKQNDRTPSDARVKQIKDQLIQAKVYLSLPVVKSNPHLTRE 231

Query: 840  LRLRVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTV 1019
            LRLRV++V R LG+A KDS+LPRN NE+MRAMEQTL KGKQ QDDC+AVVKKLRAMLH+ 
Sbjct: 232  LRLRVKEVSRTLGEAIKDSDLPRNANERMRAMEQTLMKGKQAQDDCAAVVKKLRAMLHSS 291

Query: 1020 EDQLQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFH 1199
            E+QL V KKQTLFLTQL AKTLPKGLHCL LRL+TEY+ M+ S +QFPHQE LEDP L+H
Sbjct: 292  EEQLHVLKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYHNMNSSHQQFPHQENLEDPHLYH 351

Query: 1200 YALFSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQN 1379
            YA+FSDNILA +VVVNSTV N K+++ HVFHIVTDRLNYAAMRMWFL NPP  ATIQVQN
Sbjct: 352  YAIFSDNILATAVVVNSTVSNTKDASKHVFHIVTDRLNYAAMRMWFLVNPPGKATIQVQN 411

Query: 1380 IEEFTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEI 1559
            IE+FTWLNASYSPV++QLGSQSMID+YF+ +R  SD+NLKFRNPKYLS+LNHLRFYLPEI
Sbjct: 412  IEDFTWLNASYSPVLKQLGSQSMIDYYFKAHRVTSDSNLKFRNPKYLSILNHLRFYLPEI 471

Query: 1560 FPKLDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKN 1739
            FPKL+KV+FLDDDIVVQ+DLT LWSIDL+G VNGAVETCGESFHRFDRYLNFSNPLIAKN
Sbjct: 472  FPKLNKVLFLDDDIVVQKDLTDLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLIAKN 531

Query: 1740 FDPRACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFP 1919
            FDP ACGWAYGMNVFDLAEW+RQ ITEVYH+WQ LN DR LWKLGTLPPGLITFWK+TFP
Sbjct: 532  FDPHACGWAYGMNVFDLAEWKRQNITEVYHNWQNLNHDRQLWKLGTLPPGLITFWKRTFP 591

Query: 1920 LNKSWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQ 2099
            LN+SWH+LGLGYNPNVNQ+DI+++AVVHYNGNMKPWLEI IPKFR YW  YVDYD  YL+
Sbjct: 592  LNRSWHILGLGYNPNVNQRDIEQSAVVHYNGNMKPWLEISIPKFRRYWTNYVDYDHVYLR 651

Query: 2100 DCNINP 2117
            +CNINP
Sbjct: 652  ECNINP 657


>ref|XP_006449976.1| hypothetical protein CICLE_v10014426mg [Citrus clementina]
            gi|557552587|gb|ESR63216.1| hypothetical protein
            CICLE_v10014426mg [Citrus clementina]
          Length = 646

 Score =  838 bits (2165), Expect = 0.0
 Identities = 419/665 (63%), Positives = 523/665 (78%), Gaps = 11/665 (1%)
 Frame = +3

Query: 156  MVKRKPVLVLLCVTVLAPIVLYTDMLA-STSNSIQNSDFTGDASNLSAGKEVQRLHALPL 332
            M  R  V+ +LC TV API+++T     S  +S ++ +F  D +  + G + + L+ LP 
Sbjct: 1    MKTRNLVVGMLCATVFAPILIFTSTFKDSYPSSSESGEFLEDLTAFTVGGDARHLNLLPQ 60

Query: 333  ESSD--ALKEPIGIVYYDNSSNSEQISARSSDGITASELPLGGSGELKS-RVLSAT-NER 500
            ESS   +LK+PI ++       S++I+  S+   + S+    GS E KS RVLSAT N  
Sbjct: 61   ESSTTLSLKQPILVI-------SDKIAQHSAHSQSQSQ----GSWEHKSARVLSATTNGL 109

Query: 501  TQSERIGAIKQVTS------REHSNDGLKNPKLVEGNKEVGSQQAVDLTSTEIQGSXXXX 662
             QS+    I+QVT        +H++   +  K  + +      Q +D    +        
Sbjct: 110  DQSKTDNPIRQVTDLTKTPINKHADQ--EQIKASDNHISAHHSQILDTKHQQESSQTYGV 167

Query: 663  XXXXRPPKVASRRHQILDTQRSNMVMPDVKVRNLKDQLIRAKVYLSLGSSRTNPHFIREL 842
                 P K+ + +      Q      PD +VR LKDQLI+AKVYLSL ++R N +F+REL
Sbjct: 168  LEKKEPTKINNEK------QTEQTAPPDFRVRQLKDQLIKAKVYLSLPATRNNANFVREL 221

Query: 843  RLRVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDDCSAVVKKLRAMLHTVE 1022
            RLR+++VQRALGDA+KDS+LPR  N++++AMEQ+L+KGKQIQDDC+AVVKKLRAMLH+ E
Sbjct: 222  RLRIKEVQRALGDASKDSDLPRIANDRLKAMEQSLAKGKQIQDDCAAVVKKLRAMLHSTE 281

Query: 1023 DQLQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRRQFPHQEKLEDPKLFHY 1202
            +QL+VHKKQTLFLTQL AKTLPKGLHCL LRL+TEYY ++ S+R FP+QEKLEDP+LFHY
Sbjct: 282  EQLRVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNSSQRYFPNQEKLEDPRLFHY 341

Query: 1203 ALFSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMWFLANPPRNATIQVQNI 1382
            ALFSDN+LAA+VVVNSTV +AK+ ++HVFHIVTDRLNYAAMRMWFLANPP  AT+QVQNI
Sbjct: 342  ALFSDNVLAAAVVVNSTVTHAKHPSNHVFHIVTDRLNYAAMRMWFLANPPGRATVQVQNI 401

Query: 1383 EEFTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPKYLSMLNHLRFYLPEIF 1562
            EEFTWLN+SYSPV++QL SQSMID+YFR +RANSD+NLKFRNPKYLS+LNHLRFYLPE+F
Sbjct: 402  EEFTWLNSSYSPVLKQLNSQSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEVF 461

Query: 1563 PKLDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHRFDRYLNFSNPLIAKNF 1742
            P+L+KV+FLDDD+VVQ+DL+ LWSIDL+GKVNGAVETCGE+FHRFDRYLNFSNPLI+KNF
Sbjct: 462  PRLNKVLFLDDDVVVQKDLSGLWSIDLKGKVNGAVETCGETFHRFDRYLNFSNPLISKNF 521

Query: 1743 DPRACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLGTLPPGLITFWKQTFPL 1922
            DPRACGWAYGMN+FDL EWRRQ IT+VYH+WQK+N DR LWKLGTLPPGLITFWK+T+PL
Sbjct: 522  DPRACGWAYGMNIFDLDEWRRQNITDVYHTWQKMNHDRQLWKLGTLPPGLITFWKRTYPL 581

Query: 1923 NKSWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFRNYWAKYVDYDQDYLQD 2102
            ++ WHVLGLGYNP+VNQ+DI+RAAV+HYNGNMKPWLEI IPK+RNYW K+VDYDQ YL++
Sbjct: 582  DRFWHVLGLGYNPSVNQRDIERAAVIHYNGNMKPWLEINIPKYRNYWTKHVDYDQLYLRE 641

Query: 2103 CNINP 2117
            CNINP
Sbjct: 642  CNINP 646


>ref|XP_006350232.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1
            [Solanum tuberosum] gi|565367133|ref|XP_006350233.1|
            PREDICTED: probable galacturonosyltransferase 4-like
            isoform X2 [Solanum tuberosum]
            gi|565367135|ref|XP_006350234.1| PREDICTED: probable
            galacturonosyltransferase 4-like isoform X3 [Solanum
            tuberosum]
          Length = 680

 Score =  838 bits (2164), Expect = 0.0
 Identities = 427/680 (62%), Positives = 527/680 (77%), Gaps = 26/680 (3%)
 Frame = +3

Query: 153  KMVKRKPVLVLLCVTVLAPIVLYTDMLAS--TSNSIQNSDFTGDASNLSAGKEVQRLHAL 326
            KM  RKPVL LL VTV APIVLYTD L +  TS S   ++F  D S  + G +V+ L+ L
Sbjct: 2    KMKLRKPVLFLLLVTVFAPIVLYTDTLGTYFTSPSSSRTEFIEDLSTFTFGGDVRPLNVL 61

Query: 327  PLESSDALKEPIGIVYYDNSSNSEQISARSSDGITASEL----PLGGSGELKSRVLS-AT 491
            P ESS +LKEP G VY +NSS+S    + +SD +++ +      L  +  +K +  + ++
Sbjct: 62   PQESSTSLKEPRGDVYSENSSHS---LSNASDTLSSEDARKTRQLTEAESMKHQTATGSS 118

Query: 492  NERTQSERIGA-IKQVTSREHS--NDGLKNPKLVEGNKE---------------VGSQQA 617
            N+  +    G+ I QVT+  H        +PKLV   K                  S Q 
Sbjct: 119  NDGVEVAMNGSHISQVTANLHEPQQTDKTSPKLVSAGKNESIAMETISKKKTSPTDSNQT 178

Query: 618  VDLTSTEIQGSXXXXXXXXRPPKVASRRHQILDTQRS-NMVMPDVKVRNLKDQLIRAKVY 794
            +D T TE +          +     + R +  D +R+  +V PD +VR LKDQLIRAKVY
Sbjct: 179  LDSTKTETRHDQRTVQTSGKFVSGETARGK--DEERNVQIVPPDARVRQLKDQLIRAKVY 236

Query: 795  LSLGSSRTNPHFIRELRLRVRDVQRALGDATKDSELPRNINEKMRAMEQTLSKGKQIQDD 974
            LSL ++R+NPHFIRELRLR+++V RALG+ATKDS+L R+ NEK++AMEQTL+KGKQIQDD
Sbjct: 237  LSLSATRSNPHFIRELRLRIKEVLRALGEATKDSDLSRSANEKLKAMEQTLAKGKQIQDD 296

Query: 975  CSAVVKKLRAMLHTVEDQLQVHKKQTLFLTQLAAKTLPKGLHCLSLRLSTEYYLMDPSRR 1154
            C+ +VKKLRAMLH+ E+QL+VHKKQTL+LT L AKTLPKGLHCL LRLSTEY+ ++ S++
Sbjct: 297  CATIVKKLRAMLHSAEEQLRVHKKQTLYLTHLTAKTLPKGLHCLPLRLSTEYFKLNSSQQ 356

Query: 1155 QFPHQEKLEDPKLFHYALFSDNILAASVVVNSTVFNAKNSADHVFHIVTDRLNYAAMRMW 1334
             FPHQE LE+PKL+HYALFSDNILAA+VVVNSTV +AK+ + HVFHIVTDRLN+AAMRMW
Sbjct: 357  HFPHQENLENPKLYHYALFSDNILAAAVVVNSTVSHAKDPSKHVFHIVTDRLNFAAMRMW 416

Query: 1335 FLANPPRNATIQVQNIEEFTWLNASYSPVMEQLGSQSMIDFYFRTNRANSDANLKFRNPK 1514
            FLANPP+ AT+ VQN+EEFTWLN+SYSPV++QL SQSMID+YFR+ RA+SD N+KFRNPK
Sbjct: 417  FLANPPKYATVDVQNVEEFTWLNSSYSPVLKQLNSQSMIDYYFRS-RADSDPNVKFRNPK 475

Query: 1515 YLSMLNHLRFYLPEIFPKLDKVVFLDDDIVVQRDLTALWSIDLEGKVNGAVETCGESFHR 1694
            YLS++NHLRFYLPEIFPKLDKV+FLDDDIVVQ+DL  LWS+DL+GKV G VETCGESFHR
Sbjct: 476  YLSIMNHLRFYLPEIFPKLDKVLFLDDDIVVQKDLGGLWSLDLKGKVIGVVETCGESFHR 535

Query: 1695 FDRYLNFSNPLIAKNFDPRACGWAYGMNVFDLAEWRRQKITEVYHSWQKLNRDRLLWKLG 1874
            FDRYLNFSNPLI+KNFDPRACGWA+GMN+ DL +WRRQ ITEVYHSWQ  N +R LWKLG
Sbjct: 536  FDRYLNFSNPLISKNFDPRACGWAFGMNIIDLNQWRRQNITEVYHSWQNRNHERQLWKLG 595

Query: 1875 TLPPGLITFWKQTFPLNKSWHVLGLGYNPNVNQKDIDRAAVVHYNGNMKPWLEIGIPKFR 2054
            TLPPGLITFWK+T+ L++SWHVLGLGYNPNV+QKDI RAAV+HYNGN+KPWLEI IPKFR
Sbjct: 596  TLPPGLITFWKRTYALDRSWHVLGLGYNPNVSQKDIQRAAVIHYNGNLKPWLEISIPKFR 655

Query: 2055 NYWAKYVDYDQDYLQDCNIN 2114
            +YW+K+VDYDQ +L++CNIN
Sbjct: 656  DYWSKFVDYDQAFLRECNIN 675


Top