BLASTX nr result

ID: Achyranthes22_contig00020284 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes22_contig00020284
         (3165 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI29877.3| unnamed protein product [Vitis vinifera]             1252   0.0  
ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267...  1250   0.0  
ref|XP_006493901.1| PREDICTED: uncharacterized protein LOC102626...  1211   0.0  
gb|EOY09341.1| Exostosin family protein isoform 1 [Theobroma cac...  1207   0.0  
ref|XP_004307567.1| PREDICTED: uncharacterized protein LOC101304...  1205   0.0  
ref|XP_006421449.1| hypothetical protein CICLE_v10004353mg [Citr...  1203   0.0  
ref|XP_002308967.2| exostosin family protein [Populus trichocarp...  1201   0.0  
gb|EMJ03138.1| hypothetical protein PRUPE_ppa001595mg [Prunus pe...  1188   0.0  
ref|XP_004516917.1| PREDICTED: uncharacterized protein LOC101503...  1187   0.0  
ref|XP_003519065.1| PREDICTED: uncharacterized protein LOC100783...  1185   0.0  
ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807...  1169   0.0  
ref|XP_006349551.1| PREDICTED: uncharacterized protein LOC102592...  1162   0.0  
gb|ESW17624.1| hypothetical protein PHAVU_007G255200g [Phaseolus...  1161   0.0  
ref|XP_002526728.1| catalytic, putative [Ricinus communis] gi|22...  1159   0.0  
ref|XP_004234838.1| PREDICTED: uncharacterized protein LOC101249...  1157   0.0  
ref|XP_006402860.1| hypothetical protein EUTSA_v10005794mg [Eutr...  1155   0.0  
emb|CAN80640.1| hypothetical protein VITISV_016911 [Vitis vinifera]  1155   0.0  
ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]...  1150   0.0  
ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata...  1150   0.0  
ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]...  1141   0.0  

>emb|CBI29877.3| unnamed protein product [Vitis vinifera]
          Length = 822

 Score = 1252 bits (3240), Expect = 0.0
 Identities = 575/794 (72%), Positives = 654/794 (82%), Gaps = 5/794 (0%)
 Frame = -1

Query: 2715 EMMLILQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNEST 2536
            EM   LQK  CSW L+A + SVV L+  V    L P   +L+YF + QGQ TC P N S 
Sbjct: 28   EMTFFLQKWKCSWSLLATVASVVALIS-VAHLFLFPLAPSLEYFSMGQGQKTCTPINASI 86

Query: 2535 KGTEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITE 2356
            +G +   HD +   P   L  RFP D H +VVYRGAPWKAEIGRW SGCD++  EV+I E
Sbjct: 87   RGVD---HDGKNLQPSFDLDHRFPADSHKSVVYRGAPWKAEIGRWFSGCDSIAAEVSIIE 143

Query: 2355 KISGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSI 2176
            KI GK CKNDCSGQG+CNHELGQCRCFHG+SGE CS+ L+L CNYP +P++PYG WVVSI
Sbjct: 144  KIGGKDCKNDCSGQGICNHELGQCRCFHGFSGEGCSERLHLDCNYPSSPEQPYGPWVVSI 203

Query: 2175 CPGQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLD-VFTTNS 2008
            CP  CDTTRAMCFCG GTKYP+RPVAE CGF+ N  +TP      DW K DLD +FTTN 
Sbjct: 204  CPASCDTTRAMCFCGEGTKYPHRPVAEACGFQMNLPTTPGDPKLVDWTKADLDNIFTTND 263

Query: 2007 SVPGWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQ 1828
            S PGWCNVDP E Y  K ++KEEC CKYD  +G FCEI V+ TC+NQCSGHG+CRGGFCQ
Sbjct: 264  SKPGWCNVDPTEAYALKMQYKEECDCKYDCLLGRFCEIPVLCTCVNQCSGHGHCRGGFCQ 323

Query: 1827 CHEGWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVY 1648
            CH GWYG DCSIPSV SS+R+WP+WLRPAH+++P   H + S+ ++ A+V KKRPLIYVY
Sbjct: 324  CHRGWYGTDCSIPSVLSSVREWPRWLRPAHVEVPDDMHLSGSLVNLDAVVKKKRPLIYVY 383

Query: 1647 DLPPEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEE 1468
            DLPPEF S LLEGRHFKF+CVNR+YDD+NAT+WTEQLYGAQMAIYES LASP+RTL+GEE
Sbjct: 384  DLPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYGAQMAIYESILASPHRTLDGEE 443

Query: 1467 ADFFFVPILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGK 1288
            ADFFFVP+LDSCII R DDAPH+NM  H G+RSS TLEFYK AYDHIVE YP+W RS+G+
Sbjct: 444  ADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTLEFYKTAYDHIVEQYPFWNRSSGR 503

Query: 1287 DHIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPC 1108
            DHIWFF+WDEGACYAPKEIW+SMMLVHWGNTNSKH +STTAYWADNW+ +S   RGNHPC
Sbjct: 504  DHIWFFSWDEGACYAPKEIWDSMMLVHWGNTNSKHNHSTTAYWADNWDSVSSDRRGNHPC 563

Query: 1107 FDPAKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIR 928
            FDP KDLVLP+WKRPD  S  +KLW+RPR QR TLFYFNGNLGPAY+ GRPE TYSMGIR
Sbjct: 564  FDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKTLFYFNGNLGPAYEGGRPETTYSMGIR 623

Query: 927  QKLAEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDS 748
            QK+AEEFGS+P+K+GKLG+QH +DVIVT LRS  YHE LASSVFCGV PGDGWSGR EDS
Sbjct: 624  QKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGNYHESLASSVFCGVMPGDGWSGRFEDS 683

Query: 747  ILQGCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQR 568
            ILQGCIPV+IQDGIFLP+EN LNYESFAVRI EDEIPNLIKILRG +ETEIEFKL NV++
Sbjct: 684  ILQGCIPVVIQDGIFLPFENMLNYESFAVRIREDEIPNLIKILRGMNETEIEFKLENVRK 743

Query: 567  IWQRFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKR 388
            IWQRFLYRDSIL EAERQK+ FG+  DWA QLLQ SEDDVFATLIQVLHYKLHNDPWR++
Sbjct: 744  IWQRFLYRDSILLEAERQKTAFGNVEDWAVQLLQLSEDDVFATLIQVLHYKLHNDPWRQQ 803

Query: 387  AAHLKA-YGVPQEC 349
             AHLK  +G+ QEC
Sbjct: 804  LAHLKKDFGLAQEC 817


>ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267584 [Vitis vinifera]
          Length = 794

 Score = 1250 bits (3235), Expect = 0.0
 Identities = 574/793 (72%), Positives = 653/793 (82%), Gaps = 5/793 (0%)
 Frame = -1

Query: 2712 MMLILQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTK 2533
            M   LQK  CSW L+A + SVV L+  V    L P   +L+YF + QGQ TC P N S +
Sbjct: 1    MTFFLQKWKCSWSLLATVASVVALIS-VAHLFLFPLAPSLEYFSMGQGQKTCTPINASIR 59

Query: 2532 GTEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEK 2353
            G +   HD +   P   L  RFP D H +VVYRGAPWKAEIGRW SGCD++  EV+I EK
Sbjct: 60   GVD---HDGKNLQPSFDLDHRFPADSHKSVVYRGAPWKAEIGRWFSGCDSIAAEVSIIEK 116

Query: 2352 ISGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSIC 2173
            I GK CKNDCSGQG+CNHELGQCRCFHG+SGE CS+ L+L CNYP +P++PYG WVVSIC
Sbjct: 117  IGGKDCKNDCSGQGICNHELGQCRCFHGFSGEGCSERLHLDCNYPSSPEQPYGPWVVSIC 176

Query: 2172 PGQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLD-VFTTNSS 2005
            P  CDTTRAMCFCG GTKYP+RPVAE CGF+ N  +TP      DW K DLD +FTTN S
Sbjct: 177  PASCDTTRAMCFCGEGTKYPHRPVAEACGFQMNLPTTPGDPKLVDWTKADLDNIFTTNDS 236

Query: 2004 VPGWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQC 1825
             PGWCNVDP E Y  K ++KEEC CKYD  +G FCEI V+ TC+NQCSGHG+CRGGFCQC
Sbjct: 237  KPGWCNVDPTEAYALKMQYKEECDCKYDCLLGRFCEIPVLCTCVNQCSGHGHCRGGFCQC 296

Query: 1824 HEGWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYD 1645
            H GWYG DCSIPSV SS+R+WP+WLRPAH+++P   H + S+ ++ A+V KKRPLIYVYD
Sbjct: 297  HRGWYGTDCSIPSVLSSVREWPRWLRPAHVEVPDDMHLSGSLVNLDAVVKKKRPLIYVYD 356

Query: 1644 LPPEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEA 1465
            LPPEF S LLEGRHFKF+CVNR+YDD+NAT+WTEQLYGAQMAIYES LASP+RTL+GEEA
Sbjct: 357  LPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYGAQMAIYESILASPHRTLDGEEA 416

Query: 1464 DFFFVPILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKD 1285
            DFFFVP+LDSCII R DDAPH+NM  H G+RSS TLEFYK AYDHIVE YP+W RS+G+D
Sbjct: 417  DFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTLEFYKTAYDHIVEQYPFWNRSSGRD 476

Query: 1284 HIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCF 1105
            HIWFF+WDEGACYAPKEIW+SMMLVHWGNTNSKH +STTAYWADNW+ +S   RGNHPCF
Sbjct: 477  HIWFFSWDEGACYAPKEIWDSMMLVHWGNTNSKHNHSTTAYWADNWDSVSSDRRGNHPCF 536

Query: 1104 DPAKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQ 925
            DP KDLVLP+WKRPD  S  +KLW+RPR QR TLFYFNGNLGPAY+ GRPE TYSMGIRQ
Sbjct: 537  DPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKTLFYFNGNLGPAYEGGRPETTYSMGIRQ 596

Query: 924  KLAEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSI 745
            K+AEEFGS+P+K+GKLG+QH +DVIVT LRS  YHE LASSVFCGV PGDGWSGR EDSI
Sbjct: 597  KVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGNYHESLASSVFCGVMPGDGWSGRFEDSI 656

Query: 744  LQGCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRI 565
            LQGCIPV+IQDGIFLP+EN LNYESFAVRI EDEIPNLIKILRG +ETEIEFKL NV++I
Sbjct: 657  LQGCIPVVIQDGIFLPFENMLNYESFAVRIREDEIPNLIKILRGMNETEIEFKLENVRKI 716

Query: 564  WQRFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRA 385
            WQRFLYRDSIL EAERQK+ FG+  DWA QLLQ SEDDVFATLIQVLHYKLHNDPWR++ 
Sbjct: 717  WQRFLYRDSILLEAERQKTAFGNVEDWAVQLLQLSEDDVFATLIQVLHYKLHNDPWRQQL 776

Query: 384  AHLKA-YGVPQEC 349
            AHLK  +G+ QEC
Sbjct: 777  AHLKKDFGLAQEC 789


>ref|XP_006493901.1| PREDICTED: uncharacterized protein LOC102626477 isoform X1 [Citrus
            sinensis]
          Length = 791

 Score = 1211 bits (3132), Expect = 0.0
 Identities = 553/793 (69%), Positives = 646/793 (81%), Gaps = 6/793 (0%)
 Frame = -1

Query: 2709 MLILQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQG-QTTCIPSNESTK 2533
            M+ ++K   SW L+A + SV L L+ VV   L P   + DYF  RQ  Q +C+P  ES +
Sbjct: 1    MISIEKWRFSWTLVATVASV-LTLVSVVHLFLFPLVPSFDYFTARQQIQNSCVPIKESAE 59

Query: 2532 GTEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEK 2353
            G  VT      SPP ++L  RFP DLH+AVVYR APWKAEIGRWLSGCD+V KEV++ E 
Sbjct: 60   G--VTNRVWENSPPQLNLDHRFPADLHNAVVYRNAPWKAEIGRWLSGCDSVAKEVDLVEM 117

Query: 2352 ISGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSIC 2173
            I GK CK+DCSGQGVCNHELGQCRCFHG+ G+ CS+ ++  CN+P TP+ PYGRWVVSIC
Sbjct: 118  IGGKSCKSDCSGQGVCNHELGQCRCFHGFRGKGCSERIHFQCNFPKTPELPYGRWVVSIC 177

Query: 2172 PGQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSS---TPPSTDWGKPDLD-VFTTNSS 2005
            P  CDTTRAMCFCG GTKYP RPVAE CGF+ N  S    P STDW K DLD +FTTN S
Sbjct: 178  PTHCDTTRAMCFCGEGTKYPNRPVAEACGFQVNLPSQPGAPKSTDWAKADLDNIFTTNGS 237

Query: 2004 VPGWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQC 1825
             PGWCNVDP E Y  K +FKEEC CKYDG +G FCE+ V STC+NQCSGHG+CRGGFCQC
Sbjct: 238  KPGWCNVDPEEAYALKVQFKEECDCKYDGLLGQFCEVPVSSTCVNQCSGHGHCRGGFCQC 297

Query: 1824 HEGWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYD 1645
              GWYG DCSIPSV SS+ +WPQWLRPAHID+P   +   ++ ++ A+V KKRPL+YVYD
Sbjct: 298  DNGWYGVDCSIPSVMSSMSEWPQWLRPAHIDIPINANITGNLVNLNAVVKKKRPLVYVYD 357

Query: 1644 LPPEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEA 1465
            LPPEF S LLEGRH+K +CVNR+Y++KN T WT+ LYG+QMA YES LASP+RTLNGEEA
Sbjct: 358  LPPEFNSLLLEGRHYKLECVNRIYNEKNETLWTDMLYGSQMAFYESILASPHRTLNGEEA 417

Query: 1464 DFFFVPILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKD 1285
            DFFFVP+LDSCIITR DDAPH++ Q+H G+RSS TLEFYKKAY+HI+E+YPYW R++G+D
Sbjct: 418  DFFFVPVLDSCIITRADDAPHLSAQEHRGLRSSLTLEFYKKAYEHIIEHYPYWNRTSGRD 477

Query: 1284 HIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCF 1105
            HIWFF+WDEGACYAPKEIWNSMMLVHWGNTNSKH +STTAYWADNW+ IS + RGNH CF
Sbjct: 478  HIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDRISSSRRGNHSCF 537

Query: 1104 DPAKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQ 925
            DP KDLVLP+WK PD    R+KLWA PR +R TLFYFNGNLG AY NGRPE++YSMG+RQ
Sbjct: 538  DPEKDLVLPAWKAPDAFVLRSKLWASPREKRKTLFYFNGNLGSAYPNGRPESSYSMGVRQ 597

Query: 924  KLAEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSI 745
            KLAEE+GS+P+K+GKLG+QH +DVIVT LRSE YHE+L+SSVFCGV PGDGWSGRMEDSI
Sbjct: 598  KLAEEYGSSPNKEGKLGKQHAEDVIVTSLRSENYHEDLSSSVFCGVLPGDGWSGRMEDSI 657

Query: 744  LQGCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRI 565
            LQGCIPV+IQDGIFLPYEN LNYESF VRI EDEIPNLI ILRG +ETEI+F+LANVQ++
Sbjct: 658  LQGCIPVVIQDGIFLPYENVLNYESFVVRISEDEIPNLINILRGLNETEIQFRLANVQKV 717

Query: 564  WQRFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRA 385
            WQRFLYRDSIL EA+RQ + FG  +DWA + L+  EDDVF TLIQ+LHYKLHNDPWR+  
Sbjct: 718  WQRFLYRDSILLEAKRQNAKFGRMNDWAVEFLKLREDDVFTTLIQILHYKLHNDPWRREL 777

Query: 384  AHLKA-YGVPQEC 349
             H K  +G+PQEC
Sbjct: 778  VHQKKDFGIPQEC 790


>gb|EOY09341.1| Exostosin family protein isoform 1 [Theobroma cacao]
            gi|508717445|gb|EOY09342.1| Exostosin family protein
            isoform 1 [Theobroma cacao]
          Length = 794

 Score = 1207 bits (3124), Expect = 0.0
 Identities = 557/793 (70%), Positives = 637/793 (80%), Gaps = 5/793 (0%)
 Frame = -1

Query: 2712 MMLILQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTK 2533
            MM  +QK  CSW L+A + SV++ +  VV   L P   + DYF   Q Q  C+P N S +
Sbjct: 1    MMFSVQKWKCSWSLVATVASVIVPVS-VVHLFLFPVVPSFDYFRAPQVQYKCVPINASVE 59

Query: 2532 GTEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEK 2353
              +V  H      PG+ L  RFP DLH+ VVY  APWKAEIG+WLS CD + +EVNI E 
Sbjct: 60   --KVADHVWENIQPGLDLDHRFPSDLHNGVVYHNAPWKAEIGQWLSSCDAIAREVNIVET 117

Query: 2352 ISGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSIC 2173
            I G+ CK DCSGQGVCNHE+GQCRCFHG+SGE+CS+ ++L+CNYP TP+ PYGRWVVSIC
Sbjct: 118  IGGRRCKADCSGQGVCNHEMGQCRCFHGFSGEECSERVHLSCNYPKTPELPYGRWVVSIC 177

Query: 2172 PGQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLD-VFTTNSS 2005
            P  CDTTRAMCFCG GTKYP RPVAE CGF+ N  S P     TDW K DLD +FTTN S
Sbjct: 178  PAHCDTTRAMCFCGEGTKYPNRPVAEACGFQMNLPSEPGGPKLTDWSKADLDNIFTTNGS 237

Query: 2004 VPGWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQC 1825
             PGWCNVDP   Y  K  FKEEC CKYDG  G FCE+ V S CINQCSGHG+CRGGFCQC
Sbjct: 238  KPGWCNVDPDAAYASKVLFKEECDCKYDGLWGRFCEVPVESVCINQCSGHGHCRGGFCQC 297

Query: 1824 HEGWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYD 1645
            + GWYG DCSIPSV S + +WP+WLRPA +D+P   H    +N + A V KKRPLIYVYD
Sbjct: 298  YNGWYGTDCSIPSVVSPMGEWPKWLRPAQVDIPSIEHTGSLVN-LDAAVKKKRPLIYVYD 356

Query: 1644 LPPEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEA 1465
            LPPEF S LLEGRHFKF+CVNR+YDD+NAT WT+QLYG+QMA+YES LASPYRTLNGEEA
Sbjct: 357  LPPEFNSLLLEGRHFKFECVNRIYDDRNATLWTDQLYGSQMALYESILASPYRTLNGEEA 416

Query: 1464 DFFFVPILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKD 1285
            DFFFVP+LDSCIITR DDAPH++M+ H+G+RSS TLEFY+KAYDHIVE Y YW RSAG+D
Sbjct: 417  DFFFVPVLDSCIITRADDAPHLSMENHTGLRSSLTLEFYRKAYDHIVEKYAYWNRSAGRD 476

Query: 1284 HIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCF 1105
            H+W F+WDEGACYAPKEIWNSMMLVHWGNTNSKH +STTAYWADNW+ I    RGNHPCF
Sbjct: 477  HVWSFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDKIPSDRRGNHPCF 536

Query: 1104 DPAKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQ 925
            DPAKDLVLP+WK PD  +  AKLW+RPR +R TLFYFNGNLGPA+ +GRPE TYSMGIRQ
Sbjct: 537  DPAKDLVLPAWKHPDVTALSAKLWSRPREKRKTLFYFNGNLGPAFTSGRPETTYSMGIRQ 596

Query: 924  KLAEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSI 745
            KLA+EFGS P+K+GKLG+QH +DVIVT LRS  YHE++A+S FCGV PGDGWSGRMEDS+
Sbjct: 597  KLADEFGSTPNKEGKLGKQHAEDVIVTSLRSNNYHEDIANSTFCGVLPGDGWSGRMEDSV 656

Query: 744  LQGCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRI 565
            LQGCIPV+IQDGIFLPYEN LNYESFAVRI EDEIPNLIKIL+G +E+EIEFKLANVQ+I
Sbjct: 657  LQGCIPVVIQDGIFLPYENVLNYESFAVRIREDEIPNLIKILQGINESEIEFKLANVQKI 716

Query: 564  WQRFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRA 385
             QRFLYR+SIL EAERQK+ FG   DWA Q LQ +EDDVF T +QVLHYKLHNDPWR++ 
Sbjct: 717  QQRFLYRNSILLEAERQKTLFGRLEDWAVQFLQQTEDDVFTTFLQVLHYKLHNDPWRRQL 776

Query: 384  AHL-KAYGVPQEC 349
            AHL K YGVP EC
Sbjct: 777  AHLKKEYGVPPEC 789


>ref|XP_004307567.1| PREDICTED: uncharacterized protein LOC101304329 [Fragaria vesca
            subsp. vesca]
          Length = 791

 Score = 1205 bits (3118), Expect = 0.0
 Identities = 560/796 (70%), Positives = 639/796 (80%), Gaps = 8/796 (1%)
 Frame = -1

Query: 2712 MMLILQKKGCSWLLIAAIGSVV----LLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSN 2545
            M  IL+ KG SW +IA I S+V    L L  +V     P   + +YF   Q Q +C+P N
Sbjct: 1    MFSILRWKG-SWSMIATIASIVGLISLALASIVHLFFFPLVPSFNYFS--QAQNSCVPIN 57

Query: 2544 ESTKGTEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVN 2365
             S +   +T H       G+ L  +FP DLH AVVYRGAPWKAEIGRWL+GC ++  EVN
Sbjct: 58   GSAEA--ITDHIK-----GIDLEYQFPSDLHKAVVYRGAPWKAEIGRWLAGCLSITNEVN 110

Query: 2364 ITEKISGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWV 2185
            I E I G  CKNDCSGQGVCN ELGQCRCFHGYSGE CS+TL L CNYP +PD+PYGRWV
Sbjct: 111  IVELIGGSGCKNDCSGQGVCNRELGQCRCFHGYSGEGCSETLQLECNYPGSPDQPYGRWV 170

Query: 2184 VSICPGQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLD-VFT 2017
            VSIC   CDT +AMCFCG GTKYP RPVAE CGF+    S P +   TDW K DLD + T
Sbjct: 171  VSICSAHCDTKKAMCFCGEGTKYPNRPVAEACGFQVKPPSKPGAPKLTDWEKADLDNLLT 230

Query: 2016 TNSSVPGWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGG 1837
            TNSS PGWCNVDP E Y  K +FK+EC CKYD  +G FCE+ V+ TCINQCSGHG+CRGG
Sbjct: 231  TNSSKPGWCNVDPAEAYALKVQFKQECDCKYDCLLGRFCEVPVLCTCINQCSGHGHCRGG 290

Query: 1836 FCQCHEGWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLI 1657
            FCQC+ GWYG DCSIPSV SS+R+WPQWLRPA +++P  +H    + ++ A+V KKRPLI
Sbjct: 291  FCQCNNGWYGIDCSIPSVASSVREWPQWLRPAQVNIPDNSHLTGKVVNLNAVVKKKRPLI 350

Query: 1656 YVYDLPPEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLN 1477
            YVYDLPP+F S LLEGRHFKF+CVNR+YDD N+T WT+ LYG+QMA+YES LASPYRTLN
Sbjct: 351  YVYDLPPDFNSLLLEGRHFKFECVNRIYDDLNSTVWTDMLYGSQMALYESILASPYRTLN 410

Query: 1476 GEEADFFFVPILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRS 1297
            GEEADFFFVP+LDSCIITR DDAPH++MQ+H G+RSS TLE+YKKAYDHIVE YP+W  S
Sbjct: 411  GEEADFFFVPVLDSCIITRADDAPHLSMQEHKGLRSSLTLEYYKKAYDHIVEQYPFWNHS 470

Query: 1296 AGKDHIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGN 1117
            +G+DHIWFF+WDEGACYAPKEIWNSMML+HWGNTNSKHK+STTAYW DNWNDIS   RGN
Sbjct: 471  SGRDHIWFFSWDEGACYAPKEIWNSMMLIHWGNTNSKHKHSTTAYWGDNWNDISSDRRGN 530

Query: 1116 HPCFDPAKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSM 937
            HPCFDP KDLVLP+WK PD NS  +KLWARP   R TLFYFNGNLGPAY NGRPE TYSM
Sbjct: 531  HPCFDPEKDLVLPAWKSPDVNSLSSKLWARPHEMRKTLFYFNGNLGPAYPNGRPENTYSM 590

Query: 936  GIRQKLAEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRM 757
            GIRQKLAEEFGS+P+K+GKLG+QH +DVIVT LRSE YHE++ASS+FCGVFPGDGWSGRM
Sbjct: 591  GIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSENYHEDIASSIFCGVFPGDGWSGRM 650

Query: 756  EDSILQGCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLAN 577
            EDSILQGCIPV+IQDGIFLPYEN LNYESFAVRI EDEI NLI ILR F+ETEI+F+LAN
Sbjct: 651  EDSILQGCIPVVIQDGIFLPYENVLNYESFAVRIREDEISNLINILRAFNETEIKFRLAN 710

Query: 576  VQRIWQRFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPW 397
            VQ+IWQRFLYRDSIL EAERQK++FG   DWA Q  Q  EDDVF T +QVLHYKLHNDPW
Sbjct: 711  VQQIWQRFLYRDSILLEAERQKTSFGRMGDWAVQFSQLIEDDVFQTFVQVLHYKLHNDPW 770

Query: 396  RKRAAHLKAYGVPQEC 349
            R+     K +G+PQEC
Sbjct: 771  RQHVRVKKEFGLPQEC 786


>ref|XP_006421449.1| hypothetical protein CICLE_v10004353mg [Citrus clementina]
            gi|557523322|gb|ESR34689.1| hypothetical protein
            CICLE_v10004353mg [Citrus clementina]
          Length = 791

 Score = 1203 bits (3112), Expect = 0.0
 Identities = 549/793 (69%), Positives = 645/793 (81%), Gaps = 6/793 (0%)
 Frame = -1

Query: 2709 MLILQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQG-QTTCIPSNESTK 2533
            M+ ++K   SW L+A + SV L L+ VV   L P   + DYF  RQ  Q +C+P  ES +
Sbjct: 1    MISIEKWRFSWTLVATVASV-LTLVSVVHLFLFPLVPSFDYFTARQQIQNSCVPIKESAE 59

Query: 2532 GTEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEK 2353
            G  VT      SPP ++L  RFP DLH+AVVYR APWKAEIGRWLSGCD+V KEV++ E 
Sbjct: 60   G--VTNRVWENSPPQLNLDHRFPADLHNAVVYRNAPWKAEIGRWLSGCDSVAKEVDLVEM 117

Query: 2352 ISGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSIC 2173
            I GK CK+DCSGQGVCNHELGQCRCFHG+ G+ CS+ ++  CN+P TP+ PYGRWVVSIC
Sbjct: 118  IGGKSCKSDCSGQGVCNHELGQCRCFHGFRGKGCSERIHFQCNFPKTPELPYGRWVVSIC 177

Query: 2172 PGQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLD-VFTTNSS 2005
            P  CDTTRAMCFCG GTKYP RPVAE CGF+ N  S P +   T+W K DLD +FTTN S
Sbjct: 178  PTHCDTTRAMCFCGEGTKYPNRPVAEACGFQVNLPSQPGAPKLTNWAKADLDNIFTTNGS 237

Query: 2004 VPGWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQC 1825
             PGWCN+DP+E Y  K +FKEEC CKYDG +G FCE+ V STC+NQCSGHG+CRGGFCQC
Sbjct: 238  KPGWCNIDPKEAYALKVQFKEECDCKYDGLLGQFCEVPVSSTCVNQCSGHGHCRGGFCQC 297

Query: 1824 HEGWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYD 1645
              GWYG DCSIPSV SS+ +WPQWLRPAHID+P   +   ++ ++ A+V KKRPL+YVYD
Sbjct: 298  DNGWYGVDCSIPSVMSSMSEWPQWLRPAHIDIPINANITGNLVNLNAVVKKKRPLLYVYD 357

Query: 1644 LPPEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEA 1465
            LPPEF S LLEGRH+K +CVNR+Y++KN T WT+ LYG+QMA YES LASP+RTLNGEEA
Sbjct: 358  LPPEFNSLLLEGRHYKLECVNRIYNEKNETLWTDMLYGSQMAFYESILASPHRTLNGEEA 417

Query: 1464 DFFFVPILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKD 1285
            DFFFVP+LDSCIITR DDAPH++ Q+H  +RSS TLEFYKKAY+HI+E+YPYW  ++G+D
Sbjct: 418  DFFFVPVLDSCIITRADDAPHLSAQEHRSLRSSLTLEFYKKAYEHIIEHYPYWNHTSGRD 477

Query: 1284 HIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCF 1105
            HIWFF+WDEGACYAPKEIWNSMMLVHWGNTNSKH +STTAYWADNW+ IS + RGNH CF
Sbjct: 478  HIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDRISSSRRGNHSCF 537

Query: 1104 DPAKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQ 925
            DP KDLVLP+WK PD    R+KLWA PR +R TLFYFNGNLG AY NGRPE++YSMGIRQ
Sbjct: 538  DPEKDLVLPAWKAPDAFVLRSKLWASPREKRKTLFYFNGNLGSAYPNGRPESSYSMGIRQ 597

Query: 924  KLAEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSI 745
            KLAEE+GS+P+K+GKLG+QH +DVIVT LRSE YHE+L+SSVFCGV PGDGWSGRMEDSI
Sbjct: 598  KLAEEYGSSPNKEGKLGKQHAEDVIVTSLRSENYHEDLSSSVFCGVLPGDGWSGRMEDSI 657

Query: 744  LQGCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRI 565
            LQGCIPV+IQDGIFLPYEN LNYESF VRI EDEIPNLI ILRG +ETEI+F+LANVQ++
Sbjct: 658  LQGCIPVVIQDGIFLPYENVLNYESFVVRISEDEIPNLINILRGLNETEIQFRLANVQKV 717

Query: 564  WQRFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRA 385
            WQRFLYRDSIL EA+RQ + FG  +DWA + L+  EDDVF TLIQ+LHYKLHNDPWR+  
Sbjct: 718  WQRFLYRDSILLEAKRQNATFGRMNDWAVEFLKLREDDVFTTLIQILHYKLHNDPWRREL 777

Query: 384  AHLKA-YGVPQEC 349
             H K  +G+PQEC
Sbjct: 778  VHQKKDFGIPQEC 790


>ref|XP_002308967.2| exostosin family protein [Populus trichocarpa]
            gi|550335517|gb|EEE92490.2| exostosin family protein
            [Populus trichocarpa]
          Length = 793

 Score = 1201 bits (3106), Expect = 0.0
 Identities = 555/791 (70%), Positives = 638/791 (80%), Gaps = 4/791 (0%)
 Frame = -1

Query: 2709 MLILQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTKG 2530
            M+ + K  CSW L+A I S+V L+  VV   L P   + D F + Q Q +C P+NES  G
Sbjct: 1    MITISKWKCSWSLMATIASIVALVS-VVHLFLFPVVPSFDPFSVWQVQDSCGPNNESVDG 59

Query: 2529 TEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEKI 2350
               T HD     P + L  +FP DLH AV YR APWKAEIGRWLSGCD V KEV++ E I
Sbjct: 60   R--TGHDPGNLQPVLDLEHKFPADLHRAVFYRNAPWKAEIGRWLSGCDAVTKEVSVVETI 117

Query: 2349 SGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSICP 2170
            SG+ CKNDCSGQGVCN+ELGQCRCFHG+SGE CS+ L+L CNYP +P+ PYGRWVVSIC 
Sbjct: 118  SGRSCKNDCSGQGVCNYELGQCRCFHGFSGEGCSERLHLECNYPKSPELPYGRWVVSICS 177

Query: 2169 GQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSS---TPPSTDWGKPDLDVFTTNSSVP 1999
              CD TRAMCFCG GTKYP RP AE CGF+ +  S    P   DW KPDLD++TTN S  
Sbjct: 178  AHCDPTRAMCFCGEGTKYPNRPAAETCGFQLSLPSEIGAPRQVDWAKPDLDIYTTNKSKL 237

Query: 1998 GWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCHE 1819
            GWCNVDP EGY  K KFKEEC CKYD   G FCE+ V  +CINQCSGHG+CRGGFCQC  
Sbjct: 238  GWCNVDPAEGYANKVKFKEECDCKYDCLSGRFCEVPVQCSCINQCSGHGHCRGGFCQCAN 297

Query: 1818 GWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDLP 1639
            GWYG DCSIPSV SS+R+WP+WLRPA +D+P   H    + D+ A+V KKRPLIY+YDLP
Sbjct: 298  GWYGTDCSIPSVTSSVREWPRWLRPAQLDVPDNAHLTGKLVDLNAVVKKKRPLIYIYDLP 357

Query: 1638 PEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEADF 1459
            P+F S LLEGRHFKF+CVNRLY+D NAT WT+QLYGAQMA+YES LASPYRTLNGEEADF
Sbjct: 358  PKFNSLLLEGRHFKFECVNRLYNDNNATIWTDQLYGAQMALYESILASPYRTLNGEEADF 417

Query: 1458 FFVPILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKDHI 1279
            FFVP+LDSCIITR DDAPH++M++H G+RSS TLEFY+KAYDHIVE+YP+W RS+G+DHI
Sbjct: 418  FFVPVLDSCIITRADDAPHLSMEQHLGLRSSLTLEFYRKAYDHIVEHYPFWNRSSGRDHI 477

Query: 1278 WFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCFDP 1099
            W F+WDEGACYAPKEIWNSMM+VHWGNTNSKH +STTAYWADNW+ IS   RG HPCFDP
Sbjct: 478  WSFSWDEGACYAPKEIWNSMMVVHWGNTNSKHNHSTTAYWADNWDKISSDRRGKHPCFDP 537

Query: 1098 AKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQKL 919
             KDLVLP+WKRPD N+   KLWARP  +R TLFYFNGNLGPAY NGRPEA YSMGIRQKL
Sbjct: 538  DKDLVLPAWKRPDVNALSTKLWARPLEKRKTLFYFNGNLGPAYLNGRPEALYSMGIRQKL 597

Query: 918  AEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSILQ 739
            AEEFGS P+KDG LG+QH ++VIV+ LRSE YHE+LASSVFCGV PGDGWSGRMEDSILQ
Sbjct: 598  AEEFGSTPNKDGNLGKQHAENVIVSPLRSESYHEDLASSVFCGVMPGDGWSGRMEDSILQ 657

Query: 738  GCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRIWQ 559
            GCIPV+IQDGI+LPYEN LNYESFAVRI EDEIPNLIKIL+GF+ETEIE KL +VQ+I Q
Sbjct: 658  GCIPVVIQDGIYLPYENVLNYESFAVRILEDEIPNLIKILQGFNETEIENKLTSVQKIGQ 717

Query: 558  RFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKR-AA 382
            RFLYRDS+L EAERQK+ FG+  DWA + L+ +EDDV AT +QVLHYKLHNDPWR++  +
Sbjct: 718  RFLYRDSMLLEAERQKTAFGYVEDWAVEFLRLTEDDVVATFVQVLHYKLHNDPWRRQLGS 777

Query: 381  HLKAYGVPQEC 349
              K +G+PQEC
Sbjct: 778  QKKDFGLPQEC 788


>gb|EMJ03138.1| hypothetical protein PRUPE_ppa001595mg [Prunus persica]
          Length = 795

 Score = 1188 bits (3074), Expect = 0.0
 Identities = 554/795 (69%), Positives = 633/795 (79%), Gaps = 8/795 (1%)
 Frame = -1

Query: 2709 MLILQKKGCSWLLIAAIGSVV----LLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNE 2542
            ML +QK  CSW  IA I S+V    ++L  +V     P   + +YF   Q Q +C+P N 
Sbjct: 1    MLSIQKWKCSWSQIATIASIVALASIILGSIVHLFWFPLVPSFNYFS--QAQNSCVPING 58

Query: 2541 STKGTEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNI 2362
            S +   +        PP + L  +FP DLH AVV+RGAPWKAEIGRWLSGCD +  EVNI
Sbjct: 59   SAEAV-IDNVKGNFKPP-IDLDRQFPSDLHKAVVFRGAPWKAEIGRWLSGCDPISDEVNI 116

Query: 2361 TEKISGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVV 2182
             E I G  CKNDCSGQGVCN ELGQCRC+HGYSGE CS+ L L CNYP +PD+PYGRWVV
Sbjct: 117  VEVIGGSGCKNDCSGQGVCNRELGQCRCYHGYSGEGCSERLQLECNYPGSPDQPYGRWVV 176

Query: 2181 SICPGQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLD-VFTT 2014
            SIC   CDTTRA CFCG GTKYP RPVAE CGF+    S P +   TDW K DLD VFT 
Sbjct: 177  SICSAHCDTTRAFCFCGEGTKYPNRPVAEACGFQVQLPSEPGAPKLTDWAKADLDNVFTK 236

Query: 2013 NSSVPGWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGF 1834
            N S PGWCNVDP E Y  K +FKEEC CKYD   G FCE+ V+ TCINQCSGHG+CRGGF
Sbjct: 237  NGSKPGWCNVDPAEVYAHKVQFKEECDCKYDCFWGRFCEVPVLCTCINQCSGHGHCRGGF 296

Query: 1833 CQCHEGWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIY 1654
            CQC  GWYG DCSIPSV SS+R+WPQWLRPA +D+P  +H    + ++ A+V KKRPLIY
Sbjct: 297  CQCDNGWYGIDCSIPSVTSSVREWPQWLRPAQVDVPDSSHLPGKVVNLNAVVKKKRPLIY 356

Query: 1653 VYDLPPEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNG 1474
            VYDLPP+F S LLEGRHF+ +CVNR+YD KN+T WT+QLYGAQ+A+YES LASPYRTLNG
Sbjct: 357  VYDLPPDFNSLLLEGRHFRLECVNRIYDGKNSTLWTDQLYGAQVALYESILASPYRTLNG 416

Query: 1473 EEADFFFVPILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSA 1294
            EEADFFFVP+LDSCIITR DDAPH++MQ H G+RSS TLE+Y+KAYDHIVE YP+W RS+
Sbjct: 417  EEADFFFVPVLDSCIITRADDAPHLSMQ-HKGLRSSLTLEYYRKAYDHIVEQYPFWNRSS 475

Query: 1293 GKDHIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNH 1114
            G+DHIWFF+WDEGACYAPKEIWNSMMLVHWGNTN KHK+STTAYWADNW+ I    RGNH
Sbjct: 476  GRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNLKHKHSTTAYWADNWDTIPSDKRGNH 535

Query: 1113 PCFDPAKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMG 934
            PCFDP KDLVLPSWK PD NS  +KLWAR  + R TLFYFNGNLGPAY NGRPEA+YSMG
Sbjct: 536  PCFDPDKDLVLPSWKSPDVNSLSSKLWARSHDTRKTLFYFNGNLGPAYPNGRPEASYSMG 595

Query: 933  IRQKLAEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRME 754
            IRQKLAEEFGS+P+K+GKLG+QH +DVIVT LRSE YH +LASS+FCGVFPGDGWSGRME
Sbjct: 596  IRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSENYHGDLASSIFCGVFPGDGWSGRME 655

Query: 753  DSILQGCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANV 574
            DSILQGCIPV+IQDGIFLPYEN LNY+S+AVRI EDEIP+LI ILR F+ETEI+F+L NV
Sbjct: 656  DSILQGCIPVVIQDGIFLPYENVLNYDSYAVRIREDEIPDLINILRAFNETEIKFRLENV 715

Query: 573  QRIWQRFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWR 394
            Q+IWQRFLYRDSI+ EAERQK++FGH  DWA Q  Q  EDDV AT +QVLHYKLHNDPWR
Sbjct: 716  QKIWQRFLYRDSIMLEAERQKTDFGHMEDWAAQFSQLIEDDVVATFVQVLHYKLHNDPWR 775

Query: 393  KRAAHLKAYGVPQEC 349
            +     K +G+PQEC
Sbjct: 776  QHVHVKKEFGLPQEC 790


>ref|XP_004516917.1| PREDICTED: uncharacterized protein LOC101503851 isoform X1 [Cicer
            arietinum] gi|502181977|ref|XP_004516918.1| PREDICTED:
            uncharacterized protein LOC101503851 isoform X2 [Cicer
            arietinum]
          Length = 796

 Score = 1187 bits (3072), Expect = 0.0
 Identities = 546/782 (69%), Positives = 627/782 (80%), Gaps = 3/782 (0%)
 Frame = -1

Query: 2685 CSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTKGTEVTQHDA 2506
            CSW L A+I SVV ++  VV   L P   + DYF +     +C+ +N S+    V+ H  
Sbjct: 16   CSWSLAASIASVVAMVS-VVHLFLFPLTPSFDYFKL--ASDSCVSNNVSSADL-VSNHGL 71

Query: 2505 RASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEKISGKVCKND 2326
                P + L  RFP DLH +V Y+GA WKAEIGRWLSGCD++ K+VNI+E I G  CKND
Sbjct: 72   EE--PAIDLKYRFPADLHSSVAYKGALWKAEIGRWLSGCDSITKDVNISEIIGGNDCKND 129

Query: 2325 CSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSICPGQCDTTRA 2146
            CSG GVCN ELGQCRCFHGY G+ C D   L CN+P +   P+GRWVVSICP  CD TRA
Sbjct: 130  CSGLGVCNRELGQCRCFHGYVGDGCVDIQELECNFPGSLHEPFGRWVVSICPANCDKTRA 189

Query: 2145 MCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLDVFTTNSSVPGWCNVDPR 1975
            MCFCG GTKYPYRP+AE CGF+ N  S P      +W K D DVFTTN S+PGWCNVDP 
Sbjct: 190  MCFCGEGTKYPYRPLAESCGFQYNQPSEPGGPKIVNWTKVDQDVFTTNGSIPGWCNVDPV 249

Query: 1974 EGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCHEGWYGADCS 1795
            + Y GK KFKEECHC YDG IG FCE+ V S CINQC+GHG CRGGFCQC  GWYGADCS
Sbjct: 250  DAYEGKVKFKEECHCPYDGFIGRFCEVPVQSICINQCNGHGQCRGGFCQCDNGWYGADCS 309

Query: 1794 IPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDLPPEFVSHLL 1615
            IPSV SSIR+WP WLRPA +D+P   H +E + ++ A+V KKRPLIY+YDLPPEF S LL
Sbjct: 310  IPSVISSIREWPSWLRPARVDVPDNIHVSEKLINLNAVVAKKRPLIYIYDLPPEFNSLLL 369

Query: 1614 EGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEADFFFVPILDS 1435
            EGRHFK +CVNR+YD  NAT WTEQLYGAQMAIYES LASP+RTLNGEEADFFFVPILDS
Sbjct: 370  EGRHFKLECVNRIYDGNNATIWTEQLYGAQMAIYESLLASPHRTLNGEEADFFFVPILDS 429

Query: 1434 CIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKDHIWFFAWDEG 1255
            CIITRGDDAPH+++Q+HSG+RSS TLE+ KKAY HIVE YPYW  S+G+DHIWFF+WDEG
Sbjct: 430  CIITRGDDAPHLSLQEHSGLRSSLTLEYSKKAYYHIVEQYPYWNHSSGRDHIWFFSWDEG 489

Query: 1254 ACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCFDPAKDLVLPS 1075
            ACYAPKEIWNSMMLVHWGNTN+KH +STTAYWADNW+ IS   RG HPCFDP KDLVLP+
Sbjct: 490  ACYAPKEIWNSMMLVHWGNTNTKHNHSTTAYWADNWDTISSDRRGIHPCFDPDKDLVLPA 549

Query: 1074 WKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQKLAEEFGSNP 895
            WK PD N    KLWARPR +R TLFYFNGNLGPAY +GRPE +YSMGIRQKL EEFGS+P
Sbjct: 550  WKVPDANMLTMKLWARPREKRKTLFYFNGNLGPAYPHGRPEYSYSMGIRQKLGEEFGSSP 609

Query: 894  DKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSILQGCIPVIIQ 715
            +KDGKLG+QH +DVIVT +RS+ YH ++A+SVFCGVFPGDGWSGRMEDS+LQGCIPV+IQ
Sbjct: 610  NKDGKLGKQHAEDVIVTPVRSDNYHADIANSVFCGVFPGDGWSGRMEDSVLQGCIPVVIQ 669

Query: 714  DGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRIWQRFLYRDSI 535
            DGIFLPYEN LNY+SFAVRI E+EIPN+IKILRGF++TEI  KLANVQ+IWQRFLYR+SI
Sbjct: 670  DGIFLPYENVLNYDSFAVRIPEEEIPNMIKILRGFNDTEINLKLANVQKIWQRFLYRNSI 729

Query: 534  LREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRAAHLKAYGVPQ 355
            L EAERQK+ FGH  DWA + L+ +EDDV ATLIQVLHYKLHNDPWRK   H K +G+P 
Sbjct: 730  LLEAERQKTAFGHVDDWAVEFLRLTEDDVTATLIQVLHYKLHNDPWRKLVGHNKKFGLPN 789

Query: 354  EC 349
            +C
Sbjct: 790  QC 791


>ref|XP_003519065.1| PREDICTED: uncharacterized protein LOC100783624 [Glycine max]
          Length = 795

 Score = 1185 bits (3065), Expect = 0.0
 Identities = 548/787 (69%), Positives = 626/787 (79%), Gaps = 3/787 (0%)
 Frame = -1

Query: 2700 LQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTKGTEV 2521
            + K  CSW L A I SVV L+  VV   L P   T +YF I   Q +C P+N S +    
Sbjct: 11   MNKWRCSWSLAATIASVVALVS-VVHLFLFPLTPTFNYFKI--AQDSCFPTNASAEFPSN 67

Query: 2520 TQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEKISGK 2341
               +     P V    +FP DLH A VY+GAPWKAEIG+WL+GCD+V+KEVNITE I G 
Sbjct: 68   RDQEW----PAVDFKRQFPADLHGAFVYQGAPWKAEIGQWLAGCDSVIKEVNITEIIGGN 123

Query: 2340 VCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSICPGQC 2161
             CK DCSGQGVCN ELGQCRCFHGYSG+ C++ L L CN+  +PD+P+GRWVVSICP  C
Sbjct: 124  NCKKDCSGQGVCNLELGQCRCFHGYSGDGCTEKLQLQCNFLGSPDQPFGRWVVSICPANC 183

Query: 2160 DTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLDVFTTNSSVPGWC 1990
            D TRAMCFCG GTKYP RP+AE CGF+ N  S P      +W K D DVFTTN S+PGWC
Sbjct: 184  DKTRAMCFCGEGTKYPNRPLAETCGFQFNPPSEPDGPRIVNWTKIDQDVFTTNRSIPGWC 243

Query: 1989 NVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCHEGWY 1810
            NVDP E Y GKAK KEEC CKYDG  G  CE+ V S CINQCSGHG+CRGGFCQC  GWY
Sbjct: 244  NVDPAEAYAGKAKIKEECDCKYDGLAGRLCEVPVESVCINQCSGHGHCRGGFCQCDNGWY 303

Query: 1809 GADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDLPPEF 1630
            G DCS+PSV SSI++WP WLRPA ID+   THA E + ++ A+V KKRPL+YVYDLPPEF
Sbjct: 304  GVDCSMPSVISSIKEWPSWLRPARIDIADDTHANEKMINLNAVVAKKRPLVYVYDLPPEF 363

Query: 1629 VSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEADFFFV 1450
             S LLEGRHFK +CVNR+YD  N T WT+QLYGAQ+A+YES LASP+RTLNGEEADFFFV
Sbjct: 364  NSLLLEGRHFKLECVNRIYDGNNITVWTDQLYGAQIALYESLLASPHRTLNGEEADFFFV 423

Query: 1449 PILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKDHIWFF 1270
            P+LDSCIITR DDAPH++MQ+H G+RSS TLE+YKKAY HIVE YPYW RS+G+DH+W F
Sbjct: 424  PVLDSCIITRADDAPHLSMQEHMGLRSSLTLEYYKKAYIHIVEQYPYWNRSSGRDHVWSF 483

Query: 1269 AWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCFDPAKD 1090
            +WDEGACYAPKEIWNSMMLVHWGNTN+KH +STTAYWADNW+ IS   RG HPCFDP KD
Sbjct: 484  SWDEGACYAPKEIWNSMMLVHWGNTNTKHNHSTTAYWADNWDKISSDKRGTHPCFDPDKD 543

Query: 1089 LVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQKLAEE 910
            LVLP+WK PD N   +KLWA    +R TLFYFNGNLGPAY +GRPE TYSMGIRQKLAEE
Sbjct: 544  LVLPAWKVPDANVLTSKLWAWSHEKRKTLFYFNGNLGPAYPHGRPEDTYSMGIRQKLAEE 603

Query: 909  FGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSILQGCI 730
            FGS+P+KDGKLG+QH KDVIVT  RSE YH +LASSVFCGVFPGDGWSGRMEDSILQGCI
Sbjct: 604  FGSSPNKDGKLGKQHAKDVIVTPERSENYHLDLASSVFCGVFPGDGWSGRMEDSILQGCI 663

Query: 729  PVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRIWQRFL 550
            PV+IQDGIFLPYEN LNY+SFAVRI E EIPNLIKILRGF++TEIEFKL NVQ+IWQRF+
Sbjct: 664  PVVIQDGIFLPYENVLNYDSFAVRIPEAEIPNLIKILRGFNDTEIEFKLENVQKIWQRFM 723

Query: 549  YRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRAAHLKA 370
            YRDS+L EAERQK+  GH  DWA + L+ +EDDVF TLIQ+LHYKLHNDPWRK+  H K 
Sbjct: 724  YRDSVLLEAERQKTAIGHVDDWAVEFLKLTEDDVFVTLIQILHYKLHNDPWRKQVRHNKH 783

Query: 369  YGVPQEC 349
            +G+P +C
Sbjct: 784  FGLPHQC 790


>ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807663 [Glycine max]
          Length = 795

 Score = 1169 bits (3024), Expect = 0.0
 Identities = 542/787 (68%), Positives = 621/787 (78%), Gaps = 3/787 (0%)
 Frame = -1

Query: 2700 LQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTKGTEV 2521
            + K  CSW L A I SVV L+  VV   L P   T +YF I   Q +C P+N S +    
Sbjct: 11   MNKWRCSWSLAATIASVVALVS-VVHLFLFPLTPTFNYFKI--AQDSCFPTNASAEFP-- 65

Query: 2520 TQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEKISGK 2341
            + HD     P V    +FP DLH A VY G PWKAEIG+WL+GCD+V+K+VNITE I G 
Sbjct: 66   SNHDQER--PAVDFKHQFPADLHGAFVYHGVPWKAEIGQWLAGCDSVIKDVNITEIIGGI 123

Query: 2340 VCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSICPGQC 2161
             CKNDCSGQG+CN +LGQCRCFHGYSG+ C+  L L CN+  +PD+P+GRWVVSICP  C
Sbjct: 124  NCKNDCSGQGICNRQLGQCRCFHGYSGDGCTKNLQLECNFLGSPDQPFGRWVVSICPANC 183

Query: 2160 DTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLDVFTTNSSVPGWC 1990
            D TRAMCFCG G KYP RP+AE CGF+ +  S P      +W K D DVFTTN S+PGWC
Sbjct: 184  DKTRAMCFCGEGAKYPNRPLAETCGFQFDPPSEPDGPRIVNWTKIDQDVFTTNRSIPGWC 243

Query: 1989 NVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCHEGWY 1810
            NVDP E Y GKAK KEEC CKYDG  G FCE+ V S CINQCSGHG+CRGGFCQ   GWY
Sbjct: 244  NVDPAEAYAGKAKVKEECDCKYDGLAGRFCEVPVESVCINQCSGHGHCRGGFCQVSAGWY 303

Query: 1809 GADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDLPPEF 1630
            G DCS+PSV SSI++WP WLRPA I +   THA E + ++ A+V KKRPL+YVYDLPPEF
Sbjct: 304  GVDCSMPSVISSIKEWPSWLRPARIHIADDTHANEKMINLNAVVAKKRPLVYVYDLPPEF 363

Query: 1629 VSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEADFFFV 1450
             S LLEGRH+K +CVNR+YDD N T WT+QLYGAQ+A+YES LASP+RTLNGEEADFFFV
Sbjct: 364  NSLLLEGRHYKLECVNRIYDDNNITVWTDQLYGAQIALYESLLASPHRTLNGEEADFFFV 423

Query: 1449 PILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKDHIWFF 1270
            P+LDSCIITR DDAPH++MQ+H G+RSS TLE+YK  Y HIVE YPYW  S+G+DHIW F
Sbjct: 424  PVLDSCIITRADDAPHLSMQEHMGLRSSLTLEYYKNTYTHIVEQYPYWSHSSGRDHIWSF 483

Query: 1269 AWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCFDPAKD 1090
            +WDEGACYAPKEIWNSMMLVHWGNTN+KH +STTAYWADNW+ IS   RG HPCFDP KD
Sbjct: 484  SWDEGACYAPKEIWNSMMLVHWGNTNTKHNHSTTAYWADNWDKISSDRRGIHPCFDPDKD 543

Query: 1089 LVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQKLAEE 910
            LVLP+WK PD     +KLWAR   +R TLFYFNGNLGPAY +GRPE TYSMGIRQKLAEE
Sbjct: 544  LVLPAWKVPDAYVLTSKLWARSHEKRKTLFYFNGNLGPAYPHGRPEDTYSMGIRQKLAEE 603

Query: 909  FGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSILQGCI 730
            FGS+P+KDGKLG+QH KDVIVT  RSE YH +LASSVFCGVFPGDGWSGRMEDSILQGCI
Sbjct: 604  FGSSPNKDGKLGKQHAKDVIVTPERSEDYHMDLASSVFCGVFPGDGWSGRMEDSILQGCI 663

Query: 729  PVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRIWQRFL 550
            PV+IQDGIFLPYEN LNY+SFAVRI E EIPNLIK LRGF++TEIEFKLANVQ+IWQRFL
Sbjct: 664  PVVIQDGIFLPYENVLNYDSFAVRIPEAEIPNLIKTLRGFNDTEIEFKLANVQKIWQRFL 723

Query: 549  YRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRAAHLKA 370
            YRDS+L EAERQK+  GH  DWA + L+ +EDD FATLIQ+LHYKLHND WRK+  H K 
Sbjct: 724  YRDSVLLEAERQKTAIGHVDDWAVEFLKLTEDDAFATLIQILHYKLHNDRWRKQVRHNKQ 783

Query: 369  YGVPQEC 349
            +G+P +C
Sbjct: 784  FGLPHQC 790


>ref|XP_006349551.1| PREDICTED: uncharacterized protein LOC102592127 [Solanum tuberosum]
          Length = 790

 Score = 1162 bits (3005), Expect = 0.0
 Identities = 538/794 (67%), Positives = 630/794 (79%), Gaps = 6/794 (0%)
 Frame = -1

Query: 2712 MMLILQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTK 2533
            MM   QK+ CSW  +  I S+V L+  VV   L P   +LDYF  RQ + +CIP N STK
Sbjct: 1    MMWFKQKRMCSWSSVTIIASIVTLVS-VVHLFLYPVVPSLDYF--RQYKNSCIPIN-STK 56

Query: 2532 GTEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEK 2353
             T+ T ++       +S   +FP+DLH+ VVYRGAPWK ++G+WL+GCD++   + + E 
Sbjct: 57   STQPTHNNII-----ISNQTKFPLDLHNGVVYRGAPWKNQVGQWLAGCDSITSPLKVIEH 111

Query: 2352 ISGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSIC 2173
            I GK C+NDCSGQG+CN ELGQCRCFHG++GE+C++   L+CNYP + ++P+G WVVSIC
Sbjct: 112  IGGKSCRNDCSGQGICNRELGQCRCFHGFTGEECAERQELSCNYPRSKEKPFGHWVVSIC 171

Query: 2172 PGQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTP---PSTDWGKPDLDVFTTNSSV 2002
            P  CDTTRAMCFCG GTKYP RPV E CGF  N  S P   P TD+ K DLDVFTTN S 
Sbjct: 172  PAYCDTTRAMCFCGEGTKYPNRPVPETCGFTINPPSKPGGAPVTDFTKADLDVFTTNGSK 231

Query: 2001 PGWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCH 1822
             GWCNVDP E Y  K  FKEEC CKYDG  G FCE+ V+STCINQCSGHG CRGGFCQC 
Sbjct: 232  RGWCNVDPEEAYASKVLFKEECDCKYDGLWGRFCEVSVLSTCINQCSGHGLCRGGFCQCD 291

Query: 1821 EGWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDL 1642
             GW+G DCS+PSV SSIR+WP WLRPA + +P   ++  ++ ++ A+V KKRPLIYVYDL
Sbjct: 292  SGWFGTDCSVPSVLSSIREWPLWLRPAQVTVPENVNSNGNLINLDAIVEKKRPLIYVYDL 351

Query: 1641 PPEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEAD 1462
            PP+F S LLEGRHFK +C+NR+YD +NAT WT+QLYGAQMA+YES LASP+RTLNGEEAD
Sbjct: 352  PPDFNSLLLEGRHFKLECINRIYDQRNATVWTDQLYGAQMALYESMLASPHRTLNGEEAD 411

Query: 1461 FFFVPILDSCIITRGDDAPHMNMQKH--SGIRSSFTLEFYKKAYDHIVENYPYWKRSAGK 1288
            FFFVP+LDSCIITR DDAPH++MQ+H   G+RSS TLEFYKKAYDHI+  YPYW RSAGK
Sbjct: 412  FFFVPVLDSCIITRADDAPHLSMQEHIHGGLRSSLTLEFYKKAYDHIITQYPYWSRSAGK 471

Query: 1287 DHIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPC 1108
            DHIWFF+WDEGACYAPKEIWNS+MLVHWGNTNSKH +STTAYW DNW+ IS   RGNH C
Sbjct: 472  DHIWFFSWDEGACYAPKEIWNSIMLVHWGNTNSKHNHSTTAYWGDNWDPISSDRRGNHTC 531

Query: 1107 FDPAKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIR 928
            FDP KDLVLP+WKRPD  S  AK W+R R +R T FYFNGNLGPAY+NGRPEATYSMGIR
Sbjct: 532  FDPDKDLVLPAWKRPDEGSLNAKHWSRVREERKTFFYFNGNLGPAYENGRPEATYSMGIR 591

Query: 927  QKLAEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDS 748
            QK+AEEFGS  +K+GKLG+QH +DVIVT LR+  YHEELASSVFCGV PGDGWSGRMEDS
Sbjct: 592  QKVAEEFGSTLNKEGKLGKQHAEDVIVTPLRAGNYHEELASSVFCGVMPGDGWSGRMEDS 651

Query: 747  ILQGCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQR 568
            ILQGCIPV+IQDGI+LPYENFLNYESFAVRI EDEIPNL+ ILR F+ETEIEFKL NV++
Sbjct: 652  ILQGCIPVVIQDGIYLPYENFLNYESFAVRIREDEIPNLLNILRSFNETEIEFKLENVKK 711

Query: 567  IWQRFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKR 388
            IWQRFLYRDS++ EAERQK+  G   DW  +  Q  EDDVFAT IQVLHYKLHND WR++
Sbjct: 712  IWQRFLYRDSVVLEAERQKAVRGSVEDWGLKFSQLKEDDVFATFIQVLHYKLHNDTWRQQ 771

Query: 387  -AAHLKAYGVPQEC 349
                 K +G+P+EC
Sbjct: 772  LILQKKEFGLPKEC 785


>gb|ESW17624.1| hypothetical protein PHAVU_007G255200g [Phaseolus vulgaris]
          Length = 795

 Score = 1161 bits (3003), Expect = 0.0
 Identities = 539/790 (68%), Positives = 621/790 (78%), Gaps = 3/790 (0%)
 Frame = -1

Query: 2709 MLILQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTKG 2530
            +L   K  CSW L   I SVV L+  VV   + P   T +YF I   + +CI +N S + 
Sbjct: 8    LLSKNKWRCSWSLAVTIASVVALVS-VVHLFMFPLTPTFNYFKI--AKDSCIQANASAEF 64

Query: 2529 TEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEKI 2350
                  +     P V    +FP DLH +VVY+GAPWKAEIG WL+ CD+V+KEVNITE I
Sbjct: 65   PSNRDQEQ----PAVDFKLQFPADLHGSVVYQGAPWKAEIGHWLAACDSVIKEVNITEII 120

Query: 2349 SGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSICP 2170
                CKNDCSGQGVCN ELGQCRCFHGYSG+ C++   L CNY  +PD  +GRWVVSICP
Sbjct: 121  GVNNCKNDCSGQGVCNRELGQCRCFHGYSGDGCTEQRQLECNYEGSPDLQFGRWVVSICP 180

Query: 2169 GQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLDVFTTNSSVP 1999
              CD TRAMCFCG GTKYP RP+AE CGF+    S P      +W K D DVFTTN S+ 
Sbjct: 181  ANCDKTRAMCFCGEGTKYPNRPLAETCGFQYIPPSEPDGPKIVNWTKIDQDVFTTNGSIR 240

Query: 1998 GWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCHE 1819
            GWCNVDP + Y GKAK KEEC CKYDG  G  CE+ V S CINQCS HG+CRGGFCQC +
Sbjct: 241  GWCNVDPADAYAGKAKIKEECDCKYDGLSGRLCEVPVESVCINQCSRHGHCRGGFCQCDK 300

Query: 1818 GWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDLP 1639
            GWYG DCS+PS  SSI +WP WLRPA ID+   THA   + ++ A+V KKRPLIYVYDLP
Sbjct: 301  GWYGVDCSMPSAISSIIEWPSWLRPARIDIVDDTHANGKMINLNAVVAKKRPLIYVYDLP 360

Query: 1638 PEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEADF 1459
            PEF S LLEGRHFK +CVNR+YDDKN T WT+QLYGAQMA+YES LASP+RT+NGEEADF
Sbjct: 361  PEFNSLLLEGRHFKLECVNRIYDDKNVTIWTDQLYGAQMALYESLLASPHRTVNGEEADF 420

Query: 1458 FFVPILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKDHI 1279
            FFVP+LDSCIITR DDAPH+++Q+H G+RSS TLE+YKKAY HIV+ YPYW  S+G+DHI
Sbjct: 421  FFVPVLDSCIITRADDAPHLSLQEHMGLRSSLTLEYYKKAYTHIVDQYPYWNHSSGRDHI 480

Query: 1278 WFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCFDP 1099
            WFF+WDEGACYAPKEIWNSMMLVHWGNTNSKH +STTAYWADNW+ I    RG HPCFDP
Sbjct: 481  WFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDTIPSDKRGIHPCFDP 540

Query: 1098 AKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQKL 919
             KDLVLP+WK PD N   +KLWAR   +R TLFYFNGNLGPAY +GRPE +YSMGIRQKL
Sbjct: 541  DKDLVLPAWKVPDANVLTSKLWARTHEERKTLFYFNGNLGPAYPHGRPEDSYSMGIRQKL 600

Query: 918  AEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSILQ 739
            AEEFGS+P+KDGKLG+QH KDVIVTQ R+E YH +LASSVFCGVFPGDGWSGRMEDSILQ
Sbjct: 601  AEEFGSSPNKDGKLGKQHAKDVIVTQERTENYHLDLASSVFCGVFPGDGWSGRMEDSILQ 660

Query: 738  GCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRIWQ 559
            GCIPV+IQDGIFLPYEN LNY+SFAVR+ E+EIPNL+KILRGF+ETEI+FKLANVQ+IWQ
Sbjct: 661  GCIPVVIQDGIFLPYENILNYDSFAVRLSEEEIPNLLKILRGFNETEIKFKLANVQKIWQ 720

Query: 558  RFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRAAH 379
            RFLYRDS+L EAERQK+  G+  DWA + L+  EDDV ATLIQVLHYKLHN+PWRK+  H
Sbjct: 721  RFLYRDSVLLEAERQKTAIGYVDDWAIEFLKLIEDDVSATLIQVLHYKLHNEPWRKQLRH 780

Query: 378  LKAYGVPQEC 349
             K +G+P +C
Sbjct: 781  NKQFGLPHQC 790


>ref|XP_002526728.1| catalytic, putative [Ricinus communis] gi|223533917|gb|EEF35642.1|
            catalytic, putative [Ricinus communis]
          Length = 728

 Score = 1159 bits (2998), Expect = 0.0
 Identities = 526/720 (73%), Positives = 595/720 (82%), Gaps = 5/720 (0%)
 Frame = -1

Query: 2493 PGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEKISGKVCKNDCSGQ 2314
            P V+L  RFPVD H  VVYR APWKAE+G+WLSGCD++ KEV + E I G+ CKNDCSGQ
Sbjct: 19   PTVALDHRFPVDSHKGVVYRDAPWKAEVGQWLSGCDSITKEVKVVEIIGGRTCKNDCSGQ 78

Query: 2313 GVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSICPGQCDTTRAMCFC 2134
            GVCNHELG+CRCFHG+SGE+CS+ L L CNYP TP+ PYGRWVVSICP  CDTTRAMCFC
Sbjct: 79   GVCNHELGECRCFHGFSGEECSEKLQLECNYPKTPELPYGRWVVSICPAYCDTTRAMCFC 138

Query: 2133 GAGTKYPYRPVAEQCGFKSNTSST---PPSTDWGKPDLD-VFTTNSSVPGWCNVDPREGY 1966
            G GTKYP RPVAE CGF+ N  S    P  TDWGK DLD +FTTN S  GWCNVDP E Y
Sbjct: 139  GEGTKYPNRPVAEACGFQVNLPSESGGPKLTDWGKADLDNIFTTNKSKLGWCNVDPHEAY 198

Query: 1965 TGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCHEGWYGADCSIPS 1786
              K KFKEEC CKYDG  G FCE+ V S CINQCSGHGYCRGGFCQC  GWYG DCSIPS
Sbjct: 199  ASKVKFKEECDCKYDGLFGRFCEVPVQSICINQCSGHGYCRGGFCQCDNGWYGTDCSIPS 258

Query: 1785 VFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDLPPEFVSHLLEGR 1606
            V SS+ +WPQWLRPA +D+P  +H  + + ++ A+V KKRPLIY              GR
Sbjct: 259  VVSSVSEWPQWLRPALLDVPDNSHVIQKLVNLNAVVEKKRPLIY--------------GR 304

Query: 1605 HFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEADFFFVPILDSCII 1426
            HFKF+CVNR+YD +NAT WT+ LYGAQMA+YES LASPYRTLNGEEADFFFVPILDSCII
Sbjct: 305  HFKFECVNRIYDGRNATIWTDHLYGAQMALYESLLASPYRTLNGEEADFFFVPILDSCII 364

Query: 1425 TRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKDHIWFFAWDEGACY 1246
            TR DDAPH++MQ H G+RSS TLE+Y+KAYDHIVE+YPYW R++G+DHIWFF+WDEGACY
Sbjct: 365  TRADDAPHLSMQDHMGLRSSLTLEYYRKAYDHIVEHYPYWNRTSGRDHIWFFSWDEGACY 424

Query: 1245 APKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCFDPAKDLVLPSWKR 1066
            APKEIWNSMMLVHWGNTNSKH +STTAYWADNW+ IS   RG HPCFDP KDLVLP+WKR
Sbjct: 425  APKEIWNSMMLVHWGNTNSKHNHSTTAYWADNWDKISSDRRGRHPCFDPDKDLVLPAWKR 484

Query: 1065 PDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQKLAEEFGSNPDKD 886
            PD ++   KLWARP  +R TLF+FNGNLGPAY NGRPE +YSMGIRQKLAEEFGS+P+KD
Sbjct: 485  PDVSALSTKLWARPLERRKTLFFFNGNLGPAYPNGRPELSYSMGIRQKLAEEFGSSPNKD 544

Query: 885  GKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSILQGCIPVIIQDGI 706
            GKLG+QH +DVIVT LRSE YHE+LASS+FCGV PGDGWSGRMEDSILQGCIPVIIQDGI
Sbjct: 545  GKLGKQHAEDVIVTPLRSENYHEDLASSIFCGVLPGDGWSGRMEDSILQGCIPVIIQDGI 604

Query: 705  FLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRIWQRFLYRDSILRE 526
            FLPYEN LNYESFAVRI EDEI NL+KILRGF+ETE EFKLANV++IWQRFLYRD++L E
Sbjct: 605  FLPYENVLNYESFAVRIREDEISNLLKILRGFNETEKEFKLANVRKIWQRFLYRDTVLLE 664

Query: 525  AERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRAAHLKA-YGVPQEC 349
            A+RQK+ FGH  DW  + LQ   DDVF T IQ+LHYKLHNDPWR++ +HLK  +G+PQEC
Sbjct: 665  AKRQKTAFGHEEDWEVEFLQLVNDDVFTTFIQILHYKLHNDPWRRQLSHLKKDFGLPQEC 724


>ref|XP_004234838.1| PREDICTED: uncharacterized protein LOC101249053 [Solanum
            lycopersicum]
          Length = 785

 Score = 1157 bits (2992), Expect = 0.0
 Identities = 535/793 (67%), Positives = 627/793 (79%), Gaps = 5/793 (0%)
 Frame = -1

Query: 2712 MMLILQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTK 2533
            MML  QK+  SW  +  I  +++ L+ VV     P+  + DYF  RQ Q +CIP N STK
Sbjct: 1    MMLFNQKRMFSWSTVTII-VLIVTLVSVVHLFFYPFVPSFDYF--RQYQNSCIPIN-STK 56

Query: 2532 GTEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEK 2353
             T             +S   +F VDLH+ VVYRGAPWK E+G+WL+GCD+V   V + E+
Sbjct: 57   STHNNI---------ISNQTKFAVDLHNGVVYRGAPWKNEVGQWLAGCDSVTSAVKVIEQ 107

Query: 2352 ISGKVCKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSIC 2173
            I GK C+NDCSGQG+CN ELGQCRCFHG++GE+C++   L+CNYP + ++P+G WVVSIC
Sbjct: 108  IGGKSCRNDCSGQGICNRELGQCRCFHGFTGEECAERQELSCNYPRSKEKPFGHWVVSIC 167

Query: 2172 PGQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTP---PSTDWGKPDLDVFTTNSSV 2002
            P  CDTTRAMCFCG GTKYP RP+AE CGF  N  S P   P TD+ K DLDVFTTN S 
Sbjct: 168  PAYCDTTRAMCFCGDGTKYPNRPLAETCGFTINPPSKPGGAPVTDFTKADLDVFTTNGSK 227

Query: 2001 PGWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCH 1822
             GWCNVDP E Y  K  FKEEC CKYDG  G FCE+ V+STCINQCSGHG CRGGFCQC 
Sbjct: 228  RGWCNVDPEEAYASKVLFKEECDCKYDGLWGRFCEVSVLSTCINQCSGHGLCRGGFCQCD 287

Query: 1821 EGWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDL 1642
             GW+G DCS+PSV SSIR+WP WLRPA + +P   ++  ++ ++ A+V KKRPL+YVYDL
Sbjct: 288  SGWFGTDCSVPSVLSSIREWPLWLRPAQVTVPENVNSKGNLVNLDAIVEKKRPLLYVYDL 347

Query: 1641 PPEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEAD 1462
            PP+F S LLEGRHFK +C+NR+YD +NAT WT+QLYGAQMAIYES LASP+RTLNGEEAD
Sbjct: 348  PPDFNSLLLEGRHFKLECINRIYDQRNATVWTDQLYGAQMAIYESMLASPHRTLNGEEAD 407

Query: 1461 FFFVPILDSCIITRGDDAPHMNMQKH--SGIRSSFTLEFYKKAYDHIVENYPYWKRSAGK 1288
            FFFVP+LDSCIITR DDAPH++MQ+H   G+RSS TLEFYKKAYDHI+  YPYW RSAGK
Sbjct: 408  FFFVPVLDSCIITRADDAPHLSMQEHIHGGLRSSLTLEFYKKAYDHIITKYPYWSRSAGK 467

Query: 1287 DHIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPC 1108
            DHIWFF+WDEGACYAPKEIWNS+MLVHWGNTNSKH +STTAYW DNW+ IS   RGNH C
Sbjct: 468  DHIWFFSWDEGACYAPKEIWNSIMLVHWGNTNSKHNHSTTAYWGDNWDPISSDRRGNHTC 527

Query: 1107 FDPAKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIR 928
            FDP KDLVLP+WKRPD +S  AK W+RPR +R T FYFNGNLGPAY+NGRPE TYSMGIR
Sbjct: 528  FDPDKDLVLPAWKRPDESSLSAKHWSRPREERKTFFYFNGNLGPAYENGRPEDTYSMGIR 587

Query: 927  QKLAEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDS 748
            QK+AEEFGS  +K+GKLG+QH +DVIVT LR+  YH+ELASSVFCGV PGDGWSGRMEDS
Sbjct: 588  QKVAEEFGSTLNKEGKLGKQHAEDVIVTPLRAGNYHDELASSVFCGVMPGDGWSGRMEDS 647

Query: 747  ILQGCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQR 568
            ILQGCIPV+IQDGI+LPYENFLNYESFAVRI EDEIP L+ ILR F+ETEI+FKL NV++
Sbjct: 648  ILQGCIPVVIQDGIYLPYENFLNYESFAVRIREDEIPYLLNILRSFNETEIKFKLENVKK 707

Query: 567  IWQRFLYRDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKR 388
            IWQRFLYRDS++ EAERQK+  G   DW  + LQ  EDDVFAT IQVLHYKLHND WR++
Sbjct: 708  IWQRFLYRDSVVLEAERQKAIRGSVEDWGLKFLQLEEDDVFATFIQVLHYKLHNDTWRQK 767

Query: 387  AAHLKAYGVPQEC 349
                K +G+P+EC
Sbjct: 768  LLQKKEFGLPKEC 780


>ref|XP_006402860.1| hypothetical protein EUTSA_v10005794mg [Eutrema salsugineum]
            gi|557103959|gb|ESQ44313.1| hypothetical protein
            EUTSA_v10005794mg [Eutrema salsugineum]
          Length = 791

 Score = 1155 bits (2987), Expect = 0.0
 Identities = 524/788 (66%), Positives = 619/788 (78%), Gaps = 5/788 (0%)
 Frame = -1

Query: 2697 QKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTKGTEVT 2518
            QK  CSW  IA + SV++L+  V  F L P   + D   +RQ Q     SN+S     + 
Sbjct: 5    QKWKCSWSQIATVASVIVLVSLVHIF-LGPVVPSFDSVSVRQAQNLSGTSNDS-----IR 58

Query: 2517 QHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEKISGKV 2338
            Q    +S   V+   RFP DLH AVVYR A WKAEIG+WLS CD V K+V+I E I G+ 
Sbjct: 59   QVSEDSSKTVVAFDRRFPADLHGAVVYRNASWKAEIGQWLSSCDAVAKDVDIIEPIGGRK 118

Query: 2337 CKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSICPGQCD 2158
            C NDCS QGVCNHE G CRCFHGY+GE CS  L L CNY  TP+ PYGRWVVSIC   CD
Sbjct: 119  CLNDCSSQGVCNHEFGICRCFHGYTGEDCSQKLRLECNYEKTPEMPYGRWVVSICSRHCD 178

Query: 2157 TTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLDVFTTNSSVPGWCN 1987
            TTRAMCFCG GTKYP RPV E CGF+ N+   P     TDW KPDLD+ TTNSS  GWCN
Sbjct: 179  TTRAMCFCGEGTKYPNRPVPESCGFQINSPVNPDEPKMTDWSKPDLDILTTNSSKQGWCN 238

Query: 1986 VDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCHEGWYG 1807
            VDP + Y  K + KEEC CKYD   G FCE+ V  TC+NQCSGHG CRGGFCQC +GW+G
Sbjct: 239  VDPEDAYALKVQIKEECDCKYDCLWGRFCEVPVQCTCVNQCSGHGKCRGGFCQCDKGWFG 298

Query: 1806 ADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDLPPEFV 1627
             DCSIPS  S++ +WPQWLRPAH+++P   +   +++++ A+V KKRPLIY+YDLPP+F 
Sbjct: 299  TDCSIPSTLSTVGEWPQWLRPAHLEVPSDKNVPGNLSNISAVVKKKRPLIYIYDLPPDFN 358

Query: 1626 SHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEADFFFVP 1447
            S LLEGRHFK +CVNR+YDD+NAT WT+ LYG+QMA YE+ LA+ +RTLNGEEADFFFVP
Sbjct: 359  SLLLEGRHFKLECVNRIYDDRNATIWTDYLYGSQMAFYENILATAHRTLNGEEADFFFVP 418

Query: 1446 ILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKDHIWFFA 1267
            +LDSCIITR DDAPH++MQ H+G+RSS TLEFYK+AY+HIVE YPYW RS+G+DHIWFF+
Sbjct: 419  VLDSCIITRADDAPHLSMQNHTGLRSSLTLEFYKRAYEHIVEKYPYWNRSSGRDHIWFFS 478

Query: 1266 WDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCFDPAKDL 1087
            WDEGACYAPKEIWNSMMLVHWGNTNSKH +STTAYW DNW++IS   RG+HPCFDP KDL
Sbjct: 479  WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDEISNERRGDHPCFDPRKDL 538

Query: 1086 VLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQKLAEEF 907
            V+P+WK PD  S RA  WARPR +R TLFYFNGNLGPAY+ GRPE +YSMGIRQKLAEEF
Sbjct: 539  VIPAWKVPDPFSMRANYWARPREKRKTLFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEF 598

Query: 906  GSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSILQGCIP 727
            GS+P+K+GKLG+QH  DV+VT LRS+ YH ++A+S+FCGVFPGDGWSGRMEDSILQGC+P
Sbjct: 599  GSSPNKEGKLGKQHADDVVVTPLRSDNYHNDIATSIFCGVFPGDGWSGRMEDSILQGCVP 658

Query: 726  VIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRIWQRFLY 547
            VIIQDGI+LPYEN LNYESFAVR+ ED+IP+LI  LRGF+ETEI+F+LANV++IWQRFL+
Sbjct: 659  VIIQDGIYLPYENMLNYESFAVRVSEDDIPSLINTLRGFNETEIQFRLANVKKIWQRFLF 718

Query: 546  RDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRAA--HLK 373
            RDSIL EAERQK++FGH  DWA Q  +   DD+FAT IQ LH+KLHNDPWR+       K
Sbjct: 719  RDSILLEAERQKASFGHEEDWAVQFSKLKHDDIFATFIQTLHFKLHNDPWRREQVVNRTK 778

Query: 372  AYGVPQEC 349
             YG+PQEC
Sbjct: 779  EYGLPQEC 786


>emb|CAN80640.1| hypothetical protein VITISV_016911 [Vitis vinifera]
          Length = 1363

 Score = 1155 bits (2987), Expect = 0.0
 Identities = 545/810 (67%), Positives = 627/810 (77%), Gaps = 30/810 (3%)
 Frame = -1

Query: 2712 MMLILQKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTK 2533
            M   LQK  CSW L+A + SVV L+  V    L P   +L+YF + QGQ TC P N S +
Sbjct: 1    MTFFLQKWKCSWSLLATVASVVALIS-VAHLFLFPLAPSLEYFSMGQGQKTCTPINASIR 59

Query: 2532 GTEVTQHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEK 2353
            G +   HD +   P + L  RFP D H +VVYRGAPWKAEIGRW SGCD++  EV+I E 
Sbjct: 60   GVD---HDGKNLQPSLDLDHRFPADSHKSVVYRGAPWKAEIGRWFSGCDSIAAEVSIIEV 116

Query: 2352 ISGKVCKNDCSGQGVCNHE-----------------LGQCRCFHGYSGEQCSDTLYLACN 2224
                          + + +                 +G  +   G +GE CS+ L+L CN
Sbjct: 117  ARTAKMTAVVKAFAIMSWDNAGAFMDFLFFIALVLYVGSIKSADG-TGEGCSERLHLDCN 175

Query: 2223 YPVTPDRPYGRWVVSICPGQCDTTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS--- 2053
            YP +P++PYG WVVSICP  CDTTRAMCFCG GTKYP+RPVAE CGF+ N  +TP     
Sbjct: 176  YPSSPEQPYGPWVVSICPASCDTTRAMCFCGEGTKYPHRPVAEACGFQMNLPTTPGDPKL 235

Query: 2052 TDWGKPDLD-VFTTNSSVPGWCNVDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTC 1876
             DW K DLD +FTTN S PGWCNVDP E Y  K ++KEEC CKYD  +G FCEI V+ TC
Sbjct: 236  VDWTKADLDNIFTTNDSKPGWCNVDPTEAYALKMQYKEECDCKYDCLLGRFCEIPVLCTC 295

Query: 1875 INQCSGHGYCRGGFCQCHEGWYGADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESIN 1696
            +NQCSGHG+CRGGFCQCH GWYG DCSIPSV SS+R+WP+WLRPAH+++P   H + S+ 
Sbjct: 296  VNQCSGHGHCRGGFCQCHRGWYGTDCSIPSVLSSVREWPRWLRPAHVEVPDDMHLSGSLV 355

Query: 1695 DVGALVTKKRPLIYVYDLPPEFVSHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAI 1516
            ++ A+V KKRPLIYVYDLPPEF S LLEGRHFKF+CVNR+YDD+NAT+WTEQLYGAQMAI
Sbjct: 356  NLDAVVKKKRPLIYVYDLPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYGAQMAI 415

Query: 1515 YESFLASPYRTLNGEEADFFFVPILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAY 1336
            YES LASP+RTL+GEEADFFFVP+LDSCII R DDAPH+NM  H G+RSS TLEFYK AY
Sbjct: 416  YESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTLEFYKTAY 475

Query: 1335 DHIVENYPYWKRSAGKDHIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWA 1156
            DHIVE YP+W RS+G+DHIWFF+WDEGACYAPKEIW+SMMLVHWGNTNSKH +STTAYWA
Sbjct: 476  DHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWGNTNSKHNHSTTAYWA 535

Query: 1155 DNWNDISPAIRGNHPCFDPAKDLVLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGP 976
            DNW+ +S   RGNHPCFDP KDLVLP+WKRPD  S  +KLW+RPR QR TLFYFNGNLGP
Sbjct: 536  DNWDSVSSDRRGNHPCFDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKTLFYFNGNLGP 595

Query: 975  AYDNGRPEATYSMGIRQKLAEEFGSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVF 796
            AY+ GRPE TYSMGIRQK+AEEFGS+P+K+GKLG+QH +DVIVT LRS  YHE LASSVF
Sbjct: 596  AYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGNYHESLASSVF 655

Query: 795  CGVFPGDGWSGRMEDSILQGCIPVIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILR 616
            CGV PGDGWSGR EDSILQGCIPV+IQDGIFLP+EN LNYESFAVRI EDEIPNLIKILR
Sbjct: 656  CGVMPGDGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIREDEIPNLIKILR 715

Query: 615  ---------GFSETEIEFKLANVQRIWQRFLYRDSILREAERQKSNFGHTSDWATQLLQF 463
                     G +ETEIEFKL NV++IWQRFLYRDSIL EAERQK+ FG+  DWA QLLQ 
Sbjct: 716  LSGDPYVLQGMNETEIEFKLENVRKIWQRFLYRDSILLEAERQKTAFGNVEDWAVQLLQL 775

Query: 462  SEDDVFATLIQVLHYKLHNDPWRKRAAHLK 373
            SEDDVFATLIQVLHYKLHNDPWR++ AHLK
Sbjct: 776  SEDDVFATLIQVLHYKLHNDPWRQQLAHLK 805


>ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]
            gi|44917463|gb|AAS49056.1| At3g57630 [Arabidopsis
            thaliana] gi|46931284|gb|AAT06446.1| At3g57630
            [Arabidopsis thaliana] gi|332646159|gb|AEE79680.1|
            exostosin family protein [Arabidopsis thaliana]
          Length = 793

 Score = 1150 bits (2975), Expect = 0.0
 Identities = 523/788 (66%), Positives = 622/788 (78%), Gaps = 5/788 (0%)
 Frame = -1

Query: 2697 QKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTKGTEVT 2518
            QK   SW  IA + SV++L+  V  F L P   + D   +RQ Q  C PSNES   ++VT
Sbjct: 5    QKWKFSWSQIATVASVIVLVSLVHLF-LGPVVPSFDSITVRQAQNLCGPSNESI--SQVT 61

Query: 2517 QHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEKISGKV 2338
            ++ ++ S   V+   RFP D H AVVYR A WKAEIG+WLS CD V KEV+I E I G+ 
Sbjct: 62   KNSSQ-SLVVVAFDRRFPADSHGAVVYRNASWKAEIGQWLSSCDAVAKEVDIIEPIGGRK 120

Query: 2337 CKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSICPGQCD 2158
            C +DCSGQGVCNHE G CRCFHG++GE CS  L L CNY  TP+ PYG+WVVSIC   CD
Sbjct: 121  CMSDCSGQGVCNHEFGLCRCFHGFTGEDCSQKLRLDCNYEKTPEMPYGKWVVSICSRHCD 180

Query: 2157 TTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLDVFTTNSSVPGWCN 1987
            TTRAMCFCG GTKYP RPV E CGF+ N+ + P     TDW KPDLD+ TTNSS  GWCN
Sbjct: 181  TTRAMCFCGEGTKYPNRPVPESCGFQINSPTNPDEPKMTDWSKPDLDILTTNSSKQGWCN 240

Query: 1986 VDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCHEGWYG 1807
            VDP + Y  K K KEEC CKYD   G FCEI V  TC+NQCSGHG CRGGFCQC +GW+G
Sbjct: 241  VDPEDAYAMKVKIKEECDCKYDCLWGRFCEIPVQCTCVNQCSGHGKCRGGFCQCDKGWFG 300

Query: 1806 ADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDLPPEFV 1627
             DCSIPS  S++ +WPQWLRPAH+++P   +   ++ ++ A+V KKRPLIY+YDLPP+F 
Sbjct: 301  TDCSIPSTLSTVGEWPQWLRPAHLEVPSEKNVPGNLINLSAVVKKKRPLIYIYDLPPDFN 360

Query: 1626 SHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEADFFFVP 1447
            S L+EGRHFKF+CVNR+YD++NAT WT+ LYG+QMA YE+ LA+ +RT+NGEEADFFFVP
Sbjct: 361  SLLIEGRHFKFECVNRIYDERNATVWTDYLYGSQMAFYENILATAHRTMNGEEADFFFVP 420

Query: 1446 ILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKDHIWFFA 1267
            +LDSCII R DDAPH+NMQ H+G+RSS TLEFYK+AY+HIVE YPYW RSAG+DHIWFF+
Sbjct: 421  VLDSCIINRADDAPHINMQNHTGLRSSLTLEFYKRAYEHIVEKYPYWNRSAGRDHIWFFS 480

Query: 1266 WDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCFDPAKDL 1087
            WDEGACYAPKEIWNSMMLVHWGNTNSKH +STTAY+ DNW+DIS   RG+HPCFDP KDL
Sbjct: 481  WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYFGDNWDDISDERRGDHPCFDPRKDL 540

Query: 1086 VLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQKLAEEF 907
            V+P+WK PD  S R   W RPR +R TLFYFNGNLGPAY+ GRPE +YSMGIRQKLAEEF
Sbjct: 541  VIPAWKVPDPYSMRKNYWERPREKRKTLFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEF 600

Query: 906  GSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSILQGCIP 727
            GS+P+K+GKLG+QH +DVIVT LRS+ YH+++A+S+FCG FPGDGWSGRMEDSILQGC+P
Sbjct: 601  GSSPNKEGKLGKQHAEDVIVTPLRSDNYHKDIANSIFCGAFPGDGWSGRMEDSILQGCVP 660

Query: 726  VIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRIWQRFLY 547
            VIIQDGI+LPYEN LNYESFAVR+ ED+IPNLI  LRGFSE EI+F+L NV+ +WQRFL+
Sbjct: 661  VIIQDGIYLPYENMLNYESFAVRVNEDDIPNLINTLRGFSEAEIQFRLGNVKELWQRFLF 720

Query: 546  RDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRAA--HLK 373
            RDSIL EAERQK+ +GH  DWA Q  +   DD+FAT+IQ LH+KLHNDPWR+  A    K
Sbjct: 721  RDSILLEAERQKATYGHEEDWAVQFSKLKHDDIFATIIQTLHFKLHNDPWRREQAVNRTK 780

Query: 372  AYGVPQEC 349
             YG+PQEC
Sbjct: 781  DYGLPQEC 788


>ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297324003|gb|EFH54424.1| exostosin family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 792

 Score = 1150 bits (2974), Expect = 0.0
 Identities = 521/788 (66%), Positives = 624/788 (79%), Gaps = 5/788 (0%)
 Frame = -1

Query: 2697 QKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTKGTEVT 2518
            QK   SW  IA + SV++L+  V  F L P   + D   +RQ Q    P+NES   T+VT
Sbjct: 5    QKWKFSWSQIATVASVIVLVSLVHLF-LGPVVPSFDSIIVRQAQNLSGPTNESI--TQVT 61

Query: 2517 QHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEKISGKV 2338
            +  +++    V+   RFP D H AVVYR A WKAEIG+WLS CD V KEV++ E I G+ 
Sbjct: 62   KDLSQSLV--VAFDRRFPADSHGAVVYRNASWKAEIGQWLSSCDAVAKEVDVIEPIGGRK 119

Query: 2337 CKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSICPGQCD 2158
            C NDCSGQGVCN+E G CRCFHG++G+ CS  L+L CNY  TP+ PYG+WVVSIC   CD
Sbjct: 120  CMNDCSGQGVCNYEFGLCRCFHGFTGDDCSQKLHLDCNYEKTPEMPYGKWVVSICSRHCD 179

Query: 2157 TTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLDVFTTNSSVPGWCN 1987
            TTRAMCFCG GTKYP RPV E CGF+ N+ + P     TDW KPDLD+ TTNSS  GWCN
Sbjct: 180  TTRAMCFCGEGTKYPNRPVPESCGFQINSPANPDEPKMTDWSKPDLDILTTNSSKQGWCN 239

Query: 1986 VDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCHEGWYG 1807
            VDP + Y  K + KEEC CKYD   G FCEI V  TC+NQCSGHG CRGGFCQC +GW+G
Sbjct: 240  VDPEDAYALKVQIKEECDCKYDCLWGRFCEIPVQCTCVNQCSGHGKCRGGFCQCDKGWFG 299

Query: 1806 ADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDLPPEFV 1627
             DCS PS  S++ +WPQWLRPAH+++P   +   ++ ++ A+V KKRPLIY+YDLPP+F 
Sbjct: 300  TDCSTPSTLSTVGEWPQWLRPAHLEVPSEKNVPGNLTNLSAVVKKKRPLIYIYDLPPDFN 359

Query: 1626 SHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEADFFFVP 1447
            S L+EGRHFK +CVNR+YD++NAT WT+ LYG+QMA YE+ LA+ +RTLNGEEADFFFVP
Sbjct: 360  SLLIEGRHFKLECVNRIYDERNATVWTDYLYGSQMAFYENILATAHRTLNGEEADFFFVP 419

Query: 1446 ILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKDHIWFFA 1267
            +LDSCII R DDAPH+NMQ H+G+RSSFTLEFYK+AY+HIVE YPYW RSAG+DHIWFF+
Sbjct: 420  VLDSCIINRADDAPHINMQNHTGLRSSFTLEFYKRAYEHIVEKYPYWNRSAGRDHIWFFS 479

Query: 1266 WDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCFDPAKDL 1087
            WDEGACYAPKEIWNSMMLVHWGNTNSKH +STTAYW DNW+DIS   RG+HPCFDP KDL
Sbjct: 480  WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDDISDERRGDHPCFDPRKDL 539

Query: 1086 VLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQKLAEEF 907
            V+P+WK PD  S RA  WARPR +R TLFYFNGNLGPAY+ GRPE +YSMGIRQKLAEEF
Sbjct: 540  VIPAWKVPDPYSMRANYWARPREKRKTLFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEF 599

Query: 906  GSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSILQGCIP 727
            GS+P+K+GKLG+QH +DVIVT LRS+ YH+++A+S+FCG FPGDGWSGRMEDSILQGC+P
Sbjct: 600  GSSPNKEGKLGKQHAEDVIVTPLRSDNYHKDIANSIFCGAFPGDGWSGRMEDSILQGCVP 659

Query: 726  VIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRIWQRFLY 547
            VIIQDGI+LPYEN LNYESFAVR+ ED+IPNLI  LRGFSETEI+F+LANV+++WQRFL+
Sbjct: 660  VIIQDGIYLPYENMLNYESFAVRVSEDDIPNLINTLRGFSETEIQFRLANVKKLWQRFLF 719

Query: 546  RDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRAA--HLK 373
            RDSIL EAERQK+++GH  +WA Q  +   DD+FAT IQ LH+KLHNDPWR+       K
Sbjct: 720  RDSILLEAERQKASYGHEEEWAVQFSKLKHDDIFATFIQTLHFKLHNDPWRREQVVNRTK 779

Query: 372  AYGVPQEC 349
             YG+PQEC
Sbjct: 780  DYGLPQEC 787


>ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]
            gi|110740929|dbj|BAE98560.1| hypothetical protein
            [Arabidopsis thaliana] gi|332646160|gb|AEE79681.1|
            exostosin family protein [Arabidopsis thaliana]
          Length = 791

 Score = 1141 bits (2951), Expect = 0.0
 Identities = 521/788 (66%), Positives = 620/788 (78%), Gaps = 5/788 (0%)
 Frame = -1

Query: 2697 QKKGCSWLLIAAIGSVVLLLLYVVQFSLSPWGSTLDYFGIRQGQTTCIPSNESTKGTEVT 2518
            QK   SW  IA + SV++L+  V  F L P   + D   +RQ Q  C PSNES   ++VT
Sbjct: 5    QKWKFSWSQIATVASVIVLVSLVHLF-LGPVVPSFDSITVRQAQNLCGPSNESI--SQVT 61

Query: 2517 QHDARASPPGVSLAARFPVDLHDAVVYRGAPWKAEIGRWLSGCDTVVKEVNITEKISGKV 2338
            ++ ++ S   V+   RFP D H AVVYR A WKAEIG+WLS CD V KEV+I E I G+ 
Sbjct: 62   KNSSQ-SLVVVAFDRRFPADSHGAVVYRNASWKAEIGQWLSSCDAVAKEVDIIEPIGGRK 120

Query: 2337 CKNDCSGQGVCNHELGQCRCFHGYSGEQCSDTLYLACNYPVTPDRPYGRWVVSICPGQCD 2158
            C +DCSGQGVCNHE G CRCFHG++   CS  L L CNY  TP+ PYG+WVVSIC   CD
Sbjct: 121  CMSDCSGQGVCNHEFGLCRCFHGFT--DCSQKLRLDCNYEKTPEMPYGKWVVSICSRHCD 178

Query: 2157 TTRAMCFCGAGTKYPYRPVAEQCGFKSNTSSTPPS---TDWGKPDLDVFTTNSSVPGWCN 1987
            TTRAMCFCG GTKYP RPV E CGF+ N+ + P     TDW KPDLD+ TTNSS  GWCN
Sbjct: 179  TTRAMCFCGEGTKYPNRPVPESCGFQINSPTNPDEPKMTDWSKPDLDILTTNSSKQGWCN 238

Query: 1986 VDPREGYTGKAKFKEECHCKYDGRIGIFCEIRVISTCINQCSGHGYCRGGFCQCHEGWYG 1807
            VDP + Y  K K KEEC CKYD   G FCEI V  TC+NQCSGHG CRGGFCQC +GW+G
Sbjct: 239  VDPEDAYAMKVKIKEECDCKYDCLWGRFCEIPVQCTCVNQCSGHGKCRGGFCQCDKGWFG 298

Query: 1806 ADCSIPSVFSSIRDWPQWLRPAHIDMPGYTHAAESINDVGALVTKKRPLIYVYDLPPEFV 1627
             DCSIPS  S++ +WPQWLRPAH+++P   +   ++ ++ A+V KKRPLIY+YDLPP+F 
Sbjct: 299  TDCSIPSTLSTVGEWPQWLRPAHLEVPSEKNVPGNLINLSAVVKKKRPLIYIYDLPPDFN 358

Query: 1626 SHLLEGRHFKFQCVNRLYDDKNATFWTEQLYGAQMAIYESFLASPYRTLNGEEADFFFVP 1447
            S L+EGRHFKF+CVNR+YD++NAT WT+ LYG+QMA YE+ LA+ +RT+NGEEADFFFVP
Sbjct: 359  SLLIEGRHFKFECVNRIYDERNATVWTDYLYGSQMAFYENILATAHRTMNGEEADFFFVP 418

Query: 1446 ILDSCIITRGDDAPHMNMQKHSGIRSSFTLEFYKKAYDHIVENYPYWKRSAGKDHIWFFA 1267
            +LDSCII R DDAPH+NMQ H+G+RSS TLEFYK+AY+HIVE YPYW RSAG+DHIWFF+
Sbjct: 419  VLDSCIINRADDAPHINMQNHTGLRSSLTLEFYKRAYEHIVEKYPYWNRSAGRDHIWFFS 478

Query: 1266 WDEGACYAPKEIWNSMMLVHWGNTNSKHKNSTTAYWADNWNDISPAIRGNHPCFDPAKDL 1087
            WDEGACYAPKEIWNSMMLVHWGNTNSKH +STTAY+ DNW+DIS   RG+HPCFDP KDL
Sbjct: 479  WDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYFGDNWDDISDERRGDHPCFDPRKDL 538

Query: 1086 VLPSWKRPDGNSYRAKLWARPRNQRTTLFYFNGNLGPAYDNGRPEATYSMGIRQKLAEEF 907
            V+P+WK PD  S R   W RPR +R TLFYFNGNLGPAY+ GRPE +YSMGIRQKLAEEF
Sbjct: 539  VIPAWKVPDPYSMRKNYWERPREKRKTLFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEF 598

Query: 906  GSNPDKDGKLGRQHQKDVIVTQLRSEMYHEELASSVFCGVFPGDGWSGRMEDSILQGCIP 727
            GS+P+K+GKLG+QH +DVIVT LRS+ YH+++A+S+FCG FPGDGWSGRMEDSILQGC+P
Sbjct: 599  GSSPNKEGKLGKQHAEDVIVTPLRSDNYHKDIANSIFCGAFPGDGWSGRMEDSILQGCVP 658

Query: 726  VIIQDGIFLPYENFLNYESFAVRIGEDEIPNLIKILRGFSETEIEFKLANVQRIWQRFLY 547
            VIIQDGI+LPYEN LNYESFAVR+ ED+IPNLI  LRGFSE EI+F+L NV+ +WQRFL+
Sbjct: 659  VIIQDGIYLPYENMLNYESFAVRVNEDDIPNLINTLRGFSEAEIQFRLGNVKELWQRFLF 718

Query: 546  RDSILREAERQKSNFGHTSDWATQLLQFSEDDVFATLIQVLHYKLHNDPWRKRAA--HLK 373
            RDSIL EAERQK+ +GH  DWA Q  +   DD+FAT+IQ LH+KLHNDPWR+  A    K
Sbjct: 719  RDSILLEAERQKATYGHEEDWAVQFSKLKHDDIFATIIQTLHFKLHNDPWRREQAVNRTK 778

Query: 372  AYGVPQEC 349
             YG+PQEC
Sbjct: 779  DYGLPQEC 786


Top