BLASTX nr result
ID: Sinomenium22_contig00011808
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00011808 (843 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXC02106.1| hypothetical protein L484_024071 [Morus notabilis] 362 1e-97 ref|XP_002272057.2| PREDICTED: uncharacterized protein LOC100266... 355 1e-95 emb|CBI29499.3| unnamed protein product [Vitis vinifera] 355 1e-95 ref|XP_007039777.1| O-fucosyltransferase family protein, putativ... 353 6e-95 ref|XP_007151341.1| hypothetical protein PHAVU_004G038100g [Phas... 341 2e-91 ref|XP_006477145.1| PREDICTED: uncharacterized protein LOC102619... 340 4e-91 ref|XP_002304290.2| hypothetical protein POPTR_0003s07710g [Popu... 340 5e-91 ref|XP_003543971.1| PREDICTED: uncharacterized protein LOC100788... 339 7e-91 ref|XP_002521236.1| conserved hypothetical protein [Ricinus comm... 339 7e-91 ref|XP_002868069.1| hypothetical protein ARALYDRAFT_493135 [Arab... 337 3e-90 ref|XP_004299410.1| PREDICTED: uncharacterized protein LOC101295... 335 1e-89 ref|XP_006440263.1| hypothetical protein CICLE_v10019844mg [Citr... 334 2e-89 ref|NP_193473.1| O-fucosyltransferase family protein [Arabidopsi... 334 3e-89 ref|XP_004245784.1| PREDICTED: uncharacterized protein LOC101261... 333 5e-89 ref|XP_006355417.1| PREDICTED: uncharacterized protein LOC102586... 331 2e-88 ref|XP_006414231.1| hypothetical protein EUTSA_v10024957mg [Eutr... 331 2e-88 ref|XP_004503386.1| PREDICTED: uncharacterized protein LOC101504... 330 3e-88 ref|XP_003630908.1| CigA protein [Medicago truncatula] gi|355524... 330 4e-88 ref|XP_006285372.1| hypothetical protein CARUB_v10006763mg [Caps... 327 5e-87 gb|ABD65093.1| hypothetical protein 31.t00055 [Brassica oleracea] 321 2e-85 >gb|EXC02106.1| hypothetical protein L484_024071 [Morus notabilis] Length = 512 Score = 362 bits (929), Expect = 1e-97 Identities = 184/262 (70%), Positives = 213/262 (81%) Frame = +2 Query: 56 PKSLFFNSSKQPLNSRVFQCSELQRLGEKFLWFAPHSGFSNQFSELKNAVLMAAILNRTL 235 P SL +SSK P +S QC + EKFLW+APHSGFSNQ SE KNA+LMAAILNRTL Sbjct: 54 PTSLSLSSSKTP-HSLQNQC-RIPSPREKFLWYAPHSGFSNQLSEFKNALLMAAILNRTL 111 Query: 236 IVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDLSPLVSSS 415 IVPP+LDHHAVALGSCPKFRVS+P+E+R VWDH +ELI S RYVSM D++D+S LVSSS Sbjct: 112 IVPPILDHHAVALGSCPKFRVSAPAEIRASVWDHAVELIRSGRYVSMADIVDISSLVSSS 171 Query: 416 MLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDVHKCLYAP 595 ++ IDFR+FAS WC L + C + SD +SS LL+ LKQCGSLL DG+ V KCLYA Sbjct: 172 FIRAIDFRVFASQWCNLNLEGICVNESDKQSS-LLDSLKQCGSLLAGLDGS-VSKCLYAV 229 Query: 596 NEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSEAELATVL 775 NEDCRTTVWTY+ + EDG LDSFQPDE+L +KKKISYVRRRRDVYK LGP SEA+ AT+L Sbjct: 230 NEDCRTTVWTYKNDNEDGTLDSFQPDEQLKKKKKISYVRRRRDVYKNLGPDSEADSATLL 289 Query: 776 AFGSLFTAPYKGSELYIDIHES 841 AFGS+FT+PYKGSELYIDIHES Sbjct: 290 AFGSIFTSPYKGSELYIDIHES 311 >ref|XP_002272057.2| PREDICTED: uncharacterized protein LOC100266043 [Vitis vinifera] Length = 482 Score = 355 bits (911), Expect = 1e-95 Identities = 174/248 (70%), Positives = 207/248 (83%) Frame = +2 Query: 98 SRVFQCSELQRLGEKFLWFAPHSGFSNQFSELKNAVLMAAILNRTLIVPPVLDHHAVALG 277 SR QC+ G++FLW+APHSGFSNQ SE KNA+LMAAILNRTL+VPP+LDHHAVALG Sbjct: 50 SRTSQCTPQNLPGQRFLWYAPHSGFSNQVSEFKNAILMAAILNRTLVVPPILDHHAVALG 109 Query: 278 SCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDLSPLVSSSMLKIIDFRIFASLW 457 SCPKFRV P E+R VW+H+I+L+ SRRYVSM D+IDLS LVS S+++ IDFR F SLW Sbjct: 110 SCPKFRVLGPGEIRLSVWNHVIDLLRSRRYVSMADIIDLSSLVSISVIQAIDFRDFISLW 169 Query: 458 CGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDVHKCLYAPNEDCRTTVWTYEQN 637 CG+ D CF+ S+ +SS LL+ LKQCGS L DGN V KC+YA +EDCRTTVWTY+QN Sbjct: 170 CGVNVDFDCFNESNDQSS-LLDSLKQCGSRLSGLDGN-VDKCIYALDEDCRTTVWTYQQN 227 Query: 638 VEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSEAELATVLAFGSLFTAPYKGSE 817 +D +LDSFQPDE+L +KKKISY+R+RRDVYK LGPGS+AE ATVLAFGSLFTAPYKGSE Sbjct: 228 -DDEVLDSFQPDEQLKKKKKISYIRKRRDVYKTLGPGSKAESATVLAFGSLFTAPYKGSE 286 Query: 818 LYIDIHES 841 LYIDI+E+ Sbjct: 287 LYIDINEA 294 >emb|CBI29499.3| unnamed protein product [Vitis vinifera] Length = 494 Score = 355 bits (911), Expect = 1e-95 Identities = 174/248 (70%), Positives = 207/248 (83%) Frame = +2 Query: 98 SRVFQCSELQRLGEKFLWFAPHSGFSNQFSELKNAVLMAAILNRTLIVPPVLDHHAVALG 277 SR QC+ G++FLW+APHSGFSNQ SE KNA+LMAAILNRTL+VPP+LDHHAVALG Sbjct: 50 SRTSQCTPQNLPGQRFLWYAPHSGFSNQVSEFKNAILMAAILNRTLVVPPILDHHAVALG 109 Query: 278 SCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDLSPLVSSSMLKIIDFRIFASLW 457 SCPKFRV P E+R VW+H+I+L+ SRRYVSM D+IDLS LVS S+++ IDFR F SLW Sbjct: 110 SCPKFRVLGPGEIRLSVWNHVIDLLRSRRYVSMADIIDLSSLVSISVIQAIDFRDFISLW 169 Query: 458 CGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDVHKCLYAPNEDCRTTVWTYEQN 637 CG+ D CF+ S+ +SS LL+ LKQCGS L DGN V KC+YA +EDCRTTVWTY+QN Sbjct: 170 CGVNVDFDCFNESNDQSS-LLDSLKQCGSRLSGLDGN-VDKCIYALDEDCRTTVWTYQQN 227 Query: 638 VEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSEAELATVLAFGSLFTAPYKGSE 817 +D +LDSFQPDE+L +KKKISY+R+RRDVYK LGPGS+AE ATVLAFGSLFTAPYKGSE Sbjct: 228 -DDEVLDSFQPDEQLKKKKKISYIRKRRDVYKTLGPGSKAESATVLAFGSLFTAPYKGSE 286 Query: 818 LYIDIHES 841 LYIDI+E+ Sbjct: 287 LYIDINEA 294 >ref|XP_007039777.1| O-fucosyltransferase family protein, putative isoform 2 [Theobroma cacao] gi|508777022|gb|EOY24278.1| O-fucosyltransferase family protein, putative isoform 2 [Theobroma cacao] Length = 514 Score = 353 bits (905), Expect = 6e-95 Identities = 178/270 (65%), Positives = 212/270 (78%), Gaps = 3/270 (1%) Frame = +2 Query: 41 SHTDIPKSLFFNSSKQ---PLNSRVFQCSELQRLGEKFLWFAPHSGFSNQFSELKNAVLM 211 ++ IPKSLF SSK L+ + C+ Q GEKFLW+APHSGFSNQ SE KNA+LM Sbjct: 47 TYISIPKSLFSTSSKTVNAALSPQYPHCTT-QIPGEKFLWYAPHSGFSNQLSEFKNAILM 105 Query: 212 AAILNRTLIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVID 391 A ILNRTLIVPP+LDHHAV LGSCPKFRV S E+R VWDHI ELI S RYVSM D+ID Sbjct: 106 AGILNRTLIVPPILDHHAVVLGSCPKFRVQSAKEIRLSVWDHINELIRSERYVSMADIID 165 Query: 392 LSPLVSSSMLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGND 571 +S L+SSS+++ IDFR+F SLWCGL DL C + + + S ++ L+QCGSLL DGN Sbjct: 166 ISSLLSSSLVRAIDFRVFVSLWCGLNMDLVCSNELNAQQS-MVGSLRQCGSLLSGIDGN- 223 Query: 572 VHKCLYAPNEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGS 751 + +CL+A +EDCRTTVWTY+ + DG+LDSFQPDE+L KKKISYVRRRR+VYK LGPGS Sbjct: 224 IDRCLFAVDEDCRTTVWTYQNDEVDGVLDSFQPDEQLKNKKKISYVRRRRNVYKTLGPGS 283 Query: 752 EAELATVLAFGSLFTAPYKGSELYIDIHES 841 EAE ATVLAFGSLFTAPYKGS+LYIDI ++ Sbjct: 284 EAESATVLAFGSLFTAPYKGSDLYIDIQKA 313 >ref|XP_007151341.1| hypothetical protein PHAVU_004G038100g [Phaseolus vulgaris] gi|561024650|gb|ESW23335.1| hypothetical protein PHAVU_004G038100g [Phaseolus vulgaris] Length = 500 Score = 341 bits (875), Expect = 2e-91 Identities = 165/246 (67%), Positives = 202/246 (82%), Gaps = 2/246 (0%) Frame = +2 Query: 110 QCS-ELQRLGEKFLWFAPHSGFSNQFSELKNAVLMAAILNRTLIVPPVLDHHAVALGSCP 286 QCS + LGEKF+W+APHSGFSNQ SE KNAVLMA ILNRTL+VPP+LDHHAVALGSCP Sbjct: 63 QCSSQALTLGEKFMWYAPHSGFSNQLSEFKNAVLMAGILNRTLVVPPILDHHAVALGSCP 122 Query: 287 KFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDLSPLVSSSMLKIIDFRIFASLWCGL 466 KFRV P ++R VWDH+IEL++SRRY+S+ ++ID+S LVSSS++++IDFR F S+WCG+ Sbjct: 123 KFRVLDPKDIRISVWDHVIELVQSRRYISIAEIIDISSLVSSSLVRVIDFRDFVSIWCGI 182 Query: 467 GTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDVHKCLYAPNEDCRTTVWTY-EQNVE 643 DLAC + + SS + + LKQCGSLL G+ + KC+YA NEDCRTTVWTY E Sbjct: 183 SLDLACITDTKLHSS-VSKSLKQCGSLLAGLHGS-IEKCIYAVNEDCRTTVWTYHEDGHG 240 Query: 644 DGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSEAELATVLAFGSLFTAPYKGSELY 823 DG+LDSFQPDE+L KKKISYVRRR+DV+K LGPGSEA A++LAFG+LF+A YKGSELY Sbjct: 241 DGMLDSFQPDEQLKHKKKISYVRRRKDVFKTLGPGSEAGSASLLAFGTLFSATYKGSELY 300 Query: 824 IDIHES 841 +DIHES Sbjct: 301 VDIHES 306 >ref|XP_006477145.1| PREDICTED: uncharacterized protein LOC102619700 [Citrus sinensis] Length = 496 Score = 340 bits (872), Expect = 4e-91 Identities = 171/267 (64%), Positives = 211/267 (79%), Gaps = 2/267 (0%) Frame = +2 Query: 41 SHTDIPKSLFFNSSKQPLNSRVFQCSELQRLG--EKFLWFAPHSGFSNQFSELKNAVLMA 214 ++ IP+SL SSK L+ + QC + + +KF +APHSGFSNQ E KNA+LMA Sbjct: 42 AYNHIPESLLSLSSKT-LDPKFSQCHTTKAISPDKKFFLYAPHSGFSNQLGEFKNAILMA 100 Query: 215 AILNRTLIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDL 394 ILNRTLIVPPVLDHHAVALGSCPKFRV SP+++R VWDH IEL+ S RYVSM D+ID+ Sbjct: 101 GILNRTLIVPPVLDHHAVALGSCPKFRVQSPNQMRISVWDHAIELLRSGRYVSMADIIDI 160 Query: 395 SPLVSSSMLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDV 574 S LVSSSM+K++DFR FASLWCGL DLAC + + S LL++L+QC S+L +GN V Sbjct: 161 SSLVSSSMVKVLDFRRFASLWCGLDVDLACLISLNTQPS-LLDRLRQCVSMLSGLNGN-V 218 Query: 575 HKCLYAPNEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSE 754 C +A ++DCRTTVWTY+ EDG+LD FQPDE+L +KKK+SYVRRRRDVYKALGPGS+ Sbjct: 219 DGCFFAVDDDCRTTVWTYQSGDEDGVLDPFQPDEQLKKKKKVSYVRRRRDVYKALGPGSK 278 Query: 755 AELATVLAFGSLFTAPYKGSELYIDIH 835 A+ AT+LAFG+LFTAPYKGS+LYIDI+ Sbjct: 279 ADSATILAFGTLFTAPYKGSQLYIDIN 305 >ref|XP_002304290.2| hypothetical protein POPTR_0003s07710g [Populus trichocarpa] gi|550342654|gb|EEE79269.2| hypothetical protein POPTR_0003s07710g [Populus trichocarpa] Length = 509 Score = 340 bits (871), Expect = 5e-91 Identities = 172/264 (65%), Positives = 205/264 (77%), Gaps = 2/264 (0%) Frame = +2 Query: 56 PKSLFFNSSKQPLNSRVFQCSELQRL--GEKFLWFAPHSGFSNQFSELKNAVLMAAILNR 229 P SLF + N + QC++ Q L GEKFLW+APHSGFSNQ SE KN +LMA ILNR Sbjct: 54 PNSLFSKTITN--NPLISQCTKFQTLALGEKFLWYAPHSGFSNQLSEFKNGILMAGILNR 111 Query: 230 TLIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDLSPLVS 409 TLIVPPVLDHHAVALGSCPKFRV P E+R VWDH+++L+++ RYVSM D+ID+S LV Sbjct: 112 TLIVPPVLDHHAVALGSCPKFRVLGPKEIRVSVWDHVLDLVKTGRYVSMADIIDISSLVP 171 Query: 410 SSMLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDVHKCLY 589 SS ++ IDFR+FAS WC + D C + + +SS L + L CGS+L DGN V KCLY Sbjct: 172 SS-IQAIDFRVFASQWCNVKMDFTCSNDLNAQSS-LFDSLNLCGSILSGIDGN-VDKCLY 228 Query: 590 APNEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSEAELAT 769 A +EDCRTTVWTY+ ED + DSFQPDE+L +KKKISYVRRR+DVYK+LGPGSEA AT Sbjct: 229 AVDEDCRTTVWTYKNGDEDRVFDSFQPDEQLKKKKKISYVRRRQDVYKSLGPGSEAGSAT 288 Query: 770 VLAFGSLFTAPYKGSELYIDIHES 841 VLAFGSLFTAPYKGSEL+IDIHE+ Sbjct: 289 VLAFGSLFTAPYKGSELHIDIHEA 312 >ref|XP_003543971.1| PREDICTED: uncharacterized protein LOC100788337 [Glycine max] Length = 501 Score = 339 bits (870), Expect = 7e-91 Identities = 168/264 (63%), Positives = 206/264 (78%), Gaps = 2/264 (0%) Frame = +2 Query: 56 PKSLFFNSSKQPLNSRVFQCS-ELQRLGEKFLWFAPHSGFSNQFSELKNAVLMAAILNRT 232 P S F SK S QCS + LGEKF+W+APHSGFSNQ SE KNAVLMA ILNRT Sbjct: 46 PDSQFNGLSKTSTMSHAPQCSGQALALGEKFVWYAPHSGFSNQLSEFKNAVLMAGILNRT 105 Query: 233 LIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDLSPLVSS 412 L+VPP+LDHHAVALGSCPKFRV P ++R VWDH+IEL++SRRY+S+ ++ID+S LVS Sbjct: 106 LVVPPILDHHAVALGSCPKFRVVDPKDVRISVWDHVIELVQSRRYISIAEIIDVSSLVSP 165 Query: 413 SMLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDVHKCLYA 592 S++++ID R F S+WCG+ DLAC + ++SS + E LKQCGSLL G+ + KC+YA Sbjct: 166 SLVRVIDLRDFVSIWCGISLDLACVKDTKLQSS-VSESLKQCGSLLAGLHGS-IEKCIYA 223 Query: 593 PNEDCRTTVWTYE-QNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSEAELAT 769 NEDCRTT+WT+ EDG LDSFQ DE+L +KKKISYVRRR+DV+K LGPGSE E A+ Sbjct: 224 VNEDCRTTIWTFHTDGHEDGKLDSFQADEQLKQKKKISYVRRRKDVFKTLGPGSEVESAS 283 Query: 770 VLAFGSLFTAPYKGSELYIDIHES 841 +LAFGSLF+A YKGSELY+DIHES Sbjct: 284 LLAFGSLFSAAYKGSELYVDIHES 307 >ref|XP_002521236.1| conserved hypothetical protein [Ricinus communis] gi|223539504|gb|EEF41092.1| conserved hypothetical protein [Ricinus communis] Length = 506 Score = 339 bits (870), Expect = 7e-91 Identities = 172/270 (63%), Positives = 209/270 (77%), Gaps = 2/270 (0%) Frame = +2 Query: 38 SSHTDIPKSLFFNSSKQPLNSRVFQCSELQRL--GEKFLWFAPHSGFSNQFSELKNAVLM 211 +S+T K + N+ L+S++ QCS Q L GEKFLW+APHSGFSNQ SE KNA+LM Sbjct: 46 TSYTKTSKPILQNT----LDSQISQCSRFQSLTGGEKFLWYAPHSGFSNQLSEFKNAILM 101 Query: 212 AAILNRTLIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVID 391 A ILNRTLIVPP+LDHHAVALGSCPK RV P ++R VW+H IEL+++ RYVSMVD+ID Sbjct: 102 AGILNRTLIVPPILDHHAVALGSCPKLRVLGPKDIRISVWNHAIELVKTGRYVSMVDIID 161 Query: 392 LSPLVSSSMLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGND 571 +S LV SS ++ IDFR+FASLWCG+ D C + + SS L + L QCGS+L GN Sbjct: 162 ISSLVPSS-IRAIDFRVFASLWCGVNKDFICTNNLNAESS-LFDSLGQCGSVLSGFTGN- 218 Query: 572 VHKCLYAPNEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGS 751 + KCLYA EDCRTTVWTY+ +DG+LDSFQPDE+L +KK ISY+RR +DVYK LG GS Sbjct: 219 IGKCLYAVVEDCRTTVWTYKNGEKDGVLDSFQPDEQLKKKKNISYIRRHQDVYKVLGTGS 278 Query: 752 EAELATVLAFGSLFTAPYKGSELYIDIHES 841 E+E A+VLAFGSLFTAPYKGSELYIDIHE+ Sbjct: 279 ESESASVLAFGSLFTAPYKGSELYIDIHEA 308 >ref|XP_002868069.1| hypothetical protein ARALYDRAFT_493135 [Arabidopsis lyrata subsp. lyrata] gi|297313905|gb|EFH44328.1| hypothetical protein ARALYDRAFT_493135 [Arabidopsis lyrata subsp. lyrata] Length = 508 Score = 337 bits (864), Expect = 3e-90 Identities = 175/271 (64%), Positives = 206/271 (76%), Gaps = 4/271 (1%) Frame = +2 Query: 41 SHTDIPKSLF----FNSSKQPLNSRVFQCSELQRLGEKFLWFAPHSGFSNQFSELKNAVL 208 +++++PKSLF F+ S Q R + LG+KFLW+APHSGFSNQ SE KNAVL Sbjct: 47 TYSEMPKSLFSISAFSGSVQFPQCRS-EILTRTLLGQKFLWYAPHSGFSNQLSEFKNAVL 105 Query: 209 MAAILNRTLIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVI 388 MA ILNRTLI+PP+LDHHAVALGSCPKFRV SPSE+R VW+H IEL+ + RYVSM D++ Sbjct: 106 MAGILNRTLIIPPILDHHAVALGSCPKFRVLSPSEIRISVWNHSIELLRTDRYVSMADIV 165 Query: 389 DLSPLVSSSMLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGN 568 D+S LVSSS +++IDFR FASL CG+ + C D S E LKQCG LL GN Sbjct: 166 DISSLVSSSAVRVIDFRYFASLLCGVDLETLCSD-DLAEQSQAYELLKQCGYLLSGVRGN 224 Query: 569 DVHKCLYAPNEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPG 748 V KCLYA +EDCRTTVWTY+ DG LDSFQPDE+L +KKK+SYVRRRRDVYK LG G Sbjct: 225 -VDKCLYAVDEDCRTTVWTYKNGDADGRLDSFQPDEKLKKKKKLSYVRRRRDVYKTLGHG 283 Query: 749 SEAELATVLAFGSLFTAPYKGSELYIDIHES 841 +EAE A +LAFGSLFTAPYKGSELYIDIH+S Sbjct: 284 TEAESAAILAFGSLFTAPYKGSELYIDIHKS 314 >ref|XP_004299410.1| PREDICTED: uncharacterized protein LOC101295132 [Fragaria vesca subsp. vesca] Length = 487 Score = 335 bits (860), Expect = 1e-89 Identities = 166/235 (70%), Positives = 196/235 (83%), Gaps = 1/235 (0%) Frame = +2 Query: 140 KFLWFAPHSGFSNQFSELKNAVLMAAILNRTLIVPPVLDHHAVALGSCPKFRVSSPSELR 319 +FLW+APHSGFSNQ ELKN +LMA ILNRTLIVPPVLDHHAVALGSCPKFRVS+P+E+R Sbjct: 63 RFLWYAPHSGFSNQLMELKNGILMAGILNRTLIVPPVLDHHAVALGSCPKFRVSAPNEIR 122 Query: 320 KKVWDHIIELIESRRYVSMVDVIDLSPLVSSSMLKIIDFRIFASLWCGLGTDLACFDVSD 499 +VW+H++ELI S RYVSM D++DLS LVSSS++++IDFR F SLWC + D AC ++ Sbjct: 123 GQVWEHVVELIRSGRYVSMADIVDLSSLVSSSLVRVIDFRDFMSLWCDVNLDFAC--PNE 180 Query: 500 VRSSP-LLEKLKQCGSLLFSTDGNDVHKCLYAPNEDCRTTVWTYEQNVEDGLLDSFQPDE 676 + P LL+KLK+CGS+L GN KCL+A NEDCRTTVWTY+ EDG LDSFQPDE Sbjct: 181 FNAQPHLLDKLKECGSVLTGVKGN--VKCLHAVNEDCRTTVWTYQNGNEDGALDSFQPDE 238 Query: 677 ELLRKKKISYVRRRRDVYKALGPGSEAELATVLAFGSLFTAPYKGSELYIDIHES 841 + L+KKKISYVR+RRDVYK LGPGSE+E A+VLAFGSLFTAPYKGSEL IDI ES Sbjct: 239 K-LKKKKISYVRKRRDVYKTLGPGSESESASVLAFGSLFTAPYKGSELLIDIRES 292 >ref|XP_006440263.1| hypothetical protein CICLE_v10019844mg [Citrus clementina] gi|557542525|gb|ESR53503.1| hypothetical protein CICLE_v10019844mg [Citrus clementina] Length = 496 Score = 334 bits (857), Expect = 2e-89 Identities = 169/267 (63%), Positives = 209/267 (78%), Gaps = 2/267 (0%) Frame = +2 Query: 41 SHTDIPKSLFFNSSKQPLNSRVFQCSELQRLG--EKFLWFAPHSGFSNQFSELKNAVLMA 214 ++ IP+SL SSK L+ + QC + + +KF +APHSGFSNQ E KNA+LMA Sbjct: 42 AYNHIPESLLSLSSKT-LDPKFSQCHTTKAISPDKKFFLYAPHSGFSNQLGEFKNAILMA 100 Query: 215 AILNRTLIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDL 394 ILNRTLIVPPVLDHHAVALGSCPKFRV SP+++R VW H IEL+ S RYVSM D+ID+ Sbjct: 101 GILNRTLIVPPVLDHHAVALGSCPKFRVQSPNQMRISVWHHAIELLRSGRYVSMADIIDI 160 Query: 395 SPLVSSSMLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDV 574 S LVSSSM+K++DFR FASLWCGL DLAC + + S LL++L+QC S+L +GN V Sbjct: 161 SSLVSSSMVKVLDFRRFASLWCGLDVDLACLISLNTQPS-LLDRLRQCVSMLSGLNGN-V 218 Query: 575 HKCLYAPNEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSE 754 C +A ++DCRTTVWTY+ EDG+LD FQPDE+L +KKK+SYVRRRRDVYKALG GS+ Sbjct: 219 DGCFFAVDDDCRTTVWTYQSGDEDGVLDPFQPDEQLKKKKKVSYVRRRRDVYKALGSGSK 278 Query: 755 AELATVLAFGSLFTAPYKGSELYIDIH 835 A+ AT+LAFG+LFTAPYKGS+LYIDI+ Sbjct: 279 ADSATILAFGTLFTAPYKGSQLYIDIN 305 >ref|NP_193473.1| O-fucosyltransferase family protein [Arabidopsis thaliana] gi|95147298|gb|ABF57284.1| At4g17430 [Arabidopsis thaliana] gi|332658490|gb|AEE83890.1| O-fucosyltransferase family protein [Arabidopsis thaliana] Length = 507 Score = 334 bits (856), Expect = 3e-89 Identities = 173/271 (63%), Positives = 206/271 (76%), Gaps = 4/271 (1%) Frame = +2 Query: 41 SHTDIPKSLF----FNSSKQPLNSRVFQCSELQRLGEKFLWFAPHSGFSNQFSELKNAVL 208 +++++PKSLF F+ S Q R + LG+KFLW+APHSGFSNQ SE KNA+L Sbjct: 46 TYSEMPKSLFSISAFSGSVQFPQCRS-EILTRTLLGQKFLWYAPHSGFSNQLSEFKNALL 104 Query: 209 MAAILNRTLIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVI 388 MA ILNRTLI+PP+LDHHAVALGSCPKFRV SPSE+R VW+H IEL+++ RYVSM D++ Sbjct: 105 MAGILNRTLIIPPILDHHAVALGSCPKFRVLSPSEIRISVWNHSIELLKTDRYVSMADIV 164 Query: 389 DLSPLVSSSMLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGN 568 D+S LVSSS +++IDFR FASL CG+ + C D S E LKQCG LL GN Sbjct: 165 DISSLVSSSAVRVIDFRYFASLQCGVDLETLCTD-DLAEQSQAYESLKQCGYLLSGVRGN 223 Query: 569 DVHKCLYAPNEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPG 748 V KCLYA +EDCRTTVWTY+ DG LDSFQPDE+L +KKK+S VRRRRDVYK LG G Sbjct: 224 -VDKCLYAVDEDCRTTVWTYKNGEADGRLDSFQPDEKLKKKKKLSNVRRRRDVYKTLGHG 282 Query: 749 SEAELATVLAFGSLFTAPYKGSELYIDIHES 841 +EAE A +LAFGSLFTAPYKGSELYIDIH+S Sbjct: 283 TEAESAAILAFGSLFTAPYKGSELYIDIHKS 313 >ref|XP_004245784.1| PREDICTED: uncharacterized protein LOC101261944 [Solanum lycopersicum] Length = 495 Score = 333 bits (854), Expect = 5e-89 Identities = 167/262 (63%), Positives = 199/262 (75%) Frame = +2 Query: 56 PKSLFFNSSKQPLNSRVFQCSELQRLGEKFLWFAPHSGFSNQFSELKNAVLMAAILNRTL 235 P L + P+ R QC+ RL EKF+W+APHSGFSNQ +E KNA+LMA ILNRTL Sbjct: 43 PSLLPLSKKSIPIIPRPQQCNPENRLQEKFMWYAPHSGFSNQLAEFKNAILMAKILNRTL 102 Query: 236 IVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDLSPLVSSS 415 IVPPVLDHHAVALGSCPKFRV P+ELR VW+H I+L+ RYVSM D++DLSPL S S Sbjct: 103 IVPPVLDHHAVALGSCPKFRVLEPNELRYLVWNHSIQLLRDCRYVSMADIVDLSPLASYS 162 Query: 416 MLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDVHKCLYAP 595 ++ IDFR F S WCG+ D+ C ++ SS L E L+QCGSLL G+ CL A Sbjct: 163 TVRFIDFRAFVSSWCGVNLDVICSKNQNIPSS-LFESLRQCGSLLSGYYGS-FSGCLSAL 220 Query: 596 NEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSEAELATVL 775 EDCRTTVWTY+++ EDG LDSFQPD++L +KKKIS++RRR+DVYKALGPGS AE ATVL Sbjct: 221 KEDCRTTVWTYKKDDEDGALDSFQPDDQLRKKKKISFIRRRKDVYKALGPGSAAESATVL 280 Query: 776 AFGSLFTAPYKGSELYIDIHES 841 AFGSLFTAPYKGSE +IDIHE+ Sbjct: 281 AFGSLFTAPYKGSESHIDIHEA 302 >ref|XP_006355417.1| PREDICTED: uncharacterized protein LOC102586517 [Solanum tuberosum] Length = 495 Score = 331 bits (849), Expect = 2e-88 Identities = 164/262 (62%), Positives = 198/262 (75%) Frame = +2 Query: 56 PKSLFFNSSKQPLNSRVFQCSELQRLGEKFLWFAPHSGFSNQFSELKNAVLMAAILNRTL 235 P L + P+ R QC+ RL KF+W+APHSGFSNQ +E KNA+LMA ILNRTL Sbjct: 43 PSLLPLSKKSIPIIPRPQQCNPTNRLQGKFMWYAPHSGFSNQLAEFKNAILMAKILNRTL 102 Query: 236 IVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDLSPLVSSS 415 +VPPVLDHHAVALGSCPKFRV P+ELR VW+H I+L+ RYVSM D++DLSPL S S Sbjct: 103 VVPPVLDHHAVALGSCPKFRVLEPNELRYLVWNHSIQLLRDCRYVSMADIVDLSPLASYS 162 Query: 416 MLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDVHKCLYAP 595 ++ IDFR F S WCG+ D+ C ++ SPL E L+QCGSLL G+ CL A Sbjct: 163 TVRFIDFRAFVSSWCGVNLDVICSKDQNI-PSPLSESLRQCGSLLSGYYGS-FSGCLSAL 220 Query: 596 NEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSEAELATVL 775 EDCRTTVWTY+++ EDG LDSFQPD++L +KKKIS++RRR+DVYKALGPGS AE ATVL Sbjct: 221 KEDCRTTVWTYKKDNEDGALDSFQPDDQLRKKKKISFIRRRKDVYKALGPGSAAESATVL 280 Query: 776 AFGSLFTAPYKGSELYIDIHES 841 AFGSLFTAPYKGSE ++DIHE+ Sbjct: 281 AFGSLFTAPYKGSESHVDIHEA 302 >ref|XP_006414231.1| hypothetical protein EUTSA_v10024957mg [Eutrema salsugineum] gi|557115401|gb|ESQ55684.1| hypothetical protein EUTSA_v10024957mg [Eutrema salsugineum] Length = 509 Score = 331 bits (848), Expect = 2e-88 Identities = 174/272 (63%), Positives = 206/272 (75%), Gaps = 5/272 (1%) Frame = +2 Query: 41 SHTDIPKSLF----FNSSKQPLNSRVFQCSELQRLGEKFLWFAPHSGFSNQFSELKNAVL 208 ++++IPKSLF F+ S Q R S LG++FLW+APHSGFSNQ SE KNAVL Sbjct: 46 TYSEIPKSLFSISAFSGSVQFPQCRSEILSRTL-LGQRFLWYAPHSGFSNQLSEFKNAVL 104 Query: 209 MAAILNRTLIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVI 388 MA ILNRTLIVPPVLDHHAVALGSCPKFRV SPSE+R VW+H IEL+ + RYVS+ D++ Sbjct: 105 MAGILNRTLIVPPVLDHHAVALGSCPKFRVLSPSEIRVSVWNHSIELLRTGRYVSIADIV 164 Query: 389 DLSPLVSSSMLKIIDFRIFASLWCGLGTDLACF-DVSDVRSSPLLEKLKQCGSLLFSTDG 565 D+S LVSSS +++ID R FASL CG+ + C ++S+ S E LKQCG LL G Sbjct: 165 DISSLVSSSAVRVIDLRFFASLLCGVDLETLCSGELSE--QSQAYESLKQCGYLLSGVRG 222 Query: 566 NDVHKCLYAPNEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGP 745 N V KCLYA +EDCRTTVWTY DG LDSFQPDE+L +KKKISYVRRRRDVYK+LG Sbjct: 223 N-VDKCLYAVDEDCRTTVWTYRNGDSDGKLDSFQPDEKLKKKKKISYVRRRRDVYKSLGG 281 Query: 746 GSEAELATVLAFGSLFTAPYKGSELYIDIHES 841 G+EAE ++AFGSLFTAPYKGSELYID H+S Sbjct: 282 GTEAESVAIMAFGSLFTAPYKGSELYIDFHKS 313 >ref|XP_004503386.1| PREDICTED: uncharacterized protein LOC101504282 isoform X1 [Cicer arietinum] gi|502138386|ref|XP_004503387.1| PREDICTED: uncharacterized protein LOC101504282 isoform X2 [Cicer arietinum] Length = 490 Score = 330 bits (847), Expect = 3e-88 Identities = 161/263 (61%), Positives = 204/263 (77%), Gaps = 3/263 (1%) Frame = +2 Query: 62 SLFFNSSKQPLNSRVFQCS-ELQRLGEKFLWFAPHSGFSNQFSELKNAVLMAAILNRTLI 238 S F S+ S QCS + L EKF+W+APHSGFSNQF E KNAV +A ILNRTL+ Sbjct: 37 SHFAEFSQTLKTSHTLQCSSQALALSEKFMWYAPHSGFSNQFLEFKNAVSIAGILNRTLV 96 Query: 239 VPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDLSPLVSSSM 418 VPP+LDHHAVALGSCPKFRV P ++R VWDH++EL+ S RY+S+ ++ID+S LVSSS+ Sbjct: 97 VPPILDHHAVALGSCPKFRVIEPKDIRISVWDHVVELVRSGRYISIAEIIDISSLVSSSL 156 Query: 419 LKIIDFRIFASLWCGLGTDLACFDVSDVRS-SPLLEKLKQCGSLLFSTDGNDVHKCLYAP 595 +++ID R F S+WCG+ D AC ++D++S SP+ + LKQCGSLL GN + C+Y Sbjct: 157 VRVIDLRDFVSIWCGISLDFAC--LNDLKSQSPVSKSLKQCGSLLAGLHGN-IENCIYGV 213 Query: 596 NEDCRTTVWTYE-QNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSEAELATV 772 +EDCRTTVWTY EDG+LDSFQPDE+L +KKKISYVRRR+DV++ LGPGS+ E A++ Sbjct: 214 DEDCRTTVWTYHVDGHEDGVLDSFQPDEQLKQKKKISYVRRRKDVFRTLGPGSDVESASM 273 Query: 773 LAFGSLFTAPYKGSELYIDIHES 841 LAFGSLF+APYKGSE Y+DIHES Sbjct: 274 LAFGSLFSAPYKGSESYLDIHES 296 >ref|XP_003630908.1| CigA protein [Medicago truncatula] gi|355524930|gb|AET05384.1| CigA protein [Medicago truncatula] Length = 486 Score = 330 bits (846), Expect = 4e-88 Identities = 162/250 (64%), Positives = 199/250 (79%), Gaps = 2/250 (0%) Frame = +2 Query: 98 SRVFQCS-ELQRLGEKFLWFAPHSGFSNQFSELKNAVLMAAILNRTLIVPPVLDHHAVAL 274 S QCS + L EKF+W+APHSGFSNQ SE K+AVL+A ILNRTL+VPP+LDHHAVAL Sbjct: 47 SHTLQCSSQALALSEKFMWYAPHSGFSNQLSEFKHAVLIAGILNRTLVVPPILDHHAVAL 106 Query: 275 GSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVIDLSPLVSSSMLKIIDFRIFASL 454 GSCPKFRV P+ +R VWDH+I+L+ RYVS+ ++ID+S LVSSS++++ID R F S+ Sbjct: 107 GSCPKFRVVEPNHIRFSVWDHVIQLLRGGRYVSIAEIIDISSLVSSSLVRVIDLRDFVSI 166 Query: 455 WCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGNDVHKCLYAPNEDCRTTVWTYE- 631 WCG+ DLAC + +SS + E LKQCGSLL GN + KC+YA NEDCRTTVWTY Sbjct: 167 WCGISLDLACNNDPKSQSS-VSESLKQCGSLLSGFHGN-IAKCIYAINEDCRTTVWTYHV 224 Query: 632 QNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPGSEAELATVLAFGSLFTAPYKG 811 EDG+LDSFQPDE+L ++KKISYVRRRRDV++ LGPGS+ E A++LAFGSLF+APYKG Sbjct: 225 DGHEDGMLDSFQPDEQLKQRKKISYVRRRRDVFRTLGPGSKVESASMLAFGSLFSAPYKG 284 Query: 812 SELYIDIHES 841 SE YIDIHES Sbjct: 285 SESYIDIHES 294 >ref|XP_006285372.1| hypothetical protein CARUB_v10006763mg [Capsella rubella] gi|482554077|gb|EOA18270.1| hypothetical protein CARUB_v10006763mg [Capsella rubella] Length = 507 Score = 327 bits (837), Expect = 5e-87 Identities = 171/272 (62%), Positives = 204/272 (75%), Gaps = 5/272 (1%) Frame = +2 Query: 41 SHTDIPKSLF----FNSSKQPLNSRVFQCSELQRLGEKFLWFAPHSGFSNQFSELKNAVL 208 +++++PKSLF F+ S Q R + LG+KFLW+APHSGFSNQ SE KNAVL Sbjct: 47 TYSEMPKSLFSISAFSGSVQFPQCRS-EILTRTLLGQKFLWYAPHSGFSNQLSEFKNAVL 105 Query: 209 MAAILNRTLIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVI 388 MA ILNRTLIVPP+LDHHAVALGSCPKFRV +PS++R VW+H IEL+ + RYVSM D++ Sbjct: 106 MAGILNRTLIVPPILDHHAVALGSCPKFRVLNPSDIRLSVWNHSIELLMNNRYVSMADIV 165 Query: 389 DLSPLVSSSMLKIIDFRIFASLWCGLGTDLACFDVSDVRSSP-LLEKLKQCGSLLFSTDG 565 D+S LVSSS +++ID R FASL CG+ + C D ++ P E LKQCG LL G Sbjct: 166 DISSLVSSSAVRVIDLRYFASLLCGVDLEALCSD--ELAEKPQAYESLKQCGYLLSGVRG 223 Query: 566 NDVHKCLYAPNEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGP 745 N V KCLY +EDCRTTVWTY DG LDSFQPDE+L +KKK+SYVRRRRDVYK LG Sbjct: 224 N-VDKCLYGVDEDCRTTVWTYRNGDADGRLDSFQPDEKLKKKKKLSYVRRRRDVYKTLGH 282 Query: 746 GSEAELATVLAFGSLFTAPYKGSELYIDIHES 841 G+EAE A +LAFGSLFTAPYKGSELYIDI +S Sbjct: 283 GTEAESAAILAFGSLFTAPYKGSELYIDILKS 314 >gb|ABD65093.1| hypothetical protein 31.t00055 [Brassica oleracea] Length = 521 Score = 321 bits (823), Expect = 2e-85 Identities = 171/271 (63%), Positives = 201/271 (74%), Gaps = 4/271 (1%) Frame = +2 Query: 41 SHTDIPKSLFFNSSKQPLNSRVFQCSE--LQR--LGEKFLWFAPHSGFSNQFSELKNAVL 208 ++++IPKS+F SS + QC L R +G++FL +APHSGFSNQ SE KNAVL Sbjct: 57 TYSEIPKSIFSISSAFSGSVEFPQCRSETLSRTLIGQRFLLYAPHSGFSNQLSEFKNAVL 116 Query: 209 MAAILNRTLIVPPVLDHHAVALGSCPKFRVSSPSELRKKVWDHIIELIESRRYVSMVDVI 388 MA ILNRTL+VPPVLDHHAVALGSCPKFRV SPSE+R VW+H +EL+ S RYVSM DV+ Sbjct: 117 MAMILNRTLVVPPVLDHHAVALGSCPKFRVLSPSEVRVSVWNHSVELLRSGRYVSMGDVV 176 Query: 389 DLSPLVSSSMLKIIDFRIFASLWCGLGTDLACFDVSDVRSSPLLEKLKQCGSLLFSTDGN 568 D+S LVSSS +++IDFR FASL CG+ + C S E L+QCG LL GN Sbjct: 177 DISSLVSSSAVRVIDFRYFASLLCGVDLETLC-SGELAEQSQAYESLRQCGYLLSGVRGN 235 Query: 569 DVHKCLYAPNEDCRTTVWTYEQNVEDGLLDSFQPDEELLRKKKISYVRRRRDVYKALGPG 748 V CLY ++DCRTTVWTY DG LDSFQ DE+L +KKKI+YVRRRRDVYKALG G Sbjct: 236 -VDGCLYGVDDDCRTTVWTYRNGGSDGRLDSFQADEKLKKKKKITYVRRRRDVYKALGRG 294 Query: 749 SEAELATVLAFGSLFTAPYKGSELYIDIHES 841 SEAE A +LAFGSLFTAPYKGSELYIDI +S Sbjct: 295 SEAESAAILAFGSLFTAPYKGSELYIDIKKS 325