BLASTX nr result

ID: Cornus23_contig00031089 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00031089
         (945 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI29499.3| unnamed protein product [Vitis vinifera]              384   e-104
ref|XP_002272057.2| PREDICTED: uncharacterized protein LOC100266...   384   e-104
ref|XP_007039777.1| O-fucosyltransferase family protein, putativ...   370   e-100
ref|XP_010258826.1| PREDICTED: uncharacterized protein LOC104598...   368   3e-99
ref|XP_012486763.1| PREDICTED: uncharacterized protein LOC105800...   368   4e-99
ref|XP_011031926.1| PREDICTED: uncharacterized protein LOC105130...   365   4e-98
ref|XP_008238434.1| PREDICTED: uncharacterized protein LOC103337...   363   1e-97
ref|XP_002304290.2| hypothetical protein POPTR_0003s07710g [Popu...   363   1e-97
ref|XP_002868069.1| hypothetical protein ARALYDRAFT_493135 [Arab...   363   1e-97
gb|KHG18329.1| tRNA-splicing ligase RtcB [Gossypium arboreum]         362   2e-97
ref|XP_006477145.1| PREDICTED: uncharacterized protein LOC102619...   362   2e-97
ref|XP_012472055.1| PREDICTED: uncharacterized protein LOC105789...   361   4e-97
ref|NP_193473.1| O-fucosyltransferase family protein [Arabidopsi...   360   9e-97
ref|XP_004245784.1| PREDICTED: uncharacterized protein LOC101261...   360   9e-97
ref|XP_002521236.1| conserved hypothetical protein [Ricinus comm...   360   1e-96
gb|KDO61362.1| hypothetical protein CISIN_1g0110012mg [Citrus si...   359   2e-96
ref|XP_006440263.1| hypothetical protein CICLE_v10019844mg [Citr...   359   2e-96
ref|XP_010104870.1| hypothetical protein L484_024071 [Morus nota...   358   3e-96
gb|KHG18421.1| GDP-fucose O-fucosyltransferase 1 [Gossypium arbo...   358   4e-96
ref|XP_007151341.1| hypothetical protein PHAVU_004G038100g [Phas...   355   3e-95

>emb|CBI29499.3| unnamed protein product [Vitis vinifera]
          Length = 494

 Score =  384 bits (987), Expect = e-104
 Identities = 192/290 (66%), Positives = 225/290 (77%), Gaps = 1/290 (0%)
 Frame = -2

Query: 869 NFNRSNKWKKRPSHRXXXXXXXXXXXXXXXXXXXSYR-HISNSLLPISKNSQPAQFPQCR 693
           N  R+  WKK+  +                    SY  HI NSLLPIS+ SQ      C 
Sbjct: 3   NLGRNRVWKKKGFYNTTSLLSISLLIFIFIFIFVSYTSHIPNSLLPISRTSQ------CT 56

Query: 692 SKQTLGEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTLIVPPILDHHAVALGSCPKFRV 513
            +   G++FLWYAPHSGFSNQ+SEFKNAILMAAILNRTL+VPPILDHHAVALGSCPKFRV
Sbjct: 57  PQNLPGQRFLWYAPHSGFSNQVSEFKNAILMAAILNRTLVVPPILDHHAVALGSCPKFRV 116

Query: 512 STPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSASAIRVIDFRVFVSTWCGLNMDF 333
             P E+R+SVWNH+I+L++S RY+SMADIIDLSSLVS S I+ IDFR F+S WCG+N+DF
Sbjct: 117 LGPGEIRLSVWNHVIDLLRSRRYVSMADIIDLSSLVSISVIQAIDFRDFISLWCGVNVDF 176

Query: 332 ACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYALDEDCRTTVWTYQKGGEDGLPDSF 153
            C N+SN Q SL D+LKQCGS LSG +GN+D C+YALDEDCRTTVWTYQ+  +D + DSF
Sbjct: 177 DCFNESNDQSSLLDSLKQCGSRLSGLDGNVDKCIYALDEDCRTTVWTYQQ-NDDEVLDSF 235

Query: 152 QPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAFGSLFTAPYKGT 3
           QPDEQLKKKKK SY+R+R DVYKT+GPGS+AE ATVLAFGSLFTAPYKG+
Sbjct: 236 QPDEQLKKKKKISYIRKRRDVYKTLGPGSKAESATVLAFGSLFTAPYKGS 285


>ref|XP_002272057.2| PREDICTED: uncharacterized protein LOC100266043 [Vitis vinifera]
          Length = 482

 Score =  384 bits (987), Expect = e-104
 Identities = 192/290 (66%), Positives = 225/290 (77%), Gaps = 1/290 (0%)
 Frame = -2

Query: 869 NFNRSNKWKKRPSHRXXXXXXXXXXXXXXXXXXXSYR-HISNSLLPISKNSQPAQFPQCR 693
           N  R+  WKK+  +                    SY  HI NSLLPIS+ SQ      C 
Sbjct: 3   NLGRNRVWKKKGFYNTTSLLSISLLIFIFIFIFVSYTSHIPNSLLPISRTSQ------CT 56

Query: 692 SKQTLGEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTLIVPPILDHHAVALGSCPKFRV 513
            +   G++FLWYAPHSGFSNQ+SEFKNAILMAAILNRTL+VPPILDHHAVALGSCPKFRV
Sbjct: 57  PQNLPGQRFLWYAPHSGFSNQVSEFKNAILMAAILNRTLVVPPILDHHAVALGSCPKFRV 116

Query: 512 STPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSASAIRVIDFRVFVSTWCGLNMDF 333
             P E+R+SVWNH+I+L++S RY+SMADIIDLSSLVS S I+ IDFR F+S WCG+N+DF
Sbjct: 117 LGPGEIRLSVWNHVIDLLRSRRYVSMADIIDLSSLVSISVIQAIDFRDFISLWCGVNVDF 176

Query: 332 ACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYALDEDCRTTVWTYQKGGEDGLPDSF 153
            C N+SN Q SL D+LKQCGS LSG +GN+D C+YALDEDCRTTVWTYQ+  +D + DSF
Sbjct: 177 DCFNESNDQSSLLDSLKQCGSRLSGLDGNVDKCIYALDEDCRTTVWTYQQ-NDDEVLDSF 235

Query: 152 QPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAFGSLFTAPYKGT 3
           QPDEQLKKKKK SY+R+R DVYKT+GPGS+AE ATVLAFGSLFTAPYKG+
Sbjct: 236 QPDEQLKKKKKISYIRKRRDVYKTLGPGSKAESATVLAFGSLFTAPYKGS 285


>ref|XP_007039777.1| O-fucosyltransferase family protein, putative isoform 2 [Theobroma
           cacao] gi|508777022|gb|EOY24278.1| O-fucosyltransferase
           family protein, putative isoform 2 [Theobroma cacao]
          Length = 514

 Score =  370 bits (950), Expect = e-100
 Identities = 183/258 (70%), Positives = 209/258 (81%), Gaps = 4/258 (1%)
 Frame = -2

Query: 764 YRHISNSLLPISKNSQPA----QFPQCRSKQTLGEKFLWYAPHSGFSNQLSEFKNAILMA 597
           Y  I  SL   S  +  A    Q+P C + Q  GEKFLWYAPHSGFSNQLSEFKNAILMA
Sbjct: 48  YISIPKSLFSTSSKTVNAALSPQYPHCTT-QIPGEKFLWYAPHSGFSNQLSEFKNAILMA 106

Query: 596 AILNRTLIVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDL 417
            ILNRTLIVPPILDHHAV LGSCPKFRV + KE+R+SVW+HI EL++S RY+SMADIID+
Sbjct: 107 GILNRTLIVPPILDHHAVVLGSCPKFRVQSAKEIRLSVWDHINELIRSERYVSMADIIDI 166

Query: 416 SSLVSASAIRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDD 237
           SSL+S+S +R IDFRVFVS WCGLNMD  C N+ N Q S+  +L+QCGSLLSG +GNID 
Sbjct: 167 SSLLSSSLVRAIDFRVFVSLWCGLNMDLVCSNELNAQQSMVGSLRQCGSLLSGIDGNIDR 226

Query: 236 CLYALDEDCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAE 57
           CL+A+DEDCRTTVWTYQ    DG+ DSFQPDEQLK KKK SY+RRR +VYKT+GPGS AE
Sbjct: 227 CLFAVDEDCRTTVWTYQNDEVDGVLDSFQPDEQLKNKKKISYVRRRRNVYKTLGPGSEAE 286

Query: 56  LATVLAFGSLFTAPYKGT 3
            ATVLAFGSLFTAPYKG+
Sbjct: 287 SATVLAFGSLFTAPYKGS 304


>ref|XP_010258826.1| PREDICTED: uncharacterized protein LOC104598454 [Nelumbo nucifera]
          Length = 482

 Score =  368 bits (945), Expect = 3e-99
 Identities = 184/293 (62%), Positives = 224/293 (76%)
 Frame = -2

Query: 881 MEAFNFNRSNKWKKRPSHRXXXXXXXXXXXXXXXXXXXSYRHISNSLLPISKNSQPAQFP 702
           M   +F+RS +WKK+ S+R                    Y  +  S L  S+ +  A F 
Sbjct: 1   MNVLSFHRS-RWKKKTSNRPLFFSISLMIIFFILLLIS-YTEVPKSFLFNSRKALNAGFT 58

Query: 701 QCRSKQTLGEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTLIVPPILDHHAVALGSCPK 522
            C  + T GE+FLW+APHSGFSNQLSE KNAILMAAILNRTLIVPP+LDHH+V LGSCPK
Sbjct: 59  PCSGRNT-GERFLWFAPHSGFSNQLSELKNAILMAAILNRTLIVPPVLDHHSVVLGSCPK 117

Query: 521 FRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSASAIRVIDFRVFVSTWCGLN 342
           FRVS+P +LR+SVWNH+I+L+QSHRYISMADI+DLSSLVS+S ++ IDFRVF+S WCG++
Sbjct: 118 FRVSSPNDLRMSVWNHVIDLLQSHRYISMADIVDLSSLVSSSMVKTIDFRVFLSLWCGVD 177

Query: 341 MDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYALDEDCRTTVWTYQKGGEDGLP 162
            D +C  D     SLF+ LKQCGS+LSG NGNI+ CLYALDEDCRTTVWTY +  +DG+ 
Sbjct: 178 RDLSCF-DVLEMGSLFERLKQCGSVLSGLNGNINSCLYALDEDCRTTVWTYLQRDDDGIL 236

Query: 161 DSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAFGSLFTAPYKGT 3
           DSFQ DEQL+K+KK SY+R+R DVYK +GPGS+AE ATVLAFGSLFTAPYKG+
Sbjct: 237 DSFQADEQLRKRKKISYIRKRRDVYKALGPGSKAETATVLAFGSLFTAPYKGS 289


>ref|XP_012486763.1| PREDICTED: uncharacterized protein LOC105800283 [Gossypium
           raimondii] gi|763770429|gb|KJB37644.1| hypothetical
           protein B456_006G214000 [Gossypium raimondii]
          Length = 516

 Score =  368 bits (944), Expect = 4e-99
 Identities = 178/236 (75%), Positives = 205/236 (86%)
 Frame = -2

Query: 710 QFPQCRSKQTLGEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTLIVPPILDHHAVALGS 531
           Q+P C ++   GEKFLWYAPHSGFSNQLSEFK AILMA+ILNRTLIV PILDHHAVALGS
Sbjct: 70  QYPHCETRFH-GEKFLWYAPHSGFSNQLSEFKKAILMASILNRTLIVSPILDHHAVALGS 128

Query: 530 CPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSASAIRVIDFRVFVSTWC 351
           CPKFRV +PKE+RISVW+HIIEL++S RY+SMADIID+SSL+S+S +R IDFRVFVS+WC
Sbjct: 129 CPKFRVQSPKEIRISVWDHIIELLRSRRYVSMADIIDISSLLSSSLVRAIDFRVFVSSWC 188

Query: 350 GLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYALDEDCRTTVWTYQKGGED 171
            LN+D  C N  N+  SL ++LKQCGSLLSG  GNI+ CLYA+DEDCRTTVWTYQ  G D
Sbjct: 189 DLNVDLVCSNGLNVPPSLVESLKQCGSLLSGIGGNINQCLYAVDEDCRTTVWTYQ-NGVD 247

Query: 170 GLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAFGSLFTAPYKGT 3
           G+ DSFQPDEQL K+KK SY+RRR DVYKT+GPGS+AE ATVLAFGSLFTAPYKG+
Sbjct: 248 GMLDSFQPDEQLMKRKKISYVRRRRDVYKTLGPGSKAESATVLAFGSLFTAPYKGS 303


>ref|XP_011031926.1| PREDICTED: uncharacterized protein LOC105130896 [Populus
           euphratica]
          Length = 509

 Score =  365 bits (936), Expect = 4e-98
 Identities = 181/251 (72%), Positives = 206/251 (82%), Gaps = 2/251 (0%)
 Frame = -2

Query: 749 NSLLPISKNSQPAQFPQCRSKQTL--GEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTL 576
           NSL   +    P    QC   QTL  GEKFLWYAPHSGFSNQLSEFKN ILMA ILNRTL
Sbjct: 55  NSLFSKTIRDNPL-ISQCTRFQTLALGEKFLWYAPHSGFSNQLSEFKNGILMAGILNRTL 113

Query: 575 IVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSAS 396
           IVPP+LDHHAVALGSCPKFRV  PKE+RISVW+H+++LV++ RY+SMADIID+SSLV +S
Sbjct: 114 IVPPVLDHHAVALGSCPKFRVLGPKEIRISVWDHVLDLVKTGRYVSMADIIDISSLVPSS 173

Query: 395 AIRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYALDE 216
            I+ IDFRVF S WC +NMDF C ND N Q SLFD+L  CGS+LSG +GN+D CLYA+DE
Sbjct: 174 -IQAIDFRVFASLWCNVNMDFTCSNDLNSQSSLFDSLNLCGSILSGIDGNVDKCLYAVDE 232

Query: 215 DCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAF 36
           DCRTTVWTY+ G ED + DSFQPDEQLKKKKK SY+RRR DVYK++GPGS A  ATVLAF
Sbjct: 233 DCRTTVWTYKNGDEDRVFDSFQPDEQLKKKKKISYVRRRQDVYKSLGPGSEAGSATVLAF 292

Query: 35  GSLFTAPYKGT 3
           GSLFTAPYKG+
Sbjct: 293 GSLFTAPYKGS 303


>ref|XP_008238434.1| PREDICTED: uncharacterized protein LOC103337062 [Prunus mume]
          Length = 494

 Score =  363 bits (932), Expect = 1e-97
 Identities = 177/249 (71%), Positives = 208/249 (83%)
 Frame = -2

Query: 749 NSLLPISKNSQPAQFPQCRSKQTLGEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTLIV 570
           NSLL  S  +  +  P+C S    GEKFLWYAPHSGFSNQLSEFKNA+LMAAILNRTL+ 
Sbjct: 48  NSLL--SNKTSISDSPKCPSLG--GEKFLWYAPHSGFSNQLSEFKNAVLMAAILNRTLVA 103

Query: 569 PPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSASAI 390
           PP+LDHHAVALGSCPKFRV +  E+RISVW+HI+EL++S RY+SMADI+D+SSLVS+S +
Sbjct: 104 PPVLDHHAVALGSCPKFRVLSANEIRISVWDHIVELIRSGRYVSMADIVDISSLVSSSLV 163

Query: 389 RVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYALDEDC 210
           RVIDFRVF+S WC +N+DFAC N+ +   SL + LKQCGSLLSG NG++  CLYA++EDC
Sbjct: 164 RVIDFRVFISLWCNVNVDFACYNELDKHASLLERLKQCGSLLSGLNGDV-KCLYAVNEDC 222

Query: 209 RTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAFGS 30
           RTTVWTYQ G EDG  DSFQPDEQLKKKKK SY+R+R DVY T+GPGS AE ATVLAFGS
Sbjct: 223 RTTVWTYQSGNEDGALDSFQPDEQLKKKKKISYVRKRRDVYNTLGPGSEAESATVLAFGS 282

Query: 29  LFTAPYKGT 3
           LFT PYKG+
Sbjct: 283 LFTLPYKGS 291


>ref|XP_002304290.2| hypothetical protein POPTR_0003s07710g [Populus trichocarpa]
           gi|550342654|gb|EEE79269.2| hypothetical protein
           POPTR_0003s07710g [Populus trichocarpa]
          Length = 509

 Score =  363 bits (932), Expect = 1e-97
 Identities = 179/251 (71%), Positives = 206/251 (82%), Gaps = 2/251 (0%)
 Frame = -2

Query: 749 NSLLPISKNSQPAQFPQCRSKQTL--GEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTL 576
           NSL   +  + P    QC   QTL  GEKFLWYAPHSGFSNQLSEFKN ILMA ILNRTL
Sbjct: 55  NSLFSKTITNNPL-ISQCTKFQTLALGEKFLWYAPHSGFSNQLSEFKNGILMAGILNRTL 113

Query: 575 IVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSAS 396
           IVPP+LDHHAVALGSCPKFRV  PKE+R+SVW+H+++LV++ RY+SMADIID+SSLV +S
Sbjct: 114 IVPPVLDHHAVALGSCPKFRVLGPKEIRVSVWDHVLDLVKTGRYVSMADIIDISSLVPSS 173

Query: 395 AIRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYALDE 216
            I+ IDFRVF S WC + MDF C ND N Q SLFD+L  CGS+LSG +GN+D CLYA+DE
Sbjct: 174 -IQAIDFRVFASQWCNVKMDFTCSNDLNAQSSLFDSLNLCGSILSGIDGNVDKCLYAVDE 232

Query: 215 DCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAF 36
           DCRTTVWTY+ G ED + DSFQPDEQLKKKKK SY+RRR DVYK++GPGS A  ATVLAF
Sbjct: 233 DCRTTVWTYKNGDEDRVFDSFQPDEQLKKKKKISYVRRRQDVYKSLGPGSEAGSATVLAF 292

Query: 35  GSLFTAPYKGT 3
           GSLFTAPYKG+
Sbjct: 293 GSLFTAPYKGS 303


>ref|XP_002868069.1| hypothetical protein ARALYDRAFT_493135 [Arabidopsis lyrata subsp.
           lyrata] gi|297313905|gb|EFH44328.1| hypothetical protein
           ARALYDRAFT_493135 [Arabidopsis lyrata subsp. lyrata]
          Length = 508

 Score =  363 bits (932), Expect = 1e-97
 Identities = 176/258 (68%), Positives = 207/258 (80%), Gaps = 4/258 (1%)
 Frame = -2

Query: 764 YRHISNSLLPISKNSQPAQFPQCRS----KQTLGEKFLWYAPHSGFSNQLSEFKNAILMA 597
           Y  +  SL  IS  S   QFPQCRS    +  LG+KFLWYAPHSGFSNQLSEFKNA+LMA
Sbjct: 48  YSEMPKSLFSISAFSGSVQFPQCRSEILTRTLLGQKFLWYAPHSGFSNQLSEFKNAVLMA 107

Query: 596 AILNRTLIVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDL 417
            ILNRTLI+PPILDHHAVALGSCPKFRV +P E+RISVWNH IEL+++ RY+SMADI+D+
Sbjct: 108 GILNRTLIIPPILDHHAVALGSCPKFRVLSPSEIRISVWNHSIELLRTDRYVSMADIVDI 167

Query: 416 SSLVSASAIRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDD 237
           SSLVS+SA+RVIDFR F S  CG++++  C +D   Q   ++ LKQCG LLSG  GN+D 
Sbjct: 168 SSLVSSSAVRVIDFRYFASLLCGVDLETLCSDDLAEQSQAYELLKQCGYLLSGVRGNVDK 227

Query: 236 CLYALDEDCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAE 57
           CLYA+DEDCRTTVWTY+ G  DG  DSFQPDE+LKKKKK SY+RRR DVYKT+G G+ AE
Sbjct: 228 CLYAVDEDCRTTVWTYKNGDADGRLDSFQPDEKLKKKKKLSYVRRRRDVYKTLGHGTEAE 287

Query: 56  LATVLAFGSLFTAPYKGT 3
            A +LAFGSLFTAPYKG+
Sbjct: 288 SAAILAFGSLFTAPYKGS 305


>gb|KHG18329.1| tRNA-splicing ligase RtcB [Gossypium arboreum]
          Length = 516

 Score =  362 bits (930), Expect = 2e-97
 Identities = 172/237 (72%), Positives = 205/237 (86%), Gaps = 1/237 (0%)
 Frame = -2

Query: 710 QFPQCRSKQTLGEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTLIVPPILDHHAVALGS 531
           Q P C  + + GEKFLWYAPHSGFSNQLSEFKNA+LMA ILNRTLI+PPIL HHA+ALGS
Sbjct: 65  QSPHCEIRIS-GEKFLWYAPHSGFSNQLSEFKNALLMAGILNRTLIIPPILSHHAIALGS 123

Query: 530 CPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSASAIRVIDFRVFVSTWC 351
           CPKFRV +PKE+R+SVW+H+IEL+ S RY+SMADIID+SS++S+S +R IDFRVFVS+WC
Sbjct: 124 CPKFRVQSPKEIRVSVWDHVIELITSWRYVSMADIIDISSVLSSSHVRAIDFRVFVSSWC 183

Query: 350 GLNMDFACINDSNIQYS-LFDTLKQCGSLLSGYNGNIDDCLYALDEDCRTTVWTYQKGGE 174
           GL++D AC  +SN Q + L D+LKQCGSLLSG +GNID CL+A+D+DCRTTVWTY     
Sbjct: 184 GLDLDLACCKESNTQPTYLVDSLKQCGSLLSGVDGNIDRCLFAVDDDCRTTVWTYGNDEA 243

Query: 173 DGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAFGSLFTAPYKGT 3
           DG  DSFQPDEQLKKKKK SY+RRR DVYKT+GPGS+A+ ATVLAFG+LFTAPYKG+
Sbjct: 244 DGALDSFQPDEQLKKKKKISYVRRRRDVYKTLGPGSKADSATVLAFGTLFTAPYKGS 300


>ref|XP_006477145.1| PREDICTED: uncharacterized protein LOC102619700 [Citrus sinensis]
          Length = 496

 Score =  362 bits (930), Expect = 2e-97
 Identities = 171/256 (66%), Positives = 208/256 (81%), Gaps = 2/256 (0%)
 Frame = -2

Query: 764 YRHISNSLLPISKNSQPAQFPQCRSKQTLG--EKFLWYAPHSGFSNQLSEFKNAILMAAI 591
           Y HI  SLL +S  +   +F QC + + +   +KF  YAPHSGFSNQL EFKNAILMA I
Sbjct: 43  YNHIPESLLSLSSKTLDPKFSQCHTTKAISPDKKFFLYAPHSGFSNQLGEFKNAILMAGI 102

Query: 590 LNRTLIVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSS 411
           LNRTLIVPP+LDHHAVALGSCPKFRV +P ++RISVW+H IEL++S RY+SMADIID+SS
Sbjct: 103 LNRTLIVPPVLDHHAVALGSCPKFRVQSPNQMRISVWDHAIELLRSGRYVSMADIIDISS 162

Query: 410 LVSASAIRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCL 231
           LVS+S ++V+DFR F S WCGL++D AC+   N Q SL D L+QC S+LSG NGN+D C 
Sbjct: 163 LVSSSMVKVLDFRRFASLWCGLDVDLACLISLNTQPSLLDRLRQCVSMLSGLNGNVDGCF 222

Query: 230 YALDEDCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELA 51
           +A+D+DCRTTVWTYQ G EDG+ D FQPDEQLKKKKK SY+RRR DVYK +GPGS+A+ A
Sbjct: 223 FAVDDDCRTTVWTYQSGDEDGVLDPFQPDEQLKKKKKVSYVRRRRDVYKALGPGSKADSA 282

Query: 50  TVLAFGSLFTAPYKGT 3
           T+LAFG+LFTAPYKG+
Sbjct: 283 TILAFGTLFTAPYKGS 298


>ref|XP_012472055.1| PREDICTED: uncharacterized protein LOC105789281 [Gossypium
           raimondii] gi|763753583|gb|KJB20971.1| hypothetical
           protein B456_003G175200 [Gossypium raimondii]
          Length = 516

 Score =  361 bits (927), Expect = 4e-97
 Identities = 176/258 (68%), Positives = 212/258 (82%), Gaps = 4/258 (1%)
 Frame = -2

Query: 764 YRHISNSLLPISKNSQPA---QFPQCRSKQTLGEKFLWYAPHSGFSNQLSEFKNAILMAA 594
           +  I  SL   S +   A   QFP C  + + GEKFLWYAPHSGFSNQLSEFKNA+LMA 
Sbjct: 44  FTSIPKSLFSSSSSKTTALSLQFPHCEIRIS-GEKFLWYAPHSGFSNQLSEFKNALLMAG 102

Query: 593 ILNRTLIVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLS 414
           ILNRTLI+PPIL HHA+ALGSCPKFRV +PKE+R+SVW+H+IEL+ S RY+SMADIID+S
Sbjct: 103 ILNRTLIIPPILSHHAIALGSCPKFRVQSPKEIRVSVWDHVIELITSGRYVSMADIIDIS 162

Query: 413 SLVSASAIRVIDFRVFVSTWCGLNMDFACINDSNIQYS-LFDTLKQCGSLLSGYNGNIDD 237
           S++S+S +R IDFRVFVS+WCGL++D AC  + N Q + L D+LKQCGSLLSG +GNID 
Sbjct: 163 SVLSSSHVRAIDFRVFVSSWCGLDLDLACSKEPNTQPTYLVDSLKQCGSLLSGVDGNIDR 222

Query: 236 CLYALDEDCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAE 57
           CL+A+D+DCRTTVWTY     DG  DSFQP+EQLKKKKK SY+RRR DVYKT+GPGS+A+
Sbjct: 223 CLFAVDDDCRTTVWTYGNYEADGALDSFQPNEQLKKKKKISYVRRRRDVYKTLGPGSKAD 282

Query: 56  LATVLAFGSLFTAPYKGT 3
            ATVLAFG+LFTAPYKG+
Sbjct: 283 SATVLAFGTLFTAPYKGS 300


>ref|NP_193473.1| O-fucosyltransferase family protein [Arabidopsis thaliana]
           gi|95147298|gb|ABF57284.1| At4g17430 [Arabidopsis
           thaliana] gi|332658490|gb|AEE83890.1|
           O-fucosyltransferase family protein [Arabidopsis
           thaliana]
          Length = 507

 Score =  360 bits (924), Expect = 9e-97
 Identities = 175/258 (67%), Positives = 207/258 (80%), Gaps = 4/258 (1%)
 Frame = -2

Query: 764 YRHISNSLLPISKNSQPAQFPQCRS----KQTLGEKFLWYAPHSGFSNQLSEFKNAILMA 597
           Y  +  SL  IS  S   QFPQCRS    +  LG+KFLWYAPHSGFSNQLSEFKNA+LMA
Sbjct: 47  YSEMPKSLFSISAFSGSVQFPQCRSEILTRTLLGQKFLWYAPHSGFSNQLSEFKNALLMA 106

Query: 596 AILNRTLIVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDL 417
            ILNRTLI+PPILDHHAVALGSCPKFRV +P E+RISVWNH IEL+++ RY+SMADI+D+
Sbjct: 107 GILNRTLIIPPILDHHAVALGSCPKFRVLSPSEIRISVWNHSIELLKTDRYVSMADIVDI 166

Query: 416 SSLVSASAIRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDD 237
           SSLVS+SA+RVIDFR F S  CG++++  C +D   Q   +++LKQCG LLSG  GN+D 
Sbjct: 167 SSLVSSSAVRVIDFRYFASLQCGVDLETLCTDDLAEQSQAYESLKQCGYLLSGVRGNVDK 226

Query: 236 CLYALDEDCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAE 57
           CLYA+DEDCRTTVWTY+ G  DG  DSFQPDE+LKKKKK S +RRR DVYKT+G G+ AE
Sbjct: 227 CLYAVDEDCRTTVWTYKNGEADGRLDSFQPDEKLKKKKKLSNVRRRRDVYKTLGHGTEAE 286

Query: 56  LATVLAFGSLFTAPYKGT 3
            A +LAFGSLFTAPYKG+
Sbjct: 287 SAAILAFGSLFTAPYKGS 304


>ref|XP_004245784.1| PREDICTED: uncharacterized protein LOC101261944 [Solanum
           lycopersicum]
          Length = 495

 Score =  360 bits (924), Expect = 9e-97
 Identities = 176/253 (69%), Positives = 203/253 (80%), Gaps = 2/253 (0%)
 Frame = -2

Query: 755 ISNSLLPISKNSQPA--QFPQCRSKQTLGEKFLWYAPHSGFSNQLSEFKNAILMAAILNR 582
           IS SLLP+SK S P   +  QC  +  L EKF+WYAPHSGFSNQL+EFKNAILMA ILNR
Sbjct: 41  ISPSLLPLSKKSIPIIPRPQQCNPENRLQEKFMWYAPHSGFSNQLAEFKNAILMAKILNR 100

Query: 581 TLIVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSLVS 402
           TLIVPP+LDHHAVALGSCPKFRV  P ELR  VWNH I+L++  RY+SMADI+DLS L S
Sbjct: 101 TLIVPPVLDHHAVALGSCPKFRVLEPNELRYLVWNHSIQLLRDCRYVSMADIVDLSPLAS 160

Query: 401 ASAIRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYAL 222
            S +R IDFR FVS+WCG+N+D  C  + NI  SLF++L+QCGSLLSGY G+   CL AL
Sbjct: 161 YSTVRFIDFRAFVSSWCGVNLDVICSKNQNIPSSLFESLRQCGSLLSGYYGSFSGCLSAL 220

Query: 221 DEDCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVL 42
            EDCRTTVWTY+K  EDG  DSFQPD+QL+KKKK S++RRR DVYK +GPGS AE ATVL
Sbjct: 221 KEDCRTTVWTYKKDDEDGALDSFQPDDQLRKKKKISFIRRRKDVYKALGPGSAAESATVL 280

Query: 41  AFGSLFTAPYKGT 3
           AFGSLFTAPYKG+
Sbjct: 281 AFGSLFTAPYKGS 293


>ref|XP_002521236.1| conserved hypothetical protein [Ricinus communis]
           gi|223539504|gb|EEF41092.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 506

 Score =  360 bits (923), Expect = 1e-96
 Identities = 177/247 (71%), Positives = 203/247 (82%), Gaps = 2/247 (0%)
 Frame = -2

Query: 737 PISKNSQPAQFPQCRSKQTL--GEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTLIVPP 564
           PI +N+  +Q  QC   Q+L  GEKFLWYAPHSGFSNQLSEFKNAILMA ILNRTLIVPP
Sbjct: 54  PILQNTLDSQISQCSRFQSLTGGEKFLWYAPHSGFSNQLSEFKNAILMAGILNRTLIVPP 113

Query: 563 ILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSASAIRV 384
           ILDHHAVALGSCPK RV  PK++RISVWNH IELV++ RY+SM DIID+SSLV +S IR 
Sbjct: 114 ILDHHAVALGSCPKLRVLGPKDIRISVWNHAIELVKTGRYVSMVDIIDISSLVPSS-IRA 172

Query: 383 IDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYALDEDCRT 204
           IDFRVF S WCG+N DF C N+ N + SLFD+L QCGS+LSG+ GNI  CLYA+ EDCRT
Sbjct: 173 IDFRVFASLWCGVNKDFICTNNLNAESSLFDSLGQCGSVLSGFTGNIGKCLYAVVEDCRT 232

Query: 203 TVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAFGSLF 24
           TVWTY+ G +DG+ DSFQPDEQLKKKK  SY+RR  DVYK +G GS +E A+VLAFGSLF
Sbjct: 233 TVWTYKNGEKDGVLDSFQPDEQLKKKKNISYIRRHQDVYKVLGTGSESESASVLAFGSLF 292

Query: 23  TAPYKGT 3
           TAPYKG+
Sbjct: 293 TAPYKGS 299


>gb|KDO61362.1| hypothetical protein CISIN_1g0110012mg [Citrus sinensis]
          Length = 496

 Score =  359 bits (922), Expect = 2e-96
 Identities = 170/256 (66%), Positives = 207/256 (80%), Gaps = 2/256 (0%)
 Frame = -2

Query: 764 YRHISNSLLPISKNSQPAQFPQCRSKQTLG--EKFLWYAPHSGFSNQLSEFKNAILMAAI 591
           Y HI  SLL +S  +   +F QC + + +   +KF  YAPHSGFSNQL EFKNAILMA I
Sbjct: 43  YNHIPESLLSLSSKTLDPKFSQCHTTKAISPDKKFFLYAPHSGFSNQLGEFKNAILMAGI 102

Query: 590 LNRTLIVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSS 411
           LNRTLIVPP+LDHHAVALGSCPKFRV +P ++RISVW+H IEL++S RY+SMADIID+SS
Sbjct: 103 LNRTLIVPPVLDHHAVALGSCPKFRVQSPNQMRISVWHHAIELLRSGRYVSMADIIDISS 162

Query: 410 LVSASAIRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCL 231
           LVS+S ++V+DFR F S WCGL++D AC+   N Q SL D L+QC S+LSG NGN+D C 
Sbjct: 163 LVSSSMVKVLDFRRFASLWCGLDVDLACLISLNTQPSLLDRLRQCVSMLSGLNGNVDGCF 222

Query: 230 YALDEDCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELA 51
           +A+D+DCRTTVWTYQ G EDG+ D FQPDEQLKKKKK SY+RRR DVYK +G GS+A+ A
Sbjct: 223 FAVDDDCRTTVWTYQSGDEDGVLDPFQPDEQLKKKKKVSYVRRRRDVYKALGSGSKADSA 282

Query: 50  TVLAFGSLFTAPYKGT 3
           T+LAFG+LFTAPYKG+
Sbjct: 283 TILAFGTLFTAPYKGS 298


>ref|XP_006440263.1| hypothetical protein CICLE_v10019844mg [Citrus clementina]
           gi|557542525|gb|ESR53503.1| hypothetical protein
           CICLE_v10019844mg [Citrus clementina]
          Length = 496

 Score =  359 bits (922), Expect = 2e-96
 Identities = 170/256 (66%), Positives = 207/256 (80%), Gaps = 2/256 (0%)
 Frame = -2

Query: 764 YRHISNSLLPISKNSQPAQFPQCRSKQTLG--EKFLWYAPHSGFSNQLSEFKNAILMAAI 591
           Y HI  SLL +S  +   +F QC + + +   +KF  YAPHSGFSNQL EFKNAILMA I
Sbjct: 43  YNHIPESLLSLSSKTLDPKFSQCHTTKAISPDKKFFLYAPHSGFSNQLGEFKNAILMAGI 102

Query: 590 LNRTLIVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSS 411
           LNRTLIVPP+LDHHAVALGSCPKFRV +P ++RISVW+H IEL++S RY+SMADIID+SS
Sbjct: 103 LNRTLIVPPVLDHHAVALGSCPKFRVQSPNQMRISVWHHAIELLRSGRYVSMADIIDISS 162

Query: 410 LVSASAIRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCL 231
           LVS+S ++V+DFR F S WCGL++D AC+   N Q SL D L+QC S+LSG NGN+D C 
Sbjct: 163 LVSSSMVKVLDFRRFASLWCGLDVDLACLISLNTQPSLLDRLRQCVSMLSGLNGNVDGCF 222

Query: 230 YALDEDCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELA 51
           +A+D+DCRTTVWTYQ G EDG+ D FQPDEQLKKKKK SY+RRR DVYK +G GS+A+ A
Sbjct: 223 FAVDDDCRTTVWTYQSGDEDGVLDPFQPDEQLKKKKKVSYVRRRRDVYKALGSGSKADSA 282

Query: 50  TVLAFGSLFTAPYKGT 3
           T+LAFG+LFTAPYKG+
Sbjct: 283 TILAFGTLFTAPYKGS 298


>ref|XP_010104870.1| hypothetical protein L484_024071 [Morus notabilis]
           gi|587914327|gb|EXC02106.1| hypothetical protein
           L484_024071 [Morus notabilis]
          Length = 512

 Score =  358 bits (920), Expect = 3e-96
 Identities = 173/255 (67%), Positives = 208/255 (81%), Gaps = 1/255 (0%)
 Frame = -2

Query: 764 YRHISNSLLPISKNSQPAQFP-QCRSKQTLGEKFLWYAPHSGFSNQLSEFKNAILMAAIL 588
           Y+ +S + L +S +  P     QCR      EKFLWYAPHSGFSNQLSEFKNA+LMAAIL
Sbjct: 49  YQSLSPTSLSLSSSKTPHSLQNQCRIPSPR-EKFLWYAPHSGFSNQLSEFKNALLMAAIL 107

Query: 587 NRTLIVPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSL 408
           NRTLIVPPILDHHAVALGSCPKFRVS P E+R SVW+H +EL++S RY+SMADI+D+SSL
Sbjct: 108 NRTLIVPPILDHHAVALGSCPKFRVSAPAEIRASVWDHAVELIRSGRYVSMADIVDISSL 167

Query: 407 VSASAIRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLY 228
           VS+S IR IDFRVF S WC LN++  C+N+S+ Q SL D+LKQCGSLL+G +G++  CLY
Sbjct: 168 VSSSFIRAIDFRVFASQWCNLNLEGICVNESDKQSSLLDSLKQCGSLLAGLDGSVSKCLY 227

Query: 227 ALDEDCRTTVWTYQKGGEDGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELAT 48
           A++EDCRTTVWTY+   EDG  DSFQPDEQLKKKKK SY+RRR DVYK +GP S A+ AT
Sbjct: 228 AVNEDCRTTVWTYKNDNEDGTLDSFQPDEQLKKKKKISYVRRRRDVYKNLGPDSEADSAT 287

Query: 47  VLAFGSLFTAPYKGT 3
           +LAFGS+FT+PYKG+
Sbjct: 288 LLAFGSIFTSPYKGS 302


>gb|KHG18421.1| GDP-fucose O-fucosyltransferase 1 [Gossypium arboreum]
          Length = 515

 Score =  358 bits (918), Expect = 4e-96
 Identities = 175/236 (74%), Positives = 202/236 (85%)
 Frame = -2

Query: 710 QFPQCRSKQTLGEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTLIVPPILDHHAVALGS 531
           Q+P C ++   GEKFLWYAPHSGFSNQLSEFKNAILMA+ILNRTLIVPPILDHHAVALGS
Sbjct: 69  QYPHCETRFH-GEKFLWYAPHSGFSNQLSEFKNAILMASILNRTLIVPPILDHHAVALGS 127

Query: 530 CPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSASAIRVIDFRVFVSTWC 351
           CPKFRV +PKE+RISV +HIIEL++S RY+SMADIID+SSL+S S +R IDFRVFV  WC
Sbjct: 128 CPKFRVQSPKEIRISVLDHIIELLRSRRYVSMADIIDISSLLSFSLVRAIDFRVFVLLWC 187

Query: 350 GLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYALDEDCRTTVWTYQKGGED 171
            LN+D  C N  N   SL ++LK CGSLLSG +GNI+ CLYA+D+DCRTTVWTYQ  G D
Sbjct: 188 DLNVDLVCSNGLNAPPSLVESLKLCGSLLSGIDGNINQCLYAVDDDCRTTVWTYQ-NGMD 246

Query: 170 GLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAFGSLFTAPYKGT 3
           G+ DSFQPDEQL K+KK SY+RRR DVY+T+GPGS+AE ATVLAFGSLFTAPYKG+
Sbjct: 247 GMLDSFQPDEQLMKRKKISYVRRRRDVYRTLGPGSKAESATVLAFGSLFTAPYKGS 302


>ref|XP_007151341.1| hypothetical protein PHAVU_004G038100g [Phaseolus vulgaris]
           gi|561024650|gb|ESW23335.1| hypothetical protein
           PHAVU_004G038100g [Phaseolus vulgaris]
          Length = 500

 Score =  355 bits (911), Expect = 3e-95
 Identities = 171/251 (68%), Positives = 209/251 (83%), Gaps = 2/251 (0%)
 Frame = -2

Query: 749 NSLLPISKNSQPAQFPQCRSKQ-TLGEKFLWYAPHSGFSNQLSEFKNAILMAAILNRTLI 573
           NS      N   +  PQC S+  TLGEKF+WYAPHSGFSNQLSEFKNA+LMA ILNRTL+
Sbjct: 47  NSQFSRLANISTSHTPQCSSQALTLGEKFMWYAPHSGFSNQLSEFKNAVLMAGILNRTLV 106

Query: 572 VPPILDHHAVALGSCPKFRVSTPKELRISVWNHIIELVQSHRYISMADIIDLSSLVSASA 393
           VPPILDHHAVALGSCPKFRV  PK++RISVW+H+IELVQS RYIS+A+IID+SSLVS+S 
Sbjct: 107 VPPILDHHAVALGSCPKFRVLDPKDIRISVWDHVIELVQSRRYISIAEIIDISSLVSSSL 166

Query: 392 IRVIDFRVFVSTWCGLNMDFACINDSNIQYSLFDTLKQCGSLLSGYNGNIDDCLYALDED 213
           +RVIDFR FVS WCG+++D ACI D+ +  S+  +LKQCGSLL+G +G+I+ C+YA++ED
Sbjct: 167 VRVIDFRDFVSIWCGISLDLACITDTKLHSSVSKSLKQCGSLLAGLHGSIEKCIYAVNED 226

Query: 212 CRTTVWTYQKGGE-DGLPDSFQPDEQLKKKKKFSYLRRRLDVYKTIGPGSRAELATVLAF 36
           CRTTVWTY + G  DG+ DSFQPDEQLK KKK SY+RRR DV+KT+GPGS A  A++LAF
Sbjct: 227 CRTTVWTYHEDGHGDGMLDSFQPDEQLKHKKKISYVRRRKDVFKTLGPGSEAGSASLLAF 286

Query: 35  GSLFTAPYKGT 3
           G+LF+A YKG+
Sbjct: 287 GTLFSATYKGS 297


Top