BLASTX nr result

ID: Catharanthus23_contig00015564 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00015564
         (1466 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006342369.1| PREDICTED: uncharacterized protein At1g04910...   504   e-140
gb|EMJ05803.1| hypothetical protein PRUPE_ppa002708mg [Prunus pe...   502   e-139
ref|XP_004243713.1| PREDICTED: uncharacterized protein At1g04910...   499   e-138
ref|XP_002279041.1| PREDICTED: DUF246 domain-containing protein ...   497   e-138
ref|XP_006342370.1| PREDICTED: uncharacterized protein At1g04910...   495   e-137
gb|EXB38940.1| hypothetical protein L484_027375 [Morus notabilis]     481   e-133
ref|XP_004288979.1| PREDICTED: uncharacterized protein At1g04910...   461   e-127
ref|XP_002870435.1| hypothetical protein ARALYDRAFT_493618 [Arab...   454   e-125
ref|XP_003550617.1| PREDICTED: uncharacterized protein At1g04910...   450   e-124
ref|NP_568528.2| O-fucosyltransferase family protein [Arabidopsi...   449   e-123
ref|XP_006395968.1| hypothetical protein EUTSA_v10003786mg [Eutr...   448   e-123
ref|XP_006395967.1| hypothetical protein EUTSA_v10003786mg [Eutr...   448   e-123
ref|XP_003542359.1| PREDICTED: uncharacterized protein At1g04910...   444   e-122
gb|EOY30276.1| O-fucosyltransferase family protein isoform 2 [Th...   444   e-122
gb|EOY30275.1| O-fucosyltransferase family protein isoform 1 [Th...   444   e-122
ref|XP_006283281.1| hypothetical protein CARUB_v10004322mg [Caps...   440   e-121
gb|ESW26581.1| hypothetical protein PHAVU_003G131300g [Phaseolus...   437   e-120
ref|XP_002326282.1| predicted protein [Populus trichocarpa]           437   e-120
ref|XP_006381630.1| hypothetical protein POPTR_0006s14490g [Popu...   435   e-119
ref|XP_004508243.1| PREDICTED: uncharacterized protein At1g04910...   435   e-119

>ref|XP_006342369.1| PREDICTED: uncharacterized protein At1g04910-like isoform X1 [Solanum
            tuberosum]
          Length = 648

 Score =  504 bits (1298), Expect = e-140
 Identities = 261/419 (62%), Positives = 304/419 (72%), Gaps = 1/419 (0%)
 Frame = -2

Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076
            TATDGV  QRV+SPRFSGPMTRRAHSFKR                               
Sbjct: 15   TATDGVP-QRVNSPRFSGPMTRRAHSFKRTNNTNQNAQNTGSSSSSTASLNTHH------ 67

Query: 1075 EINV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVHLKKNIGSFNVDXXXXXXX 899
            EI+V LNSPRSE N+N    D ++   EKK +HLS + QRVHL+K + S  VD       
Sbjct: 68   EIDVPLNSPRSETNANIA--DEYEILGEKKHTHLSNVIQRVHLRKKLESLTVDFGFGLEL 125

Query: 898  XXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESS 719
                  GHWMF VFCG C F+GVLKFC  GWFGSAIE+  Y Q+ Y+S I  LS  ++S+
Sbjct: 126  KGRKKLGHWMFLVFCGFCLFIGVLKFCAYGWFGSAIERVAYSQDSYDSLISQLSLRDQST 185

Query: 718  RDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQFK 539
                H   +       + +E+TL MVASGVVG+QN++ D S IW KP+S N+TQCI++ K
Sbjct: 186  HAYRHMEGDTKHSGERNHLEQTLSMVASGVVGNQNSMLDFSEIWLKPNSENFTQCIERTK 245

Query: 538  RHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESGFK 359
              K ++  TNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLP LDHTSYWADESGFK
Sbjct: 246  SQKLVDAKTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPLLDHTSYWADESGFK 305

Query: 358  DLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKVIY 179
            DLF+WQHF+ETLKDDIHIV+ LPPE+AG EPFNKTPISWSKVSYYK+EVLPLLKQHKV+Y
Sbjct: 306  DLFNWQHFIETLKDDIHIVETLPPEFAGTEPFNKTPISWSKVSYYKSEVLPLLKQHKVMY 365

Query: 178  FTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHLR 2
             THTDSR+ANNG+P SIQKLRC+VNY ALKYSA IE LG+ LVSRM+ +GNPYLALHLR
Sbjct: 366  ITHTDSRIANNGIPNSIQKLRCRVNYQALKYSAPIETLGRILVSRMRQDGNPYLALHLR 424


>gb|EMJ05803.1| hypothetical protein PRUPE_ppa002708mg [Prunus persica]
          Length = 642

 Score =  502 bits (1292), Expect = e-139
 Identities = 267/422 (63%), Positives = 311/422 (73%), Gaps = 6/422 (1%)
 Frame = -2

Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070
            +DGVS QRV+SPRFSGPMTRRAHSFKR                              +EI
Sbjct: 11   SDGVS-QRVNSPRFSGPMTRRAHSFKRNPNTSANNGSSHGNSNSNNSSGSVGFGSGEYEI 69

Query: 1069 NV-LNSPRSEVNSNSVSDDGFDSFVEKKQSH--LSKLTQRVHLKKNIGSFNVDXXXXXXX 899
            ++ LNSPRSE+  NSV  DGFDS +E+KQ+H    ++  R  L+K IGS  VD       
Sbjct: 70   DLPLNSPRSEIGGNSVPGDGFDSVLERKQTHHVSQRVAVRGFLRKPIGSVVVDLGLREKK 129

Query: 898  XXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESS 719
                   HWMF+ FCG C FLG+LK C  GWFGSAIE    +Q+    PI  +++M++SS
Sbjct: 130  QLG----HWMFFAFCGVCLFLGILKICATGWFGSAIESSRSNQD-GSDPITLMNRMDQSS 184

Query: 718  RDDPHRASENDDGDRGSDVERTLMMVASGV---VGSQNAVADHSGIWSKPDSGNYTQCID 548
             D  HR       D GSDVERTLMM ASGV   VG +N+V +++GIWS+P+S N++QCI+
Sbjct: 185  HDYGHR-------DGGSDVERTLMM-ASGVNRVVGEENSV-EYTGIWSRPNSENFSQCIE 235

Query: 547  QFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADES 368
              K HKKL+  TNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWAD+S
Sbjct: 236  LPKIHKKLDAKTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADDS 295

Query: 367  GFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHK 188
            GFKDLFDWQHF+ETLKDDIHIV+ LPP YAGIEPFNKTPISWSK SYYK+EVL LLKQHK
Sbjct: 296  GFKDLFDWQHFIETLKDDIHIVETLPPAYAGIEPFNKTPISWSKASYYKSEVLSLLKQHK 355

Query: 187  VIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALH 8
            VIYFTHTDSR++NNG+P SIQ+LRC+VNY ALKYSA IEELGKTLVSRM+ NG PYLALH
Sbjct: 356  VIYFTHTDSRISNNGIPSSIQRLRCRVNYRALKYSAPIEELGKTLVSRMRQNGGPYLALH 415

Query: 7    LR 2
            LR
Sbjct: 416  LR 417


>ref|XP_004243713.1| PREDICTED: uncharacterized protein At1g04910-like [Solanum
            lycopersicum]
          Length = 646

 Score =  499 bits (1285), Expect = e-138
 Identities = 260/419 (62%), Positives = 303/419 (72%), Gaps = 1/419 (0%)
 Frame = -2

Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076
            TATDGV  QRV+SPRFSGPMTRRAHSFKR                               
Sbjct: 15   TATDGVP-QRVNSPRFSGPMTRRAHSFKRTNNTNQNAQNTGGGSSNSTATLNTHH----- 68

Query: 1075 EINV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVHLKKNIGSFNVDXXXXXXX 899
            EI+V LNSPRSE N+N    D ++   EKK +HLS + QRVHL+K + S  VD       
Sbjct: 69   EIDVPLNSPRSETNANIA--DEYEILGEKKHTHLSNVIQRVHLRKKLESLTVDFGFGLEL 126

Query: 898  XXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESS 719
                  GHWMF VFCG C F+GVLKFC  GWFGSAIE+  Y Q+ Y+S +   S  ++S+
Sbjct: 127  KGRKKLGHWMFLVFCGFCLFMGVLKFCAYGWFGSAIERVAYSQDSYDSLV---SLRDQST 183

Query: 718  RDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQFK 539
                H   +       + +E+TL MVASGVVG+QN + D+S IW  P+S N+TQCI++ K
Sbjct: 184  HTYRHMDGDTKHSGERNHLEQTLSMVASGVVGNQNNMLDYSEIWLHPNSENFTQCIERTK 243

Query: 538  RHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESGFK 359
              K ++  TNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWADESGFK
Sbjct: 244  SQKLVDAKTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADESGFK 303

Query: 358  DLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKVIY 179
            DLFDWQHF+ETLKDDIHIV+ LPPE+AG EPFNKTPISWSKVSYYK+EVLPLLKQHKV+Y
Sbjct: 304  DLFDWQHFIETLKDDIHIVETLPPEFAGTEPFNKTPISWSKVSYYKSEVLPLLKQHKVMY 363

Query: 178  FTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHLR 2
             THTDSR+ANNG+P SIQKLRC+VNY ALKYSA IE LG+ LVSRM+ +GNPYLALHLR
Sbjct: 364  ITHTDSRIANNGIPNSIQKLRCRVNYQALKYSAPIETLGRILVSRMRQDGNPYLALHLR 422


>ref|XP_002279041.1| PREDICTED: DUF246 domain-containing protein At1g04910 [Vitis
            vinifera] gi|297738571|emb|CBI27816.3| unnamed protein
            product [Vitis vinifera]
          Length = 634

 Score =  497 bits (1279), Expect = e-138
 Identities = 260/425 (61%), Positives = 312/425 (73%), Gaps = 8/425 (1%)
 Frame = -2

Query: 1252 ATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHE 1073
            A+DGVS QRV+SPRFSGPMTRRAHSFKR                               E
Sbjct: 7    ASDGVS-QRVNSPRFSGPMTRRAHSFKRGNSSGNAHNNGSSKGGGGFDPHY--------E 57

Query: 1072 INV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVH-------LKKNIGSFNVDX 917
            I+V LNSPRSE+  + VS DGFD  +E+KQ+H   + QRVH        KK++GS  +D 
Sbjct: 58   IDVHLNSPRSEICGSPVSGDGFDVVLERKQTH--HVNQRVHGGVLKNQPKKHVGSAVLDL 115

Query: 916  XXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLS 737
                         HWMF+VFCG C FLGVLK C  GWFGSAI++    Q+  +    +L+
Sbjct: 116  GLRERKKLG----HWMFFVFCGVCLFLGVLKICATGWFGSAIDRIGSHQDFSDPLNTHLN 171

Query: 736  KMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQ 557
            +M++SS D  +R       + GSDVERTLMMVASGVV  Q ++A++S IWSKP+S N+TQ
Sbjct: 172  EMDKSSHDYVYR-------EGGSDVERTLMMVASGVVNRQKSMAENSDIWSKPNSENFTQ 224

Query: 556  CIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWA 377
            C++Q + HKKL+  TNG++++NANGGLNQMRFGICDMVA+AK MKATLVLPSLDHTSYWA
Sbjct: 225  CVNQPRIHKKLDAKTNGYIIINANGGLNQMRFGICDMVAIAKVMKATLVLPSLDHTSYWA 284

Query: 376  DESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLK 197
            D+S FKDLFDWQHF++ LKDD+HIV+ LPP+YAGIEPF KTPISWSKVSYYKTE+LPLLK
Sbjct: 285  DDSDFKDLFDWQHFIKALKDDVHIVETLPPDYAGIEPFTKTPISWSKVSYYKTEILPLLK 344

Query: 196  QHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYL 17
            Q+KVIYFTHTDSRLANNG+P SIQKLRC+VNY ALKYS+ IEELG TLVSRM+  GNPY+
Sbjct: 345  QYKVIYFTHTDSRLANNGIPSSIQKLRCRVNYKALKYSSLIEELGNTLVSRMREGGNPYI 404

Query: 16   ALHLR 2
            ALHLR
Sbjct: 405  ALHLR 409


>ref|XP_006342370.1| PREDICTED: uncharacterized protein At1g04910-like isoform X2 [Solanum
            tuberosum]
          Length = 643

 Score =  495 bits (1275), Expect = e-137
 Identities = 260/419 (62%), Positives = 301/419 (71%), Gaps = 1/419 (0%)
 Frame = -2

Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076
            TATDGV  QRV+SPRFSGPMTRRAHSFKR                               
Sbjct: 15   TATDGVP-QRVNSPRFSGPMTRRAHSFKRTNNTNQNAQNTGSSSSSTASLNTHH------ 67

Query: 1075 EINV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVHLKKNIGSFNVDXXXXXXX 899
            EI+V LNSPRSE N+N    D ++   EKK +HLS + QRVHL+K + S  VD       
Sbjct: 68   EIDVPLNSPRSETNANIA--DEYEILGEKKHTHLSNVIQRVHLRKKLESLTVDFGFGLEL 125

Query: 898  XXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESS 719
                  GHWMF VFCG C F+GVLKFC  GWFGSAIE+  YD     S I  LS  ++S+
Sbjct: 126  KGRKKLGHWMFLVFCGFCLFIGVLKFCAYGWFGSAIERDSYD-----SLISQLSLRDQST 180

Query: 718  RDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQFK 539
                H   +       + +E+TL MVASGVVG+QN++ D S IW KP+S N+TQCI++ K
Sbjct: 181  HAYRHMEGDTKHSGERNHLEQTLSMVASGVVGNQNSMLDFSEIWLKPNSENFTQCIERTK 240

Query: 538  RHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESGFK 359
              K ++  TNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLP LDHTSYWADESGFK
Sbjct: 241  SQKLVDAKTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPLLDHTSYWADESGFK 300

Query: 358  DLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKVIY 179
            DLF+WQHF+ETLKDDIHIV+ LPPE+AG EPFNKTPISWSKVSYYK+EVLPLLKQHKV+Y
Sbjct: 301  DLFNWQHFIETLKDDIHIVETLPPEFAGTEPFNKTPISWSKVSYYKSEVLPLLKQHKVMY 360

Query: 178  FTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHLR 2
             THTDSR+ANNG+P SIQKLRC+VNY ALKYSA IE LG+ LVSRM+ +GNPYLALHLR
Sbjct: 361  ITHTDSRIANNGIPNSIQKLRCRVNYQALKYSAPIETLGRILVSRMRQDGNPYLALHLR 419


>gb|EXB38940.1| hypothetical protein L484_027375 [Morus notabilis]
          Length = 641

 Score =  481 bits (1239), Expect = e-133
 Identities = 259/418 (61%), Positives = 299/418 (71%), Gaps = 2/418 (0%)
 Frame = -2

Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070
            +DGVS QRV+SPRFSGPMTRRAHSFKR                              HEI
Sbjct: 16   SDGVS-QRVNSPRFSGPMTRRAHSFKRNANSSSQSGTNTGNNGGGGGGNNGSGLSPHHEI 74

Query: 1069 NV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVHLKKNIGSFNVDXXXXXXXXX 893
             + LNSPRSE+  N  S DGFDS +E++         R  L+K IGS  VD         
Sbjct: 75   ELQLNSPRSEIGGNLSSVDGFDSVLERRH--------RFALRKKIGSVVVDLGLREKKKL 126

Query: 892  XXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESSRD 713
                 HWMF VFCG C FLGVLK C  GWFGSAIE+   D++  + P+  L  M++SS+D
Sbjct: 127  G----HWMFLVFCGLCLFLGVLKICATGWFGSAIERASSDRDSTD-PMSGLLVMDQSSKD 181

Query: 712  DPHRASENDDGDRGSDVERTLMMVASGV-VGSQNAVADHSGIWSKPDSGNYTQCIDQFKR 536
              +R        +G+DVERTLMMV++GV V +Q +  ++SGIWS+P+S N+TQCIDQ   
Sbjct: 182  YVYREK------KGTDVERTLMMVSTGVRVDNQKSKDEYSGIWSRPNSENFTQCIDQPNN 235

Query: 535  HKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESGFKD 356
             KKL+  TNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWADESGFKD
Sbjct: 236  KKKLDLKTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADESGFKD 295

Query: 355  LFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKVIYF 176
            LFDW+HF+ETLKDD+HIV+ LPP YA IEP  KTPISWSK  YYKTEVLP LKQHKV+YF
Sbjct: 296  LFDWRHFIETLKDDVHIVETLPPAYADIEPLMKTPISWSKAGYYKTEVLPPLKQHKVVYF 355

Query: 175  THTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHLR 2
            THTDSRLANNG+P SIQKLRC+VNY ALKYSAQIEEL  TLVSRM+ +GNPYLALHLR
Sbjct: 356  THTDSRLANNGIPNSIQKLRCRVNYRALKYSAQIEELATTLVSRMRCDGNPYLALHLR 413


>ref|XP_004288979.1| PREDICTED: uncharacterized protein At1g04910-like [Fragaria vesca
            subsp. vesca]
          Length = 634

 Score =  461 bits (1187), Expect = e-127
 Identities = 249/422 (59%), Positives = 293/422 (69%), Gaps = 4/422 (0%)
 Frame = -2

Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076
            T+ DG  SQRV+SPRFSG MTRRAHSFKR                               
Sbjct: 11   TSADGGVSQRVNSPRFSGAMTRRAHSFKRNPFSSSSSAAAAANNDDGGIAGGGFSTQYEV 70

Query: 1075 EINVLNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRV----HLKKNIGSFNVDXXXX 908
            ++  +NSPRSE+     + +GF +     QS    +TQR      L+K I +  V+    
Sbjct: 71   DLQ-MNSPRSEIGG---AGEGFVT-----QSGGGHVTQRAAVRGFLRKPIEAVVVE---- 117

Query: 907  XXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKME 728
                     GHWMF+ FCG C FLG+LK C  GWFGSAIE    +Q+     + + ++++
Sbjct: 118  MGLRERKRLGHWMFFAFCGVCLFLGILKICATGWFGSAIETASSNQD-NSGSMTHSNRID 176

Query: 727  ESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCID 548
            ESS D  +R       D GSDVERTL MVASGVVG +N  A+ +GIWS+P+S NY+QCID
Sbjct: 177  ESSHDYGYR-------DGGSDVERTLKMVASGVVGRENR-AEWTGIWSRPNSANYSQCID 228

Query: 547  QFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADES 368
              K HKK +  TNG++L+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWAD+S
Sbjct: 229  HPKSHKKPDPKTNGYILINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADDS 288

Query: 367  GFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHK 188
            GFKDLFDWQHF+ETLKDDIHIV+ LPPEYAGIEPFNKTPISWSK SYYK+EVLPLLKQH 
Sbjct: 289  GFKDLFDWQHFIETLKDDIHIVEALPPEYAGIEPFNKTPISWSKASYYKSEVLPLLKQHT 348

Query: 187  VIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALH 8
             +Y THTDSRL+NN LP SIQ+LRC+VNY ALKYSA IE+LGKTLVS M+ NG PYLALH
Sbjct: 349  AVYLTHTDSRLSNNDLPSSIQRLRCRVNYRALKYSAPIEQLGKTLVSGMRQNGGPYLALH 408

Query: 7    LR 2
            LR
Sbjct: 409  LR 410


>ref|XP_002870435.1| hypothetical protein ARALYDRAFT_493618 [Arabidopsis lyrata subsp.
            lyrata] gi|297316271|gb|EFH46694.1| hypothetical protein
            ARALYDRAFT_493618 [Arabidopsis lyrata subsp. lyrata]
          Length = 653

 Score =  454 bits (1168), Expect = e-125
 Identities = 246/422 (58%), Positives = 288/422 (68%), Gaps = 7/422 (1%)
 Frame = -2

Query: 1246 DGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEIN 1067
            DGV    V+SPRFSGPMTRRA SFKR                               EI+
Sbjct: 9    DGVPQHHVNSPRFSGPMTRRAQSFKRGGSGGSSSNTHVGDGNNTSTLRVHH------EID 62

Query: 1066 V-LNSPRSEVNSNSVSDD---GFDSFVEKKQSHLSKLTQRVH---LKKNIGSFNVDXXXX 908
            + LNSPRSE+ S S   D   GFDS + +K     +L +RV    L+K +GS   D    
Sbjct: 63   LPLNSPRSEIVSGSSGSDPSGGFDSALNRKHQTYGQLRERVVKGLLRKPMGSVVSDFSLR 122

Query: 907  XXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKME 728
                      HWMF+ FCG C FLGV K C  GW GSAI+     Q+L  S I  ++ ++
Sbjct: 123  ERKKLG----HWMFFAFCGVCLFLGVFKICATGWLGSAIDGAASHQDLSNS-IPRVNLLD 177

Query: 727  ESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCID 548
             SS D  ++       D G+DV+ TL+MVAS VVG QN+V ++SG+W+KP+SGN++QCID
Sbjct: 178  HSSHDYIYK-------DGGNDVDPTLVMVASDVVGDQNSVVEYSGVWAKPESGNFSQCID 230

Query: 547  QFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADES 368
              +  KKL  NTNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH+SYWAD+S
Sbjct: 231  SPRSRKKLGVNTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHSSYWADDS 290

Query: 367  GFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHK 188
            GFKDLFDWQHF+E LKDDIHIV++LP E AGIEPF KTPISWSKV YYK EVLPLLKQH 
Sbjct: 291  GFKDLFDWQHFIEELKDDIHIVEMLPSELAGIEPFVKTPISWSKVGYYKREVLPLLKQHI 350

Query: 187  VIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALH 8
            V+Y THTDSRLANN LP S+QKLRC+VNY ALKYSA IEELG  LVSRM+ N  PYLALH
Sbjct: 351  VMYLTHTDSRLANNDLPDSVQKLRCRVNYRALKYSAPIEELGNVLVSRMRQNRGPYLALH 410

Query: 7    LR 2
            LR
Sbjct: 411  LR 412


>ref|XP_003550617.1| PREDICTED: uncharacterized protein At1g04910-like [Glycine max]
          Length = 628

 Score =  450 bits (1158), Expect = e-124
 Identities = 251/421 (59%), Positives = 285/421 (67%), Gaps = 5/421 (1%)
 Frame = -2

Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070
            +DGVS QRV+SPRFSGPMTRRAHSFKR                               EI
Sbjct: 14   SDGVS-QRVNSPRFSGPMTRRAHSFKRNNSSNNSNNTATTTSHGGGGGSGGV------EI 66

Query: 1069 NV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVH----LKKNIGSFNVDXXXXX 905
             + +NSPRSE  S  V        V K   H   +TQRVH    LKK + S   D     
Sbjct: 67   ELQINSPRSEEASEGVP-------VGKHSHH--HVTQRVHVRGLLKKPLASIVEDLGLRE 117

Query: 904  XXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEE 725
                     HWMF VFCG C F+GVLK C  GW GSAIE    ++ L +S I  L+ M++
Sbjct: 118  RKKIG----HWMFLVFCGVCLFMGVLKICATGWLGSAIEITQSNKELSDS-IPSLTLMDK 172

Query: 724  SSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQ 545
            SS    +R          SDVERTL  VA+GV GS  A+ + SGIWSKP+S N+T+CID 
Sbjct: 173  SSLGYAYRGG-------ASDVERTLKTVATGVDGSHTAMTEDSGIWSKPNSDNFTKCIDL 225

Query: 544  FKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESG 365
               HKKL+  TNG++ +NANGGLNQMRFGICDMVAVAK +KATLVLPSLDHTSYWAD+SG
Sbjct: 226  PSNHKKLDAKTNGYIFVNANGGLNQMRFGICDMVAVAKIVKATLVLPSLDHTSYWADDSG 285

Query: 364  FKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKV 185
            FKDLFDW+HF+  LKDD+HIV+ LPP YAGIEPF KTPISWSKV YYKTEVLPLLKQHKV
Sbjct: 286  FKDLFDWKHFINMLKDDVHIVEKLPPAYAGIEPFPKTPISWSKVHYYKTEVLPLLKQHKV 345

Query: 184  IYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHL 5
            +YFTHTDSRL NN +P SIQKLRC+VNY ALKYSA IEELG TLVSRM+ NGNPYLALHL
Sbjct: 346  MYFTHTDSRLDNNDIPRSIQKLRCRVNYRALKYSAPIEELGNTLVSRMQQNGNPYLALHL 405

Query: 4    R 2
            R
Sbjct: 406  R 406


>ref|NP_568528.2| O-fucosyltransferase family protein [Arabidopsis thaliana]
            gi|14517444|gb|AAK62612.1| AT5g35570/K2K18_1 [Arabidopsis
            thaliana] gi|21360449|gb|AAM47340.1| AT5g35570/K2K18_1
            [Arabidopsis thaliana] gi|332006599|gb|AED93982.1|
            O-fucosyltransferase family protein [Arabidopsis
            thaliana]
          Length = 652

 Score =  449 bits (1156), Expect = e-123
 Identities = 244/427 (57%), Positives = 287/427 (67%), Gaps = 12/427 (2%)
 Frame = -2

Query: 1246 DGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH--- 1076
            DGV    V+SPRFSGPMTRRA SFKR                                  
Sbjct: 9    DGVPQHHVNSPRFSGPMTRRAQSFKRGGSAGSSSNNNNTHVGVSGGDGNNNNNTSSTLRV 68

Query: 1075 --EINV-LNSPRSEVNSNSVSDD---GFDSFVEKKQSHLSKLTQRVH---LKKNIGSFNV 923
              EI++ LNSPRSE+ S S   D   GFDS + +K     +L +RV    L+K +GS   
Sbjct: 69   HHEIDLPLNSPRSEIVSGSSGSDPSGGFDSALNRKHQTYGQLRERVVKGLLRKPMGSVVS 128

Query: 922  DXXXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDY 743
            D              HWMF+ FCG C FLGV K C  GW GSAI+    DQ+L    I  
Sbjct: 129  DFSLRERKKLG----HWMFFAFCGVCLFLGVFKICATGWLGSAIDGAASDQDL---SIPR 181

Query: 742  LSKMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNY 563
            ++ ++ SS D  ++       D G+DV+ TL+MVAS VVG QN+V + SG+W+KP+SGN+
Sbjct: 182  VNLLDHSSHDYIYK-------DGGNDVDPTLVMVASDVVGDQNSVVEFSGVWAKPESGNF 234

Query: 562  TQCIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSY 383
            ++CID  +  KKL  NTNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH+SY
Sbjct: 235  SRCIDSSRSRKKLGANTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHSSY 294

Query: 382  WADESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPL 203
            WAD+SGFKDLFDWQHF+E LKDDIHIV++LP E AGIEPF KTPISWSKV YYK EVLPL
Sbjct: 295  WADDSGFKDLFDWQHFIEELKDDIHIVEMLPSELAGIEPFVKTPISWSKVGYYKKEVLPL 354

Query: 202  LKQHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNP 23
            LKQH V+Y THTDSRLANN LP S+QKLRC+VNY ALKYSA IEELG  LVSRM+ +  P
Sbjct: 355  LKQHIVMYLTHTDSRLANNDLPDSVQKLRCRVNYRALKYSAPIEELGNVLVSRMRQDRGP 414

Query: 22   YLALHLR 2
            YLALHLR
Sbjct: 415  YLALHLR 421


>ref|XP_006395968.1| hypothetical protein EUTSA_v10003786mg [Eutrema salsugineum]
            gi|557092607|gb|ESQ33254.1| hypothetical protein
            EUTSA_v10003786mg [Eutrema salsugineum]
          Length = 654

 Score =  448 bits (1152), Expect = e-123
 Identities = 241/425 (56%), Positives = 287/425 (67%), Gaps = 10/425 (2%)
 Frame = -2

Query: 1246 DGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEIN 1067
            DGV  Q V+SPRFSGPMTRRA SFKR                                ++
Sbjct: 12   DGVP-QHVNSPRFSGPMTRRAQSFKRGGSGGSSSNNTHAGGSISAGDNSTGTNHSTLRVH 70

Query: 1066 -----VLNSPRSEVNSNSVSD--DGFDSFVEKKQSHLSKLTQRVH---LKKNIGSFNVDX 917
                  LNSPRSE+ S S  D    F+S + +K     +L +RV    L+K +GS   + 
Sbjct: 71   HEIDLQLNSPRSEIASGSGLDPSSAFESAINRKHQTYGQLRERVVKGLLRKPMGSVVSEL 130

Query: 916  XXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLS 737
                         HWMF+ FCG C F+GVLK C  GW GSAI+    DQ+L +S I  ++
Sbjct: 131  SLRERKKLG----HWMFFAFCGVCLFMGVLKICATGWLGSAIDGAASDQDLSDS-IPRVN 185

Query: 736  KMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQ 557
             ++ SS D  ++       D G+ ++ TL MVASGVVG QN+V ++SG+W+KP+SGN++Q
Sbjct: 186  LLDHSSHDYIYK-------DGGNGIDPTLAMVASGVVGDQNSVVEYSGVWAKPESGNHSQ 238

Query: 556  CIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWA 377
            CI+  +  KKL  NTNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH+SYWA
Sbjct: 239  CIETLRTRKKLGANTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHSSYWA 298

Query: 376  DESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLK 197
            D+SGFKDLFDWQHF+E LKDDIHIV+ LP E AGIEPF KTPISWSKV YYK EVLPLLK
Sbjct: 299  DDSGFKDLFDWQHFIEELKDDIHIVETLPSELAGIEPFVKTPISWSKVGYYKKEVLPLLK 358

Query: 196  QHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYL 17
            QH V+Y THTDSRLANN LP S+QKLRC+VNY ALKYSA IEELG  LVSRM+ N  PYL
Sbjct: 359  QHIVMYLTHTDSRLANNDLPDSVQKLRCRVNYRALKYSAPIEELGNVLVSRMRQNRGPYL 418

Query: 16   ALHLR 2
            ALHLR
Sbjct: 419  ALHLR 423


>ref|XP_006395967.1| hypothetical protein EUTSA_v10003786mg [Eutrema salsugineum]
            gi|557092606|gb|ESQ33253.1| hypothetical protein
            EUTSA_v10003786mg [Eutrema salsugineum]
          Length = 460

 Score =  448 bits (1152), Expect = e-123
 Identities = 241/425 (56%), Positives = 287/425 (67%), Gaps = 10/425 (2%)
 Frame = -2

Query: 1246 DGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEIN 1067
            DGV  Q V+SPRFSGPMTRRA SFKR                                ++
Sbjct: 12   DGVP-QHVNSPRFSGPMTRRAQSFKRGGSGGSSSNNTHAGGSISAGDNSTGTNHSTLRVH 70

Query: 1066 -----VLNSPRSEVNSNSVSD--DGFDSFVEKKQSHLSKLTQRVH---LKKNIGSFNVDX 917
                  LNSPRSE+ S S  D    F+S + +K     +L +RV    L+K +GS   + 
Sbjct: 71   HEIDLQLNSPRSEIASGSGLDPSSAFESAINRKHQTYGQLRERVVKGLLRKPMGSVVSEL 130

Query: 916  XXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLS 737
                         HWMF+ FCG C F+GVLK C  GW GSAI+    DQ+L +S I  ++
Sbjct: 131  SLRERKKLG----HWMFFAFCGVCLFMGVLKICATGWLGSAIDGAASDQDLSDS-IPRVN 185

Query: 736  KMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQ 557
             ++ SS D  ++       D G+ ++ TL MVASGVVG QN+V ++SG+W+KP+SGN++Q
Sbjct: 186  LLDHSSHDYIYK-------DGGNGIDPTLAMVASGVVGDQNSVVEYSGVWAKPESGNHSQ 238

Query: 556  CIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWA 377
            CI+  +  KKL  NTNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH+SYWA
Sbjct: 239  CIETLRTRKKLGANTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHSSYWA 298

Query: 376  DESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLK 197
            D+SGFKDLFDWQHF+E LKDDIHIV+ LP E AGIEPF KTPISWSKV YYK EVLPLLK
Sbjct: 299  DDSGFKDLFDWQHFIEELKDDIHIVETLPSELAGIEPFVKTPISWSKVGYYKKEVLPLLK 358

Query: 196  QHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYL 17
            QH V+Y THTDSRLANN LP S+QKLRC+VNY ALKYSA IEELG  LVSRM+ N  PYL
Sbjct: 359  QHIVMYLTHTDSRLANNDLPDSVQKLRCRVNYRALKYSAPIEELGNVLVSRMRQNRGPYL 418

Query: 16   ALHLR 2
            ALHLR
Sbjct: 419  ALHLR 423


>ref|XP_003542359.1| PREDICTED: uncharacterized protein At1g04910-like [Glycine max]
          Length = 626

 Score =  444 bits (1143), Expect = e-122
 Identities = 248/421 (58%), Positives = 285/421 (67%), Gaps = 5/421 (1%)
 Frame = -2

Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070
            +DGVS QRV+SPRFSGPMTRRAHSFKR                               E+
Sbjct: 14   SDGVS-QRVNSPRFSGPMTRRAHSFKRNNNNIAANTAATTSHGGAGGSGAG-------EV 65

Query: 1069 NV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVH----LKKNIGSFNVDXXXXX 905
             + +NSPRSE  S  V        V K   H   +TQRVH    LKK + S   D     
Sbjct: 66   ELQINSPRSEEASEGVP-------VGKHSHH--HVTQRVHVRGLLKKPLASIVEDLGLRE 116

Query: 904  XXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEE 725
                     HWMF VFCG C F+GVLK C  GW GSAIE+   ++ L +S I  L+ M++
Sbjct: 117  RKKIG----HWMFLVFCGVCLFMGVLKICATGWLGSAIERTQSNKELSDS-IASLNLMDK 171

Query: 724  SSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQ 545
            SS    +R          SDVERTL  VA+G  GS  A+ + SGIWSKP+S N+T+CID 
Sbjct: 172  SSLGYAYRGG-------ASDVERTLKTVATGD-GSHTAMTEDSGIWSKPNSDNFTKCIDL 223

Query: 544  FKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESG 365
               HKKL+  TNG++L+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWAD+SG
Sbjct: 224  PSNHKKLDAKTNGYILVNANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADDSG 283

Query: 364  FKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKV 185
            FKDLFDW+HF+  LK+D+HIV+ LPP YAGIEPF KTPISWSKV YYKTEVLPLLKQHKV
Sbjct: 284  FKDLFDWKHFINMLKNDVHIVEKLPPAYAGIEPFPKTPISWSKVPYYKTEVLPLLKQHKV 343

Query: 184  IYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHL 5
            +YFTHTDSRL NN +P SIQKLRC+ NY ALKYSA +EELG TLVSRM+ NGNPYLALHL
Sbjct: 344  MYFTHTDSRLDNNDIPRSIQKLRCRANYRALKYSAPVEELGNTLVSRMQQNGNPYLALHL 403

Query: 4    R 2
            R
Sbjct: 404  R 404


>gb|EOY30276.1| O-fucosyltransferase family protein isoform 2 [Theobroma cacao]
          Length = 564

 Score =  444 bits (1142), Expect = e-122
 Identities = 243/421 (57%), Positives = 279/421 (66%), Gaps = 5/421 (1%)
 Frame = -2

Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070
            +DGVS QRV+SPRFSGPMTRRA SFKR                                 
Sbjct: 13   SDGVS-QRVNSPRFSGPMTRRASSFKRGNGNSQTTNSNNALGSGNGNNNGSNGNNLSVHH 71

Query: 1069 NV---LNSPRSEVNS-NSVSDDGFDSFVEKKQSHLSKLTQRVHLKK-NIGSFNVDXXXXX 905
             +   +NSPRSE  +  SVS DG                +R  L+K ++GS  +D     
Sbjct: 72   EIDLPINSPRSETGAAGSVSIDGLSQ-------------RRGFLRKPSVGSMVLDFGLKE 118

Query: 904  XXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEE 725
                     HWMF VFCG C FLGV K C  GWFGSAIE    +Q L +  I+   ++++
Sbjct: 119  RKKLG----HWMFLVFCGVCLFLGVFKICATGWFGSAIETVTSNQGLSDISINRPKRIDQ 174

Query: 724  SSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQ 545
             S D  +R       + GSD +RTLM V S V        + SGIWS P+S N+T+CID 
Sbjct: 175  GSHDYGYR-------EEGSDSDRTLMTVPSDVT-------EDSGIWSLPNSENFTKCIDH 220

Query: 544  FKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESG 365
             K  KKL+  TNG++L+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWADESG
Sbjct: 221  SKNQKKLDAKTNGYILVNANGGLNQMRFGICDMVAVAKVMKATLVLPSLDHTSYWADESG 280

Query: 364  FKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKV 185
            FKDLFDW HFMETLKDD+HIV+ +PP YAGIEPFNKTPISWSKVSYY  EVLPLLKQHKV
Sbjct: 281  FKDLFDWHHFMETLKDDVHIVERIPPAYAGIEPFNKTPISWSKVSYYNAEVLPLLKQHKV 340

Query: 184  IYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHL 5
            IYFTHTDSRLANN +P SIQKLRC+VNY ALKYSA IEELG TL+SRM+ NG+PYLALHL
Sbjct: 341  IYFTHTDSRLANNDIPSSIQKLRCRVNYRALKYSAPIEELGNTLISRMRQNGSPYLALHL 400

Query: 4    R 2
            R
Sbjct: 401  R 401


>gb|EOY30275.1| O-fucosyltransferase family protein isoform 1 [Theobroma cacao]
          Length = 626

 Score =  444 bits (1142), Expect = e-122
 Identities = 243/421 (57%), Positives = 279/421 (66%), Gaps = 5/421 (1%)
 Frame = -2

Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070
            +DGVS QRV+SPRFSGPMTRRA SFKR                                 
Sbjct: 13   SDGVS-QRVNSPRFSGPMTRRASSFKRGNGNSQTTNSNNALGSGNGNNNGSNGNNLSVHH 71

Query: 1069 NV---LNSPRSEVNS-NSVSDDGFDSFVEKKQSHLSKLTQRVHLKK-NIGSFNVDXXXXX 905
             +   +NSPRSE  +  SVS DG                +R  L+K ++GS  +D     
Sbjct: 72   EIDLPINSPRSETGAAGSVSIDGLSQ-------------RRGFLRKPSVGSMVLDFGLKE 118

Query: 904  XXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEE 725
                     HWMF VFCG C FLGV K C  GWFGSAIE    +Q L +  I+   ++++
Sbjct: 119  RKKLG----HWMFLVFCGVCLFLGVFKICATGWFGSAIETVTSNQGLSDISINRPKRIDQ 174

Query: 724  SSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQ 545
             S D  +R       + GSD +RTLM V S V        + SGIWS P+S N+T+CID 
Sbjct: 175  GSHDYGYR-------EEGSDSDRTLMTVPSDVT-------EDSGIWSLPNSENFTKCIDH 220

Query: 544  FKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESG 365
             K  KKL+  TNG++L+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWADESG
Sbjct: 221  SKNQKKLDAKTNGYILVNANGGLNQMRFGICDMVAVAKVMKATLVLPSLDHTSYWADESG 280

Query: 364  FKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKV 185
            FKDLFDW HFMETLKDD+HIV+ +PP YAGIEPFNKTPISWSKVSYY  EVLPLLKQHKV
Sbjct: 281  FKDLFDWHHFMETLKDDVHIVERIPPAYAGIEPFNKTPISWSKVSYYNAEVLPLLKQHKV 340

Query: 184  IYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHL 5
            IYFTHTDSRLANN +P SIQKLRC+VNY ALKYSA IEELG TL+SRM+ NG+PYLALHL
Sbjct: 341  IYFTHTDSRLANNDIPSSIQKLRCRVNYRALKYSAPIEELGNTLISRMRQNGSPYLALHL 400

Query: 4    R 2
            R
Sbjct: 401  R 401


>ref|XP_006283281.1| hypothetical protein CARUB_v10004322mg [Capsella rubella]
            gi|482551986|gb|EOA16179.1| hypothetical protein
            CARUB_v10004322mg [Capsella rubella]
          Length = 659

 Score =  440 bits (1131), Expect = e-121
 Identities = 242/430 (56%), Positives = 285/430 (66%), Gaps = 15/430 (3%)
 Frame = -2

Query: 1246 DGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH--- 1076
            DGV  Q V+SPRFSGPMTRRA SFKR                                  
Sbjct: 12   DGVP-QHVNSPRFSGPMTRRAQSFKRGGSGGGGTSSNSHVGVSDNIGINNNNNTSSSSST 70

Query: 1075 -----EINV-LNSPRSEVNSNSVSDD---GFDSFVEKKQSHLSKLTQRVH---LKKNIGS 932
                 EI++ LNSPRSE+ S     D   GFDS V +K     +L +RV    L+K +GS
Sbjct: 71   LRVHHEIDLPLNSPRSEIVSGGSGSDPSGGFDSAVNRKHQTYGQLRERVVKGLLRKPMGS 130

Query: 931  FNVDXXXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESP 752
               D              HWMF+ FCG C F+GV K C  GW GSAI+    DQ+L  S 
Sbjct: 131  VVSDFSLKERKKLG----HWMFFAFCGVCLFMGVFKICATGWLGSAIDSAASDQDLSNS- 185

Query: 751  IDYLSKMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDS 572
            I  ++ ++ SS D  ++       D G+DV+ TL+MVAS VVG QN+V +++G+W+KP+S
Sbjct: 186  IPRVNLLDHSSHDYIYK-------DGGNDVDPTLVMVASDVVGDQNSVVEYTGVWAKPES 238

Query: 571  GNYTQCIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDH 392
             N++QCID  +  KKL  NTNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH
Sbjct: 239  ANFSQCIDSSRSRKKLNANTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDH 298

Query: 391  TSYWADESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEV 212
            +SYWAD+SGFKDLFDWQHF+E LKDDIHIV+ LP E A  EPF KTPISWSKV YYK EV
Sbjct: 299  SSYWADDSGFKDLFDWQHFIEELKDDIHIVESLPSELALTEPFVKTPISWSKVGYYKKEV 358

Query: 211  LPLLKQHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHN 32
            LPLLKQH V+Y THTDSRLANN LP S+QKLRC+VNY ALKYSA IEELG  LVSRM+ +
Sbjct: 359  LPLLKQHIVMYLTHTDSRLANNDLPDSVQKLRCRVNYKALKYSAPIEELGNILVSRMRED 418

Query: 31   GNPYLALHLR 2
              PYLALHLR
Sbjct: 419  RGPYLALHLR 428


>gb|ESW26581.1| hypothetical protein PHAVU_003G131300g [Phaseolus vulgaris]
          Length = 617

 Score =  437 bits (1124), Expect = e-120
 Identities = 239/417 (57%), Positives = 280/417 (67%), Gaps = 1/417 (0%)
 Frame = -2

Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070
            +DGVS QRV+SPRFSGPMTRRAHSFKR                               E+
Sbjct: 14   SDGVS-QRVNSPRFSGPMTRRAHSFKRNTDGTNSNGGSG-------------------EV 53

Query: 1069 NV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVHLKKNIGSFNVDXXXXXXXXX 893
             + +NSPRSE     +        V +   + + +TQRVH++  +               
Sbjct: 54   ELQINSPRSEEALEGIP-------VGRHSHNHNHVTQRVHVRSLLKKPLASIVEDLGFRE 106

Query: 892  XXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESSRD 713
                GH MF VFCG C F+GVLK C  GW GSAIE+   D+ L +S I  L+ M++SS  
Sbjct: 107  RKKIGHLMFLVFCGVCIFIGVLKICATGWLGSAIERAQSDKELPDS-IASLNLMDKSSLG 165

Query: 712  DPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQFKRH 533
              +R          SDVERTL  +A+GV  S  A+A+ SG WSKP+S N+TQCID     
Sbjct: 166  YAYRGG-------ASDVERTLKTLATGVGDSHTAMAEDSGTWSKPNSDNFTQCIDLPSNR 218

Query: 532  KKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESGFKDL 353
            KKL+   NG++++NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWAD+SGFKDL
Sbjct: 219  KKLDAKINGYIVVNANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADDSGFKDL 278

Query: 352  FDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKVIYFT 173
            FDW+HF+  LKDD+HIV+ LPP YAGIEPF KTPISWSKV YYKTEVLPLLKQHKVIYFT
Sbjct: 279  FDWKHFIHMLKDDVHIVEKLPPAYAGIEPFPKTPISWSKVPYYKTEVLPLLKQHKVIYFT 338

Query: 172  HTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHLR 2
            HTDSRLANN +P SIQKLRC+VNY ALKYSA IEE G TLVSRM+ NG+ YLALHLR
Sbjct: 339  HTDSRLANNDIPHSIQKLRCRVNYRALKYSAPIEEFGNTLVSRMQQNGSSYLALHLR 395


>ref|XP_002326282.1| predicted protein [Populus trichocarpa]
          Length = 648

 Score =  437 bits (1123), Expect = e-120
 Identities = 247/436 (56%), Positives = 287/436 (65%), Gaps = 18/436 (4%)
 Frame = -2

Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076
            +A+DGVS QRV+SPRFSGPMTRRAHSFKR                              +
Sbjct: 12   SASDGVS-QRVNSPRFSGPMTRRAHSFKRNNTSSNNNSNAGNANSSNNGSNNVSNGNSNN 70

Query: 1075 -------EINV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVH---------LK 947
                   EI++ LNSPRSE      + DGF+     +Q+    L+QRVH          K
Sbjct: 71   SILSPHLEIDLPLNSPRSE------TVDGFERESHSRQN----LSQRVHGGVVRILTNKK 120

Query: 946  KNIGSFNVDXXXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQN 767
             +IGS  +D              HWMF+ FCG C FLGV K C+ GWFGS +E+   +Q 
Sbjct: 121  GSIGSVILDFGFKERKKLG----HWMFFFFCGLCLFLGVFKICLYGWFGSTLERAASNQV 176

Query: 766  LYESPIDYLSKMEESSRDD-PHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGI 590
            L+   ID    +    +D   +  SEND        +R ++ V S VV   N  A+ SGI
Sbjct: 177  LHL--IDVFGSITRQEQDSYRYMGSENDQ-------KRMIIEVGSDVVDRLNKKAEFSGI 227

Query: 589  WSKPDSGNYTQCIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLV 410
            WSKP+S N+TQCIDQ   HKKL   TNG++L+NANGGLNQMRFGICDMVAVAK MKATLV
Sbjct: 228  WSKPNSENFTQCIDQPGNHKKLGARTNGYILINANGGLNQMRFGICDMVAVAKIMKATLV 287

Query: 409  LPSLDHTSYWADESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVS 230
            LPSLDHTSYWAD+SGFKDLF+WQHF++TLKDD+HIV+ LPP Y GIEPFNKT ISWSKV 
Sbjct: 288  LPSLDHTSYWADDSGFKDLFNWQHFIDTLKDDVHIVEKLPPAYDGIEPFNKTLISWSKVH 347

Query: 229  YYKTEVLPLLKQHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLV 50
            YYKTEVLPLLKQHKVIYFTHTDSRLANNGL  SIQKLRC+ NY ALKYS  IEELG TLV
Sbjct: 348  YYKTEVLPLLKQHKVIYFTHTDSRLANNGLSDSIQKLRCRANYRALKYSKPIEELGNTLV 407

Query: 49   SRMKHNGNPYLALHLR 2
            SRM+ NG+ YLALHLR
Sbjct: 408  SRMRENGSRYLALHLR 423


>ref|XP_006381630.1| hypothetical protein POPTR_0006s14490g [Populus trichocarpa]
            gi|550336338|gb|ERP59427.1| hypothetical protein
            POPTR_0006s14490g [Populus trichocarpa]
          Length = 648

 Score =  435 bits (1118), Expect = e-119
 Identities = 246/436 (56%), Positives = 286/436 (65%), Gaps = 18/436 (4%)
 Frame = -2

Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076
            +A+DGVS QRV+SPRFSGPMTRRAHSFKR                              +
Sbjct: 12   SASDGVS-QRVNSPRFSGPMTRRAHSFKRNNTSSNNNSNAGNANSSNNGSNNVSNGNSNN 70

Query: 1075 -------EINV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVH---------LK 947
                   EI++ LNSPRSE      + DGF+     +Q+    L+QRVH          K
Sbjct: 71   SILSPHLEIDLPLNSPRSE------TVDGFERESHSRQN----LSQRVHGGVVRILTNKK 120

Query: 946  KNIGSFNVDXXXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQN 767
             +IGS  +D              HWMF+ FCG C FLGV K C+ GWFGS +E+   +Q 
Sbjct: 121  GSIGSVILDFGFKERKKLG----HWMFFFFCGLCLFLGVFKICLYGWFGSTLERAASNQV 176

Query: 766  LYESPIDYLSKMEESSRDD-PHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGI 590
             +   ID    +    +D   +  SEND        +R ++ V S VV   N  A+ SGI
Sbjct: 177  THL--IDVFGSITRQEQDSYRYMGSENDQ-------KRMIIEVGSDVVDRLNKKAEFSGI 227

Query: 589  WSKPDSGNYTQCIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLV 410
            WSKP+S N+TQCIDQ   HKKL   TNG++L+NANGGLNQMRFGICDMVAVAK MKATLV
Sbjct: 228  WSKPNSENFTQCIDQPGNHKKLGARTNGYILINANGGLNQMRFGICDMVAVAKIMKATLV 287

Query: 409  LPSLDHTSYWADESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVS 230
            LPSLDHTSYWAD+SGFKDLF+WQHF++TLKDD+HIV+ LPP Y GIEPFNKT ISWSKV 
Sbjct: 288  LPSLDHTSYWADDSGFKDLFNWQHFIDTLKDDVHIVEKLPPAYDGIEPFNKTLISWSKVH 347

Query: 229  YYKTEVLPLLKQHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLV 50
            YYKTEVLPLLKQHKVIYFTHTDSRLANNGL  SIQKLRC+ NY ALKYS  IEELG TLV
Sbjct: 348  YYKTEVLPLLKQHKVIYFTHTDSRLANNGLSDSIQKLRCRANYRALKYSKPIEELGNTLV 407

Query: 49   SRMKHNGNPYLALHLR 2
            SRM+ NG+ YLALHLR
Sbjct: 408  SRMRENGSRYLALHLR 423


>ref|XP_004508243.1| PREDICTED: uncharacterized protein At1g04910-like [Cicer arietinum]
          Length = 630

 Score =  435 bits (1118), Expect = e-119
 Identities = 242/430 (56%), Positives = 280/430 (65%), Gaps = 12/430 (2%)
 Frame = -2

Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076
            T++DGVS QRV+SPRFSGPMTRRAHSFKR                               
Sbjct: 14   TSSDGVS-QRVNSPRFSGPMTRRAHSFKRNNTHNAAANNAVGGGGGAL------------ 60

Query: 1075 EINVLNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKL----TQRVH-------LKKNIGSF 929
                  S  SEV        G +  +E+K  H   L    +QRVH       LK+ + S 
Sbjct: 61   ------STHSEVELQK----GLEPALERKHGHHHHLHPHVSQRVHGGVVKAFLKRPLESI 110

Query: 928  NVDXXXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESP- 752
              D              HWMF VFCG C F+GVLK C  GW GSAIEK    + L +S  
Sbjct: 111  VDDLGFRERKKIG----HWMFLVFCGVCLFMGVLKICATGWLGSAIEKAQSSKELSDSNG 166

Query: 751  IDYLSKMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDS 572
            ID L+ M++SS    +R+          DVERTL  V + VV   +     S +WSKP+S
Sbjct: 167  IDNLNLMDQSSLGYAYRSG-------AGDVERTLKTVQTRVV---SFFIQESDVWSKPNS 216

Query: 571  GNYTQCIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDH 392
             N+TQCID  + HKKL+  TNG++L+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH
Sbjct: 217  ENFTQCIDLPRNHKKLDTKTNGYILINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDH 276

Query: 391  TSYWADESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEV 212
            TSYWAD+SGFKDLFDW+HF++TLKDDIHIV+ LPP Y GIEPF+KTPISWSKV YYKTE+
Sbjct: 277  TSYWADQSGFKDLFDWKHFIDTLKDDIHIVETLPPAYPGIEPFSKTPISWSKVPYYKTEI 336

Query: 211  LPLLKQHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHN 32
            LPLL  HKVIYFTHTDSRLANNG+P SIQKLRC+VNY AL+YSA IEE G  LVSRM+ N
Sbjct: 337  LPLLNHHKVIYFTHTDSRLANNGIPKSIQKLRCRVNYRALRYSAPIEEFGNILVSRMQQN 396

Query: 31   GNPYLALHLR 2
            GNPYLALHLR
Sbjct: 397  GNPYLALHLR 406


Top