BLASTX nr result
ID: Catharanthus23_contig00015564
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00015564 (1466 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006342369.1| PREDICTED: uncharacterized protein At1g04910... 504 e-140 gb|EMJ05803.1| hypothetical protein PRUPE_ppa002708mg [Prunus pe... 502 e-139 ref|XP_004243713.1| PREDICTED: uncharacterized protein At1g04910... 499 e-138 ref|XP_002279041.1| PREDICTED: DUF246 domain-containing protein ... 497 e-138 ref|XP_006342370.1| PREDICTED: uncharacterized protein At1g04910... 495 e-137 gb|EXB38940.1| hypothetical protein L484_027375 [Morus notabilis] 481 e-133 ref|XP_004288979.1| PREDICTED: uncharacterized protein At1g04910... 461 e-127 ref|XP_002870435.1| hypothetical protein ARALYDRAFT_493618 [Arab... 454 e-125 ref|XP_003550617.1| PREDICTED: uncharacterized protein At1g04910... 450 e-124 ref|NP_568528.2| O-fucosyltransferase family protein [Arabidopsi... 449 e-123 ref|XP_006395968.1| hypothetical protein EUTSA_v10003786mg [Eutr... 448 e-123 ref|XP_006395967.1| hypothetical protein EUTSA_v10003786mg [Eutr... 448 e-123 ref|XP_003542359.1| PREDICTED: uncharacterized protein At1g04910... 444 e-122 gb|EOY30276.1| O-fucosyltransferase family protein isoform 2 [Th... 444 e-122 gb|EOY30275.1| O-fucosyltransferase family protein isoform 1 [Th... 444 e-122 ref|XP_006283281.1| hypothetical protein CARUB_v10004322mg [Caps... 440 e-121 gb|ESW26581.1| hypothetical protein PHAVU_003G131300g [Phaseolus... 437 e-120 ref|XP_002326282.1| predicted protein [Populus trichocarpa] 437 e-120 ref|XP_006381630.1| hypothetical protein POPTR_0006s14490g [Popu... 435 e-119 ref|XP_004508243.1| PREDICTED: uncharacterized protein At1g04910... 435 e-119 >ref|XP_006342369.1| PREDICTED: uncharacterized protein At1g04910-like isoform X1 [Solanum tuberosum] Length = 648 Score = 504 bits (1298), Expect = e-140 Identities = 261/419 (62%), Positives = 304/419 (72%), Gaps = 1/419 (0%) Frame = -2 Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076 TATDGV QRV+SPRFSGPMTRRAHSFKR Sbjct: 15 TATDGVP-QRVNSPRFSGPMTRRAHSFKRTNNTNQNAQNTGSSSSSTASLNTHH------ 67 Query: 1075 EINV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVHLKKNIGSFNVDXXXXXXX 899 EI+V LNSPRSE N+N D ++ EKK +HLS + QRVHL+K + S VD Sbjct: 68 EIDVPLNSPRSETNANIA--DEYEILGEKKHTHLSNVIQRVHLRKKLESLTVDFGFGLEL 125 Query: 898 XXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESS 719 GHWMF VFCG C F+GVLKFC GWFGSAIE+ Y Q+ Y+S I LS ++S+ Sbjct: 126 KGRKKLGHWMFLVFCGFCLFIGVLKFCAYGWFGSAIERVAYSQDSYDSLISQLSLRDQST 185 Query: 718 RDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQFK 539 H + + +E+TL MVASGVVG+QN++ D S IW KP+S N+TQCI++ K Sbjct: 186 HAYRHMEGDTKHSGERNHLEQTLSMVASGVVGNQNSMLDFSEIWLKPNSENFTQCIERTK 245 Query: 538 RHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESGFK 359 K ++ TNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLP LDHTSYWADESGFK Sbjct: 246 SQKLVDAKTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPLLDHTSYWADESGFK 305 Query: 358 DLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKVIY 179 DLF+WQHF+ETLKDDIHIV+ LPPE+AG EPFNKTPISWSKVSYYK+EVLPLLKQHKV+Y Sbjct: 306 DLFNWQHFIETLKDDIHIVETLPPEFAGTEPFNKTPISWSKVSYYKSEVLPLLKQHKVMY 365 Query: 178 FTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHLR 2 THTDSR+ANNG+P SIQKLRC+VNY ALKYSA IE LG+ LVSRM+ +GNPYLALHLR Sbjct: 366 ITHTDSRIANNGIPNSIQKLRCRVNYQALKYSAPIETLGRILVSRMRQDGNPYLALHLR 424 >gb|EMJ05803.1| hypothetical protein PRUPE_ppa002708mg [Prunus persica] Length = 642 Score = 502 bits (1292), Expect = e-139 Identities = 267/422 (63%), Positives = 311/422 (73%), Gaps = 6/422 (1%) Frame = -2 Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070 +DGVS QRV+SPRFSGPMTRRAHSFKR +EI Sbjct: 11 SDGVS-QRVNSPRFSGPMTRRAHSFKRNPNTSANNGSSHGNSNSNNSSGSVGFGSGEYEI 69 Query: 1069 NV-LNSPRSEVNSNSVSDDGFDSFVEKKQSH--LSKLTQRVHLKKNIGSFNVDXXXXXXX 899 ++ LNSPRSE+ NSV DGFDS +E+KQ+H ++ R L+K IGS VD Sbjct: 70 DLPLNSPRSEIGGNSVPGDGFDSVLERKQTHHVSQRVAVRGFLRKPIGSVVVDLGLREKK 129 Query: 898 XXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESS 719 HWMF+ FCG C FLG+LK C GWFGSAIE +Q+ PI +++M++SS Sbjct: 130 QLG----HWMFFAFCGVCLFLGILKICATGWFGSAIESSRSNQD-GSDPITLMNRMDQSS 184 Query: 718 RDDPHRASENDDGDRGSDVERTLMMVASGV---VGSQNAVADHSGIWSKPDSGNYTQCID 548 D HR D GSDVERTLMM ASGV VG +N+V +++GIWS+P+S N++QCI+ Sbjct: 185 HDYGHR-------DGGSDVERTLMM-ASGVNRVVGEENSV-EYTGIWSRPNSENFSQCIE 235 Query: 547 QFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADES 368 K HKKL+ TNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWAD+S Sbjct: 236 LPKIHKKLDAKTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADDS 295 Query: 367 GFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHK 188 GFKDLFDWQHF+ETLKDDIHIV+ LPP YAGIEPFNKTPISWSK SYYK+EVL LLKQHK Sbjct: 296 GFKDLFDWQHFIETLKDDIHIVETLPPAYAGIEPFNKTPISWSKASYYKSEVLSLLKQHK 355 Query: 187 VIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALH 8 VIYFTHTDSR++NNG+P SIQ+LRC+VNY ALKYSA IEELGKTLVSRM+ NG PYLALH Sbjct: 356 VIYFTHTDSRISNNGIPSSIQRLRCRVNYRALKYSAPIEELGKTLVSRMRQNGGPYLALH 415 Query: 7 LR 2 LR Sbjct: 416 LR 417 >ref|XP_004243713.1| PREDICTED: uncharacterized protein At1g04910-like [Solanum lycopersicum] Length = 646 Score = 499 bits (1285), Expect = e-138 Identities = 260/419 (62%), Positives = 303/419 (72%), Gaps = 1/419 (0%) Frame = -2 Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076 TATDGV QRV+SPRFSGPMTRRAHSFKR Sbjct: 15 TATDGVP-QRVNSPRFSGPMTRRAHSFKRTNNTNQNAQNTGGGSSNSTATLNTHH----- 68 Query: 1075 EINV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVHLKKNIGSFNVDXXXXXXX 899 EI+V LNSPRSE N+N D ++ EKK +HLS + QRVHL+K + S VD Sbjct: 69 EIDVPLNSPRSETNANIA--DEYEILGEKKHTHLSNVIQRVHLRKKLESLTVDFGFGLEL 126 Query: 898 XXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESS 719 GHWMF VFCG C F+GVLKFC GWFGSAIE+ Y Q+ Y+S + S ++S+ Sbjct: 127 KGRKKLGHWMFLVFCGFCLFMGVLKFCAYGWFGSAIERVAYSQDSYDSLV---SLRDQST 183 Query: 718 RDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQFK 539 H + + +E+TL MVASGVVG+QN + D+S IW P+S N+TQCI++ K Sbjct: 184 HTYRHMDGDTKHSGERNHLEQTLSMVASGVVGNQNNMLDYSEIWLHPNSENFTQCIERTK 243 Query: 538 RHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESGFK 359 K ++ TNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWADESGFK Sbjct: 244 SQKLVDAKTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADESGFK 303 Query: 358 DLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKVIY 179 DLFDWQHF+ETLKDDIHIV+ LPPE+AG EPFNKTPISWSKVSYYK+EVLPLLKQHKV+Y Sbjct: 304 DLFDWQHFIETLKDDIHIVETLPPEFAGTEPFNKTPISWSKVSYYKSEVLPLLKQHKVMY 363 Query: 178 FTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHLR 2 THTDSR+ANNG+P SIQKLRC+VNY ALKYSA IE LG+ LVSRM+ +GNPYLALHLR Sbjct: 364 ITHTDSRIANNGIPNSIQKLRCRVNYQALKYSAPIETLGRILVSRMRQDGNPYLALHLR 422 >ref|XP_002279041.1| PREDICTED: DUF246 domain-containing protein At1g04910 [Vitis vinifera] gi|297738571|emb|CBI27816.3| unnamed protein product [Vitis vinifera] Length = 634 Score = 497 bits (1279), Expect = e-138 Identities = 260/425 (61%), Positives = 312/425 (73%), Gaps = 8/425 (1%) Frame = -2 Query: 1252 ATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHE 1073 A+DGVS QRV+SPRFSGPMTRRAHSFKR E Sbjct: 7 ASDGVS-QRVNSPRFSGPMTRRAHSFKRGNSSGNAHNNGSSKGGGGFDPHY--------E 57 Query: 1072 INV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVH-------LKKNIGSFNVDX 917 I+V LNSPRSE+ + VS DGFD +E+KQ+H + QRVH KK++GS +D Sbjct: 58 IDVHLNSPRSEICGSPVSGDGFDVVLERKQTH--HVNQRVHGGVLKNQPKKHVGSAVLDL 115 Query: 916 XXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLS 737 HWMF+VFCG C FLGVLK C GWFGSAI++ Q+ + +L+ Sbjct: 116 GLRERKKLG----HWMFFVFCGVCLFLGVLKICATGWFGSAIDRIGSHQDFSDPLNTHLN 171 Query: 736 KMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQ 557 +M++SS D +R + GSDVERTLMMVASGVV Q ++A++S IWSKP+S N+TQ Sbjct: 172 EMDKSSHDYVYR-------EGGSDVERTLMMVASGVVNRQKSMAENSDIWSKPNSENFTQ 224 Query: 556 CIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWA 377 C++Q + HKKL+ TNG++++NANGGLNQMRFGICDMVA+AK MKATLVLPSLDHTSYWA Sbjct: 225 CVNQPRIHKKLDAKTNGYIIINANGGLNQMRFGICDMVAIAKVMKATLVLPSLDHTSYWA 284 Query: 376 DESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLK 197 D+S FKDLFDWQHF++ LKDD+HIV+ LPP+YAGIEPF KTPISWSKVSYYKTE+LPLLK Sbjct: 285 DDSDFKDLFDWQHFIKALKDDVHIVETLPPDYAGIEPFTKTPISWSKVSYYKTEILPLLK 344 Query: 196 QHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYL 17 Q+KVIYFTHTDSRLANNG+P SIQKLRC+VNY ALKYS+ IEELG TLVSRM+ GNPY+ Sbjct: 345 QYKVIYFTHTDSRLANNGIPSSIQKLRCRVNYKALKYSSLIEELGNTLVSRMREGGNPYI 404 Query: 16 ALHLR 2 ALHLR Sbjct: 405 ALHLR 409 >ref|XP_006342370.1| PREDICTED: uncharacterized protein At1g04910-like isoform X2 [Solanum tuberosum] Length = 643 Score = 495 bits (1275), Expect = e-137 Identities = 260/419 (62%), Positives = 301/419 (71%), Gaps = 1/419 (0%) Frame = -2 Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076 TATDGV QRV+SPRFSGPMTRRAHSFKR Sbjct: 15 TATDGVP-QRVNSPRFSGPMTRRAHSFKRTNNTNQNAQNTGSSSSSTASLNTHH------ 67 Query: 1075 EINV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVHLKKNIGSFNVDXXXXXXX 899 EI+V LNSPRSE N+N D ++ EKK +HLS + QRVHL+K + S VD Sbjct: 68 EIDVPLNSPRSETNANIA--DEYEILGEKKHTHLSNVIQRVHLRKKLESLTVDFGFGLEL 125 Query: 898 XXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESS 719 GHWMF VFCG C F+GVLKFC GWFGSAIE+ YD S I LS ++S+ Sbjct: 126 KGRKKLGHWMFLVFCGFCLFIGVLKFCAYGWFGSAIERDSYD-----SLISQLSLRDQST 180 Query: 718 RDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQFK 539 H + + +E+TL MVASGVVG+QN++ D S IW KP+S N+TQCI++ K Sbjct: 181 HAYRHMEGDTKHSGERNHLEQTLSMVASGVVGNQNSMLDFSEIWLKPNSENFTQCIERTK 240 Query: 538 RHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESGFK 359 K ++ TNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLP LDHTSYWADESGFK Sbjct: 241 SQKLVDAKTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPLLDHTSYWADESGFK 300 Query: 358 DLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKVIY 179 DLF+WQHF+ETLKDDIHIV+ LPPE+AG EPFNKTPISWSKVSYYK+EVLPLLKQHKV+Y Sbjct: 301 DLFNWQHFIETLKDDIHIVETLPPEFAGTEPFNKTPISWSKVSYYKSEVLPLLKQHKVMY 360 Query: 178 FTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHLR 2 THTDSR+ANNG+P SIQKLRC+VNY ALKYSA IE LG+ LVSRM+ +GNPYLALHLR Sbjct: 361 ITHTDSRIANNGIPNSIQKLRCRVNYQALKYSAPIETLGRILVSRMRQDGNPYLALHLR 419 >gb|EXB38940.1| hypothetical protein L484_027375 [Morus notabilis] Length = 641 Score = 481 bits (1239), Expect = e-133 Identities = 259/418 (61%), Positives = 299/418 (71%), Gaps = 2/418 (0%) Frame = -2 Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070 +DGVS QRV+SPRFSGPMTRRAHSFKR HEI Sbjct: 16 SDGVS-QRVNSPRFSGPMTRRAHSFKRNANSSSQSGTNTGNNGGGGGGNNGSGLSPHHEI 74 Query: 1069 NV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVHLKKNIGSFNVDXXXXXXXXX 893 + LNSPRSE+ N S DGFDS +E++ R L+K IGS VD Sbjct: 75 ELQLNSPRSEIGGNLSSVDGFDSVLERRH--------RFALRKKIGSVVVDLGLREKKKL 126 Query: 892 XXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESSRD 713 HWMF VFCG C FLGVLK C GWFGSAIE+ D++ + P+ L M++SS+D Sbjct: 127 G----HWMFLVFCGLCLFLGVLKICATGWFGSAIERASSDRDSTD-PMSGLLVMDQSSKD 181 Query: 712 DPHRASENDDGDRGSDVERTLMMVASGV-VGSQNAVADHSGIWSKPDSGNYTQCIDQFKR 536 +R +G+DVERTLMMV++GV V +Q + ++SGIWS+P+S N+TQCIDQ Sbjct: 182 YVYREK------KGTDVERTLMMVSTGVRVDNQKSKDEYSGIWSRPNSENFTQCIDQPNN 235 Query: 535 HKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESGFKD 356 KKL+ TNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWADESGFKD Sbjct: 236 KKKLDLKTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADESGFKD 295 Query: 355 LFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKVIYF 176 LFDW+HF+ETLKDD+HIV+ LPP YA IEP KTPISWSK YYKTEVLP LKQHKV+YF Sbjct: 296 LFDWRHFIETLKDDVHIVETLPPAYADIEPLMKTPISWSKAGYYKTEVLPPLKQHKVVYF 355 Query: 175 THTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHLR 2 THTDSRLANNG+P SIQKLRC+VNY ALKYSAQIEEL TLVSRM+ +GNPYLALHLR Sbjct: 356 THTDSRLANNGIPNSIQKLRCRVNYRALKYSAQIEELATTLVSRMRCDGNPYLALHLR 413 >ref|XP_004288979.1| PREDICTED: uncharacterized protein At1g04910-like [Fragaria vesca subsp. vesca] Length = 634 Score = 461 bits (1187), Expect = e-127 Identities = 249/422 (59%), Positives = 293/422 (69%), Gaps = 4/422 (0%) Frame = -2 Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076 T+ DG SQRV+SPRFSG MTRRAHSFKR Sbjct: 11 TSADGGVSQRVNSPRFSGAMTRRAHSFKRNPFSSSSSAAAAANNDDGGIAGGGFSTQYEV 70 Query: 1075 EINVLNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRV----HLKKNIGSFNVDXXXX 908 ++ +NSPRSE+ + +GF + QS +TQR L+K I + V+ Sbjct: 71 DLQ-MNSPRSEIGG---AGEGFVT-----QSGGGHVTQRAAVRGFLRKPIEAVVVE---- 117 Query: 907 XXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKME 728 GHWMF+ FCG C FLG+LK C GWFGSAIE +Q+ + + ++++ Sbjct: 118 MGLRERKRLGHWMFFAFCGVCLFLGILKICATGWFGSAIETASSNQD-NSGSMTHSNRID 176 Query: 727 ESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCID 548 ESS D +R D GSDVERTL MVASGVVG +N A+ +GIWS+P+S NY+QCID Sbjct: 177 ESSHDYGYR-------DGGSDVERTLKMVASGVVGRENR-AEWTGIWSRPNSANYSQCID 228 Query: 547 QFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADES 368 K HKK + TNG++L+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWAD+S Sbjct: 229 HPKSHKKPDPKTNGYILINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADDS 288 Query: 367 GFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHK 188 GFKDLFDWQHF+ETLKDDIHIV+ LPPEYAGIEPFNKTPISWSK SYYK+EVLPLLKQH Sbjct: 289 GFKDLFDWQHFIETLKDDIHIVEALPPEYAGIEPFNKTPISWSKASYYKSEVLPLLKQHT 348 Query: 187 VIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALH 8 +Y THTDSRL+NN LP SIQ+LRC+VNY ALKYSA IE+LGKTLVS M+ NG PYLALH Sbjct: 349 AVYLTHTDSRLSNNDLPSSIQRLRCRVNYRALKYSAPIEQLGKTLVSGMRQNGGPYLALH 408 Query: 7 LR 2 LR Sbjct: 409 LR 410 >ref|XP_002870435.1| hypothetical protein ARALYDRAFT_493618 [Arabidopsis lyrata subsp. lyrata] gi|297316271|gb|EFH46694.1| hypothetical protein ARALYDRAFT_493618 [Arabidopsis lyrata subsp. lyrata] Length = 653 Score = 454 bits (1168), Expect = e-125 Identities = 246/422 (58%), Positives = 288/422 (68%), Gaps = 7/422 (1%) Frame = -2 Query: 1246 DGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEIN 1067 DGV V+SPRFSGPMTRRA SFKR EI+ Sbjct: 9 DGVPQHHVNSPRFSGPMTRRAQSFKRGGSGGSSSNTHVGDGNNTSTLRVHH------EID 62 Query: 1066 V-LNSPRSEVNSNSVSDD---GFDSFVEKKQSHLSKLTQRVH---LKKNIGSFNVDXXXX 908 + LNSPRSE+ S S D GFDS + +K +L +RV L+K +GS D Sbjct: 63 LPLNSPRSEIVSGSSGSDPSGGFDSALNRKHQTYGQLRERVVKGLLRKPMGSVVSDFSLR 122 Query: 907 XXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKME 728 HWMF+ FCG C FLGV K C GW GSAI+ Q+L S I ++ ++ Sbjct: 123 ERKKLG----HWMFFAFCGVCLFLGVFKICATGWLGSAIDGAASHQDLSNS-IPRVNLLD 177 Query: 727 ESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCID 548 SS D ++ D G+DV+ TL+MVAS VVG QN+V ++SG+W+KP+SGN++QCID Sbjct: 178 HSSHDYIYK-------DGGNDVDPTLVMVASDVVGDQNSVVEYSGVWAKPESGNFSQCID 230 Query: 547 QFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADES 368 + KKL NTNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH+SYWAD+S Sbjct: 231 SPRSRKKLGVNTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHSSYWADDS 290 Query: 367 GFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHK 188 GFKDLFDWQHF+E LKDDIHIV++LP E AGIEPF KTPISWSKV YYK EVLPLLKQH Sbjct: 291 GFKDLFDWQHFIEELKDDIHIVEMLPSELAGIEPFVKTPISWSKVGYYKREVLPLLKQHI 350 Query: 187 VIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALH 8 V+Y THTDSRLANN LP S+QKLRC+VNY ALKYSA IEELG LVSRM+ N PYLALH Sbjct: 351 VMYLTHTDSRLANNDLPDSVQKLRCRVNYRALKYSAPIEELGNVLVSRMRQNRGPYLALH 410 Query: 7 LR 2 LR Sbjct: 411 LR 412 >ref|XP_003550617.1| PREDICTED: uncharacterized protein At1g04910-like [Glycine max] Length = 628 Score = 450 bits (1158), Expect = e-124 Identities = 251/421 (59%), Positives = 285/421 (67%), Gaps = 5/421 (1%) Frame = -2 Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070 +DGVS QRV+SPRFSGPMTRRAHSFKR EI Sbjct: 14 SDGVS-QRVNSPRFSGPMTRRAHSFKRNNSSNNSNNTATTTSHGGGGGSGGV------EI 66 Query: 1069 NV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVH----LKKNIGSFNVDXXXXX 905 + +NSPRSE S V V K H +TQRVH LKK + S D Sbjct: 67 ELQINSPRSEEASEGVP-------VGKHSHH--HVTQRVHVRGLLKKPLASIVEDLGLRE 117 Query: 904 XXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEE 725 HWMF VFCG C F+GVLK C GW GSAIE ++ L +S I L+ M++ Sbjct: 118 RKKIG----HWMFLVFCGVCLFMGVLKICATGWLGSAIEITQSNKELSDS-IPSLTLMDK 172 Query: 724 SSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQ 545 SS +R SDVERTL VA+GV GS A+ + SGIWSKP+S N+T+CID Sbjct: 173 SSLGYAYRGG-------ASDVERTLKTVATGVDGSHTAMTEDSGIWSKPNSDNFTKCIDL 225 Query: 544 FKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESG 365 HKKL+ TNG++ +NANGGLNQMRFGICDMVAVAK +KATLVLPSLDHTSYWAD+SG Sbjct: 226 PSNHKKLDAKTNGYIFVNANGGLNQMRFGICDMVAVAKIVKATLVLPSLDHTSYWADDSG 285 Query: 364 FKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKV 185 FKDLFDW+HF+ LKDD+HIV+ LPP YAGIEPF KTPISWSKV YYKTEVLPLLKQHKV Sbjct: 286 FKDLFDWKHFINMLKDDVHIVEKLPPAYAGIEPFPKTPISWSKVHYYKTEVLPLLKQHKV 345 Query: 184 IYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHL 5 +YFTHTDSRL NN +P SIQKLRC+VNY ALKYSA IEELG TLVSRM+ NGNPYLALHL Sbjct: 346 MYFTHTDSRLDNNDIPRSIQKLRCRVNYRALKYSAPIEELGNTLVSRMQQNGNPYLALHL 405 Query: 4 R 2 R Sbjct: 406 R 406 >ref|NP_568528.2| O-fucosyltransferase family protein [Arabidopsis thaliana] gi|14517444|gb|AAK62612.1| AT5g35570/K2K18_1 [Arabidopsis thaliana] gi|21360449|gb|AAM47340.1| AT5g35570/K2K18_1 [Arabidopsis thaliana] gi|332006599|gb|AED93982.1| O-fucosyltransferase family protein [Arabidopsis thaliana] Length = 652 Score = 449 bits (1156), Expect = e-123 Identities = 244/427 (57%), Positives = 287/427 (67%), Gaps = 12/427 (2%) Frame = -2 Query: 1246 DGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH--- 1076 DGV V+SPRFSGPMTRRA SFKR Sbjct: 9 DGVPQHHVNSPRFSGPMTRRAQSFKRGGSAGSSSNNNNTHVGVSGGDGNNNNNTSSTLRV 68 Query: 1075 --EINV-LNSPRSEVNSNSVSDD---GFDSFVEKKQSHLSKLTQRVH---LKKNIGSFNV 923 EI++ LNSPRSE+ S S D GFDS + +K +L +RV L+K +GS Sbjct: 69 HHEIDLPLNSPRSEIVSGSSGSDPSGGFDSALNRKHQTYGQLRERVVKGLLRKPMGSVVS 128 Query: 922 DXXXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDY 743 D HWMF+ FCG C FLGV K C GW GSAI+ DQ+L I Sbjct: 129 DFSLRERKKLG----HWMFFAFCGVCLFLGVFKICATGWLGSAIDGAASDQDL---SIPR 181 Query: 742 LSKMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNY 563 ++ ++ SS D ++ D G+DV+ TL+MVAS VVG QN+V + SG+W+KP+SGN+ Sbjct: 182 VNLLDHSSHDYIYK-------DGGNDVDPTLVMVASDVVGDQNSVVEFSGVWAKPESGNF 234 Query: 562 TQCIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSY 383 ++CID + KKL NTNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH+SY Sbjct: 235 SRCIDSSRSRKKLGANTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHSSY 294 Query: 382 WADESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPL 203 WAD+SGFKDLFDWQHF+E LKDDIHIV++LP E AGIEPF KTPISWSKV YYK EVLPL Sbjct: 295 WADDSGFKDLFDWQHFIEELKDDIHIVEMLPSELAGIEPFVKTPISWSKVGYYKKEVLPL 354 Query: 202 LKQHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNP 23 LKQH V+Y THTDSRLANN LP S+QKLRC+VNY ALKYSA IEELG LVSRM+ + P Sbjct: 355 LKQHIVMYLTHTDSRLANNDLPDSVQKLRCRVNYRALKYSAPIEELGNVLVSRMRQDRGP 414 Query: 22 YLALHLR 2 YLALHLR Sbjct: 415 YLALHLR 421 >ref|XP_006395968.1| hypothetical protein EUTSA_v10003786mg [Eutrema salsugineum] gi|557092607|gb|ESQ33254.1| hypothetical protein EUTSA_v10003786mg [Eutrema salsugineum] Length = 654 Score = 448 bits (1152), Expect = e-123 Identities = 241/425 (56%), Positives = 287/425 (67%), Gaps = 10/425 (2%) Frame = -2 Query: 1246 DGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEIN 1067 DGV Q V+SPRFSGPMTRRA SFKR ++ Sbjct: 12 DGVP-QHVNSPRFSGPMTRRAQSFKRGGSGGSSSNNTHAGGSISAGDNSTGTNHSTLRVH 70 Query: 1066 -----VLNSPRSEVNSNSVSD--DGFDSFVEKKQSHLSKLTQRVH---LKKNIGSFNVDX 917 LNSPRSE+ S S D F+S + +K +L +RV L+K +GS + Sbjct: 71 HEIDLQLNSPRSEIASGSGLDPSSAFESAINRKHQTYGQLRERVVKGLLRKPMGSVVSEL 130 Query: 916 XXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLS 737 HWMF+ FCG C F+GVLK C GW GSAI+ DQ+L +S I ++ Sbjct: 131 SLRERKKLG----HWMFFAFCGVCLFMGVLKICATGWLGSAIDGAASDQDLSDS-IPRVN 185 Query: 736 KMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQ 557 ++ SS D ++ D G+ ++ TL MVASGVVG QN+V ++SG+W+KP+SGN++Q Sbjct: 186 LLDHSSHDYIYK-------DGGNGIDPTLAMVASGVVGDQNSVVEYSGVWAKPESGNHSQ 238 Query: 556 CIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWA 377 CI+ + KKL NTNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH+SYWA Sbjct: 239 CIETLRTRKKLGANTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHSSYWA 298 Query: 376 DESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLK 197 D+SGFKDLFDWQHF+E LKDDIHIV+ LP E AGIEPF KTPISWSKV YYK EVLPLLK Sbjct: 299 DDSGFKDLFDWQHFIEELKDDIHIVETLPSELAGIEPFVKTPISWSKVGYYKKEVLPLLK 358 Query: 196 QHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYL 17 QH V+Y THTDSRLANN LP S+QKLRC+VNY ALKYSA IEELG LVSRM+ N PYL Sbjct: 359 QHIVMYLTHTDSRLANNDLPDSVQKLRCRVNYRALKYSAPIEELGNVLVSRMRQNRGPYL 418 Query: 16 ALHLR 2 ALHLR Sbjct: 419 ALHLR 423 >ref|XP_006395967.1| hypothetical protein EUTSA_v10003786mg [Eutrema salsugineum] gi|557092606|gb|ESQ33253.1| hypothetical protein EUTSA_v10003786mg [Eutrema salsugineum] Length = 460 Score = 448 bits (1152), Expect = e-123 Identities = 241/425 (56%), Positives = 287/425 (67%), Gaps = 10/425 (2%) Frame = -2 Query: 1246 DGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEIN 1067 DGV Q V+SPRFSGPMTRRA SFKR ++ Sbjct: 12 DGVP-QHVNSPRFSGPMTRRAQSFKRGGSGGSSSNNTHAGGSISAGDNSTGTNHSTLRVH 70 Query: 1066 -----VLNSPRSEVNSNSVSD--DGFDSFVEKKQSHLSKLTQRVH---LKKNIGSFNVDX 917 LNSPRSE+ S S D F+S + +K +L +RV L+K +GS + Sbjct: 71 HEIDLQLNSPRSEIASGSGLDPSSAFESAINRKHQTYGQLRERVVKGLLRKPMGSVVSEL 130 Query: 916 XXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLS 737 HWMF+ FCG C F+GVLK C GW GSAI+ DQ+L +S I ++ Sbjct: 131 SLRERKKLG----HWMFFAFCGVCLFMGVLKICATGWLGSAIDGAASDQDLSDS-IPRVN 185 Query: 736 KMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQ 557 ++ SS D ++ D G+ ++ TL MVASGVVG QN+V ++SG+W+KP+SGN++Q Sbjct: 186 LLDHSSHDYIYK-------DGGNGIDPTLAMVASGVVGDQNSVVEYSGVWAKPESGNHSQ 238 Query: 556 CIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWA 377 CI+ + KKL NTNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH+SYWA Sbjct: 239 CIETLRTRKKLGANTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHSSYWA 298 Query: 376 DESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLK 197 D+SGFKDLFDWQHF+E LKDDIHIV+ LP E AGIEPF KTPISWSKV YYK EVLPLLK Sbjct: 299 DDSGFKDLFDWQHFIEELKDDIHIVETLPSELAGIEPFVKTPISWSKVGYYKKEVLPLLK 358 Query: 196 QHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYL 17 QH V+Y THTDSRLANN LP S+QKLRC+VNY ALKYSA IEELG LVSRM+ N PYL Sbjct: 359 QHIVMYLTHTDSRLANNDLPDSVQKLRCRVNYRALKYSAPIEELGNVLVSRMRQNRGPYL 418 Query: 16 ALHLR 2 ALHLR Sbjct: 419 ALHLR 423 >ref|XP_003542359.1| PREDICTED: uncharacterized protein At1g04910-like [Glycine max] Length = 626 Score = 444 bits (1143), Expect = e-122 Identities = 248/421 (58%), Positives = 285/421 (67%), Gaps = 5/421 (1%) Frame = -2 Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070 +DGVS QRV+SPRFSGPMTRRAHSFKR E+ Sbjct: 14 SDGVS-QRVNSPRFSGPMTRRAHSFKRNNNNIAANTAATTSHGGAGGSGAG-------EV 65 Query: 1069 NV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVH----LKKNIGSFNVDXXXXX 905 + +NSPRSE S V V K H +TQRVH LKK + S D Sbjct: 66 ELQINSPRSEEASEGVP-------VGKHSHH--HVTQRVHVRGLLKKPLASIVEDLGLRE 116 Query: 904 XXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEE 725 HWMF VFCG C F+GVLK C GW GSAIE+ ++ L +S I L+ M++ Sbjct: 117 RKKIG----HWMFLVFCGVCLFMGVLKICATGWLGSAIERTQSNKELSDS-IASLNLMDK 171 Query: 724 SSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQ 545 SS +R SDVERTL VA+G GS A+ + SGIWSKP+S N+T+CID Sbjct: 172 SSLGYAYRGG-------ASDVERTLKTVATGD-GSHTAMTEDSGIWSKPNSDNFTKCIDL 223 Query: 544 FKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESG 365 HKKL+ TNG++L+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWAD+SG Sbjct: 224 PSNHKKLDAKTNGYILVNANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADDSG 283 Query: 364 FKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKV 185 FKDLFDW+HF+ LK+D+HIV+ LPP YAGIEPF KTPISWSKV YYKTEVLPLLKQHKV Sbjct: 284 FKDLFDWKHFINMLKNDVHIVEKLPPAYAGIEPFPKTPISWSKVPYYKTEVLPLLKQHKV 343 Query: 184 IYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHL 5 +YFTHTDSRL NN +P SIQKLRC+ NY ALKYSA +EELG TLVSRM+ NGNPYLALHL Sbjct: 344 MYFTHTDSRLDNNDIPRSIQKLRCRANYRALKYSAPVEELGNTLVSRMQQNGNPYLALHL 403 Query: 4 R 2 R Sbjct: 404 R 404 >gb|EOY30276.1| O-fucosyltransferase family protein isoform 2 [Theobroma cacao] Length = 564 Score = 444 bits (1142), Expect = e-122 Identities = 243/421 (57%), Positives = 279/421 (66%), Gaps = 5/421 (1%) Frame = -2 Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070 +DGVS QRV+SPRFSGPMTRRA SFKR Sbjct: 13 SDGVS-QRVNSPRFSGPMTRRASSFKRGNGNSQTTNSNNALGSGNGNNNGSNGNNLSVHH 71 Query: 1069 NV---LNSPRSEVNS-NSVSDDGFDSFVEKKQSHLSKLTQRVHLKK-NIGSFNVDXXXXX 905 + +NSPRSE + SVS DG +R L+K ++GS +D Sbjct: 72 EIDLPINSPRSETGAAGSVSIDGLSQ-------------RRGFLRKPSVGSMVLDFGLKE 118 Query: 904 XXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEE 725 HWMF VFCG C FLGV K C GWFGSAIE +Q L + I+ ++++ Sbjct: 119 RKKLG----HWMFLVFCGVCLFLGVFKICATGWFGSAIETVTSNQGLSDISINRPKRIDQ 174 Query: 724 SSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQ 545 S D +R + GSD +RTLM V S V + SGIWS P+S N+T+CID Sbjct: 175 GSHDYGYR-------EEGSDSDRTLMTVPSDVT-------EDSGIWSLPNSENFTKCIDH 220 Query: 544 FKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESG 365 K KKL+ TNG++L+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWADESG Sbjct: 221 SKNQKKLDAKTNGYILVNANGGLNQMRFGICDMVAVAKVMKATLVLPSLDHTSYWADESG 280 Query: 364 FKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKV 185 FKDLFDW HFMETLKDD+HIV+ +PP YAGIEPFNKTPISWSKVSYY EVLPLLKQHKV Sbjct: 281 FKDLFDWHHFMETLKDDVHIVERIPPAYAGIEPFNKTPISWSKVSYYNAEVLPLLKQHKV 340 Query: 184 IYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHL 5 IYFTHTDSRLANN +P SIQKLRC+VNY ALKYSA IEELG TL+SRM+ NG+PYLALHL Sbjct: 341 IYFTHTDSRLANNDIPSSIQKLRCRVNYRALKYSAPIEELGNTLISRMRQNGSPYLALHL 400 Query: 4 R 2 R Sbjct: 401 R 401 >gb|EOY30275.1| O-fucosyltransferase family protein isoform 1 [Theobroma cacao] Length = 626 Score = 444 bits (1142), Expect = e-122 Identities = 243/421 (57%), Positives = 279/421 (66%), Gaps = 5/421 (1%) Frame = -2 Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070 +DGVS QRV+SPRFSGPMTRRA SFKR Sbjct: 13 SDGVS-QRVNSPRFSGPMTRRASSFKRGNGNSQTTNSNNALGSGNGNNNGSNGNNLSVHH 71 Query: 1069 NV---LNSPRSEVNS-NSVSDDGFDSFVEKKQSHLSKLTQRVHLKK-NIGSFNVDXXXXX 905 + +NSPRSE + SVS DG +R L+K ++GS +D Sbjct: 72 EIDLPINSPRSETGAAGSVSIDGLSQ-------------RRGFLRKPSVGSMVLDFGLKE 118 Query: 904 XXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEE 725 HWMF VFCG C FLGV K C GWFGSAIE +Q L + I+ ++++ Sbjct: 119 RKKLG----HWMFLVFCGVCLFLGVFKICATGWFGSAIETVTSNQGLSDISINRPKRIDQ 174 Query: 724 SSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQ 545 S D +R + GSD +RTLM V S V + SGIWS P+S N+T+CID Sbjct: 175 GSHDYGYR-------EEGSDSDRTLMTVPSDVT-------EDSGIWSLPNSENFTKCIDH 220 Query: 544 FKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESG 365 K KKL+ TNG++L+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWADESG Sbjct: 221 SKNQKKLDAKTNGYILVNANGGLNQMRFGICDMVAVAKVMKATLVLPSLDHTSYWADESG 280 Query: 364 FKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKV 185 FKDLFDW HFMETLKDD+HIV+ +PP YAGIEPFNKTPISWSKVSYY EVLPLLKQHKV Sbjct: 281 FKDLFDWHHFMETLKDDVHIVERIPPAYAGIEPFNKTPISWSKVSYYNAEVLPLLKQHKV 340 Query: 184 IYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHL 5 IYFTHTDSRLANN +P SIQKLRC+VNY ALKYSA IEELG TL+SRM+ NG+PYLALHL Sbjct: 341 IYFTHTDSRLANNDIPSSIQKLRCRVNYRALKYSAPIEELGNTLISRMRQNGSPYLALHL 400 Query: 4 R 2 R Sbjct: 401 R 401 >ref|XP_006283281.1| hypothetical protein CARUB_v10004322mg [Capsella rubella] gi|482551986|gb|EOA16179.1| hypothetical protein CARUB_v10004322mg [Capsella rubella] Length = 659 Score = 440 bits (1131), Expect = e-121 Identities = 242/430 (56%), Positives = 285/430 (66%), Gaps = 15/430 (3%) Frame = -2 Query: 1246 DGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH--- 1076 DGV Q V+SPRFSGPMTRRA SFKR Sbjct: 12 DGVP-QHVNSPRFSGPMTRRAQSFKRGGSGGGGTSSNSHVGVSDNIGINNNNNTSSSSST 70 Query: 1075 -----EINV-LNSPRSEVNSNSVSDD---GFDSFVEKKQSHLSKLTQRVH---LKKNIGS 932 EI++ LNSPRSE+ S D GFDS V +K +L +RV L+K +GS Sbjct: 71 LRVHHEIDLPLNSPRSEIVSGGSGSDPSGGFDSAVNRKHQTYGQLRERVVKGLLRKPMGS 130 Query: 931 FNVDXXXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESP 752 D HWMF+ FCG C F+GV K C GW GSAI+ DQ+L S Sbjct: 131 VVSDFSLKERKKLG----HWMFFAFCGVCLFMGVFKICATGWLGSAIDSAASDQDLSNS- 185 Query: 751 IDYLSKMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDS 572 I ++ ++ SS D ++ D G+DV+ TL+MVAS VVG QN+V +++G+W+KP+S Sbjct: 186 IPRVNLLDHSSHDYIYK-------DGGNDVDPTLVMVASDVVGDQNSVVEYTGVWAKPES 238 Query: 571 GNYTQCIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDH 392 N++QCID + KKL NTNG+LL+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH Sbjct: 239 ANFSQCIDSSRSRKKLNANTNGYLLINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDH 298 Query: 391 TSYWADESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEV 212 +SYWAD+SGFKDLFDWQHF+E LKDDIHIV+ LP E A EPF KTPISWSKV YYK EV Sbjct: 299 SSYWADDSGFKDLFDWQHFIEELKDDIHIVESLPSELALTEPFVKTPISWSKVGYYKKEV 358 Query: 211 LPLLKQHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHN 32 LPLLKQH V+Y THTDSRLANN LP S+QKLRC+VNY ALKYSA IEELG LVSRM+ + Sbjct: 359 LPLLKQHIVMYLTHTDSRLANNDLPDSVQKLRCRVNYKALKYSAPIEELGNILVSRMRED 418 Query: 31 GNPYLALHLR 2 PYLALHLR Sbjct: 419 RGPYLALHLR 428 >gb|ESW26581.1| hypothetical protein PHAVU_003G131300g [Phaseolus vulgaris] Length = 617 Score = 437 bits (1124), Expect = e-120 Identities = 239/417 (57%), Positives = 280/417 (67%), Gaps = 1/417 (0%) Frame = -2 Query: 1249 TDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHEI 1070 +DGVS QRV+SPRFSGPMTRRAHSFKR E+ Sbjct: 14 SDGVS-QRVNSPRFSGPMTRRAHSFKRNTDGTNSNGGSG-------------------EV 53 Query: 1069 NV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVHLKKNIGSFNVDXXXXXXXXX 893 + +NSPRSE + V + + + +TQRVH++ + Sbjct: 54 ELQINSPRSEEALEGIP-------VGRHSHNHNHVTQRVHVRSLLKKPLASIVEDLGFRE 106 Query: 892 XXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESPIDYLSKMEESSRD 713 GH MF VFCG C F+GVLK C GW GSAIE+ D+ L +S I L+ M++SS Sbjct: 107 RKKIGHLMFLVFCGVCIFIGVLKICATGWLGSAIERAQSDKELPDS-IASLNLMDKSSLG 165 Query: 712 DPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDSGNYTQCIDQFKRH 533 +R SDVERTL +A+GV S A+A+ SG WSKP+S N+TQCID Sbjct: 166 YAYRGG-------ASDVERTLKTLATGVGDSHTAMAEDSGTWSKPNSDNFTQCIDLPSNR 218 Query: 532 KKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDHTSYWADESGFKDL 353 KKL+ NG++++NANGGLNQMRFGICDMVAVAK MKATLVLPSLDHTSYWAD+SGFKDL Sbjct: 219 KKLDAKINGYIVVNANGGLNQMRFGICDMVAVAKIMKATLVLPSLDHTSYWADDSGFKDL 278 Query: 352 FDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEVLPLLKQHKVIYFT 173 FDW+HF+ LKDD+HIV+ LPP YAGIEPF KTPISWSKV YYKTEVLPLLKQHKVIYFT Sbjct: 279 FDWKHFIHMLKDDVHIVEKLPPAYAGIEPFPKTPISWSKVPYYKTEVLPLLKQHKVIYFT 338 Query: 172 HTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHNGNPYLALHLR 2 HTDSRLANN +P SIQKLRC+VNY ALKYSA IEE G TLVSRM+ NG+ YLALHLR Sbjct: 339 HTDSRLANNDIPHSIQKLRCRVNYRALKYSAPIEEFGNTLVSRMQQNGSSYLALHLR 395 >ref|XP_002326282.1| predicted protein [Populus trichocarpa] Length = 648 Score = 437 bits (1123), Expect = e-120 Identities = 247/436 (56%), Positives = 287/436 (65%), Gaps = 18/436 (4%) Frame = -2 Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076 +A+DGVS QRV+SPRFSGPMTRRAHSFKR + Sbjct: 12 SASDGVS-QRVNSPRFSGPMTRRAHSFKRNNTSSNNNSNAGNANSSNNGSNNVSNGNSNN 70 Query: 1075 -------EINV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVH---------LK 947 EI++ LNSPRSE + DGF+ +Q+ L+QRVH K Sbjct: 71 SILSPHLEIDLPLNSPRSE------TVDGFERESHSRQN----LSQRVHGGVVRILTNKK 120 Query: 946 KNIGSFNVDXXXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQN 767 +IGS +D HWMF+ FCG C FLGV K C+ GWFGS +E+ +Q Sbjct: 121 GSIGSVILDFGFKERKKLG----HWMFFFFCGLCLFLGVFKICLYGWFGSTLERAASNQV 176 Query: 766 LYESPIDYLSKMEESSRDD-PHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGI 590 L+ ID + +D + SEND +R ++ V S VV N A+ SGI Sbjct: 177 LHL--IDVFGSITRQEQDSYRYMGSENDQ-------KRMIIEVGSDVVDRLNKKAEFSGI 227 Query: 589 WSKPDSGNYTQCIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLV 410 WSKP+S N+TQCIDQ HKKL TNG++L+NANGGLNQMRFGICDMVAVAK MKATLV Sbjct: 228 WSKPNSENFTQCIDQPGNHKKLGARTNGYILINANGGLNQMRFGICDMVAVAKIMKATLV 287 Query: 409 LPSLDHTSYWADESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVS 230 LPSLDHTSYWAD+SGFKDLF+WQHF++TLKDD+HIV+ LPP Y GIEPFNKT ISWSKV Sbjct: 288 LPSLDHTSYWADDSGFKDLFNWQHFIDTLKDDVHIVEKLPPAYDGIEPFNKTLISWSKVH 347 Query: 229 YYKTEVLPLLKQHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLV 50 YYKTEVLPLLKQHKVIYFTHTDSRLANNGL SIQKLRC+ NY ALKYS IEELG TLV Sbjct: 348 YYKTEVLPLLKQHKVIYFTHTDSRLANNGLSDSIQKLRCRANYRALKYSKPIEELGNTLV 407 Query: 49 SRMKHNGNPYLALHLR 2 SRM+ NG+ YLALHLR Sbjct: 408 SRMRENGSRYLALHLR 423 >ref|XP_006381630.1| hypothetical protein POPTR_0006s14490g [Populus trichocarpa] gi|550336338|gb|ERP59427.1| hypothetical protein POPTR_0006s14490g [Populus trichocarpa] Length = 648 Score = 435 bits (1118), Expect = e-119 Identities = 246/436 (56%), Positives = 286/436 (65%), Gaps = 18/436 (4%) Frame = -2 Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076 +A+DGVS QRV+SPRFSGPMTRRAHSFKR + Sbjct: 12 SASDGVS-QRVNSPRFSGPMTRRAHSFKRNNTSSNNNSNAGNANSSNNGSNNVSNGNSNN 70 Query: 1075 -------EINV-LNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKLTQRVH---------LK 947 EI++ LNSPRSE + DGF+ +Q+ L+QRVH K Sbjct: 71 SILSPHLEIDLPLNSPRSE------TVDGFERESHSRQN----LSQRVHGGVVRILTNKK 120 Query: 946 KNIGSFNVDXXXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQN 767 +IGS +D HWMF+ FCG C FLGV K C+ GWFGS +E+ +Q Sbjct: 121 GSIGSVILDFGFKERKKLG----HWMFFFFCGLCLFLGVFKICLYGWFGSTLERAASNQV 176 Query: 766 LYESPIDYLSKMEESSRDD-PHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGI 590 + ID + +D + SEND +R ++ V S VV N A+ SGI Sbjct: 177 THL--IDVFGSITRQEQDSYRYMGSENDQ-------KRMIIEVGSDVVDRLNKKAEFSGI 227 Query: 589 WSKPDSGNYTQCIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLV 410 WSKP+S N+TQCIDQ HKKL TNG++L+NANGGLNQMRFGICDMVAVAK MKATLV Sbjct: 228 WSKPNSENFTQCIDQPGNHKKLGARTNGYILINANGGLNQMRFGICDMVAVAKIMKATLV 287 Query: 409 LPSLDHTSYWADESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVS 230 LPSLDHTSYWAD+SGFKDLF+WQHF++TLKDD+HIV+ LPP Y GIEPFNKT ISWSKV Sbjct: 288 LPSLDHTSYWADDSGFKDLFNWQHFIDTLKDDVHIVEKLPPAYDGIEPFNKTLISWSKVH 347 Query: 229 YYKTEVLPLLKQHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLV 50 YYKTEVLPLLKQHKVIYFTHTDSRLANNGL SIQKLRC+ NY ALKYS IEELG TLV Sbjct: 348 YYKTEVLPLLKQHKVIYFTHTDSRLANNGLSDSIQKLRCRANYRALKYSKPIEELGNTLV 407 Query: 49 SRMKHNGNPYLALHLR 2 SRM+ NG+ YLALHLR Sbjct: 408 SRMRENGSRYLALHLR 423 >ref|XP_004508243.1| PREDICTED: uncharacterized protein At1g04910-like [Cicer arietinum] Length = 630 Score = 435 bits (1118), Expect = e-119 Identities = 242/430 (56%), Positives = 280/430 (65%), Gaps = 12/430 (2%) Frame = -2 Query: 1255 TATDGVSSQRVSSPRFSGPMTRRAHSFKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH 1076 T++DGVS QRV+SPRFSGPMTRRAHSFKR Sbjct: 14 TSSDGVS-QRVNSPRFSGPMTRRAHSFKRNNTHNAAANNAVGGGGGAL------------ 60 Query: 1075 EINVLNSPRSEVNSNSVSDDGFDSFVEKKQSHLSKL----TQRVH-------LKKNIGSF 929 S SEV G + +E+K H L +QRVH LK+ + S Sbjct: 61 ------STHSEVELQK----GLEPALERKHGHHHHLHPHVSQRVHGGVVKAFLKRPLESI 110 Query: 928 NVDXXXXXXXXXXXXXGHWMFWVFCGACFFLGVLKFCINGWFGSAIEKGVYDQNLYESP- 752 D HWMF VFCG C F+GVLK C GW GSAIEK + L +S Sbjct: 111 VDDLGFRERKKIG----HWMFLVFCGVCLFMGVLKICATGWLGSAIEKAQSSKELSDSNG 166 Query: 751 IDYLSKMEESSRDDPHRASENDDGDRGSDVERTLMMVASGVVGSQNAVADHSGIWSKPDS 572 ID L+ M++SS +R+ DVERTL V + VV + S +WSKP+S Sbjct: 167 IDNLNLMDQSSLGYAYRSG-------AGDVERTLKTVQTRVV---SFFIQESDVWSKPNS 216 Query: 571 GNYTQCIDQFKRHKKLEENTNGFLLMNANGGLNQMRFGICDMVAVAKTMKATLVLPSLDH 392 N+TQCID + HKKL+ TNG++L+NANGGLNQMRFGICDMVAVAK MKATLVLPSLDH Sbjct: 217 ENFTQCIDLPRNHKKLDTKTNGYILINANGGLNQMRFGICDMVAVAKIMKATLVLPSLDH 276 Query: 391 TSYWADESGFKDLFDWQHFMETLKDDIHIVDVLPPEYAGIEPFNKTPISWSKVSYYKTEV 212 TSYWAD+SGFKDLFDW+HF++TLKDDIHIV+ LPP Y GIEPF+KTPISWSKV YYKTE+ Sbjct: 277 TSYWADQSGFKDLFDWKHFIDTLKDDIHIVETLPPAYPGIEPFSKTPISWSKVPYYKTEI 336 Query: 211 LPLLKQHKVIYFTHTDSRLANNGLPPSIQKLRCKVNYIALKYSAQIEELGKTLVSRMKHN 32 LPLL HKVIYFTHTDSRLANNG+P SIQKLRC+VNY AL+YSA IEE G LVSRM+ N Sbjct: 337 LPLLNHHKVIYFTHTDSRLANNGIPKSIQKLRCRVNYRALRYSAPIEEFGNILVSRMQQN 396 Query: 31 GNPYLALHLR 2 GNPYLALHLR Sbjct: 397 GNPYLALHLR 406