BLASTX nr result
ID: Catharanthus22_contig00028449
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00028449 (943 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY29850.1| Core-2/I-branching beta-1,6-N-acetylglucosaminylt... 377 e-102 ref|XP_006343019.1| PREDICTED: uncharacterized protein LOC102599... 377 e-102 ref|XP_004510336.1| PREDICTED: uncharacterized protein LOC101496... 374 e-101 ref|XP_004236394.1| PREDICTED: uncharacterized protein LOC101257... 374 e-101 gb|ESW07591.1| hypothetical protein PHAVU_010G142800g [Phaseolus... 373 e-101 ref|XP_003529742.1| PREDICTED: uncharacterized protein LOC100793... 370 e-100 ref|XP_002276490.1| PREDICTED: uncharacterized protein LOC100266... 365 1e-98 ref|XP_003533024.1| PREDICTED: uncharacterized protein LOC100819... 364 2e-98 gb|EXC07668.1| hypothetical protein L484_003112 [Morus notabilis] 362 1e-97 ref|XP_002515894.1| conserved hypothetical protein [Ricinus comm... 359 7e-97 ref|XP_004141671.1| PREDICTED: uncharacterized protein LOC101216... 359 1e-96 gb|EOY27013.1| Core-2/I-branching beta-1,6-N-acetylglucosaminylt... 355 1e-95 ref|XP_002516477.1| conserved hypothetical protein [Ricinus comm... 355 1e-95 gb|EMJ16713.1| hypothetical protein PRUPE_ppa007018mg [Prunus pe... 350 6e-94 ref|XP_006450782.1| hypothetical protein CICLE_v10008613mg [Citr... 349 8e-94 ref|XP_006355313.1| PREDICTED: uncharacterized protein LOC102601... 348 2e-93 ref|XP_004297621.1| PREDICTED: uncharacterized protein LOC101304... 347 3e-93 gb|EXB66899.1| hypothetical protein L484_019537 [Morus notabilis] 345 1e-92 ref|XP_006466159.1| PREDICTED: uncharacterized protein LOC102619... 345 1e-92 ref|XP_002309520.1| hypothetical protein POPTR_0006s24970g [Popu... 345 2e-92 >gb|EOY29850.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein [Theobroma cacao] Length = 392 Score = 377 bits (969), Expect = e-102 Identities = 178/275 (64%), Positives = 213/275 (77%), Gaps = 3/275 (1%) Frame = -3 Query: 818 MQLRVGSHGSLEECKDPGTKVPNXXXXXXXXXXXXXXXXXXXXLMC---SIFSMYMIRYV 648 MQ RVGS GSL+E KDP V +C SI S+Y IR Sbjct: 1 MQTRVGSAGSLDEVKDP--VVTTRTSQSKTLPLRLFQLFGLFLALCIAFSIVSIYTIRRF 58 Query: 647 GSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFVSRIKDYPFKRV 468 G +VV+ ++ PC+ +PNSL W KPPS++LH MND EL WRA+FV RIK YPF R+ Sbjct: 59 GIYSVVTTVKSNFVPCVEEPNSLNRWIKPPSNLLHTMNDKELLWRASFVPRIKKYPFNRL 118 Query: 467 PKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSVFYGREIPSQMV 288 PKIAFMFLT+GPLP++PLWE+F G+E LYSIY+HSLP+Y ++FPPSSVFYGR+IPSQ+ Sbjct: 119 PKIAFMFLTKGPLPLSPLWERFLKGHEGLYSIYIHSLPSYNAEFPPSSVFYGRQIPSQVS 178 Query: 287 EWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSRSRYSFMGAFDE 108 EWGRMSMCDAERRLLANALLDISNEWF+LLSE+CIPL+NFSV+Y Y+ +S+YSF+GAFD+ Sbjct: 179 EWGRMSMCDAERRLLANALLDISNEWFILLSESCIPLYNFSVIYHYIKKSKYSFIGAFDD 238 Query: 107 PGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 PGPYGRGRY+ENMAPEV +TQWRKGSQWFEINR L Sbjct: 239 PGPYGRGRYNENMAPEVNITQWRKGSQWFEINRRL 273 >ref|XP_006343019.1| PREDICTED: uncharacterized protein LOC102599337 [Solanum tuberosum] Length = 386 Score = 377 bits (968), Expect = e-102 Identities = 177/272 (65%), Positives = 213/272 (78%) Frame = -3 Query: 818 MQLRVGSHGSLEECKDPGTKVPNXXXXXXXXXXXXXXXXXXXXLMCSIFSMYMIRYVGSQ 639 M+ RVGS GSLEE KD TKV ++ S+ SM M+RY Q Sbjct: 1 MKSRVGSQGSLEEGKDSVTKVVTQYKPLSIRLLQFLLLFLGIGIVFSLLSMCMVRYSVMQ 60 Query: 638 NVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFVSRIKDYPFKRVPKI 459 NV+ + ++ +PC + SLESW +PPSS+LH MNDT+LFWRA+ V +IK+YPF R KI Sbjct: 61 NVIPMVQSRFQPCFQQL-SLESWIRPPSSLLHSMNDTQLFWRASIVPQIKEYPFNRTRKI 119 Query: 458 AFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSVFYGREIPSQMVEWG 279 AFMFLTRGP+P+APLWE+FF GNE+LYSIY+HSLP+YK DFPPSSVF GR++PSQ +WG Sbjct: 120 AFMFLTRGPIPLAPLWERFFKGNEKLYSIYIHSLPSYKPDFPPSSVFQGRQVPSQEAKWG 179 Query: 278 RMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSRSRYSFMGAFDEPGP 99 RMSMCDAERRLLANALLDISNEWF+LLSEACIPLHNF +Y Y+S+SRYSFMGA DEPGP Sbjct: 180 RMSMCDAERRLLANALLDISNEWFILLSEACIPLHNFKAIYHYISKSRYSFMGAVDEPGP 239 Query: 98 YGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 +GRGRY+ NMAPEV +T+WRKGSQWFE+NR+L Sbjct: 240 FGRGRYNPNMAPEVNITEWRKGSQWFEVNRKL 271 >ref|XP_004510336.1| PREDICTED: uncharacterized protein LOC101496152 [Cicer arietinum] Length = 381 Score = 374 bits (959), Expect = e-101 Identities = 174/226 (76%), Positives = 196/226 (86%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFV 501 S SMYMIR+ G NV V ++ +PC KP +E+W KPPSS+LH MND ELFWRA+FV Sbjct: 41 SFLSMYMIRHFGIHNVAFV-QSSFKPCFQKPAIIENWFKPPSSLLHTMNDVELFWRASFV 99 Query: 500 SRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSV 321 RIK YPFKR PKIAFMFLT+GPLPMAPLWEKFF+G+E+LYSIYVHSLP+Y +DF SSV Sbjct: 100 PRIKSYPFKRTPKIAFMFLTKGPLPMAPLWEKFFNGHEKLYSIYVHSLPSYNADFSLSSV 159 Query: 320 FYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSR 141 FY R+IPSQ+ EWG MSMCDAERRLLANALLDISNEWFVLLSE+CIPL NFS+VY YLSR Sbjct: 160 FYQRQIPSQVAEWGMMSMCDAERRLLANALLDISNEWFVLLSESCIPLQNFSIVYRYLSR 219 Query: 140 SRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 SRYSFMGAFDEPGPYGRGRY+ENMAPE+ ++ WRKGSQWFEINREL Sbjct: 220 SRYSFMGAFDEPGPYGRGRYNENMAPEINMSDWRKGSQWFEINREL 265 >ref|XP_004236394.1| PREDICTED: uncharacterized protein LOC101257059 [Solanum lycopersicum] Length = 385 Score = 374 bits (959), Expect = e-101 Identities = 177/272 (65%), Positives = 213/272 (78%) Frame = -3 Query: 818 MQLRVGSHGSLEECKDPGTKVPNXXXXXXXXXXXXXXXXXXXXLMCSIFSMYMIRYVGSQ 639 M+ RVGS GSLEE KD TKV ++ S+ SMYM+RY Q Sbjct: 1 MKSRVGSQGSLEEGKDSVTKVVTQYKPLSIRLLQFLLLFLGIGIVFSLLSMYMVRYSVMQ 60 Query: 638 NVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFVSRIKDYPFKRVPKI 459 NV+ + ++ +PC + SLESW +PPSS+LH MNDT+LFWRA+ V +IK+YPF R KI Sbjct: 61 NVIPMVQSRFQPCFQQL-SLESWIRPPSSLLHSMNDTQLFWRASIVPQIKEYPFNRTRKI 119 Query: 458 AFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSVFYGREIPSQMVEWG 279 AFMFLTRGP+P+APLWE+FF GNE LYSIY+HSLP+Y+ DFPPSSVF+GR+IPSQ +WG Sbjct: 120 AFMFLTRGPIPLAPLWERFFKGNE-LYSIYIHSLPSYRPDFPPSSVFHGRQIPSQEAKWG 178 Query: 278 RMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSRSRYSFMGAFDEPGP 99 RMSMCDAERRLLANALLDISNEWF+LLSEACIPLHNF +Y Y+S+SRYSFMGA DEPGP Sbjct: 179 RMSMCDAERRLLANALLDISNEWFILLSEACIPLHNFKAIYHYISKSRYSFMGAADEPGP 238 Query: 98 YGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 +GRGRY+ NM PEV +T+WRKGSQWFE+NR+L Sbjct: 239 FGRGRYNPNMVPEVNITEWRKGSQWFEVNRKL 270 >gb|ESW07591.1| hypothetical protein PHAVU_010G142800g [Phaseolus vulgaris] Length = 380 Score = 373 bits (958), Expect = e-101 Identities = 171/226 (75%), Positives = 196/226 (86%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFV 501 S SMYMIR+ G NV V T ++PC P ++E+W +P S +LH MNDTELFWRATFV Sbjct: 40 SFLSMYMIRHFGIHNVALVQST-IKPCFELPVTIENWIRPSSGLLHSMNDTELFWRATFV 98 Query: 500 SRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSV 321 R+K+YPF+R PKIAFMFLT+GPLPMAPLWEKFF GNERLYSIYVHSLP+Y +DFPPSSV Sbjct: 99 PRVKNYPFERTPKIAFMFLTKGPLPMAPLWEKFFKGNERLYSIYVHSLPSYSADFPPSSV 158 Query: 320 FYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSR 141 F+ R+IPSQ+ EWG MSMCDAERRLLANALLDISNEWF+L+SE+CIPL NFS+VY Y+SR Sbjct: 159 FHRRQIPSQVAEWGMMSMCDAERRLLANALLDISNEWFILVSESCIPLQNFSIVYRYISR 218 Query: 140 SRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 SRYSFMGA DEPGPYGRGRYDENMAPE+ ++ WRKGSQWFEINREL Sbjct: 219 SRYSFMGAVDEPGPYGRGRYDENMAPEINMSDWRKGSQWFEINREL 264 >ref|XP_003529742.1| PREDICTED: uncharacterized protein LOC100793448 [Glycine max] Length = 387 Score = 370 bits (951), Expect = e-100 Identities = 171/226 (75%), Positives = 195/226 (86%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFV 501 S SMYMIR+ G NV V ++ +PC +P +LESW +P SS+LH MNDTELFWRA+FV Sbjct: 41 SFLSMYMIRHFGIHNVALV-QSSFKPCFEQPATLESWIRPRSSLLHTMNDTELFWRASFV 99 Query: 500 SRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSV 321 RIK YPFKR PKIAFMFLT+GPLPMAPLWEKFF G+E LYSIYVHSLP+Y +DF PSSV Sbjct: 100 PRIKSYPFKRTPKIAFMFLTKGPLPMAPLWEKFFRGHEGLYSIYVHSLPSYNADFSPSSV 159 Query: 320 FYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSR 141 FY R+IPSQ+ EWG MSMCDAERRLLANALLDISNEWF+LLSE+CIPL NFS+VY+Y++R Sbjct: 160 FYRRQIPSQVAEWGMMSMCDAERRLLANALLDISNEWFILLSESCIPLQNFSIVYLYIAR 219 Query: 140 SRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 SRYSFMGA DEPGPYGRGRYD NMAPE+ ++ WRKGSQWFEINREL Sbjct: 220 SRYSFMGAVDEPGPYGRGRYDGNMAPEINMSDWRKGSQWFEINREL 265 >ref|XP_002276490.1| PREDICTED: uncharacterized protein LOC100266878 [Vitis vinifera] Length = 380 Score = 365 bits (937), Expect = 1e-98 Identities = 163/225 (72%), Positives = 199/225 (88%) Frame = -3 Query: 677 IFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFVS 498 IFSM MIRY G ++VV VAR+ +PC + N+LE+W +PPS +LH MND+ELFWRA+FV Sbjct: 41 IFSMNMIRYFGVESVVPVARSHFQPCFEEANTLETWIRPPSKLLHSMNDSELFWRASFVP 100 Query: 497 RIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSVF 318 IK+YPF+R+PKIAFMF+T+GPLP++PLWE+FF G++ LYSIYVHSLP+Y +DFP SSVF Sbjct: 101 GIKNYPFRRIPKIAFMFMTKGPLPLSPLWERFFKGHKGLYSIYVHSLPSYDADFPASSVF 160 Query: 317 YGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSRS 138 Y R+IPSQ+VEWG MSMCDAERRLLANALLDI NEWF+LLSE+CIPLHNFS+VY YLSRS Sbjct: 161 YKRQIPSQVVEWGMMSMCDAERRLLANALLDIDNEWFILLSESCIPLHNFSIVYRYLSRS 220 Query: 137 RYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 RYSF+GAFDE P+GRGRY+ N+AP+V LT+WRKGSQWFE+NR+L Sbjct: 221 RYSFIGAFDEDSPFGRGRYNPNLAPQVNLTEWRKGSQWFEVNRKL 265 >ref|XP_003533024.1| PREDICTED: uncharacterized protein LOC100819579 [Glycine max] Length = 387 Score = 364 bits (935), Expect = 2e-98 Identities = 167/226 (73%), Positives = 193/226 (85%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFV 501 S SMYMIR+ G NV ++ ++ +PC +P ++ESWT P S++LH MNDTELFWRA+FV Sbjct: 41 SFLSMYMIRHFGIHNV-ALLQSSFKPCFEQPANIESWTMPRSNLLHAMNDTELFWRASFV 99 Query: 500 SRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSV 321 RIK YPFKR PKIAFMFLT+GPLPMAPLWEKFF G+ RLYSIYVH LP+Y +DFPPSSV Sbjct: 100 PRIKSYPFKRTPKIAFMFLTKGPLPMAPLWEKFFKGHARLYSIYVHLLPSYNADFPPSSV 159 Query: 320 FYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSR 141 FY R+IPSQ+ EWG MSMCDAERRLLANALLDISNEWF+LLSE+CIPL NFS+VY Y++ Sbjct: 160 FYRRQIPSQVAEWGMMSMCDAERRLLANALLDISNEWFILLSESCIPLQNFSIVYRYIAH 219 Query: 140 SRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 SRYSFMGA DEPGPYGRGRYD NMAPE+ ++ WRKGSQWFEI REL Sbjct: 220 SRYSFMGAVDEPGPYGRGRYDGNMAPEINVSDWRKGSQWFEIKREL 265 >gb|EXC07668.1| hypothetical protein L484_003112 [Morus notabilis] Length = 384 Score = 362 bits (929), Expect = 1e-97 Identities = 163/231 (70%), Positives = 195/231 (84%), Gaps = 5/231 (2%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSV-----ARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFW 516 S+FSMY +Y QNV + A + +PC +P+S E W +PP+ +LH MND+ELFW Sbjct: 39 SVFSMYTTKYYKVQNVPPITQSVTAESVTKPCFEEPSSFEGWIRPPTKLLHTMNDSELFW 98 Query: 515 RATFVSRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDF 336 RA+ V RIK YPFKR+PKIAFMFLT+GPLP+APLWE+FF G+E LYSIY+HS+P+Y++DF Sbjct: 99 RASLVPRIKHYPFKRIPKIAFMFLTKGPLPLAPLWERFFKGHEGLYSIYIHSMPSYEADF 158 Query: 335 PPSSVFYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVY 156 PPSSVFY R+IPSQ+ EWGRMSMCDAERRLLANALLD+SNEWF+LLSE+CIPL NFS+VY Sbjct: 159 PPSSVFYRRQIPSQIAEWGRMSMCDAERRLLANALLDVSNEWFILLSESCIPLSNFSIVY 218 Query: 155 IYLSRSRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 Y+SRSRYSF+G+FDEPGPYGRGRY+ NMAPEV LT WRKG QWFEINR L Sbjct: 219 RYISRSRYSFLGSFDEPGPYGRGRYNRNMAPEVNLTDWRKGPQWFEINRVL 269 >ref|XP_002515894.1| conserved hypothetical protein [Ricinus communis] gi|223544799|gb|EEF46314.1| conserved hypothetical protein [Ricinus communis] Length = 385 Score = 359 bits (922), Expect = 7e-97 Identities = 167/227 (73%), Positives = 194/227 (85%), Gaps = 3/227 (1%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNV--VSVARTGME-PCISKPNSLESWTKPPSSILHIMNDTELFWRA 510 SI SMYM RY G Q++ +V R+ + PC +PN+LE+W KPPS +LH MNDTELFWRA Sbjct: 42 SIISMYMFRYFGIQSIPAAAVERSNIIFPCFEEPNTLENWIKPPSDLLHKMNDTELFWRA 101 Query: 509 TFVSRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPP 330 +FV RIK+YPFKRVPKIAFMFLT+GPLP PLWE+FF G+E LYSIY+HSLP+Y +F Sbjct: 102 SFVPRIKEYPFKRVPKIAFMFLTKGPLPFVPLWERFFKGHEGLYSIYIHSLPSYVGNFSQ 161 Query: 329 SSVFYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIY 150 SSVFY R+IPSQ+VEWGRMSMCD ERRLLANALLDISNEWF+LLSEACIPLHNFS++Y Y Sbjct: 162 SSVFYRRQIPSQIVEWGRMSMCDGERRLLANALLDISNEWFILLSEACIPLHNFSIIYRY 221 Query: 149 LSRSRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINR 9 +SRSR+SFMG+FDE PYGRGRY+ NM PEVTL QWRKGSQWFE+NR Sbjct: 222 ISRSRHSFMGSFDENSPYGRGRYNWNMQPEVTLEQWRKGSQWFEVNR 268 >ref|XP_004141671.1| PREDICTED: uncharacterized protein LOC101216165 [Cucumis sativus] gi|449480606|ref|XP_004155943.1| PREDICTED: uncharacterized protein LOC101228844 [Cucumis sativus] Length = 392 Score = 359 bits (921), Expect = 1e-96 Identities = 156/226 (69%), Positives = 196/226 (86%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFV 501 S+ S++ ++Y G NVV VA++ + PC+ +P S+E W +PPSS++H MND EL WRA+F+ Sbjct: 52 SLASLHTVKYFGGPNVVPVAQSIIRPCLEEPASIERWIEPPSSLMHTMNDAELLWRASFI 111 Query: 500 SRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSV 321 R+K+YPFKRV KIAFMFLT+GPLP+APLWE+F G+E+ YSIY+H +P Y +DFPPSSV Sbjct: 112 PRVKNYPFKRVRKIAFMFLTKGPLPLAPLWERFLKGHEKFYSIYIHPMPHYVADFPPSSV 171 Query: 320 FYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSR 141 FYGR+IPS++ EWG+MSMCDAERRLLANALLDI+NEWF+LLSE+CIPLHNFS++Y Y+SR Sbjct: 172 FYGRQIPSKIAEWGKMSMCDAERRLLANALLDIANEWFILLSESCIPLHNFSIIYHYISR 231 Query: 140 SRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 SRYSFM +FDEPGP GRGRY+E+MAP V LT WRKGSQWFE+NREL Sbjct: 232 SRYSFMSSFDEPGPIGRGRYNESMAPMVNLTNWRKGSQWFELNREL 277 >gb|EOY27013.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein [Theobroma cacao] Length = 386 Score = 355 bits (911), Expect = 1e-95 Identities = 157/226 (69%), Positives = 196/226 (86%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFV 501 SI SMY +R+ Q++ SVA + +P + NS+ESW +PPS++LH MNDTELFWRA+F Sbjct: 46 SIMSMYSVRFFVVQHIASVAPSTSQPLFQEANSIESWIRPPSNLLHTMNDTELFWRASFF 105 Query: 500 SRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSV 321 +IK+YPFKRVPKIAFMFLT+GPLP+APLW++FF G+E +SIYVH+LP+Y + +PPS+ Sbjct: 106 PQIKEYPFKRVPKIAFMFLTKGPLPLAPLWDRFFKGHEGRFSIYVHALPSYVAGYPPSAA 165 Query: 320 FYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSR 141 FY R+IPSQMVEWG+MSMC+AERRL+ANALLDISNEWF+LLSE+CIPLHNFS++Y Y+SR Sbjct: 166 FYRRQIPSQMVEWGKMSMCEAERRLIANALLDISNEWFILLSESCIPLHNFSIIYRYISR 225 Query: 140 SRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 SR+SFMG+FDE GPYGRGRY+ M PEVTL+QWRKGSQWFE+NR L Sbjct: 226 SRHSFMGSFDEAGPYGRGRYNPRMEPEVTLSQWRKGSQWFEVNRRL 271 >ref|XP_002516477.1| conserved hypothetical protein [Ricinus communis] gi|223544297|gb|EEF45818.1| conserved hypothetical protein [Ricinus communis] Length = 371 Score = 355 bits (911), Expect = 1e-95 Identities = 160/226 (70%), Positives = 195/226 (86%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFV 501 S S+Y I++ G NV++ + G +PC+ +PN L+ W KPPS+I+H MND EL WRATFV Sbjct: 31 STTSIYTIKHFGVYNVITTIKPGFQPCLEEPNDLDRWIKPPSNIVHKMNDEELLWRATFV 90 Query: 500 SRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSV 321 +IK YPF+RVPKIAFMFLT+GPLP+APLWE+F G+E LYSIYVHSLPT+++ FPPSSV Sbjct: 91 PKIKKYPFERVPKIAFMFLTKGPLPLAPLWERFLKGHEGLYSIYVHSLPTFEAKFPPSSV 150 Query: 320 FYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSR 141 F+ R+IPSQ+ EWG+MSMCDAERRLLANALLDISNE F+LLSE+CIPL+NFSV+Y Y+ + Sbjct: 151 FHRRQIPSQISEWGKMSMCDAERRLLANALLDISNERFILLSESCIPLYNFSVIYHYIMK 210 Query: 140 SRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 SRYSF+GAFD+ GPYGRGRY+ENMAPEV +TQWRKGSQWFEINR L Sbjct: 211 SRYSFIGAFDDHGPYGRGRYNENMAPEVNITQWRKGSQWFEINRRL 256 >gb|EMJ16713.1| hypothetical protein PRUPE_ppa007018mg [Prunus persica] Length = 385 Score = 350 bits (897), Expect = 6e-94 Identities = 164/226 (72%), Positives = 190/226 (84%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFV 501 SI SM+ IR+ G Q+V A + + PC +PNSLES +PPS++LH MNDTEL W A+ Sbjct: 45 SILSMHTIRFFGVQHVAPTAPSTIRPCFEEPNSLESRIRPPSNLLHAMNDTELLWLASSA 104 Query: 500 SRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSV 321 +I DYPFKRVPKIAFMFLT+GPLPM PLWE+FF G++ LYSIYVHSLP+Y ++F PSSV Sbjct: 105 PQINDYPFKRVPKIAFMFLTKGPLPMEPLWERFFKGHKGLYSIYVHSLPSYNANFSPSSV 164 Query: 320 FYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSR 141 F+ R+IPSQ+ EWG MSMCDAERRLLANALLDISNEWFVLLSEACIPL+N S+VY YLSR Sbjct: 165 FHKRQIPSQVAEWGEMSMCDAERRLLANALLDISNEWFVLLSEACIPLYNLSIVYHYLSR 224 Query: 140 SRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 SRYSFMG+FDE GPYGRGRYD MAP V LT+WRKGSQWFEINR+L Sbjct: 225 SRYSFMGSFDEIGPYGRGRYDGRMAPLVNLTEWRKGSQWFEINRKL 270 >ref|XP_006450782.1| hypothetical protein CICLE_v10008613mg [Citrus clementina] gi|567917554|ref|XP_006450783.1| hypothetical protein CICLE_v10008613mg [Citrus clementina] gi|568844179|ref|XP_006475972.1| PREDICTED: uncharacterized protein LOC102614541 isoform X1 [Citrus sinensis] gi|568844181|ref|XP_006475973.1| PREDICTED: uncharacterized protein LOC102614541 isoform X2 [Citrus sinensis] gi|557554008|gb|ESR64022.1| hypothetical protein CICLE_v10008613mg [Citrus clementina] gi|557554009|gb|ESR64023.1| hypothetical protein CICLE_v10008613mg [Citrus clementina] Length = 385 Score = 349 bits (896), Expect = 8e-94 Identities = 154/226 (68%), Positives = 192/226 (84%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFV 501 S+ S+Y +R G Q+VV+ ++ C PN L+ W PPS+++H M+D ELFWRA+FV Sbjct: 45 SLISIYSVRRFGVQSVVTTVKSSFVACPELPNGLDYWINPPSNLMHTMSDKELFWRASFV 104 Query: 500 SRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSV 321 R+K+YPFKRVPKIAFMFLT+GPLP+ PLWEKFF G+E LYSIYVHSLPT+++ FP SSV Sbjct: 105 PRVKEYPFKRVPKIAFMFLTKGPLPLGPLWEKFFKGHEGLYSIYVHSLPTFENKFPSSSV 164 Query: 320 FYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSR 141 FY R+IPSQ+ EWG+MSMCDAERRLLANALLDISNEWF+L+SE+CIPL+NFS++Y Y+ + Sbjct: 165 FYNRQIPSQISEWGKMSMCDAERRLLANALLDISNEWFILVSESCIPLYNFSLIYHYIKK 224 Query: 140 SRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 S++SFMG+FD+PGPYGRGRY+ NMAP V +TQWRKGSQWFEINR L Sbjct: 225 SKHSFMGSFDDPGPYGRGRYNANMAPVVNITQWRKGSQWFEINRRL 270 >ref|XP_006355313.1| PREDICTED: uncharacterized protein LOC102601627 [Solanum tuberosum] Length = 388 Score = 348 bits (893), Expect = 2e-93 Identities = 154/227 (67%), Positives = 197/227 (86%), Gaps = 1/227 (0%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPN-SLESWTKPPSSILHIMNDTELFWRATF 504 S+ S+YMI+Y G ++V R + PCI + + +LESW PPS++LH M+D EL WRA+ Sbjct: 47 SVASIYMIKYFGFHSIVPTIRPSLLPCIEEESKNLESWINPPSNLLHTMSDKELLWRASM 106 Query: 503 VSRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSS 324 V R+K YPFKRVPKIAFMFLT+GPLP+AP+WE+FF G++ YSIY+HSLP++++DFP SS Sbjct: 107 VPRVKKYPFKRVPKIAFMFLTKGPLPLAPIWERFFKGHQGFYSIYIHSLPSFEADFPASS 166 Query: 323 VFYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLS 144 VFY R+IPSQ+ EWG+MSMCDAERRLLANALLDISNEWFVLLSE+CIPL+NF+V+Y Y+S Sbjct: 167 VFYKRQIPSQVTEWGKMSMCDAERRLLANALLDISNEWFVLLSESCIPLYNFNVIYKYIS 226 Query: 143 RSRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 +S++SF+GAFD+PGPYGRGRYDENM PEV ++QWRKGSQWFE++R+L Sbjct: 227 QSKHSFVGAFDDPGPYGRGRYDENMLPEVNISQWRKGSQWFEMSRKL 273 >ref|XP_004297621.1| PREDICTED: uncharacterized protein LOC101304278 [Fragaria vesca subsp. vesca] Length = 383 Score = 347 bits (891), Expect = 3e-93 Identities = 170/272 (62%), Positives = 203/272 (74%) Frame = -3 Query: 818 MQLRVGSHGSLEECKDPGTKVPNXXXXXXXXXXXXXXXXXXXXLMCSIFSMYMIRYVGSQ 639 MQ RVG LEE KDPG + + SI SM+ IRY G Q Sbjct: 1 MQSRVGG---LEEGKDPGGSLLKTRALPFRLIQFLLMFVVLGLGV-SILSMHTIRYFGVQ 56 Query: 638 NVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFVSRIKDYPFKRVPKI 459 ++ + M C ++PN+LESW +PPSS+LH MNDTEL W A+ ++K+YPFKRVPKI Sbjct: 57 HMAPAEPSTMRSCFAEPNTLESWIRPPSSLLHNMNDTELLWLASMAPQVKEYPFKRVPKI 116 Query: 458 AFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSVFYGREIPSQMVEWG 279 AFMFLT+GPLPM PLWE+FF G+E LYSIYVHSLP+Y +F +SVFY R+IPS++ EWG Sbjct: 117 AFMFLTKGPLPMEPLWERFFKGHEGLYSIYVHSLPSYTPNFSATSVFYKRQIPSKLAEWG 176 Query: 278 RMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSRSRYSFMGAFDEPGP 99 M+MC+AERRLLANALLDISNEWF+LLSE+CIPL N S+VY YLSRSRYSFMG+FDE GP Sbjct: 177 EMNMCEAERRLLANALLDISNEWFILLSESCIPLSNLSIVYHYLSRSRYSFMGSFDEIGP 236 Query: 98 YGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 YGRGRY+E+MAP V L+ WRKGSQWFEINREL Sbjct: 237 YGRGRYNEHMAPLVNLSNWRKGSQWFEINREL 268 >gb|EXB66899.1| hypothetical protein L484_019537 [Morus notabilis] Length = 387 Score = 345 bits (886), Expect = 1e-92 Identities = 163/276 (59%), Positives = 211/276 (76%), Gaps = 4/276 (1%) Frame = -3 Query: 818 MQLRVGSHGSLEECKDPGTKVPNXXXXXXXXXXXXXXXXXXXXLMC---SIFSMYMIRYV 648 MQ RV LEE KDPG + + +C ++ S+Y +R+ Sbjct: 1 MQQRVSP---LEEGKDPGAII-SRTSQYKALPLRLLRLFVLFLALCVTFAVISIYTVRHF 56 Query: 647 GSQNVVSVARTGMEPCISKPNS-LESWTKPPSSILHIMNDTELFWRATFVSRIKDYPFKR 471 G V++ ++G +PC ++P S L+ W +PPS+++H MNDTEL WRA+F RI++YPFKR Sbjct: 57 GISTVMTTVKSGFQPCYAEPQSDLDQWIRPPSNLIHRMNDTELLWRASFAPRIRNYPFKR 116 Query: 470 VPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSVFYGREIPSQM 291 VPKIAFMFLT+GP+P+APLWE+FF G+E YSIYVHSLP++ FP +SVFYGR IPSQ+ Sbjct: 117 VPKIAFMFLTKGPMPLAPLWERFFKGHEGFYSIYVHSLPSFVPMFPSTSVFYGRHIPSQV 176 Query: 290 VEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSRSRYSFMGAFD 111 EWGRMSMCDAERRLLANALLDISNE FVLLSE+CIPL+NFS++Y Y+S+S++SFMGAFD Sbjct: 177 SEWGRMSMCDAERRLLANALLDISNERFVLLSESCIPLYNFSIIYHYISKSKFSFMGAFD 236 Query: 110 EPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 +PGPYGRGRY++NM PEV +++WRKGSQWFE+NR+L Sbjct: 237 DPGPYGRGRYNDNMLPEVNISRWRKGSQWFEVNRKL 272 >ref|XP_006466159.1| PREDICTED: uncharacterized protein LOC102619633 [Citrus sinensis] Length = 385 Score = 345 bits (885), Expect = 1e-92 Identities = 155/226 (68%), Positives = 188/226 (83%) Frame = -3 Query: 680 SIFSMYMIRYVGSQNVVSVARTGMEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFV 501 SI +Y ++ +N++ +A + + C+ + ++SW KPP+++LH MNDTELFWRA+FV Sbjct: 45 SIIGLYTTKFNAVRNIMPIAPSIINVCLEDVSDIKSWIKPPTNLLHNMNDTELFWRASFV 104 Query: 500 SRIKDYPFKRVPKIAFMFLTRGPLPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSV 321 RIK YPFKRVPKIAFMFLT+GPLP+APLWEKFF G+E LYSIYVH P Y F PSSV Sbjct: 105 PRIKKYPFKRVPKIAFMFLTKGPLPLAPLWEKFFKGHEGLYSIYVHPHPAYNGKFSPSSV 164 Query: 320 FYGREIPSQMVEWGRMSMCDAERRLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSR 141 FY R+IPSQ EWG MSMC+AERRLLANALLD+SNEWF+LLSE+CIPLHNFS+VY Y+S+ Sbjct: 165 FYRRQIPSQPAEWGEMSMCEAERRLLANALLDVSNEWFILLSESCIPLHNFSIVYYYISK 224 Query: 140 SRYSFMGAFDEPGPYGRGRYDENMAPEVTLTQWRKGSQWFEINREL 3 SRYSFM ++D+PGPYGRGRY+ NM PEVTL+QWRKGSQWFEINR L Sbjct: 225 SRYSFMESYDDPGPYGRGRYNGNMEPEVTLSQWRKGSQWFEINRRL 270 >ref|XP_002309520.1| hypothetical protein POPTR_0006s24970g [Populus trichocarpa] gi|222855496|gb|EEE93043.1| hypothetical protein POPTR_0006s24970g [Populus trichocarpa] Length = 385 Score = 345 bits (884), Expect = 2e-92 Identities = 162/263 (61%), Positives = 199/263 (75%), Gaps = 1/263 (0%) Frame = -3 Query: 788 LEECKDPGTKVPNXXXXXXXXXXXXXXXXXXXXLMC-SIFSMYMIRYVGSQNVVSVARTG 612 LEE KDP + M SI SMY I+ G Q + + Sbjct: 8 LEEGKDPAVSIKASQSKPFPIRLLQLFLLFLALCMAFSIISMYTIKRFGVQTARTTVKPA 67 Query: 611 MEPCISKPNSLESWTKPPSSILHIMNDTELFWRATFVSRIKDYPFKRVPKIAFMFLTRGP 432 EPC +P++L+ W +PPS++LH M+D ELFWRA+FV IK YPFKR+PKIAFMFLT+GP Sbjct: 68 FEPCFDEPDTLDRWIRPPSNLLHKMSDKELFWRASFVPGIKKYPFKRIPKIAFMFLTKGP 127 Query: 431 LPMAPLWEKFFSGNERLYSIYVHSLPTYKSDFPPSSVFYGREIPSQMVEWGRMSMCDAER 252 LP+APLWE+F G+E LYS+Y+H LPT+++ FP SSVF+ R+IPSQ+ EWGRMSMCDAER Sbjct: 128 LPLAPLWERFLKGHEGLYSVYIHPLPTFEAKFPSSSVFHRRQIPSQVAEWGRMSMCDAER 187 Query: 251 RLLANALLDISNEWFVLLSEACIPLHNFSVVYIYLSRSRYSFMGAFDEPGPYGRGRYDEN 72 RLLANALLDISNE FVL+SE+CIPL+NFSV+Y Y+ RS+YSF+GAFD+ GPYGRGRY+EN Sbjct: 188 RLLANALLDISNERFVLVSESCIPLYNFSVIYDYMMRSKYSFIGAFDDHGPYGRGRYNEN 247 Query: 71 MAPEVTLTQWRKGSQWFEINREL 3 MAPEV +TQWRKGSQWFEINR+L Sbjct: 248 MAPEVNITQWRKGSQWFEINRKL 270