BLASTX nr result
ID: Catharanthus23_contig00024694
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00024694 (943 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004243452.1| PREDICTED: uncharacterized protein LOC101248... 243 8e-62 ref|XP_006360583.1| PREDICTED: uncharacterized protein LOC102586... 242 1e-61 ref|XP_006469726.1| PREDICTED: uncharacterized protein LOC102620... 230 5e-58 ref|XP_006447478.1| hypothetical protein CICLE_v10015015mg [Citr... 230 5e-58 ref|XP_006447476.1| hypothetical protein CICLE_v10015015mg [Citr... 230 5e-58 emb|CAN76207.1| hypothetical protein VITISV_043112 [Vitis vinifera] 227 6e-57 ref|XP_006372975.1| hypothetical protein POPTR_0017s06680g [Popu... 225 2e-56 ref|XP_002327928.1| predicted protein [Populus trichocarpa] 225 2e-56 ref|XP_002274405.2| PREDICTED: uncharacterized protein LOC100246... 224 3e-56 gb|EXC23146.1| hypothetical protein L484_018277 [Morus notabilis] 222 2e-55 ref|XP_006414552.1| hypothetical protein EUTSA_v10025038mg [Eutr... 222 2e-55 ref|XP_006414551.1| hypothetical protein EUTSA_v10025038mg [Eutr... 222 2e-55 gb|EOX99163.1| Uncharacterized protein isoform 5 [Theobroma cacao] 220 7e-55 gb|EOX99161.1| Uncharacterized protein isoform 3 [Theobroma cacao] 220 7e-55 gb|EOX99159.1| Uncharacterized protein isoform 1 [Theobroma caca... 220 7e-55 ref|XP_006283625.1| hypothetical protein CARUB_v10004682mg [Caps... 219 9e-55 ref|NP_193259.2| uncharacterized protein [Arabidopsis thaliana] ... 219 1e-54 emb|CAB10303.1| hypothetical protein [Arabidopsis thaliana] gi|7... 219 1e-54 ref|XP_002868218.1| hypothetical protein ARALYDRAFT_493365 [Arab... 219 2e-54 ref|XP_002515723.1| transferase, transferring glycosyl groups, p... 216 8e-54 >ref|XP_004243452.1| PREDICTED: uncharacterized protein LOC101248314 isoform 1 [Solanum lycopersicum] gi|460395768|ref|XP_004243453.1| PREDICTED: uncharacterized protein LOC101248314 isoform 2 [Solanum lycopersicum] Length = 483 Score = 243 bits (620), Expect = 8e-62 Identities = 122/225 (54%), Positives = 154/225 (68%), Gaps = 2/225 (0%) Frame = -1 Query: 670 SRAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAIR 491 +R +I LWY PNST VF+D P+ ++ P +++SSDTS+FPY+FP G SAIR Sbjct: 86 NRLSYINLWYKPNSTN-AVVFLDDPISISTLSVSSPPILVSSDTSKFPYSFPAGRRSAIR 144 Query: 490 IARVVKDVFTLANPPPH--IRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYE 317 IAR+VKD F L RWFVFGDDDTVFF NL+ VLSKYD +KW+Y+G+ SES+E Sbjct: 145 IARIVKDTFDLVKNVNFNDTRWFVFGDDDTVFFTDNLVRVLSKYDCEKWYYVGYNSESFE 204 Query: 316 QNEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTR 137 QNEK A+VLA VLDSCL+RYPHLYGSDSRIF+C++ELGV LT Sbjct: 205 QNEKYSFDMAFGGGGFALSAPLAKVLARVLDSCLIRYPHLYGSDSRIFSCVAELGVHLTH 264 Query: 136 EPGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 EPGFHQ+DVRG+LFG+ H++ +EP+FPGM++IQ Sbjct: 265 EPGFHQVDVRGNLFGILAAHPLSPLLSLHHLDVVEPLFPGMTRIQ 309 >ref|XP_006360583.1| PREDICTED: uncharacterized protein LOC102586004 [Solanum tuberosum] Length = 483 Score = 242 bits (618), Expect = 1e-61 Identities = 122/225 (54%), Positives = 154/225 (68%), Gaps = 2/225 (0%) Frame = -1 Query: 670 SRAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAIR 491 +R +I LWY PNST VF+D P+ ++ P +++SSDTS+FPY+FP G SAIR Sbjct: 86 NRLSYINLWYKPNSTN-AVVFLDDPISISTLSVSSPPILVSSDTSKFPYSFPAGRRSAIR 144 Query: 490 IARVVKDVFTLANPPPH--IRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYE 317 IAR+VKD F L RWFVFGDDDTVFF NL+ VLSKYD +KW+Y+G+ SES+E Sbjct: 145 IARIVKDTFDLVKNVNFNDTRWFVFGDDDTVFFTDNLVRVLSKYDCEKWYYVGYNSESFE 204 Query: 316 QNEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTR 137 QNEK A+ LA VLDSCL+RYPHLYGSDSRIF+C++ELGV LTR Sbjct: 205 QNEKYSFDMAFGGGGFALSAPLAKGLARVLDSCLIRYPHLYGSDSRIFSCVAELGVHLTR 264 Query: 136 EPGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 EPGFHQ+DVRG+LFG+ H++ +EP+FPGM++IQ Sbjct: 265 EPGFHQVDVRGNLFGILAAHPLSPLLSLHHLDVVEPLFPGMTRIQ 309 >ref|XP_006469726.1| PREDICTED: uncharacterized protein LOC102620781 isoform X4 [Citrus sinensis] Length = 392 Score = 230 bits (587), Expect = 5e-58 Identities = 114/225 (50%), Positives = 147/225 (65%), Gaps = 1/225 (0%) Frame = -1 Query: 673 PSRAQFIRLWYSPNSTPYTYVFIDKPLQKPMS-TLHFPSVILSSDTSRFPYTFPRGNPSA 497 P R ++RLWYSPNST F+D+ + P +++S+DTS+FP+TFP+G SA Sbjct: 97 PRRRSYVRLWYSPNSTR-ALTFLDRAADSSSAGDPSLPRIVISADTSKFPFTFPKGLRSA 155 Query: 496 IRIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYE 317 +R+ARVVK+ L + +RWFVFGDDDTVFF NL+ LSKYD +WFY+G SE YE Sbjct: 156 VRVARVVKEAVDLTDEKAGVRWFVFGDDDTVFFVDNLVKTLSKYDDDRWFYVGSNSEGYE 215 Query: 316 QNEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTR 137 QN K ARVLAG LDSCL+RY HLYGSD+R+F+CL ELGV LT Sbjct: 216 QNAKHSFGMAFGGGGFAISHSLARVLAGALDSCLMRYAHLYGSDARVFSCLVELGVGLTP 275 Query: 136 EPGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 EPGFHQ+D+RGD+FGM H++A++PIFP M++ Q Sbjct: 276 EPGFHQLDMRGDMFGMLSAHPLSPLLSLHHLDAIDPIFPNMNRTQ 320 >ref|XP_006447478.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|567910331|ref|XP_006447479.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|567910335|ref|XP_006447481.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|567910341|ref|XP_006447484.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|557550089|gb|ESR60718.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|557550090|gb|ESR60719.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|557550092|gb|ESR60721.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|557550095|gb|ESR60724.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] Length = 496 Score = 230 bits (587), Expect = 5e-58 Identities = 114/225 (50%), Positives = 147/225 (65%), Gaps = 1/225 (0%) Frame = -1 Query: 673 PSRAQFIRLWYSPNSTPYTYVFIDKPLQKPMS-TLHFPSVILSSDTSRFPYTFPRGNPSA 497 P R ++RLWYSPNST F+D+ + P +++S+DTS+FP+TFP+G SA Sbjct: 97 PRRRSYVRLWYSPNSTR-ALTFLDRAADSSSAGDPSLPRIVISADTSKFPFTFPKGLRSA 155 Query: 496 IRIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYE 317 +R+ARVVK+ L + +RWFVFGDDDTVFF NL+ LSKYD +WFY+G SE YE Sbjct: 156 VRVARVVKEAVDLTDEKAGVRWFVFGDDDTVFFVDNLVKTLSKYDDDRWFYVGSNSEGYE 215 Query: 316 QNEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTR 137 QN K ARVLAG LDSCL+RY HLYGSD+R+F+CL ELGV LT Sbjct: 216 QNAKHSFGMAFGGGGFAISHSLARVLAGALDSCLMRYAHLYGSDARVFSCLVELGVGLTP 275 Query: 136 EPGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 EPGFHQ+D+RGD+FGM H++A++PIFP M++ Q Sbjct: 276 EPGFHQLDMRGDMFGMLSAHPLSPLLSLHHLDAIDPIFPNMNRTQ 320 >ref|XP_006447476.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|567910327|ref|XP_006447477.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|567910333|ref|XP_006447480.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|567910337|ref|XP_006447482.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|567910339|ref|XP_006447483.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|567910343|ref|XP_006447485.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|568830904|ref|XP_006469723.1| PREDICTED: uncharacterized protein LOC102620781 isoform X1 [Citrus sinensis] gi|568830906|ref|XP_006469724.1| PREDICTED: uncharacterized protein LOC102620781 isoform X2 [Citrus sinensis] gi|568830908|ref|XP_006469725.1| PREDICTED: uncharacterized protein LOC102620781 isoform X3 [Citrus sinensis] gi|557550087|gb|ESR60716.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|557550088|gb|ESR60717.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|557550091|gb|ESR60720.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|557550093|gb|ESR60722.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|557550094|gb|ESR60723.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] gi|557550096|gb|ESR60725.1| hypothetical protein CICLE_v10015015mg [Citrus clementina] Length = 494 Score = 230 bits (587), Expect = 5e-58 Identities = 114/225 (50%), Positives = 147/225 (65%), Gaps = 1/225 (0%) Frame = -1 Query: 673 PSRAQFIRLWYSPNSTPYTYVFIDKPLQKPMS-TLHFPSVILSSDTSRFPYTFPRGNPSA 497 P R ++RLWYSPNST F+D+ + P +++S+DTS+FP+TFP+G SA Sbjct: 97 PRRRSYVRLWYSPNSTR-ALTFLDRAADSSSAGDPSLPRIVISADTSKFPFTFPKGLRSA 155 Query: 496 IRIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYE 317 +R+ARVVK+ L + +RWFVFGDDDTVFF NL+ LSKYD +WFY+G SE YE Sbjct: 156 VRVARVVKEAVDLTDEKAGVRWFVFGDDDTVFFVDNLVKTLSKYDDDRWFYVGSNSEGYE 215 Query: 316 QNEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTR 137 QN K ARVLAG LDSCL+RY HLYGSD+R+F+CL ELGV LT Sbjct: 216 QNAKHSFGMAFGGGGFAISHSLARVLAGALDSCLMRYAHLYGSDARVFSCLVELGVGLTP 275 Query: 136 EPGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 EPGFHQ+D+RGD+FGM H++A++PIFP M++ Q Sbjct: 276 EPGFHQLDMRGDMFGMLSAHPLSPLLSLHHLDAIDPIFPNMNRTQ 320 >emb|CAN76207.1| hypothetical protein VITISV_043112 [Vitis vinifera] Length = 1587 Score = 227 bits (578), Expect = 6e-57 Identities = 115/225 (51%), Positives = 147/225 (65%) Frame = -1 Query: 676 LPSRAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSA 497 L RA ++RLW +++ +F+D P S P ++LS DTSRFPYTF RG PSA Sbjct: 61 LGRRAPYLRLW---SNSARAILFLDSPPPPDPSFAALPPIVLSGDTSRFPYTFRRGLPSA 117 Query: 496 IRIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYE 317 +R+AR++K+ + IRWFVFGDDDTVFF NL+ LSKYDH +WFYIG SESYE Sbjct: 118 VRVARIIKEA--VDRNESDIRWFVFGDDDTVFFVDNLVRTLSKYDHDQWFYIGSSSESYE 175 Query: 316 QNEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTR 137 QNE AR LAGV DSCL+RYPHL+GSD+RIF+CL+ELGV LT Sbjct: 176 QNESNSFDMAFGGGGFALSHSLARALAGVFDSCLMRYPHLFGSDARIFSCLAELGVGLTH 235 Query: 136 EPGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 EPGFHQ+D+RG+LFGM H+++++PIFP M++ Q Sbjct: 236 EPGFHQVDIRGNLFGMLSAHPLSPLVSLHHLDSVDPIFPNMNRTQ 280 Score = 196 bits (499), Expect = 8e-48 Identities = 106/225 (47%), Positives = 133/225 (59%), Gaps = 3/225 (1%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFIDKPLQKPMS---TLHFPSVILSSDTSRFPYTFPRGNPSA 497 + +++ W+ P VF+D S P V +S DTSRF YT+ G PSA Sbjct: 613 KKNYVKHWWKPQQMRGC-VFVDSMPGNESSYNDNSSLPPVCISEDTSRFRYTYRHGLPSA 671 Query: 496 IRIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYE 317 IR+A VV + T+A +RWFVFGDDDT+FF NL+ LSKYDH+ W+YIG SE YE Sbjct: 672 IRVAHVVSE--TVALNHSGVRWFVFGDDDTIFFPENLVKTLSKYDHELWYYIGTNSEIYE 729 Query: 316 QNEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTR 137 QN A+VLA V DSCL RYPHLYGSDSR++ CL+ELGV LTR Sbjct: 730 QNRVFSFDMAFGGAGFAISYPLAKVLAKVFDSCLERYPHLYGSDSRVYTCLAELGVGLTR 789 Query: 136 EPGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 EPGFHQ+DVRGD FG+ H++ ++PIFP M+ Q Sbjct: 790 EPGFHQVDVRGDTFGLLAAHPLAPLVSFHHLDHIDPIFPNMTANQ 834 >ref|XP_006372975.1| hypothetical protein POPTR_0017s06680g [Populus trichocarpa] gi|550319623|gb|ERP50772.1| hypothetical protein POPTR_0017s06680g [Populus trichocarpa] Length = 506 Score = 225 bits (573), Expect = 2e-56 Identities = 121/229 (52%), Positives = 145/229 (63%), Gaps = 7/229 (3%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLH-------FPSVILSSDTSRFPYTFPRG 509 R +IRLWY+P +T + F+D+ + P + P VI+S DTS FPYTF G Sbjct: 106 RQPYIRLWYNPTTTR-AFAFLDREVVDPTGNNNRSVIDPTLPPVIISKDTSSFPYTFKGG 164 Query: 508 NPSAIRIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGS 329 SAIR+ARVVK+V L P + WFVFGDDDTVFF NL+ VLSKYDH WFY+G S Sbjct: 165 LKSAIRVARVVKEVVELNEPD--VDWFVFGDDDTVFFVENLVTVLSKYDHNGWFYVGSNS 222 Query: 328 ESYEQNEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGV 149 ESY QN K A+VLA VLDSCLVRY HLYGSD+RIF+CL+ELGV Sbjct: 223 ESYSQNVKNSFEMGFGGGGFAISYSLAKVLARVLDSCLVRYAHLYGSDARIFSCLAELGV 282 Query: 148 ELTREPGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 L+ EPGFHQ+D+RGDLFGM H++A+ PIFP MSK Q Sbjct: 283 GLSHEPGFHQVDMRGDLFGMLSAHPLSPLVSLHHLDAVNPIFPKMSKTQ 331 >ref|XP_002327928.1| predicted protein [Populus trichocarpa] Length = 479 Score = 225 bits (573), Expect = 2e-56 Identities = 121/229 (52%), Positives = 145/229 (63%), Gaps = 7/229 (3%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLH-------FPSVILSSDTSRFPYTFPRG 509 R +IRLWY+P +T + F+D+ + P + P VI+S DTS FPYTF G Sbjct: 79 RQPYIRLWYNPTTTR-AFAFLDREVVDPTGNNNRSVIDPTLPPVIISKDTSSFPYTFKGG 137 Query: 508 NPSAIRIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGS 329 SAIR+ARVVK+V L P + WFVFGDDDTVFF NL+ VLSKYDH WFY+G S Sbjct: 138 LKSAIRVARVVKEVVELNEPD--VDWFVFGDDDTVFFVENLVTVLSKYDHNGWFYVGSNS 195 Query: 328 ESYEQNEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGV 149 ESY QN K A+VLA VLDSCLVRY HLYGSD+RIF+CL+ELGV Sbjct: 196 ESYSQNVKNSFEMGFGGGGFAISYSLAKVLARVLDSCLVRYAHLYGSDARIFSCLAELGV 255 Query: 148 ELTREPGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 L+ EPGFHQ+D+RGDLFGM H++A+ PIFP MSK Q Sbjct: 256 GLSHEPGFHQVDMRGDLFGMLSAHPLSPLVSLHHLDAVNPIFPKMSKTQ 304 >ref|XP_002274405.2| PREDICTED: uncharacterized protein LOC100246569 [Vitis vinifera] Length = 455 Score = 224 bits (572), Expect = 3e-56 Identities = 114/225 (50%), Positives = 146/225 (64%) Frame = -1 Query: 676 LPSRAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSA 497 L RA ++RLW +++ +F+D P S P ++LS DTSRFPYTF RG PSA Sbjct: 61 LGRRAPYLRLW---SNSARAILFLDSPPPPDPSFAALPPIVLSGDTSRFPYTFRRGLPSA 117 Query: 496 IRIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYE 317 +R+AR++K+ + IRWFVFGDDDTVFF NL+ LSKYDH +WFYIG SESYE Sbjct: 118 VRVARIIKEA--VDRNESDIRWFVFGDDDTVFFVDNLVRTLSKYDHDQWFYIGSSSESYE 175 Query: 316 QNEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTR 137 QNE AR LAGV DSCL+RYPHL+GSD+RIF+CL+ELGV LT Sbjct: 176 QNESNSFDMAFGGGGFALSHSLARALAGVFDSCLMRYPHLFGSDARIFSCLAELGVGLTH 235 Query: 136 EPGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 EPGFHQ+D+RG+LFGM H+++++PIFP ++ Q Sbjct: 236 EPGFHQVDIRGNLFGMLSAHPLSPLVSLHHLDSVDPIFPNRNRTQ 280 >gb|EXC23146.1| hypothetical protein L484_018277 [Morus notabilis] Length = 456 Score = 222 bits (565), Expect = 2e-55 Identities = 117/224 (52%), Positives = 146/224 (65%), Gaps = 2/224 (0%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFID--KPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAI 494 R ++RLWY+P ST +VF+D +PL P +L P I+S D SRFPYTF G SAI Sbjct: 82 RKPYVRLWYNPKSTR-AFVFLDSSEPLSDPDPSL--PPAIVSEDASRFPYTFRGGLRSAI 138 Query: 493 RIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQ 314 R+ARVVK+V P +RWFVFGDDDTVFF NL+ LSKYDH++WFY+G SE YEQ Sbjct: 139 RVARVVKEVVDRGEPG--VRWFVFGDDDTVFFVDNLVRTLSKYDHERWFYVGSNSEGYEQ 196 Query: 313 NEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTRE 134 N K AR LA V DSCLVRY HLYGSD+R+F+C++ELGV LT E Sbjct: 197 NAKNSFDMAFGGGGFAISSSLARALAKVFDSCLVRYAHLYGSDARVFSCVAELGVGLTHE 256 Query: 133 PGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 PGFHQ+DVRG+LFG+ H++A++PIFP ++ Q Sbjct: 257 PGFHQVDVRGNLFGLLSAHPLSPLLSLHHLDAVDPIFPNTNRTQ 300 >ref|XP_006414552.1| hypothetical protein EUTSA_v10025038mg [Eutrema salsugineum] gi|557115722|gb|ESQ56005.1| hypothetical protein EUTSA_v10025038mg [Eutrema salsugineum] Length = 487 Score = 222 bits (565), Expect = 2e-55 Identities = 115/222 (51%), Positives = 146/222 (65%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAIRI 488 R+ ++RLWY+P ST VF+D+ P L P VI+S D SRFPY FP G SAIR+ Sbjct: 96 RSSYVRLWYTPESTR-AVVFLDRGGFDP--DLSLPQVIVSKDVSRFPYNFPGGLRSAIRV 152 Query: 487 ARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQNE 308 ARVVK+ T+ +RWFVFGDDDTVFF NL+ VLSKYDH+KW+Y+G SE Y+QN Sbjct: 153 ARVVKE--TVDRGDKDVRWFVFGDDDTVFFVDNLVTVLSKYDHRKWYYVGSNSEFYDQNV 210 Query: 307 KXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTREPG 128 + +VLA VLDSCL+RY H+YGSDSRIF+CL+ELGV LT EPG Sbjct: 211 RYSFDMAFGGGGFAISASLGKVLAKVLDSCLMRYSHMYGSDSRIFSCLAELGVTLTHEPG 270 Query: 127 FHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 FHQIDVRG+LFG+ H++A++P FP M++ + Sbjct: 271 FHQIDVRGNLFGLLCAHPLSPLVSLHHLDAVDPFFPKMNRTE 312 >ref|XP_006414551.1| hypothetical protein EUTSA_v10025038mg [Eutrema salsugineum] gi|557115721|gb|ESQ56004.1| hypothetical protein EUTSA_v10025038mg [Eutrema salsugineum] Length = 462 Score = 222 bits (565), Expect = 2e-55 Identities = 115/222 (51%), Positives = 146/222 (65%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAIRI 488 R+ ++RLWY+P ST VF+D+ P L P VI+S D SRFPY FP G SAIR+ Sbjct: 96 RSSYVRLWYTPESTR-AVVFLDRGGFDP--DLSLPQVIVSKDVSRFPYNFPGGLRSAIRV 152 Query: 487 ARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQNE 308 ARVVK+ T+ +RWFVFGDDDTVFF NL+ VLSKYDH+KW+Y+G SE Y+QN Sbjct: 153 ARVVKE--TVDRGDKDVRWFVFGDDDTVFFVDNLVTVLSKYDHRKWYYVGSNSEFYDQNV 210 Query: 307 KXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTREPG 128 + +VLA VLDSCL+RY H+YGSDSRIF+CL+ELGV LT EPG Sbjct: 211 RYSFDMAFGGGGFAISASLGKVLAKVLDSCLMRYSHMYGSDSRIFSCLAELGVTLTHEPG 270 Query: 127 FHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 FHQIDVRG+LFG+ H++A++P FP M++ + Sbjct: 271 FHQIDVRGNLFGLLCAHPLSPLVSLHHLDAVDPFFPKMNRTE 312 >gb|EOX99163.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 452 Score = 220 bits (560), Expect = 7e-55 Identities = 115/224 (51%), Positives = 145/224 (64%) Frame = -1 Query: 673 PSRAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAI 494 P R+ +IRLWY+P +T F+D+P+ + P V++S DT FPYTF G SAI Sbjct: 91 PRRSSYIRLWYTPRATR-AVAFLDQPVSSLVDPT-LPPVMVSGDTKSFPYTFKGGLRSAI 148 Query: 493 RIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQ 314 R+ARVVK+ + IRWFVFGDDDTVF NL+ VLSKYDH+KWFY+G SESYEQ Sbjct: 149 RVARVVKEA--VERNETGIRWFVFGDDDTVFIVDNLVKVLSKYDHEKWFYVGSNSESYEQ 206 Query: 313 NEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTRE 134 N K +VLA VLDSCL+RY HLYGSD+R+++CL+ELGV LT E Sbjct: 207 NLKYSFDMAFGGGGFAISYSLGKVLARVLDSCLMRYAHLYGSDARVWSCLAELGVGLTHE 266 Query: 133 PGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 GFHQ+D+RG+LFGM H++A+EP+FP MSK Q Sbjct: 267 RGFHQVDMRGNLFGMLTAHPLSPLVSLHHLDAMEPVFPNMSKTQ 310 >gb|EOX99161.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 485 Score = 220 bits (560), Expect = 7e-55 Identities = 115/224 (51%), Positives = 145/224 (64%) Frame = -1 Query: 673 PSRAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAI 494 P R+ +IRLWY+P +T F+D+P+ + P V++S DT FPYTF G SAI Sbjct: 91 PRRSSYIRLWYTPRATR-AVAFLDQPVSSLVDPT-LPPVMVSGDTKSFPYTFKGGLRSAI 148 Query: 493 RIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQ 314 R+ARVVK+ + IRWFVFGDDDTVF NL+ VLSKYDH+KWFY+G SESYEQ Sbjct: 149 RVARVVKEA--VERNETGIRWFVFGDDDTVFIVDNLVKVLSKYDHEKWFYVGSNSESYEQ 206 Query: 313 NEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTRE 134 N K +VLA VLDSCL+RY HLYGSD+R+++CL+ELGV LT E Sbjct: 207 NLKYSFDMAFGGGGFAISYSLGKVLARVLDSCLMRYAHLYGSDARVWSCLAELGVGLTHE 266 Query: 133 PGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 GFHQ+D+RG+LFGM H++A+EP+FP MSK Q Sbjct: 267 RGFHQVDMRGNLFGMLTAHPLSPLVSLHHLDAMEPVFPNMSKTQ 310 >gb|EOX99159.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508707264|gb|EOX99160.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508707266|gb|EOX99162.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 484 Score = 220 bits (560), Expect = 7e-55 Identities = 115/224 (51%), Positives = 145/224 (64%) Frame = -1 Query: 673 PSRAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAI 494 P R+ +IRLWY+P +T F+D+P+ + P V++S DT FPYTF G SAI Sbjct: 91 PRRSSYIRLWYTPRATR-AVAFLDQPVSSLVDPT-LPPVMVSGDTKSFPYTFKGGLRSAI 148 Query: 493 RIARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQ 314 R+ARVVK+ + IRWFVFGDDDTVF NL+ VLSKYDH+KWFY+G SESYEQ Sbjct: 149 RVARVVKEA--VERNETGIRWFVFGDDDTVFIVDNLVKVLSKYDHEKWFYVGSNSESYEQ 206 Query: 313 NEKXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTRE 134 N K +VLA VLDSCL+RY HLYGSD+R+++CL+ELGV LT E Sbjct: 207 NLKYSFDMAFGGGGFAISYSLGKVLARVLDSCLMRYAHLYGSDARVWSCLAELGVGLTHE 266 Query: 133 PGFHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 GFHQ+D+RG+LFGM H++A+EP+FP MSK Q Sbjct: 267 RGFHQVDMRGNLFGMLTAHPLSPLVSLHHLDAMEPVFPNMSKTQ 310 >ref|XP_006283625.1| hypothetical protein CARUB_v10004682mg [Capsella rubella] gi|482552330|gb|EOA16523.1| hypothetical protein CARUB_v10004682mg [Capsella rubella] Length = 487 Score = 219 bits (559), Expect = 9e-55 Identities = 114/222 (51%), Positives = 148/222 (66%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAIRI 488 R+ ++RLWY+P ST VF+D+ + S L P V++S D SRFPY FP G SAIR+ Sbjct: 96 RSSYVRLWYTPESTR-AVVFLDRGGFE--SDLTLPPVVVSKDVSRFPYNFPGGLRSAIRV 152 Query: 487 ARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQNE 308 ARVVK+ T+ +RWFVFGDDDTVFF NL+ VLSKYDH+KW+Y+G SE Y+QN Sbjct: 153 ARVVKE--TVDQGDKDVRWFVFGDDDTVFFVDNLVTVLSKYDHRKWYYVGSNSEFYDQNV 210 Query: 307 KXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTREPG 128 + A+VLA VLDSCL+RY H+YGSDSRIF+CL+ELGV LT EPG Sbjct: 211 RYSFDMAFGGGGFAISVSLAKVLAKVLDSCLMRYSHMYGSDSRIFSCLAELGVTLTHEPG 270 Query: 127 FHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 FHQIDVRG++FG+ H++A++P FP M++ + Sbjct: 271 FHQIDVRGNIFGLLCAHPLSPLVSLHHLDAVDPFFPKMNRTE 312 >ref|NP_193259.2| uncharacterized protein [Arabidopsis thaliana] gi|332658175|gb|AEE83575.1| uncharacterized protein AT4G15240 [Arabidopsis thaliana] Length = 488 Score = 219 bits (558), Expect = 1e-54 Identities = 115/222 (51%), Positives = 147/222 (66%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAIRI 488 R+ ++RLWYSP ST VF+D+ + S L P VI+S D SRFPY FP G SAIR+ Sbjct: 97 RSSYVRLWYSPESTR-AVVFLDRGGLE--SDLTLPPVIVSKDVSRFPYNFPGGLRSAIRV 153 Query: 487 ARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQNE 308 ARVVK+ T+ +RWFVFGDDDTVFF NL+ VLSKYDH+KWFY+G SE Y+QN Sbjct: 154 ARVVKE--TVDRGDKDVRWFVFGDDDTVFFVDNLVTVLSKYDHRKWFYVGSNSEFYDQNV 211 Query: 307 KXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTREPG 128 + A+VLA VLDSCL+RY H+YGSDSRIF+C++ELGV LT EPG Sbjct: 212 RYSFDMAFGGGGFAISASLAKVLAKVLDSCLMRYSHMYGSDSRIFSCVAELGVTLTHEPG 271 Query: 127 FHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 FHQIDVRG++FG+ H++A++P FP ++ + Sbjct: 272 FHQIDVRGNIFGLLCAHPLSPLVSLHHLDAVDPFFPKRNRTE 313 >emb|CAB10303.1| hypothetical protein [Arabidopsis thaliana] gi|7268271|emb|CAB78566.1| hypothetical protein [Arabidopsis thaliana] Length = 520 Score = 219 bits (558), Expect = 1e-54 Identities = 115/222 (51%), Positives = 147/222 (66%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAIRI 488 R+ ++RLWYSP ST VF+D+ + S L P VI+S D SRFPY FP G SAIR+ Sbjct: 97 RSSYVRLWYSPESTR-AVVFLDRGGLE--SDLTLPPVIVSKDVSRFPYNFPGGLRSAIRV 153 Query: 487 ARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQNE 308 ARVVK+ T+ +RWFVFGDDDTVFF NL+ VLSKYDH+KWFY+G SE Y+QN Sbjct: 154 ARVVKE--TVDRGDKDVRWFVFGDDDTVFFVDNLVTVLSKYDHRKWFYVGSNSEFYDQNV 211 Query: 307 KXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTREPG 128 + A+VLA VLDSCL+RY H+YGSDSRIF+C++ELGV LT EPG Sbjct: 212 RYSFDMAFGGGGFAISASLAKVLAKVLDSCLMRYSHMYGSDSRIFSCVAELGVTLTHEPG 271 Query: 127 FHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 FHQIDVRG++FG+ H++A++P FP ++ + Sbjct: 272 FHQIDVRGNIFGLLCAHPLSPLVSLHHLDAVDPFFPKRNRTE 313 >ref|XP_002868218.1| hypothetical protein ARALYDRAFT_493365 [Arabidopsis lyrata subsp. lyrata] gi|297314054|gb|EFH44477.1| hypothetical protein ARALYDRAFT_493365 [Arabidopsis lyrata subsp. lyrata] Length = 488 Score = 219 bits (557), Expect = 2e-54 Identities = 114/222 (51%), Positives = 147/222 (66%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAIRI 488 R+ ++RLWYSP ST VF+D+ + S L P VI+S D SRFPY FP G SAIR+ Sbjct: 97 RSSYVRLWYSPESTR-AVVFLDRGGLE--SDLTLPPVIVSKDVSRFPYNFPGGLRSAIRV 153 Query: 487 ARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQNE 308 ARVVK+ L + +RWFVFGDDDTVFF NL+ VLSKYDH+KW+Y+G SE Y+QN Sbjct: 154 ARVVKETVDLGDKD--VRWFVFGDDDTVFFVDNLVTVLSKYDHRKWYYVGSNSEFYDQNV 211 Query: 307 KXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTREPG 128 + A+VLA VLDSCL+RY H+YGSDSRIF+C++ELGV LT EPG Sbjct: 212 RYSFDMAFGGGGFAISASLAKVLAKVLDSCLMRYSHMYGSDSRIFSCVAELGVTLTHEPG 271 Query: 127 FHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFPGMSKIQ 2 FHQIDVRG++FG+ H++A++P FP ++ + Sbjct: 272 FHQIDVRGNIFGLLCAHPLSPLVSLHHLDAVDPFFPKRNRTE 313 >ref|XP_002515723.1| transferase, transferring glycosyl groups, putative [Ricinus communis] gi|223545160|gb|EEF46670.1| transferase, transferring glycosyl groups, putative [Ricinus communis] Length = 308 Score = 216 bits (551), Expect = 8e-54 Identities = 115/216 (53%), Positives = 139/216 (64%) Frame = -1 Query: 667 RAQFIRLWYSPNSTPYTYVFIDKPLQKPMSTLHFPSVILSSDTSRFPYTFPRGNPSAIRI 488 R ++RLWY+PNST + F+D P VI+S DTSRFPYTF G SAIR+ Sbjct: 95 REPYLRLWYNPNSTR-AFAFLDVNTSSLSVDPTLPPVIISKDTSRFPYTFKGGLRSAIRV 153 Query: 487 ARVVKDVFTLANPPPHIRWFVFGDDDTVFFKRNLLLVLSKYDHKKWFYIGFGSESYEQNE 308 ARVVK+ + P IRWFVFGDDDTVFF +L+ LS YDH KW+YIG SESYEQN Sbjct: 154 ARVVKEA--VDKNVPDIRWFVFGDDDTVFFVDSLVKTLSFYDHNKWYYIGSNSESYEQNM 211 Query: 307 KXXXXXXXXXXXXXXXXXXARVLAGVLDSCLVRYPHLYGSDSRIFACLSELGVELTREPG 128 K A+VLA VLDSCLVRY HLYGSD+R+F+CL+ELGV LT EPG Sbjct: 212 KYSFDMGFGGGGFVISYSLAKVLARVLDSCLVRYGHLYGSDARVFSCLAELGVGLTHEPG 271 Query: 127 FHQIDVRGDLFGMXXXXXXXXXXXXXHMEALEPIFP 20 FHQ+D+RG+LFGM H++A +P+FP Sbjct: 272 FHQVDMRGNLFGMLSAHPLSPLLSLHHLDAADPLFP 307