BLASTX nr result
ID: Catharanthus22_contig00000372
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00000372 (1366 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004245789.1| PREDICTED: uncharacterized protein LOC101263... 110 1e-21 ref|XP_006359254.1| PREDICTED: uncharacterized protein LOC102604... 107 9e-21 ref|XP_004152391.1| PREDICTED: uncharacterized protein LOC101222... 102 5e-19 ref|XP_003549926.1| PREDICTED: uncharacterized protein LOC100812... 100 2e-18 gb|EXC02099.1| hypothetical protein L484_024064 [Morus notabilis] 99 4e-18 ref|XP_006477140.1| PREDICTED: uncharacterized protein LOC102618... 99 5e-18 ref|XP_003608674.1| hypothetical protein MTR_4g100570 [Medicago ... 97 1e-17 gb|ESW27685.1| hypothetical protein PHAVU_003G223000g, partial [... 97 2e-17 ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citr... 96 3e-17 ref|XP_004299406.1| PREDICTED: uncharacterized protein LOC101293... 96 5e-17 ref|XP_004508940.1| PREDICTED: uncharacterized protein LOC101492... 94 1e-16 gb|EOY24267.1| Uncharacterized protein isoform 4 [Theobroma cacao] 94 2e-16 gb|EOY24264.1| Uncharacterized protein isoform 1 [Theobroma caca... 94 2e-16 gb|EMJ10735.1| hypothetical protein PRUPE_ppa010718mg [Prunus pe... 93 2e-16 ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferas... 93 3e-16 ref|XP_003635203.1| PREDICTED: uncharacterized protein LOC100853... 91 1e-15 ref|XP_002531462.1| conserved hypothetical protein [Ricinus comm... 84 1e-13 gb|ABD65177.1| hypothetical protein 40.t00065 [Brassica oleracea] 82 5e-13 ref|XP_006440253.1| hypothetical protein CICLE_v10022000mg [Citr... 75 8e-11 ref|XP_006285401.1| hypothetical protein CARUB_v10006806mg, part... 74 2e-10 >ref|XP_004245789.1| PREDICTED: uncharacterized protein LOC101263341 isoform 1 [Solanum lycopersicum] gi|460400536|ref|XP_004245790.1| PREDICTED: uncharacterized protein LOC101263341 isoform 2 [Solanum lycopersicum] Length = 219 Score = 110 bits (276), Expect = 1e-21 Identities = 86/272 (31%), Positives = 114/272 (41%), Gaps = 4/272 (1%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAPVKSQPLH F LP L+W RFRRR SPP Sbjct: 1 MATAPVKSQPLHYFSLPQLKWGNKSNTNANH----RFRRRDSPP---------------S 41 Query: 993 XSRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 814 T A+ D +D P+ P ++ + + K Sbjct: 42 NGDNPTQTADVDGGSDSEKVQPRS-----EAEADPNGVSSLQGREEHEEKVKEEEEEEVG 96 Query: 813 XXXXXGKPWNLRPRRFVTLPAASFKKGEKM----SDEIVLYQRNDNSSSAGGCGPSSKPI 646 K WNLRPRR VT + K +M S+ + QR +++ G G K Sbjct: 97 CEEGEVKLWNLRPRRGVTKVETTSLKNVEMRVESSNHMQRSQRLKDNADGNGVGSGKK-- 154 Query: 645 RFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXXXXXXXX 466 G ++ LWISLSREEIEED+YS+TGS Sbjct: 155 ----------------GKKK-----------LWISLSREEIEEDVYSMTGSRPARRPKKR 187 Query: 465 XKTVQKQMDTVFPGLFLVGMNVDSYRVHESLR 370 KT+QKQ+D VFPGL+LVG+ DS+RV+++ + Sbjct: 188 SKTIQKQLDNVFPGLYLVGVTADSFRVNDTTK 219 >ref|XP_006359254.1| PREDICTED: uncharacterized protein LOC102604791 [Solanum tuberosum] Length = 220 Score = 107 bits (268), Expect = 9e-21 Identities = 89/272 (32%), Positives = 113/272 (41%), Gaps = 4/272 (1%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M APVKSQPLH F LP L+W RFRRR SPP Sbjct: 1 MAAAPVKSQPLHYFSLPQLKWGNKSHTNANH----RFRRRDSPP---------------- 40 Query: 993 XSRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 814 S +D D S Q + P S KE + Sbjct: 41 -SNGDNPPQTADVDGGSDSEKVQPRSEAEADPNGVSSLQGEDEHEKEVKEEEEEEEVGCE 99 Query: 813 XXXXXGKPWNLRPRRFVT-LPAASFKKGE---KMSDEIVLYQRNDNSSSAGGCGPSSKPI 646 K WNLRPRR VT + AS K E + S+ + QR +++ G G K Sbjct: 100 EGEV--KLWNLRPRRGVTKVETASLKNVEMRVESSNHMQRSQRLKDNADGNGVGSGKK-- 155 Query: 645 RFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXXXXXXXX 466 G ++ LWISLSREEIEED+YS+TGS Sbjct: 156 ----------------GKKK-----------LWISLSREEIEEDVYSMTGSRPARRPKKR 188 Query: 465 XKTVQKQMDTVFPGLFLVGMNVDSYRVHESLR 370 KT+QKQ+D VFPGL+LVG+ DS+RV+++ + Sbjct: 189 SKTIQKQLDNVFPGLYLVGLTADSFRVNDTTK 220 >ref|XP_004152391.1| PREDICTED: uncharacterized protein LOC101222282 [Cucumis sativus] gi|449488652|ref|XP_004158130.1| PREDICTED: uncharacterized LOC101222282 [Cucumis sativus] Length = 246 Score = 102 bits (253), Expect = 5e-19 Identities = 87/277 (31%), Positives = 114/277 (41%), Gaps = 11/277 (3%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M T PVKSQPLHNF LP L+W R RR Sbjct: 1 MATGPVKSQPLHNFALPFLKWGGKNQTNSNH----RIRRA----------------IGGG 40 Query: 993 XSRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRK-----------EAS 847 ++ AV S+P+++ S PQ+ + RT R +F+ CS K E Sbjct: 41 GGDSSPAVDHSEPESEADSK-PQL-RVGSRTVRNRLAFSPCSLGDKFAKHSEGEVGDEVV 98 Query: 846 KXXXXXXXXXXXXXXXGKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGC 667 K KPWNLRPR+ +L K E+ + +S G Sbjct: 99 KEQKREGEEVEGEEIVQKPWNLRPRKGTSLRGYGDLKNGGDLQEMDGAVSSAAGASQQGE 158 Query: 666 GPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXX 487 P K +R + G K WI+LSR+EIEEDI+ +TGS Sbjct: 159 NPQPKSLRLR-------------GFTESHRIEKKDKRKFWIALSRDEIEEDIFIMTGSRP 205 Query: 486 XXXXXXXXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 K VQKQ+DTVFPGL+LVG+ DSYR+ +S Sbjct: 206 SRRPKKRPKNVQKQLDTVFPGLWLVGVTADSYRLADS 242 >ref|XP_003549926.1| PREDICTED: uncharacterized protein LOC100812835 isoform X1 [Glycine max] gi|571536516|ref|XP_006600845.1| PREDICTED: uncharacterized protein LOC100812835 isoform X2 [Glycine max] Length = 237 Score = 100 bits (248), Expect = 2e-18 Identities = 88/270 (32%), Positives = 109/270 (40%), Gaps = 7/270 (2%) Frame = -2 Query: 1164 APVKSQPLHNFDLPHLRWXXXXXXXXXXXXXH--RFRRRGSPPVLHYLVNNXXXXXXXXX 991 APVKSQPLHNF LP L+W H RFRR P H Sbjct: 8 APVKSQPLHNFALPFLKWGASGKNNTTTTAAHHHRFRR----PSDH-------------- 49 Query: 990 SRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXXX 811 S+PD+ D + P + RT R FS + Sbjct: 50 --------ASEPDSSDPDSRPH--RLGSRTARNRFSLPL---KPPPPPPPQLHEAEHDDA 96 Query: 810 XXXXGKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCG-----PSSKPI 646 KPWNLRPR+ LP A+ + G S N GG G P+ K + Sbjct: 97 DDAVQKPWNLRPRKPALLPKAALEIGTGPSRNHHHATNNGEFHDGGGGGGDNNNPAPKSL 156 Query: 645 RFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXXXXXXXX 466 R + G K WI+LSREEIEEDI+ +TGS Sbjct: 157 RLR-------------GFSDTPCSVKKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKR 203 Query: 465 XKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 K VQKQMD+VFPGL+LVG+ D+YRV ++ Sbjct: 204 PKNVQKQMDSVFPGLWLVGITADAYRVADT 233 >gb|EXC02099.1| hypothetical protein L484_024064 [Morus notabilis] Length = 268 Score = 99.0 bits (245), Expect = 4e-18 Identities = 86/279 (30%), Positives = 118/279 (42%), Gaps = 16/279 (5%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAPVKS PLHNF LP L+W R S PV + Sbjct: 1 MATAPVKS-PLHNFPLPFLKWGGGKNHASGSHRCRRTISADSSPVADHC----------- 48 Query: 993 XSRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFS--FAACS--SQRKEASKXXXXXX 826 AE + + + + + RT R F+ FA+CS S++KE+ + Sbjct: 49 ------DAAEQERNESSEAEPNRFHRVGSRTVRNRFAAPFASCSLVSEKKESDEVAAGEG 102 Query: 825 XXXXXXXXXG----------KPWNLRPRRFVTLPAAS--FKKGEKMSDEIVLYQRNDNSS 682 KPWNLRPR+ + AA+ K GE E + Sbjct: 103 KEGDDREVEAAAGEEEMMVQKPWNLRPRKALFSKAATNGAKSGELPEQE----------N 152 Query: 681 SAGGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSL 502 + G G S+ + + P + + G L +Q WI+LSREEIEEDI+ + Sbjct: 153 AVAGGGHQSENLNQQPPKSMRLRG---LSESQQSSEKEKRK--FWIALSREEIEEDIFVM 207 Query: 501 TGSXXXXXXXXXXKTVQKQMDTVFPGLFLVGMNVDSYRV 385 TGS K VQKQ+D VFPGL+LVG+ D+YR+ Sbjct: 208 TGSRPARRPRKRPKNVQKQLDAVFPGLWLVGITADAYRI 246 >ref|XP_006477140.1| PREDICTED: uncharacterized protein LOC102618144 isoform X1 [Citrus sinensis] Length = 216 Score = 98.6 bits (244), Expect = 5e-18 Identities = 88/274 (32%), Positives = 110/274 (40%), Gaps = 8/274 (2%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAP+KSQPLHNF L L+W R R PP Sbjct: 1 MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHN------RTRTPPPT--------------- 39 Query: 993 XSRAATAVAESDPDNDDTSNAPQVVKTSIRTPR--------KPFSFAACSSQRKEASKXX 838 E D +D T + V S R R KP A SQR+ A Sbjct: 40 ---------EPDTTDDSTRHHRVVGSRSSRAQRLSFPCSTSKPHQDAGDRSQRQTADTEE 90 Query: 837 XXXXXXXXXXXXXGKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPS 658 +PWNLRPR K E + D V R DN+++ P Sbjct: 91 EEEDEVG-------RPWNLRPR----------KVQETLVDVAVFQNRGDNNANTKA--PK 131 Query: 657 SKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXXXX 478 S +R V E G E+ W++LSREEIEEDI+ +TGS Sbjct: 132 STRLREMV----ESRGSNGDKKEKNK---------FWVTLSREEIEEDIFIMTGSRPARR 178 Query: 477 XXXXXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 K VQKQ+D VFPGL+LVG+ VD+YRV ++ Sbjct: 179 PRKRPKNVQKQLDNVFPGLWLVGLTVDAYRVSDA 212 >ref|XP_003608674.1| hypothetical protein MTR_4g100570 [Medicago truncatula] gi|355509729|gb|AES90871.1| hypothetical protein MTR_4g100570 [Medicago truncatula] Length = 243 Score = 97.4 bits (241), Expect = 1e-17 Identities = 90/289 (31%), Positives = 119/289 (41%), Gaps = 23/289 (7%) Frame = -2 Query: 1173 MGTAP--VKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXX 1000 M T P VKSQPLHNF LP L+W HR RR P H Sbjct: 1 MATTPASVKSQPLHNFSLPFLKWGGTGKNNTNATNHHRSRR----PPDH----------- 45 Query: 999 XXXSRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRK------------ 856 S+PD++ S ++ RT R F FA+ SSQR+ Sbjct: 46 -----------ASEPDSEPDSRPHRL---GSRTARNRFGFASSSSQRQAPPTPSSNNETD 91 Query: 855 -----EASKXXXXXXXXXXXXXXXGKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRND 691 KPWNLRPR+ + +P F+ G S RN+ Sbjct: 92 DNAGDRKRDAEDDAEAGGGAEEIVQKPWNLRPRKPM-IPRGGFEIGAGGS-------RNN 143 Query: 690 NSSS----AGGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEI 523 N G P+ K +R + A N G +++ WI+LS++EI Sbjct: 144 NGGELQEGVNGENPAPKSLRLRGFADT------NCGEKKEKRK-------FWIALSKDEI 190 Query: 522 EEDIYSLTGSXXXXXXXXXXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 EEDI+ +TGS K VQKQMD VFPGL+LVG+ D+YRV ++ Sbjct: 191 EEDIFVMTGSRPNRRPRKRAKNVQKQMDNVFPGLWLVGITADAYRVADT 239 >gb|ESW27685.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus vulgaris] gi|561029046|gb|ESW27686.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus vulgaris] Length = 306 Score = 97.1 bits (240), Expect = 2e-17 Identities = 91/294 (30%), Positives = 121/294 (41%), Gaps = 26/294 (8%) Frame = -2 Query: 1179 FLMGTAP----VKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXX 1012 F M TAP VKSQPLHNF LP L+W HR RR S H Sbjct: 55 FSMATAPAQPPVKSQPLHNFALPFLKWGASGKNHTNAAHHHRCRRPSSLSSDH------- 107 Query: 1011 XXXXXXXSRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQR--------- 859 S+PD+D S +V RT R F+ CS + Sbjct: 108 ---------------ASEPDSDPDSRPHRV---GSRTTRNRFALPTCSLKPLPPPPEPPQ 149 Query: 858 ----KEASKXXXXXXXXXXXXXXXGKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRND 691 + + KPWNLRPR+ LP ++ + G S RN Sbjct: 150 PPSCNDETDDEAAKRDIEDAEEAVQKPWNLRPRK-PALPKSALEIGTGPS-------RNH 201 Query: 690 NSSSAG---------GCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISL 538 ++ G G P+ K +R + A + +E++ WI+L Sbjct: 202 ANNGVGEFHDGVSHHGENPAPKSLRLRGFADTQC-------AEKKEKRK------FWIAL 248 Query: 537 SREEIEEDIYSLTGSXXXXXXXXXXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 SREEIEEDI+ +TGS K VQKQMD+VFPGL+LVG+ D+YRV ++ Sbjct: 249 SREEIEEDIFVMTGSRPARRPRKRPKNVQKQMDSVFPGLWLVGITADAYRVPDT 302 >ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citrus clementina] gi|557542514|gb|ESR53492.1| hypothetical protein CICLE_v10022000mg [Citrus clementina] Length = 216 Score = 96.3 bits (238), Expect = 3e-17 Identities = 82/267 (30%), Positives = 106/267 (39%), Gaps = 1/267 (0%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAP+KSQPLHNF L L+W R R PP Sbjct: 1 MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHN------RTRTPPPT--------------- 39 Query: 993 XSRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 814 E D +D T + V S R R F + Q+ + Sbjct: 40 ---------EPDTTDDSTRHHRVVGSRSSRAQRLSFPSSTSKPQQDAVERPQRQTADTEE 90 Query: 813 XXXXXG-KPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPSSKPIRFK 637 +PWNLRPR K E + D V R DN+++ P S +R Sbjct: 91 EEEDEVGRPWNLRPR----------KVQETLVDVAVFQNRGDNNANTKA--PKSTRLREM 138 Query: 636 VPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXXXXXXXXXKT 457 V E G E+ W++LSREEIEEDI+ +TGS K Sbjct: 139 V----ESRGSNGDKKEKNK---------FWVTLSREEIEEDIFIMTGSRPARRPRKRPKN 185 Query: 456 VQKQMDTVFPGLFLVGMNVDSYRVHES 376 VQKQ+D VFPGL+LVG+ D+YRV ++ Sbjct: 186 VQKQLDNVFPGLWLVGLTADAYRVSDA 212 >ref|XP_004299406.1| PREDICTED: uncharacterized protein LOC101293977 [Fragaria vesca subsp. vesca] Length = 239 Score = 95.5 bits (236), Expect = 5e-17 Identities = 87/282 (30%), Positives = 119/282 (42%), Gaps = 16/282 (5%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAPVK PLHNF L L+W R+RR PV Sbjct: 1 MATAPVKP-PLHNFPLSFLKWGSKNHTNTNH----RYRR----PV--------------- 36 Query: 993 XSRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSS---QRKEAS-------- 847 +A +D D +D+ + PQ + RT R FS A+CS QR E + Sbjct: 37 ---SAEPEPSADDDRNDSESPPQHHRVGSRTARHRFSLASCSEKLPQRNEKASEESDDDV 93 Query: 846 ----KXXXXXXXXXXXXXXXGKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSS 679 K KPWNLRPRR A + GE +++ S Sbjct: 94 DDDAKAAAVAAVAAAEEAEVQKPWNLRPRRAPVTKANNNTGGE-------VHEAEGTKQS 146 Query: 678 AGGCGPSSKPIRFK-VPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSL 502 P+ K +R + + A E GP++ +++ WI+LS++EIEEDI+ + Sbjct: 147 EQ---PAPKSMRLRGLAAAAE---GPSMEKKKEKRK-------FWIALSKDEIEEDIFIM 193 Query: 501 TGSXXXXXXXXXXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 TGS K VQKQ+D FPGL+LVG D+YR +S Sbjct: 194 TGSRPARRPKKRPKNVQKQLDNCFPGLWLVGFTADAYRGSDS 235 >ref|XP_004508940.1| PREDICTED: uncharacterized protein LOC101492028 [Cicer arietinum] Length = 242 Score = 94.0 bits (232), Expect = 1e-16 Identities = 84/282 (29%), Positives = 114/282 (40%), Gaps = 19/282 (6%) Frame = -2 Query: 1164 APVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXXXSR 985 APVKSQPLHNF LP L+W R RR P H Sbjct: 6 APVKSQPLHNFSLPFLKWGGTGKNHTNSNNHQRSRR----PPDH---------------- 45 Query: 984 AATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRK------------EASKX 841 A +PD++ S ++ RT R F + SS + +A Sbjct: 46 -----ASPEPDSEPDSRPHRL---GSRTARNRFGLPSSSSSHRHATVSSNHETDDDAGDR 97 Query: 840 XXXXXXXXXXXXXXGKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSA----- 676 KPWNLRPR+ + +P +F+ G S RN+++ Sbjct: 98 KREGEDEAGAEEIVQKPWNLRPRKPM-IPRGAFEIGAGGS-------RNNHNGGELVEAV 149 Query: 675 --GGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSL 502 G P+ K +R + G K WI+LS+EEIEEDI+ + Sbjct: 150 NNNGDNPTPKSLRLR-------------GFADTSCTEKKEKRKFWIALSKEEIEEDIFVM 196 Query: 501 TGSXXXXXXXXXXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 TGS K VQKQMD+VFPGL+LVG+ D+YRV ++ Sbjct: 197 TGSRPNRRPRKRPKNVQKQMDSVFPGLWLVGITADAYRVADT 238 >gb|EOY24267.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 227 Score = 93.6 bits (231), Expect = 2e-16 Identities = 85/277 (30%), Positives = 111/277 (40%), Gaps = 11/277 (3%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAPVKSQPLHNF+ P L+W R SP Sbjct: 1 MATAPVKSQPLHNFNFPFLKWGTHGGGGSSTSSADH---RRSP----------------- 40 Query: 993 XSRAATAVAESDPDNDDT------SNAPQVVKTSIRTPRKPF--SFAACSSQRKEAS--K 844 ESD D+D S + ++ + S P KP S Q++E K Sbjct: 41 ---------ESDSDHDRLRPTRVGSRSTRIQRLSFLPPPKPIKQSHGEDEEQQQEEQPLK 91 Query: 843 XXXXXXXXXXXXXXXGKPWNLRPRRFVTLPAASFKKG-EKMSDEIVLYQRNDNSSSAGGC 667 +PWNLRPR+ V A EK+S+ Sbjct: 92 PHKNEAEEEEEEETVQRPWNLRPRKVVVETTAVVTTAMEKVSET---------------A 136 Query: 666 GPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXX 487 P S +R G+ NGG E++ WI+LSREEIEEDI+ +TGS Sbjct: 137 APKSMRLR-----GLAENGGIVEKKEKRK---------FWIALSREEIEEDIFVMTGSRP 182 Query: 486 XXXXXXXXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 K +QKQ+D VFPGL+LVG D+YRV ++ Sbjct: 183 ARRPKKRPKNIQKQLDAVFPGLWLVGTTADAYRVADA 219 >gb|EOY24264.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508777009|gb|EOY24265.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508777010|gb|EOY24266.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508777012|gb|EOY24268.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 223 Score = 93.6 bits (231), Expect = 2e-16 Identities = 85/277 (30%), Positives = 111/277 (40%), Gaps = 11/277 (3%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAPVKSQPLHNF+ P L+W R SP Sbjct: 1 MATAPVKSQPLHNFNFPFLKWGTHGGGGSSTSSADH---RRSP----------------- 40 Query: 993 XSRAATAVAESDPDNDDT------SNAPQVVKTSIRTPRKPF--SFAACSSQRKEAS--K 844 ESD D+D S + ++ + S P KP S Q++E K Sbjct: 41 ---------ESDSDHDRLRPTRVGSRSTRIQRLSFLPPPKPIKQSHGEDEEQQQEEQPLK 91 Query: 843 XXXXXXXXXXXXXXXGKPWNLRPRRFVTLPAASFKKG-EKMSDEIVLYQRNDNSSSAGGC 667 +PWNLRPR+ V A EK+S+ Sbjct: 92 PHKNEAEEEEEEETVQRPWNLRPRKVVVETTAVVTTAMEKVSET---------------A 136 Query: 666 GPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXX 487 P S +R G+ NGG E++ WI+LSREEIEEDI+ +TGS Sbjct: 137 APKSMRLR-----GLAENGGIVEKKEKRK---------FWIALSREEIEEDIFVMTGSRP 182 Query: 486 XXXXXXXXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 K +QKQ+D VFPGL+LVG D+YRV ++ Sbjct: 183 ARRPKKRPKNIQKQLDAVFPGLWLVGTTADAYRVADA 219 >gb|EMJ10735.1| hypothetical protein PRUPE_ppa010718mg [Prunus persica] Length = 238 Score = 93.2 bits (230), Expect = 2e-16 Identities = 85/271 (31%), Positives = 107/271 (39%), Gaps = 5/271 (1%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAPVK PLHNF L L+W NN Sbjct: 1 MATAPVKP-PLHNFPLAFLKWGAK--------------------------NNSTTNNNHR 33 Query: 993 XSRAATAVAESDPDNDDTSNAPQVVKT-SIRTPRKPFSFAACSS---QRKEASKXXXXXX 826 R +A S+PD++ + S R R +S C+ +R E + Sbjct: 34 YRRPVSAEPASEPDSESERTHYNNSRVGSSRASRHRYSLIPCAGDKRRRSEERESDQEEG 93 Query: 825 XXXXXXXXXGKPWNLRPRRFVTLPAA-SFKKGEKMSDEIVLYQRNDNSSSAGGCGPSSKP 649 KPWNLRPRR PA SF KG + L N N S P S Sbjct: 94 EEADKAEVVHKPWNLRPRR---APATTSFSKGGANGEPHELESPNPNQSELQQ--PKSMR 148 Query: 648 IRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXXXXXXX 469 +R G V N K WI+LS+EEIEEDI+ +TGS Sbjct: 149 LRGLAAEGQNVEKKEN--------------RKFWIALSKEEIEEDIFVMTGSRPARRPKK 194 Query: 468 XXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 K VQKQ+D FPGL+LVG+ D+Y+V +S Sbjct: 195 RPKNVQKQLDITFPGLWLVGVTADAYKVADS 225 >ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferase 2E-like [Glycine max] Length = 241 Score = 92.8 bits (229), Expect = 3e-16 Identities = 84/269 (31%), Positives = 110/269 (40%), Gaps = 6/269 (2%) Frame = -2 Query: 1164 APVKSQPLHNFDLPHLRWXXXXXXXXXXXXXH-RFRRRGSPPVLHYLVNNXXXXXXXXXS 988 APVKSQPLHNF LP L+W H RFRR P H Sbjct: 14 APVKSQPLHNFALPFLKWGASGKNNTTNAAHHHRFRR----PSDH--------------- 54 Query: 987 RAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXXXX 808 S+PD+ D + P + RT R FS + Sbjct: 55 -------ASEPDSSDPDSRPH--RLGSRTARNRFSLPL----KPPPPPPPPQPPHDDDAD 101 Query: 807 XXXGKPWNLRPRRFVTLP---AASFKKGEKMSDEIVLYQRNDNSS--SAGGCGPSSKPIR 643 KPW LRPR+ LP A G + + +N G P+ K +R Sbjct: 102 DSVQKPWKLRPRKPALLPNKTALEIGTGPSRNHHHHHHHATNNGEFLDGGDNNPAPKSLR 161 Query: 642 FKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXXXXXXXXX 463 + + + SE++ WI+LSREEIEEDI+ +TGS Sbjct: 162 LRGFSDTQC-------SEKKEKRK------FWIALSREEIEEDIFVMTGSRPARRPRKRP 208 Query: 462 KTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 K VQKQMD+VFPGL+LVG+ D+YRV ++ Sbjct: 209 KNVQKQMDSVFPGLWLVGITADAYRVADT 237 >ref|XP_003635203.1| PREDICTED: uncharacterized protein LOC100853295 [Vitis vinifera] gi|296085701|emb|CBI29500.3| unnamed protein product [Vitis vinifera] Length = 240 Score = 90.5 bits (223), Expect = 1e-15 Identities = 83/276 (30%), Positives = 113/276 (40%), Gaps = 10/276 (3%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAPVKSQPLHNF L L+W R P Sbjct: 1 MATAPVKSQPLHNFPLSFLKWGKNQMNNHRCRKPVDALRESPPD---------------- 44 Query: 993 XSRAATAVAESDPDND-----DTSNAPQVVKTSIRTPRKPFSFAACS----SQRKEAS-K 844 ES+PD+D ++ + + + RT R + A+ S +Q+ +A + Sbjct: 45 -----GRKNESEPDSDGGSKNESDSENRKLPLGSRTARSRHAVASPSPVEKAQKNQALVE 99 Query: 843 XXXXXXXXXXXXXXXGKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCG 664 KPWNLRPR+ V+ K EI + +N A Sbjct: 100 REGGEVDEGEGEESVQKPWNLRPRKAVS----------KSPIEIGVAPKNGELQEAVPGV 149 Query: 663 PSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXX 484 P S+ P + + G S + WISLSREEIEEDI+ +TGS Sbjct: 150 PHSE----NQPKSLRLRGFAESHSSEKKEKRK-----FWISLSREEIEEDIFVMTGSKPA 200 Query: 483 XXXXXXXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 K VQKQ+D VFPGL+LVG+ DSYR+ ++ Sbjct: 201 RRPKKRAKNVQKQLDNVFPGLWLVGVTPDSYRLPDA 236 >ref|XP_002531462.1| conserved hypothetical protein [Ricinus communis] gi|223528916|gb|EEF30912.1| conserved hypothetical protein [Ricinus communis] Length = 265 Score = 84.0 bits (206), Expect = 1e-13 Identities = 89/290 (30%), Positives = 119/290 (41%), Gaps = 27/290 (9%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAPVK Q LHNF + L+W S NN Sbjct: 1 MATAPVKPQQLHNFPIS-LKWGQTTTTTTISANHQHHHHNRSSSS-----NNQ------- 47 Query: 993 XSRAATAV----AESDPD-NDDTSNAPQVVKTSIRTPRKPFSFAACSS----------QR 859 R AT V ESDPD + T P+V S R R +SFA+CS+ Q+ Sbjct: 48 --RLATPVHESETESDPDQSQSTIRHPRVGSRSARVHR--YSFASCSTLLPKAKTEIPQK 103 Query: 858 KEASKXXXXXXXXXXXXXXXG------------KPWNLRPRRFVTLPAASFKKGEKMSDE 715 EA++ +PW LRPR+ + L +S + + +E Sbjct: 104 PEATEKPQQKNLAVLENNNKNEAEEIEEEDSSSRPWKLRPRKGI-LTGSSKETATLLGNE 162 Query: 714 IVLYQRNDNSSSAGGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLS 535 QR+ + P S +R V + + G +G K W++LS Sbjct: 163 ----QRDSTT-------PKSMRLRGLVDS---TSSGLGVGLGNGVSLEKKEKRKFWVALS 208 Query: 534 REEIEEDIYSLTGSXXXXXXXXXXKTVQKQMDTVFPGLFLVGMNVDSYRV 385 REEIEED++ LTGS K VQK +D+VFPGL+LVG DSYRV Sbjct: 209 REEIEEDVFVLTGSRPARRPKKRPKNVQKILDSVFPGLWLVGTTADSYRV 258 >gb|ABD65177.1| hypothetical protein 40.t00065 [Brassica oleracea] Length = 237 Score = 82.0 bits (201), Expect = 5e-13 Identities = 64/213 (30%), Positives = 92/213 (43%), Gaps = 12/213 (5%) Frame = -2 Query: 978 TAVAESDPDNDDTSNAPQVVKTSI-----RTPRKPFSFAACSSQR-------KEASKXXX 835 +AV + DP +D + P V ++ R PR FS A SS+R E + Sbjct: 26 SAVTDVDPKSDPSPETPPVSNRTVASRSSRQPRLSFSSLAPSSERDHQKKVKSEENPPRR 85 Query: 834 XXXXXXXXXXXXGKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPSS 655 + WNLRPR+ AS K +K + + N P+ Sbjct: 86 EEVPVSAEEDEEKRKWNLRPRKACGGGGASEAKNQKPVAAVAEAKSNRQRGI-----PAE 140 Query: 654 KPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXXXXX 475 P G+ GG +E LW++LSR+EIEED++S++G+ Sbjct: 141 SP-------GLGGGGGVEAKNENHR---------LWVALSRDEIEEDVFSMSGNRPSRRP 184 Query: 474 XXXXKTVQKQMDTVFPGLFLVGMNVDSYRVHES 376 KT+QK +D +FPGL LVGMN D +RV S Sbjct: 185 RKRTKTLQKHLDVIFPGLCLVGMNADCFRVSTS 217 >ref|XP_006440253.1| hypothetical protein CICLE_v10022000mg [Citrus clementina] gi|557542515|gb|ESR53493.1| hypothetical protein CICLE_v10022000mg [Citrus clementina] Length = 236 Score = 74.7 bits (182), Expect = 8e-11 Identities = 75/258 (29%), Positives = 95/258 (36%), Gaps = 4/258 (1%) Frame = -2 Query: 1173 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXHRFRRRGSPPVLHYLVNNXXXXXXXX 994 M TAP+KSQPLHNF L L+W R R PP Sbjct: 1 MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHN------RTRTPPPT--------------- 39 Query: 993 XSRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 814 E D +D T + V S R R F + Q+ + Sbjct: 40 ---------EPDTTDDSTRHHRVVGSRSSRAQRLSFPSSTSKPQQDAVERPQRQTADTEE 90 Query: 813 XXXXXG-KPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPSSKPIRFK 637 +PWNLRPR K E + D V R DN+++ P S +R Sbjct: 91 EEEDEVGRPWNLRPR----------KVQETLVDVAVFQNRGDNNANTKA--PKSTRLREM 138 Query: 636 VPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXXXXXXXXXKT 457 V E G E+ W++LSREEIEEDI+ +TGS K Sbjct: 139 V----ESRGSNGDKKEKNK---------FWVTLSREEIEEDIFIMTGSRPARRPRKRPKN 185 Query: 456 VQKQMDTVF---PGLFLV 412 VQKQ+D + PG FLV Sbjct: 186 VQKQLDVRYFCSPGFFLV 203 >ref|XP_006285401.1| hypothetical protein CARUB_v10006806mg, partial [Capsella rubella] gi|482554106|gb|EOA18299.1| hypothetical protein CARUB_v10006806mg, partial [Capsella rubella] Length = 256 Score = 73.6 bits (179), Expect = 2e-10 Identities = 51/148 (34%), Positives = 68/148 (45%), Gaps = 8/148 (5%) Frame = -2 Query: 795 KPWNLRPRRFVTLPAASFKKGEKMSDEIVLY--------QRNDNSSSAGGCGPSSKPIRF 640 + WNLRPR+ KKG + N S GG P S R Sbjct: 117 RTWNLRPRKAY---GGGLKKGNGVFTAEACVGVGGGGGASEVKNQKSGGGMEPKSNRQR- 172 Query: 639 KVPAGVEVNGGPNLGSERQXXXXXXXXXKLWISLSREEIEEDIYSLTGSXXXXXXXXXXK 460 +PA GG + +E LW++LSR+EIEED++S+ GS K Sbjct: 173 GIPAESPGLGGGEVANENHR---------LWVALSRDEIEEDLFSMCGSRPSRRPRKRTK 223 Query: 459 TVQKQMDTVFPGLFLVGMNVDSYRVHES 376 T+QK +D +FPGL LVGMN D ++V S Sbjct: 224 TLQKYLDVIFPGLCLVGMNADCFKVSNS 251