BLASTX nr result
ID: Catharanthus23_contig00018938
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00018938 (1387 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004245789.1| PREDICTED: uncharacterized protein LOC101263... 110 1e-21 ref|XP_006359254.1| PREDICTED: uncharacterized protein LOC102604... 107 9e-21 ref|XP_004152391.1| PREDICTED: uncharacterized protein LOC101222... 102 5e-19 ref|XP_003549926.1| PREDICTED: uncharacterized protein LOC100812... 100 2e-18 gb|EXC02099.1| hypothetical protein L484_024064 [Morus notabilis] 99 4e-18 ref|XP_006477140.1| PREDICTED: uncharacterized protein LOC102618... 99 5e-18 ref|XP_003608674.1| hypothetical protein MTR_4g100570 [Medicago ... 97 1e-17 gb|ESW27685.1| hypothetical protein PHAVU_003G223000g, partial [... 97 2e-17 ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citr... 96 3e-17 ref|XP_004299406.1| PREDICTED: uncharacterized protein LOC101293... 96 5e-17 ref|XP_004508940.1| PREDICTED: uncharacterized protein LOC101492... 94 1e-16 gb|EOY24267.1| Uncharacterized protein isoform 4 [Theobroma cacao] 94 2e-16 gb|EOY24264.1| Uncharacterized protein isoform 1 [Theobroma caca... 94 2e-16 gb|EMJ10735.1| hypothetical protein PRUPE_ppa010718mg [Prunus pe... 93 2e-16 ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferas... 93 3e-16 ref|XP_003635203.1| PREDICTED: uncharacterized protein LOC100853... 91 1e-15 ref|XP_002531462.1| conserved hypothetical protein [Ricinus comm... 84 1e-13 gb|ABD65177.1| hypothetical protein 40.t00065 [Brassica oleracea] 82 5e-13 ref|XP_006440253.1| hypothetical protein CICLE_v10022000mg [Citr... 75 9e-11 ref|XP_006285401.1| hypothetical protein CARUB_v10006806mg, part... 74 2e-10 >ref|XP_004245789.1| PREDICTED: uncharacterized protein LOC101263341 isoform 1 [Solanum lycopersicum] gi|460400536|ref|XP_004245790.1| PREDICTED: uncharacterized protein LOC101263341 isoform 2 [Solanum lycopersicum] Length = 219 Score = 110 bits (276), Expect = 1e-21 Identities = 85/272 (31%), Positives = 113/272 (41%), Gaps = 4/272 (1%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAPVKSQPLH F LP L+W RFRRR SPP Sbjct: 1 MATAPVKSQPLHYFSLPQLKWGNKSNTNANH----RFRRRDSPP---------------S 41 Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 569 T A+ D +D P+ P ++ + + K Sbjct: 42 NGDNPTQTADVDGGSDSEKVQPRS-----EAEADPNGVSSLQGREEHEEKVKEEEEEEVG 96 Query: 570 XXXXXXKPWNLRPRRFVTLPAASFKKGEKM----SDEIVLYQRNDNSSSAGGCGPSSKPI 737 K WNLRPRR VT + K +M S+ + QR +++ G G K Sbjct: 97 CEEGEVKLWNLRPRRGVTKVETTSLKNVEMRVESSNHMQRSQRLKDNADGNGVGSGKK-- 154 Query: 738 RFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXX 917 G ++ LWISLSREEIEED+YS+TGS Sbjct: 155 ----------------GKKK-----------LWISLSREEIEEDVYSMTGSRPARRPKKR 187 Query: 918 XXTVQKQMDTVFPGLFLVGMNVDSYRVHESLR 1013 T+QKQ+D VFPGL+LVG+ DS+RV+++ + Sbjct: 188 SKTIQKQLDNVFPGLYLVGVTADSFRVNDTTK 219 >ref|XP_006359254.1| PREDICTED: uncharacterized protein LOC102604791 [Solanum tuberosum] Length = 220 Score = 107 bits (268), Expect = 9e-21 Identities = 87/272 (31%), Positives = 111/272 (40%), Gaps = 4/272 (1%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M APVKSQPLH F LP L+W RFRRR SPP Sbjct: 1 MAAAPVKSQPLHYFSLPQLKWGNKSHTNANH----RFRRRDSPP---------------- 40 Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 569 +D D S Q + P S KE + Sbjct: 41 -SNGDNPPQTADVDGGSDSEKVQPRSEAEADPNGVSSLQGEDEHEKEVKEEEEEEEVGCE 99 Query: 570 XXXXXXKPWNLRPRRFVT-LPAASFKKGE---KMSDEIVLYQRNDNSSSAGGCGPSSKPI 737 K WNLRPRR VT + AS K E + S+ + QR +++ G G K Sbjct: 100 EGEV--KLWNLRPRRGVTKVETASLKNVEMRVESSNHMQRSQRLKDNADGNGVGSGKK-- 155 Query: 738 RFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXX 917 G ++ LWISLSREEIEED+YS+TGS Sbjct: 156 ----------------GKKK-----------LWISLSREEIEEDVYSMTGSRPARRPKKR 188 Query: 918 XXTVQKQMDTVFPGLFLVGMNVDSYRVHESLR 1013 T+QKQ+D VFPGL+LVG+ DS+RV+++ + Sbjct: 189 SKTIQKQLDNVFPGLYLVGLTADSFRVNDTTK 220 >ref|XP_004152391.1| PREDICTED: uncharacterized protein LOC101222282 [Cucumis sativus] gi|449488652|ref|XP_004158130.1| PREDICTED: uncharacterized LOC101222282 [Cucumis sativus] Length = 246 Score = 102 bits (253), Expect = 5e-19 Identities = 85/277 (30%), Positives = 112/277 (40%), Gaps = 11/277 (3%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M T PVKSQPLHNF LP L+W R RR Sbjct: 1 MATGPVKSQPLHNFALPFLKWGGKNQTNSNH----RIRRA----------------IGGG 40 Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRK-----------EAS 536 ++ AV S+P+++ S PQ+ + RT R +F+ CS K E Sbjct: 41 GGDSSPAVDHSEPESEADSK-PQL-RVGSRTVRNRLAFSPCSLGDKFAKHSEGEVGDEVV 98 Query: 537 KXXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGC 716 K KPWNLRPR+ +L K E+ + +S G Sbjct: 99 KEQKREGEEVEGEEIVQKPWNLRPRKGTSLRGYGDLKNGGDLQEMDGAVSSAAGASQQGE 158 Query: 717 GPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXX 896 P K +R + G WI+LSR+EIEEDI+ +TGS Sbjct: 159 NPQPKSLRLR-------------GFTESHRIEKKDKRKFWIALSRDEIEEDIFIMTGSRP 205 Query: 897 XXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 VQKQ+DTVFPGL+LVG+ DSYR+ +S Sbjct: 206 SRRPKKRPKNVQKQLDTVFPGLWLVGVTADSYRLADS 242 >ref|XP_003549926.1| PREDICTED: uncharacterized protein LOC100812835 isoform X1 [Glycine max] gi|571536516|ref|XP_006600845.1| PREDICTED: uncharacterized protein LOC100812835 isoform X2 [Glycine max] Length = 237 Score = 100 bits (248), Expect = 2e-18 Identities = 85/270 (31%), Positives = 106/270 (39%), Gaps = 7/270 (2%) Frame = +3 Query: 219 APVKSQPLHNFDLPHLRWXXXXXXXXXXXXXX--RFRRRGSPPVLHYLVNNXXXXXXXXX 392 APVKSQPLHNF LP L+W RFRR P H Sbjct: 8 APVKSQPLHNFALPFLKWGASGKNNTTTTAAHHHRFRR----PSDH-------------- 49 Query: 393 XRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXXX 572 S+PD+ D + P + RT R FS + Sbjct: 50 --------ASEPDSSDPDSRPH--RLGSRTARNRFSLPL---KPPPPPPPQLHEAEHDDA 96 Query: 573 XXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCG-----PSSKPI 737 KPWNLRPR+ LP A+ + G S N GG G P+ K + Sbjct: 97 DDAVQKPWNLRPRKPALLPKAALEIGTGPSRNHHHATNNGEFHDGGGGGGDNNNPAPKSL 156 Query: 738 RFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXX 917 R + G WI+LSREEIEEDI+ +TGS Sbjct: 157 RLR-------------GFSDTPCSVKKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKR 203 Query: 918 XXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 VQKQMD+VFPGL+LVG+ D+YRV ++ Sbjct: 204 PKNVQKQMDSVFPGLWLVGITADAYRVADT 233 >gb|EXC02099.1| hypothetical protein L484_024064 [Morus notabilis] Length = 268 Score = 99.0 bits (245), Expect = 4e-18 Identities = 85/279 (30%), Positives = 117/279 (41%), Gaps = 16/279 (5%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAPVKS PLHNF LP L+W R S PV + Sbjct: 1 MATAPVKS-PLHNFPLPFLKWGGGKNHASGSHRCRRTISADSSPVADHC----------- 48 Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFS--FAACS--SQRKEASKXXXXXX 557 AE + + + + + RT R F+ FA+CS S++KE+ + Sbjct: 49 ------DAAEQERNESSEAEPNRFHRVGSRTVRNRFAAPFASCSLVSEKKESDEVAAGEG 102 Query: 558 XXXXXXXXXX----------KPWNLRPRRFVTLPAAS--FKKGEKMSDEIVLYQRNDNSS 701 KPWNLRPR+ + AA+ K GE E + Sbjct: 103 KEGDDREVEAAAGEEEMMVQKPWNLRPRKALFSKAATNGAKSGELPEQE----------N 152 Query: 702 SAGGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSL 881 + G G S+ + + P + + G L +Q WI+LSREEIEEDI+ + Sbjct: 153 AVAGGGHQSENLNQQPPKSMRLRG---LSESQQSSEKEKRK--FWIALSREEIEEDIFVM 207 Query: 882 TGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRV 998 TGS VQKQ+D VFPGL+LVG+ D+YR+ Sbjct: 208 TGSRPARRPRKRPKNVQKQLDAVFPGLWLVGITADAYRI 246 >ref|XP_006477140.1| PREDICTED: uncharacterized protein LOC102618144 isoform X1 [Citrus sinensis] Length = 216 Score = 98.6 bits (244), Expect = 5e-18 Identities = 87/274 (31%), Positives = 109/274 (39%), Gaps = 8/274 (2%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAP+KSQPLHNF L L+W R R PP Sbjct: 1 MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHN------RTRTPPPT--------------- 39 Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPR--------KPFSFAACSSQRKEASKXX 545 E D +D T + V S R R KP A SQR+ A Sbjct: 40 ---------EPDTTDDSTRHHRVVGSRSSRAQRLSFPCSTSKPHQDAGDRSQRQTADTEE 90 Query: 546 XXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPS 725 +PWNLRPR K E + D V R DN+++ P Sbjct: 91 EEEDEVG-------RPWNLRPR----------KVQETLVDVAVFQNRGDNNANTKA--PK 131 Query: 726 SKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXX 905 S +R V E G E+ W++LSREEIEEDI+ +TGS Sbjct: 132 STRLREMV----ESRGSNGDKKEKNK---------FWVTLSREEIEEDIFIMTGSRPARR 178 Query: 906 XXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 VQKQ+D VFPGL+LVG+ VD+YRV ++ Sbjct: 179 PRKRPKNVQKQLDNVFPGLWLVGLTVDAYRVSDA 212 >ref|XP_003608674.1| hypothetical protein MTR_4g100570 [Medicago truncatula] gi|355509729|gb|AES90871.1| hypothetical protein MTR_4g100570 [Medicago truncatula] Length = 243 Score = 97.4 bits (241), Expect = 1e-17 Identities = 88/289 (30%), Positives = 117/289 (40%), Gaps = 23/289 (7%) Frame = +3 Query: 210 MGTAP--VKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXX 383 M T P VKSQPLHNF LP L+W R RR P H Sbjct: 1 MATTPASVKSQPLHNFSLPFLKWGGTGKNNTNATNHHRSRR----PPDH----------- 45 Query: 384 XXXXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRK------------ 527 S+PD++ S ++ RT R F FA+ SSQR+ Sbjct: 46 -----------ASEPDSEPDSRPHRL---GSRTARNRFGFASSSSQRQAPPTPSSNNETD 91 Query: 528 -----EASKXXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRND 692 KPWNLRPR+ + +P F+ G S RN+ Sbjct: 92 DNAGDRKRDAEDDAEAGGGAEEIVQKPWNLRPRKPM-IPRGGFEIGAGGS-------RNN 143 Query: 693 NSSS----AGGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEI 860 N G P+ K +R + A N G +++ WI+LS++EI Sbjct: 144 NGGELQEGVNGENPAPKSLRLRGFADT------NCGEKKEKRK-------FWIALSKDEI 190 Query: 861 EEDIYSLTGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 EEDI+ +TGS VQKQMD VFPGL+LVG+ D+YRV ++ Sbjct: 191 EEDIFVMTGSRPNRRPRKRAKNVQKQMDNVFPGLWLVGITADAYRVADT 239 >gb|ESW27685.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus vulgaris] gi|561029046|gb|ESW27686.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus vulgaris] Length = 306 Score = 97.1 bits (240), Expect = 2e-17 Identities = 89/294 (30%), Positives = 119/294 (40%), Gaps = 26/294 (8%) Frame = +3 Query: 204 FLMGTAP----VKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXX 371 F M TAP VKSQPLHNF LP L+W R RR S H Sbjct: 55 FSMATAPAQPPVKSQPLHNFALPFLKWGASGKNHTNAAHHHRCRRPSSLSSDH------- 107 Query: 372 XXXXXXXXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQR--------- 524 S+PD+D S +V RT R F+ CS + Sbjct: 108 ---------------ASEPDSDPDSRPHRV---GSRTTRNRFALPTCSLKPLPPPPEPPQ 149 Query: 525 ----KEASKXXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRND 692 + + KPWNLRPR+ LP ++ + G S RN Sbjct: 150 PPSCNDETDDEAAKRDIEDAEEAVQKPWNLRPRK-PALPKSALEIGTGPS-------RNH 201 Query: 693 NSSSAG---------GCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISL 845 ++ G G P+ K +R + A + +E++ WI+L Sbjct: 202 ANNGVGEFHDGVSHHGENPAPKSLRLRGFADTQC-------AEKKEKRK------FWIAL 248 Query: 846 SREEIEEDIYSLTGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 SREEIEEDI+ +TGS VQKQMD+VFPGL+LVG+ D+YRV ++ Sbjct: 249 SREEIEEDIFVMTGSRPARRPRKRPKNVQKQMDSVFPGLWLVGITADAYRVPDT 302 >ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citrus clementina] gi|557542514|gb|ESR53492.1| hypothetical protein CICLE_v10022000mg [Citrus clementina] Length = 216 Score = 96.3 bits (238), Expect = 3e-17 Identities = 81/267 (30%), Positives = 105/267 (39%), Gaps = 1/267 (0%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAP+KSQPLHNF L L+W R R PP Sbjct: 1 MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHN------RTRTPPPT--------------- 39 Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 569 E D +D T + V S R R F + Q+ + Sbjct: 40 ---------EPDTTDDSTRHHRVVGSRSSRAQRLSFPSSTSKPQQDAVERPQRQTADTEE 90 Query: 570 XXXXXX-KPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPSSKPIRFK 746 +PWNLRPR K E + D V R DN+++ P S +R Sbjct: 91 EEEDEVGRPWNLRPR----------KVQETLVDVAVFQNRGDNNANTKA--PKSTRLREM 138 Query: 747 VPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXXXXT 926 V E G E+ W++LSREEIEEDI+ +TGS Sbjct: 139 V----ESRGSNGDKKEKNK---------FWVTLSREEIEEDIFIMTGSRPARRPRKRPKN 185 Query: 927 VQKQMDTVFPGLFLVGMNVDSYRVHES 1007 VQKQ+D VFPGL+LVG+ D+YRV ++ Sbjct: 186 VQKQLDNVFPGLWLVGLTADAYRVSDA 212 >ref|XP_004299406.1| PREDICTED: uncharacterized protein LOC101293977 [Fragaria vesca subsp. vesca] Length = 239 Score = 95.5 bits (236), Expect = 5e-17 Identities = 86/282 (30%), Positives = 118/282 (41%), Gaps = 16/282 (5%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAPVK PLHNF L L+W R+RR PV Sbjct: 1 MATAPVKP-PLHNFPLSFLKWGSKNHTNTNH----RYRR----PV--------------- 36 Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSS---QRKEAS-------- 536 +A +D D +D+ + PQ + RT R FS A+CS QR E + Sbjct: 37 ---SAEPEPSADDDRNDSESPPQHHRVGSRTARHRFSLASCSEKLPQRNEKASEESDDDV 93 Query: 537 ----KXXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSS 704 K KPWNLRPRR A + GE +++ S Sbjct: 94 DDDAKAAAVAAVAAAEEAEVQKPWNLRPRRAPVTKANNNTGGE-------VHEAEGTKQS 146 Query: 705 AGGCGPSSKPIRFK-VPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSL 881 P+ K +R + + A E GP++ +++ WI+LS++EIEEDI+ + Sbjct: 147 EQ---PAPKSMRLRGLAAAAE---GPSMEKKKEKRK-------FWIALSKDEIEEDIFIM 193 Query: 882 TGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 TGS VQKQ+D FPGL+LVG D+YR +S Sbjct: 194 TGSRPARRPKKRPKNVQKQLDNCFPGLWLVGFTADAYRGSDS 235 >ref|XP_004508940.1| PREDICTED: uncharacterized protein LOC101492028 [Cicer arietinum] Length = 242 Score = 94.0 bits (232), Expect = 1e-16 Identities = 82/282 (29%), Positives = 112/282 (39%), Gaps = 19/282 (6%) Frame = +3 Query: 219 APVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXXXXR 398 APVKSQPLHNF LP L+W R RR P H Sbjct: 6 APVKSQPLHNFSLPFLKWGGTGKNHTNSNNHQRSRR----PPDH---------------- 45 Query: 399 AATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRK------------EASKX 542 A +PD++ S ++ RT R F + SS + +A Sbjct: 46 -----ASPEPDSEPDSRPHRL---GSRTARNRFGLPSSSSSHRHATVSSNHETDDDAGDR 97 Query: 543 XXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSA----- 707 KPWNLRPR+ + +P +F+ G S RN+++ Sbjct: 98 KREGEDEAGAEEIVQKPWNLRPRKPM-IPRGAFEIGAGGS-------RNNHNGGELVEAV 149 Query: 708 --GGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSL 881 G P+ K +R + G WI+LS+EEIEEDI+ + Sbjct: 150 NNNGDNPTPKSLRLR-------------GFADTSCTEKKEKRKFWIALSKEEIEEDIFVM 196 Query: 882 TGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 TGS VQKQMD+VFPGL+LVG+ D+YRV ++ Sbjct: 197 TGSRPNRRPRKRPKNVQKQMDSVFPGLWLVGITADAYRVADT 238 >gb|EOY24267.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 227 Score = 93.6 bits (231), Expect = 2e-16 Identities = 84/277 (30%), Positives = 110/277 (39%), Gaps = 11/277 (3%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAPVKSQPLHNF+ P L+W R SP Sbjct: 1 MATAPVKSQPLHNFNFPFLKWGTHGGGGSSTSSADH---RRSP----------------- 40 Query: 390 XXRAATAVAESDPDNDDT------SNAPQVVKTSIRTPRKPF--SFAACSSQRKEAS--K 539 ESD D+D S + ++ + S P KP S Q++E K Sbjct: 41 ---------ESDSDHDRLRPTRVGSRSTRIQRLSFLPPPKPIKQSHGEDEEQQQEEQPLK 91 Query: 540 XXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKG-EKMSDEIVLYQRNDNSSSAGGC 716 +PWNLRPR+ V A EK+S+ Sbjct: 92 PHKNEAEEEEEEETVQRPWNLRPRKVVVETTAVVTTAMEKVSET---------------A 136 Query: 717 GPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXX 896 P S +R G+ NGG E++ WI+LSREEIEEDI+ +TGS Sbjct: 137 APKSMRLR-----GLAENGGIVEKKEKRK---------FWIALSREEIEEDIFVMTGSRP 182 Query: 897 XXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 +QKQ+D VFPGL+LVG D+YRV ++ Sbjct: 183 ARRPKKRPKNIQKQLDAVFPGLWLVGTTADAYRVADA 219 >gb|EOY24264.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508777009|gb|EOY24265.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508777010|gb|EOY24266.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508777012|gb|EOY24268.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 223 Score = 93.6 bits (231), Expect = 2e-16 Identities = 84/277 (30%), Positives = 110/277 (39%), Gaps = 11/277 (3%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAPVKSQPLHNF+ P L+W R SP Sbjct: 1 MATAPVKSQPLHNFNFPFLKWGTHGGGGSSTSSADH---RRSP----------------- 40 Query: 390 XXRAATAVAESDPDNDDT------SNAPQVVKTSIRTPRKPF--SFAACSSQRKEAS--K 539 ESD D+D S + ++ + S P KP S Q++E K Sbjct: 41 ---------ESDSDHDRLRPTRVGSRSTRIQRLSFLPPPKPIKQSHGEDEEQQQEEQPLK 91 Query: 540 XXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKG-EKMSDEIVLYQRNDNSSSAGGC 716 +PWNLRPR+ V A EK+S+ Sbjct: 92 PHKNEAEEEEEEETVQRPWNLRPRKVVVETTAVVTTAMEKVSET---------------A 136 Query: 717 GPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXX 896 P S +R G+ NGG E++ WI+LSREEIEEDI+ +TGS Sbjct: 137 APKSMRLR-----GLAENGGIVEKKEKRK---------FWIALSREEIEEDIFVMTGSRP 182 Query: 897 XXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 +QKQ+D VFPGL+LVG D+YRV ++ Sbjct: 183 ARRPKKRPKNIQKQLDAVFPGLWLVGTTADAYRVADA 219 >gb|EMJ10735.1| hypothetical protein PRUPE_ppa010718mg [Prunus persica] Length = 238 Score = 93.2 bits (230), Expect = 2e-16 Identities = 83/271 (30%), Positives = 105/271 (38%), Gaps = 5/271 (1%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAPVK PLHNF L L+W NN Sbjct: 1 MATAPVKP-PLHNFPLAFLKWGAK--------------------------NNSTTNNNHR 33 Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKT-SIRTPRKPFSFAACSS---QRKEASKXXXXXX 557 R +A S+PD++ + S R R +S C+ +R E + Sbjct: 34 YRRPVSAEPASEPDSESERTHYNNSRVGSSRASRHRYSLIPCAGDKRRRSEERESDQEEG 93 Query: 558 XXXXXXXXXXKPWNLRPRRFVTLPAA-SFKKGEKMSDEIVLYQRNDNSSSAGGCGPSSKP 734 KPWNLRPRR PA SF KG + L N N S P S Sbjct: 94 EEADKAEVVHKPWNLRPRR---APATTSFSKGGANGEPHELESPNPNQSELQQ--PKSMR 148 Query: 735 IRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXX 914 +R G V N WI+LS+EEIEEDI+ +TGS Sbjct: 149 LRGLAAEGQNVEKKEN--------------RKFWIALSKEEIEEDIFVMTGSRPARRPKK 194 Query: 915 XXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 VQKQ+D FPGL+LVG+ D+Y+V +S Sbjct: 195 RPKNVQKQLDITFPGLWLVGVTADAYKVADS 225 >ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferase 2E-like [Glycine max] Length = 241 Score = 92.8 bits (229), Expect = 3e-16 Identities = 82/269 (30%), Positives = 108/269 (40%), Gaps = 6/269 (2%) Frame = +3 Query: 219 APVKSQPLHNFDLPHLRWXXXXXXXXXXXXXX-RFRRRGSPPVLHYLVNNXXXXXXXXXX 395 APVKSQPLHNF LP L+W RFRR P H Sbjct: 14 APVKSQPLHNFALPFLKWGASGKNNTTNAAHHHRFRR----PSDH--------------- 54 Query: 396 RAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXXXX 575 S+PD+ D + P + RT R FS + Sbjct: 55 -------ASEPDSSDPDSRPH--RLGSRTARNRFSLPL----KPPPPPPPPQPPHDDDAD 101 Query: 576 XXXXKPWNLRPRRFVTLP---AASFKKGEKMSDEIVLYQRNDNSS--SAGGCGPSSKPIR 740 KPW LRPR+ LP A G + + +N G P+ K +R Sbjct: 102 DSVQKPWKLRPRKPALLPNKTALEIGTGPSRNHHHHHHHATNNGEFLDGGDNNPAPKSLR 161 Query: 741 FKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXXX 920 + + + SE++ WI+LSREEIEEDI+ +TGS Sbjct: 162 LRGFSDTQC-------SEKKEKRK------FWIALSREEIEEDIFVMTGSRPARRPRKRP 208 Query: 921 XTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 VQKQMD+VFPGL+LVG+ D+YRV ++ Sbjct: 209 KNVQKQMDSVFPGLWLVGITADAYRVADT 237 >ref|XP_003635203.1| PREDICTED: uncharacterized protein LOC100853295 [Vitis vinifera] gi|296085701|emb|CBI29500.3| unnamed protein product [Vitis vinifera] Length = 240 Score = 90.5 bits (223), Expect = 1e-15 Identities = 82/276 (29%), Positives = 112/276 (40%), Gaps = 10/276 (3%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAPVKSQPLHNF L L+W R P Sbjct: 1 MATAPVKSQPLHNFPLSFLKWGKNQMNNHRCRKPVDALRESPPD---------------- 44 Query: 390 XXRAATAVAESDPDND-----DTSNAPQVVKTSIRTPRKPFSFAACS----SQRKEAS-K 539 ES+PD+D ++ + + + RT R + A+ S +Q+ +A + Sbjct: 45 -----GRKNESEPDSDGGSKNESDSENRKLPLGSRTARSRHAVASPSPVEKAQKNQALVE 99 Query: 540 XXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCG 719 KPWNLRPR+ V+ K EI + +N A Sbjct: 100 REGGEVDEGEGEESVQKPWNLRPRKAVS----------KSPIEIGVAPKNGELQEAVPGV 149 Query: 720 PSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXX 899 P S+ P + + G S + WISLSREEIEEDI+ +TGS Sbjct: 150 PHSE----NQPKSLRLRGFAESHSSEKKEKRK-----FWISLSREEIEEDIFVMTGSKPA 200 Query: 900 XXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 VQKQ+D VFPGL+LVG+ DSYR+ ++ Sbjct: 201 RRPKKRAKNVQKQLDNVFPGLWLVGVTPDSYRLPDA 236 >ref|XP_002531462.1| conserved hypothetical protein [Ricinus communis] gi|223528916|gb|EEF30912.1| conserved hypothetical protein [Ricinus communis] Length = 265 Score = 84.0 bits (206), Expect = 1e-13 Identities = 87/290 (30%), Positives = 117/290 (40%), Gaps = 27/290 (9%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAPVK Q LHNF + L+W S NN Sbjct: 1 MATAPVKPQQLHNFPIS-LKWGQTTTTTTISANHQHHHHNRSSSS-----NNQ------- 47 Query: 390 XXRAATAV----AESDPD-NDDTSNAPQVVKTSIRTPRKPFSFAACSS----------QR 524 R AT V ESDPD + T P+V S R R +SFA+CS+ Q+ Sbjct: 48 --RLATPVHESETESDPDQSQSTIRHPRVGSRSARVHR--YSFASCSTLLPKAKTEIPQK 103 Query: 525 KEASKXXXXXXXXXXXXXXXX------------KPWNLRPRRFVTLPAASFKKGEKMSDE 668 EA++ +PW LRPR+ + L +S + + +E Sbjct: 104 PEATEKPQQKNLAVLENNNKNEAEEIEEEDSSSRPWKLRPRKGI-LTGSSKETATLLGNE 162 Query: 669 IVLYQRNDNSSSAGGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLS 848 QR+ + P S +R V + + G +G W++LS Sbjct: 163 ----QRDSTT-------PKSMRLRGLVDS---TSSGLGVGLGNGVSLEKKEKRKFWVALS 208 Query: 849 REEIEEDIYSLTGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRV 998 REEIEED++ LTGS VQK +D+VFPGL+LVG DSYRV Sbjct: 209 REEIEEDVFVLTGSRPARRPKKRPKNVQKILDSVFPGLWLVGTTADSYRV 258 >gb|ABD65177.1| hypothetical protein 40.t00065 [Brassica oleracea] Length = 237 Score = 82.0 bits (201), Expect = 5e-13 Identities = 63/213 (29%), Positives = 91/213 (42%), Gaps = 12/213 (5%) Frame = +3 Query: 405 TAVAESDPDNDDTSNAPQVVKTSI-----RTPRKPFSFAACSSQR-------KEASKXXX 548 +AV + DP +D + P V ++ R PR FS A SS+R E + Sbjct: 26 SAVTDVDPKSDPSPETPPVSNRTVASRSSRQPRLSFSSLAPSSERDHQKKVKSEENPPRR 85 Query: 549 XXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPSS 728 + WNLRPR+ AS K +K + + N P+ Sbjct: 86 EEVPVSAEEDEEKRKWNLRPRKACGGGGASEAKNQKPVAAVAEAKSNRQRGI-----PAE 140 Query: 729 KPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXX 908 P G+ GG +E LW++LSR+EIEED++S++G+ Sbjct: 141 SP-------GLGGGGGVEAKNENHR---------LWVALSRDEIEEDVFSMSGNRPSRRP 184 Query: 909 XXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 T+QK +D +FPGL LVGMN D +RV S Sbjct: 185 RKRTKTLQKHLDVIFPGLCLVGMNADCFRVSTS 217 >ref|XP_006440253.1| hypothetical protein CICLE_v10022000mg [Citrus clementina] gi|557542515|gb|ESR53493.1| hypothetical protein CICLE_v10022000mg [Citrus clementina] Length = 236 Score = 74.7 bits (182), Expect = 9e-11 Identities = 74/258 (28%), Positives = 94/258 (36%), Gaps = 4/258 (1%) Frame = +3 Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389 M TAP+KSQPLHNF L L+W R R PP Sbjct: 1 MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHN------RTRTPPPT--------------- 39 Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 569 E D +D T + V S R R F + Q+ + Sbjct: 40 ---------EPDTTDDSTRHHRVVGSRSSRAQRLSFPSSTSKPQQDAVERPQRQTADTEE 90 Query: 570 XXXXXX-KPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPSSKPIRFK 746 +PWNLRPR K E + D V R DN+++ P S +R Sbjct: 91 EEEDEVGRPWNLRPR----------KVQETLVDVAVFQNRGDNNANTKA--PKSTRLREM 138 Query: 747 VPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXXXXT 926 V E G E+ W++LSREEIEEDI+ +TGS Sbjct: 139 V----ESRGSNGDKKEKNK---------FWVTLSREEIEEDIFIMTGSRPARRPRKRPKN 185 Query: 927 VQKQMDTVF---PGLFLV 971 VQKQ+D + PG FLV Sbjct: 186 VQKQLDVRYFCSPGFFLV 203 >ref|XP_006285401.1| hypothetical protein CARUB_v10006806mg, partial [Capsella rubella] gi|482554106|gb|EOA18299.1| hypothetical protein CARUB_v10006806mg, partial [Capsella rubella] Length = 256 Score = 73.6 bits (179), Expect = 2e-10 Identities = 50/148 (33%), Positives = 67/148 (45%), Gaps = 8/148 (5%) Frame = +3 Query: 588 KPWNLRPRRFVTLPAASFKKGEKMSDEIVLY--------QRNDNSSSAGGCGPSSKPIRF 743 + WNLRPR+ KKG + N S GG P S R Sbjct: 117 RTWNLRPRKAY---GGGLKKGNGVFTAEACVGVGGGGGASEVKNQKSGGGMEPKSNRQR- 172 Query: 744 KVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXXXX 923 +PA GG + +E LW++LSR+EIEED++S+ GS Sbjct: 173 GIPAESPGLGGGEVANENHR---------LWVALSRDEIEEDLFSMCGSRPSRRPRKRTK 223 Query: 924 TVQKQMDTVFPGLFLVGMNVDSYRVHES 1007 T+QK +D +FPGL LVGMN D ++V S Sbjct: 224 TLQKYLDVIFPGLCLVGMNADCFKVSNS 251