BLASTX nr result
ID: Sinomenium21_contig00018601
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00018601 (995 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003635203.1| PREDICTED: uncharacterized protein LOC100853... 234 5e-59 ref|XP_004152391.1| PREDICTED: uncharacterized protein LOC101222... 165 2e-38 ref|XP_003608674.1| hypothetical protein MTR_4g100570 [Medicago ... 165 2e-38 ref|XP_007155691.1| hypothetical protein PHAVU_003G223000g, part... 162 2e-37 gb|EXC02099.1| hypothetical protein L484_024064 [Morus notabilis] 159 1e-36 ref|XP_004508940.1| PREDICTED: uncharacterized protein LOC101492... 158 3e-36 ref|XP_004299406.1| PREDICTED: uncharacterized protein LOC101293... 152 2e-34 ref|XP_007209536.1| hypothetical protein PRUPE_ppa010718mg [Prun... 143 1e-31 ref|XP_007052276.1| Uncharacterized protein TCM_005685 [Theobrom... 139 1e-30 ref|XP_007039763.1| Uncharacterized protein isoform 1 [Theobroma... 134 6e-29 ref|XP_002531462.1| conserved hypothetical protein [Ricinus comm... 133 1e-28 ref|XP_007039766.1| Uncharacterized protein isoform 4 [Theobroma... 132 2e-28 ref|XP_006477140.1| PREDICTED: uncharacterized protein LOC102618... 130 8e-28 ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citr... 128 3e-27 ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferas... 120 9e-25 ref|XP_003549926.1| PREDICTED: uncharacterized protein LOC100812... 120 1e-24 ref|XP_006445388.1| hypothetical protein CICLE_v10021338mg [Citr... 106 2e-20 ref|XP_006445387.1| hypothetical protein CICLE_v10021338mg [Citr... 106 2e-20 ref|XP_002279484.2| PREDICTED: uncharacterized protein LOC100244... 105 3e-20 ref|XP_007044281.1| Uncharacterized protein TCM_009632 [Theobrom... 99 2e-18 >ref|XP_003635203.1| PREDICTED: uncharacterized protein LOC100853295 [Vitis vinifera] gi|296085701|emb|CBI29500.3| unnamed protein product [Vitis vinifera] Length = 240 Score = 234 bits (596), Expect = 5e-59 Identities = 139/277 (50%), Positives = 161/277 (58%), Gaps = 6/277 (2%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKWGKNQMNNHRCRKLVDTSRESPPRDHRXXXXXXXXXXXXX 341 MATAPVKSQPLHNF L FLKWGKNQMNNHRCRK VD RESPP D R Sbjct: 1 MATAPVKSQPLHNFPLSFLKWGKNQMNNHRCRKPVDALRESPP-DGRKNESEPDSDGGSK 59 Query: 342 XXXXXXXAKNPVGSRSSKNRFSFANCSSSTTGMVAKVEKNLTVSESSAAGEEARRNLMAR 521 K P+GSR++++R + A+ S V K +KN + E Sbjct: 60 NESDSENRKLPLGSRTARSRHAVASPSP-----VEKAQKNQALVEREG------------ 102 Query: 522 LQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRGVSKAPNEIGGPCKNGELQ 701 G DEGEG E SVQKPWNLRPR+ VSK+P EIG KNGELQ Sbjct: 103 ----------------GEVDEGEGEE---SVQKPWNLRPRKAVSKSPIEIGVAPKNGELQ 143 Query: 702 EKL------ENLPKSCRLRSFAEAQNVEKKEKRRLSIALSREEIDEDFFVMTGXXXXXXX 863 E + EN PKS RLR FAE+ + EKKEKR+ I+LSREEI+ED FVMTG Sbjct: 144 EAVPGVPHSENQPKSLRLRGFAESHSSEKKEKRKFWISLSREEIEEDIFVMTGSKPARRP 203 Query: 864 XXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAKK 974 N+QK +DNVFPGLWLVG+T DSY +PD PAK+ Sbjct: 204 KKRAKNVQKQLDNVFPGLWLVGVTPDSYRLPDAPAKR 240 >ref|XP_004152391.1| PREDICTED: uncharacterized protein LOC101222282 [Cucumis sativus] gi|449488652|ref|XP_004158130.1| PREDICTED: uncharacterized LOC101222282 [Cucumis sativus] Length = 246 Score = 165 bits (418), Expect = 2e-38 Identities = 118/290 (40%), Positives = 143/290 (49%), Gaps = 19/290 (6%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKW-GKNQMN-NHRCRKLV--DTSRESPPRDHRXXXXXXXXX 329 MAT PVKSQPLHNF+LPFLKW GKNQ N NHR R+ + SP DH Sbjct: 1 MATGPVKSQPLHNFALPFLKWGGKNQTNSNHRIRRAIGGGGGDSSPAVDHSEPESEADSK 60 Query: 330 XXXXXXXXXXXAKNPVGSRSSKNRFSFANCSSSTTGMVAKVEKNLTVSESSAAGEEARRN 509 VGSR+ +NR +F+ CS +K SE E + Sbjct: 61 PQLR-----------VGSRTVRNRLAFSPCSLG--------DKFAKHSEGEVGDEVVKEQ 101 Query: 510 LMARLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRGVSKAPNEIGGPCKN 689 EGE E + VQKPWNLRPR+G S G KN Sbjct: 102 ----------------------KREGEEVEGEEIVQKPWNLRPRKGTSLRGY---GDLKN 136 Query: 690 -GELQE-------------KLEN-LPKSCRLRSFAEAQNVEKKEKRRLSIALSREEIDED 824 G+LQE + EN PKS RLR F E+ +EKK+KR+ IALSR+EI+ED Sbjct: 137 GGDLQEMDGAVSSAAGASQQGENPQPKSLRLRGFTESHRIEKKDKRKFWIALSRDEIEED 196 Query: 825 FFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAKK 974 F+MTG N+QK +D VFPGLWLVG+TADSY + D PAK+ Sbjct: 197 IFIMTGSRPSRRPKKRPKNVQKQLDTVFPGLWLVGVTADSYRLADSPAKR 246 >ref|XP_003608674.1| hypothetical protein MTR_4g100570 [Medicago truncatula] gi|355509729|gb|AES90871.1| hypothetical protein MTR_4g100570 [Medicago truncatula] Length = 243 Score = 165 bits (418), Expect = 2e-38 Identities = 119/291 (40%), Positives = 146/291 (50%), Gaps = 20/291 (6%) Frame = +3 Query: 162 MATAP--VKSQPLHNFSLPFLKWG------KNQMNNHRCRKLVDTSRE--SPP--RDHRX 305 MAT P VKSQPLHNFSLPFLKWG N N+HR R+ D + E S P R HR Sbjct: 1 MATTPASVKSQPLHNFSLPFLKWGGTGKNNTNATNHHRSRRPPDHASEPDSEPDSRPHR- 59 Query: 306 XXXXXXXXXXXXXXXXXXXAKNPVGSRSSKNRFSFANCSSSTTGMVAKVEKNLTVSESSA 485 +GSR+++NRF FA+ SS N T Sbjct: 60 -----------------------LGSRTARNRFGFASSSSQRQAPPTPSSNNET---DDN 93 Query: 486 AGEEARRNLMARLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRG-VSKAP 662 AG+ R + D GG + VQKPWNLRPR+ + + Sbjct: 94 AGDRKR---------------------DAEDDAEAGGGAEEIVQKPWNLRPRKPMIPRGG 132 Query: 663 NEIG-GPCKN---GELQEKLEN---LPKSCRLRSFAEAQNVEKKEKRRLSIALSREEIDE 821 EIG G +N GELQE + PKS RLR FA+ EKKEKR+ IALS++EI+E Sbjct: 133 FEIGAGGSRNNNGGELQEGVNGENPAPKSLRLRGFADTNCGEKKEKRKFWIALSKDEIEE 192 Query: 822 DFFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAKK 974 D FVMTG N+QK +DNVFPGLWLVG+TAD+Y + D P K+ Sbjct: 193 DIFVMTGSRPNRRPRKRAKNVQKQMDNVFPGLWLVGITADAYRVADTPTKR 243 >ref|XP_007155691.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus vulgaris] gi|593785303|ref|XP_007155692.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus vulgaris] gi|561029045|gb|ESW27685.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus vulgaris] gi|561029046|gb|ESW27686.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus vulgaris] Length = 306 Score = 162 bits (411), Expect = 2e-37 Identities = 115/304 (37%), Positives = 153/304 (50%), Gaps = 27/304 (8%) Frame = +3 Query: 144 FERKSAMATAPVKSQPLHNFSLPFLKWG---KNQMN---NHRCRKLVDTSRE------SP 287 F +A A PVKSQPLHNF+LPFLKWG KN N +HRCR+ S + S Sbjct: 55 FSMATAPAQPPVKSQPLHNFALPFLKWGASGKNHTNAAHHHRCRRPSSLSSDHASEPDSD 114 Query: 288 P--RDHRXXXXXXXXXXXXXXXXXXXXAKNPVGSRSSKNRFSFANCSSSTTGMVAKVEKN 461 P R HR VGSR+++NRF+ CS + + Sbjct: 115 PDSRPHR------------------------VGSRTTRNRFALPTCSLKPLPPPPEPPQP 150 Query: 462 LTVSESSAAGEEARRNLMARLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPR 641 + ++ + E A+R++ + + +VQKPWNLRPR Sbjct: 151 PSCNDETD-DEAAKRDIE---------------------------DAEEAVQKPWNLRPR 182 Query: 642 R-GVSKAPNEIG-GPCKN------GELQEKLEN-----LPKSCRLRSFAEAQNVEKKEKR 782 + + K+ EIG GP +N GE + + + PKS RLR FA+ Q EKKEKR Sbjct: 183 KPALPKSALEIGTGPSRNHANNGVGEFHDGVSHHGENPAPKSLRLRGFADTQCAEKKEKR 242 Query: 783 RLSIALSREEIDEDFFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDV 962 + IALSREEI+ED FVMTG N+QK +D+VFPGLWLVG+TAD+Y +PD Sbjct: 243 KFWIALSREEIEEDIFVMTGSRPARRPRKRPKNVQKQMDSVFPGLWLVGITADAYRVPDT 302 Query: 963 PAKK 974 P K+ Sbjct: 303 PTKR 306 >gb|EXC02099.1| hypothetical protein L484_024064 [Morus notabilis] Length = 268 Score = 159 bits (403), Expect = 1e-36 Identities = 120/291 (41%), Positives = 152/291 (52%), Gaps = 20/291 (6%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKWG--KNQMN-NHRCRKLVDTSRESPPRDHRXXXXXXXXXX 332 MATAPVKS PLHNF LPFLKWG KN + +HRCR+ + SP DH Sbjct: 1 MATAPVKS-PLHNFPLPFLKWGGGKNHASGSHRCRRTISAD-SSPVADHCDAAEQERNES 58 Query: 333 XXXXXXXXXXAKNPVGSRSSKNRFS--FANCSSSTTGMVAKVEKNLTVSESSAAGEEARR 506 + VGSR+ +NRF+ FA+CS +V++ ++ S+ AAGE Sbjct: 59 SEAEPNRF----HRVGSRTVRNRFAAPFASCS-----LVSEKKE----SDEVAAGEGKEG 105 Query: 507 NLMARLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRGV-SKAPNEIGGPC 683 + E GE ++ VQKPWNLRPR+ + SKA Sbjct: 106 D--------------------DREVEAAAGEEEMMVQKPWNLRPRKALFSKAATN---GA 142 Query: 684 KNGELQEK----------LENL----PKSCRLRSFAEAQNVEKKEKRRLSIALSREEIDE 821 K+GEL E+ ENL PKS RLR +E+Q +KEKR+ IALSREEI+E Sbjct: 143 KSGELPEQENAVAGGGHQSENLNQQPPKSMRLRGLSESQQSSEKEKRKFWIALSREEIEE 202 Query: 822 DFFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAKK 974 D FVMTG N+QK +D VFPGLWLVG+TAD+Y I D PAK+ Sbjct: 203 DIFVMTGSRPARRPRKRPKNVQKQLDAVFPGLWLVGITADAYRIVDAPAKE 253 >ref|XP_004508940.1| PREDICTED: uncharacterized protein LOC101492028 [Cicer arietinum] Length = 242 Score = 158 bits (400), Expect = 3e-36 Identities = 113/292 (38%), Positives = 144/292 (49%), Gaps = 22/292 (7%) Frame = +3 Query: 165 ATAPVKSQPLHNFSLPFLKWG------KNQMNNHRCRKLVDTSRESPP-----RDHRXXX 311 A APVKSQPLHNFSLPFLKWG N N+ R R+ D + P R HR Sbjct: 4 APAPVKSQPLHNFSLPFLKWGGTGKNHTNSNNHQRSRRPPDHASPEPDSEPDSRPHR--- 60 Query: 312 XXXXXXXXXXXXXXXXXAKNPVGSRSSKNRFSFANCSSSTTGMVAKVEKNLTVSESSAAG 491 +GSR+++NRF + SSS ++ TVS + Sbjct: 61 ---------------------LGSRTARNRFGLPSSSSS--------HRHATVSSNHETD 91 Query: 492 EEARRNLMARLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRG-VSKAPNE 668 ++A E E G ++ VQKPWNLRPR+ + + E Sbjct: 92 DDAGDRKRE--------------------GEDEAGAEEI-VQKPWNLRPRKPMIPRGAFE 130 Query: 669 IG-GPCKN----GELQEKLEN-----LPKSCRLRSFAEAQNVEKKEKRRLSIALSREEID 818 IG G +N GEL E + N PKS RLR FA+ EKKEKR+ IALS+EEI+ Sbjct: 131 IGAGGSRNNHNGGELVEAVNNNGDNPTPKSLRLRGFADTSCTEKKEKRKFWIALSKEEIE 190 Query: 819 EDFFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAKK 974 ED FVMTG N+QK +D+VFPGLWLVG+TAD+Y + D P K+ Sbjct: 191 EDIFVMTGSRPNRRPRKRPKNVQKQMDSVFPGLWLVGITADAYRVADTPTKR 242 >ref|XP_004299406.1| PREDICTED: uncharacterized protein LOC101293977 [Fragaria vesca subsp. vesca] Length = 239 Score = 152 bits (384), Expect = 2e-34 Identities = 115/292 (39%), Positives = 142/292 (48%), Gaps = 21/292 (7%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKWG-KNQMN-NHRCRKLVD-----------TSRESPPRDHR 302 MATAPVK PLHNF L FLKWG KN N NHR R+ V ESPP+ HR Sbjct: 1 MATAPVKP-PLHNFPLSFLKWGSKNHTNTNHRYRRPVSAEPEPSADDDRNDSESPPQHHR 59 Query: 303 XXXXXXXXXXXXXXXXXXXXAKNPVGSRSSKNRFSFANCSSSTTGMVAKVEKNLTVSESS 482 VGSR++++RFS A+CS ++N SE S Sbjct: 60 ------------------------VGSRTARHRFSLASCSEKLP------QRNEKASEES 89 Query: 483 --AAGEEARRNLMARLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRG-VS 653 ++A+ +A + +E E VQKPWNLRPRR V+ Sbjct: 90 DDDVDDDAKAAAVAAV---------------AAAEEAE-------VQKPWNLRPRRAPVT 127 Query: 654 KAPNEIGGPCKNGELQEKLEN-LPKSCRLRSFAEAQN----VEKKEKRRLSIALSREEID 818 KA N GG E ++ E PKS RLR A A +KKEKR+ IALS++EI+ Sbjct: 128 KANNNTGGEVHEAEGTKQSEQPAPKSMRLRGLAAAAEGPSMEKKKEKRKFWIALSKDEIE 187 Query: 819 EDFFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAKK 974 ED F+MTG N+QK +DN FPGLWLVG TAD+Y D P KK Sbjct: 188 EDIFIMTGSRPARRPKKRPKNVQKQLDNCFPGLWLVGFTADAYRGSDSPTKK 239 >ref|XP_007209536.1| hypothetical protein PRUPE_ppa010718mg [Prunus persica] gi|462405271|gb|EMJ10735.1| hypothetical protein PRUPE_ppa010718mg [Prunus persica] Length = 238 Score = 143 bits (360), Expect = 1e-31 Identities = 112/289 (38%), Positives = 137/289 (47%), Gaps = 19/289 (6%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKWG-KNQM---NNHRCRKLVDTSRESPPRDHRXXXXXXXXX 329 MATAPVK PLHNF L FLKWG KN NNHR R+ V S P Sbjct: 1 MATAPVKP-PLHNFPLAFLKWGAKNNSTTNNNHRYRRPVSAEPASEPDSESERTHYN--- 56 Query: 330 XXXXXXXXXXXAKNPVGS-RSSKNRFSFANCSSSTTGMVAKVEKNLTVSESSAAGEEARR 506 + VGS R+S++R+S C AG++ RR Sbjct: 57 ------------NSRVGSSRASRHRYSLIPC----------------------AGDKRRR 82 Query: 507 NLMARLQXXXXXXXXXXXXLNGVTDEGEGGEIDVS--VQKPWNLRPRR----------GV 650 + +D+ EG E D + V KPWNLRPRR G Sbjct: 83 SEERE------------------SDQEEGEEADKAEVVHKPWNLRPRRAPATTSFSKGGA 124 Query: 651 SKAPNEIGGPCKN-GELQEKLENLPKSCRLRSFA-EAQNVEKKEKRRLSIALSREEIDED 824 + P+E+ P N ELQ+ PKS RLR A E QNVEKKE R+ IALS+EEI+ED Sbjct: 125 NGEPHELESPNPNQSELQQ-----PKSMRLRGLAAEGQNVEKKENRKFWIALSKEEIEED 179 Query: 825 FFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAK 971 FVMTG N+QK +D FPGLWLVG+TAD+Y + D P+K Sbjct: 180 IFVMTGSRPARRPKKRPKNVQKQLDITFPGLWLVGVTADAYKVADSPSK 228 >ref|XP_007052276.1| Uncharacterized protein TCM_005685 [Theobroma cacao] gi|508704537|gb|EOX96433.1| Uncharacterized protein TCM_005685 [Theobroma cacao] Length = 287 Score = 139 bits (351), Expect = 1e-30 Identities = 99/282 (35%), Positives = 133/282 (47%), Gaps = 7/282 (2%) Frame = +3 Query: 147 ERKSAMATAP-VKSQPLHNFSLPFLKWGKNQMNNHRCRKLVDTSRESPPRDHRXXXXXXX 323 E ++ MA++ +KS PLHNF L LKW N NNHR RKL D+S +SP R Sbjct: 17 EPETVMASSSTLKSHPLHNFQLHDLKWAMNHSNNHRLRKLSDSSHKSP---QRGDSDSDS 73 Query: 324 XXXXXXXXXXXXXAKNPVGSRSSKNRFSFANCSSSTTGMVAKVEKNLTVSESSAAGEEAR 503 KN S SS + S + G V+ N SE A + R Sbjct: 74 DDNRKGNPVREAAPKNGASSGSSADHRSEKSEKKVINGSDVLVDNN---SEKKATPSDGR 130 Query: 504 RNLMARLQXXXXXXXXXXXXLNGVTDEGEGG----EIDVSVQKPWNLRPRRGVSKAPNEI 671 + R + + V D G+ ++ V K WNLRPR+ ++K N+ Sbjct: 131 SKIYIRFRTKNQKPA------DEVADAGDQNLDAEYVEELVPKTWNLRPRKPITKPRNQN 184 Query: 672 GGPCKNG-ELQEKLENLPKSCRLRSFAEAQNVEKKEKRR-LSIALSREEIDEDFFVMTGX 845 G + G E + P+S R R+ E + EKKEK++ SI+LSREEID+D F MTG Sbjct: 185 GAAPRIGASAHENKIHRPESTRSRNVTEPKAAEKKEKKKKFSISLSREEIDDDIFAMTGS 244 Query: 846 XXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAK 971 N+QK +D VFPGLWL +T D Y + D PAK Sbjct: 245 KPSRRPKKRAKNVQKQLDCVFPGLWLSSITPDCYRVSDAPAK 286 >ref|XP_007039763.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590676536|ref|XP_007039764.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590676539|ref|XP_007039765.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590676547|ref|XP_007039767.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508777008|gb|EOY24264.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508777009|gb|EOY24265.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508777010|gb|EOY24266.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508777012|gb|EOY24268.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 223 Score = 134 bits (337), Expect = 6e-29 Identities = 103/275 (37%), Positives = 125/275 (45%), Gaps = 4/275 (1%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKWGKNQMNNHRCRKLVDTSRESPPRDHRXXXXXXXXXXXXX 341 MATAPVKSQPLHNF+ PFLKWG + R SP D Sbjct: 1 MATAPVKSQPLHNFNFPFLKWGTHGGGGSSTSSA--DHRRSPESDS-------------- 44 Query: 342 XXXXXXXAKNPVGSRSSK-NRFSFANCSSSTTGMVAKVEKNLTVSESSAAGEEARRNLMA 518 VGSRS++ R SF + + +S EE ++ Sbjct: 45 --DHDRLRPTRVGSRSTRIQRLSF-------------LPPPKPIKQSHGEDEEQQQE--- 86 Query: 519 RLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRGVSKAPNEIGGPCKNGEL 698 L +E E E + +VQ+PWNLRPR+ V + + Sbjct: 87 ------------EQPLKPHKNEAEEEEEEETVQRPWNLRPRKVVVETTAVVT------TA 128 Query: 699 QEKLENL--PKSCRLRSFAEAQN-VEKKEKRRLSIALSREEIDEDFFVMTGXXXXXXXXX 869 EK+ PKS RLR AE VEKKEKR+ IALSREEI+ED FVMTG Sbjct: 129 MEKVSETAAPKSMRLRGLAENGGIVEKKEKRKFWIALSREEIEEDIFVMTGSRPARRPKK 188 Query: 870 XXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAKK 974 NIQK +D VFPGLWLVG TAD+Y + D P KK Sbjct: 189 RPKNIQKQLDAVFPGLWLVGTTADAYRVADAPVKK 223 >ref|XP_002531462.1| conserved hypothetical protein [Ricinus communis] gi|223528916|gb|EEF30912.1| conserved hypothetical protein [Ricinus communis] Length = 265 Score = 133 bits (335), Expect = 1e-28 Identities = 100/294 (34%), Positives = 143/294 (48%), Gaps = 23/294 (7%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKWGK---------NQMNNHRCRKLVDTSRESPPRDHRXXXX 314 MATAPVK Q LHNF + LKWG+ N ++H +R S + R Sbjct: 1 MATAPVKPQQLHNFPIS-LKWGQTTTTTTISANHQHHHH-------NRSSSSNNQRLATP 52 Query: 315 XXXXXXXXXXXXXXXXAKNP-VGSRSSK-NRFSFANCSSSTTGMVAKVEKNLTVSESSAA 488 ++P VGSRS++ +R+SFA+CS T AK E + + A Sbjct: 53 VHESETESDPDQSQSTIRHPRVGSRSARVHRYSFASCS--TLLPKAKTE----IPQKPEA 106 Query: 489 GEEARRNLMARLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRGVSKAPNE 668 E+ ++ +A L+ N +E E E + S +PW LRPR+G+ ++ Sbjct: 107 TEKPQQKNLAVLE-------------NNNKNEAEEIEEEDSSSRPWKLRPRKGILTGSSK 153 Query: 669 IGGPCKNGELQEKLENLPKSCRLRSFAEAQN------------VEKKEKRRLSIALSREE 812 E ++ PKS RLR ++ + +EKKEKR+ +ALSREE Sbjct: 154 ETATLLGNEQRDS--TTPKSMRLRGLVDSTSSGLGVGLGNGVSLEKKEKRKFWVALSREE 211 Query: 813 IDEDFFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAKK 974 I+ED FV+TG N+QK++D+VFPGLWLVG TADSY + D P K+ Sbjct: 212 IEEDVFVLTGSRPARRPKKRPKNVQKILDSVFPGLWLVGTTADSYRVADPPVKR 265 >ref|XP_007039766.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508777011|gb|EOY24267.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 227 Score = 132 bits (332), Expect = 2e-28 Identities = 102/274 (37%), Positives = 124/274 (45%), Gaps = 4/274 (1%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKWGKNQMNNHRCRKLVDTSRESPPRDHRXXXXXXXXXXXXX 341 MATAPVKSQPLHNF+ PFLKWG + R SP D Sbjct: 1 MATAPVKSQPLHNFNFPFLKWGTHGGGGSSTSSA--DHRRSPESDS-------------- 44 Query: 342 XXXXXXXAKNPVGSRSSK-NRFSFANCSSSTTGMVAKVEKNLTVSESSAAGEEARRNLMA 518 VGSRS++ R SF + + +S EE ++ Sbjct: 45 --DHDRLRPTRVGSRSTRIQRLSF-------------LPPPKPIKQSHGEDEEQQQE--- 86 Query: 519 RLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRGVSKAPNEIGGPCKNGEL 698 L +E E E + +VQ+PWNLRPR+ V + + Sbjct: 87 ------------EQPLKPHKNEAEEEEEEETVQRPWNLRPRKVVVETTAVVT------TA 128 Query: 699 QEKLENL--PKSCRLRSFAEAQN-VEKKEKRRLSIALSREEIDEDFFVMTGXXXXXXXXX 869 EK+ PKS RLR AE VEKKEKR+ IALSREEI+ED FVMTG Sbjct: 129 MEKVSETAAPKSMRLRGLAENGGIVEKKEKRKFWIALSREEIEEDIFVMTGSRPARRPKK 188 Query: 870 XXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAK 971 NIQK +D VFPGLWLVG TAD+Y + D P K Sbjct: 189 RPKNIQKQLDAVFPGLWLVGTTADAYRVADAPVK 222 >ref|XP_006477140.1| PREDICTED: uncharacterized protein LOC102618144 isoform X1 [Citrus sinensis] Length = 216 Score = 130 bits (327), Expect = 8e-28 Identities = 101/280 (36%), Positives = 131/280 (46%), Gaps = 9/280 (3%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKWGKNQMNNHRCRKLV------DTSRESPPRDHRXXXXXXX 323 M TAP+KSQPLHNFSL FLKWG + N + R DT+ +S R HR Sbjct: 1 MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHNRTRTPPPTEPDTTDDST-RHHRV------ 53 Query: 324 XXXXXXXXXXXXXAKNPVGSRSSK-NRFSFANCSSSTTGMVAKVEKNLTVSESSAAGEEA 500 VGSRSS+ R SF CS+S AG+ + Sbjct: 54 -----------------VGSRSSRAQRLSFP-CSTS--------------KPHQDAGDRS 81 Query: 501 RRNLMARLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRGVSKAPNEIGGP 680 +R T+E E E+ +PWNLRPR+ V + ++ Sbjct: 82 QRQ-------------------TADTEEEEEDEVG----RPWNLRPRK-VQETLVDVAVF 117 Query: 681 CKNGELQEKLENLPKSCRLRSFAEAQ--NVEKKEKRRLSIALSREEIDEDFFVMTGXXXX 854 G+ + PKS RLR E++ N +KKEK + + LSREEI+ED F+MTG Sbjct: 118 QNRGDNNANTK-APKSTRLREMVESRGSNGDKKEKNKFWVTLSREEIEEDIFIMTGSRPA 176 Query: 855 XXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAKK 974 N+QK +DNVFPGLWLVGLT D+Y + D P KK Sbjct: 177 RRPRKRPKNVQKQLDNVFPGLWLVGLTVDAYRVSDAPMKK 216 >ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citrus clementina] gi|557542514|gb|ESR53492.1| hypothetical protein CICLE_v10022000mg [Citrus clementina] Length = 216 Score = 128 bits (322), Expect = 3e-27 Identities = 98/280 (35%), Positives = 128/280 (45%), Gaps = 9/280 (3%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKWGKNQMNNHRCRKLV------DTSRESPPRDHRXXXXXXX 323 M TAP+KSQPLHNFSL FLKWG + N + R DT+ +S R HR Sbjct: 1 MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHNRTRTPPPTEPDTTDDST-RHHRV------ 53 Query: 324 XXXXXXXXXXXXXAKNPVGSRSSK-NRFSFANCSSSTTGMVAKVEKNLTVSESSAAGEEA 500 VGSRSS+ R SF + +S + + T Sbjct: 54 -----------------VGSRSSRAQRLSFPSSTSKPQQDAVERPQRQTAD--------- 87 Query: 501 RRNLMARLQXXXXXXXXXXXXLNGVTDEGEGGEIDVSVQKPWNLRPRRGVSKAPNEIGGP 680 T+E E E+ +PWNLRPR+ V + ++ Sbjct: 88 -------------------------TEEEEEDEVG----RPWNLRPRK-VQETLVDVAVF 117 Query: 681 CKNGELQEKLENLPKSCRLRSFAEAQ--NVEKKEKRRLSIALSREEIDEDFFVMTGXXXX 854 G+ + PKS RLR E++ N +KKEK + + LSREEI+ED F+MTG Sbjct: 118 QNRGDNNANTK-APKSTRLREMVESRGSNGDKKEKNKFWVTLSREEIEEDIFIMTGSRPA 176 Query: 855 XXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVPAKK 974 N+QK +DNVFPGLWLVGLTAD+Y + D P KK Sbjct: 177 RRPRKRPKNVQKQLDNVFPGLWLVGLTADAYRVSDAPMKK 216 >ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferase 2E-like [Glycine max] Length = 241 Score = 120 bits (301), Expect = 9e-25 Identities = 70/143 (48%), Positives = 87/143 (60%), Gaps = 17/143 (11%) Frame = +3 Query: 597 EIDVSVQKPWNLRPRRGV---SKAPNEIG-GPCKN-----------GELQEKLEN--LPK 725 + D SVQKPW LRPR+ +K EIG GP +N GE + +N PK Sbjct: 99 DADDSVQKPWKLRPRKPALLPNKTALEIGTGPSRNHHHHHHHATNNGEFLDGGDNNPAPK 158 Query: 726 SCRLRSFAEAQNVEKKEKRRLSIALSREEIDEDFFVMTGXXXXXXXXXXXXNIQKLVDNV 905 S RLR F++ Q EKKEKR+ IALSREEI+ED FVMTG N+QK +D+V Sbjct: 159 SLRLRGFSDTQCSEKKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKRPKNVQKQMDSV 218 Query: 906 FPGLWLVGLTADSYSIPDVPAKK 974 FPGLWLVG+TAD+Y + D P K+ Sbjct: 219 FPGLWLVGITADAYRVADTPTKR 241 >ref|XP_003549926.1| PREDICTED: uncharacterized protein LOC100812835 isoform X1 [Glycine max] gi|571536516|ref|XP_006600845.1| PREDICTED: uncharacterized protein LOC100812835 isoform X2 [Glycine max] Length = 237 Score = 120 bits (300), Expect = 1e-24 Identities = 71/148 (47%), Positives = 88/148 (59%), Gaps = 17/148 (11%) Frame = +3 Query: 582 EGEGGEIDVSVQKPWNLRPRRG--VSKAPNEIG-GPCKN-------GELQE-------KL 710 E E + D +VQKPWNLRPR+ + KA EIG GP +N GE + Sbjct: 90 EAEHDDADDAVQKPWNLRPRKPALLPKAALEIGTGPSRNHHHATNNGEFHDGGGGGGDNN 149 Query: 711 ENLPKSCRLRSFAEAQNVEKKEKRRLSIALSREEIDEDFFVMTGXXXXXXXXXXXXNIQK 890 PKS RLR F++ KKEKR+ IALSREEI+ED FVMTG N+QK Sbjct: 150 NPAPKSLRLRGFSDTPCSVKKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKRPKNVQK 209 Query: 891 LVDNVFPGLWLVGLTADSYSIPDVPAKK 974 +D+VFPGLWLVG+TAD+Y + D PAK+ Sbjct: 210 QMDSVFPGLWLVGITADAYRVADTPAKR 237 >ref|XP_006445388.1| hypothetical protein CICLE_v10021338mg [Citrus clementina] gi|568819838|ref|XP_006464451.1| PREDICTED: uncharacterized protein LOC102609123 isoform X1 [Citrus sinensis] gi|557547650|gb|ESR58628.1| hypothetical protein CICLE_v10021338mg [Citrus clementina] Length = 302 Score = 106 bits (264), Expect = 2e-20 Identities = 89/288 (30%), Positives = 125/288 (43%), Gaps = 22/288 (7%) Frame = +3 Query: 156 SAMATAPVKSQPLHNFSLPFLKWGKNQMNNHRCRKLVDTSRESPPRDHRXXXXXXXXXXX 335 S A QPLHNFSL LKW N N +R RK D+S +SP D Sbjct: 29 SLTAVKSQSQQPLHNFSLTDLKWAMNHTNTNRFRKPSDSSHKSPHYDAAVSDKH------ 82 Query: 336 XXXXXXXXXAKNPV----GSRSSKNRFSFANCSSSTTGMVAKVEKNLTVSESSAAGEEAR 503 K P+ G +S+ A+ ST+G +NL + A + Sbjct: 83 ----------KRPLLQVQGVKSALGVEKLAD-GKSTSGHGHDAAENLVNEPAPALSSDGS 131 Query: 504 RN-LMARLQXXXXXXXXXXXXLNGVTDEGEGGEI-------DVSVQKPWNLRPRRGVSKA 659 R+ + R++ + V D G+ + D+ V K WNLRPRR ++K Sbjct: 132 RSKIFIRIKTKTTKVA------DEVADAGDHNAVVPDDDSDDLLVPKTWNLRPRRLITKV 185 Query: 660 PNEI-------GGPCK--NGELQE-KLENLPKSCRLRSFAEAQNVEKKEKRRLSIALSRE 809 N GG K G QE K + + + + + EKKEK + SI+L +E Sbjct: 186 NNNNIVNVKGGGGALKIGGGAAQEIKPPEKKDTDKDKEREKEKEKEKKEKMKFSISLKKE 245 Query: 810 EIDEDFFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSI 953 EI++DFF MTG N+QK +D VFPGLWL +T +SY + Sbjct: 246 EIEDDFFAMTGAKPSRRPKKRAKNVQKQLDYVFPGLWLASITPESYKV 293 >ref|XP_006445387.1| hypothetical protein CICLE_v10021338mg [Citrus clementina] gi|568819841|ref|XP_006464452.1| PREDICTED: uncharacterized protein LOC102609123 isoform X2 [Citrus sinensis] gi|557547649|gb|ESR58627.1| hypothetical protein CICLE_v10021338mg [Citrus clementina] Length = 300 Score = 106 bits (264), Expect = 2e-20 Identities = 89/288 (30%), Positives = 125/288 (43%), Gaps = 22/288 (7%) Frame = +3 Query: 156 SAMATAPVKSQPLHNFSLPFLKWGKNQMNNHRCRKLVDTSRESPPRDHRXXXXXXXXXXX 335 S A QPLHNFSL LKW N N +R RK D+S +SP D Sbjct: 29 SLTAVKSQSQQPLHNFSLTDLKWAMNHTNTNRFRKPSDSSHKSPHYDAAVSDKH------ 82 Query: 336 XXXXXXXXXAKNPV----GSRSSKNRFSFANCSSSTTGMVAKVEKNLTVSESSAAGEEAR 503 K P+ G +S+ A+ ST+G +NL + A + Sbjct: 83 ----------KRPLLQVQGVKSALGVEKLAD-GKSTSGHGHDAAENLVNEPAPALSSDGS 131 Query: 504 RN-LMARLQXXXXXXXXXXXXLNGVTDEGEGGEI-------DVSVQKPWNLRPRRGVSKA 659 R+ + R++ + V D G+ + D+ V K WNLRPRR ++K Sbjct: 132 RSKIFIRIKTKTTKVA------DEVADAGDHNAVVPDDDSDDLLVPKTWNLRPRRLITKV 185 Query: 660 PNEI-------GGPCK--NGELQE-KLENLPKSCRLRSFAEAQNVEKKEKRRLSIALSRE 809 N GG K G QE K + + + + + EKKEK + SI+L +E Sbjct: 186 NNNNIVNVKGGGGALKIGGGAAQEIKPPEKKDTDKDKEREKEKEKEKKEKMKFSISLKKE 245 Query: 810 EIDEDFFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSI 953 EI++DFF MTG N+QK +D VFPGLWL +T +SY + Sbjct: 246 EIEDDFFAMTGAKPSRRPKKRAKNVQKQLDYVFPGLWLASITPESYKV 293 >ref|XP_002279484.2| PREDICTED: uncharacterized protein LOC100244117 [Vitis vinifera] Length = 259 Score = 105 bits (262), Expect = 3e-20 Identities = 90/288 (31%), Positives = 131/288 (45%), Gaps = 7/288 (2%) Frame = +3 Query: 123 VISKAEAFERKSAMAT-APVKSQPLHNFSLPFLKWGKNQMNNHRCRKLVDTSRESPPRDH 299 VI+ +E ER S + T P +S+PLHNF++P LKWG + RC K V+++ E D Sbjct: 2 VITGSEG-ERVSELKTMGPERSKPLHNFAMPSLKWGNQRFL--RCMK-VNSNGEVAADDG 57 Query: 300 RXXXXXXXXXXXXXXXXXXXXAKNPVGSRSSKNRFSFANCSSSTTGMVAKVEKNLTVSES 479 R + S S K R CS S +E++ T S Sbjct: 58 RSSDL----------------VRGRRESESEKRRS--LTCSES-------LEESPT--RS 90 Query: 480 SAAGEEARRNLMARLQXXXXXXXXXXXXLNGVTDE------GEGGEIDVSVQKPWNLRPR 641 S G + + + + L D+ +G E D S +PWNLR R Sbjct: 91 SPIGGKGKGDEIDGDDGIEAVRAKLMFDLQAAADKMKVAIFKDGEEEDSS--RPWNLRTR 148 Query: 642 RGVSKAPNEIGGPCKNGELQEKLENLPKSCRLRSFAEAQNVEKKEKRRLSIALSREEIDE 821 R KAP+ GG K+ ++ + S + + EKKE+ + S++LSR+EI+E Sbjct: 149 RAACKAPSPSGGGGKSLTIERRKPGTSPS--RTDVSAPRRGEKKERAKFSVSLSRQEIEE 206 Query: 822 DFFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPDVP 965 DF +TG N+QK +D +FPGLWL +T DSY +PD P Sbjct: 207 DFMAITGHRPARRPKKRAKNVQKQLDTLFPGLWLTEVTPDSYKVPDFP 254 >ref|XP_007044281.1| Uncharacterized protein TCM_009632 [Theobroma cacao] gi|508708216|gb|EOY00113.1| Uncharacterized protein TCM_009632 [Theobroma cacao] Length = 312 Score = 99.4 bits (246), Expect = 2e-18 Identities = 84/301 (27%), Positives = 132/301 (43%), Gaps = 34/301 (11%) Frame = +3 Query: 162 MATAPVKSQPLHNFSLPFLKWGKNQMNNHRCRKLVD----TSRESPPRDHRXXXXXXXXX 329 MA P +S+PLHNF LP LKWG + RC KL D T S DH Sbjct: 12 MAMGPERSKPLHNFKLPCLKWGNQRYL--RCVKLDDASTATDSSSAAVDHHRRHRHRHVF 69 Query: 330 XXXXXXXXXXXAKNPVGSRSSKNRFSFANCSSSTTGMVAKVEKNLTVSESSAA-GEEARR 506 + +R ++ S +N ++ G E+ L +SE AA G +A R Sbjct: 70 QRRRSPPSKFESMIVGATRLRESESSPSNDKNNDYGR----ERRLRISEGEAAEGIKAVR 125 Query: 507 NLMARLQXXXXXXXXXXXXLNGVTDEGEGGEID--------------------VSVQ-KP 623 + + + V+D+ + + + V+V+ +P Sbjct: 126 EKIMKDLKTAADKIKDEIFRDEVSDDDDVDDDEDEFEEPKRKMKEKEIEESPAVAVEARP 185 Query: 624 WNLRPRRGVSKAPNEIGGPCKN--GELQEKLENLPK------SCRLRSFAEAQNVEKKEK 779 WNLR RR KAP + GG N ++ ++ N P+ S + A A +K+ + Sbjct: 186 WNLRTRRAACKAPIDGGGTNNNYNSPMKNEVINSPRVRDRGSSVASATVAAAAAEKKRPR 245 Query: 780 RRLSIALSREEIDEDFFVMTGXXXXXXXXXXXXNIQKLVDNVFPGLWLVGLTADSYSIPD 959 + S++LS++EI+EDF VM G +Q +D++FPGLWL +T DSY +P+ Sbjct: 246 PKFSVSLSKKEIEEDFMVMAGHRPLRRPKKRPRYVQNQLDSLFPGLWLTEVTVDSYKVPE 305 Query: 960 V 962 + Sbjct: 306 L 306