BLASTX nr result
ID: Mentha29_contig00012106
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00012106 (1280 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272570.1| PREDICTED: uncharacterized protein LOC100253... 265 3e-68 ref|XP_007039374.1| Branchless trichome [Theobroma cacao] gi|508... 259 2e-66 ref|XP_006343777.1| PREDICTED: protein BRANCHLESS TRICHOME-like ... 250 1e-63 ref|XP_007208768.1| hypothetical protein PRUPE_ppa022759mg [Prun... 245 3e-62 ref|XP_006439213.1| hypothetical protein CICLE_v10020958mg [Citr... 235 3e-59 ref|XP_002297942.1| hypothetical protein POPTR_0001s11490g [Popu... 235 4e-59 ref|XP_006477505.1| PREDICTED: protein BRANCHLESS TRICHOME-like ... 232 2e-58 ref|XP_004309576.1| PREDICTED: uncharacterized protein LOC101291... 220 9e-55 gb|EXB81881.1| hypothetical protein L484_015357 [Morus notabilis] 214 5e-53 ref|XP_002518920.1| hypothetical protein RCOM_1313640 [Ricinus c... 207 1e-50 ref|XP_004245452.1| PREDICTED: uncharacterized protein LOC101262... 199 3e-48 emb|CBI34596.3| unnamed protein product [Vitis vinifera] 196 2e-47 ref|XP_002303697.1| hypothetical protein POPTR_0003s14770g [Popu... 174 7e-41 ref|XP_006391709.1| hypothetical protein EUTSA_v10023631mg [Eutr... 172 3e-40 gb|AFK13153.1| branchless trichomes [Gossypium arboreum] 158 6e-36 ref|XP_002887867.1| hypothetical protein ARALYDRAFT_892933 [Arab... 151 5e-34 ref|XP_006301742.1| hypothetical protein CARUB_v10022203mg, part... 151 7e-34 gb|AAD38245.1|AC006193_1 Hypothetical Protein [Arabidopsis thali... 149 3e-33 ref|NP_176650.1| protein branchless trichome [Arabidopsis thalia... 147 1e-32 gb|AAR20780.1| At1g64690 [Arabidopsis thaliana] gi|38604052|gb|A... 141 7e-31 >ref|XP_002272570.1| PREDICTED: uncharacterized protein LOC100253094 [Vitis vinifera] Length = 342 Score = 265 bits (677), Expect = 3e-68 Identities = 168/360 (46%), Positives = 207/360 (57%), Gaps = 33/360 (9%) Frame = +3 Query: 3 MMKMKMNKMRSQEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXX 182 M KM M +RS E PS N T +P I TTCP WKLYENPFY Sbjct: 1 MEKMMMMMIRSPENPS----NETISEEP--------ITATTCPSWKLYENPFYYSQHQQQ 48 Query: 183 XXXXXXXXXXX----------STTDKQIHRLHLPIPAKKIAASFWDLTFIKPFMESDLES 332 T +K HRL LP+ A+KIAASFWDLTF +P M+S+L+ Sbjct: 49 QQQQQQHHHYHHRHHQHQQQKQTRNKHHHRLQLPLSARKIAASFWDLTFFRPIMDSELDI 108 Query: 333 ARTQITEARAELESERKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSE 512 AR QI E +AELE ERKARKK+ES+NKRLAK+++EERRGREA+ERVCE LA EI++ K+E Sbjct: 109 ARAQIIELKAELEFERKARKKVESMNKRLAKDLNEERRGREAMERVCEVLAKEISSDKAE 168 Query: 513 ISRMRREIEDERKMLRTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAKQ------ 674 IS M+REIE+ERKMLR AE +REERVQMKL++AK ++ Sbjct: 169 ISWMKREIEEERKMLRMAEVLREERVQMKLADAKFLLEEKLLELEVTKQTQVERPSSKME 228 Query: 675 --NQER--------------SCFLGEKLRLVLGEKVSGGCVNEGEEGARIPMALMKRKSG 806 NQE SC GE R + GEK + C + A +P ++R++ Sbjct: 229 HKNQEDNGRGITTASAPAAISC--GESARFIPGEKPT--C---NDSNAVVPSMAIQRRAS 281 Query: 807 GETENPHIRRGIKGFVEFPKVVRAIGCRSSKHLG-KLECXXXXXXXXXXXXGAVRFNGLL 983 ETENPHI+RGIKGFVEFP+VVRAIG R S+HLG KLEC +R N L+ Sbjct: 282 PETENPHIKRGIKGFVEFPRVVRAIGSR-SRHLGTKLECQKAQLRILLKQKNPIRSNSLI 340 >ref|XP_007039374.1| Branchless trichome [Theobroma cacao] gi|508776619|gb|EOY23875.1| Branchless trichome [Theobroma cacao] Length = 354 Score = 259 bits (661), Expect = 2e-66 Identities = 154/334 (46%), Positives = 195/334 (58%), Gaps = 36/334 (10%) Frame = +3 Query: 27 MRSQEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXXXXXXX 206 M+ ++ +++ ++ D+ N I I TTTCP WKLYENPFY + Sbjct: 1 MKGEDMEEAMMMISSPDNPCNVTIPQEHITTTTCPSWKLYENPFY-YSHHHHHHHHHHQQ 59 Query: 207 XXXSTTDKQIHRLHLPIPAKKIAASFWDLTFIKPFMESDLESARTQITEARAELESERKA 386 + K + +++LP+ A+KIAASFWDLTF KP MES+L+ AR QI E +AE+E ERKA Sbjct: 60 QQLCQSSKHLRQVNLPLSARKIAASFWDLTFFKPVMESELDIARAQIIELKAEVEYERKA 119 Query: 387 RKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIEDERKMLRTA 566 RKK+ESLNKRLAKE++EERRGREALERVCEELA EI+ HK+EI RM++E+E+ERKMLR A Sbjct: 120 RKKVESLNKRLAKELAEERRGREALERVCEELAREISMHKAEIDRMKKEVEEERKMLRMA 179 Query: 567 ETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAKQN----------------------- 677 E +REERVQMKL+EAKI + Sbjct: 180 EVLREERVQMKLAEAKILFEEKLLELEETKRAQPHTSISRIEQKNKEFKTPALSANLSGK 239 Query: 678 ------QERSCFLG-------EKLRLVLGEKVSGGCVNEGEEGARIPMALMKRKSGGETE 818 E+SC E R L EK S C + MA+ +RK+ E E Sbjct: 240 FARLVFSEKSCDYSNIGVDSRESTRFALSEK-SSSCYDNISSAVSSSMAI-QRKASPEPE 297 Query: 819 NPHIRRGIKGFVEFPKVVRAIGCRSSKHLGKLEC 920 NPHI+RGIKGFVEFP+VVRAIG +S KLEC Sbjct: 298 NPHIKRGIKGFVEFPRVVRAIGSKSRHWGTKLEC 331 >ref|XP_006343777.1| PREDICTED: protein BRANCHLESS TRICHOME-like [Solanum tuberosum] Length = 315 Score = 250 bits (638), Expect = 1e-63 Identities = 147/327 (44%), Positives = 195/327 (59%), Gaps = 5/327 (1%) Frame = +3 Query: 18 MNKMRSQEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXXXX 197 M +RSQE + +DDH II T+TCP WKLYENPFY Sbjct: 2 MMMIRSQENLRNKSIGISDDH---------IIPTSTCPTWKLYENPFY--NSQNPLQQYT 50 Query: 198 XXXXXXSTTDKQIHRLHLPIPAKKIAASFWDLTFIKPFMESDLESARTQITEARAELESE 377 + +KQIHRL+LPI A+KIAASFWDLTFI+PFM+S+LE AR Q+ E +A++E E Sbjct: 51 PLRIQHNNNNKQIHRLNLPISARKIAASFWDLTFIRPFMDSELEIARAQVAELKAKVEHE 110 Query: 378 RKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIEDERKML 557 RKARKKLE +NK++A+E+SEE++GREALERVCEELA I++ K+E++R+R+++E+ERKM+ Sbjct: 111 RKARKKLEWMNKKIARELSEEKKGREALERVCEELANHISSDKAEMNRLRKDMEEERKMM 170 Query: 558 RTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAKQNQERSCFLGEKLRLVLGEKVS 737 R AE +REERVQMKL+EAK K+ + + E + + Sbjct: 171 RVAEVMREERVQMKLTEAK-----YLLEDKLLELEATKKMLQTEQKIEENINNNYNADET 225 Query: 738 GGCVNE-----GEEGARIPMALMKRKSGGETENPHIRRGIKGFVEFPKVVRAIGCRSSKH 902 G + G + + + +R E ENPHI+RGIKGFVEFPKVVRAI +S Sbjct: 226 GTSRPDHKSICGSNFHQYQVTIHRRNHSQEAENPHIKRGIKGFVEFPKVVRAISSKSRHW 285 Query: 903 LGKLECXXXXXXXXXXXXGAVRFNGLL 983 KLEC +R N LL Sbjct: 286 GTKLECQKAQLRILLKQKCPIRSNVLL 312 >ref|XP_007208768.1| hypothetical protein PRUPE_ppa022759mg [Prunus persica] gi|462404503|gb|EMJ09967.1| hypothetical protein PRUPE_ppa022759mg [Prunus persica] Length = 399 Score = 245 bits (625), Expect = 3e-62 Identities = 165/379 (43%), Positives = 204/379 (53%), Gaps = 58/379 (15%) Frame = +3 Query: 21 NKMRSQEEPSSIISNTTDDHKPNHNI---KIPIIATTTCPVWKLYENPFYIFXXXXXXXX 191 +K EE S+ T + + N I I T+TCP WKLYENPFY Sbjct: 20 SKREDMEEMMMRTSSATSSPETSRNETINSIEPITTSTCPSWKLYENPFYNSHLRHQNQP 79 Query: 192 XXXXXXXXSTTD--KQIHR-LHLPIPAKKIAASFWDLTFIKPFMESDLESARTQITEARA 362 S++ +Q+H LHLPI A+K+AA+FWDLTF KP MES+++ R QI E +A Sbjct: 80 QQQQQQCQSSSSNKQQVHHCLHLPISARKLAATFWDLTFFKPVMESEMDYTRAQIIELKA 139 Query: 363 ELESERKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIED 542 ELE ERKARKKLES NKRLAKE+ EERRGREA+ERVCEELA EI+ KSEISRM++EIE+ Sbjct: 140 ELEYERKARKKLESSNKRLAKELGEERRGREAIERVCEELAREISFGKSEISRMKKEIEE 199 Query: 543 ERKMLRTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAKQN-----QERS------ 689 ERKMLR AE +REERVQMKL+EA+I + +N +++S Sbjct: 200 ERKMLRMAEVLREERVQMKLAEARILFEEKLLELEGCKQMQSTENSHFKIKDKSEDVNAA 259 Query: 690 ----------------CFLGEKLRLVLGEK---VSGGCVNE---------------GEE- 764 C +VLGEK SG N+ GE+ Sbjct: 260 SFSGKSASNDKNIGVDCSRDSMSLVVLGEKSSAFSGDHNNDYVSSVSATTSRSVLLGEKA 319 Query: 765 ------GARIPMALMKRKSGGETENPHIRRGIKGFVEFPKVVRAIGCRSSKHLGKLECXX 926 G MA+ KR S E ENPHI+RGIKGFVEFP+VVRAIG +S KLEC Sbjct: 320 ACSDNSGGFSSMAIQKRAS-PEPENPHIKRGIKGFVEFPRVVRAIGSKSRHWGTKLECQK 378 Query: 927 XXXXXXXXXXGAVRFNGLL 983 +R N + Sbjct: 379 AQLRILLKQKSPIRSNSFI 397 >ref|XP_006439213.1| hypothetical protein CICLE_v10020958mg [Citrus clementina] gi|557541475|gb|ESR52453.1| hypothetical protein CICLE_v10020958mg [Citrus clementina] Length = 344 Score = 235 bits (600), Expect = 3e-59 Identities = 150/333 (45%), Positives = 192/333 (57%), Gaps = 32/333 (9%) Frame = +3 Query: 18 MNKMRSQEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXXXX 197 M + S E S I NT +P I TTC WKLYENPFY Sbjct: 1 MMMITSPENSSKI--NTISSQEP--------ITPTTCSSWKLYENPFY---NSQKHRQNH 47 Query: 198 XXXXXXSTTDKQIHRLHLP-IPAKKIAASFWDLTFIKPFMESDLESARTQITEARAELES 374 + K IH+ HLP + A+KIAASFWDLTF +P ME++L+ A+ QI E +AEL+ Sbjct: 48 HDHHQCQSNTKHIHQFHLPAVSARKIAASFWDLTFFRPMMETELDIAQAQIMELKAELDY 107 Query: 375 ERKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIEDERKM 554 ERKARK+ ES+NKRLA+E++EERRGRE +ER+CEELA +I++ + EI R++RE+E+ERKM Sbjct: 108 ERKARKRAESMNKRLARELAEERRGRETMERLCEELARDISSDREEIDRIKREMEEERKM 167 Query: 555 LRTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXA----------KQNQERSC--FL 698 LR A+ +REERVQMKL+EAKI ++NQE FL Sbjct: 168 LRMAQVLREERVQMKLTEAKILFEEKLQELEESKRTQIDTSSTRTEKLRRNQEDKFPGFL 227 Query: 699 -------GEKLRLVLGEKVS--GGCVNE--GEEGARIPMAL--------MKRKSGGETEN 821 G+ RL L + S G C NE E R + ++R++ E EN Sbjct: 228 TASHSLSGKFARLALSSEKSSIGACSNEIDSRESTRAVSSASNDTASIAIQRRAPSEPEN 287 Query: 822 PHIRRGIKGFVEFPKVVRAIGCRSSKHLGKLEC 920 PHI+RGIKGFVEFP+VVRAIG RS KLEC Sbjct: 288 PHIKRGIKGFVEFPRVVRAIGSRSRHWDTKLEC 320 >ref|XP_002297942.1| hypothetical protein POPTR_0001s11490g [Populus trichocarpa] gi|222845200|gb|EEE82747.1| hypothetical protein POPTR_0001s11490g [Populus trichocarpa] Length = 349 Score = 235 bits (599), Expect = 4e-59 Identities = 146/344 (42%), Positives = 191/344 (55%), Gaps = 42/344 (12%) Frame = +3 Query: 78 HKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXXXXXXXXXXSTTDKQIHRLHLPI 257 H N + I TTTC WKLYENPFY ++K +H HLP+ Sbjct: 14 HSMNESSPQDPITTTTCTSWKLYENPFY-----NSQHNIQHQHQQHCQSNKHLH--HLPL 66 Query: 258 PAKKIAASFWDLTFIKPFMESDLESARTQITEARAELESERKARKKLESLNKRLAKEVSE 437 A+KIAASFWDLTF +P M+++L+ AR QI + +AELE ERKARKKLE+++KRLAKE++E Sbjct: 67 SARKIAASFWDLTFFRPIMDTELDFARAQILDLKAELEYERKARKKLETMSKRLAKELAE 126 Query: 438 ERRGREALERVCEELAMEITAHKSEISRMRREIEDERKMLRTAETIREERVQMKLSEA-- 611 ERRGREAL RVCEELA EI+ K EI M+RE+E+ERKMLR AE +REERVQMKL+EA Sbjct: 127 ERRGREALARVCEELAREISCDKEEIDHMKREMEEERKMLRMAEVLREERVQMKLAEARM 186 Query: 612 ----KIXXXXXXXXXXXXXXXXAKQNQER---------------SCFLGEKL-RLVLGEK 731 K+ A + +++ + L KL RLVL EK Sbjct: 187 LFEEKLLELGGTTTQTELHHNSASRMEQKYQEDKEAEISTPFKAAAILSSKLNRLVLSEK 246 Query: 732 VSGGCVN----EGEEGARI----------------PMALMKRKSGGETENPHIRRGIKGF 851 C + + +E R+ M + + ++ E ENPHI+RGIKGF Sbjct: 247 ---SCYDNSGADSKESTRVILSEMSSFNDKSRSISSMVIQRSRASPEPENPHIKRGIKGF 303 Query: 852 VEFPKVVRAIGCRSSKHLGKLECXXXXXXXXXXXXGAVRFNGLL 983 VEFP+V+RAIG ++ KLEC +R N L+ Sbjct: 304 VEFPRVIRAIGSKNRHWGTKLECQKAQLRILLKQKSPIRSNNLI 347 >ref|XP_006477505.1| PREDICTED: protein BRANCHLESS TRICHOME-like [Citrus sinensis] Length = 344 Score = 232 bits (592), Expect = 2e-58 Identities = 141/302 (46%), Positives = 181/302 (59%), Gaps = 32/302 (10%) Frame = +3 Query: 111 IATTTCPVWKLYENPFYIFXXXXXXXXXXXXXXXXSTTDKQIHRLHLP-IPAKKIAASFW 287 I TTC WKLYENPFY + K IH+ HLP + A+KIAASFW Sbjct: 22 ITPTTCSSWKLYENPFY---NSQKHRQNHHDHHQCQSNTKHIHQFHLPAVSARKIAASFW 78 Query: 288 DLTFIKPFMESDLESARTQITEARAELESERKARKKLESLNKRLAKEVSEERRGREALER 467 DLTF +P ME++L+ A+ QI E +AEL+ ERKARK+ ES+NKRLA+E++EERRGRE +ER Sbjct: 79 DLTFFRPMMETELDIAQAQIMELKAELDYERKARKRAESMNKRLARELAEERRGRETMER 138 Query: 468 VCEELAMEITAHKSEISRMRREIEDERKMLRTAETIREERVQMKLSEAKIXXXXXXXXXX 647 +CEELA +I++ + EI R++RE+E+ERKMLR A+ +REERVQMKL+EAKI Sbjct: 139 LCEELARDISSDREEIDRIKREMEEERKMLRMAQVLREERVQMKLTEAKILFEEKLQELE 198 Query: 648 XXXXXXA----------KQNQERSC--FL-------GEKLRLVLGEKVS--GGCVNE--G 758 ++NQE FL G+ RL L + S G C NE Sbjct: 199 ESKRTQIDTSSTRTEKLRRNQEDKFPGFLTASHSLSGKFARLALSSEKSSIGACSNEIDS 258 Query: 759 EEGARIPMAL--------MKRKSGGETENPHIRRGIKGFVEFPKVVRAIGCRSSKHLGKL 914 E R + ++R++ E ENPHI+RGIKGFVEFP+VVRA G RS KL Sbjct: 259 RESTRAVSSASNDTASIAIQRRAPSEPENPHIKRGIKGFVEFPRVVRATGSRSRHWDTKL 318 Query: 915 EC 920 EC Sbjct: 319 EC 320 >ref|XP_004309576.1| PREDICTED: uncharacterized protein LOC101291885 [Fragaria vesca subsp. vesca] Length = 287 Score = 220 bits (561), Expect = 9e-55 Identities = 145/330 (43%), Positives = 176/330 (53%), Gaps = 5/330 (1%) Frame = +3 Query: 9 KMKMNKMRSQEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXX 188 KM M M S P + + T T+ P WKLYENPFY Sbjct: 4 KMMMMMMTSSSSPETSTNEAT--------------TTSAFPSWKLYENPFY--------- 40 Query: 189 XXXXXXXXXSTTDKQIHRLHLPIPAKKIAASFWDLTFIKPFMESDLESARTQITEARAEL 368 + H+ H P K ASFWDLTF KP ME+++E R QI E +AEL Sbjct: 41 -----------NSQPQHQNH---PEK---ASFWDLTFFKPVMETEMEYTRAQIMELKAEL 83 Query: 369 ESERKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIEDER 548 E ERK RKK E +NK+LAKE+SEERR REA+E VCEELA EI++ SEI+RMR+E+E+ER Sbjct: 84 EYERKTRKKFEVINKKLAKELSEERRAREAIESVCEELAREISSRNSEINRMRKEMEEER 143 Query: 549 KMLRTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAK-----QNQERSCFLGEKLR 713 KMLR AE IREERVQMKLSEAK K +N + C Sbjct: 144 KMLRVAEVIREERVQMKLSEAKFLFEEKVFQLSCKPVQTEKSPTCVENDDPVC-----SD 198 Query: 714 LVLGEKVSGGCVNEGEEGARIPMALMKRKSGGETENPHIRRGIKGFVEFPKVVRAIGCRS 893 +V SG V+ G MA+ +R S E+ENPHI+RGIKGFVEFP+VVRAIG +S Sbjct: 199 VVWTNSKSG--VSRENSGGFSTMAIQRRAS-PESENPHIKRGIKGFVEFPRVVRAIGSKS 255 Query: 894 SKHLGKLECXXXXXXXXXXXXGAVRFNGLL 983 KLEC +R N L+ Sbjct: 256 RHWGTKLECQKAQLRILLKQKSPIRSNSLI 285 >gb|EXB81881.1| hypothetical protein L484_015357 [Morus notabilis] Length = 391 Score = 214 bits (546), Expect = 5e-53 Identities = 143/388 (36%), Positives = 193/388 (49%), Gaps = 72/388 (18%) Frame = +3 Query: 36 QEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXXXXXXXXXX 215 +E+ +++ P + I T+TCP WKLYENPFY Sbjct: 2 EEKKKTMMMTLNSPENPRNESSGEPITTSTCPTWKLYENPFYNSHNQPHHNHHNHNQHQN 61 Query: 216 STTDKQIHR--------LHLPIPAKKIAASFWDLTFI-KPFMESDLESARTQITEARAEL 368 T +++ LHLP+ A+K+AASFWDLTF +P MES+L+ ARTQI E + EL Sbjct: 62 LTAQNCVYQRNTSLHQCLHLPLSARKLAASFWDLTFFTRPAMESELDMARTQIMELKTEL 121 Query: 369 ESERKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIEDER 548 E ERKARKKLES+NKRL KE +EERRGREA+E++CEELA EI+ K+EIS M+REIE+ER Sbjct: 122 EYERKARKKLESINKRLVKEAAEERRGREAMEKMCEELAKEISFDKAEISTMKREIEEER 181 Query: 549 KMLRTAETIR----------------EERVQMKLSEAKIXXXXXXXXXXXXXXXXA---- 668 KMLR +E +R E+ ++++ S+ I + Sbjct: 182 KMLRMSEVLREERVQMKLAEAKIVLEEKLLELECSKRIISECTNTSTTTSTTTTSSALKS 241 Query: 669 KQN---QERSCFLGEKLRLVLGEK---------------------------------VSG 740 K+N + S F G+ V+GEK S Sbjct: 242 KENVASRAVSSFSGKFRHFVMGEKWISNDEINNRVDSMASTRSVSSHDHNHNNNIKYSSS 301 Query: 741 GCVNEGEEGA------RIPMALMKRKSGGETENPHIRRGIKGFVEFPKVVRAIGCRSSKH 902 C + E + + RK ETENPHI+RGIKGFVEFPKVVRA+G ++ ++ Sbjct: 302 ACTEDSSESVLSENCNNKSVPIFPRKLLAETENPHIKRGIKGFVEFPKVVRAVGSKNRQY 361 Query: 903 LG-KLECXXXXXXXXXXXXGAVRFNGLL 983 G KLEC +R N L+ Sbjct: 362 WGTKLECQKAQLRILLKQKSPIRSNSLI 389 >ref|XP_002518920.1| hypothetical protein RCOM_1313640 [Ricinus communis] gi|223541907|gb|EEF43453.1| hypothetical protein RCOM_1313640 [Ricinus communis] Length = 357 Score = 207 bits (526), Expect = 1e-50 Identities = 109/177 (61%), Positives = 132/177 (74%) Frame = +3 Query: 87 NHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXXXXXXXXXXSTTDKQIHRLHLPIPAK 266 N + + I TTTCP WKLYENPFY S T+K HLP+ A+ Sbjct: 18 NESFPLEPITTTTCPTWKLYENPFY-----NSHNTKQHQHLQHSQTNK-----HLPLSAR 67 Query: 267 KIAASFWDLTFIKPFMESDLESARTQITEARAELESERKARKKLESLNKRLAKEVSEERR 446 KIAASFWDLTF++P ME++L+ AR QI E +AELE ERKARKK E++NKRLAKE++EERR Sbjct: 68 KIAASFWDLTFLRPIMETELDFARAQIIELKAELEYERKARKKGETMNKRLAKELAEERR 127 Query: 447 GREALERVCEELAMEITAHKSEISRMRREIEDERKMLRTAETIREERVQMKLSEAKI 617 GREALERVCE+LA EI+ K+EI RM+REIE+ER+MLR AE +REERVQMKL+EAKI Sbjct: 128 GREALERVCEQLAKEISFDKAEIDRMKREIEEERRMLRMAEVLREERVQMKLAEAKI 184 >ref|XP_004245452.1| PREDICTED: uncharacterized protein LOC101262444 [Solanum lycopersicum] Length = 252 Score = 199 bits (505), Expect = 3e-48 Identities = 124/298 (41%), Positives = 160/298 (53%) Frame = +3 Query: 27 MRSQEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXXXXXXX 206 MR QE SI +DH II CP WKLYENPFY Sbjct: 1 MRRQENNKSI--GILEDH---------IIPPCICPTWKLYENPFY-------------NS 36 Query: 207 XXXSTTDKQIHRLHLPIPAKKIAASFWDLTFIKPFMESDLESARTQITEARAELESERKA 386 +IHRL+LPI A+KIA+SFWDLTF+ ++ E +A++E ERKA Sbjct: 37 LNNPLPQHKIHRLNLPISARKIASSFWDLTFMD-----------CEVAELKAKVEDERKA 85 Query: 387 RKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIEDERKMLRTA 566 RKKLE++NK++A+E+ EE++GR+ALERVC++LA I+ K+EI +R+++E+ERKM+R A Sbjct: 86 RKKLETVNKKIARELCEEKKGRQALERVCKQLANHISLDKAEIDGLRKDMEEERKMVRVA 145 Query: 567 ETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAKQNQERSCFLGEKLRLVLGEKVSGGC 746 E +REERVQMKL+EAK K Q +SC + Sbjct: 146 EVMREERVQMKLTEAK-----YLLEDKLLELERLKTEQSKSCNFHD-------------- 186 Query: 747 VNEGEEGARIPMALMKRKSGGETENPHIRRGIKGFVEFPKVVRAIGCRSSKHLGKLEC 920 R E ENPHI+RGIKGFVEFPKVVRAI +S KLEC Sbjct: 187 ---------------GRNHSKEAENPHIKRGIKGFVEFPKVVRAISSKSRHWSTKLEC 229 >emb|CBI34596.3| unnamed protein product [Vitis vinifera] Length = 200 Score = 196 bits (498), Expect = 2e-47 Identities = 114/225 (50%), Positives = 145/225 (64%), Gaps = 1/225 (0%) Frame = +3 Query: 312 MESDLESARTQITEARAELESERKARKKLESLNKRLAKEVSEERRGREALERVCEELAME 491 M+S+L+ AR QI E +AELE ERKARKK+ES+NKRLAK+++EERRGREA+ERVCE LA E Sbjct: 1 MDSELDIARAQIIELKAELEFERKARKKVESMNKRLAKDLNEERRGREAMERVCEVLAKE 60 Query: 492 ITAHKSEISRMRREIEDERKMLRTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAK 671 I++ K+EIS M+REIE+ERKMLR AE +REERVQMKL++AK K Sbjct: 61 ISSDKAEISWMKREIEEERKMLRMAEVLREERVQMKLADAKF-----LLEEKLLELEVTK 115 Query: 672 QNQERSCFLGEKLRLVLGEKVSGGCVNEGEEGARIPMALMKRKSGGETENPHIRRGIKGF 851 Q Q+ +C + A +P ++R++ ETENPHI+RGIKGF Sbjct: 116 QTQKPTC---------------------NDSNAVVPSMAIQRRASPETENPHIKRGIKGF 154 Query: 852 VEFPKVVRAIGCRSSKHLG-KLECXXXXXXXXXXXXGAVRFNGLL 983 VEFP+VVRAIG R S+HLG KLEC +R N L+ Sbjct: 155 VEFPRVVRAIGSR-SRHLGTKLECQKAQLRILLKQKNPIRSNSLI 198 >ref|XP_002303697.1| hypothetical protein POPTR_0003s14770g [Populus trichocarpa] gi|222841129|gb|EEE78676.1| hypothetical protein POPTR_0003s14770g [Populus trichocarpa] Length = 248 Score = 174 bits (441), Expect = 7e-41 Identities = 104/247 (42%), Positives = 147/247 (59%), Gaps = 23/247 (9%) Frame = +3 Query: 312 MESDLESARTQITEARAELESERKARKKLESLNKRLAKEVSEERRGREALERVCEELAME 491 M+++L+ AR +I E +AELE ERKARKKLE+++KRLAKE++EERRGREALERVCEELA E Sbjct: 1 MDTELDFARARILELKAELEYERKARKKLETMSKRLAKELAEERRGREALERVCEELARE 60 Query: 492 ITAHKSEISRMRREIEDERKMLRTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAK 671 I++ K EI M+RE+ +ER+M+R AE +REERVQMKL+EAK+ A+ Sbjct: 61 ISSDKEEIDHMKREMGEEREMIRMAEVLREERVQMKLAEAKM-LFEEKLLELVGTTTQAE 119 Query: 672 QNQERSCFLGEKLRLVLGEKVS---------GGCVNEGEEGARI--------------PM 782 +Q + + +K + +++ G +N GA + M Sbjct: 120 PHQNSTSRMEQKSQEHKEPEIATPLKTTAILSGQLNSESTGAILSEKPSFNDNTSSISSM 179 Query: 783 ALMKRKSGGETENPHIRRGIKGFVEFPKVVRAIGCRSSKHLGKLECXXXXXXXXXXXXGA 962 + + ++ E ENPHI+RG+KGFVEFP+VVRAIG ++ KLEC Sbjct: 180 VIQRSRASPEPENPHIKRGMKGFVEFPRVVRAIGSKNKHRGTKLECQKAQLRILLKQKSP 239 Query: 963 VRFNGLL 983 +R N L+ Sbjct: 240 IRSNNLI 246 >ref|XP_006391709.1| hypothetical protein EUTSA_v10023631mg [Eutrema salsugineum] gi|557088215|gb|ESQ28995.1| hypothetical protein EUTSA_v10023631mg [Eutrema salsugineum] Length = 283 Score = 172 bits (436), Expect = 3e-40 Identities = 111/276 (40%), Positives = 149/276 (53%), Gaps = 6/276 (2%) Frame = +3 Query: 111 IATTTCPVWKLYENPFYIFXXXXXXXXXXXXXXXXSTTDKQIHRLHLPIPAKKIAASFWD 290 I + C WKLY+NP+Y Q H+ + A WD Sbjct: 36 IMDSVCKTWKLYDNPYYF------------------CLQPQQHQ-------HQRKAFIWD 70 Query: 291 LTFIKPFMESDLESARTQITEARAELESERKARKKLESLNKRLAKEVSEERRGREALERV 470 L F++ FMES+L+ AR +I E +AEL+ ER+ARK+ E LNK+LAK+V EER REA E Sbjct: 71 LNFVRVFMESELDKARAEIKELKAELDYERRARKRAELLNKKLAKDVEEERLCREAEEMQ 130 Query: 471 CEELAMEITAHKSEISRMRREIEDERKMLRTAETIREERVQMKLSEAKIXXXXXXXXXXX 650 E L E ++ KSE+ R+R+E+E+ER+M R AE +REERVQMKLS+A++ Sbjct: 131 YERLLKERSSEKSEMVRLRQEVEEERQMHRLAEVLREERVQMKLSDARL----------- 179 Query: 651 XXXXXAKQNQERSCFLGEKLRLVLGEKVSGGCVNEGEEGARIPMALMKR------KSGGE 812 FL EKL + G G +E P +++R S Sbjct: 180 --------------FLEEKLSELEGSNGEG-----DKEMVMKPKKILERARSSPAASRRS 220 Query: 813 TENPHIRRGIKGFVEFPKVVRAIGCRSSKHLGKLEC 920 +ENPHI+RGIKGFVEFPKV+RAI +S + KLEC Sbjct: 221 SENPHIKRGIKGFVEFPKVMRAIRSKSEQWGSKLEC 256 >gb|AFK13153.1| branchless trichomes [Gossypium arboreum] Length = 232 Score = 158 bits (399), Expect = 6e-36 Identities = 107/262 (40%), Positives = 137/262 (52%) Frame = +3 Query: 135 WKLYENPFYIFXXXXXXXXXXXXXXXXSTTDKQIHRLHLPIPAKKIAASFWDLTFIKPFM 314 WKLYENPFY + K +H +P+ M Sbjct: 16 WKLYENPFY----------YSNHQQLQCQSSKHLHHQLIPV------------------M 47 Query: 315 ESDLESARTQITEARAELESERKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEI 494 +S+L AR QI + +AE+E ERKARKK+ESLNK+L KEV+EERRG+EALE VCE+LA EI Sbjct: 48 DSELGIARAQIIDLKAEVEYERKARKKVESLNKKLGKEVAEERRGKEALESVCEKLAREI 107 Query: 495 TAHKSEISRMRREIEDERKMLRTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAKQ 674 T+ KSE+ RM+RE E+ERKML+ AE +REERVQMKL+EAKI Sbjct: 108 TSKKSEMDRMKREFEEERKMLKIAEVLREERVQMKLAEAKILF----------------- 150 Query: 675 NQERSCFLGEKLRLVLGEKVSGGCVNEGEEGARIPMALMKRKSGGETENPHIRRGIKGFV 854 E+ L E R + + V++ E R +E P + Sbjct: 151 -HEKLKELEETKRKQSDIENNSKAVSDSRESTRF----------ASSEQP--SSCYQNIN 197 Query: 855 EFPKVVRAIGCRSSKHLGKLEC 920 EFPKVVRAIG +S + KLEC Sbjct: 198 EFPKVVRAIGSKSRRWGSKLEC 219 >ref|XP_002887867.1| hypothetical protein ARALYDRAFT_892933 [Arabidopsis lyrata subsp. lyrata] gi|297333708|gb|EFH64126.1| hypothetical protein ARALYDRAFT_892933 [Arabidopsis lyrata subsp. lyrata] Length = 274 Score = 151 bits (382), Expect = 5e-34 Identities = 112/307 (36%), Positives = 151/307 (49%), Gaps = 4/307 (1%) Frame = +3 Query: 12 MKMNKMRSQEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXX 191 MK KM+S E T D H N ++ + T C WKLYENP+Y Sbjct: 1 MKDMKMQSSSETMMTRIPTPDPH--NTGVREDAM-DTVCKPWKLYENPYYC--------- 48 Query: 192 XXXXXXXXSTTDKQIHRLHLPIPAKKIAASFWDLTFIKPFMESDLESARTQITEARAELE 371 S + + H+ A WDL FIK FMES+L A+ +I E +AEL+ Sbjct: 49 -------SSLSQQHQHQRK---------AFIWDLNFIKIFMESELGKAQDEIQELKAELD 92 Query: 372 SERKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIEDERK 551 ERKAR++ E +NKRLAK+V EER R A E + L E+++ KSE+ RM+R++E+ER+ Sbjct: 93 YERKARRRAELMNKRLAKDVEEERMARVAEEMQNKRLFKELSSEKSEMVRMKRDLEEERQ 152 Query: 552 MLRTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAKQNQERSCFLGEKLRLVLGEK 731 M R AE +REERVQMKL +A++ FL EK L E Sbjct: 153 MHRLAEVLREERVQMKLMDARL-------------------------FLEEK----LSEL 183 Query: 732 VSGGCVNEGEEGARIPMALMKRKSGG----ETENPHIRRGIKGFVEFPKVVRAIGCRSSK 899 E E + +++R ENP I+RGI FP+V+RAI +S K Sbjct: 184 EEANRQGERERNRMMKPKILERACSSPARRSCENPQIKRGIN---PFPRVMRAIRSKSEK 240 Query: 900 HLGKLEC 920 KLEC Sbjct: 241 WGSKLEC 247 >ref|XP_006301742.1| hypothetical protein CARUB_v10022203mg, partial [Capsella rubella] gi|482570452|gb|EOA34640.1| hypothetical protein CARUB_v10022203mg, partial [Capsella rubella] Length = 269 Score = 151 bits (381), Expect = 7e-34 Identities = 103/267 (38%), Positives = 136/267 (50%) Frame = +3 Query: 120 TTCPVWKLYENPFYIFXXXXXXXXXXXXXXXXSTTDKQIHRLHLPIPAKKIAASFWDLTF 299 + C WK Y+NP+Y S + + H+ A WDL F Sbjct: 32 SVCKTWKRYDNPYYC----------------SSQSQQNQHQRK---------AFIWDLNF 66 Query: 300 IKPFMESDLESARTQITEARAELESERKARKKLESLNKRLAKEVSEERRGREALERVCEE 479 IK FMES+L AR +I E +AEL+ ERKAR++ E NKRLAK+V EER GREA E ++ Sbjct: 67 IKVFMESELGKARAEIKELKAELDYERKARRRAELTNKRLAKDVEEERMGREAEELQNKQ 126 Query: 480 LAMEITAHKSEISRMRREIEDERKMLRTAETIREERVQMKLSEAKIXXXXXXXXXXXXXX 659 L EI+ KSE+ RM+R++E+ER+M R AE +REERVQMKL++A++ Sbjct: 127 LLKEISFDKSEMVRMKRDLEEERQMHRLAEVLREERVQMKLADARL-------------- 172 Query: 660 XXAKQNQERSCFLGEKLRLVLGEKVSGGCVNEGEEGARIPMALMKRKSGGETENPHIRRG 839 FL EKL E E E +I + ENPHI R Sbjct: 173 -----------FLEEKLT----ELEEANRQGEREMKPKILKRACSSPARRRWENPHIMR- 216 Query: 840 IKGFVEFPKVVRAIGCRSSKHLGKLEC 920 +G FP+V+RAI +S K KLEC Sbjct: 217 -RGINPFPRVIRAIRSKSEKWGSKLEC 242 >gb|AAD38245.1|AC006193_1 Hypothetical Protein [Arabidopsis thaliana] gi|6633823|gb|AAF19682.1|AC009519_16 F1N19.26 [Arabidopsis thaliana] Length = 323 Score = 149 bits (376), Expect = 3e-33 Identities = 105/306 (34%), Positives = 152/306 (49%), Gaps = 4/306 (1%) Frame = +3 Query: 15 KMNKMRSQEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXXX 194 KM M+ Q P ++++ + ++ + + C WKLYENP+Y Sbjct: 50 KMKDMKMQSSPETMMTRIPTPDPHSTGVREDAM-DSVCKPWKLYENPYYC---------- 98 Query: 195 XXXXXXXSTTDKQIHRLHLPIPAKKIAASFWDLTFIKPFMESDLESARTQITEARAELES 374 ++ Q H+ + A WDL FIK FMES+L A+ +I E +AEL+ Sbjct: 99 --------SSQSQQHQ-------HQRKAFIWDLNFIKVFMESELGKAQDEIKELKAELDY 143 Query: 375 ERKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIEDERKM 554 ERKAR++ E + K+LAK+V EER REA E + L E+++ KSE+ RM+R++E+ER+M Sbjct: 144 ERKARRRAELMIKKLAKDVEEERMAREAEEMQNKRLFKELSSEKSEMVRMKRDLEEERQM 203 Query: 555 LRTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAKQNQERSCFLGEKLRLVLGEKV 734 R AE +REERVQMKL +A++ FL EK L E Sbjct: 204 HRLAEVLREERVQMKLMDARL-------------------------FLEEK----LSELE 234 Query: 735 SGGCVNEGEEGARIPMALMKRKSGG----ETENPHIRRGIKGFVEFPKVVRAIGCRSSKH 902 E E + +++R ENP I+RGI FP+V+RAI +S K Sbjct: 235 EANRQGERERNRMMKPKILERACSSPARRRCENPQIKRGIN---PFPRVMRAIRSKSEKW 291 Query: 903 LGKLEC 920 KLEC Sbjct: 292 GSKLEC 297 >ref|NP_176650.1| protein branchless trichome [Arabidopsis thaliana] gi|527525132|sp|F4I878.1|BLT_ARATH RecName: Full=Protein BRANCHLESS TRICHOME gi|332196153|gb|AEE34274.1| protein branchless trichome [Arabidopsis thaliana] Length = 273 Score = 147 bits (371), Expect = 1e-32 Identities = 104/305 (34%), Positives = 151/305 (49%), Gaps = 4/305 (1%) Frame = +3 Query: 18 MNKMRSQEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXXXX 197 M M+ Q P ++++ + ++ + + C WKLYENP+Y Sbjct: 1 MKDMKMQSSPETMMTRIPTPDPHSTGVREDAM-DSVCKPWKLYENPYYC----------- 48 Query: 198 XXXXXXSTTDKQIHRLHLPIPAKKIAASFWDLTFIKPFMESDLESARTQITEARAELESE 377 ++ Q H+ + A WDL FIK FMES+L A+ +I E +AEL+ E Sbjct: 49 -------SSQSQQHQ-------HQRKAFIWDLNFIKVFMESELGKAQDEIKELKAELDYE 94 Query: 378 RKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIEDERKML 557 RKAR++ E + K+LAK+V EER REA E + L E+++ KSE+ RM+R++E+ER+M Sbjct: 95 RKARRRAELMIKKLAKDVEEERMAREAEEMQNKRLFKELSSEKSEMVRMKRDLEEERQMH 154 Query: 558 RTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAKQNQERSCFLGEKLRLVLGEKVS 737 R AE +REERVQMKL +A++ FL EK L E Sbjct: 155 RLAEVLREERVQMKLMDARL-------------------------FLEEK----LSELEE 185 Query: 738 GGCVNEGEEGARIPMALMKRKSGG----ETENPHIRRGIKGFVEFPKVVRAIGCRSSKHL 905 E E + +++R ENP I+RGI FP+V+RAI +S K Sbjct: 186 ANRQGERERNRMMKPKILERACSSPARRRCENPQIKRGIN---PFPRVMRAIRSKSEKWG 242 Query: 906 GKLEC 920 KLEC Sbjct: 243 SKLEC 247 >gb|AAR20780.1| At1g64690 [Arabidopsis thaliana] gi|38604052|gb|AAR24769.1| At1g64690 [Arabidopsis thaliana] Length = 287 Score = 141 bits (355), Expect = 7e-31 Identities = 100/298 (33%), Positives = 147/298 (49%), Gaps = 4/298 (1%) Frame = +3 Query: 18 MNKMRSQEEPSSIISNTTDDHKPNHNIKIPIIATTTCPVWKLYENPFYIFXXXXXXXXXX 197 M M+ Q P ++++ + ++ + + C WKLYENP+Y Sbjct: 1 MKDMKMQSSPETMMTRIPTPDPHSTGVREDAM-DSVCKPWKLYENPYYC----------- 48 Query: 198 XXXXXXSTTDKQIHRLHLPIPAKKIAASFWDLTFIKPFMESDLESARTQITEARAELESE 377 ++ Q H+ + A WDL FIK FMES+L A+ +I E +AEL+ E Sbjct: 49 -------SSQSQQHQ-------HQRKAFIWDLNFIKVFMESELGKAQDEIKELKAELDYE 94 Query: 378 RKARKKLESLNKRLAKEVSEERRGREALERVCEELAMEITAHKSEISRMRREIEDERKML 557 RKAR++ E + K+LAK+V EER REA E + L E+++ KSE+ RM+R++E+ER+M Sbjct: 95 RKARRRAEPMIKKLAKDVEEERMAREAEEMQNKRLFKELSSEKSEMVRMKRDLEEERQMH 154 Query: 558 RTAETIREERVQMKLSEAKIXXXXXXXXXXXXXXXXAKQNQERSCFLGEKLRLVLGEKVS 737 R AE +REERVQMKL +A++ FL EK L E Sbjct: 155 RLAEVLREERVQMKLMDARL-------------------------FLEEK----LSELEE 185 Query: 738 GGCVNEGEEGARIPMALMKRKSGG----ETENPHIRRGIKGFVEFPKVVRAIGCRSSK 899 E E + +++R ENP I+RGI FP+V+RAI +S K Sbjct: 186 ANRQGERERNRMMKPKILERACSSPARRRCENPQIKRGIN---PFPRVMRAIRSKSEK 240