BLASTX nr result
ID: Mentha22_contig00045870
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00045870 (1245 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007200556.1| hypothetical protein PRUPE_ppa009673mg [Prun... 194 5e-47 ref|XP_004292660.1| PREDICTED: uncharacterized protein LOC101313... 191 7e-46 ref|XP_006444350.1| hypothetical protein CICLE_v10021537mg [Citr... 186 2e-44 ref|XP_002268658.1| PREDICTED: uncharacterized protein LOC100241... 185 3e-44 ref|XP_006355807.1| PREDICTED: uncharacterized protein LOC102585... 182 3e-43 ref|XP_004240545.1| PREDICTED: uncharacterized protein LOC101258... 182 3e-43 gb|EPS68606.1| hypothetical protein M569_06163, partial [Genlise... 179 2e-42 gb|EXC10295.1| hypothetical protein L484_006190 [Morus notabilis] 178 5e-42 ref|XP_007050920.1| LYR motif-containing protein 7 isoform 2 [Th... 177 6e-42 ref|XP_003519006.1| PREDICTED: uncharacterized protein LOC100795... 177 6e-42 ref|XP_002523082.1| conserved hypothetical protein [Ricinus comm... 175 3e-41 ref|XP_007050919.1| LYR motif-containing protein 7 isoform 1 [Th... 174 5e-41 ref|XP_006588589.1| PREDICTED: uncharacterized protein LOC100798... 174 9e-41 ref|XP_007144318.1| hypothetical protein PHAVU_007G146100g [Phas... 172 3e-40 ref|XP_004152251.1| PREDICTED: uncharacterized protein LOC101206... 166 3e-38 gb|EYU46668.1| hypothetical protein MIMGU_mgv1a016256mg [Mimulus... 155 3e-35 ref|XP_004495959.1| PREDICTED: uncharacterized protein LOC101493... 155 3e-35 ref|XP_003591530.1| hypothetical protein MTR_1g088580 [Medicago ... 148 5e-33 ref|XP_002321007.2| hypothetical protein POPTR_0014s12400g [Popu... 137 1e-29 ref|XP_006396451.1| hypothetical protein EUTSA_v10028893mg [Eutr... 135 5e-29 >ref|XP_007200556.1| hypothetical protein PRUPE_ppa009673mg [Prunus persica] gi|462395956|gb|EMJ01755.1| hypothetical protein PRUPE_ppa009673mg [Prunus persica] Length = 282 Score = 194 bits (494), Expect = 5e-47 Identities = 129/279 (46%), Positives = 154/279 (55%), Gaps = 30/279 (10%) Frame = +1 Query: 58 LESIHAIESCAFQLLSWRPF--------SAKALDSDSSKPC---YGGP------HSKRPC 186 LE H I+SCAFQL SWRPF ++K LDSD S P Y H+KRPC Sbjct: 6 LEHRHPIDSCAFQLHSWRPFHLHQQTTPTSKTLDSDPSLPNPKPYNSSSNGLVVHTKRPC 65 Query: 187 RADRSTSSFSIDAILDMSKLSLFDDDRALPLSAARKHWF------AXXXXXXXXXXXXXX 348 ++R+TS FSIDAI DMS+L+L DDDR + +H Sbjct: 66 LSNRATS-FSIDAI-DMSRLTLVDDDRTISGGHHNRHGSFRFIAKKRRRHGSRSVSGRSS 123 Query: 349 XXXXXXXXXXVGASAANGTCSDFPMVAGGTDSSGELFGG--ARWASEVSE-RSLRREKEG 519 VGASAA GTCSDFP VA GTDSSGELFG A WAS+VSE R+ R+E++G Sbjct: 124 DRSGTRRCCSVGASAAYGTCSDFP-VAVGTDSSGELFGNGDANWASDVSEARNSRKERDG 182 Query: 520 NVGGERECVIAGHGLFGNCDIQGNESGYGSEPGYKXXXXXXXXXXXXXXXR--RAFFWGE 693 GE+E + G G G D+QGNESGYGSEPGY+ R FWG+ Sbjct: 183 GGSGEKENLGIGFGPIGGFDVQGNESGYGSEPGYRGDAEFGYGDELDEEEEDTRLLFWGD 242 Query: 694 ECGENTSQSEMVSENSL--QKGHHRCRRKKHDLRMADAL 804 + G+ S E+V EN+ QK HHRCRRKKHD RM D L Sbjct: 243 QFGDADSMMEIVGENTFVDQKSHHRCRRKKHDCRMVDTL 281 >ref|XP_004292660.1| PREDICTED: uncharacterized protein LOC101313678 [Fragaria vesca subsp. vesca] Length = 271 Score = 191 bits (484), Expect = 7e-46 Identities = 124/264 (46%), Positives = 149/264 (56%), Gaps = 19/264 (7%) Frame = +1 Query: 58 LESIHAIESCAFQLLSWRPFS-----AKALDSDSSKPCYGGPHSKRPCRADRSTSSFSID 222 L+S +++SC FQL SWRPF K LDSD + P H+KRPC ++R+TSSFSID Sbjct: 6 LDSRPSLDSCTFQLHSWRPFQLQQQPTKTLDSDPANP--KPYHTKRPCLSNRATSSFSID 63 Query: 223 AILDMSKLSLFDDDRALPLSAARKHWF------AXXXXXXXXXXXXXXXXXXXXXXXXVG 384 AI DMS+L+L DDDR + KH VG Sbjct: 64 AI-DMSRLTLVDDDRTISGGHHHKHGSFRFLARKRRRHGSRSVSGRSSDRSGTRRCCSVG 122 Query: 385 ASAANGTCSDFPMVAGGTDSSGELFGG--ARWASEVSE-RSLRREKEGNVGGERECVIA- 552 ASAA+GTCSDFP VA GTDSSGELFG A WAS+VSE R+LR+E++G GE+E Sbjct: 123 ASAAHGTCSDFP-VAIGTDSSGELFGNGDANWASDVSEARNLRKERDGVGSGEKETTPGV 181 Query: 553 GHGLFGNCDIQGNESGYGSEPGYKXXXXXXXXXXXXXXXR--RAFFWGEECGENTSQSEM 726 G G G D QGNESGYGSEPGY+ R FWG G++ + E+ Sbjct: 182 GFGPGGGFDAQGNESGYGSEPGYRGDAEFGYGDELDEEEEDARLLFWGNRFGDSDTMMEV 241 Query: 727 VSENSL--QKGHHRCRRKKHDLRM 792 V EN+ QK HHRCRRKKHD RM Sbjct: 242 VGENTFTDQKSHHRCRRKKHDCRM 265 >ref|XP_006444350.1| hypothetical protein CICLE_v10021537mg [Citrus clementina] gi|568852594|ref|XP_006479957.1| PREDICTED: uncharacterized protein LOC102627953 [Citrus sinensis] gi|557546612|gb|ESR57590.1| hypothetical protein CICLE_v10021537mg [Citrus clementina] Length = 283 Score = 186 bits (472), Expect = 2e-44 Identities = 131/276 (47%), Positives = 155/276 (56%), Gaps = 27/276 (9%) Frame = +1 Query: 58 LESIHAIESCAFQLLSWRPFSAK-ALDS-DSSKPCYGGP---HSKRPCRADRSTSSFSID 222 L+S H+I+SCA QL +WRPF + LDS DS+KP Y H+KRPC +DR+TS ID Sbjct: 8 LDSRHSIDSCALQLHNWRPFHLQNPLDSSDSTKPSYSPSSWVHTKRPCLSDRATSFSIID 67 Query: 223 AI-LDMSKLSLFDDDRAL-PLSAA---------RKHWFAXXXXXXXXXXXXXXXXXXXXX 369 A +D+SKLSLFDDD + P++AA R Sbjct: 68 AAAIDLSKLSLFDDDNVIKPMTAATAPQSRGGYRLIARKRRRRGSRSVSGRSSDRSGTRR 127 Query: 370 XXXVGASAANGTCSDFPMVAGGTDSSGELFGG--ARWASEVSE-RSLRREKE-GNVGGER 537 VGASAA GTCSDFP VA GTDSSGELFG A WAS+VSE R+ RRE++ GN GE+ Sbjct: 128 CCSVGASAAYGTCSDFP-VAVGTDSSGELFGNGEANWASDVSEARNSRRERDNGNGSGEK 186 Query: 538 ECVIAGHGLFGNC---DIQGNESGYGSEPGYKXXXXXXXXXXXXXXXRRA--FFWGEECG 702 E G G C + GNESGYGSEPGY+ A FWG G Sbjct: 187 ENSGTGFGGQVGCLEAQVLGNESGYGSEPGYRGDAEFGYGDELDEEEEDAKLLFWGNRFG 246 Query: 703 ENTSQSEMVSENSL--QKGHHRCRRKKHDLRMADAL 804 + S+ EMV EN+ QK HHRCRRKKHD RM DAL Sbjct: 247 DVDSKMEMVGENTFTDQKSHHRCRRKKHDCRMVDAL 282 >ref|XP_002268658.1| PREDICTED: uncharacterized protein LOC100241933 [Vitis vinifera] Length = 269 Score = 185 bits (470), Expect = 3e-44 Identities = 124/266 (46%), Positives = 145/266 (54%), Gaps = 22/266 (8%) Frame = +1 Query: 73 AIESCAFQLLSWRPF----SAKALDSDS--SKP-----CYGGPHSKRPCRADRSTSSFSI 219 +IESC FQL SWRPF + K L+ DS SKP G HSKRPC +DR TS F I Sbjct: 6 SIESCTFQLHSWRPFQLPTTPKTLEPDSHNSKPYSITTSSNGLHSKRPCLSDRKTS-FPI 64 Query: 220 DAILDMSKLSLFDDDR---ALPLSAARKHWF--AXXXXXXXXXXXXXXXXXXXXXXXXVG 384 DA LD+SKLSL +DD+ + P + W VG Sbjct: 65 DA-LDISKLSLLEDDKPASSAPRNRGNVRWIDRKRRRRGSRSVSGRSSDRSGTRRCCSVG 123 Query: 385 ASAANGTCSDFPMVAGGTDSSGELF--GGARWASEVSERSLRREKEGNVGGERECVIAGH 558 ASAA TCSDFP VA GTDSSGELF G + W+S+VSE R+ GE+E + +G Sbjct: 124 ASAAYATCSDFP-VAAGTDSSGELFVNGDSNWSSDVSEAKNSRKDRDGGSGEKENLGSGF 182 Query: 559 GLFGNCDIQGNESGYGSEPGYK--XXXXXXXXXXXXXXXRRAFFWGEECGENTSQSEMVS 732 G G + QGNESGYGSEPGY+ R FWGE+ G+N + EMV Sbjct: 183 GHIGIFETQGNESGYGSEPGYRGDAEFGYGDELDEEEDDARLLFWGEQLGDNDTNMEMVG 242 Query: 733 EN--SLQKGHHRCRRKKHDLRMADAL 804 EN S QK HHRCRRKKHD RM DAL Sbjct: 243 ENTFSEQKAHHRCRRKKHDYRMIDAL 268 >ref|XP_006355807.1| PREDICTED: uncharacterized protein LOC102585515 [Solanum tuberosum] Length = 269 Score = 182 bits (461), Expect = 3e-43 Identities = 127/268 (47%), Positives = 152/268 (56%), Gaps = 22/268 (8%) Frame = +1 Query: 55 TLESIHAIESCAFQLLSWRPF-----SAKALDSDSSKP----CYGGPHSKRPCRADRSTS 207 TL+S HAIESC + L SW+PF ++K LD DS K +GG H+KR CRADR+TS Sbjct: 5 TLDSRHAIESCTYHLHSWKPFQFPTPNSKTLDLDSPKTYSPSTHGGVHTKRQCRADRTTS 64 Query: 208 SFSIDAILDMSKLSLFDDDRALPLSAARKHWFAXXXXXXXXXXXXXXXXXXXXXXXX--- 378 I+A LDMSKLSLF++DR PLS ++ Sbjct: 65 -IPIEA-LDMSKLSLFEEDR--PLSVHKRENLRLIAGKRRRRGSRSVSGRSSDRSGTRRR 120 Query: 379 ---VGASAANGTCSDFPMVAGGTDSSGELF--GGARWASEVSE--RSLRREKEGNVGGER 537 VGASAA GTCSDFP VA GTDSSGELF G W +VSE +SLR+EKEG GER Sbjct: 121 CCSVGASAAYGTCSDFP-VAVGTDSSGELFVNGDMHWTLDVSEVTKSLRKEKEGGGVGER 179 Query: 538 ECVIAGHGL-FGNCDIQGNESGYGSEPGYKXXXXXXXXXXXXXXX--RRAFFWGEECGEN 708 E + G + GN + GNESGYGSEPGY+ +R FWG+E G Sbjct: 180 ESNLNGLSVQSGNFEGLGNESGYGSEPGYRGDAEFGYGDEFDEEEDDQRLSFWGDEFGA- 238 Query: 709 TSQSEMVSENSLQKGHHRCRRKKHDLRM 792 S+ E V EN+LQK HHRCRR+K D RM Sbjct: 239 LSRMEKVGENTLQKVHHRCRRRKQDCRM 266 >ref|XP_004240545.1| PREDICTED: uncharacterized protein LOC101258757 [Solanum lycopersicum] Length = 269 Score = 182 bits (461), Expect = 3e-43 Identities = 127/268 (47%), Positives = 152/268 (56%), Gaps = 22/268 (8%) Frame = +1 Query: 55 TLESIHAIESCAFQLLSWRPF-----SAKALDSDSSKP----CYGGPHSKRPCRADRSTS 207 TL+S HAIESC + L SW+PF ++K LD DS K +GG H+KR CRADR+TS Sbjct: 5 TLDSRHAIESCTYHLHSWKPFQFPSPNSKTLDLDSPKTYSPSTHGGLHTKRQCRADRTTS 64 Query: 208 SFSIDAILDMSKLSLFDDDRALPLSAARKHWFAXXXXXXXXXXXXXXXXXXXXXXXX--- 378 I+A LDMSKLSLF++D+ PLS ++ Sbjct: 65 -IPIEA-LDMSKLSLFEEDK--PLSVHKRENLRLIAGKRRRRGSRSVSGRSSDRSGTRRR 120 Query: 379 ---VGASAANGTCSDFPMVAGGTDSSGELF--GGARWASEVSE--RSLRREKEGNVGGER 537 VGASAA GTCSDFP VA GTDSSGELF G W +VSE +SLR+EKEG GER Sbjct: 121 CCSVGASAAYGTCSDFP-VAAGTDSSGELFVNGDMHWTLDVSEVTKSLRKEKEGGGVGER 179 Query: 538 ECVIAGHGL-FGNCDIQGNESGYGSEPGYKXXXXXXXXXXXXXXX--RRAFFWGEECGEN 708 E + G + GN + GNESGYGSEPGY+ +R FWG+E G Sbjct: 180 ENNLNGLSVQSGNFEGLGNESGYGSEPGYRGDAEFGYGDEFDEEEDDQRLSFWGDEFGA- 238 Query: 709 TSQSEMVSENSLQKGHHRCRRKKHDLRM 792 S+ E V ENSLQK HHRCRR+K D RM Sbjct: 239 LSRMEKVGENSLQKVHHRCRRRKQDCRM 266 >gb|EPS68606.1| hypothetical protein M569_06163, partial [Genlisea aurea] Length = 212 Score = 179 bits (454), Expect = 2e-42 Identities = 112/216 (51%), Positives = 134/216 (62%), Gaps = 6/216 (2%) Frame = +1 Query: 73 AIESCAFQLLSWRPFSAKALDSDSSKPCYGGPH-SKRPCRADRSTSSFSIDAILDMSKLS 249 A+ESCA Q+L WRPF K LD DS+ PH SKR C ADR+TSSFSIDAILDMSK+S Sbjct: 6 AVESCALQILGWRPFGKK-LDRDSAAV----PHTSKRFCGADRATSSFSIDAILDMSKIS 60 Query: 250 LFDDD--RALPLSAARKH-WFAXXXXXXXXXXXXXXXXXXXXXXXXVGASAANGTCSDFP 420 LFDDD RA+ + +R + WFA VGASAANGTCSDFP Sbjct: 61 LFDDDTSRAVSIPFSRNNRWFARKRRRRAGSRSVSGRSSDRRGRS-VGASAANGTCSDFP 119 Query: 421 MVAGGTDSSGELFGGARWASEVSERSLRREKEG-NVGGERECVIAGHGLFGNCDIQ-GNE 594 MVAGGTDSSGELFG + WAS+VS+R+ RR++E GG+RE + + G NC+ GNE Sbjct: 120 MVAGGTDSSGELFGESNWASDVSDRNSRRDREAVGCGGDRENLTSQFG--NNCESSLGNE 177 Query: 595 SGYGSEPGYKXXXXXXXXXXXXXXXRRAFFWGEECG 702 SGYGSEPGY+ + FWG+E G Sbjct: 178 SGYGSEPGYRGDGELEYDDEEEDDP-KILFWGDEFG 212 >gb|EXC10295.1| hypothetical protein L484_006190 [Morus notabilis] Length = 275 Score = 178 bits (451), Expect = 5e-42 Identities = 128/277 (46%), Positives = 151/277 (54%), Gaps = 28/277 (10%) Frame = +1 Query: 58 LESIHAIESCAFQLLSWRPFS------AKALDSDSSKPCY---GGPHS--KRPCRADRST 204 L+S H+I+SCAFQL SWRPF K LD+ ++ Y GG H+ KRPC +DR+T Sbjct: 6 LDSRHSIDSCAFQLHSWRPFQQHSTPPTKTLDAANNPRHYRSNGGAHAITKRPCLSDRAT 65 Query: 205 SSFSIDAILDMSKLSLFDDDRALPLS---------AARKHWFAXXXXXXXXXXXXXXXXX 357 S F IDAI DMS+LSL DDD A P ARK Sbjct: 66 S-FPIDAI-DMSRLSLVDDDTARPHHHQYRGSLRLLARKR----RRRGSRSVSGRSSDRS 119 Query: 358 XXXXXXXVGASAANGTCSDFPMVAGGTDSSGELF---GGARWASEVSE-RSLRREKEGNV 525 VGASAA GTCSDFP VA GTDSSGELF G A W+S+VSE R+ RRE++G Sbjct: 120 GTRRCCSVGASAAYGTCSDFP-VAVGTDSSGELFLNTGDANWSSDVSEARNSRRERDGAG 178 Query: 526 GGERECVIAGHGLFGNCDIQGNESGYGSEPGYKXXXXXXXXXXXXXXX--RRAFFWGEEC 699 GG E G G+ G D QG ESGYGSEPGY+ R FWG Sbjct: 179 GGSGEKESFG-GVIGGFDSQGAESGYGSEPGYRGDAEFGYGDEHDEEEDDARLLFWGNRF 237 Query: 700 GENTSQSEMVSENSL--QKGHHRCRRKKHDLRMADAL 804 + S +E+V EN+ QK HHRCRRKKHD RM D++ Sbjct: 238 EDTDSMTEIVGENTFSDQKVHHRCRRKKHDCRMVDSV 274 >ref|XP_007050920.1| LYR motif-containing protein 7 isoform 2 [Theobroma cacao] gi|508703181|gb|EOX95077.1| LYR motif-containing protein 7 isoform 2 [Theobroma cacao] Length = 270 Score = 177 bits (450), Expect = 6e-42 Identities = 127/274 (46%), Positives = 147/274 (53%), Gaps = 25/274 (9%) Frame = +1 Query: 58 LESIHAIESCAFQLLSWRPFSAKALDSDSSKPCYGGP--------HSKRPCRADRSTSSF 213 LE H+I+SC FQL SWRPF + DSS P P HSKRPC +DR+TS F Sbjct: 6 LEPRHSIDSCTFQLHSWRPFQLQQT-LDSSDPQQTPPKRASTNCFHSKRPCLSDRTTS-F 63 Query: 214 SIDAILDMSKLSLFDDDRAL---PLSAARKHW----FAXXXXXXXXXXXXXXXXXXXXXX 372 SID +SKL+L DDD P++A K FA Sbjct: 64 SID----LSKLTLLDDDNNSSYNPIAANPKRGSFRLFARKRRRRGSRSVSGRSSDRSGTR 119 Query: 373 XX--VGASAANGTCSDFPMVAGGTDSSGELFGG---ARWASEVSE-RSLRREKEGNVGGE 534 VGASAA GTCSDFP VA GTDSSGELFG A WAS+VSE R+ RRE+ GE Sbjct: 120 RCCSVGASAAYGTCSDFP-VAVGTDSSGELFGNGADAYWASDVSEARNSRRERGDGGSGE 178 Query: 535 RECVIAGHGLFGNCDIQGNESGYGSEPGYKXXXXXXXXXXXXXXXR--RAFFWGEECGEN 708 +E + G FG D QGNESGYGSEPGY+ R FWG G+ Sbjct: 179 KESL---GGQFGGFDAQGNESGYGSEPGYRGDGEFGYGDEVDEEEEDARLLFWGHHFGDT 235 Query: 709 TSQSEMVSENSL--QKGHHRCRRKKHDLRMADAL 804 S+ EMV EN+ QK HHRCRRKKHD RM D++ Sbjct: 236 DSKMEMVGENTFSDQKAHHRCRRKKHDYRMVDSV 269 >ref|XP_003519006.1| PREDICTED: uncharacterized protein LOC100795813 [Glycine max] Length = 260 Score = 177 bits (450), Expect = 6e-42 Identities = 117/260 (45%), Positives = 144/260 (55%), Gaps = 11/260 (4%) Frame = +1 Query: 58 LESIHAIESCAFQLLSWRPFSAKALDSDSSKPCYGGPHSKRPCRADRSTSSFSIDAILDM 237 L+S H +SC QL +W+PF + D KP Y KRPC +DR+T+SFS LDM Sbjct: 10 LDSRHTTDSCLLQLRTWKPFKLQQ-DGPHPKPYY----HKRPCLSDRTTTSFS----LDM 60 Query: 238 SKLSLFDDDRALPLSAARKHWFAXXXXXXXXXXXXXXXXXXXXXXXX---VGASAANGTC 408 SKL+L DDD P + A + VGASAA GTC Sbjct: 61 SKLTLADDDNHNPNNRATNYRLVARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAYGTC 120 Query: 409 SDFPMVAGGTDSSGELFGGA--RWASEVSE-RSLRREKEGNVG-GERECVIAGHGLFGNC 576 SDFP VA GTDSSGELFG W+S+VSE ++ RRE+E + G GE+E + G G+ G Sbjct: 121 SDFP-VAMGTDSSGELFGNGDPNWSSDVSEAKNSRRERERDGGSGEKENLGVGFGVSGCS 179 Query: 577 DIQGNESGYGSEPGYK--XXXXXXXXXXXXXXXRRAFFWGEECGENTSQSEMVSENSL-- 744 + GNESGYGSEPGY+ R FWG++ G S+ EMV EN+L Sbjct: 180 EANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGDQLGAVDSKMEMVGENTLLD 239 Query: 745 QKGHHRCRRKKHDLRMADAL 804 QK HHRCRR+KHD RM DAL Sbjct: 240 QKSHHRCRRRKHDCRMVDAL 259 >ref|XP_002523082.1| conserved hypothetical protein [Ricinus communis] gi|223537644|gb|EEF39267.1| conserved hypothetical protein [Ricinus communis] Length = 261 Score = 175 bits (444), Expect = 3e-41 Identities = 115/264 (43%), Positives = 145/264 (54%), Gaps = 14/264 (5%) Frame = +1 Query: 55 TLESIHAIESCAFQLLSWRPFSAKALDSDSSKPCYGGPHSKRPCRADRSTSSFSIDAILD 234 +L+S H+I+SC FQL SWRPF + LDSD KP +KRPC +DR+TS F ID+I D Sbjct: 5 SLDSRHSIDSCTFQLHSWRPFHLQTLDSDPPKPY--SSTTKRPCLSDRTTS-FPIDSI-D 60 Query: 235 MSKLSLFDDDRALPLSAARKHWFAXXXXXXXXXXXXXXXXXXXXXXXXVGAS------AA 396 +SKLS+ DDD+ + +SAA + + A Sbjct: 61 ISKLSIIDDDKPISVSAATAYNSRGSLRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGA 120 Query: 397 NGTCSDFPMVAGGTDSSGELFGG--ARWASEVSE--RSLRREKEGNVGGERECVIAGHGL 564 +GTCSDFP VA GTDSSGELFG + W S+VSE S++REK+ E G+G Sbjct: 121 HGTCSDFP-VAVGTDSSGELFGNGDSNWGSDVSEAKNSIKREKDRE---REEKENMGYGQ 176 Query: 565 FGNCDIQGNESGYGSEPGYK--XXXXXXXXXXXXXXXRRAFFWGEECGENTSQSEMVSEN 738 FG + QGNESGYGSEPGY+ + FWG+ G + EMV EN Sbjct: 177 FGTFENQGNESGYGSEPGYRGDAEFGYEDEIDEEEDDAKLLFWGDHFGGTGPKMEMVGEN 236 Query: 739 SL--QKGHHRCRRKKHDLRMADAL 804 S QK HHRCRRKKHD RM D++ Sbjct: 237 SFSDQKSHHRCRRKKHDNRMLDSV 260 >ref|XP_007050919.1| LYR motif-containing protein 7 isoform 1 [Theobroma cacao] gi|508703180|gb|EOX95076.1| LYR motif-containing protein 7 isoform 1 [Theobroma cacao] Length = 271 Score = 174 bits (442), Expect = 5e-41 Identities = 128/275 (46%), Positives = 148/275 (53%), Gaps = 26/275 (9%) Frame = +1 Query: 58 LESIHAIESCAFQLLSWRPFSAKALDSDSSKPCYGGP--------HSKRPCRADRSTSSF 213 LE H+I+SC FQL SWRPF + DSS P P HSKRPC +DR+TS F Sbjct: 6 LEPRHSIDSCTFQLHSWRPFQLQQT-LDSSDPQQTPPKRASTNCFHSKRPCLSDRTTS-F 63 Query: 214 SIDAILDMSKLSLFDDDRAL---PLSAARKHW----FAXXXXXXXXXXXXXXXXXXXXXX 372 SID +SKL+L DDD P++A K FA Sbjct: 64 SID----LSKLTLLDDDNNSSYNPIAANPKRGSFRLFARKRRRRGSRSVSGRSSDRSGTR 119 Query: 373 XX--VGASAANGTCSDFPMVAGGTDSSGELFGG---ARWASEVSE-RSLRREKEGNVGGE 534 VGASAA GTCSDFP VA GTDSSGELFG A WAS+VSE R+ RRE+ GE Sbjct: 120 RCCSVGASAAYGTCSDFP-VAVGTDSSGELFGNGADAYWASDVSEARNSRRERGDGGSGE 178 Query: 535 RECVIAGHGLFGNCDIQGNESGYGSEPGYKXXXXXXXXXXXXXXXR--RAFFWGEECGEN 708 +E + G FG D QGNESGYGSEPGY+ R FWG G + Sbjct: 179 KESL---GGQFGGFDAQGNESGYGSEPGYRGDGEFGYGDEVDEEEEDARLLFWGHHFGAD 235 Query: 709 T-SQSEMVSENSL--QKGHHRCRRKKHDLRMADAL 804 T S+ EMV EN+ QK HHRCRRKKHD RM D++ Sbjct: 236 TDSKMEMVGENTFSDQKAHHRCRRKKHDYRMVDSV 270 >ref|XP_006588589.1| PREDICTED: uncharacterized protein LOC100798288 [Glycine max] Length = 260 Score = 174 bits (440), Expect = 9e-41 Identities = 120/266 (45%), Positives = 144/266 (54%), Gaps = 17/266 (6%) Frame = +1 Query: 58 LESIHAIESCAFQLLSWRPFSAKALDSDSSKPCYGGPHSKRPCRADRSTSSFSIDAILDM 237 L+S H+I+SC QL SW+PF + D KP Y KRPC +DR+T+SFS LDM Sbjct: 10 LDSRHSIDSCLLQLRSWKPFKLQQ-DGPHPKPYY----HKRPCLSDRTTTSFS----LDM 60 Query: 238 SKLSLFDDDRALPLS----------AARKHWFAXXXXXXXXXXXXXXXXXXXXXXXXVGA 387 SKL+L DD + ARK VGA Sbjct: 61 SKLTLAADDDTIHNPNNNRATNYRLVARKR----RRRGSRSLSGRSSDRSGTRRCCSVGA 116 Query: 388 SAANGTCSDFPMVAGGTDSSGELFGGA--RWASEVSE-RSLRREKEGNVGGERECVIAGH 558 SAA GTCSDFP VA GTDSSGELFG W+S+VSE ++ RRE+E + GE+E V G Sbjct: 117 SAAYGTCSDFP-VAMGTDSSGELFGNGDPNWSSDVSEAKNSRRERERD--GEKENVGVGF 173 Query: 559 GLFGNCDIQGNESGYGSEPGYK--XXXXXXXXXXXXXXXRRAFFWGEECGENTSQSEMVS 732 G+ G D GNESGYGSEPGY+ R FWG++ G S+ EMV Sbjct: 174 GVSGCSDANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGDQLGAVDSKREMVG 233 Query: 733 ENSL--QKGHHRCRRKKHDLRMADAL 804 EN+L QK HHRCRR+KHD RM DAL Sbjct: 234 ENTLLDQKSHHRCRRRKHDCRMVDAL 259 >ref|XP_007144318.1| hypothetical protein PHAVU_007G146100g [Phaseolus vulgaris] gi|561017508|gb|ESW16312.1| hypothetical protein PHAVU_007G146100g [Phaseolus vulgaris] Length = 261 Score = 172 bits (436), Expect = 3e-40 Identities = 117/264 (44%), Positives = 143/264 (54%), Gaps = 15/264 (5%) Frame = +1 Query: 58 LESIHAIESCAFQLLSWRPFSAKALDSDSSKPCYGGPHSKRPCRADRSTSSFSIDAILDM 237 L+S H+I+SC QL SW+PF K D KP Y KRPC +DR+T+SFS LD+ Sbjct: 10 LDSRHSIDSCMLQLRSWKPF--KLQDGPHPKPYY----YKRPCLSDRATTSFS----LDI 59 Query: 238 SKLSLFDDDRALPLSAARKHWFAXXXXXXXXXXXXXXXXXXXXXXXX--------VGASA 393 +KL+L D D ++ H VGASA Sbjct: 60 AKLTLADADDTTTIANNPNHRATNYRLVARKRRRRGSRSVSGRSSDRSGTRRCCSVGASA 119 Query: 394 ANGTCSDFPMVAGGTDSSGELFGGA--RWASEVSE-RSLRREKEGNVGGERECVIAGHGL 564 A GTCSDFP VA GTDSSGELFG W+S+VSE ++ RRE+E + GERE V G G+ Sbjct: 120 AYGTCSDFP-VAMGTDSSGELFGNGDPNWSSDVSEAKNSRRERERD--GERENVGVGFGV 176 Query: 565 FGNCDIQGNESGYGSEPGYK--XXXXXXXXXXXXXXXRRAFFWGEECGENTSQSEMVSEN 738 G + GNESGYGSEPGY+ R FWG++ G S+ EMV EN Sbjct: 177 SGCSEANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGDQFGAVDSKREMVGEN 236 Query: 739 SL--QKGHHRCRRKKHDLRMADAL 804 +L QK HHRCRR+KHD RM DAL Sbjct: 237 TLLDQKSHHRCRRRKHDCRMVDAL 260 >ref|XP_004152251.1| PREDICTED: uncharacterized protein LOC101206482 [Cucumis sativus] Length = 266 Score = 166 bits (419), Expect = 3e-38 Identities = 119/270 (44%), Positives = 148/270 (54%), Gaps = 21/270 (7%) Frame = +1 Query: 58 LESIHAIESCAFQLLSWRPFSA-KALDSD--------SSKPCYGGP--HSKRPCRADRST 204 L+S H+I+SC + W PF K LDSD +SKP Y H+KRPC +DR+T Sbjct: 6 LDSRHSIDSCTLKFHGWTPFHLPKTLDSDPHNTSAPTNSKPYYSSTPLHTKRPCLSDRTT 65 Query: 205 SSFSIDAILDMSKLSLFDDDRAL--PLSAARKHWFAXXXXXXXXXXXXXXXXXXXXXXXX 378 S F++DAI DMS LSL DDD+ P + R Sbjct: 66 S-FNVDAI-DMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGTRRCCS 123 Query: 379 VGASAANGTCSDFPMVAGGTDSSGELF--GGARWASEVSE-RSLRREKEGNVGGERECVI 549 VGASAA+GTCSDFP +A GTDSSGELF G A W+S+VSE ++ RRE+E E++ + Sbjct: 124 VGASAAHGTCSDFP-IAVGTDSSGELFVNGDANWSSDVSEAKNSRRERE-----EKDHLG 177 Query: 550 AGH-GLFGNCDIQGNESGYGSEPGYKXXXXXXXXXXXXXXXR--RAFFWGEECGENTSQS 720 +G G D QGNESGYGSEPGY+ R WGE G+ S+ Sbjct: 178 SGFVSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGD--SRM 235 Query: 721 EMVSENSL--QKGHHRCRRKKHDLRMADAL 804 E+V EN+ QK HHRCRRKKH+ RM DAL Sbjct: 236 EIVGENTFADQKSHHRCRRKKHECRMVDAL 265 >gb|EYU46668.1| hypothetical protein MIMGU_mgv1a016256mg [Mimulus guttatus] Length = 128 Score = 155 bits (393), Expect = 3e-35 Identities = 77/127 (60%), Positives = 90/127 (70%) Frame = +1 Query: 421 MVAGGTDSSGELFGGARWASEVSERSLRREKEGNVGGERECVIAGHGLFGNCDIQGNESG 600 M AGGTDSSGELFG A WAS+VS+R+ RRE+EG+ GERE V AG+ FGNCD QGNESG Sbjct: 1 MAAGGTDSSGELFGDANWASDVSDRNSRREREGSCAGEREHVNAGYVQFGNCDAQGNESG 60 Query: 601 YGSEPGYKXXXXXXXXXXXXXXXRRAFFWGEECGENTSQSEMVSENSLQKGHHRCRRKKH 780 YGSEPGY+ R FWG+E G+N S+ E V ENSLQK HHR RRKKH Sbjct: 61 YGSEPGYRGDAEFGYDDEEEDDP-RVLFWGDEFGDNASKLERVGENSLQKAHHRGRRKKH 119 Query: 781 DLRMADA 801 ++RM D+ Sbjct: 120 EMRMMDS 126 >ref|XP_004495959.1| PREDICTED: uncharacterized protein LOC101493408 [Cicer arietinum] Length = 261 Score = 155 bits (392), Expect = 3e-35 Identities = 113/260 (43%), Positives = 137/260 (52%), Gaps = 17/260 (6%) Frame = +1 Query: 76 IESCAFQLLSWRPFSAKALDSDSSKPCYGGPH----SKRPCRADRSTSSFSIDAILDMSK 243 I+SC QL +WRPF + SS P +KRPC +DR+T+SFS LD+SK Sbjct: 11 IDSCVLQLRTWRPFHHLHPQTTSSLDGSHNPTKPSLNKRPCLSDRTTTSFS----LDLSK 66 Query: 244 LSLFDDDRALPLSA-----ARKHWFAXXXXXXXXXXXXXXXXXXXXXXXXVGASAANGTC 408 L+L DDDR + +A ARK VGASAA GTC Sbjct: 67 LTLADDDRPINNTANHRLIARKR----RRRCSRSVSGRSSDRSATRRCCSVGASAAYGTC 122 Query: 409 SDFPMVAGGTDSSGELFGG--ARWASEVSERSLRREKEGNVGGERECVIAGHGLFGNCDI 582 SDFP VA GTDSSGELFG A W+S+VSE R+ G+ E+E V G G+ G + Sbjct: 123 SDFP-VAMGTDSSGELFGNGDANWSSDVSEAKNSRDG-GSGEKEKENVALGFGVNGCSEA 180 Query: 583 QGNESGYGSEPGYK--XXXXXXXXXXXXXXXRRAFFWGEECGENT--SQSEMVSENSL-- 744 GNESGYGSEPGY+ R FWG + G S+ EMV EN+L Sbjct: 181 NGNESGYGSEPGYRGDAEFGYGDEFDEEEDDHRVLFWGNQLGGAAVDSKMEMVGENTLLD 240 Query: 745 QKGHHRCRRKKHDLRMADAL 804 QK HHR RR+K+D RM DAL Sbjct: 241 QKSHHRLRRRKNDCRMIDAL 260 >ref|XP_003591530.1| hypothetical protein MTR_1g088580 [Medicago truncatula] gi|355480578|gb|AES61781.1| hypothetical protein MTR_1g088580 [Medicago truncatula] Length = 249 Score = 148 bits (373), Expect = 5e-33 Identities = 105/252 (41%), Positives = 131/252 (51%), Gaps = 9/252 (3%) Frame = +1 Query: 76 IESCAFQLLSWRPFSAKALDSDSSKPCYGGPHSKRPCRADRSTSSFSIDAILDMSKLSLF 255 +++C QL +W+PF + S +KRPC +DR+T+SFS LD+SKL+L Sbjct: 7 LDTCVLQLRTWKPFHQ--IHDHGSHSHNNNNINKRPCLSDRTTTSFS----LDLSKLTLT 60 Query: 256 DDDRALPLSAARKHWFAXXXXXXXXXXXXXXXXXXXXXXXXVGASAANGTCSDFPMVAGG 435 D++ P + R VGASAA GTCSDFP VA G Sbjct: 61 DNN---PPANYRLIARKRRRRGSRSVSGRSSDRSATRRCCSVGASAAYGTCSDFP-VAMG 116 Query: 436 TDSSGELFGG--ARWASEVSERSLRRE--KEGNVGGERECVIAGHGLFGNCDIQGNESGY 603 TDSSGELFG A W+S+VSE R+ G E+E V G G+ G D GNESGY Sbjct: 117 TDSSGELFGNGDANWSSDVSEAKNSRDCGGSGEKEKEKENVGVGFGVNGCSDANGNESGY 176 Query: 604 GSEPGYK--XXXXXXXXXXXXXXXRRAFFWGEE-CGENTSQSEMVSENSL--QKGHHRCR 768 GSEPGY+ R FWG + G S+ EMV EN+L QK HHRCR Sbjct: 177 GSEPGYRGDAEFGYGDEFDEEEDDHRLLFWGNQLVGAVDSKMEMVGENTLLDQKSHHRCR 236 Query: 769 RKKHDLRMADAL 804 R+K+D RM DAL Sbjct: 237 RRKNDCRMIDAL 248 >ref|XP_002321007.2| hypothetical protein POPTR_0014s12400g [Populus trichocarpa] gi|550324059|gb|EEE99322.2| hypothetical protein POPTR_0014s12400g [Populus trichocarpa] Length = 279 Score = 137 bits (345), Expect = 1e-29 Identities = 109/272 (40%), Positives = 137/272 (50%), Gaps = 25/272 (9%) Frame = +1 Query: 64 SIHAIESCAFQLLSWRPFSAKALDSD---SSKPCYGGPHS--KRPCRADRSTSSFSIDAI 228 S H+I+SC QL SWRPF LDSD +SKP Y + KRPC +DR+TS S Sbjct: 23 SRHSIDSCTLQLHSWRPF----LDSDPPTNSKP-YASSRTLPKRPCLSDRATSFPSNIDS 77 Query: 229 LDMSKLSLFDDD-----RALPLSAARKH---------WFAXXXXXXXXXXXXXXXXXXXX 366 +D+SKLSL DD + +P + A + Sbjct: 78 IDISKLSLLQDDDNNNNKPIPATPAVTNSPYKRGTLRLIERKRRRRGSRSVSGRSSDRSG 137 Query: 367 XXXXVGASAANGTCSDFPMVAGGTDSSGELF--GGARWASEVSE-RSLRREKEGNVGGER 537 AA+GTCSDFP VA GTDSSGELF G A WAS+VSE ++ +E+E E+ Sbjct: 138 TWRCCSVGAAHGTCSDFP-VAVGTDSSGELFVNGDANWASDVSEAKNSIKERE-----EK 191 Query: 538 ECVIAGHGLFGNCDIQGNESGYGSEPGYKXXXXXXXXXXXXXXX--RRAFFWGEECGENT 711 E ++ FGN D +ESGYGSEPGY+ R FWG + Sbjct: 192 ENLLGVGSAFGNLD---SESGYGSEPGYRGDAEFGYGDEVDEEEDDARLLFWGHHFQD-- 246 Query: 712 SQSEMVSENSLQ-KGHHRCRRKKHDLRMADAL 804 S+ EMV EN+ K HHRCRR+KHD RM D+L Sbjct: 247 SKMEMVGENTFDPKTHHRCRRRKHDYRMVDSL 278 >ref|XP_006396451.1| hypothetical protein EUTSA_v10028893mg [Eutrema salsugineum] gi|557097468|gb|ESQ37904.1| hypothetical protein EUTSA_v10028893mg [Eutrema salsugineum] Length = 259 Score = 135 bits (339), Expect = 5e-29 Identities = 110/273 (40%), Positives = 136/273 (49%), Gaps = 18/273 (6%) Frame = +1 Query: 40 MSRGGTLESIHAIESCAFQLLSWRPFS-AKALDSD----SSKPCYGGPHSKRPCRADRST 204 MS+ S +IESC QLLSWRPF +K LDS S KP YG +KRPC +DRST Sbjct: 1 MSQKHLESSRSSIESCTLQLLSWRPFHRSKTLDSSDQSQSHKP-YGSISTKRPCFSDRST 59 Query: 205 SSFSIDAILDMSKLSLFDDDR---ALPLSAARKHWFAXXXXXXXXXXXXXXXXXXXXXXX 375 S FSI+A MS+LSL DDD LSA+ + Sbjct: 60 S-FSIEA---MSRLSLADDDNNNNGGKLSASNYNSKGSFRLVARKRRRRNSRSVSGRSSD 115 Query: 376 XVGAS-----AANGTCSDFPMVAGGTDSSGELFGGARWASEVSERSLRREKEGNVGGERE 540 G A+GTCSDFP A GTDSSGELF A WAS+VSE RRE+ + GGE+E Sbjct: 116 RSGTRRCCSIGAHGTCSDFPF-AVGTDSSGELFSEANWASDVSE--ARRERRDS-GGEKE 171 Query: 541 CVIAGHGLFGNCDIQGNESGYGSEPGYKXXXXXXXXXXXXXXXR--RAFFWGEECGENTS 714 +G G D+ GNESGYGSEPGY+ + FW G+ S Sbjct: 172 A--SGFGFAVGIDLMGNESGYGSEPGYRGDAEFGYGDEFDDEEEDVKPLFW----GDTGS 225 Query: 715 QSEMVSENSLQKGHH--RCRRKK-HDLRMADAL 804 EM + + H RCRR++ HD + D++ Sbjct: 226 TMEMSGDTKFTESKHQFRCRRRRQHDYKTVDSM 258