BLASTX nr result
ID: Mentha22_contig00029482
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00029482 (829 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU46216.1| hypothetical protein MIMGU_mgv1a023243mg, partial... 382 e-103 gb|EYU35929.1| hypothetical protein MIMGU_mgv1a024138mg, partial... 311 2e-82 gb|EPS62528.1| hypothetical protein M569_12262, partial [Genlise... 300 3e-79 ref|XP_007015166.1| DNA binding,zinc ion binding,DNA binding, pu... 282 9e-74 ref|XP_007015165.1| DNA binding,zinc ion binding,DNA binding, pu... 282 9e-74 ref|XP_007015163.1| DNA binding,zinc ion binding,DNA binding, pu... 282 9e-74 ref|XP_007015162.1| DNA binding,zinc ion binding,DNA binding, pu... 282 9e-74 ref|XP_007015161.1| DNA binding,zinc ion binding,DNA binding, pu... 282 9e-74 ref|XP_002513535.1| hypothetical protein RCOM_1578820 [Ricinus c... 267 4e-69 ref|XP_002274937.2| PREDICTED: uncharacterized protein LOC100260... 266 9e-69 emb|CBI24209.3| unnamed protein product [Vitis vinifera] 266 9e-69 ref|XP_007214563.1| hypothetical protein PRUPE_ppa000168mg [Prun... 264 3e-68 ref|XP_006362316.1| PREDICTED: uncharacterized protein LOC102579... 264 3e-68 ref|XP_004251353.1| PREDICTED: uncharacterized protein LOC101256... 264 3e-68 gb|EXC04604.1| Nucleosome-remodeling factor subunit BPTF [Morus ... 256 9e-66 ref|XP_006590775.1| PREDICTED: uncharacterized protein LOC100800... 254 4e-65 ref|XP_002299794.2| hypothetical protein POPTR_0001s26130g, part... 254 4e-65 ref|XP_003538947.1| PREDICTED: uncharacterized protein LOC100800... 254 4e-65 ref|XP_002313363.2| hypothetical protein POPTR_0009s05370g [Popu... 253 6e-65 ref|XP_004291756.1| PREDICTED: uncharacterized protein LOC101311... 252 1e-64 >gb|EYU46216.1| hypothetical protein MIMGU_mgv1a023243mg, partial [Mimulus guttatus] Length = 1772 Score = 382 bits (980), Expect = e-103 Identities = 187/279 (67%), Positives = 215/279 (77%), Gaps = 4/279 (1%) Frame = +1 Query: 4 EKGKNLRPKSYSHKPNAINSK---VEVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTST 174 EKGK+L +++S+ P S +VHCG NYVNCY+ AR AS FYEE+ K+SDKTS Sbjct: 906 EKGKDLNLENHSYAPYTTKSTGILPQVHCGMNYVNCYDSARPASSFYEEWNGKSSDKTSE 965 Query: 175 DAPRSAEEILAGQLKIVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSM 354 +AP S E+ + QLK+VL+RF FSWSNIQ N+ SRKE CGWC YCRVPE ++DCLF M Sbjct: 966 NAPISVEQFVGRQLKVVLDRFAHFSWSNIQISNINSRKEGCGWCFYCRVPEEDKDCLFIM 1025 Query: 355 NDSTPDVEKFSHEVLGIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHA 534 NDS P V+ F+ ++LGIQ K KNHLIDVMCHIICIEDHLQGLL+GPWLNP YSMLW Sbjct: 1026 NDSIPAVQNFTSDILGIQSRKHRKNHLIDVMCHIICIEDHLQGLLLGPWLNPHYSMLWRK 1085 Query: 535 DLCGATDIASLKNFLLQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKHGI 714 + G DIA LKN LL+LESNLH LALSADW+KHVD V T+GSA+HIVSSSAR SSKHGI Sbjct: 1086 AVLGVDDIAPLKNLLLKLESNLHQLALSADWQKHVDFVATMGSASHIVSSSARVSSKHGI 1145 Query: 715 SRKRAKSSDVS-GPSSNAATGLSLFWWRGGRGSRSLFNW 828 RK K+SDV PSSNAA GLSLFWWRGG SR LFNW Sbjct: 1146 GRKSIKNSDVERTPSSNAAKGLSLFWWRGGTSSRKLFNW 1184 >gb|EYU35929.1| hypothetical protein MIMGU_mgv1a024138mg, partial [Mimulus guttatus] Length = 936 Score = 311 bits (797), Expect = 2e-82 Identities = 155/277 (55%), Positives = 193/277 (69%), Gaps = 4/277 (1%) Frame = +1 Query: 7 KGKNLRPKSYSHKPNAINSKV---EVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTD 177 +GKN+ +Y + INS +V CGT+YVNCYEFA+TAS + E T K++DKT Sbjct: 204 EGKNISCANYVCASSTINSTAIGSQVPCGTHYVNCYEFAQTASSIFRELTAKSTDKTIEG 263 Query: 178 APRSAEEILAGQLKIVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMN 357 A RSAEE ++GQLK++ NRF QFSWSN++N N+ S KE+CGWC YC+VPE E DC F MN Sbjct: 264 AKRSAEENVSGQLKLIFNRFAQFSWSNMRNSNVTSGKEKCGWCSYCKVPEDEMDCSFVMN 323 Query: 358 DSTPDVEKFSHEVLGIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHAD 537 D+ P +E F+ E L I K KNHLIDVMCHIIC+EDHLQGLLVGPWLNP+YS LW Sbjct: 324 DNFPALENFTTESLDIGSTK-RKNHLIDVMCHIICMEDHLQGLLVGPWLNPNYSQLWRKS 382 Query: 538 LCGATDIASLKNFLLQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKHGIS 717 + A D+ S+K LL+LESNLHHLA++ADW+K VDS T+GSA I SS R S + Sbjct: 383 VLVAADLGSIKTLLLELESNLHHLAVTADWKKSVDSASTMGSACLIAKSSRRVSLNNETK 442 Query: 718 RKRAKSSDVS-GPSSNAATGLSLFWWRGGRGSRSLFN 825 R RAK S + + +A GL L WW+G + SR LFN Sbjct: 443 RTRAKCSKLEITQTPKSACGLRLLWWKGDKASRELFN 479 >gb|EPS62528.1| hypothetical protein M569_12262, partial [Genlisea aurea] Length = 799 Score = 300 bits (769), Expect = 3e-79 Identities = 148/248 (59%), Positives = 182/248 (73%), Gaps = 1/248 (0%) Frame = +1 Query: 88 NYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLKIVLNRFVQFSWSNIQN 267 NYVN YEF+RTAS + F SDK+S D P SAEEI+A QLK++ NRF +F+WS+ Sbjct: 359 NYVNHYEFSRTASSY---FGGLASDKSSDDLPLSAEEIIARQLKVITNRFSEFAWSSTPL 415 Query: 268 LNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVLGIQPEKIAKNHLIDVM 447 N S KERCGWC +C+ PE RDCLF + ++ P VEKF + GIQ HLIDV+ Sbjct: 416 SNSTSGKERCGWCFFCKTPEDGRDCLFVLKNNIPSVEKFPYGASGIQTRNTGNRHLIDVI 475 Query: 448 CHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFLLQLESNLHHLALSADW 627 +IIC+ED+L GLL GPWLN YS +W L A+ I+SLK+ LL+LESNLHHLALS++W Sbjct: 476 HYIICLEDYLLGLLSGPWLNLQYSTIWRQALLTASSISSLKDLLLKLESNLHHLALSSEW 535 Query: 628 RKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSDVSG-PSSNAATGLSLFWWRGGR 804 KHVDSV T+GS+ HIV SS RASS++GISR+RA+SSD PSS A+GLSLFWWRGGR Sbjct: 536 SKHVDSVATMGSSCHIVMSSIRASSRNGISRRRAQSSDFGATPSSKEASGLSLFWWRGGR 595 Query: 805 GSRSLFNW 828 GSR +FNW Sbjct: 596 GSRKIFNW 603 >ref|XP_007015166.1| DNA binding,zinc ion binding,DNA binding, putative isoform 6, partial [Theobroma cacao] gi|508785529|gb|EOY32785.1| DNA binding,zinc ion binding,DNA binding, putative isoform 6, partial [Theobroma cacao] Length = 1345 Score = 282 bits (722), Expect = 9e-74 Identities = 139/264 (52%), Positives = 184/264 (69%), Gaps = 5/264 (1%) Frame = +1 Query: 52 AINSK----VEVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLK 219 AIN+K + GT Y+N Y FA+TASL EE K S+KT+ D+ +S EEI+A Q+K Sbjct: 912 AINAKRGDASQTQPGTGYLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMK 971 Query: 220 IVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVL 399 ++L + +F W +I NL + +RKE CGWC CR P + DCLF + E E++ Sbjct: 972 VILKKSNRFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQ-EVSKSEMV 1030 Query: 400 GIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFL 579 G+Q + K H+IDV+CH IE+ L GLL GPWLNP Y +WH + A+D+ASLK+FL Sbjct: 1031 GLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASDVASLKHFL 1090 Query: 580 LQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSD-VSGPS 756 L LE+NLHHLALSA+W KHVDS T+GSA+H+V++S+RAS+KHGI+RKR +S+D S P+ Sbjct: 1091 LMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRSNDGESNPT 1150 Query: 757 SNAATGLSLFWWRGGRGSRSLFNW 828 SN A G S+ WWRGGR SR LFNW Sbjct: 1151 SNPAAGPSICWWRGGRVSRQLFNW 1174 >ref|XP_007015165.1| DNA binding,zinc ion binding,DNA binding, putative isoform 5, partial [Theobroma cacao] gi|508785528|gb|EOY32784.1| DNA binding,zinc ion binding,DNA binding, putative isoform 5, partial [Theobroma cacao] Length = 1357 Score = 282 bits (722), Expect = 9e-74 Identities = 139/264 (52%), Positives = 184/264 (69%), Gaps = 5/264 (1%) Frame = +1 Query: 52 AINSK----VEVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLK 219 AIN+K + GT Y+N Y FA+TASL EE K S+KT+ D+ +S EEI+A Q+K Sbjct: 924 AINAKRGDASQTQPGTGYLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMK 983 Query: 220 IVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVL 399 ++L + +F W +I NL + +RKE CGWC CR P + DCLF + E E++ Sbjct: 984 VILKKSNRFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQ-EVSKSEMV 1042 Query: 400 GIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFL 579 G+Q + K H+IDV+CH IE+ L GLL GPWLNP Y +WH + A+D+ASLK+FL Sbjct: 1043 GLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASDVASLKHFL 1102 Query: 580 LQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSD-VSGPS 756 L LE+NLHHLALSA+W KHVDS T+GSA+H+V++S+RAS+KHGI+RKR +S+D S P+ Sbjct: 1103 LMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRSNDGESNPT 1162 Query: 757 SNAATGLSLFWWRGGRGSRSLFNW 828 SN A G S+ WWRGGR SR LFNW Sbjct: 1163 SNPAAGPSICWWRGGRVSRQLFNW 1186 >ref|XP_007015163.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] gi|590584387|ref|XP_007015164.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] gi|508785526|gb|EOY32782.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] gi|508785527|gb|EOY32783.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] Length = 1859 Score = 282 bits (722), Expect = 9e-74 Identities = 139/264 (52%), Positives = 184/264 (69%), Gaps = 5/264 (1%) Frame = +1 Query: 52 AINSK----VEVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLK 219 AIN+K + GT Y+N Y FA+TASL EE K S+KT+ D+ +S EEI+A Q+K Sbjct: 924 AINAKRGDASQTQPGTGYLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMK 983 Query: 220 IVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVL 399 ++L + +F W +I NL + +RKE CGWC CR P + DCLF + E E++ Sbjct: 984 VILKKSNRFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQ-EVSKSEMV 1042 Query: 400 GIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFL 579 G+Q + K H+IDV+CH IE+ L GLL GPWLNP Y +WH + A+D+ASLK+FL Sbjct: 1043 GLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASDVASLKHFL 1102 Query: 580 LQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSD-VSGPS 756 L LE+NLHHLALSA+W KHVDS T+GSA+H+V++S+RAS+KHGI+RKR +S+D S P+ Sbjct: 1103 LMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRSNDGESNPT 1162 Query: 757 SNAATGLSLFWWRGGRGSRSLFNW 828 SN A G S+ WWRGGR SR LFNW Sbjct: 1163 SNPAAGPSICWWRGGRVSRQLFNW 1186 >ref|XP_007015162.1| DNA binding,zinc ion binding,DNA binding, putative isoform 2 [Theobroma cacao] gi|508785525|gb|EOY32781.1| DNA binding,zinc ion binding,DNA binding, putative isoform 2 [Theobroma cacao] Length = 1647 Score = 282 bits (722), Expect = 9e-74 Identities = 139/264 (52%), Positives = 184/264 (69%), Gaps = 5/264 (1%) Frame = +1 Query: 52 AINSK----VEVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLK 219 AIN+K + GT Y+N Y FA+TASL EE K S+KT+ D+ +S EEI+A Q+K Sbjct: 924 AINAKRGDASQTQPGTGYLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMK 983 Query: 220 IVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVL 399 ++L + +F W +I NL + +RKE CGWC CR P + DCLF + E E++ Sbjct: 984 VILKKSNRFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQ-EVSKSEMV 1042 Query: 400 GIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFL 579 G+Q + K H+IDV+CH IE+ L GLL GPWLNP Y +WH + A+D+ASLK+FL Sbjct: 1043 GLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASDVASLKHFL 1102 Query: 580 LQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSD-VSGPS 756 L LE+NLHHLALSA+W KHVDS T+GSA+H+V++S+RAS+KHGI+RKR +S+D S P+ Sbjct: 1103 LMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRSNDGESNPT 1162 Query: 757 SNAATGLSLFWWRGGRGSRSLFNW 828 SN A G S+ WWRGGR SR LFNW Sbjct: 1163 SNPAAGPSICWWRGGRVSRQLFNW 1186 >ref|XP_007015161.1| DNA binding,zinc ion binding,DNA binding, putative isoform 1 [Theobroma cacao] gi|508785524|gb|EOY32780.1| DNA binding,zinc ion binding,DNA binding, putative isoform 1 [Theobroma cacao] Length = 1931 Score = 282 bits (722), Expect = 9e-74 Identities = 139/264 (52%), Positives = 184/264 (69%), Gaps = 5/264 (1%) Frame = +1 Query: 52 AINSK----VEVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLK 219 AIN+K + GT Y+N Y FA+TASL EE K S+KT+ D+ +S EEI+A Q+K Sbjct: 924 AINAKRGDASQTQPGTGYLNYYSFAQTASLVVEELMGKPSEKTNEDSLKSVEEIIAMQMK 983 Query: 220 IVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVL 399 ++L + +F W +I NL + +RKE CGWC CR P + DCLF + E E++ Sbjct: 984 VILKKSNRFHWPDINNLFVDARKENCGWCFCCRYPMDDTDCLFKITSRCVQ-EVSKSEMV 1042 Query: 400 GIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFL 579 G+Q + K H+IDV+CH IE+ L GLL GPWLNP Y +WH + A+D+ASLK+FL Sbjct: 1043 GLQSKWNKKGHVIDVICHAFSIENRLHGLLSGPWLNPQYIKIWHKSILKASDVASLKHFL 1102 Query: 580 LQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSD-VSGPS 756 L LE+NLHHLALSA+W KHVDS T+GSA+H+V++S+RAS+KHGI+RKR +S+D S P+ Sbjct: 1103 LMLEANLHHLALSAEWMKHVDSAVTMGSASHVVTASSRASAKHGIARKRGRSNDGESNPT 1162 Query: 757 SNAATGLSLFWWRGGRGSRSLFNW 828 SN A G S+ WWRGGR SR LFNW Sbjct: 1163 SNPAAGPSICWWRGGRVSRQLFNW 1186 >ref|XP_002513535.1| hypothetical protein RCOM_1578820 [Ricinus communis] gi|223547443|gb|EEF48938.1| hypothetical protein RCOM_1578820 [Ricinus communis] Length = 1915 Score = 267 bits (682), Expect = 4e-69 Identities = 137/280 (48%), Positives = 187/280 (66%), Gaps = 5/280 (1%) Frame = +1 Query: 4 EKGKNLRPKSYSHKPNAINSK----VEVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTS 171 +K +R + S+ A+N K ++ T+Y+N Y F AS E+ K+SDKT Sbjct: 977 KKANVIRSAANSYPSFALNGKNGDASQIQPETSYLNYYNFGHIASSVAEDLLHKSSDKTI 1036 Query: 172 TDAPRSAEEILAGQLKIVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFS 351 D+ +S EEI++ Q+KI+ R +F WS+I LN+ +KE+CGWC CR + CLF+ Sbjct: 1037 EDSIKSEEEIISAQMKILSKRCPKFHWSSIPRLNVDVQKEKCGWCFSCRASSDDPGCLFN 1096 Query: 352 MNDSTPDVEKFSHEVLGIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWH 531 M S+ E + E G+Q + K HL D++ H++ IED LQGLL+GPWLNP+YS LW Sbjct: 1097 MTLSSVGGEGSAIESAGLQAKGNKKGHLTDIISHVLVIEDRLQGLLLGPWLNPNYSKLWR 1156 Query: 532 ADLCGATDIASLKNFLLQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKHG 711 + A+DI SLK+ LL LESNL LALSA+W KHVDS P +GSA+HIV +S RASSK+G Sbjct: 1157 KSVLKASDIVSLKHLLLTLESNLSRLALSAEWLKHVDSSPRMGSASHIVMASLRASSKNG 1216 Query: 712 ISRKRAKSSDV-SGPSSNAATGLSLFWWRGGRGSRSLFNW 828 IS+KRA+ S+ S PSSN+++GLS+ WWRGGR SR LF+W Sbjct: 1217 ISKKRARFSEFDSNPSSNSSSGLSMLWWRGGRLSRQLFSW 1256 >ref|XP_002274937.2| PREDICTED: uncharacterized protein LOC100260139 [Vitis vinifera] Length = 1976 Score = 266 bits (679), Expect = 9e-69 Identities = 139/281 (49%), Positives = 184/281 (65%), Gaps = 5/281 (1%) Frame = +1 Query: 1 VEKGKNLRPKSYSHKPNAINSKVE----VHCGTNYVNCYEFARTASLFYEEFTRKTSDKT 168 VE+ K + H + I+++ E V CG +Y N Y FA+TAS EE K+SDK+ Sbjct: 869 VEQEKKIESAVDGHTSSPIHTRKEDVSQVQCGIDYTNYYSFAQTASSVAEELMHKSSDKS 928 Query: 169 STDAPRSAEEILAGQLKIVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLF 348 + SAEEI++ Q+K + F +F W N Q+L M + KE CGWC C+ +++CLF Sbjct: 929 KEHSTTSAEEIISAQIKAISKNFTKFCWPNAQSLTMDAEKENCGWCFSCKDSTGDKNCLF 988 Query: 349 SMNDSTPDVEKFSHEVLGIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLW 528 N P E E +G+Q +K K HL+DV+ +I+ IE L+GLL+GPW+NP ++ LW Sbjct: 989 KTNFMVPVQEGSKSEGVGLQSKKNRKGHLVDVINYILSIEVRLRGLLMGPWMNPHHAKLW 1048 Query: 529 HADLCGATDIASLKNFLLQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKH 708 + A+D+AS+K+ LL LESNL LALSADW K +DS T+GSA+HIV SS RASSK Sbjct: 1049 CKNALKASDVASVKHLLLTLESNLRRLALSADWLKQMDSFITMGSASHIVISS-RASSKL 1107 Query: 709 GISRKRAKSSD-VSGPSSNAATGLSLFWWRGGRGSRSLFNW 828 G+ +KR + S VS PSSNAATGLSLFWWRGGR SR LFNW Sbjct: 1108 GVGKKRTRCSGFVSKPSSNAATGLSLFWWRGGRLSRKLFNW 1148 >emb|CBI24209.3| unnamed protein product [Vitis vinifera] Length = 1805 Score = 266 bits (679), Expect = 9e-69 Identities = 139/281 (49%), Positives = 184/281 (65%), Gaps = 5/281 (1%) Frame = +1 Query: 1 VEKGKNLRPKSYSHKPNAINSKVE----VHCGTNYVNCYEFARTASLFYEEFTRKTSDKT 168 VE+ K + H + I+++ E V CG +Y N Y FA+TAS EE K+SDK+ Sbjct: 824 VEQEKKIESAVDGHTSSPIHTRKEDVSQVQCGIDYTNYYSFAQTASSVAEELMHKSSDKS 883 Query: 169 STDAPRSAEEILAGQLKIVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLF 348 + SAEEI++ Q+K + F +F W N Q+L M + KE CGWC C+ +++CLF Sbjct: 884 KEHSTTSAEEIISAQIKAISKNFTKFCWPNAQSLTMDAEKENCGWCFSCKDSTGDKNCLF 943 Query: 349 SMNDSTPDVEKFSHEVLGIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLW 528 N P E E +G+Q +K K HL+DV+ +I+ IE L+GLL+GPW+NP ++ LW Sbjct: 944 KTNFMVPVQEGSKSEGVGLQSKKNRKGHLVDVINYILSIEVRLRGLLMGPWMNPHHAKLW 1003 Query: 529 HADLCGATDIASLKNFLLQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKH 708 + A+D+AS+K+ LL LESNL LALSADW K +DS T+GSA+HIV SS RASSK Sbjct: 1004 CKNALKASDVASVKHLLLTLESNLRRLALSADWLKQMDSFITMGSASHIVISS-RASSKL 1062 Query: 709 GISRKRAKSSD-VSGPSSNAATGLSLFWWRGGRGSRSLFNW 828 G+ +KR + S VS PSSNAATGLSLFWWRGGR SR LFNW Sbjct: 1063 GVGKKRTRCSGFVSKPSSNAATGLSLFWWRGGRLSRKLFNW 1103 >ref|XP_007214563.1| hypothetical protein PRUPE_ppa000168mg [Prunus persica] gi|462410428|gb|EMJ15762.1| hypothetical protein PRUPE_ppa000168mg [Prunus persica] Length = 1545 Score = 264 bits (675), Expect = 3e-68 Identities = 132/254 (51%), Positives = 174/254 (68%), Gaps = 1/254 (0%) Frame = +1 Query: 70 EVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLKIVLNRFVQFS 249 EVHCG Y+NCY F + AS EE TRK+SDK D + EEI++ Q+K +L + +FS Sbjct: 794 EVHCGIGYMNCYSFGQIASSVAEELTRKSSDKIKEDTIITEEEIISAQMKTILKKSSKFS 853 Query: 250 WSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVLGIQPEKIAKN 429 N+ NLN+ ++KE+CGWC C+ P DCLF M+ +S+ + G Q ++ Sbjct: 854 GPNVGNLNLDAQKEKCGWCFSCKAPANYGDCLFIMSMGPVQDVSYSN-ITGFQSKRNKDG 912 Query: 430 HLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFLLQLESNLHHL 609 HL DV C I+ I D LQGLL+GP LNP + LW L A+D+AS+K+ LL LE+NLHHL Sbjct: 913 HLNDVRCQILSIHDRLQGLLLGPLLNPHHRELWRKSLLKASDLASIKHLLLMLEANLHHL 972 Query: 610 ALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSDVS-GPSSNAATGLSLF 786 ALSADW KHVDSV T+GSA+H+V +S RA SK+ I+RKR K SD+ P+SNAA+GL +F Sbjct: 973 ALSADWLKHVDSVVTMGSASHVV-TSLRAYSKNFINRKRPKCSDIEPTPTSNAASGLGMF 1031 Query: 787 WWRGGRGSRSLFNW 828 WWRGGR SR +F+W Sbjct: 1032 WWRGGRLSRQVFSW 1045 >ref|XP_006362316.1| PREDICTED: uncharacterized protein LOC102579382 [Solanum tuberosum] Length = 1718 Score = 264 bits (674), Expect = 3e-68 Identities = 123/248 (49%), Positives = 172/248 (69%), Gaps = 1/248 (0%) Frame = +1 Query: 88 NYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLKIVLNRFVQFSWSNIQN 267 +YVN Y FAR AS EE T+K+ KT DA ++ +EI++ QLK + ++ + F W N+QN Sbjct: 865 SYVNFYSFARIASSVVEELTKKSPGKTGEDAKKTVDEIISAQLKAISSKSIDFCWPNVQN 924 Query: 268 LNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVLGIQPEKIAKNHLIDVM 447 + + +RKE CGWC+ C+VPE E+DCLF+ N + P E FS + LG+ + ++HL++V+ Sbjct: 925 MKIDARKEDCGWCISCKVPECEKDCLFTQNSTGPAPESFSSDALGVHSRRNRESHLVNVL 984 Query: 448 CHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFLLQLESNLHHLALSADW 627 C+I+ ED L GLL GPWLNP +S W D+ A +I +L+ FLL LESNL LAL+ DW Sbjct: 985 CYILSTEDRLHGLLSGPWLNPHHSQNWRKDVTEAHEIDTLRAFLLTLESNLRPLALTPDW 1044 Query: 628 RKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSDVS-GPSSNAATGLSLFWWRGGR 804 KHVDS+ +GS HI+ +S+R +HGI +K+++ + PSSNA +GLSLFWWRGGR Sbjct: 1045 LKHVDSLAKMGSGHHIIINSSRV--RHGIGKKKSRHLEPEVNPSSNAGSGLSLFWWRGGR 1102 Query: 805 GSRSLFNW 828 SR LFNW Sbjct: 1103 LSRRLFNW 1110 >ref|XP_004251353.1| PREDICTED: uncharacterized protein LOC101256352 [Solanum lycopersicum] Length = 1884 Score = 264 bits (674), Expect = 3e-68 Identities = 124/248 (50%), Positives = 170/248 (68%), Gaps = 1/248 (0%) Frame = +1 Query: 88 NYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLKIVLNRFVQFSWSNIQN 267 +YVN Y FAR AS EE T+K+ KT DA ++ +EI++ QLK + ++ + F W N+QN Sbjct: 1039 SYVNFYSFARIASSVVEELTKKSPGKTGQDAKKTVDEIISAQLKAISSKSIDFCWPNVQN 1098 Query: 268 LNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVLGIQPEKIAKNHLIDVM 447 + + +RKE CGWC+ C+VPE E+DCLF N + P E FS + LG+ + ++HL++V+ Sbjct: 1099 MKIDARKEDCGWCISCKVPECEKDCLFIQNSTGPAPESFSSDALGVHSRRNRESHLVNVL 1158 Query: 448 CHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFLLQLESNLHHLALSADW 627 C I+ ED L GLL GPWLNP +S W D+ A D+ +L+ FLL LESNL LAL+ DW Sbjct: 1159 CSILSTEDRLHGLLSGPWLNPHHSQNWRKDVTEAHDVDTLRAFLLTLESNLRPLALTPDW 1218 Query: 628 RKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSDVS-GPSSNAATGLSLFWWRGGR 804 KHVDS+ +GS HI+ +S+R +HGI +K+A+ + PSSNA +GLSLFWWRGGR Sbjct: 1219 LKHVDSLAKMGSGHHIIINSSRV--RHGIGKKKARHLEPEVNPSSNAGSGLSLFWWRGGR 1276 Query: 805 GSRSLFNW 828 SR LFNW Sbjct: 1277 LSRRLFNW 1284 >gb|EXC04604.1| Nucleosome-remodeling factor subunit BPTF [Morus notabilis] Length = 1761 Score = 256 bits (653), Expect = 9e-66 Identities = 129/253 (50%), Positives = 174/253 (68%) Frame = +1 Query: 70 EVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLKIVLNRFVQFS 249 EV G YVN Y F + AS E+ TRK+SDK D EEI++ Q++++L ++ +F Sbjct: 944 EVQYGNGYVNYYSFGQIASSIAEDLTRKSSDKIKQDVVILEEEIISRQMRVILKKYSKFC 1003 Query: 250 WSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVLGIQPEKIAKN 429 WS+I+ N+ +KE+CGWC CR +R+CLFSMN P E S + L +Q ++ K+ Sbjct: 1004 WSSIKTFNVDVQKEKCGWCFSCRAATDDRECLFSMNVG-PVREFPSSDDLSLQSKRNRKS 1062 Query: 430 HLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFLLQLESNLHHL 609 HL D++ I+ IE+ L+GLL+GPWLNP+++ LW A+DIAS+K+FLL LESNL L Sbjct: 1063 HLTDIIYQILSIENRLRGLLLGPWLNPNHTKLWRKSALKASDIASVKHFLLTLESNLGRL 1122 Query: 610 ALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSDVSGPSSNAATGLSLFW 789 ALSADW KHVDS +VGSA+HIV+SSAR S K+ I RKR + SGP+ N A+GL +FW Sbjct: 1123 ALSADWLKHVDSDVSVGSASHIVTSSARGSLKNVIGRKRPITE--SGPTLNTASGLGIFW 1180 Query: 790 WRGGRGSRSLFNW 828 WRGGR SR +FNW Sbjct: 1181 WRGGRLSRKVFNW 1193 >ref|XP_006590775.1| PREDICTED: uncharacterized protein LOC100800973 isoform X2 [Glycine max] Length = 1738 Score = 254 bits (648), Expect = 4e-65 Identities = 132/272 (48%), Positives = 177/272 (65%), Gaps = 2/272 (0%) Frame = +1 Query: 16 NLRPKSYSHKPNAINSKV-EVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSA 192 NLR S P+ N EV G +Y+N Y FARTAS +E K+ +K + S Sbjct: 949 NLRSVGASITPSTDNKDTSEVPSGIDYINYYSFARTASFVAQELMCKSPEKMNKIFAMSE 1008 Query: 193 EEILAGQLKIVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPD 372 EEI++ Q K+++ + F W +IQ+LN + KE+CGWC C+ +RDCLF+ + P Sbjct: 1009 EEIMSDQAKVIMKKSTNFCWPSIQDLNAAAHKEKCGWCFTCKGENEDRDCLFN-SVVKPI 1067 Query: 373 VEKFSHEVLGIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGAT 552 E ++ ++G+QP KI L D++C I +E L+GLL+GPWLN + LWH DL A+ Sbjct: 1068 WEVPNNTLVGLQPRKIQNGRLRDIICLIFSLEVRLRGLLLGPWLNLHQTDLWHKDLLKAS 1127 Query: 553 DIASLKNFLLQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAK 732 D +K LL LESNL LALSADW KHVDSV T+GSATHIV SS+R SS+HGI RKRA+ Sbjct: 1128 DFLPVKRLLLLLESNLRLLALSADWLKHVDSVATMGSATHIVVSSSRTSSRHGIGRKRAR 1187 Query: 733 SSDV-SGPSSNAATGLSLFWWRGGRGSRSLFN 825 ++D+ + SSN A+GL ++WWRGGR SR LFN Sbjct: 1188 NTDIETSSSSNTASGLGMYWWRGGRLSRKLFN 1219 >ref|XP_002299794.2| hypothetical protein POPTR_0001s26130g, partial [Populus trichocarpa] gi|550348214|gb|EEE84599.2| hypothetical protein POPTR_0001s26130g, partial [Populus trichocarpa] Length = 1815 Score = 254 bits (648), Expect = 4e-65 Identities = 131/281 (46%), Positives = 190/281 (67%), Gaps = 5/281 (1%) Frame = +1 Query: 1 VEKGKNLRPKSYSHKPNAINSKVEVHC----GTNYVNCYEFARTASLFYEEFTRKTSDKT 168 +++ KN P +A N+K EV GT Y+N Y F T++ + K S+KT Sbjct: 965 IKREKNPCPPPTRCPSSAGNAKAEVTLQVQPGTEYMNYYCFGHTSASIADVLLSKPSEKT 1024 Query: 169 STDAPRSAEEILAGQLKIVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLF 348 + ++ +S EE+ Q+K++L + +F WS+I LN +K +CGWC CR E DCLF Sbjct: 1025 TENSIKSDEEMALAQMKVILKKSNKFRWSSIPCLNAEVQKGKCGWCFSCRATTDEPDCLF 1084 Query: 349 SMNDSTPDVEKFSHEVLGIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLW 528 + + P E E +G+Q ++I K +LID++ HI+ IE LQGLL+GPWLNP Y+ LW Sbjct: 1085 NKSLG-PIQEGTESEAIGLQSKRIRKGYLIDLIYHILLIEHRLQGLLLGPWLNPHYTKLW 1143 Query: 529 HADLCGATDIASLKNFLLQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKH 708 + A+DIAS+K+FLL+LE+N+ LALSADW K+VDS T+GS++H+V++S+RASSK+ Sbjct: 1144 RKSILKASDIASVKHFLLKLEANVRRLALSADWVKYVDSGVTMGSSSHVVTTSSRASSKN 1203 Query: 709 GISRKRAKSSDV-SGPSSNAATGLSLFWWRGGRGSRSLFNW 828 GI RKRA+S++ S P +N+A+GLS+FWWRGGR SR LF+W Sbjct: 1204 GIGRKRARSTEFESKPCANSASGLSMFWWRGGRLSRRLFSW 1244 >ref|XP_003538947.1| PREDICTED: uncharacterized protein LOC100800973 isoform X1 [Glycine max] Length = 1735 Score = 254 bits (648), Expect = 4e-65 Identities = 132/272 (48%), Positives = 177/272 (65%), Gaps = 2/272 (0%) Frame = +1 Query: 16 NLRPKSYSHKPNAINSKV-EVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSA 192 NLR S P+ N EV G +Y+N Y FARTAS +E K+ +K + S Sbjct: 949 NLRSVGASITPSTDNKDTSEVPSGIDYINYYSFARTASFVAQELMCKSPEKMNKIFAMSE 1008 Query: 193 EEILAGQLKIVLNRFVQFSWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPD 372 EEI++ Q K+++ + F W +IQ+LN + KE+CGWC C+ +RDCLF+ + P Sbjct: 1009 EEIMSDQAKVIMKKSTNFCWPSIQDLNAAAHKEKCGWCFTCKGENEDRDCLFN-SVVKPI 1067 Query: 373 VEKFSHEVLGIQPEKIAKNHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGAT 552 E ++ ++G+QP KI L D++C I +E L+GLL+GPWLN + LWH DL A+ Sbjct: 1068 WEVPNNTLVGLQPRKIQNGRLRDIICLIFSLEVRLRGLLLGPWLNLHQTDLWHKDLLKAS 1127 Query: 553 DIASLKNFLLQLESNLHHLALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAK 732 D +K LL LESNL LALSADW KHVDSV T+GSATHIV SS+R SS+HGI RKRA+ Sbjct: 1128 DFLPVKRLLLLLESNLRLLALSADWLKHVDSVATMGSATHIVVSSSRTSSRHGIGRKRAR 1187 Query: 733 SSDV-SGPSSNAATGLSLFWWRGGRGSRSLFN 825 ++D+ + SSN A+GL ++WWRGGR SR LFN Sbjct: 1188 NTDIETSSSSNTASGLGMYWWRGGRLSRKLFN 1219 >ref|XP_002313363.2| hypothetical protein POPTR_0009s05370g [Populus trichocarpa] gi|550331079|gb|EEE87318.2| hypothetical protein POPTR_0009s05370g [Populus trichocarpa] Length = 1934 Score = 253 bits (646), Expect = 6e-65 Identities = 122/255 (47%), Positives = 178/255 (69%), Gaps = 1/255 (0%) Frame = +1 Query: 67 VEVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLKIVLNRFVQF 246 ++V T Y+N Y F T++ E K+SDKT+ ++ +S EE+ Q+K++L + +F Sbjct: 1031 LQVQPRTEYMNYYSFGYTSASIAEVLLSKSSDKTTENSIKSDEEMALAQMKVILKKSNRF 1090 Query: 247 SWSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDSTPDVEKFSHEVLGIQPEKIAK 426 WS+I +LN +KE+CGWC CR E DCLF+M+ P E EV+ ++ ++ K Sbjct: 1091 RWSSIPSLNAEVQKEKCGWCFSCRATTDEPDCLFNMSLG-PVQEGSESEVISLKTKRNRK 1149 Query: 427 NHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFLLQLESNLHH 606 +L+D++CHI+ IED LQGLL+GPWLNP Y+ LW + A+DIA++K+ LL+LE+N+ Sbjct: 1150 GYLVDLICHILLIEDRLQGLLLGPWLNPHYTKLWRKSILKASDIATVKHLLLKLEANVRR 1209 Query: 607 LALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSDV-SGPSSNAATGLSL 783 LALSADW KHVDS T+GS++H V++S+RAS K+GI RKR +S++ S P +N A+GL + Sbjct: 1210 LALSADWVKHVDSGVTMGSSSHFVTASSRASLKNGIGRKRVRSTECQSNPCANPASGLGM 1269 Query: 784 FWWRGGRGSRSLFNW 828 FWWRGGR SR LF+W Sbjct: 1270 FWWRGGRLSRRLFSW 1284 >ref|XP_004291756.1| PREDICTED: uncharacterized protein LOC101311539 [Fragaria vesca subsp. vesca] Length = 1773 Score = 252 bits (644), Expect = 1e-64 Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 2/255 (0%) Frame = +1 Query: 70 EVHCGTNYVNCYEFARTASLFYEEFTRKTSDKTSTDAPRSAEEILAGQLKIVLNRFVQFS 249 EV T+Y+N Y F + AS EEF K S+K A + EEI++ Q+K ++ + +FS Sbjct: 982 EVQIATDYINYYSFGKIASSIAEEFMSKASEKNREGAVITEEEIVSAQMKTIIKKSSKFS 1041 Query: 250 WSNIQNLNMVSRKERCGWCLYCRVPEYERDCLFSMNDST-PDVEKFSHEVLGIQPEKIAK 426 W NI+NLN+ +KE+CGWC C+ P +RDCL+ M+ DV K +V+G+ +K K Sbjct: 1042 WPNIENLNIDVQKEKCGWCFSCKYPADDRDCLYIMSKQPLQDVSKT--DVVGLGLKKTPK 1099 Query: 427 NHLIDVMCHIICIEDHLQGLLVGPWLNPDYSMLWHADLCGATDIASLKNFLLQLESNLHH 606 +HL DV C I+ I D + GLL+GPWLNP ++ W L A D+AS+K+ LL L NLH+ Sbjct: 1100 DHLSDVSCQILSIHDRMLGLLLGPWLNPHHTECWRNSLLNACDLASVKHLLLLLVENLHY 1159 Query: 607 LALSADWRKHVDSVPTVGSATHIVSSSARASSKHGISRKRAKSSDV-SGPSSNAATGLSL 783 ALSADW KHVDSV T+GSA+H+V+S RA SK+ SRKR K SD+ S PSSNA +GL + Sbjct: 1160 RALSADWLKHVDSVVTMGSASHVVTS-LRACSKNMNSRKRPKFSDIDSNPSSNAGSGLGM 1218 Query: 784 FWWRGGRGSRSLFNW 828 FWWRGGR SR +F+W Sbjct: 1219 FWWRGGRLSRQVFSW 1233