BLASTX nr result
ID: Scutellaria22_contig00016171
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria22_contig00016171 (2200 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADY38784.1| sequence-specific DNA-binding transcription facto... 563 e-158 gb|ADZ55295.1| sequence-specific DNA binding protein [Coffea ara... 554 e-155 gb|ABZ89177.1| putative protein [Coffea canephora] 554 e-155 ref|XP_002524572.1| hypothetical protein RCOM_1211540 [Ricinus c... 544 e-152 ref|XP_002321223.1| predicted protein [Populus trichocarpa] gi|2... 539 e-150 >gb|ADY38784.1| sequence-specific DNA-binding transcription factor [Coffea arabica] Length = 1116 Score = 563 bits (1450), Expect = e-158 Identities = 299/535 (55%), Positives = 372/535 (69%), Gaps = 6/535 (1%) Frame = +1 Query: 64 GGNDANDSEYDSRDSGLSLTVVDKT--NSNILPVYNEIDESHPGESWLLGLMEGDYSDLS 237 G + A DSE ++R S + K ++N+L V EIDESHPGE WLLGLMEG+YSDLS Sbjct: 577 GHSSAEDSECETRSSRSNKLRRRKNYMSNNMLTVSTEIDESHPGEVWLLGLMEGEYSDLS 636 Query: 238 IEEKLIALATLIDLLHAGSSIRMEDPLTSIAECLPGTNHHGSGAKIKRSMVKQCN---SI 408 IEEKL AL LIDL+ +GSS+R+EDP+ +I +P H +GAKIKRS KQ N Sbjct: 637 IEEKLCALLALIDLVSSGSSVRLEDPVAAITTFVPNMTQHSTGAKIKRSTAKQYNFPRQA 696 Query: 409 GILESCGGQTSSRCDMTPEPIDSLVIMSKIDEEEKHINMKKIAKQMEVEDLLHPMQSIFL 588 G G+ +S + PIDSLV+MSK E E+ +M+K ++ME + LHPMQSI+L Sbjct: 697 GGYCGANGRDASSTSVL-NPIDSLVLMSKTSERERSCSMRKDNREMEASEDLHPMQSIYL 755 Query: 589 GSDRRYNRYWLFLGPCDELDPGHRRIYFESSEDGHWEMIDTKEAXXXXXXXXXRRGAREA 768 GSDRRYNRYWLFLGPC+ DPGH+RIYFESSEDG+WE ID +EA RRG REA Sbjct: 756 GSDRRYNRYWLFLGPCNGSDPGHKRIYFESSEDGNWEFIDNEEALCSLVSSLDRRGQREA 815 Query: 769 RLLESLEKREAILSGMMSNTPNDIENRQPTQSYQSELNTSREESSSPVSDVDNRLNLSEM 948 LL SLEKRE L MSN ND Q S QS+ NTSRE+S S VSDVDN L+L E+ Sbjct: 816 FLLSSLEKRELYLCRAMSNVVNDAGIGQLNHSDQSDQNTSREDSLSAVSDVDNNLSLIEV 875 Query: 949 QSENPNSISAKAVGVGKQGEKVAEKSDNSQVFATWIWKSFYSELNTVKNGKKSYLDSLRR 1128 Q + P S V ++ E+ + + +Q F WIWKSFYS LN VK+GK+SY+DSL R Sbjct: 876 QKDVP---SGAVVFEMRKAEQQRHRWNLTQAFDRWIWKSFYSNLNAVKHGKRSYVDSLTR 932 Query: 1129 CDQCQDLYWRDEKHCRICHTTFELDFDLEEKYAVHSAICRSNIGVNKCRKQRILTSQLQA 1308 C+ C DLYWRDEKHC++CHTTFELDFDLEE+YAVH+A CR N+ VNK + ++L+SQLQ+ Sbjct: 933 CEHCHDLYWRDEKHCKVCHTTFELDFDLEERYAVHTATCRGNLDVNKFPRHKVLSSQLQS 992 Query: 1309 LKAAIYAIESAIPEDALFGSWKKSSHNLWVNRLRRVSSLREFLQVLADLMNAINEDWFYQ 1488 LKAAI AIES +P D L SW KS+HNLWV RLRR S+L E LQV+ D ++AINED FYQ Sbjct: 993 LKAAICAIESVMPGDLLVDSWAKSAHNLWVKRLRRASTLAECLQVIGDFVSAINEDSFYQ 1052 Query: 1489 NNVS-DSYFASEEIISNFSTTPQTYSAIALWLMKLDLLVASHVETGGSQDKAIIV 1650 + S +S E+I+S+F T PQT SA A WL+KLD L+A H+E SQ+K ++ Sbjct: 1053 CDDSVESNCVMEDILSSFPTMPQTSSAFAFWLVKLDELIAPHLERVKSQNKLEVI 1107 >gb|ADZ55295.1| sequence-specific DNA binding protein [Coffea arabica] Length = 1156 Score = 554 bits (1427), Expect = e-155 Identities = 298/545 (54%), Positives = 372/545 (68%), Gaps = 16/545 (2%) Frame = +1 Query: 64 GGNDANDSEYDSRDSGLSLTVVDKT--NSNILPVYNEIDESHPGESWLLGLMEGDYSDLS 237 G + A DSE ++R S + K ++N+L V EIDESHPGE WLLGLMEG+YSDLS Sbjct: 607 GHSSAEDSECETRSSHSNKLRRRKNYMSNNMLTVSTEIDESHPGEVWLLGLMEGEYSDLS 666 Query: 238 IEEKLIALATLIDLLHAGSSIRME----------DPLTSIAECLPGTNHHGSGAKIKRSM 387 IEEKL AL LIDL+ +GSS+R+E DP+ +I +P H +GAKIKRS Sbjct: 667 IEEKLCALLALIDLVSSGSSVRLEVVHLSFRRYKDPVAAITTFVPNMTQHSTGAKIKRST 726 Query: 388 VKQCN---SIGILESCGGQTSSRCDMTPEPIDSLVIMSKIDEEEKHINMKKIAKQMEVED 558 KQ N G G+ ++ + PIDSLV+MSK E E+ +M+K ++ME + Sbjct: 727 AKQYNFPRQAGGYCGANGRDATSTSVL-NPIDSLVLMSKTSERERSCSMRKDNREMEASE 785 Query: 559 LLHPMQSIFLGSDRRYNRYWLFLGPCDELDPGHRRIYFESSEDGHWEMIDTKEAXXXXXX 738 LHPMQSI+LGSDRRYNRYWLFLGPC+ DPGH+RIYFESSEDG+WE ID +EA Sbjct: 786 DLHPMQSIYLGSDRRYNRYWLFLGPCNGSDPGHKRIYFESSEDGNWEFIDNEEALCSLVS 845 Query: 739 XXXRRGAREARLLESLEKREAILSGMMSNTPNDIENRQPTQSYQSELNTSREESSSPVSD 918 RRG REA LL SLEKRE L MSN ND Q S QS+ NTSRE+S S VSD Sbjct: 846 SLDRRGQREAFLLSSLEKRELYLCRAMSNVVNDAGIGQLNHSDQSDQNTSREDSLSAVSD 905 Query: 919 VDNRLNLSEMQSENPNSISAKAVGVGKQGEKVAEKSDNSQVFATWIWKSFYSELNTVKNG 1098 VDN L+L E+Q + P S V ++ E+ + + +Q F WIWKSFYS LN VK+G Sbjct: 906 VDNNLSLIEVQKDVP---SGAVVFEMRKAEQQRHRWNLTQAFDRWIWKSFYSNLNAVKHG 962 Query: 1099 KKSYLDSLRRCDQCQDLYWRDEKHCRICHTTFELDFDLEEKYAVHSAICRSNIGVNKCRK 1278 K+SY+DSL RC+ C DLYWRDEKHC++CHTTFELDFDLEE+YAVH+A CR N+ VNK + Sbjct: 963 KRSYVDSLTRCEHCHDLYWRDEKHCKVCHTTFELDFDLEERYAVHTATCRGNLDVNKFPR 1022 Query: 1279 QRILTSQLQALKAAIYAIESAIPEDALFGSWKKSSHNLWVNRLRRVSSLREFLQVLADLM 1458 ++L+SQLQ+LKAAI AIES +P D L SW KS+HNLWV RLRR S+L E LQV+ D + Sbjct: 1023 HKVLSSQLQSLKAAICAIESVMPGDLLVDSWAKSAHNLWVKRLRRASTLAECLQVIGDFV 1082 Query: 1459 NAINEDWFYQNNVS-DSYFASEEIISNFSTTPQTYSAIALWLMKLDLLVASHVETGGSQD 1635 +AINED FYQ + S +S E+I+S+F T PQT SA A WL+KLD L+A H+E SQ+ Sbjct: 1083 SAINEDCFYQCDDSVESNCVMEDILSSFPTMPQTSSAFAFWLVKLDELIAPHLERVKSQN 1142 Query: 1636 KAIIV 1650 K ++ Sbjct: 1143 KLEVI 1147 >gb|ABZ89177.1| putative protein [Coffea canephora] Length = 1156 Score = 554 bits (1427), Expect = e-155 Identities = 298/545 (54%), Positives = 372/545 (68%), Gaps = 16/545 (2%) Frame = +1 Query: 64 GGNDANDSEYDSRDSGLSLTVVDKT--NSNILPVYNEIDESHPGESWLLGLMEGDYSDLS 237 G + A DSE ++R S + K ++N+L V EIDESHPGE WLLGLMEG+YSDLS Sbjct: 607 GHSSAEDSECETRSSHSNKLRRRKNYMSNNMLTVSTEIDESHPGEVWLLGLMEGEYSDLS 666 Query: 238 IEEKLIALATLIDLLHAGSSIRME----------DPLTSIAECLPGTNHHGSGAKIKRSM 387 IEEKL AL LIDL+ +GSS+R+E DP+ +I +P H +GAKIKRS Sbjct: 667 IEEKLCALLALIDLVSSGSSVRLEVVHLSFRRYKDPVAAITTFVPNMTQHSTGAKIKRST 726 Query: 388 VKQCN---SIGILESCGGQTSSRCDMTPEPIDSLVIMSKIDEEEKHINMKKIAKQMEVED 558 KQ N G G+ ++ + PIDSLV+MSK E E+ +M+K ++ME + Sbjct: 727 AKQYNFPRQAGGYCGANGRDATSTSVL-NPIDSLVLMSKTSERERSCSMRKDNREMEASE 785 Query: 559 LLHPMQSIFLGSDRRYNRYWLFLGPCDELDPGHRRIYFESSEDGHWEMIDTKEAXXXXXX 738 LHPMQSI+LGSDRRYNRYWLFLGPC+ DPGH+RIYFESSEDG+WE ID +EA Sbjct: 786 DLHPMQSIYLGSDRRYNRYWLFLGPCNGSDPGHKRIYFESSEDGNWEFIDNEEALCSLVS 845 Query: 739 XXXRRGAREARLLESLEKREAILSGMMSNTPNDIENRQPTQSYQSELNTSREESSSPVSD 918 RRG REA LL SLEKRE L MSN ND Q S QS+ NTSRE+S S VSD Sbjct: 846 SLDRRGQREAFLLSSLEKRELYLCRAMSNVVNDAGIGQLNHSDQSDQNTSREDSLSAVSD 905 Query: 919 VDNRLNLSEMQSENPNSISAKAVGVGKQGEKVAEKSDNSQVFATWIWKSFYSELNTVKNG 1098 VDN L+L E+Q + P S V ++ E+ + + +Q F WIWKSFYS LN VK+G Sbjct: 906 VDNNLSLIEVQKDVP---SGAVVFEMRKAEQQRHRWNLTQAFDRWIWKSFYSNLNAVKHG 962 Query: 1099 KKSYLDSLRRCDQCQDLYWRDEKHCRICHTTFELDFDLEEKYAVHSAICRSNIGVNKCRK 1278 K+SY+DSL RC+ C DLYWRDEKHC++CHTTFELDFDLEE+YAVH+A CR N+ VNK + Sbjct: 963 KRSYVDSLTRCEHCHDLYWRDEKHCKVCHTTFELDFDLEERYAVHTATCRGNLDVNKFPR 1022 Query: 1279 QRILTSQLQALKAAIYAIESAIPEDALFGSWKKSSHNLWVNRLRRVSSLREFLQVLADLM 1458 ++L+SQLQ+LKAAI AIES +P D L SW KS+HNLWV RLRR S+L E LQV+ D + Sbjct: 1023 HKVLSSQLQSLKAAICAIESVMPGDLLVDSWAKSAHNLWVKRLRRASTLAECLQVIGDFV 1082 Query: 1459 NAINEDWFYQNNVS-DSYFASEEIISNFSTTPQTYSAIALWLMKLDLLVASHVETGGSQD 1635 +AINED FYQ + S +S E+I+S+F T PQT SA A WL+KLD L+A H+E SQ+ Sbjct: 1083 SAINEDCFYQCDDSVESNCVMEDILSSFPTMPQTSSAFAFWLVKLDELIAPHLERVKSQN 1142 Query: 1636 KAIIV 1650 K ++ Sbjct: 1143 KLEVI 1147 >ref|XP_002524572.1| hypothetical protein RCOM_1211540 [Ricinus communis] gi|223536125|gb|EEF37780.1| hypothetical protein RCOM_1211540 [Ricinus communis] Length = 1120 Score = 544 bits (1402), Expect = e-152 Identities = 286/503 (56%), Positives = 345/503 (68%), Gaps = 8/503 (1%) Frame = +1 Query: 133 KTNSNILPVYNEIDESHPGESWLLGLMEGDYSDLSIEEKLIALATLIDLLHAGSSIRMED 312 K S++L VYNEIDESHPGE WLLGL+EG+Y+DL IEEKL AL LIDLL AGSSIRMED Sbjct: 585 KNKSHMLTVYNEIDESHPGEVWLLGLVEGEYADLCIEEKLNALVALIDLLSAGSSIRMED 644 Query: 313 PLTSIAECLPGTNHHGSGAKIKRSMVKQCN-------SIGILESCGGQTSSRCDMTPEPI 471 E +P T H+GSGAKIKRS KQ N +G + + T T PI Sbjct: 645 STRPTTESVPNTLHYGSGAKIKRSSSKQHNLPRPSWIHVGQINNA---TELHTSSTSRPI 701 Query: 472 DSLVIMSKIDEEEKHINMKKIAKQMEVEDLLHPMQSIFLGSDRRYNRYWLFLGPCDELDP 651 DS V + K +E EK + ++ E+ LHPMQSIFLGSDRRYNRYWLFLGPC+ DP Sbjct: 702 DSSVSILKFNEREKSSSKGNDTQETELGVNLHPMQSIFLGSDRRYNRYWLFLGPCNSHDP 761 Query: 652 GHRRIYFESSEDGHWEMIDTKEAXXXXXXXXXRRGAREARLLESLEKREAILS-GMMSNT 828 GH+R+YFESSEDGHWE+IDT EA RG REA L+ESLEKRE L M S+ Sbjct: 762 GHKRVYFESSEDGHWEVIDTAEALRALLSVLDDRGTREALLIESLEKREGFLCLEMSSSI 821 Query: 829 PNDIENRQPTQSYQSELNTSREESSSPVSDVDNRLNLSEMQSENPNSISAKAVGVGKQGE 1008 ND ENR T SEL RE+S+SPVSDVDN L+L+E+ +++ A + GK+ E Sbjct: 822 ANDSENRHLTLPDHSELEIVREDSTSPVSDVDNNLSLNEVTNDSSPLCGAIILAAGKKEE 881 Query: 1009 KVAEKSDNSQVFATWIWKSFYSELNTVKNGKKSYLDSLRRCDQCQDLYWRDEKHCRICHT 1188 +K Q F WIW FY +LN+VK K+SY +SL RC+ C DLYWRDEKHCR CHT Sbjct: 882 DENQKWCRLQEFDAWIWNYFYCDLNSVKRSKRSYFESLARCETCHDLYWRDEKHCRFCHT 941 Query: 1189 TFELDFDLEEKYAVHSAICRSNIGVNKCRKQRILTSQLQALKAAIYAIESAIPEDALFGS 1368 TFELDFDLEE+YA+HSA CR RK ++L+SQLQALKAA++AIESA+PEDAL G+ Sbjct: 942 TFELDFDLEERYAIHSATCRHKGDHEMLRKHKVLSSQLQALKAAVHAIESAMPEDALRGA 1001 Query: 1369 WKKSSHNLWVNRLRRVSSLREFLQVLADLMNAINEDWFYQNNVSDSYFASEEIISNFSTT 1548 W KS+H LWV RLRR SS+ E LQV+AD + AINE+W QN+ DS EEII+ F T Sbjct: 1002 WTKSAHRLWVKRLRRTSSVAELLQVVADFVAAINENWLCQNSAQDSNNYLEEIIACFPTM 1061 Query: 1549 PQTYSAIALWLMKLDLLVASHVE 1617 PQT SA+ALWL+KLD L+ ++E Sbjct: 1062 PQTSSALALWLVKLDDLICPYLE 1084 >ref|XP_002321223.1| predicted protein [Populus trichocarpa] gi|222861996|gb|EEE99538.1| predicted protein [Populus trichocarpa] Length = 1152 Score = 539 bits (1388), Expect = e-150 Identities = 289/527 (54%), Positives = 359/527 (68%), Gaps = 9/527 (1%) Frame = +1 Query: 64 GGNDANDSEYDSRDSG---LSLTVVDKTNSNILPVYNEIDESHPGESWLLGLMEGDYSDL 234 G + +++S+ DS +S L L K + +L NEIDES PGE WLLGLMEG+YSDL Sbjct: 624 GASSSSNSDCDSENSSPRNLKLIDYPKRKNKMLTFENEIDESRPGEVWLLGLMEGEYSDL 683 Query: 235 SIEEKLIALATLIDLLHAGSSIRMEDPLTSIAECLPGTNHHGSGAKIKRSMVKQCNSIGI 414 SIEEKL L LIDL+ AGSSIR+ED E +P HH SGAKIKRS + N Sbjct: 684 SIEEKLNGLVALIDLVSAGSSIRLEDLAKPTVESVPNIYHHCSGAKIKRSSSTKDNVPRP 743 Query: 415 LESCGGQTSSRCDMTPE----PIDSLVIMSKIDEEEKHINMKKIAKQMEVEDLLHPMQSI 582 GQ + + P+DS V+ SK D ++K +K + M +E LHPMQSI Sbjct: 744 SWVHAGQINVTKEAYTSSKFFPVDSSVLFSKFDGKDKLSGKEKETEGMGLEINLHPMQSI 803 Query: 583 FLGSDRRYNRYWLFLGPCDELDPGHRRIYFESSEDGHWEMIDTKEAXXXXXXXXXRRGAR 762 FLGSDRRYNRYWLFLGPC+ DPGH+R+YFESSEDGHWE+IDT+EA RG R Sbjct: 804 FLGSDRRYNRYWLFLGPCNSYDPGHKRVYFESSEDGHWEVIDTEEALRALLSVLDDRGRR 863 Query: 763 EARLLESLEKREAIL-SGMMSNTPNDIENRQPTQSYQSELNTSREESSSPVSDVDNRLNL 939 EA L+ESLEKRE L M S ND TQS QSEL T RE+SSSPVSDVDN L L Sbjct: 864 EALLIESLEKRETFLCQEMSSKMVNDSGVGYFTQSDQSELETVREDSSSPVSDVDNNLTL 923 Query: 940 SEMQSENPNSISAKAVGVGKQGEKVAEKSDNSQVFATWIWKSFYSELNTVKNGKKSYLDS 1119 +++ +++ +SA + GK+G++ +K + + F TWIW FY +LN VK K+SYL+S Sbjct: 924 TDIANDSLPPMSAIVLETGKKGKEENQKWNRLRQFDTWIWNCFYCDLNAVKRSKRSYLES 983 Query: 1120 LRRCDQCQDLYWRDEKHCRICHTTFELDFDLEEKYAVHSAICRSNIGVNKCRKQRILTSQ 1299 LRRC+ C DLYWRDEKHC+ICHTTFELDFDLEE+YA+HSA CR C K ++L+S+ Sbjct: 984 LRRCETCHDLYWRDEKHCKICHTTFELDFDLEERYAIHSATCRQKEDNVMCPKHKVLSSK 1043 Query: 1300 LQALKAAIYAIESAIPEDALFGSWKKSSHNLWVNRLRRVSSLREFLQVLADLMNAINEDW 1479 LQ+LKAA+YAIE+ +PEDAL G+W KS+H LWV RLRR SSL E LQV+AD + AINEDW Sbjct: 1044 LQSLKAAVYAIETVMPEDALVGAWTKSAHRLWVRRLRRTSSLAELLQVVADFVAAINEDW 1103 Query: 1480 FYQNNVSD-SYFASEEIISNFSTTPQTYSAIALWLMKLDLLVASHVE 1617 Q N++ S EEII+ F T PQT SA+ALWLMKLD L++ ++E Sbjct: 1104 LCQCNLAQGSSTYMEEIITCFPTMPQTSSALALWLMKLDELISPYLE 1150