BLASTX nr result

ID: Scutellaria22_contig00016171 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria22_contig00016171
         (2200 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ADY38784.1| sequence-specific DNA-binding transcription facto...   563   e-158
gb|ADZ55295.1| sequence-specific DNA binding protein [Coffea ara...   554   e-155
gb|ABZ89177.1| putative protein [Coffea canephora]                    554   e-155
ref|XP_002524572.1| hypothetical protein RCOM_1211540 [Ricinus c...   544   e-152
ref|XP_002321223.1| predicted protein [Populus trichocarpa] gi|2...   539   e-150

>gb|ADY38784.1| sequence-specific DNA-binding transcription factor [Coffea arabica]
          Length = 1116

 Score =  563 bits (1450), Expect = e-158
 Identities = 299/535 (55%), Positives = 372/535 (69%), Gaps = 6/535 (1%)
 Frame = +1

Query: 64   GGNDANDSEYDSRDSGLSLTVVDKT--NSNILPVYNEIDESHPGESWLLGLMEGDYSDLS 237
            G + A DSE ++R S  +     K   ++N+L V  EIDESHPGE WLLGLMEG+YSDLS
Sbjct: 577  GHSSAEDSECETRSSRSNKLRRRKNYMSNNMLTVSTEIDESHPGEVWLLGLMEGEYSDLS 636

Query: 238  IEEKLIALATLIDLLHAGSSIRMEDPLTSIAECLPGTNHHGSGAKIKRSMVKQCN---SI 408
            IEEKL AL  LIDL+ +GSS+R+EDP+ +I   +P    H +GAKIKRS  KQ N     
Sbjct: 637  IEEKLCALLALIDLVSSGSSVRLEDPVAAITTFVPNMTQHSTGAKIKRSTAKQYNFPRQA 696

Query: 409  GILESCGGQTSSRCDMTPEPIDSLVIMSKIDEEEKHINMKKIAKQMEVEDLLHPMQSIFL 588
            G      G+ +S   +   PIDSLV+MSK  E E+  +M+K  ++ME  + LHPMQSI+L
Sbjct: 697  GGYCGANGRDASSTSVL-NPIDSLVLMSKTSERERSCSMRKDNREMEASEDLHPMQSIYL 755

Query: 589  GSDRRYNRYWLFLGPCDELDPGHRRIYFESSEDGHWEMIDTKEAXXXXXXXXXRRGAREA 768
            GSDRRYNRYWLFLGPC+  DPGH+RIYFESSEDG+WE ID +EA         RRG REA
Sbjct: 756  GSDRRYNRYWLFLGPCNGSDPGHKRIYFESSEDGNWEFIDNEEALCSLVSSLDRRGQREA 815

Query: 769  RLLESLEKREAILSGMMSNTPNDIENRQPTQSYQSELNTSREESSSPVSDVDNRLNLSEM 948
             LL SLEKRE  L   MSN  ND    Q   S QS+ NTSRE+S S VSDVDN L+L E+
Sbjct: 816  FLLSSLEKRELYLCRAMSNVVNDAGIGQLNHSDQSDQNTSREDSLSAVSDVDNNLSLIEV 875

Query: 949  QSENPNSISAKAVGVGKQGEKVAEKSDNSQVFATWIWKSFYSELNTVKNGKKSYLDSLRR 1128
            Q + P   S   V   ++ E+   + + +Q F  WIWKSFYS LN VK+GK+SY+DSL R
Sbjct: 876  QKDVP---SGAVVFEMRKAEQQRHRWNLTQAFDRWIWKSFYSNLNAVKHGKRSYVDSLTR 932

Query: 1129 CDQCQDLYWRDEKHCRICHTTFELDFDLEEKYAVHSAICRSNIGVNKCRKQRILTSQLQA 1308
            C+ C DLYWRDEKHC++CHTTFELDFDLEE+YAVH+A CR N+ VNK  + ++L+SQLQ+
Sbjct: 933  CEHCHDLYWRDEKHCKVCHTTFELDFDLEERYAVHTATCRGNLDVNKFPRHKVLSSQLQS 992

Query: 1309 LKAAIYAIESAIPEDALFGSWKKSSHNLWVNRLRRVSSLREFLQVLADLMNAINEDWFYQ 1488
            LKAAI AIES +P D L  SW KS+HNLWV RLRR S+L E LQV+ D ++AINED FYQ
Sbjct: 993  LKAAICAIESVMPGDLLVDSWAKSAHNLWVKRLRRASTLAECLQVIGDFVSAINEDSFYQ 1052

Query: 1489 NNVS-DSYFASEEIISNFSTTPQTYSAIALWLMKLDLLVASHVETGGSQDKAIIV 1650
             + S +S    E+I+S+F T PQT SA A WL+KLD L+A H+E   SQ+K  ++
Sbjct: 1053 CDDSVESNCVMEDILSSFPTMPQTSSAFAFWLVKLDELIAPHLERVKSQNKLEVI 1107


>gb|ADZ55295.1| sequence-specific DNA binding protein [Coffea arabica]
          Length = 1156

 Score =  554 bits (1427), Expect = e-155
 Identities = 298/545 (54%), Positives = 372/545 (68%), Gaps = 16/545 (2%)
 Frame = +1

Query: 64   GGNDANDSEYDSRDSGLSLTVVDKT--NSNILPVYNEIDESHPGESWLLGLMEGDYSDLS 237
            G + A DSE ++R S  +     K   ++N+L V  EIDESHPGE WLLGLMEG+YSDLS
Sbjct: 607  GHSSAEDSECETRSSHSNKLRRRKNYMSNNMLTVSTEIDESHPGEVWLLGLMEGEYSDLS 666

Query: 238  IEEKLIALATLIDLLHAGSSIRME----------DPLTSIAECLPGTNHHGSGAKIKRSM 387
            IEEKL AL  LIDL+ +GSS+R+E          DP+ +I   +P    H +GAKIKRS 
Sbjct: 667  IEEKLCALLALIDLVSSGSSVRLEVVHLSFRRYKDPVAAITTFVPNMTQHSTGAKIKRST 726

Query: 388  VKQCN---SIGILESCGGQTSSRCDMTPEPIDSLVIMSKIDEEEKHINMKKIAKQMEVED 558
             KQ N     G      G+ ++   +   PIDSLV+MSK  E E+  +M+K  ++ME  +
Sbjct: 727  AKQYNFPRQAGGYCGANGRDATSTSVL-NPIDSLVLMSKTSERERSCSMRKDNREMEASE 785

Query: 559  LLHPMQSIFLGSDRRYNRYWLFLGPCDELDPGHRRIYFESSEDGHWEMIDTKEAXXXXXX 738
             LHPMQSI+LGSDRRYNRYWLFLGPC+  DPGH+RIYFESSEDG+WE ID +EA      
Sbjct: 786  DLHPMQSIYLGSDRRYNRYWLFLGPCNGSDPGHKRIYFESSEDGNWEFIDNEEALCSLVS 845

Query: 739  XXXRRGAREARLLESLEKREAILSGMMSNTPNDIENRQPTQSYQSELNTSREESSSPVSD 918
               RRG REA LL SLEKRE  L   MSN  ND    Q   S QS+ NTSRE+S S VSD
Sbjct: 846  SLDRRGQREAFLLSSLEKRELYLCRAMSNVVNDAGIGQLNHSDQSDQNTSREDSLSAVSD 905

Query: 919  VDNRLNLSEMQSENPNSISAKAVGVGKQGEKVAEKSDNSQVFATWIWKSFYSELNTVKNG 1098
            VDN L+L E+Q + P   S   V   ++ E+   + + +Q F  WIWKSFYS LN VK+G
Sbjct: 906  VDNNLSLIEVQKDVP---SGAVVFEMRKAEQQRHRWNLTQAFDRWIWKSFYSNLNAVKHG 962

Query: 1099 KKSYLDSLRRCDQCQDLYWRDEKHCRICHTTFELDFDLEEKYAVHSAICRSNIGVNKCRK 1278
            K+SY+DSL RC+ C DLYWRDEKHC++CHTTFELDFDLEE+YAVH+A CR N+ VNK  +
Sbjct: 963  KRSYVDSLTRCEHCHDLYWRDEKHCKVCHTTFELDFDLEERYAVHTATCRGNLDVNKFPR 1022

Query: 1279 QRILTSQLQALKAAIYAIESAIPEDALFGSWKKSSHNLWVNRLRRVSSLREFLQVLADLM 1458
             ++L+SQLQ+LKAAI AIES +P D L  SW KS+HNLWV RLRR S+L E LQV+ D +
Sbjct: 1023 HKVLSSQLQSLKAAICAIESVMPGDLLVDSWAKSAHNLWVKRLRRASTLAECLQVIGDFV 1082

Query: 1459 NAINEDWFYQNNVS-DSYFASEEIISNFSTTPQTYSAIALWLMKLDLLVASHVETGGSQD 1635
            +AINED FYQ + S +S    E+I+S+F T PQT SA A WL+KLD L+A H+E   SQ+
Sbjct: 1083 SAINEDCFYQCDDSVESNCVMEDILSSFPTMPQTSSAFAFWLVKLDELIAPHLERVKSQN 1142

Query: 1636 KAIIV 1650
            K  ++
Sbjct: 1143 KLEVI 1147


>gb|ABZ89177.1| putative protein [Coffea canephora]
          Length = 1156

 Score =  554 bits (1427), Expect = e-155
 Identities = 298/545 (54%), Positives = 372/545 (68%), Gaps = 16/545 (2%)
 Frame = +1

Query: 64   GGNDANDSEYDSRDSGLSLTVVDKT--NSNILPVYNEIDESHPGESWLLGLMEGDYSDLS 237
            G + A DSE ++R S  +     K   ++N+L V  EIDESHPGE WLLGLMEG+YSDLS
Sbjct: 607  GHSSAEDSECETRSSHSNKLRRRKNYMSNNMLTVSTEIDESHPGEVWLLGLMEGEYSDLS 666

Query: 238  IEEKLIALATLIDLLHAGSSIRME----------DPLTSIAECLPGTNHHGSGAKIKRSM 387
            IEEKL AL  LIDL+ +GSS+R+E          DP+ +I   +P    H +GAKIKRS 
Sbjct: 667  IEEKLCALLALIDLVSSGSSVRLEVVHLSFRRYKDPVAAITTFVPNMTQHSTGAKIKRST 726

Query: 388  VKQCN---SIGILESCGGQTSSRCDMTPEPIDSLVIMSKIDEEEKHINMKKIAKQMEVED 558
             KQ N     G      G+ ++   +   PIDSLV+MSK  E E+  +M+K  ++ME  +
Sbjct: 727  AKQYNFPRQAGGYCGANGRDATSTSVL-NPIDSLVLMSKTSERERSCSMRKDNREMEASE 785

Query: 559  LLHPMQSIFLGSDRRYNRYWLFLGPCDELDPGHRRIYFESSEDGHWEMIDTKEAXXXXXX 738
             LHPMQSI+LGSDRRYNRYWLFLGPC+  DPGH+RIYFESSEDG+WE ID +EA      
Sbjct: 786  DLHPMQSIYLGSDRRYNRYWLFLGPCNGSDPGHKRIYFESSEDGNWEFIDNEEALCSLVS 845

Query: 739  XXXRRGAREARLLESLEKREAILSGMMSNTPNDIENRQPTQSYQSELNTSREESSSPVSD 918
               RRG REA LL SLEKRE  L   MSN  ND    Q   S QS+ NTSRE+S S VSD
Sbjct: 846  SLDRRGQREAFLLSSLEKRELYLCRAMSNVVNDAGIGQLNHSDQSDQNTSREDSLSAVSD 905

Query: 919  VDNRLNLSEMQSENPNSISAKAVGVGKQGEKVAEKSDNSQVFATWIWKSFYSELNTVKNG 1098
            VDN L+L E+Q + P   S   V   ++ E+   + + +Q F  WIWKSFYS LN VK+G
Sbjct: 906  VDNNLSLIEVQKDVP---SGAVVFEMRKAEQQRHRWNLTQAFDRWIWKSFYSNLNAVKHG 962

Query: 1099 KKSYLDSLRRCDQCQDLYWRDEKHCRICHTTFELDFDLEEKYAVHSAICRSNIGVNKCRK 1278
            K+SY+DSL RC+ C DLYWRDEKHC++CHTTFELDFDLEE+YAVH+A CR N+ VNK  +
Sbjct: 963  KRSYVDSLTRCEHCHDLYWRDEKHCKVCHTTFELDFDLEERYAVHTATCRGNLDVNKFPR 1022

Query: 1279 QRILTSQLQALKAAIYAIESAIPEDALFGSWKKSSHNLWVNRLRRVSSLREFLQVLADLM 1458
             ++L+SQLQ+LKAAI AIES +P D L  SW KS+HNLWV RLRR S+L E LQV+ D +
Sbjct: 1023 HKVLSSQLQSLKAAICAIESVMPGDLLVDSWAKSAHNLWVKRLRRASTLAECLQVIGDFV 1082

Query: 1459 NAINEDWFYQNNVS-DSYFASEEIISNFSTTPQTYSAIALWLMKLDLLVASHVETGGSQD 1635
            +AINED FYQ + S +S    E+I+S+F T PQT SA A WL+KLD L+A H+E   SQ+
Sbjct: 1083 SAINEDCFYQCDDSVESNCVMEDILSSFPTMPQTSSAFAFWLVKLDELIAPHLERVKSQN 1142

Query: 1636 KAIIV 1650
            K  ++
Sbjct: 1143 KLEVI 1147


>ref|XP_002524572.1| hypothetical protein RCOM_1211540 [Ricinus communis]
            gi|223536125|gb|EEF37780.1| hypothetical protein
            RCOM_1211540 [Ricinus communis]
          Length = 1120

 Score =  544 bits (1402), Expect = e-152
 Identities = 286/503 (56%), Positives = 345/503 (68%), Gaps = 8/503 (1%)
 Frame = +1

Query: 133  KTNSNILPVYNEIDESHPGESWLLGLMEGDYSDLSIEEKLIALATLIDLLHAGSSIRMED 312
            K  S++L VYNEIDESHPGE WLLGL+EG+Y+DL IEEKL AL  LIDLL AGSSIRMED
Sbjct: 585  KNKSHMLTVYNEIDESHPGEVWLLGLVEGEYADLCIEEKLNALVALIDLLSAGSSIRMED 644

Query: 313  PLTSIAECLPGTNHHGSGAKIKRSMVKQCN-------SIGILESCGGQTSSRCDMTPEPI 471
                  E +P T H+GSGAKIKRS  KQ N        +G + +    T      T  PI
Sbjct: 645  STRPTTESVPNTLHYGSGAKIKRSSSKQHNLPRPSWIHVGQINNA---TELHTSSTSRPI 701

Query: 472  DSLVIMSKIDEEEKHINMKKIAKQMEVEDLLHPMQSIFLGSDRRYNRYWLFLGPCDELDP 651
            DS V + K +E EK  +     ++ E+   LHPMQSIFLGSDRRYNRYWLFLGPC+  DP
Sbjct: 702  DSSVSILKFNEREKSSSKGNDTQETELGVNLHPMQSIFLGSDRRYNRYWLFLGPCNSHDP 761

Query: 652  GHRRIYFESSEDGHWEMIDTKEAXXXXXXXXXRRGAREARLLESLEKREAILS-GMMSNT 828
            GH+R+YFESSEDGHWE+IDT EA          RG REA L+ESLEKRE  L   M S+ 
Sbjct: 762  GHKRVYFESSEDGHWEVIDTAEALRALLSVLDDRGTREALLIESLEKREGFLCLEMSSSI 821

Query: 829  PNDIENRQPTQSYQSELNTSREESSSPVSDVDNRLNLSEMQSENPNSISAKAVGVGKQGE 1008
             ND ENR  T    SEL   RE+S+SPVSDVDN L+L+E+ +++     A  +  GK+ E
Sbjct: 822  ANDSENRHLTLPDHSELEIVREDSTSPVSDVDNNLSLNEVTNDSSPLCGAIILAAGKKEE 881

Query: 1009 KVAEKSDNSQVFATWIWKSFYSELNTVKNGKKSYLDSLRRCDQCQDLYWRDEKHCRICHT 1188
               +K    Q F  WIW  FY +LN+VK  K+SY +SL RC+ C DLYWRDEKHCR CHT
Sbjct: 882  DENQKWCRLQEFDAWIWNYFYCDLNSVKRSKRSYFESLARCETCHDLYWRDEKHCRFCHT 941

Query: 1189 TFELDFDLEEKYAVHSAICRSNIGVNKCRKQRILTSQLQALKAAIYAIESAIPEDALFGS 1368
            TFELDFDLEE+YA+HSA CR        RK ++L+SQLQALKAA++AIESA+PEDAL G+
Sbjct: 942  TFELDFDLEERYAIHSATCRHKGDHEMLRKHKVLSSQLQALKAAVHAIESAMPEDALRGA 1001

Query: 1369 WKKSSHNLWVNRLRRVSSLREFLQVLADLMNAINEDWFYQNNVSDSYFASEEIISNFSTT 1548
            W KS+H LWV RLRR SS+ E LQV+AD + AINE+W  QN+  DS    EEII+ F T 
Sbjct: 1002 WTKSAHRLWVKRLRRTSSVAELLQVVADFVAAINENWLCQNSAQDSNNYLEEIIACFPTM 1061

Query: 1549 PQTYSAIALWLMKLDLLVASHVE 1617
            PQT SA+ALWL+KLD L+  ++E
Sbjct: 1062 PQTSSALALWLVKLDDLICPYLE 1084


>ref|XP_002321223.1| predicted protein [Populus trichocarpa] gi|222861996|gb|EEE99538.1|
            predicted protein [Populus trichocarpa]
          Length = 1152

 Score =  539 bits (1388), Expect = e-150
 Identities = 289/527 (54%), Positives = 359/527 (68%), Gaps = 9/527 (1%)
 Frame = +1

Query: 64   GGNDANDSEYDSRDSG---LSLTVVDKTNSNILPVYNEIDESHPGESWLLGLMEGDYSDL 234
            G + +++S+ DS +S    L L    K  + +L   NEIDES PGE WLLGLMEG+YSDL
Sbjct: 624  GASSSSNSDCDSENSSPRNLKLIDYPKRKNKMLTFENEIDESRPGEVWLLGLMEGEYSDL 683

Query: 235  SIEEKLIALATLIDLLHAGSSIRMEDPLTSIAECLPGTNHHGSGAKIKRSMVKQCNSIGI 414
            SIEEKL  L  LIDL+ AGSSIR+ED      E +P   HH SGAKIKRS   + N    
Sbjct: 684  SIEEKLNGLVALIDLVSAGSSIRLEDLAKPTVESVPNIYHHCSGAKIKRSSSTKDNVPRP 743

Query: 415  LESCGGQTSSRCDMTPE----PIDSLVIMSKIDEEEKHINMKKIAKQMEVEDLLHPMQSI 582
                 GQ +   +        P+DS V+ SK D ++K    +K  + M +E  LHPMQSI
Sbjct: 744  SWVHAGQINVTKEAYTSSKFFPVDSSVLFSKFDGKDKLSGKEKETEGMGLEINLHPMQSI 803

Query: 583  FLGSDRRYNRYWLFLGPCDELDPGHRRIYFESSEDGHWEMIDTKEAXXXXXXXXXRRGAR 762
            FLGSDRRYNRYWLFLGPC+  DPGH+R+YFESSEDGHWE+IDT+EA          RG R
Sbjct: 804  FLGSDRRYNRYWLFLGPCNSYDPGHKRVYFESSEDGHWEVIDTEEALRALLSVLDDRGRR 863

Query: 763  EARLLESLEKREAIL-SGMMSNTPNDIENRQPTQSYQSELNTSREESSSPVSDVDNRLNL 939
            EA L+ESLEKRE  L   M S   ND      TQS QSEL T RE+SSSPVSDVDN L L
Sbjct: 864  EALLIESLEKRETFLCQEMSSKMVNDSGVGYFTQSDQSELETVREDSSSPVSDVDNNLTL 923

Query: 940  SEMQSENPNSISAKAVGVGKQGEKVAEKSDNSQVFATWIWKSFYSELNTVKNGKKSYLDS 1119
            +++ +++   +SA  +  GK+G++  +K +  + F TWIW  FY +LN VK  K+SYL+S
Sbjct: 924  TDIANDSLPPMSAIVLETGKKGKEENQKWNRLRQFDTWIWNCFYCDLNAVKRSKRSYLES 983

Query: 1120 LRRCDQCQDLYWRDEKHCRICHTTFELDFDLEEKYAVHSAICRSNIGVNKCRKQRILTSQ 1299
            LRRC+ C DLYWRDEKHC+ICHTTFELDFDLEE+YA+HSA CR       C K ++L+S+
Sbjct: 984  LRRCETCHDLYWRDEKHCKICHTTFELDFDLEERYAIHSATCRQKEDNVMCPKHKVLSSK 1043

Query: 1300 LQALKAAIYAIESAIPEDALFGSWKKSSHNLWVNRLRRVSSLREFLQVLADLMNAINEDW 1479
            LQ+LKAA+YAIE+ +PEDAL G+W KS+H LWV RLRR SSL E LQV+AD + AINEDW
Sbjct: 1044 LQSLKAAVYAIETVMPEDALVGAWTKSAHRLWVRRLRRTSSLAELLQVVADFVAAINEDW 1103

Query: 1480 FYQNNVSD-SYFASEEIISNFSTTPQTYSAIALWLMKLDLLVASHVE 1617
              Q N++  S    EEII+ F T PQT SA+ALWLMKLD L++ ++E
Sbjct: 1104 LCQCNLAQGSSTYMEEIITCFPTMPQTSSALALWLMKLDELISPYLE 1150


Top