BLASTX nr result

ID: Akebia24_contig00029046 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00029046
         (355 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002312257.1| SET domain-containing family protein [Populu...   171   1e-40
ref|XP_007046030.1| SET domain protein [Theobroma cacao] gi|5087...   169   4e-40
ref|XP_004135447.1| PREDICTED: uncharacterized protein LOC101202...   164   9e-39
gb|EXC20007.1| SET and MYND domain-containing protein [Morus not...   161   8e-38
ref|XP_007222619.1| hypothetical protein PRUPE_ppa004170mg [Prun...   161   1e-37
ref|XP_002522468.1| protein with unknown function [Ricinus commu...   159   4e-37
emb|CAN74411.1| hypothetical protein VITISV_025097 [Vitis vinifera]   159   4e-37
ref|XP_002265832.1| PREDICTED: uncharacterized protein LOC100253...   158   9e-37
ref|XP_002890643.1| predicted protein [Arabidopsis lyrata subsp....   157   1e-36
ref|XP_004505410.1| PREDICTED: uncharacterized protein LOC101509...   157   1e-36
ref|XP_006303626.1| hypothetical protein CARUB_v10011502mg [Caps...   155   7e-36
ref|XP_006438729.1| hypothetical protein CICLE_v10031048mg [Citr...   154   1e-35
ref|XP_006483694.1| PREDICTED: uncharacterized protein LOC102616...   154   1e-35
ref|XP_006415954.1| hypothetical protein EUTSA_v10007288mg [Eutr...   154   1e-35
gb|AAF87042.1|AC006535_20 T24P13.14 [Arabidopsis thaliana]            152   5e-35
ref|NP_173998.2| SET domain protein 35 [Arabidopsis thaliana] gi...   152   5e-35
ref|XP_004298417.1| PREDICTED: uncharacterized protein LOC101301...   151   1e-34
ref|XP_006355157.1| PREDICTED: uncharacterized protein LOC102591...   144   1e-32
ref|XP_003607750.1| SET domain protein [Medicago truncatula] gi|...   144   1e-32
ref|XP_004238809.1| PREDICTED: uncharacterized protein LOC101244...   142   5e-32

>ref|XP_002312257.1| SET domain-containing family protein [Populus trichocarpa]
           gi|222852077|gb|EEE89624.1| SET domain-containing family
           protein [Populus trichocarpa]
          Length = 542

 Score =  171 bits (432), Expect = 1e-40
 Identities = 77/102 (75%), Positives = 96/102 (94%)
 Frame = -3

Query: 350 SRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGNS 171
           SRLR+ + AL+DCD+ALK++STHFK+L+CKGKILL+LN+Y+MALDCFK A+ DPQA+GN 
Sbjct: 84  SRLRDLTGALKDCDQALKIESTHFKSLVCKGKILLSLNRYSMALDCFKTAVLDPQASGNL 143

Query: 170 ETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
           ETL+GY+ +CKKLEFQSRTG+FDLSDW+L+GFRGKSPELAEY
Sbjct: 144 ETLNGYVQKCKKLEFQSRTGAFDLSDWILSGFRGKSPELAEY 185


>ref|XP_007046030.1| SET domain protein [Theobroma cacao] gi|508709965|gb|EOY01862.1|
           SET domain protein [Theobroma cacao]
          Length = 539

 Score =  169 bits (428), Expect = 4e-40
 Identities = 77/102 (75%), Positives = 92/102 (90%)
 Frame = -3

Query: 350 SRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGNS 171
           SRL++++EAL+DCD AL++++THFKTLLCKGKILL+LN+Y  ALDCFK AL DPQ NG  
Sbjct: 81  SRLQDFTEALQDCDRALQIEATHFKTLLCKGKILLSLNRYAHALDCFKAALFDPQGNGKL 140

Query: 170 ETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
           E L+GYL++CKKLEFQSRTGSFDLSDWVLNGFRGK PEL+EY
Sbjct: 141 EILNGYLEKCKKLEFQSRTGSFDLSDWVLNGFRGKPPELSEY 182


>ref|XP_004135447.1| PREDICTED: uncharacterized protein LOC101202892 [Cucumis sativus]
           gi|449522881|ref|XP_004168454.1| PREDICTED:
           uncharacterized protein LOC101228219 [Cucumis sativus]
          Length = 540

 Score =  164 bits (416), Expect = 9e-39
 Identities = 75/103 (72%), Positives = 92/103 (89%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +S+LR + EAL DC+EALK++STHFKTLLCKGKILLNLN+Y+ AL+CFK AL DPQ +GN
Sbjct: 80  RSKLRIFEEALRDCEEALKIESTHFKTLLCKGKILLNLNRYSSALECFKTALFDPQVSGN 139

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
           SE L+GY+++CKKLE  S+TG+FDLSDWVLNGFRGKSP LAE+
Sbjct: 140 SENLNGYVEKCKKLEHLSKTGAFDLSDWVLNGFRGKSPGLAEF 182


>gb|EXC20007.1| SET and MYND domain-containing protein [Morus notabilis]
          Length = 536

 Score =  161 bits (408), Expect = 8e-38
 Identities = 76/102 (74%), Positives = 89/102 (87%), Gaps = 1/102 (0%)
 Frame = -3

Query: 347 RLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQ-ANGNS 171
           RLR +  ALEDCDEALK++STHFKTLLCKGKIL+NLN+Y+MALDCF+ A  DPQ  NG S
Sbjct: 74  RLREFGAALEDCDEALKIESTHFKTLLCKGKILMNLNRYSMALDCFRTAHLDPQVCNGGS 133

Query: 170 ETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
           E+L+GYL+RCKK+EF SRTG+FDLSDWVL+ F GK PELAEY
Sbjct: 134 ESLNGYLERCKKMEFLSRTGAFDLSDWVLSRFHGKPPELAEY 175


>ref|XP_007222619.1| hypothetical protein PRUPE_ppa004170mg [Prunus persica]
           gi|462419555|gb|EMJ23818.1| hypothetical protein
           PRUPE_ppa004170mg [Prunus persica]
          Length = 525

 Score =  161 bits (407), Expect = 1e-37
 Identities = 74/103 (71%), Positives = 92/103 (89%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +SRLR+++EAL DCD+ALK++STHFKTLLCKGKILLNL++Y+MAL+CFK A  DPQANG+
Sbjct: 68  RSRLRDFAEALRDCDQALKIESTHFKTLLCKGKILLNLSRYSMALECFKTAQLDPQANGS 127

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
           S +L+GYL +CKKLE  SRTG+FDLS+WV+NGFRGK  E AEY
Sbjct: 128 SVSLNGYLQKCKKLELMSRTGAFDLSEWVVNGFRGKPLEPAEY 170


>ref|XP_002522468.1| protein with unknown function [Ricinus communis]
           gi|223538353|gb|EEF39960.1| protein with unknown
           function [Ricinus communis]
          Length = 538

 Score =  159 bits (402), Expect = 4e-37
 Identities = 72/103 (69%), Positives = 91/103 (88%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +SRL+++ +AL+DCD+ALK++STHFK+L+CKGKILL LN+Y++ALDCFK AL D Q NGN
Sbjct: 82  RSRLKDFDKALQDCDQALKIESTHFKSLICKGKILLCLNRYSVALDCFKTALLDQQDNGN 141

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
            E ++GY+++CKKLEFQSRTG  DLSDWV NGFRGK PELAEY
Sbjct: 142 LEIVNGYVEKCKKLEFQSRTGVLDLSDWVQNGFRGKLPELAEY 184


>emb|CAN74411.1| hypothetical protein VITISV_025097 [Vitis vinifera]
          Length = 588

 Score =  159 bits (402), Expect = 4e-37
 Identities = 75/103 (72%), Positives = 87/103 (84%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +SRLR+ + AL+DCD ALK++ THFKTLLCKGKILL LN+Y++ALDCFK AL DPQA   
Sbjct: 130 RSRLRDLANALQDCDGALKIECTHFKTLLCKGKILLGLNRYSLALDCFKAALLDPQAGLK 189

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
              L GYL+RCKKLE QSRTG+FDLSDWV+NGFRGK PELAEY
Sbjct: 190 CGALEGYLERCKKLEHQSRTGAFDLSDWVVNGFRGKFPELAEY 232


>ref|XP_002265832.1| PREDICTED: uncharacterized protein LOC100253788 [Vitis vinifera]
          Length = 550

 Score =  158 bits (399), Expect = 9e-37
 Identities = 74/103 (71%), Positives = 87/103 (84%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +SRLR+ + AL+DCD AL+++ THFKTLLCKGKILL LN+Y++ALDCFK AL DPQA   
Sbjct: 92  RSRLRDLANALQDCDGALEIEGTHFKTLLCKGKILLGLNRYSLALDCFKAALLDPQAGLK 151

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
              L GYL+RCKKLE QSRTG+FDLSDWV+NGFRGK PELAEY
Sbjct: 152 CGALEGYLERCKKLEHQSRTGAFDLSDWVVNGFRGKFPELAEY 194


>ref|XP_002890643.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297336485|gb|EFH66902.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 976

 Score =  157 bits (398), Expect = 1e-36
 Identities = 67/103 (65%), Positives = 93/103 (90%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           ++RLR++ EA+ DCD+AL+++ THFKTLLCKGK+LL L+KY++AL+CFK AL DPQA+ N
Sbjct: 515 RARLRDFLEAMRDCDQALEIEKTHFKTLLCKGKVLLGLSKYSLALECFKTALLDPQASDN 574

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
            ET+ GY+++CKKLEFQ++TG+FDLSDW+L+GFRG+ PELAE+
Sbjct: 575 FETVTGYMEKCKKLEFQAKTGAFDLSDWILSGFRGRCPELAEF 617


>ref|XP_004505410.1| PREDICTED: uncharacterized protein LOC101509103 [Cicer arietinum]
          Length = 539

 Score =  157 bits (397), Expect = 1e-36
 Identities = 70/103 (67%), Positives = 88/103 (85%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +SRLR+++ AL+DCD AL++ +THFKTL+CKGK+LL+LN+Y+MAL CFK AL DPQANGN
Sbjct: 71  KSRLRDFNSALQDCDHALQIDATHFKTLVCKGKVLLSLNRYSMALHCFKTALLDPQANGN 130

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
            E + GY ++CKK EF SRTGS DLSDWVLNGF  K+PELAE+
Sbjct: 131 CEFIDGYFEKCKKFEFLSRTGSLDLSDWVLNGFSAKAPELAEF 173


>ref|XP_006303626.1| hypothetical protein CARUB_v10011502mg [Capsella rubella]
           gi|482572337|gb|EOA36524.1| hypothetical protein
           CARUB_v10011502mg [Capsella rubella]
          Length = 546

 Score =  155 bits (391), Expect = 7e-36
 Identities = 67/103 (65%), Positives = 92/103 (89%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           ++RLR+Y EA++DCD+AL+++ THFKTLLCKGK+LL L+KY+ AL+CFK AL DPQA+ N
Sbjct: 84  RARLRHYLEAMKDCDQALEIEKTHFKTLLCKGKVLLGLSKYSSALECFKTALLDPQASDN 143

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
            ET+  Y+++CKKLEFQ++TG+FDLSDW+L+GFRG+ PELAE+
Sbjct: 144 LETVTVYMEKCKKLEFQAKTGAFDLSDWILSGFRGRCPELAEF 186


>ref|XP_006438729.1| hypothetical protein CICLE_v10031048mg [Citrus clementina]
           gi|557540925|gb|ESR51969.1| hypothetical protein
           CICLE_v10031048mg [Citrus clementina]
          Length = 586

 Score =  154 bits (390), Expect = 1e-35
 Identities = 69/103 (66%), Positives = 91/103 (88%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +SRLR++ +AL DC++ALK++S+HFK LLCKGKILL+LN+Y+MALDCFK  L D QA+G+
Sbjct: 133 RSRLRDFDDALRDCEQALKIESSHFKALLCKGKILLSLNRYSMALDCFKETLVDAQASGS 192

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
            ET++G+L++ KKLE+QSRTG+ DLSDW+LNG RGK PELAEY
Sbjct: 193 LETVNGFLEKSKKLEYQSRTGALDLSDWILNGLRGKCPELAEY 235


>ref|XP_006483694.1| PREDICTED: uncharacterized protein LOC102616313 [Citrus sinensis]
          Length = 531

 Score =  154 bits (389), Expect = 1e-35
 Identities = 68/103 (66%), Positives = 91/103 (88%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +SRLR++ +AL DC++ALK++S+HFK LLCKGK+LL+LN+Y+MALDCFK  L D QA+G+
Sbjct: 78  RSRLRDFDDALRDCEQALKIESSHFKALLCKGKVLLSLNRYSMALDCFKETLVDAQASGS 137

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
            ET++G+L++ KKLE+QSRTG+ DLSDW+LNG RGK PELAEY
Sbjct: 138 LETVNGFLEKSKKLEYQSRTGALDLSDWILNGLRGKCPELAEY 180


>ref|XP_006415954.1| hypothetical protein EUTSA_v10007288mg [Eutrema salsugineum]
           gi|557093725|gb|ESQ34307.1| hypothetical protein
           EUTSA_v10007288mg [Eutrema salsugineum]
          Length = 546

 Score =  154 bits (389), Expect = 1e-35
 Identities = 66/103 (64%), Positives = 92/103 (89%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +++LR++ EA+ DCD+AL+++STHFKTLLCKGK+LL L+KY  AL+CFK AL DPQA+ N
Sbjct: 86  RAKLRDFVEAMMDCDQALEIESTHFKTLLCKGKVLLGLSKYASALECFKTALLDPQASDN 145

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
            ET+ GY+++CK+LE Q++TG+FDLSDW+L+GFRG+SPELAE+
Sbjct: 146 LETVTGYMEKCKRLELQAKTGAFDLSDWILSGFRGRSPELAEF 188


>gb|AAF87042.1|AC006535_20 T24P13.14 [Arabidopsis thaliana]
          Length = 969

 Score =  152 bits (384), Expect = 5e-35
 Identities = 66/103 (64%), Positives = 91/103 (88%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           ++RLR++ EA+ DCD+AL+++ THFKTLLCKGK+LL L+KY++AL+CFK AL DPQA+ N
Sbjct: 508 RARLRDFLEAMRDCDQALEIEKTHFKTLLCKGKVLLGLSKYSLALECFKTALLDPQASDN 567

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
            ET+  Y+++CKKLEFQ++TG+FDLSDW+L+ FRGK PELAE+
Sbjct: 568 LETVTVYIEKCKKLEFQAKTGAFDLSDWILSEFRGKCPELAEF 610


>ref|NP_173998.2| SET domain protein 35 [Arabidopsis thaliana]
           gi|332192607|gb|AEE30728.1| SET domain protein 35
           [Arabidopsis thaliana]
          Length = 545

 Score =  152 bits (384), Expect = 5e-35
 Identities = 66/103 (64%), Positives = 91/103 (88%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           ++RLR++ EA+ DCD+AL+++ THFKTLLCKGK+LL L+KY++AL+CFK AL DPQA+ N
Sbjct: 84  RARLRDFLEAMRDCDQALEIEKTHFKTLLCKGKVLLGLSKYSLALECFKTALLDPQASDN 143

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
            ET+  Y+++CKKLEFQ++TG+FDLSDW+L+ FRGK PELAE+
Sbjct: 144 LETVTVYIEKCKKLEFQAKTGAFDLSDWILSEFRGKCPELAEF 186


>ref|XP_004298417.1| PREDICTED: uncharacterized protein LOC101301002 [Fragaria vesca
           subsp. vesca]
          Length = 521

 Score =  151 bits (381), Expect = 1e-34
 Identities = 69/103 (66%), Positives = 87/103 (84%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +SRLR+Y+ AL+DCDEALK++S HFKTL+CKGKILLNLN+Y+MAL CF+ A  DPQANG+
Sbjct: 61  RSRLRDYANALKDCDEALKIESAHFKTLVCKGKILLNLNRYSMALSCFRAAQLDPQANGS 120

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
           S  L+ YL +CKKLE  S+TG +DLS+WV++GFR K PE AEY
Sbjct: 121 SVGLNEYLQKCKKLELMSKTGVYDLSEWVVSGFRAKPPEPAEY 163


>ref|XP_006355157.1| PREDICTED: uncharacterized protein LOC102591692 [Solanum tuberosum]
          Length = 536

 Score =  144 bits (364), Expect = 1e-32
 Identities = 67/100 (67%), Positives = 83/100 (83%)
 Frame = -3

Query: 344 LRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGNSET 165
           L++Y +AL+DC+EA ++ +THFKTLLCKGKILL+LN+Y +ALDCFK A  DP    NSE 
Sbjct: 75  LQDYPQALQDCNEASQIGNTHFKTLLCKGKILLSLNQYGLALDCFKKASLDPNELENSEM 134

Query: 164 LHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
           L GYL++CKK EF SRTG+FD+SDWVLN F+GK PELAEY
Sbjct: 135 LDGYLEKCKKFEFLSRTGAFDISDWVLNKFQGKPPELAEY 174


>ref|XP_003607750.1| SET domain protein [Medicago truncatula]
           gi|355508805|gb|AES89947.1| SET domain protein [Medicago
           truncatula]
          Length = 540

 Score =  144 bits (364), Expect = 1e-32
 Identities = 65/103 (63%), Positives = 87/103 (84%)
 Frame = -3

Query: 353 QSRLRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGN 174
           +S+LR ++ ALEDCD AL++ +THFK+L+CKGKILL LN+Y+MAL+CFK A+   QA+GN
Sbjct: 75  KSKLREFNSALEDCDHALQIDATHFKSLVCKGKILLCLNRYSMALNCFKTAMLGNQASGN 134

Query: 173 SETLHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
            E L G++++CKK EF SR+G+ DLSDWVLNGF GK+PELAE+
Sbjct: 135 CEMLVGFVEKCKKFEFLSRSGTMDLSDWVLNGFPGKAPELAEF 177


>ref|XP_004238809.1| PREDICTED: uncharacterized protein LOC101244286 [Solanum
           lycopersicum]
          Length = 532

 Score =  142 bits (358), Expect = 5e-32
 Identities = 66/100 (66%), Positives = 83/100 (83%)
 Frame = -3

Query: 344 LRNYSEALEDCDEALKLQSTHFKTLLCKGKILLNLNKYTMALDCFKLALSDPQANGNSET 165
           L++Y +AL DC+EA ++ +THFKTLLCKGKILL+LN+Y +ALDCFK A  DP    NSE 
Sbjct: 75  LQDYPQALLDCNEASQIGNTHFKTLLCKGKILLSLNQYGLALDCFKKASLDPNELENSEM 134

Query: 164 LHGYLDRCKKLEFQSRTGSFDLSDWVLNGFRGKSPELAEY 45
           L+GYL++C+K EF SRTG+FD+SDWVLN F+GK PELAEY
Sbjct: 135 LNGYLEKCRKFEFLSRTGAFDISDWVLNKFQGKPPELAEY 174


Top