BLASTX nr result

ID: Ephedra28_contig00003050 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra28_contig00003050
         (826 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABR16297.1| unknown [Picea sitchensis]                             121   3e-25
ref|XP_006847162.1| hypothetical protein AMTR_s00017p00243480 [A...   100   5e-19
gb|EXC03893.1| hypothetical protein L484_016097 [Morus notabilis]      97   5e-18
gb|ESW30273.1| hypothetical protein PHAVU_002G139100g [Phaseolus...    97   9e-18
ref|XP_006855453.1| hypothetical protein AMTR_s00057p00175710 [A...    93   1e-16
ref|XP_002265425.2| PREDICTED: uncharacterized protein LOC100241...    93   1e-16
ref|XP_004172356.1| PREDICTED: uncharacterized LOC101205743 [Cuc...    92   3e-16
ref|XP_004143538.1| PREDICTED: uncharacterized protein LOC101205...    92   3e-16
ref|XP_006296087.1| hypothetical protein CARUB_v10025237mg [Caps...    89   2e-15
gb|EMJ08612.1| hypothetical protein PRUPE_ppa014640mg [Prunus pe...    88   3e-15
ref|NP_973512.1| SPT2 chromatin protein [Arabidopsis thaliana] g...    87   5e-15
ref|NP_973513.1| SPT2 chromatin protein [Arabidopsis thaliana] g...    87   5e-15
gb|ESW28425.1| hypothetical protein PHAVU_003G285300g [Phaseolus...    87   7e-15
ref|XP_004288305.1| PREDICTED: uncharacterized protein LOC101303...    87   9e-15
gb|EOX91834.1| SPT2 chromatin protein, putative isoform 2 [Theob...    86   2e-14
gb|EOX91833.1| SPT2 chromatin protein, putative isoform 1 [Theob...    86   2e-14
gb|AFK39413.1| unknown [Lotus japonicus]                               85   3e-14
ref|XP_004512539.1| PREDICTED: protein SPT2 homolog [Cicer ariet...    85   4e-14
gb|EOX91835.1| Mitochondrial isoform 3 [Theobroma cacao]               84   5e-14
ref|XP_006380812.1| hypothetical protein POPTR_0007s14450g [Popu...    84   6e-14

>gb|ABR16297.1| unknown [Picea sitchensis]
          Length = 513

 Score =  121 bits (303), Expect = 3e-25
 Identities = 102/289 (35%), Positives = 148/289 (51%), Gaps = 15/289 (5%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVRKLP-NNNFGSFFGPSQPVVARRIIEENRARQEASHLAKK 648
           RQKL+++L +       N  +KLP ++++GSFFGPS+ VVARR+I E RAR EAS +A K
Sbjct: 81  RQKLKQKLSDQTLVEKDNR-KKLPYDDDYGSFFGPSEIVVARRVIVETRARNEASCIAPK 139

Query: 647 TFKSS-KQPDSANVDSTVKDKEPVRRKVPVDTAKLKAQKLKEIRDYSFLFSEGAD----- 486
             K   K+      D+  + KEP  RKV VD  K KAQ+LK IRDYSFLFS+ A+     
Sbjct: 140 VSKEDPKRVSEPESDAKFEVKEPPPRKV-VDETKTKAQQLKAIRDYSFLFSDNAEIPVPD 198

Query: 485 -SSNADNKPSTSSVQNGGNSRERQVNGKSGVPAKSRPLVNKQNCKPQPSIKKPYSAVVPS 309
             S +D K  T S+   GN +++Q  GK+   A  +P+V+    KP  S K+    V PS
Sbjct: 199 KVSTSDPKSVTKSIP--GNVQQKQTIGKNSAGALKKPVVSS---KPISSSKETKGIVKPS 253

Query: 308 -----KQASNGVAGHRPVQNGTLKNGLKKNVMGLKNGITGTNSLSNGLSKDRNAQALKIS 144
                K  S  ++ H P  NG+  +  K  V  L +G+  +   +NG  +       K  
Sbjct: 254 GHLSAKDGSRAISPH-PNLNGS--SNPKNQVSKLGSGLGRSVIQANGSLRHPGINGTKAV 310

Query: 143 GCPSKGLSQNGISSDQKRPPLISNGNIKSAQKSNTNILTK--TQMNKSV 3
              S G+    +   +K+ PL+   N+    K +  +     T M K+V
Sbjct: 311 NPGSNGVKGGNLDVVKKKDPLL---NVSKVGKDHQRLSGSHPTNMQKTV 356


>ref|XP_006847162.1| hypothetical protein AMTR_s00017p00243480 [Amborella trichopoda]
           gi|548850191|gb|ERN08743.1| hypothetical protein
           AMTR_s00017p00243480 [Amborella trichopoda]
          Length = 488

 Score =  100 bits (250), Expect = 5e-19
 Identities = 104/302 (34%), Positives = 142/302 (47%), Gaps = 35/302 (11%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVRKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKT 645
           RQ+++KE  N     S      LP +NFGSFFGPS+PV+ARR+IEENRA+ EA  +A K 
Sbjct: 70  RQQMKKET-NATLGLSQEKKITLPKDNFGSFFGPSKPVIARRVIEENRAKLEAEQMAAKL 128

Query: 644 FK-SSKQPDSANVDSTVKDKEPVRRKVPVDTAKLKAQKLKEIRDYSFLFSEGAD------ 486
            K SS    S +  ++ +  E  RR   V+  KLK QKLK+ RDYSFL S  A+      
Sbjct: 129 PKASSVNKKSLSSATSERKSERDRRPKVVNEMKLKVQKLKDARDYSFLMSGDAELPSPPK 188

Query: 485 ----------SSNADNKPSTSSVQNGGNSRERQVNGKSGVPAKSRPLVNKQNCKPQPSIK 336
                     SS+A +K S     N  N +++ V+      AK+RP       K  P +K
Sbjct: 189 EHEPRNVRVPSSDAQSKQSPPR-SNPDNRKKKMVSVNQQRQAKARP------PKVDPGLK 241

Query: 335 KPYSAVV--PSKQASNGVAGHRPVQNGTLKNGLKKNVMGLK----NGITGTNSLSNGLSK 174
              S  +  PSK A +G    RP+ +    NG K    G K    NG  G     NG+  
Sbjct: 242 SSSSDPMKRPSKNAGSGPG--RPLVSKLPPNGAKLPPNGTKLPSSNG--GKVPQPNGVKL 297

Query: 173 DRNA----QALKISGCPS-----KGLSQNGISSD---QKRPPLISNGNIKSAQKSNTNIL 30
              A    +AL ++  PS     K  S  G S+    QK P L    +++   K + + L
Sbjct: 298 PPKAPVATKALVVTKAPSAPAAEKKKSSMGPSNSGAIQKAPLLKKLSSVEKLVKKSPHGL 357

Query: 29  TK 24
            K
Sbjct: 358 DK 359


>gb|EXC03893.1| hypothetical protein L484_016097 [Morus notabilis]
          Length = 1137

 Score = 97.4 bits (241), Expect = 5e-18
 Identities = 72/211 (34%), Positives = 104/211 (49%), Gaps = 26/211 (12%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVRKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKT 645
           R+KL+ + G+ P     +  RKLPN+NFGSFFGPSQP++++R+I+E ++  E  HL  K 
Sbjct: 79  RKKLKNDGGSTPNNS-SDRKRKLPNDNFGSFFGPSQPIISQRVIQETKSLLETQHLTSKV 137

Query: 644 FKSSKQPDSANVDSTVKDKEPVRRKVP--VDTAKLKAQKLKEIRDYSFLFSEGADSSNAD 471
             S+     ++  ST   K  V  K    ++  K K QKLK+ RDYSFL SE A      
Sbjct: 138 SNSAHPKKRSSSSSTAGSKPGVHDKKARVLNEVKSKVQKLKDTRDYSFLLSEDAQLPAPA 197

Query: 470 NKPSTSSV-----------------QNGGNSRERQVNGKSG----VPAKSRPLVNKQNCK 354
            +P   +V                 Q  GN+  RQV+G  G    VP    P  +K    
Sbjct: 198 KEPPQRNVPLQSADTRLAQVPVKSKQALGNN-GRQVHGNHGERKSVPMNGHP-SSKVGSN 255

Query: 353 PQPSIKKPYSAVVPSKQ---ASNGVAGHRPV 270
             PS  +P S+   S++   ++NG    RP+
Sbjct: 256 KLPSASRPNSSQTDSRKQHSSNNGTGPGRPL 286


>gb|ESW30273.1| hypothetical protein PHAVU_002G139100g [Phaseolus vulgaris]
          Length = 469

 Score = 96.7 bits (239), Expect = 9e-18
 Identities = 94/326 (28%), Positives = 145/326 (44%), Gaps = 54/326 (16%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVRKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKT 645
           R++++KE  N           KLPN+N+GSFFGPSQPV+A+R+I+E+++  E  HL+ + 
Sbjct: 49  RKQMKKE--NSSFLSDSTRKNKLPNDNYGSFFGPSQPVIAQRVIQESKSLLENQHLSSRL 106

Query: 644 FKSSKQPDSANVDSTVKDKEPVRRKVP--VDTAKLKAQKLKEIRDYSFLFSEGAD-SSNA 474
             SS    SAN  S       V R +P  V   ++KAQKLK  RDYSFL S+  +  +  
Sbjct: 107 SNSSNINKSANKVSNGVLNSSVHRNIPPKVSEKQVKAQKLKVTRDYSFLLSDDVELPAPK 166

Query: 473 DNKPSTSSVQNGGNSRERQVNGKS--------GVPAKSRPLVNKQNCKPQ-------PSI 339
              PS  ++ +    R  +V GK+        G     + +    +  P+        SI
Sbjct: 167 KEPPSHRTLVHNSEKRSGEVAGKTVNGGKIVRGSDENRKHVTGPGHLAPKSGSNYKISSI 226

Query: 338 KKPYSAVVPS-KQASN--GVAGHRPVQNGTLKNGLKKNVMGLKNGITGT--------NSL 192
            K   A V S KQ SN  G    RP++     +G  K++   KN + G         +S+
Sbjct: 227 NKYSKASVDSRKQLSNNSGNGPGRPLKMPISTSG-NKSLSDAKNPVNGVHKPLSSKMHSV 285

Query: 191 S-------------NGLSKDRNAQALKISG------------CPSKGLSQNGISSDQKRP 87
           S             NG+ K  +++   ++G             P + + Q  I  +Q +P
Sbjct: 286 SGAQKPLSSKFHSVNGVQKPLSSKLHSVNGVQKPLSSKLHLSLPKQSVEQRNILREQNKP 345

Query: 86  PLISNGNIKSAQKSNTNILTKTQMNK 9
            +IS   + S         TK Q+NK
Sbjct: 346 KMISKQPVTS---------TKAQINK 362


>ref|XP_006855453.1| hypothetical protein AMTR_s00057p00175710 [Amborella trichopoda]
           gi|548859219|gb|ERN16920.1| hypothetical protein
           AMTR_s00057p00175710 [Amborella trichopoda]
          Length = 480

 Score = 93.2 bits (230), Expect = 1e-16
 Identities = 56/111 (50%), Positives = 71/111 (63%), Gaps = 2/111 (1%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVRKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKT 645
           RQ+++KE  N     S    R LP +NFGSFFGPSQPV+ARR+IEE RA+ EA H+A K 
Sbjct: 70  RQQMKKET-NAAVGLSQEMKRTLPKDNFGSFFGPSQPVIARRVIEETRAKLEAEHMAAKL 128

Query: 644 FKSSKQPDSA--NVDSTVKDKEPVRRKVPVDTAKLKAQKLKEIRDYSFLFS 498
            K+S +   A  +  S  K +E  +R   V+  KLK QKLK+ RDYSFL S
Sbjct: 129 PKASSENKKALSSTTSERKFRERDQRLKVVNEMKLKVQKLKDARDYSFLMS 179


>ref|XP_002265425.2| PREDICTED: uncharacterized protein LOC100241361 [Vitis vinifera]
           gi|297735862|emb|CBI18616.3| unnamed protein product
           [Vitis vinifera]
          Length = 460

 Score = 93.2 bits (230), Expect = 1e-16
 Identities = 79/261 (30%), Positives = 126/261 (48%), Gaps = 14/261 (5%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGV-----RKLPNNNFGSFFGPSQPVVARRIIEENRARQEASH 660
           R++L+K+ G+      G+ +     +KLP +N+GSFFGPSQPV+A+R+I+E+++  E  H
Sbjct: 60  RKQLKKDTGS------GHNIYQEKRKKLPYDNYGSFFGPSQPVIAQRVIQESKSLLETQH 113

Query: 659 LAKKTFKS--SKQPDSANVDSTVKDKEPVRRKVPVDTAKLKAQKLKEIRDYSFLFSEGAD 486
           LA     S  + + +S + ++  + +E   R   ++  K+KAQKLK  RDYSFL S+ A+
Sbjct: 114 LASLVTNSHHNNKKNSTSTNAGSRPREQGHRPKVINELKVKAQKLKNTRDYSFLLSDDAE 173

Query: 485 --SSNADNKPSTSSVQNGGNSRERQVNGKSGVPAKSRPLVNKQNCKPQPSIKKPYSAVVP 312
             +   +  P  + V N   SR  Q+  KS  P    PL N     P    ++   ++  
Sbjct: 174 FPAPRKEPPPRKAPVPN-SESRSVQLPQKSIPPKSKPPLSNTGRQAPSSREERKPVSMNG 232

Query: 311 SKQASNGVAGHRPVQNGTLKNGLKKNVMGLKNGI-----TGTNSLSNGLSKDRNAQALKI 147
             QA  G           L +   +  +G  NG       G  SL    SK   + A K 
Sbjct: 233 QIQAKAGSQKLVSASKPNLMSVDSRKQLGTNNGAGPGRPVGPKSLP---SKMPVSSAEKK 289

Query: 146 SGCPSKGLSQNGISSDQKRPP 84
           +  P    +++ +SS  K PP
Sbjct: 290 ASAPG---ARSSMSSLHKAPP 307


>ref|XP_004172356.1| PREDICTED: uncharacterized LOC101205743 [Cucumis sativus]
          Length = 511

 Score = 91.7 bits (226), Expect = 3e-16
 Identities = 84/260 (32%), Positives = 120/260 (46%), Gaps = 14/260 (5%)
 Frame = -3

Query: 773 NGVRKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKTFKSSKQPDSANVDSTVK 594
           N  +KLP +NFGSFFGPSQPV+++R+I+E+++  E  HLA +          +   ++V 
Sbjct: 71  NDRKKLPYDNFGSFFGPSQPVISQRVIQESKSLLENQHLASRVSDHDHGNKKSQGSNSVA 130

Query: 593 DKEPVRRKVPVDTAKLKAQKLKEIRDYSFLFSEGADSSNADNKPSTSSVQNGGNSRERQV 414
            K  V  KV V   + K QKLK+ RDYSFLFSE A+      + S S       +R  QV
Sbjct: 131 SKPRVLPKV-VSEKQTKVQKLKDTRDYSFLFSEDANVPAPSKESSRSVYAPSTEARSAQV 189

Query: 413 NGKSGVPAKSRPLVNKQNCKPQPSIKK--PYSAVVPS--KQASNG------VAGHRPVQN 264
             KS    K  P   +QN       KK  P + ++ S  K AS+G      +   + + N
Sbjct: 190 PMKS----KHPPSNPRQNIHVDHKEKKSVPMNGLMQSKNKSASSGNSNLSMMKAKKQLVN 245

Query: 263 GTLKNGLKKNVMGLKNGITGTNSLSNGLSKDRNAQALKIS---GCPSKGLSQNGISSDQK 93
               NG     MG  N       +SN  S +R+ + L  S     P + L  +   +   
Sbjct: 246 SCSGNG-PGRPMGNNNESGPGRPMSNSNSGNRSGRPLGNSNNGNGPGRPLGNSNNGNGPG 304

Query: 92  RP-PLISNGNIKSAQKSNTN 36
           RP    +NGN       N+N
Sbjct: 305 RPLGNSNNGNGPGRPLGNSN 324


>ref|XP_004143538.1| PREDICTED: uncharacterized protein LOC101205743 [Cucumis sativus]
          Length = 524

 Score = 91.7 bits (226), Expect = 3e-16
 Identities = 84/260 (32%), Positives = 120/260 (46%), Gaps = 14/260 (5%)
 Frame = -3

Query: 773 NGVRKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKTFKSSKQPDSANVDSTVK 594
           N  +KLP +NFGSFFGPSQPV+++R+I+E+++  E  HLA +          +   ++V 
Sbjct: 71  NDRKKLPYDNFGSFFGPSQPVISQRVIQESKSLLENQHLASRVSDHDHGNKKSQGSNSVA 130

Query: 593 DKEPVRRKVPVDTAKLKAQKLKEIRDYSFLFSEGADSSNADNKPSTSSVQNGGNSRERQV 414
            K  V  KV V   + K QKLK+ RDYSFLFSE A+      + S S       +R  QV
Sbjct: 131 SKPRVLPKV-VSEKQTKVQKLKDTRDYSFLFSEDANVPAPSKESSRSVYAPSTEARSAQV 189

Query: 413 NGKSGVPAKSRPLVNKQNCKPQPSIKK--PYSAVVPS--KQASNG------VAGHRPVQN 264
             KS    K  P   +QN       KK  P + ++ S  K AS+G      +   + + N
Sbjct: 190 PMKS----KHPPSNPRQNIHVDHKEKKSVPMNGLMQSKNKSASSGNSNLSMMKAKKQLVN 245

Query: 263 GTLKNGLKKNVMGLKNGITGTNSLSNGLSKDRNAQALKIS---GCPSKGLSQNGISSDQK 93
               NG     MG  N       +SN  S +R+ + L  S     P + L  +   +   
Sbjct: 246 SCSGNG-PGRPMGNNNESGPGRPMSNSNSGNRSGRPLGNSNNGNGPGRPLGNSNNGNGPG 304

Query: 92  RP-PLISNGNIKSAQKSNTN 36
           RP    +NGN       N+N
Sbjct: 305 RPLGNSNNGNGPGRPLGNSN 324


>ref|XP_006296087.1| hypothetical protein CARUB_v10025237mg [Capsella rubella]
           gi|482564795|gb|EOA28985.1| hypothetical protein
           CARUB_v10025237mg [Capsella rubella]
          Length = 657

 Score = 88.6 bits (218), Expect = 2e-15
 Identities = 81/259 (31%), Positives = 124/259 (47%), Gaps = 23/259 (8%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGV------RKLPNNNFGSFFGPSQPVVARRIIEENRARQEAS 663
           RQKL++         S NG       RKLP N+FGSFFGPSQPV++ R+I+E+++  E  
Sbjct: 136 RQKLKETYKKNMGKASANGQSSQERRRKLPYNDFGSFFGPSQPVISSRVIQESKSLLENE 195

Query: 662 HLAKKTFKSS---KQPDSANVDSTVKDKEPVRRKVPVDTAKLKAQKLKEIRDYSFLFSEG 492
               K   SS   K+P S +  S  K+    +R   V+  + K + LK+ RDYSFLFS+ 
Sbjct: 196 IRTAKMSNSSQTKKRPVSTS-GSVAKNVSQEKRPRVVNEVRRKVEALKDTRDYSFLFSDD 254

Query: 491 AD--------SSNADNKPSTSSVQNGGNSRERQVNGKSGVPAKSRPLVNKQ----NCKPQ 348
           A+         S   + P+  +     ++R +Q +G +G  A   P   K+    N   +
Sbjct: 255 AELPVPKKESLSRTGSFPNPEARSAQLSARPKQSSGANGRTAHGPPREEKRPVSANGHSR 314

Query: 347 PSIKKPYSAVVPSKQASNG--VAGHRPVQNGTLKNGLKKNVMGLKNGITGTNSLSNGLSK 174
           PS     S +  S+ +S+G  +   RP  +G+  N LK++  G +     + S     S 
Sbjct: 315 PS--SSGSQMNHSRPSSSGSQMNHSRPSSSGSQMNHLKQSSSGSQMRPASSGSQMRPASS 372

Query: 173 DRNAQALKISGCPSKGLSQ 117
               Q+  +SG PS   SQ
Sbjct: 373 GSQMQSRAVSGRPSSIGSQ 391


>gb|EMJ08612.1| hypothetical protein PRUPE_ppa014640mg [Prunus persica]
          Length = 509

 Score = 88.2 bits (217), Expect = 3e-15
 Identities = 81/268 (30%), Positives = 128/268 (47%), Gaps = 5/268 (1%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVRKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKT 645
           R++++KE G      S +  +KLP +N+GSFFGPSQP+++ R+I+E+++  E  HLA + 
Sbjct: 65  RKQMKKE-GGSSLANSSDKKKKLPYDNYGSFFGPSQPIISERVIQESKSLLETQHLASRV 123

Query: 644 FKSSKQPDSANVDSTVKDKEPV---RRKVPVDTAKLKAQKLKEIRDYSFLFSEGAD--SS 480
             SS      +  ST    +PV   ++   ++ AK K QKLK+ RDYSFL S+  +  +S
Sbjct: 124 -SSSLHSSKKSSGSTSAGSKPVAYNQKPRVINEAKNKVQKLKDTRDYSFLLSDDVELPAS 182

Query: 479 NADNKPSTSSVQNGGNSRERQVNGKSGVPAKSRPLVNKQNCKPQPSIKKPYSAVVPSKQA 300
             D  P + SV N    R  Q+  KS +P  +    N ++       +KP S    +  +
Sbjct: 183 ANDRPPRSVSVPN-SEVRSSQMAPKSKLPMAN----NGRHAHGGRDERKPASM---NGHS 234

Query: 299 SNGVAGHRPVQNGTLKNGLKKNVMGLKNGITGTNSLSNGLSKDRNAQALKISGCPSKGLS 120
             G    RPV      NG +      +      N  ++G   +R   ++      SKG  
Sbjct: 235 HGGRDERRPVSMNGPSNGGRD-----ERRPVSMNGHAHGGRDERRPVSMN-GQVHSKG-G 287

Query: 119 QNGISSDQKRPPLISNGNIKSAQKSNTN 36
            N +SS  +RP   S  + K    +N N
Sbjct: 288 PNKLSSASRRPDSTSVDSRKQFGSNNGN 315


>ref|NP_973512.1| SPT2 chromatin protein [Arabidopsis thaliana]
           gi|330252249|gb|AEC07343.1| SPT2 chromatin protein
           [Arabidopsis thaliana]
          Length = 672

 Score = 87.4 bits (215), Expect = 5e-15
 Identities = 78/267 (29%), Positives = 127/267 (47%), Gaps = 6/267 (2%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVR--KLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAK 651
           ++ +RK++GN       +  R  KLP N+FGSFFGPS+PV++ R+I+E+++  E + L K
Sbjct: 153 KESIRKKMGNGSANAQSSQERRRKLPYNDFGSFFGPSRPVISSRVIQESKSLLE-NELRK 211

Query: 650 --KTFKSSKQPDSANVDSTVKDKEPVRRKVPVDTAKLKAQKLKEIRDYSFLFSEGADSSN 477
              + ++ K+P   N   +    +  R KV V+  + K + LK+ RDYSFLFS+ A+   
Sbjct: 212 MSNSSQTKKRPVPTNGSGSKNVSQEKRPKV-VNEVRRKVETLKDTRDYSFLFSDDAELP- 269

Query: 476 ADNKPSTSSVQNGGNSRERQVNGKSGVPAKSRPLVNKQNCKPQPSIKKPYSAVVPSKQAS 297
              K S S   +  NS  R     S  P +S  +  +    P    K+P SA   S+ +S
Sbjct: 270 VPKKESLSRSGSFPNSEARSAQ-LSSRPKQSSGINGRTAHSPHREEKRPVSANGHSRPSS 328

Query: 296 NG--VAGHRPVQNGTLKNGLKKNVMGLKNGITGTNSLSNGLSKDRNAQALKISGCPSKGL 123
           +G  +   RP  +G+  N  +    G +      NS          ++A+  SG P+   
Sbjct: 329 SGSQMNHSRPSSSGSKMNHSRPATSGSQM----PNSRPASSGSQMQSRAVSGSGRPASSG 384

Query: 122 SQNGISSDQKRPPLISNGNIKSAQKSN 42
           SQ   S  Q   P  +   ++    S+
Sbjct: 385 SQMQNSRPQNSRPASAGSQMQQRPASS 411


>ref|NP_973513.1| SPT2 chromatin protein [Arabidopsis thaliana]
           gi|110741100|dbj|BAE98644.1| hypothetical protein
           [Arabidopsis thaliana] gi|330252251|gb|AEC07345.1| SPT2
           chromatin protein [Arabidopsis thaliana]
          Length = 569

 Score = 87.4 bits (215), Expect = 5e-15
 Identities = 78/267 (29%), Positives = 127/267 (47%), Gaps = 6/267 (2%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVR--KLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAK 651
           ++ +RK++GN       +  R  KLP N+FGSFFGPS+PV++ R+I+E+++  E + L K
Sbjct: 50  KESIRKKMGNGSANAQSSQERRRKLPYNDFGSFFGPSRPVISSRVIQESKSLLE-NELRK 108

Query: 650 --KTFKSSKQPDSANVDSTVKDKEPVRRKVPVDTAKLKAQKLKEIRDYSFLFSEGADSSN 477
              + ++ K+P   N   +    +  R KV V+  + K + LK+ RDYSFLFS+ A+   
Sbjct: 109 MSNSSQTKKRPVPTNGSGSKNVSQEKRPKV-VNEVRRKVETLKDTRDYSFLFSDDAELP- 166

Query: 476 ADNKPSTSSVQNGGNSRERQVNGKSGVPAKSRPLVNKQNCKPQPSIKKPYSAVVPSKQAS 297
              K S S   +  NS  R     S  P +S  +  +    P    K+P SA   S+ +S
Sbjct: 167 VPKKESLSRSGSFPNSEARSAQ-LSSRPKQSSGINGRTAHSPHREEKRPVSANGHSRPSS 225

Query: 296 NG--VAGHRPVQNGTLKNGLKKNVMGLKNGITGTNSLSNGLSKDRNAQALKISGCPSKGL 123
           +G  +   RP  +G+  N  +    G +      NS          ++A+  SG P+   
Sbjct: 226 SGSQMNHSRPSSSGSKMNHSRPATSGSQM----PNSRPASSGSQMQSRAVSGSGRPASSG 281

Query: 122 SQNGISSDQKRPPLISNGNIKSAQKSN 42
           SQ   S  Q   P  +   ++    S+
Sbjct: 282 SQMQNSRPQNSRPASAGSQMQQRPASS 308


>gb|ESW28425.1| hypothetical protein PHAVU_003G285300g [Phaseolus vulgaris]
          Length = 545

 Score = 87.0 bits (214), Expect = 7e-15
 Identities = 67/220 (30%), Positives = 98/220 (44%), Gaps = 24/220 (10%)
 Frame = -3

Query: 761 KLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKTFKSSKQPDSANVDSTVKDKEP 582
           KLP +N+GSFFGPSQPV+A+R+I+E+++  E  HLA K         + N   +   K  
Sbjct: 84  KLPYDNYGSFFGPSQPVIAQRVIQESKSLLENQHLASKVPNPHHAKKNQNKAPSGGSKSS 143

Query: 581 VRRKVP-VDTAKLKAQKLKEIRDYSFLFSEGADSSNADNKPSTSSVQ-NGGNSRERQVNG 408
                P V   ++KAQK K+ RDYSFL S+ A+   A   P   ++       R  QV  
Sbjct: 144 SHNPPPKVSEVQVKAQKRKDTRDYSFLLSDDAELPAASKAPPPQNMHIRNSEGRPAQVPA 203

Query: 407 KSGVPA-----------KSRPLVNKQNCKP--------QPSIKKPYSAVVPSKQASNGVA 285
           +S VP            + R L +     P          S  KP  A   S++     +
Sbjct: 204 RSKVPLSNGSKHVRTSHEERNLGSGAGRMPPKSGSGYKTSSTSKPSMASADSRKQLGNNS 263

Query: 284 GH---RPVQNGTLKNGLKKNVMGLKNGITGTNSLSNGLSK 174
           GH   RPV +  + + +     G K+   G  +  NG+ K
Sbjct: 264 GHGPGRPVGSNGMSSKMSVGSTGNKSSTPGIKNPVNGMPK 303


>ref|XP_004288305.1| PREDICTED: uncharacterized protein LOC101303938 [Fragaria vesca
           subsp. vesca]
          Length = 576

 Score = 86.7 bits (213), Expect = 9e-15
 Identities = 87/283 (30%), Positives = 136/283 (48%), Gaps = 20/283 (7%)
 Frame = -3

Query: 824 RQKLRKELG-NLPXXXSGNGV----RKLP-NNNFGSFFGPSQPVVARRIIEENRARQEAS 663
           RQ+L++++  ++     G G     +KLP + +FGSFFGPSQPV+A R+I+E+++  E  
Sbjct: 65  RQRLKEQIRRDMKRKEGGRGNSDDRKKLPYDKSFGSFFGPSQPVIADRVIQESKSLLETR 124

Query: 662 HLAKKTFKSSKQPDSANVDSTVKDKEPV---RRKVPVDTAKLKAQKLKEIRDYSFLFSEG 492
           HLA +   +S  P+  N  ST    +PV   ++   ++  K   QK K+ RDY+FLFSE 
Sbjct: 125 HLASRA-TNSVHPNKKNSGSTSSGSKPVAHTQKPNVINEQKNIVQKRKDTRDYAFLFSED 183

Query: 491 AD-SSNADNKPSTSSVQNGGNSRERQVNGKSGVPAKSRPLVNKQNCKPQPSIKKPYSAVV 315
           A+  + A ++P  S      ++   +V     V    +PLVN           +P     
Sbjct: 184 AELPAPAKDRPPRSD-----SAPTSEVRSSQTVMKSKQPLVNN---------SRPLHGGQ 229

Query: 314 PSKQASNGVAGHRPVQNGTLKNGLKKNVMGLKNGITGTNSLSNGL----SKD--RNAQAL 153
            + ++ +G    RPVQ G+  NG  + V G ++      S  NG     S+D  R  Q  
Sbjct: 230 DNGRSIHGRDSGRPVQ-GSRDNG--RPVHGSRDNRAVHGSHDNGRPVHGSRDSGRPVQGS 286

Query: 152 KISGCPSKGLSQNGI----SSDQKRPPLISNGNIKSAQKSNTN 36
           + SG P +G   +G     S D  RP   S  N +    S+ N
Sbjct: 287 RDSGRPVQGSRDSGRPVQGSRDSGRPVQGSRENGRPVHGSHDN 329


>gb|EOX91834.1| SPT2 chromatin protein, putative isoform 2 [Theobroma cacao]
          Length = 456

 Score = 85.5 bits (210), Expect = 2e-14
 Identities = 67/208 (32%), Positives = 105/208 (50%), Gaps = 9/208 (4%)
 Frame = -3

Query: 764 RKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKTFKSSKQPDSANVDSTVKDKE 585
           ++LP +NFGSFFGPSQPV+A+R+I+E+++  E  HL  K   S++     +V ++   K 
Sbjct: 78  KRLPYDNFGSFFGPSQPVIAQRVIQESKSLLENQHLVSKMLSSNQSGKKNSVSNSAGSKL 137

Query: 584 PVRRKVPVDTAKL--KAQKLKEIRDYSFLFSEGADSSNADNKPSTSSVQNGGNSRERQVN 411
             R  VP  T++L  K +KLK  RDYSFL S+ A+      +P   +V N   S  R   
Sbjct: 138 GQRGLVPKATSELKKKVEKLKVARDYSFL-SDDAEVPAPPREPPPRNV-NVPTSEARSAQ 195

Query: 410 GKSGVPAKSRPLVNKQNCKPQPSIKK-----PYSAVVPSKQASNGVAGHRPVQNGTLKNG 246
               +  KS+PL+   N +    I++     P +  + SK  S   +  +P     + + 
Sbjct: 196 ----MLPKSKPLLGSNNGRNVQGIREERKPVPLNGQMHSKAGSYKSSASKP----NVMSM 247

Query: 245 LKKNVMGLKNGITGTN--SLSNGLSKDR 168
             K  +G+ NGI       +SNG+   R
Sbjct: 248 DSKKQLGVNNGIGPGRPVGVSNGVGPGR 275


>gb|EOX91833.1| SPT2 chromatin protein, putative isoform 1 [Theobroma cacao]
          Length = 464

 Score = 85.5 bits (210), Expect = 2e-14
 Identities = 66/211 (31%), Positives = 107/211 (50%), Gaps = 12/211 (5%)
 Frame = -3

Query: 764 RKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKTFKSSKQPDSANVDSTVKDKE 585
           ++LP +NFGSFFGPSQPV+A+R+I+E+++  E  HL  K   S++     +V ++   K 
Sbjct: 78  KRLPYDNFGSFFGPSQPVIAQRVIQESKSLLENQHLVSKMLSSNQSGKKNSVSNSAGSKL 137

Query: 584 PVRRKVPVDTAKL--KAQKLKEIRDYSFLFSEGADSSNADNKPSTSSVQ---NGGNSRER 420
             R  VP  T++L  K +KLK  RDYSFL S+ A+      +P   +V    +G    + 
Sbjct: 138 GQRGLVPKATSELKKKVEKLKVARDYSFL-SDDAEVPAPPREPPPRNVNVPTSGRVFADF 196

Query: 419 QVNGKSGVPAKSRPLVNKQNCKPQPSIKK-----PYSAVVPSKQASNGVAGHRPVQNGTL 255
           Q    + +  KS+PL+   N +    I++     P +  + SK  S   +  +P     +
Sbjct: 197 QEARSAQMLPKSKPLLGSNNGRNVQGIREERKPVPLNGQMHSKAGSYKSSASKP----NV 252

Query: 254 KNGLKKNVMGLKNGITGTN--SLSNGLSKDR 168
            +   K  +G+ NGI       +SNG+   R
Sbjct: 253 MSMDSKKQLGVNNGIGPGRPVGVSNGVGPGR 283


>gb|AFK39413.1| unknown [Lotus japonicus]
          Length = 204

 Score = 85.1 bits (209), Expect = 3e-14
 Identities = 55/146 (37%), Positives = 84/146 (57%), Gaps = 8/146 (5%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVRKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKT 645
           R++LRKE  + P        +KLPN+NFGSFFGPSQPV+A R+I+E+++  E  HL    
Sbjct: 66  RKQLRKE-NSAPLKDCSATKKKLPNDNFGSFFGPSQPVIAPRVIQESKSLLENQHL---- 120

Query: 644 FKSSKQPDSANVDSTVKD-----KEPVRRKVP--VDTAKLKAQKLKEIRDYSFLFSEGAD 486
              S+  ++++V+  VK       +P   K P  V   K++AQ +K+ RDYSFL S+ A+
Sbjct: 121 --QSRLSNTSHVNKNVKKVSNGVMKPSSHKQPPKVSETKIRAQTVKDTRDYSFLMSDDAE 178

Query: 485 -SSNADNKPSTSSVQNGGNSRERQVN 411
             + A   PS ++  +    R  QV+
Sbjct: 179 LPAPAKELPSRNTSVHNSEGRPAQVD 204


>ref|XP_004512539.1| PREDICTED: protein SPT2 homolog [Cicer arietinum]
          Length = 465

 Score = 84.7 bits (208), Expect = 4e-14
 Identities = 78/292 (26%), Positives = 135/292 (46%), Gaps = 34/292 (11%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVRKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKT 645
           R+K++KE  ++    S    +++ ++N+GSFFGPSQPV+A+R+I+E+++  E  HL  K 
Sbjct: 63  RKKMKKE-NSISVADSSVRKKQIRHDNYGSFFGPSQPVIAQRVIQESKSLLENRHLVPKP 121

Query: 644 FKSSKQPDSANVDSTVKDKEPVRRKVP-VDTAKLKAQKLKEIRDYSFLFSEGADSSNADN 468
             + +   S N  S    K     + P V+  ++KA+KLK  RDYSFL S+ A+      
Sbjct: 122 SNTPQTNKSTNKVSNGVLKPSAHNQPPKVNEKQVKAEKLKVTRDYSFLLSDDAELPAPSK 181

Query: 467 KPSTSSV----------QNGGNSRERQVNG-----KSGVPAKSRPLVNKQNCKPQPSIK- 336
           +P + ++          Q  G S++   NG      SG   K     +    KPQ + K 
Sbjct: 182 EPPSRNISVRSSVGQAAQVAGRSKQSMSNGGKLVRSSGEDRKLVARASHLAPKPQSNYKL 241

Query: 335 ----KPYSAVVPSKQ--ASNGVAGH-----------RPVQNGTLKNGLKKNVMGLKNGIT 207
               +   A V S++   SNG  G            RPV    L + +  + MG K+   
Sbjct: 242 SSASQASKASVDSRKQIGSNGRNGQQLGSNSGNGPGRPVGPKGLPSKMPVHSMGNKSVTP 301

Query: 206 GTNSLSNGLSKDRNAQALKISGCPSKGLSQNGISSDQKRPPLISNGNIKSAQ 51
           G  + +NG+ +   +   ++     K + Q     +Q +P ++    + S++
Sbjct: 302 GMRNPANGVQRPPTS---RVPSSVPKHVEQRRDVREQNKPRILPKQPVSSSK 350


>gb|EOX91835.1| Mitochondrial isoform 3 [Theobroma cacao]
          Length = 361

 Score = 84.3 bits (207), Expect = 5e-14
 Identities = 66/206 (32%), Positives = 104/206 (50%), Gaps = 7/206 (3%)
 Frame = -3

Query: 764 RKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKTFKSSKQPDSANVDSTVKDKE 585
           ++LP +NFGSFFGPSQPV+A+R+I+E+++  E  HL  K   S++     +V ++   K 
Sbjct: 78  KRLPYDNFGSFFGPSQPVIAQRVIQESKSLLENQHLVSKMLSSNQSGKKNSVSNSAGSKL 137

Query: 584 PVRRKVPVDTAKLKAQKLKEIRDYSFLFSEGADSSNADNKPSTSSVQNGGNSRERQVNGK 405
             R  VP  T++ K +KLK  RDYSFL S+ A+      +P   +V N   S  R     
Sbjct: 138 GQRGLVPKATSE-KVEKLKVARDYSFL-SDDAEVPAPPREPPPRNV-NVPTSEARSAQ-- 192

Query: 404 SGVPAKSRPLVNKQNCKPQPSIKK-----PYSAVVPSKQASNGVAGHRPVQNGTLKNGLK 240
             +  KS+PL+   N +    I++     P +  + SK  S   +  +P     + +   
Sbjct: 193 --MLPKSKPLLGSNNGRNVQGIREERKPVPLNGQMHSKAGSYKSSASKP----NVMSMDS 246

Query: 239 KNVMGLKNGITGTN--SLSNGLSKDR 168
           K  +G+ NGI       +SNG+   R
Sbjct: 247 KKQLGVNNGIGPGRPVGVSNGVGPGR 272


>ref|XP_006380812.1| hypothetical protein POPTR_0007s14450g [Populus trichocarpa]
           gi|550334873|gb|ERP58609.1| hypothetical protein
           POPTR_0007s14450g [Populus trichocarpa]
          Length = 336

 Score = 84.0 bits (206), Expect = 6e-14
 Identities = 58/192 (30%), Positives = 94/192 (48%), Gaps = 15/192 (7%)
 Frame = -3

Query: 824 RQKLRKELGNLPXXXSGNGVRKLPNNNFGSFFGPSQPVVARRIIEENRARQEASHLAKKT 645
           R+K+RKE G+          +KLP++N+GSFFGPSQPV+++R+I+E+++  E  HLA + 
Sbjct: 54  RKKMRKESGST--LSKSQEKKKLPSDNYGSFFGPSQPVISQRVIQESKSILENQHLALRV 111

Query: 644 FKSSKQPDSANVDSTVKDKEPVRRKVP--VDTAKLKAQKLKEIRDYSFLFSEGADSSNAD 471
             +      ++  +    K  V   VP   +  K K QKLK+ RDYSFL ++ A+     
Sbjct: 112 PNAQHTNKKSSSSTATGLKNRVHGLVPKVKNEVKTKVQKLKDTRDYSFLLTDDAELPAPT 171

Query: 470 NKPSTSSVQ-NGGNSRERQVNGK------------SGVPAKSRPLVNKQNCKPQPSIKKP 330
            +P+  +V      +R  QV  K             G+  + +P+        +   +KP
Sbjct: 172 KEPAPRNVSAPNSEARSAQVPQKIKQASSNSGRNIHGIREERKPVFRNGQMHSKVGSQKP 231

Query: 329 YSAVVPSKQASN 294
            SA  P   + N
Sbjct: 232 TSANKPDATSIN 243


Top