BLASTX nr result

ID: Ephedra26_contig00010521 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00010521
         (788 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006404302.1| hypothetical protein EUTSA_v10010055mg [Eutr...   116   1e-23
ref|XP_006404300.1| hypothetical protein EUTSA_v10010055mg [Eutr...   116   1e-23
ref|XP_002511444.1| conserved hypothetical protein [Ricinus comm...   114   5e-23
ref|XP_002877594.1| hypothetical protein ARALYDRAFT_485166 [Arab...   110   4e-22
ref|NP_190388.2| BAH and TFIIS domain-containing protein [Arabid...   110   6e-22
emb|CAB41134.1| putative protein [Arabidopsis thaliana]               110   6e-22
ref|XP_003634295.1| PREDICTED: uncharacterized protein LOC100248...   110   7e-22
emb|CAN60153.1| hypothetical protein VITISV_021504 [Vitis vinifera]   110   7e-22
gb|ABR16880.1| unknown [Picea sitchensis]                             109   9e-22
ref|XP_006290492.1| hypothetical protein CARUB_v10016566mg [Caps...   109   1e-21
ref|XP_002511441.1| DNA binding protein, putative [Ricinus commu...   108   2e-21
gb|EOY20638.1| BAH domain,TFIIS helical bundle-like domain isofo...   108   2e-21
gb|EOY20637.1| BAH domain,TFIIS helical bundle-like domain isofo...   108   2e-21
gb|EOY20634.1| BAH domain,TFIIS helical bundle-like domain isofo...   108   2e-21
ref|XP_006290494.1| hypothetical protein CARUB_v10016567mg [Caps...   107   5e-21
ref|XP_006290493.1| hypothetical protein CARUB_v10016567mg [Caps...   107   5e-21
ref|NP_190389.1| BAH and TFIIS domain-containing protein [Arabid...   106   1e-20
gb|EXC31170.1| hypothetical protein L484_004936 [Morus notabilis]     105   1e-20
ref|XP_002321576.2| hypothetical protein POPTR_0015s08410g [Popu...   105   2e-20
ref|XP_002866520.1| hypothetical protein ARALYDRAFT_919569 [Arab...   104   4e-20

>ref|XP_006404302.1| hypothetical protein EUTSA_v10010055mg [Eutrema salsugineum]
            gi|557105421|gb|ESQ45755.1| hypothetical protein
            EUTSA_v10010055mg [Eutrema salsugineum]
          Length = 1616

 Score =  116 bits (290), Expect = 1e-23
 Identities = 91/265 (34%), Positives = 132/265 (49%), Gaps = 8/265 (3%)
 Frame = -1

Query: 782  LDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTGKL 603
            LD DLNVPDE  +ED      SQ S   T+      +  + V S    SA + S+  G L
Sbjct: 1255 LDFDLNVPDERLLED----LASQRSGNATNCTPAVTNSFDQVRSGVMGSALDHSS--GGL 1308

Query: 602  DLDLNKAEDYEDAGQ--VSSYNEVSNS--AVKTFADGFSNGGQSQGARGFDLNDGPSSED 435
            DLDLNK +D  D     +SS + + +S   VK  + G        G R FDLNDGP+ +D
Sbjct: 1309 DLDLNKVDDSTDMNNYTMSSGHRLDSSFQQVKLSSPG--------GRRDFDLNDGPAGDD 1360

Query: 434  AAMEHNPWISSAKGKANSYVPSLSGWPMNGE-LLNVSPWLPPVTSHPSLSMHPIPSDRAD 258
            A +E +  +S          PSLSG  +NGE + + S W P   ++ ++S+ PI  +R D
Sbjct: 1361 AVVESSMGLSQHSRSGLPSQPSLSGIRVNGENMASFSTWFPAANAYSAVSIPPIMPERGD 1420

Query: 257  RTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYPNVTQPTFPYGAYXXXXXXX 81
            + +PM+A+   Q +L  ++   +++ D YRGPVL+S+ A P     TF Y  +       
Sbjct: 1421 QPFPMIANRGPQRMLGPTTGVSSFAPDGYRGPVLSSSPAMP-FQNTTFQYPVFPFGNNFP 1479

Query: 80   XXXXXXSGGISSFRD--PIGATCFP 12
                  SGG ++  D    G  CFP
Sbjct: 1480 IASANFSGGSTTHMDSSSSGRACFP 1504


>ref|XP_006404300.1| hypothetical protein EUTSA_v10010055mg [Eutrema salsugineum]
            gi|567189127|ref|XP_006404301.1| hypothetical protein
            EUTSA_v10010055mg [Eutrema salsugineum]
            gi|557105419|gb|ESQ45753.1| hypothetical protein
            EUTSA_v10010055mg [Eutrema salsugineum]
            gi|557105420|gb|ESQ45754.1| hypothetical protein
            EUTSA_v10010055mg [Eutrema salsugineum]
          Length = 1615

 Score =  116 bits (290), Expect = 1e-23
 Identities = 91/265 (34%), Positives = 132/265 (49%), Gaps = 8/265 (3%)
 Frame = -1

Query: 782  LDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTGKL 603
            LD DLNVPDE  +ED      SQ S   T+      +  + V S    SA + S+  G L
Sbjct: 1254 LDFDLNVPDERLLED----LASQRSGNATNCTPAVTNSFDQVRSGVMGSALDHSS--GGL 1307

Query: 602  DLDLNKAEDYEDAGQ--VSSYNEVSNS--AVKTFADGFSNGGQSQGARGFDLNDGPSSED 435
            DLDLNK +D  D     +SS + + +S   VK  + G        G R FDLNDGP+ +D
Sbjct: 1308 DLDLNKVDDSTDMNNYTMSSGHRLDSSFQQVKLSSPG--------GRRDFDLNDGPAGDD 1359

Query: 434  AAMEHNPWISSAKGKANSYVPSLSGWPMNGE-LLNVSPWLPPVTSHPSLSMHPIPSDRAD 258
            A +E +  +S          PSLSG  +NGE + + S W P   ++ ++S+ PI  +R D
Sbjct: 1360 AVVESSMGLSQHSRSGLPSQPSLSGIRVNGENMASFSTWFPAANAYSAVSIPPIMPERGD 1419

Query: 257  RTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYPNVTQPTFPYGAYXXXXXXX 81
            + +PM+A+   Q +L  ++   +++ D YRGPVL+S+ A P     TF Y  +       
Sbjct: 1420 QPFPMIANRGPQRMLGPTTGVSSFAPDGYRGPVLSSSPAMP-FQNTTFQYPVFPFGNNFP 1478

Query: 80   XXXXXXSGGISSFRD--PIGATCFP 12
                  SGG ++  D    G  CFP
Sbjct: 1479 IASANFSGGSTTHMDSSSSGRACFP 1503


>ref|XP_002511444.1| conserved hypothetical protein [Ricinus communis]
            gi|223550559|gb|EEF52046.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1651

 Score =  114 bits (284), Expect = 5e-23
 Identities = 89/266 (33%), Positives = 125/266 (46%), Gaps = 7/266 (2%)
 Frame = -1

Query: 788  PHLDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTG 609
            P LD DLNVPDE  +ED      S+ S   T S     +  N       +S P     +G
Sbjct: 1283 PPLDFDLNVPDERILED----MASRGSVHGTVSVANLSNNLNLQHDEIVVSEP--VRGSG 1336

Query: 608  KLDLDLNKAEDYEDAGQVSSYN----EVSNSAVKTFADGFSNGGQSQGARGFDLNDGPSS 441
             LDLDLN+ E+  D G   + N    +     VK+ +    NG +S   R FDLNDGP  
Sbjct: 1337 GLDLDLNRVEEPNDVGNHLTSNGRRIDAHLQGVKSSSGAVLNG-ESTVRRDFDLNDGPLL 1395

Query: 440  EDAAMEHNPWISSAKGKANSYVPSLSGWPMNG-ELLNVSPWLPPVTSHPSLSMHPIPSDR 264
            ++   E +P+    +    S  PS+SG  +N  E+ N S W   V S+P++++  I  +R
Sbjct: 1396 DEVNAEVSPFSQHIRNNTPSQ-PSVSGLRLNNTEMGNFSSWFSQVNSYPAVAIQSILPER 1454

Query: 263  ADRTYPMVASSVAQHILSSSSAPAYSGDAYRGPVLASTVAYPNVTQPTFPYGAYXXXXXX 84
             ++ +PMV     Q IL  S +  ++ D YRGPVL+S  A P    P F Y  +      
Sbjct: 1455 GEQPFPMVTPGGPQRILPPSGSTPFNPDVYRGPVLSSAPAVPFPASP-FQYPVFPFGTNL 1513

Query: 83   XXXXXXXSGGISSFRDPI--GATCFP 12
                   SGG S++ D    G  CFP
Sbjct: 1514 PLPSATFSGGSSTYVDSSSGGRLCFP 1539


>ref|XP_002877594.1| hypothetical protein ARALYDRAFT_485166 [Arabidopsis lyrata subsp.
            lyrata] gi|297323432|gb|EFH53853.1| hypothetical protein
            ARALYDRAFT_485166 [Arabidopsis lyrata subsp. lyrata]
          Length = 1613

 Score =  110 bits (276), Expect = 4e-22
 Identities = 82/239 (34%), Positives = 127/239 (53%), Gaps = 14/239 (5%)
 Frame = -1

Query: 782  LDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDL--SAPNGST--- 618
            LD DLNVPDE  +ED        ++++ T       SG   + +S D   S   GS    
Sbjct: 1253 LDFDLNVPDERVLED--------LASQRTGIATNCTSG---ITNSFDQVRSGVMGSALDH 1301

Query: 617  STGKLDLDLNKAEDYEDAGQVSSYNEVSNSAVKTFADGF---SNGGQSQGARGFDLNDGP 447
            S+G LDLDLNK +D  D   +++YN  S+  + +        S GG+    R FDLNDGP
Sbjct: 1302 SSGGLDLDLNKVDDSTD---MNNYNMSSSHRLDSSFQHVKLPSTGGR----RDFDLNDGP 1354

Query: 446  SSEDAAMEHNPWISSAKGKANSYVPSLSGWPMNGE-LLNVSPWLPPVTSHPSLSMHPIPS 270
            + +DAA+E +  ++          PSLSG  +NGE + + S W P   ++ ++S+ PI  
Sbjct: 1355 AGDDAAVEPSMVLNQHSRSGLPSQPSLSGIRVNGENMASFSTWFPAANAYSAVSIPPIMP 1414

Query: 269  DRADRTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYP----NVTQPTFPYG 108
            +R D+ +PM+A+   Q +L  ++   +++ + YRGPVL+S+ A P        P FP+G
Sbjct: 1415 ERGDQPFPMIANRGPQRMLGPTTGVSSFAPEGYRGPVLSSSPAMPFQSTTFQYPVFPFG 1473


>ref|NP_190388.2| BAH and TFIIS domain-containing protein [Arabidopsis thaliana]
            gi|186510770|ref|NP_850669.2| BAH and TFIIS
            domain-containing protein [Arabidopsis thaliana]
            gi|332644839|gb|AEE78360.1| BAH and TFIIS
            domain-containing protein [Arabidopsis thaliana]
            gi|332644840|gb|AEE78361.1| BAH and TFIIS
            domain-containing protein [Arabidopsis thaliana]
          Length = 1613

 Score =  110 bits (275), Expect = 6e-22
 Identities = 83/235 (35%), Positives = 124/235 (52%), Gaps = 10/235 (4%)
 Frame = -1

Query: 782  LDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTGKL 603
            LD DLNVPDE  +ED      SQ S   T+      +  + V S    SA + S+  G L
Sbjct: 1253 LDFDLNVPDERVLED----LASQRSGNPTNCTSDITNSFDQVRSGVMGSALDHSS--GGL 1306

Query: 602  DLDLNKAEDYED--AGQVSSYNEVSNS--AVKTFADGFSNGGQSQGARGFDLNDGPSSED 435
            DLDLNK +D  D  +  ++S + + +S   VK  + G        G R FDLNDGP  +D
Sbjct: 1307 DLDLNKVDDSTDMISYTMNSSHRLDSSFQQVKLPSTG--------GRRDFDLNDGPVGDD 1358

Query: 434  AAMEHNPWISSAKGKANSYVPSLSGWPMNGE-LLNVSPWLPPVTSHPSLSMHPIPSDRAD 258
            AA+E +  ++          PSLSG  +NGE + + S W P   ++ ++SM PI  +R D
Sbjct: 1359 AAVEPSMVLNQHSRSGLPSQPSLSGIRVNGENMASFSTWFPAANAYSAVSMPPIMPERGD 1418

Query: 257  RTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYP----NVTQPTFPYG 108
            + +PM+A+   Q +L  ++   +++ + YRGPVL+S+ A P        P FP+G
Sbjct: 1419 QPFPMIATRGPQRMLGPTTGVSSFTPEGYRGPVLSSSPAMPFQSTTFQYPVFPFG 1473


>emb|CAB41134.1| putative protein [Arabidopsis thaliana]
          Length = 1613

 Score =  110 bits (275), Expect = 6e-22
 Identities = 83/235 (35%), Positives = 124/235 (52%), Gaps = 10/235 (4%)
 Frame = -1

Query: 782  LDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTGKL 603
            LD DLNVPDE  +ED      SQ S   T+      +  + V S    SA + S+  G L
Sbjct: 1253 LDFDLNVPDERVLED----LASQRSGNPTNCTSDITNSFDQVRSGVMGSALDHSS--GGL 1306

Query: 602  DLDLNKAEDYED--AGQVSSYNEVSNS--AVKTFADGFSNGGQSQGARGFDLNDGPSSED 435
            DLDLNK +D  D  +  ++S + + +S   VK  + G        G R FDLNDGP  +D
Sbjct: 1307 DLDLNKVDDSTDMISYTMNSSHRLDSSFQQVKLPSTG--------GRRDFDLNDGPVGDD 1358

Query: 434  AAMEHNPWISSAKGKANSYVPSLSGWPMNGE-LLNVSPWLPPVTSHPSLSMHPIPSDRAD 258
            AA+E +  ++          PSLSG  +NGE + + S W P   ++ ++SM PI  +R D
Sbjct: 1359 AAVEPSMVLNQHSRSGLPSQPSLSGIRVNGENMASFSTWFPAANAYSAVSMPPIMPERGD 1418

Query: 257  RTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYP----NVTQPTFPYG 108
            + +PM+A+   Q +L  ++   +++ + YRGPVL+S+ A P        P FP+G
Sbjct: 1419 QPFPMIATRGPQRMLGPTTGVSSFTPEGYRGPVLSSSPAMPFQSTTFQYPVFPFG 1473


>ref|XP_003634295.1| PREDICTED: uncharacterized protein LOC100248456 [Vitis vinifera]
          Length = 1631

 Score =  110 bits (274), Expect = 7e-22
 Identities = 93/272 (34%), Positives = 137/272 (50%), Gaps = 13/272 (4%)
 Frame = -1

Query: 788  PHLDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLS--APNGSTS 615
            P LD DLN+PDE  +ED      S+ SA+ TSS    VS       SRDL+   P GS  
Sbjct: 1258 PLLDFDLNMPDERILED----MTSRSSAQETSSTCDLVS-------SRDLAHDRPMGSAP 1306

Query: 614  ---TGKLDLDLNKAEDYEDAGQVSSYNE----VSNSAVKTFAD-GFSNGGQSQGARGFDL 459
               +G LDLDLN++++  D GQ S+ N     V    VK+ +  GF NG +    R FDL
Sbjct: 1307 IRCSGGLDLDLNQSDEVTDMGQHSASNSHRLVVPLLPVKSSSSVGFPNG-EVVVRRDFDL 1365

Query: 458  NDGPSSEDAAMEHNPWISSAKGKANSYVPSLSGWPMNGELLNVSPWLPPVTSHPSLSMHP 279
            N+GP  ++ + E + +   A+    S  P       N ++ N S W PP  ++ ++++  
Sbjct: 1366 NNGPVLDEVSAEPSSFSQHARSSMASQPPVACLRMNNTDIGNFSSWFPPANNYSAVTIPS 1425

Query: 278  IPSDRADRTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYPNVTQPTFPYGAY 102
            I  DR ++ +P+VA++  Q I+  S+    ++ D YRGPVL+S+ A P  + P F Y  +
Sbjct: 1426 IMPDR-EQPFPIVATNGPQRIMGLSTGGTPFNPDVYRGPVLSSSPAVPFPSTP-FQYPVF 1483

Query: 101  XXXXXXXXXXXXXSGGISSFRD--PIGATCFP 12
                         SG  +SF D    G  CFP
Sbjct: 1484 PFGTNFPLPPATFSGSSTSFTDSSSAGRLCFP 1515


>emb|CAN60153.1| hypothetical protein VITISV_021504 [Vitis vinifera]
          Length = 1688

 Score =  110 bits (274), Expect = 7e-22
 Identities = 93/272 (34%), Positives = 137/272 (50%), Gaps = 13/272 (4%)
 Frame = -1

Query: 788  PHLDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLS--APNGSTS 615
            P LD DLN+PDE  +ED      S+ SA+ TSS    VS       SRDL+   P GS  
Sbjct: 1315 PLLDFDLNMPDERILED----MTSRSSAQETSSTCDLVS-------SRDLAHDRPMGSAP 1363

Query: 614  ---TGKLDLDLNKAEDYEDAGQVSSYNE----VSNSAVKTFAD-GFSNGGQSQGARGFDL 459
               +G LDLDLN++++  D GQ S+ N     V    VK+ +  GF NG +    R FDL
Sbjct: 1364 IRCSGGLDLDLNQSDEVTDMGQHSASNSHRLVVPLLPVKSSSSVGFPNG-EVVVRRDFDL 1422

Query: 458  NDGPSSEDAAMEHNPWISSAKGKANSYVPSLSGWPMNGELLNVSPWLPPVTSHPSLSMHP 279
            N+GP  ++ + E + +   A+    S  P       N ++ N S W PP  ++ ++++  
Sbjct: 1423 NNGPVLDEVSAEPSSFSQHARSSMASQPPVACLRMNNTDIGNFSSWFPPANNYSAVTIPS 1482

Query: 278  IPSDRADRTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYPNVTQPTFPYGAY 102
            I  DR ++ +P+VA++  Q I+  S+    ++ D YRGPVL+S+ A P  + P F Y  +
Sbjct: 1483 IMPDR-EQPFPIVATNGPQRIMGLSTGGTPFNPDVYRGPVLSSSPAVPFPSTP-FQYPVF 1540

Query: 101  XXXXXXXXXXXXXSGGISSFRD--PIGATCFP 12
                         SG  +SF D    G  CFP
Sbjct: 1541 PFGTNFPLPPATFSGSSTSFTDSSSAGRLCFP 1572


>gb|ABR16880.1| unknown [Picea sitchensis]
          Length = 443

 Score =  109 bits (273), Expect = 9e-22
 Identities = 91/273 (33%), Positives = 125/273 (45%), Gaps = 16/273 (5%)
 Frame = -1

Query: 782 LDIDLNVPDEGAVEDPPVTC--VSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTG 609
           LDIDLNV  E   ED  +T    SQ     TSS    +SG +F+ S  +  AP G+ S  
Sbjct: 74  LDIDLNVAYERTSEDGVITVHLSSQTCEPSTSSGCRDMSGQDFISSIAEPFAPTGACSPV 133

Query: 608 KLDLDLNKAEDYEDAGQVSSYNEVSNSAVKTFADGF----------SNGGQSQGARGFDL 459
           K DLDLN+ +D       S  NE++   + T A+ F          S+ G S   RGFDL
Sbjct: 134 KSDLDLNRIDD-------SGENELTKMPLGTSAENFGLTLKSPTSASSLGASCVLRGFDL 186

Query: 458 NDGPSSEDAAMEHNPWISSAKGKANSYVPSLSGWPMNGELLNVSPWLPPVTSHPSLSMHP 279
           NDGP+ +D   E  P   S+  +    VP L    M GEL N S W  P  +  +L+M  
Sbjct: 187 NDGPTFDDGEDELLPQNFSSSSQP---VPDLR---MKGELFNSSSWFSPGNAFQALTMPL 240

Query: 278 IPSDRADRTYPMVASSVAQHILSSSSAP-AYSGDAYRGPVLAS---TVAYPNVTQPTFPY 111
             + R D      A+S  Q   SS S P  +SGD Y+G    S    +++ N    ++P+
Sbjct: 241 HFNARTDHQVITTAASAPQSNRSSLSGPNFFSGDIYKGQTSFSPDPIISFSNTMSTSYPF 300

Query: 110 GAYXXXXXXXXXXXXXSGGISSFRDPIGATCFP 12
             +             SGG  S+ + +G  CFP
Sbjct: 301 TGFPFGSSFPLNSASFSGGSLSYPESLGPGCFP 333


>ref|XP_006290492.1| hypothetical protein CARUB_v10016566mg [Capsella rubella]
            gi|482559199|gb|EOA23390.1| hypothetical protein
            CARUB_v10016566mg [Capsella rubella]
          Length = 1604

 Score =  109 bits (272), Expect = 1e-21
 Identities = 82/235 (34%), Positives = 119/235 (50%), Gaps = 10/235 (4%)
 Frame = -1

Query: 782  LDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTGKL 603
            LD DLNV DE  +ED      SQ S   T+      S  + V S     A + S+  G L
Sbjct: 1244 LDFDLNVADERVLED----LASQRSGNATNCTSGITSSFDRVRSGVIGLALDHSS--GGL 1297

Query: 602  DLDLNKAEDYEDAGQVSSYN----EVSNSAVKTFADGFSNGGQSQGARGFDLNDGPSSED 435
            DLDLNK +D  D    +  +    E S   VK  + G          R FDLNDGP+ +D
Sbjct: 1298 DLDLNKVDDSTDMNNYTMNSSHRLEPSFQQVKLSSTG--------SRRDFDLNDGPAGDD 1349

Query: 434  AAMEHNPWISSAKGKANSYVPSLSGWPMNGE-LLNVSPWLPPVTSHPSLSMHPIPSDRAD 258
            AA+E +  ++     A    PSLSG  MNGE + + S W P   ++ ++S+ PI  +R D
Sbjct: 1350 AAVESSVILNQHSRSALPSQPSLSGIRMNGENMASFSTWFPAANAYSAVSIPPIMPERGD 1409

Query: 257  RTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYP----NVTQPTFPYG 108
            + +PM+A+   Q +L  ++   +++ + YRGPVL+S+ A P        P FP+G
Sbjct: 1410 QPFPMIANRGPQRMLGPTTGVSSFTPEGYRGPVLSSSPAMPFQSTTFQYPVFPFG 1464


>ref|XP_002511441.1| DNA binding protein, putative [Ricinus communis]
            gi|223550556|gb|EEF52043.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 1712

 Score =  108 bits (271), Expect = 2e-21
 Identities = 82/266 (30%), Positives = 127/266 (47%), Gaps = 7/266 (2%)
 Frame = -1

Query: 788  PHLDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTG 609
            P LDIDLNVPDE   ED      +Q +  ++  E                SAP    S+G
Sbjct: 1353 PPLDIDLNVPDERIFEDMACQSTAQGNCDLSHDEPLG-------------SAP--VRSSG 1397

Query: 608  KLDLDLNKAEDYEDAGQVSSYN----EVSNSAVKTFADGFSNGGQSQGARGFDLNDGPSS 441
             LDLDLN+ ++  D G   + N    +V    VK+ + G  NG  S   R FDLNDGP  
Sbjct: 1398 GLDLDLNRVDELADIGNHLTSNGRRLDVQLHPVKSPSSGILNGEVSV-RRNFDLNDGPLV 1456

Query: 440  EDAAMEHNPWISSAKGKANSYVPSLSGWPMNG-ELLNVSPWLPPVTSHPSLSMHPIPSDR 264
            ++ + E + +    +    S++P +S   +N  E+ N S W  P   +P++++ PI   R
Sbjct: 1457 DEVSGEPSSFGQHTRNSVPSHLPPVSALRINNVEMGNFSSWFSPGHPYPAVTIQPILPGR 1516

Query: 263  ADRTYPMVASSVAQHILSSSSAPAYSGDAYRGPVLASTVAYPNVTQPTFPYGAYXXXXXX 84
             ++ +P+VA    Q +L+ ++   +S D +RG VL+S+ A P  + P F Y  +      
Sbjct: 1517 GEQPFPVVAPGGPQRMLTPTANTPFSPDIFRGSVLSSSPAVPFTSTP-FQYPVFPFGTSF 1575

Query: 83   XXXXXXXSGGISSFRDPIGAT--CFP 12
                    GG +S+ D    +  CFP
Sbjct: 1576 PLPSATFPGGSTSYVDASAGSRLCFP 1601


>gb|EOY20638.1| BAH domain,TFIIS helical bundle-like domain isoform 5 [Theobroma
            cacao]
          Length = 1583

 Score =  108 bits (270), Expect = 2e-21
 Identities = 90/278 (32%), Positives = 142/278 (51%), Gaps = 16/278 (5%)
 Frame = -1

Query: 788  PHLDIDLNVPDEGAVEDPPVTCVSQISARMTSS--ELTSVSGHNFVPSSRDL------SA 633
            P LDIDLNVPDE  +ED      S+ SA+ T S  +LT+         +RDL      SA
Sbjct: 1216 PPLDIDLNVPDERVLED----LASRSSAQGTDSAPDLTN---------NRDLTCGLMGSA 1262

Query: 632  PNGSTSTGKLDLDLNKAEDYEDAGQVSSYN----EVSNSAVKTFADGFSNGGQSQGARGF 465
            P    S+G LDLDLN+ ++  D G  S+ +    +V    +K+ + G  NG ++   R F
Sbjct: 1263 P--IRSSGGLDLDLNRVDEPIDLGNHSTGSSRRLDVPMQPLKSSSGGILNG-EASVRRDF 1319

Query: 464  DLNDGPSSEDAAMEHNPWISSAKGKANSYVPSLSGWPMNG-ELLNVSPWLPPVTSHPSLS 288
            DLN+GP+ ++ + E + +    +       P +S   +N  E+ N S W P   ++ +++
Sbjct: 1320 DLNNGPAVDEVSAEPSLFSQHNRSSNVPSQPPVSSLRINNTEMANFSSWFPTGNTYSAVT 1379

Query: 287  MHPIPSDRADRTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYPNVTQPTFPY 111
            +  I  DR ++ +P+VA+     +L   ++A  ++ D YRGPVL+S+ A P  + P F Y
Sbjct: 1380 IPSILPDRGEQPFPIVATGGPPRVLGPPTAATPFNPDVYRGPVLSSSPAVPFPSAP-FQY 1438

Query: 110  GAYXXXXXXXXXXXXXSGGISSFRD--PIGATCFPPGS 3
              +             SGG +++ D  P G  CFPP S
Sbjct: 1439 PVFPFGTTFPLPSTSFSGGSTTYVDSSPSGRLCFPPVS 1476


>gb|EOY20637.1| BAH domain,TFIIS helical bundle-like domain isoform 4 [Theobroma
            cacao]
          Length = 1442

 Score =  108 bits (270), Expect = 2e-21
 Identities = 90/278 (32%), Positives = 142/278 (51%), Gaps = 16/278 (5%)
 Frame = -1

Query: 788  PHLDIDLNVPDEGAVEDPPVTCVSQISARMTSS--ELTSVSGHNFVPSSRDL------SA 633
            P LDIDLNVPDE  +ED      S+ SA+ T S  +LT+         +RDL      SA
Sbjct: 1075 PPLDIDLNVPDERVLED----LASRSSAQGTDSAPDLTN---------NRDLTCGLMGSA 1121

Query: 632  PNGSTSTGKLDLDLNKAEDYEDAGQVSSYN----EVSNSAVKTFADGFSNGGQSQGARGF 465
            P    S+G LDLDLN+ ++  D G  S+ +    +V    +K+ + G  NG ++   R F
Sbjct: 1122 P--IRSSGGLDLDLNRVDEPIDLGNHSTGSSRRLDVPMQPLKSSSGGILNG-EASVRRDF 1178

Query: 464  DLNDGPSSEDAAMEHNPWISSAKGKANSYVPSLSGWPMNG-ELLNVSPWLPPVTSHPSLS 288
            DLN+GP+ ++ + E + +    +       P +S   +N  E+ N S W P   ++ +++
Sbjct: 1179 DLNNGPAVDEVSAEPSLFSQHNRSSNVPSQPPVSSLRINNTEMANFSSWFPTGNTYSAVT 1238

Query: 287  MHPIPSDRADRTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYPNVTQPTFPY 111
            +  I  DR ++ +P+VA+     +L   ++A  ++ D YRGPVL+S+ A P  + P F Y
Sbjct: 1239 IPSILPDRGEQPFPIVATGGPPRVLGPPTAATPFNPDVYRGPVLSSSPAVPFPSAP-FQY 1297

Query: 110  GAYXXXXXXXXXXXXXSGGISSFRD--PIGATCFPPGS 3
              +             SGG +++ D  P G  CFPP S
Sbjct: 1298 PVFPFGTTFPLPSTSFSGGSTTYVDSSPSGRLCFPPVS 1335


>gb|EOY20634.1| BAH domain,TFIIS helical bundle-like domain isoform 1 [Theobroma
            cacao] gi|508773379|gb|EOY20635.1| BAH domain,TFIIS
            helical bundle-like domain isoform 1 [Theobroma cacao]
            gi|508773380|gb|EOY20636.1| BAH domain,TFIIS helical
            bundle-like domain isoform 1 [Theobroma cacao]
            gi|508773383|gb|EOY20639.1| BAH domain,TFIIS helical
            bundle-like domain isoform 1 [Theobroma cacao]
          Length = 1630

 Score =  108 bits (270), Expect = 2e-21
 Identities = 90/278 (32%), Positives = 142/278 (51%), Gaps = 16/278 (5%)
 Frame = -1

Query: 788  PHLDIDLNVPDEGAVEDPPVTCVSQISARMTSS--ELTSVSGHNFVPSSRDL------SA 633
            P LDIDLNVPDE  +ED      S+ SA+ T S  +LT+         +RDL      SA
Sbjct: 1263 PPLDIDLNVPDERVLED----LASRSSAQGTDSAPDLTN---------NRDLTCGLMGSA 1309

Query: 632  PNGSTSTGKLDLDLNKAEDYEDAGQVSSYN----EVSNSAVKTFADGFSNGGQSQGARGF 465
            P    S+G LDLDLN+ ++  D G  S+ +    +V    +K+ + G  NG ++   R F
Sbjct: 1310 P--IRSSGGLDLDLNRVDEPIDLGNHSTGSSRRLDVPMQPLKSSSGGILNG-EASVRRDF 1366

Query: 464  DLNDGPSSEDAAMEHNPWISSAKGKANSYVPSLSGWPMNG-ELLNVSPWLPPVTSHPSLS 288
            DLN+GP+ ++ + E + +    +       P +S   +N  E+ N S W P   ++ +++
Sbjct: 1367 DLNNGPAVDEVSAEPSLFSQHNRSSNVPSQPPVSSLRINNTEMANFSSWFPTGNTYSAVT 1426

Query: 287  MHPIPSDRADRTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYPNVTQPTFPY 111
            +  I  DR ++ +P+VA+     +L   ++A  ++ D YRGPVL+S+ A P  + P F Y
Sbjct: 1427 IPSILPDRGEQPFPIVATGGPPRVLGPPTAATPFNPDVYRGPVLSSSPAVPFPSAP-FQY 1485

Query: 110  GAYXXXXXXXXXXXXXSGGISSFRD--PIGATCFPPGS 3
              +             SGG +++ D  P G  CFPP S
Sbjct: 1486 PVFPFGTTFPLPSTSFSGGSTTYVDSSPSGRLCFPPVS 1523


>ref|XP_006290494.1| hypothetical protein CARUB_v10016567mg [Capsella rubella]
            gi|482559201|gb|EOA23392.1| hypothetical protein
            CARUB_v10016567mg [Capsella rubella]
          Length = 1598

 Score =  107 bits (267), Expect = 5e-21
 Identities = 78/235 (33%), Positives = 117/235 (49%), Gaps = 10/235 (4%)
 Frame = -1

Query: 782  LDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTGKL 603
            LD DLNV DE  +ED      SQ S   T+         + + S   +       S+G L
Sbjct: 1242 LDFDLNVADERVLED----LASQKSGNATNCTSGITDSCDRIHSG--VMGLALDHSSGGL 1295

Query: 602  DLDLNKAEDYEDAGQVSSYN----EVSNSAVKTFADGFSNGGQSQGARGFDLNDGPSSED 435
            DLDLNK +D  D    +  +    E S   VK    G        G R FDLNDGP+ +D
Sbjct: 1296 DLDLNKVDDSTDMNNYTMSSSHRLEPSFQQVKLSTAG--------GRRDFDLNDGPAGDD 1347

Query: 434  AAMEHNPWISSAKGKANSYVPSLSGWPMNGE-LLNVSPWLPPVTSHPSLSMHPIPSDRAD 258
            AA+E +  ++          PSLSG  +NGE + ++S W P   ++ ++S+ PI  +R D
Sbjct: 1348 AAVESSMILNQHSRSGLPSQPSLSGIQVNGENMASISTWFPAANAYSAVSIPPIMPERGD 1407

Query: 257  RTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYP----NVTQPTFPYG 108
            + +PM+A+   Q +L  ++   +++ + YRGPVL+S+ A P        P FP+G
Sbjct: 1408 QPFPMIANRGPQRMLGPTTGVSSFTPEGYRGPVLSSSPAMPFQSTTFQYPVFPFG 1462


>ref|XP_006290493.1| hypothetical protein CARUB_v10016567mg [Capsella rubella]
            gi|482559200|gb|EOA23391.1| hypothetical protein
            CARUB_v10016567mg [Capsella rubella]
          Length = 1597

 Score =  107 bits (267), Expect = 5e-21
 Identities = 78/235 (33%), Positives = 117/235 (49%), Gaps = 10/235 (4%)
 Frame = -1

Query: 782  LDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTGKL 603
            LD DLNV DE  +ED      SQ S   T+         + + S   +       S+G L
Sbjct: 1241 LDFDLNVADERVLED----LASQKSGNATNCTSGITDSCDRIHSG--VMGLALDHSSGGL 1294

Query: 602  DLDLNKAEDYEDAGQVSSYN----EVSNSAVKTFADGFSNGGQSQGARGFDLNDGPSSED 435
            DLDLNK +D  D    +  +    E S   VK    G        G R FDLNDGP+ +D
Sbjct: 1295 DLDLNKVDDSTDMNNYTMSSSHRLEPSFQQVKLSTAG--------GRRDFDLNDGPAGDD 1346

Query: 434  AAMEHNPWISSAKGKANSYVPSLSGWPMNGE-LLNVSPWLPPVTSHPSLSMHPIPSDRAD 258
            AA+E +  ++          PSLSG  +NGE + ++S W P   ++ ++S+ PI  +R D
Sbjct: 1347 AAVESSMILNQHSRSGLPSQPSLSGIQVNGENMASISTWFPAANAYSAVSIPPIMPERGD 1406

Query: 257  RTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYP----NVTQPTFPYG 108
            + +PM+A+   Q +L  ++   +++ + YRGPVL+S+ A P        P FP+G
Sbjct: 1407 QPFPMIANRGPQRMLGPTTGVSSFTPEGYRGPVLSSSPAMPFQSTTFQYPVFPFG 1461


>ref|NP_190389.1| BAH and TFIIS domain-containing protein [Arabidopsis thaliana]
            gi|4678322|emb|CAB41133.1| putative protein [Arabidopsis
            thaliana] gi|332644841|gb|AEE78362.1| BAH and TFIIS
            domain-containing protein [Arabidopsis thaliana]
          Length = 1611

 Score =  106 bits (264), Expect = 1e-20
 Identities = 82/235 (34%), Positives = 120/235 (51%), Gaps = 10/235 (4%)
 Frame = -1

Query: 782  LDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTGKL 603
            LD DLNVPDE  +ED      SQ S   T+   TS   +NF      +       S+G  
Sbjct: 1255 LDFDLNVPDERVLED----LASQRSGNPTNC--TSGITNNFDQVRSGVMGSALDHSSG-- 1306

Query: 602  DLDLNKAEDYEDAGQ--VSSYNEVSNS--AVKTFADGFSNGGQSQGARGFDLNDGPSSED 435
             LDLNK +D  D     ++S + + +S   VK  + G        G R FDLNDGP  +D
Sbjct: 1307 GLDLNKVDDLTDMNSYTMNSSHRLDSSFQQVKLPSTG--------GRRDFDLNDGPVGDD 1358

Query: 434  AAMEHNPWISSAKGKANSYVPSLSGWPMNGE-LLNVSPWLPPVTSHPSLSMHPIPSDRAD 258
            AA+E +  ++          PSLSG  +NGE + + S W P   ++ ++SM PI  +R D
Sbjct: 1359 AAVEPSMVLNQHSRSGLPSQPSLSGIRVNGENMASFSTWFPAANAYSAVSMPPIMPERGD 1418

Query: 257  RTYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYP----NVTQPTFPYG 108
            + +PM+A+   Q +L  ++   ++S + YRGPVL+S+ A P        P FP+G
Sbjct: 1419 QPFPMIATRGPQRMLGPTTGVSSFSPEGYRGPVLSSSPAMPFQSTTFQYPVFPFG 1473


>gb|EXC31170.1| hypothetical protein L484_004936 [Morus notabilis]
          Length = 1455

 Score =  105 bits (263), Expect = 1e-20
 Identities = 83/262 (31%), Positives = 124/262 (47%), Gaps = 3/262 (1%)
 Frame = -1

Query: 788  PHLDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTG 609
            P LDIDLNVPDE  +ED     VS+ S + TSS     +  +    S  L+      S G
Sbjct: 1096 PPLDIDLNVPDERVLED----MVSRFSGQGTSSASDPANNRDLAHKSSSLTPVR---SFG 1148

Query: 608  KLDLDLNKAEDYEDAGQVSSYNEVSNSAVKTFADGFSNGGQSQ-GA-RGFDLNDGPSSED 435
             LDLDLN+ +D  D G   +Y+   ++ +  F     N   S+ GA R FDLNDGP  ++
Sbjct: 1149 GLDLDLNQVDDTSDMG---NYSIAKDNPILQFKSSSGNALSSEIGAHRDFDLNDGPDVDE 1205

Query: 434  AAMEHNPWISSAKGKANSYVPSLSGWPMNGELLNVSPWLPPVTSHPSLSMHPIPSDRADR 255
               E   +   AK    S  P +SG  +N        W  P T +P++++  I  DR + 
Sbjct: 1206 VIAESALFTQQAKSILPSQ-PPISGPRINNTEAGNYSWFHPGTPYPAVTIPSIIPDRGEP 1264

Query: 254  TYPMVASSVAQHIL-SSSSAPAYSGDAYRGPVLASTVAYPNVTQPTFPYGAYXXXXXXXX 78
             +P++A+   Q ++   S    ++ D YRGPVL+++ A P     +F Y  +        
Sbjct: 1265 LFPILAAGGPQRMMVPPSGGNPFAPDVYRGPVLSASPAVP-FPSTSFQYPVFSYGTSFSL 1323

Query: 77   XXXXXSGGISSFRDPIGATCFP 12
                 +GG ++F D     CFP
Sbjct: 1324 RPTTFAGGSTTFLDS-SRVCFP 1344


>ref|XP_002321576.2| hypothetical protein POPTR_0015s08410g [Populus trichocarpa]
            gi|550322308|gb|EEF05703.2| hypothetical protein
            POPTR_0015s08410g [Populus trichocarpa]
          Length = 1642

 Score =  105 bits (262), Expect = 2e-20
 Identities = 89/271 (32%), Positives = 137/271 (50%), Gaps = 12/271 (4%)
 Frame = -1

Query: 788  PHLDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLS--APNGST- 618
            P LDIDLNVPDE  +ED        +++R ++ E  SVS    +  + D +  A  GS  
Sbjct: 1274 PLLDIDLNVPDERILED--------LASRSSAQETVSVSD---LAKNNDCARDALMGSIP 1322

Query: 617  --STGKLDLDLNKAEDYEDAGQ-VSSYNEVSNSAVKTF--ADGFSNGGQSQGARGFDLND 453
              S+G LD DLN+A++  D G  ++S     ++ +     + GF NG +  G R FDLND
Sbjct: 1323 VRSSGGLDFDLNRADEASDIGNHLTSIGRRLDAPLHPAKSSGGFLNG-KVGGCRDFDLND 1381

Query: 452  GPSSEDAAMEHNPWISSAKGKANSYVPSLSGWPMNG-ELLNVSPWLPPVTSHPSLSMHPI 276
            GP  ++ + E +P     +    S  P +S   MN  E+ N   W P    +P++++  I
Sbjct: 1382 GPLVDEVSAEPSPLGQHTRNIVPSQ-PLISNLRMNSTEIGNFPSWFPQGNPYPAVTIQSI 1440

Query: 275  PSDRADRTYPMVASSVAQHILSSSS-APAYSGDAYRGPVLASTVAYPNVTQPTFPYGAYX 99
              DR ++ +P+VA+   Q +L+SS+ +  ++ D YRG VL+S+ A P    P F Y  + 
Sbjct: 1441 LHDRGEQPFPVVATGGPQRMLASSTGSNPFNTDVYRGAVLSSSPAVP-FPSPPFQYPVFP 1499

Query: 98   XXXXXXXXXXXXSGGISSFRDPI--GATCFP 12
                        SGG +S+ D    G  CFP
Sbjct: 1500 FGTNFPLTSATFSGGSASYVDSPSGGRLCFP 1530


>ref|XP_002866520.1| hypothetical protein ARALYDRAFT_919569 [Arabidopsis lyrata subsp.
            lyrata] gi|297312355|gb|EFH42779.1| hypothetical protein
            ARALYDRAFT_919569 [Arabidopsis lyrata subsp. lyrata]
          Length = 1597

 Score =  104 bits (259), Expect = 4e-20
 Identities = 83/267 (31%), Positives = 121/267 (45%), Gaps = 9/267 (3%)
 Frame = -1

Query: 782  LDIDLNVPDEGAVEDPPVTCVSQISARMTSSELTSVSGHNFVPSSRDLSAPNGSTSTGKL 603
            LD DLNVPDE  +ED      SQ SA                         N + S+G L
Sbjct: 1261 LDFDLNVPDERVLED----LASQRSA-------------------------NPTNSSGGL 1291

Query: 602  DLDLNKAEDYEDAGQ--VSSYNEVSNSAVKTFADGFSNGGQSQGARGFDLNDGPSSEDAA 429
            DLDLNK +D  D     +SS + V +S        F     S G R FDLNDGP+ +D++
Sbjct: 1292 DLDLNKLDDPTDMNNYTISSGHRVDSS--------FQQANFSGGRRDFDLNDGPAVDDSS 1343

Query: 428  MEHNPWISSAKGKANSYVPSLSGWPMNGELL--NVSPWLPPVTSHPSLSMHPIPSDRADR 255
            +E +   +       +  P +SG  MNGE +    S W P   ++ ++S+  +  DR D 
Sbjct: 1344 VESSMVFTQHSRSGLTSQPMISGIRMNGEHMAAGFSSWFPAANNYSAMSIPQVLPDRGDH 1403

Query: 254  TYPMVASSVAQHILS-SSSAPAYSGDAYRGPVLASTVAYPNVTQP--TFPYGAYXXXXXX 84
             +P++ S+  Q ++  +S   +++ D YRGPVL S+   P V+ P   F Y A+      
Sbjct: 1404 PFPVITSNGPQRMVGPTSGVSSFTPDMYRGPVLLSS---PAVSFPPTAFQYPAFPFGTSF 1460

Query: 83   XXXXXXXSGGISSFRD--PIGATCFPP 9
                    G  + + D    G  CFPP
Sbjct: 1461 PLASANFPGSSTPYMDSSSSGRLCFPP 1487


Top