BLASTX nr result

ID: Perilla23_contig00019073 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00019073
         (676 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011094382.1| PREDICTED: uncharacterized protein LOC105174...   307   4e-81
ref|XP_011094381.1| PREDICTED: uncharacterized protein LOC105174...   307   4e-81
ref|XP_012840132.1| PREDICTED: uncharacterized protein LOC105960...   291   2e-76
gb|EYU35152.1| hypothetical protein MIMGU_mgv1a001852mg [Erythra...   291   2e-76
ref|XP_012492709.1| PREDICTED: uncharacterized protein LOC105804...   221   4e-55
ref|XP_012492702.1| PREDICTED: uncharacterized protein LOC105804...   221   4e-55
ref|XP_011094383.1| PREDICTED: uncharacterized protein LOC105174...   218   2e-54
emb|CDP15391.1| unnamed protein product [Coffea canephora]            211   4e-52
ref|XP_012492696.1| PREDICTED: uncharacterized protein LOC105804...   210   6e-52
ref|XP_012492689.1| PREDICTED: uncharacterized protein LOC105804...   210   6e-52
ref|XP_012492674.1| PREDICTED: uncharacterized protein LOC105804...   210   6e-52
ref|XP_002311825.2| hypothetical protein POPTR_0008s20540g [Popu...   209   1e-51
gb|KDO61089.1| hypothetical protein CISIN_1g008363mg [Citrus sin...   207   4e-51
ref|XP_006481815.1| PREDICTED: uncharacterized protein LOC102609...   207   4e-51
ref|XP_006481813.1| PREDICTED: uncharacterized protein LOC102609...   207   4e-51
gb|KHG10240.1| General transcription factor 3C polypeptide 2 [Go...   207   5e-51
ref|XP_007049744.1| DNA binding protein, putative isoform 1 [The...   204   4e-50
ref|XP_011010279.1| PREDICTED: uncharacterized protein LOC105115...   196   8e-48
ref|XP_011010277.1| PREDICTED: uncharacterized protein LOC105115...   196   8e-48
ref|XP_013733163.1| PREDICTED: uncharacterized protein LOC106436...   195   2e-47

>ref|XP_011094382.1| PREDICTED: uncharacterized protein LOC105174093 isoform X2 [Sesamum
            indicum]
          Length = 870

 Score =  307 bits (786), Expect = 4e-81
 Identities = 155/289 (53%), Positives = 185/289 (64%), Gaps = 64/289 (22%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPFQGITYGLDWLPDPRCIFGSM 496
            AWAP+Q  LE AN+IIT G KGFK WDIRDPF PLW H   G+TYGL+WLPDPRC+FGS+
Sbjct: 557  AWAPVQGELESANVIITTGPKGFKVWDIRDPFRPLWDHHIPGVTYGLEWLPDPRCVFGSI 616

Query: 495  EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316
            +DGT+WLLSLERA+HDIPVTGK    A KHGFHSF CSSF+IW + ASRLTGMVAYCGEE
Sbjct: 617  DDGTLWLLSLERAAHDIPVTGKSITAAPKHGFHSFDCSSFSIWCIHASRLTGMVAYCGEE 676

Query: 315  GSTICFKPTTRSLKDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSRG- 139
            G+T CF+PTTRS++DPSRNR+ HYLCGS+L+EE A+I+ASP   S  Q +SPGMKRS G 
Sbjct: 677  GTTFCFQPTTRSVRDPSRNRLHHYLCGSLLEEETALIVASPSTSSFLQKRSPGMKRSGGA 736

Query: 138  KEEERRAKEQSNRQVT---------AWKEERDE----------EMEAVPPK--------- 43
            K++E+R KEQ  + VT          W +  +E          + +A  PK         
Sbjct: 737  KDQEKRVKEQMAKSVTCNEPPTPAICWSDHVEEHGSDKSSMVIKKQASKPKESSKTQSQA 796

Query: 42   -----------------------------------IVAMHRVRWNVNRG 1
                                               IVAMHRVRWNVN+G
Sbjct: 797  NQETVLCRSEDAGQLQREGSGKEEKGDTVEVFPPKIVAMHRVRWNVNKG 845


>ref|XP_011094381.1| PREDICTED: uncharacterized protein LOC105174093 isoform X1 [Sesamum
            indicum]
          Length = 874

 Score =  307 bits (786), Expect = 4e-81
 Identities = 155/289 (53%), Positives = 185/289 (64%), Gaps = 64/289 (22%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPFQGITYGLDWLPDPRCIFGSM 496
            AWAP+Q  LE AN+IIT G KGFK WDIRDPF PLW H   G+TYGL+WLPDPRC+FGS+
Sbjct: 561  AWAPVQGELESANVIITTGPKGFKVWDIRDPFRPLWDHHIPGVTYGLEWLPDPRCVFGSI 620

Query: 495  EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316
            +DGT+WLLSLERA+HDIPVTGK    A KHGFHSF CSSF+IW + ASRLTGMVAYCGEE
Sbjct: 621  DDGTLWLLSLERAAHDIPVTGKSITAAPKHGFHSFDCSSFSIWCIHASRLTGMVAYCGEE 680

Query: 315  GSTICFKPTTRSLKDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSRG- 139
            G+T CF+PTTRS++DPSRNR+ HYLCGS+L+EE A+I+ASP   S  Q +SPGMKRS G 
Sbjct: 681  GTTFCFQPTTRSVRDPSRNRLHHYLCGSLLEEETALIVASPSTSSFLQKRSPGMKRSGGA 740

Query: 138  KEEERRAKEQSNRQVT---------AWKEERDE----------EMEAVPPK--------- 43
            K++E+R KEQ  + VT          W +  +E          + +A  PK         
Sbjct: 741  KDQEKRVKEQMAKSVTCNEPPTPAICWSDHVEEHGSDKSSMVIKKQASKPKESSKTQSQA 800

Query: 42   -----------------------------------IVAMHRVRWNVNRG 1
                                               IVAMHRVRWNVN+G
Sbjct: 801  NQETVLCRSEDAGQLQREGSGKEEKGDTVEVFPPKIVAMHRVRWNVNKG 849


>ref|XP_012840132.1| PREDICTED: uncharacterized protein LOC105960491 [Erythranthe
            guttatus]
          Length = 808

 Score =  291 bits (746), Expect = 2e-76
 Identities = 138/249 (55%), Positives = 174/249 (69%), Gaps = 24/249 (9%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPFQGITYGLDWLPDPRCIFGSM 496
            AWAP Q++LE AN+I+TAG KGFKFWDIRDPF PLW H  QG+TYGL WL DPRC+FGS+
Sbjct: 535  AWAPNQTDLESANVIVTAGHKGFKFWDIRDPFRPLWDHAMQGVTYGLSWLRDPRCVFGSV 594

Query: 495  EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316
            +DGT+W   LE  + DIP+TGKC   A+K GFHSF CSSF+IW++QAS LTG+VAYCGE 
Sbjct: 595  DDGTLWFHRLENTASDIPITGKCVAAATKQGFHSFDCSSFSIWNVQASPLTGVVAYCGEA 654

Query: 315  GSTICFKPTTRSLKDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSRG- 139
            G+T+CF+PT RS+KDPSRNR  H LCGS+L+EE A+I+A+P   +S   + PGMKRS G 
Sbjct: 655  GTTLCFQPTARSVKDPSRNRRTHLLCGSLLEEEDALIVATPSTSTSHSRRYPGMKRSGGA 714

Query: 138  KEEERRAKEQSNRQ---VTAWKEE--------------------RDEEMEAVPPKIVAMH 28
            K+ E++ KEQ N +      W+ +                     D + E  P K VA+H
Sbjct: 715  KDLEKKFKEQINNEQPLAICWRGDLEETKKQEPKSKETNKDQLKNDNKREVFPGKNVAIH 774

Query: 27   RVRWNVNRG 1
            RVRWN N+G
Sbjct: 775  RVRWNANKG 783


>gb|EYU35152.1| hypothetical protein MIMGU_mgv1a001852mg [Erythranthe guttata]
          Length = 749

 Score =  291 bits (746), Expect = 2e-76
 Identities = 138/249 (55%), Positives = 174/249 (69%), Gaps = 24/249 (9%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPFQGITYGLDWLPDPRCIFGSM 496
            AWAP Q++LE AN+I+TAG KGFKFWDIRDPF PLW H  QG+TYGL WL DPRC+FGS+
Sbjct: 476  AWAPNQTDLESANVIVTAGHKGFKFWDIRDPFRPLWDHAMQGVTYGLSWLRDPRCVFGSV 535

Query: 495  EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316
            +DGT+W   LE  + DIP+TGKC   A+K GFHSF CSSF+IW++QAS LTG+VAYCGE 
Sbjct: 536  DDGTLWFHRLENTASDIPITGKCVAAATKQGFHSFDCSSFSIWNVQASPLTGVVAYCGEA 595

Query: 315  GSTICFKPTTRSLKDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSRG- 139
            G+T+CF+PT RS+KDPSRNR  H LCGS+L+EE A+I+A+P   +S   + PGMKRS G 
Sbjct: 596  GTTLCFQPTARSVKDPSRNRRTHLLCGSLLEEEDALIVATPSTSTSHSRRYPGMKRSGGA 655

Query: 138  KEEERRAKEQSNRQ---VTAWKEE--------------------RDEEMEAVPPKIVAMH 28
            K+ E++ KEQ N +      W+ +                     D + E  P K VA+H
Sbjct: 656  KDLEKKFKEQINNEQPLAICWRGDLEETKKQEPKSKETNKDQLKNDNKREVFPGKNVAIH 715

Query: 27   RVRWNVNRG 1
            RVRWN N+G
Sbjct: 716  RVRWNANKG 724


>ref|XP_012492709.1| PREDICTED: uncharacterized protein LOC105804550 isoform X5 [Gossypium
            raimondii]
          Length = 852

 Score =  221 bits (562), Expect = 4e-55
 Identities = 124/298 (41%), Positives = 166/298 (55%), Gaps = 73/298 (24%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            AWAP  S++E +N+I+TAG  G KFWDIRDPF PLW VHP     Y LDWLP+PRC+  S
Sbjct: 523  AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 582

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DGTM LLSL +A+ D+PVTGK    + + G H ++CSSFAIW +Q SRL GMVAYCG 
Sbjct: 583  FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLIGMVAYCGA 642

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157
            +G+  CF+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D+    K P      
Sbjct: 643  DGTAACFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDNPLPLKKPSSECGD 702

Query: 156  --------MKRSRGKE-EERRAKEQSNRQVT----------------------------- 91
                    +  S GK  ++R+AK  ++ Q T                             
Sbjct: 703  GQRSMRYFLTESLGKNAKDRKAKVPTSNQRTLALCYGNDPSLESEPEETLAALKSKMNPN 762

Query: 90   ---------------------AWKEERDE-------EMEAVPPKIVAMHRVRWNVNRG 1
                                 A  +ER+E       +ME  PPKIVAMHR+RWN+N+G
Sbjct: 763  SKSDGKKKANDSQALAQGTKEATNKEREETEKEGESQMETFPPKIVAMHRLRWNMNKG 820


>ref|XP_012492702.1| PREDICTED: uncharacterized protein LOC105804550 isoform X4 [Gossypium
            raimondii]
          Length = 852

 Score =  221 bits (562), Expect = 4e-55
 Identities = 124/298 (41%), Positives = 166/298 (55%), Gaps = 73/298 (24%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            AWAP  S++E +N+I+TAG  G KFWDIRDPF PLW VHP     Y LDWLP+PRC+  S
Sbjct: 523  AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 582

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DGTM LLSL +A+ D+PVTGK    + + G H ++CSSFAIW +Q SRL GMVAYCG 
Sbjct: 583  FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLIGMVAYCGA 642

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157
            +G+  CF+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D+    K P      
Sbjct: 643  DGTAACFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDNPLPLKKPSSECGD 702

Query: 156  --------MKRSRGKE-EERRAKEQSNRQVT----------------------------- 91
                    +  S GK  ++R+AK  ++ Q T                             
Sbjct: 703  GQRSMRYFLTESLGKNAKDRKAKVPTSNQRTLALYDGNDPSVESEPEETLAALKSKMKPN 762

Query: 90   ---------------------AWKEERDE-------EMEAVPPKIVAMHRVRWNVNRG 1
                                 A  +ER+E       +ME  PPKIVAMHR+RWN+N+G
Sbjct: 763  SKSDGKKKANDSQALAQGTKEATNKEREETEKEGESQMETFPPKIVAMHRLRWNMNKG 820


>ref|XP_011094383.1| PREDICTED: uncharacterized protein LOC105174093 isoform X3 [Sesamum
           indicum]
          Length = 692

 Score =  218 bits (556), Expect = 2e-54
 Identities = 94/127 (74%), Positives = 107/127 (84%)
 Frame = -2

Query: 675 AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPFQGITYGLDWLPDPRCIFGSM 496
           AWAP+Q  LE AN+IIT G KGFK WDIRDPF PLW H   G+TYGL+WLPDPRC+FGS+
Sbjct: 561 AWAPVQGELESANVIITTGPKGFKVWDIRDPFRPLWDHHIPGVTYGLEWLPDPRCVFGSI 620

Query: 495 EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316
           +DGT+WLLSLERA+HDIPVTGK    A KHGFHSF CSSF+IW + ASRLTGMVAYCGEE
Sbjct: 621 DDGTLWLLSLERAAHDIPVTGKSITAAPKHGFHSFDCSSFSIWCIHASRLTGMVAYCGEE 680

Query: 315 GSTICFK 295
           G+T CF+
Sbjct: 681 GTTFCFQ 687


>emb|CDP15391.1| unnamed protein product [Coffea canephora]
          Length = 942

 Score =  211 bits (536), Expect = 4e-52
 Identities = 116/289 (40%), Positives = 157/289 (54%), Gaps = 65/289 (22%)
 Frame = -2

Query: 672  WAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGSM 496
            W P+ S  E ANII+TAG +G KFWD+RDPF PLW  +PFQ + Y LDWLPDPRCI  S 
Sbjct: 630  WVPVSSYSESANIIVTAGHRGLKFWDLRDPFRPLWDFYPFQRVIYSLDWLPDPRCIIVSF 689

Query: 495  EDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGEE 316
            +DG + +LSL +A++D PVTGK    A + GFHS+ CS F IWS+  SRLTGMVAYCG +
Sbjct: 690  DDGALRILSLLKAANDAPVTGKPFEGAQQKGFHSYLCSPFQIWSVHTSRLTGMVAYCGAD 749

Query: 315  GSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTK------SPG 157
            G+ + F+ TTR++ KDP RNR PH+LCG++ +E   + + + L ++ F  +         
Sbjct: 750  GTALRFQLTTRAVEKDPLRNRAPHFLCGALTEENSTLTMFTSLPNTPFPMRKSLREWGEA 809

Query: 156  MKRSRG----KEEERRAKE-----------------------------------QSNRQV 94
             +  RG      +E+RAK+                                   ++ +  
Sbjct: 810  PRTVRGYISVSNQEKRAKQKVVKVRSEEKHKALCKRGDLDSEFGPDCMAVTETREAGKVK 869

Query: 93   TAWKEERD------------------EEMEAVPPKIVAMHRVRWNVNRG 1
            T+   E D                  EE+E  P K VAMHRVRWN N+G
Sbjct: 870  TSSNSEADQRPIMVGEDNPDIMRGEVEEVEVFPSKTVAMHRVRWNTNKG 918


>ref|XP_012492696.1| PREDICTED: uncharacterized protein LOC105804550 isoform X3 [Gossypium
            raimondii]
          Length = 877

 Score =  210 bits (535), Expect = 6e-52
 Identities = 109/231 (47%), Positives = 150/231 (64%), Gaps = 17/231 (7%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            AWAP  S++E +N+I+TAG  G KFWDIRDPF PLW VHP     Y LDWLP+PRC+  S
Sbjct: 513  AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 572

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DGTM LLSL +A+ D+PVTGK    + + G H ++CSSFAIW +Q SRL GMVAYCG 
Sbjct: 573  FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLIGMVAYCGA 632

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157
            +G+  CF+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D+    K P      
Sbjct: 633  DGTAACFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDNPLPLKKPSSECGD 692

Query: 156  --------MKRSRGKE-EERRAK-EQSNRQVTAWKEERDEEMEAVPPKIVA 34
                    +  S GK  ++R+AK   SN++  A  +  D  +E+ P + +A
Sbjct: 693  GQRSMRYFLTESLGKNAKDRKAKVPTSNQRTLALYDGNDPSVESEPEETLA 743


>ref|XP_012492689.1| PREDICTED: uncharacterized protein LOC105804550 isoform X2 [Gossypium
            raimondii] gi|763743438|gb|KJB10937.1| hypothetical
            protein B456_001G233200 [Gossypium raimondii]
          Length = 886

 Score =  210 bits (535), Expect = 6e-52
 Identities = 109/231 (47%), Positives = 150/231 (64%), Gaps = 17/231 (7%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            AWAP  S++E +N+I+TAG  G KFWDIRDPF PLW VHP     Y LDWLP+PRC+  S
Sbjct: 522  AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 581

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DGTM LLSL +A+ D+PVTGK    + + G H ++CSSFAIW +Q SRL GMVAYCG 
Sbjct: 582  FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLIGMVAYCGA 641

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157
            +G+  CF+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D+    K P      
Sbjct: 642  DGTAACFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDNPLPLKKPSSECGD 701

Query: 156  --------MKRSRGKE-EERRAK-EQSNRQVTAWKEERDEEMEAVPPKIVA 34
                    +  S GK  ++R+AK   SN++  A  +  D  +E+ P + +A
Sbjct: 702  GQRSMRYFLTESLGKNAKDRKAKVPTSNQRTLALYDGNDPSVESEPEETLA 752


>ref|XP_012492674.1| PREDICTED: uncharacterized protein LOC105804550 isoform X1 [Gossypium
            raimondii] gi|823127124|ref|XP_012492682.1| PREDICTED:
            uncharacterized protein LOC105804550 isoform X1
            [Gossypium raimondii] gi|763743433|gb|KJB10932.1|
            hypothetical protein B456_001G233200 [Gossypium
            raimondii] gi|763743437|gb|KJB10936.1| hypothetical
            protein B456_001G233200 [Gossypium raimondii]
          Length = 887

 Score =  210 bits (535), Expect = 6e-52
 Identities = 109/231 (47%), Positives = 150/231 (64%), Gaps = 17/231 (7%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            AWAP  S++E +N+I+TAG  G KFWDIRDPF PLW VHP     Y LDWLP+PRC+  S
Sbjct: 523  AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 582

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DGTM LLSL +A+ D+PVTGK    + + G H ++CSSFAIW +Q SRL GMVAYCG 
Sbjct: 583  FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLIGMVAYCGA 642

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157
            +G+  CF+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D+    K P      
Sbjct: 643  DGTAACFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDNPLPLKKPSSECGD 702

Query: 156  --------MKRSRGKE-EERRAK-EQSNRQVTAWKEERDEEMEAVPPKIVA 34
                    +  S GK  ++R+AK   SN++  A  +  D  +E+ P + +A
Sbjct: 703  GQRSMRYFLTESLGKNAKDRKAKVPTSNQRTLALYDGNDPSVESEPEETLA 753


>ref|XP_002311825.2| hypothetical protein POPTR_0008s20540g [Populus trichocarpa]
            gi|550333546|gb|EEE89192.2| hypothetical protein
            POPTR_0008s20540g [Populus trichocarpa]
          Length = 813

 Score =  209 bits (532), Expect = 1e-51
 Identities = 118/291 (40%), Positives = 157/291 (53%), Gaps = 66/291 (22%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            AW P +S+ E  N+I+TAG  G KFWDIRDPF PLW +HP   + Y LDWLPDPRCI  S
Sbjct: 491  AWVPSESDQESPNLILTAGHLGLKFWDIRDPFRPLWDLHPAPKLIYSLDWLPDPRCIILS 550

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DGTM LLSL RA++D  V GK      + G H  +CSSFAIWS+Q SRLTGMVAYC  
Sbjct: 551  FDDGTMRLLSLARAAYDAAVNGKPSVGPKQLGMHVVNCSSFAIWSVQVSRLTGMVAYCSA 610

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSR 142
            +G+   F+ TT+++ KDPSR+R PH+ CGS+ ++E AII+ +PL D+    K P      
Sbjct: 611  DGTVCRFQLTTKAVEKDPSRHRAPHFGCGSLSEDESAIIVGTPLPDTPLPLKKPVNDVGN 670

Query: 141  GKEEERRAK---------------------------EQSNRQVTAWKEER---------- 73
              + ++R                               S+  +TA K +R          
Sbjct: 671  NPKSKQRLSVSNKAAKIPTSDDPPLALCYGDDPGMDHGSDETLTATKSKRKPKSKSGSKQ 730

Query: 72   -----------DEEME----------------AVPPKIVAMHRVRWNVNRG 1
                       D+E +                ++PPK+VAMHRVRWN+N+G
Sbjct: 731  MEGEDQALVCIDDEQDVKQKGGGKEGAGNVVESIPPKMVAMHRVRWNMNKG 781


>gb|KDO61089.1| hypothetical protein CISIN_1g008363mg [Citrus sinensis]
          Length = 568

 Score =  207 bits (528), Expect = 4e-51
 Identities = 117/299 (39%), Positives = 161/299 (53%), Gaps = 74/299 (24%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            +WAP +S+ + AN+I+TAG  G KFWDIRDPF PLW +HP     YGLDWLPDP C+  S
Sbjct: 236  SWAPAESDSDSANVILTAGHGGLKFWDIRDPFRPLWDIHPAPKFIYGLDWLPDPGCVILS 295

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DG M ++SL +A++D+P TGK      + G H  +CSSFAIWS+Q SRLTGMVAYC  
Sbjct: 296  FDDGAMRIVSLLKAAYDVPATGKPFAGTKQQGLHLVNCSSFAIWSVQVSRLTGMVAYCSA 355

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPL---------------- 190
            +G+   F+ T +++ KD SRNR  H+LCGS+ ++E AI + +PL                
Sbjct: 356  DGTVHRFQLTAKAVEKDHSRNRPMHFLCGSVTEDESAITVNTPLDNTPVPLKKTVHDAGE 415

Query: 189  ------LDSSFQTKSPGMKRSR------------------GKEEE--------RRAKEQS 106
                  L  S  +KSP  K+ +                  G+E E        +  ++  
Sbjct: 416  RSMRSFLIESNSSKSPNDKKGKNVLSSDNQPLALCYGNEPGEESEGDMTLAALKNKQKPK 475

Query: 105  NRQVTAWKEERDEEM------------------------EAVPPKIVAMHRVRWNVNRG 1
            +R  +  KEE D+ M                        E +PPK+VAMHRVRWN+N+G
Sbjct: 476  SRSSSKKKEEDDQAMVCIDEEATDIQGKENEKGEAGNGIEVLPPKVVAMHRVRWNMNKG 534


>ref|XP_006481815.1| PREDICTED: uncharacterized protein LOC102609984 isoform X3 [Citrus
            sinensis]
          Length = 801

 Score =  207 bits (528), Expect = 4e-51
 Identities = 117/299 (39%), Positives = 161/299 (53%), Gaps = 74/299 (24%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            +WAP +S+ + AN+I+TAG  G KFWDIRDPF PLW +HP     YGLDWLPDP C+  S
Sbjct: 469  SWAPAESDSDSANVILTAGHGGLKFWDIRDPFRPLWDIHPAPKFIYGLDWLPDPGCVILS 528

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DG M ++SL +A++D+P TGK      + G H  +CSSFAIWS+Q SRLTGMVAYC  
Sbjct: 529  FDDGAMRIVSLLKAAYDVPATGKPFAGTKQQGLHLVNCSSFAIWSVQVSRLTGMVAYCSA 588

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPL---------------- 190
            +G+   F+ T +++ KD SRNR  H+LCGS+ ++E AI + +PL                
Sbjct: 589  DGTVHRFQLTAKAVEKDHSRNRPMHFLCGSVTEDESAITVNTPLDNTPVPLKKTVHDAGE 648

Query: 189  ------LDSSFQTKSPGMKRSR------------------GKEEE--------RRAKEQS 106
                  L  S  +KSP  K+ +                  G+E E        +  ++  
Sbjct: 649  RSMRSFLIESNSSKSPNDKKGKNVLSSDNQPLALCYGNEPGEESEGDMTLAALKNKQKPK 708

Query: 105  NRQVTAWKEERDEEM------------------------EAVPPKIVAMHRVRWNVNRG 1
            +R  +  KEE D+ M                        E +PPK+VAMHRVRWN+N+G
Sbjct: 709  SRSSSKKKEEDDQAMVCIDEEATDIQGKENAKGEAGNGIEVLPPKVVAMHRVRWNMNKG 767


>ref|XP_006481813.1| PREDICTED: uncharacterized protein LOC102609984 isoform X1 [Citrus
            sinensis] gi|568856485|ref|XP_006481814.1| PREDICTED:
            uncharacterized protein LOC102609984 isoform X2 [Citrus
            sinensis]
          Length = 911

 Score =  207 bits (528), Expect = 4e-51
 Identities = 117/299 (39%), Positives = 161/299 (53%), Gaps = 74/299 (24%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            +WAP +S+ + AN+I+TAG  G KFWDIRDPF PLW +HP     YGLDWLPDP C+  S
Sbjct: 579  SWAPAESDSDSANVILTAGHGGLKFWDIRDPFRPLWDIHPAPKFIYGLDWLPDPGCVILS 638

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DG M ++SL +A++D+P TGK      + G H  +CSSFAIWS+Q SRLTGMVAYC  
Sbjct: 639  FDDGAMRIVSLLKAAYDVPATGKPFAGTKQQGLHLVNCSSFAIWSVQVSRLTGMVAYCSA 698

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPL---------------- 190
            +G+   F+ T +++ KD SRNR  H+LCGS+ ++E AI + +PL                
Sbjct: 699  DGTVHRFQLTAKAVEKDHSRNRPMHFLCGSVTEDESAITVNTPLDNTPVPLKKTVHDAGE 758

Query: 189  ------LDSSFQTKSPGMKRSR------------------GKEEE--------RRAKEQS 106
                  L  S  +KSP  K+ +                  G+E E        +  ++  
Sbjct: 759  RSMRSFLIESNSSKSPNDKKGKNVLSSDNQPLALCYGNEPGEESEGDMTLAALKNKQKPK 818

Query: 105  NRQVTAWKEERDEEM------------------------EAVPPKIVAMHRVRWNVNRG 1
            +R  +  KEE D+ M                        E +PPK+VAMHRVRWN+N+G
Sbjct: 819  SRSSSKKKEEDDQAMVCIDEEATDIQGKENAKGEAGNGIEVLPPKVVAMHRVRWNMNKG 877


>gb|KHG10240.1| General transcription factor 3C polypeptide 2 [Gossypium arboreum]
          Length = 925

 Score =  207 bits (527), Expect = 5e-51
 Identities = 109/231 (47%), Positives = 148/231 (64%), Gaps = 17/231 (7%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            AWAP  S++E +N+I+TAG  G KFWDIRDPF PLW VHP     Y LDWLP+PRC+  S
Sbjct: 561  AWAPSGSDMESSNVILTAGHGGVKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVIIS 620

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DGTM LLSL +A+ D+PVTGK    + + G H ++CSSFAIW +Q SRLTGMVAYCG 
Sbjct: 621  FDDGTMKLLSLVQAACDVPVTGKPFGGSKQQGLHVYNCSSFAIWCVQVSRLTGMVAYCGA 680

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPG----- 157
            +G+   F+ T++++ KD SRNR PH+ CGS+ +EE A+I+ +PL D     K P      
Sbjct: 681  DGTATYFQLTSKAVDKDFSRNRSPHFACGSLTEEEPAVIVNTPLPDKPLPLKKPSSECGD 740

Query: 156  --------MKRSRGKE-EERRAK-EQSNRQVTAWKEERDEEMEAVPPKIVA 34
                    +  S GK  + R+AK   SN++  A  +  D  +E+ P + +A
Sbjct: 741  GQRSMRYFLTESLGKNAKHRKAKVPTSNQRTLALYDGNDPSVESEPDETLA 791


>ref|XP_007049744.1| DNA binding protein, putative isoform 1 [Theobroma cacao]
            gi|508702005|gb|EOX93901.1| DNA binding protein, putative
            isoform 1 [Theobroma cacao]
          Length = 868

 Score =  204 bits (519), Expect = 4e-50
 Identities = 99/198 (50%), Positives = 132/198 (66%), Gaps = 2/198 (1%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            AWAP  S++E AN+++TAG  G KFWDIRDPF PLW VHP     Y LDWLP+PRC+  S
Sbjct: 540  AWAPSGSDMESANVVLTAGHGGLKFWDIRDPFLPLWDVHPAPKFIYSLDWLPEPRCVILS 599

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DGTM +LSL +A+ D+PVTGK      + G H ++CSSFAIW++Q SRLTGMVAYCG 
Sbjct: 600  FDDGTMKMLSLIQAACDVPVTGKPFTGTKQQGLHLYNCSSFAIWNVQVSRLTGMVAYCGA 659

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMKRSR 142
            +G+   F+ T++++ KD SRNR PH++CGS+ +EE AI++ +PL D     K        
Sbjct: 660  DGNVTRFQLTSKAVDKDFSRNRAPHFVCGSLTEEESAIVVNTPLPDIPLTLKKQTNDYGE 719

Query: 141  GKEEERRAKEQSNRQVTA 88
            G    R    +SN+   A
Sbjct: 720  GPRSMRAFLTESNQAKNA 737


>ref|XP_011010279.1| PREDICTED: uncharacterized protein LOC105115164 isoform X2 [Populus
            euphratica]
          Length = 930

 Score =  196 bits (499), Expect = 8e-48
 Identities = 95/174 (54%), Positives = 121/174 (69%), Gaps = 2/174 (1%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            AW P +S+ E  N+I+TAG  G KFWDIRDPF PLW +HP   + Y LDWLPDPRCI  S
Sbjct: 608  AWVPSESDQESPNLILTAGHLGLKFWDIRDPFRPLWDLHPAPKLIYSLDWLPDPRCIILS 667

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DGTM LLSL RA++D  V GK      + G H  +CSSFAIWS+Q SRLTGMVAYC  
Sbjct: 668  FDDGTMRLLSLARAAYDAAVNGKPSVGPKQLGMHVVNCSSFAIWSVQVSRLTGMVAYCSA 727

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSP 160
            +G+   F+ TT+++ KDPSR+R PH+ CG + ++E AII+ +PL D+    K P
Sbjct: 728  DGTVCRFQLTTKAVEKDPSRHRAPHFGCGFLSEDESAIIVGTPLPDNPLPLKKP 781


>ref|XP_011010277.1| PREDICTED: uncharacterized protein LOC105115164 isoform X1 [Populus
            euphratica] gi|743931985|ref|XP_011010278.1| PREDICTED:
            uncharacterized protein LOC105115164 isoform X1 [Populus
            euphratica]
          Length = 931

 Score =  196 bits (499), Expect = 8e-48
 Identities = 95/174 (54%), Positives = 121/174 (69%), Gaps = 2/174 (1%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLW-VHPFQGITYGLDWLPDPRCIFGS 499
            AW P +S+ E  N+I+TAG  G KFWDIRDPF PLW +HP   + Y LDWLPDPRCI  S
Sbjct: 609  AWVPSESDQESPNLILTAGHLGLKFWDIRDPFRPLWDLHPAPKLIYSLDWLPDPRCIILS 668

Query: 498  MEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCGE 319
             +DGTM LLSL RA++D  V GK      + G H  +CSSFAIWS+Q SRLTGMVAYC  
Sbjct: 669  FDDGTMRLLSLARAAYDAAVNGKPSVGPKQLGMHVVNCSSFAIWSVQVSRLTGMVAYCSA 728

Query: 318  EGSTICFKPTTRSL-KDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSP 160
            +G+   F+ TT+++ KDPSR+R PH+ CG + ++E AII+ +PL D+    K P
Sbjct: 729  DGTVCRFQLTTKAVEKDPSRHRAPHFGCGFLSEDESAIIVGTPLPDNPLPLKKP 782


>ref|XP_013733163.1| PREDICTED: uncharacterized protein LOC106436761 [Brassica napus]
          Length = 2668

 Score =  195 bits (496), Expect = 2e-47
 Identities = 111/287 (38%), Positives = 156/287 (54%), Gaps = 62/287 (21%)
 Frame = -2

Query: 675  AWAPIQSNLEGANIIITAGSKGFKFWDIRDPFHPLWVHPF-QGITY-GLDWLPDPRCIFG 502
            +WAP QS+   A +IITAG KG KFWD+RDPFHPL  +   QG+    +DWLP+PRCI  
Sbjct: 566  SWAPFQSDSGSATVIITAGHKGLKFWDLRDPFHPLREYNVGQGVNICSVDWLPEPRCIII 625

Query: 501  SMEDGTMWLLSLERASHDIPVTGKCHNVASKHGFHSFHCSSFAIWSLQASRLTGMVAYCG 322
            S +DGT+ +LSL +A++D+PVTG       + GFHSF  S  AIW++QASR+TG+VAYCG
Sbjct: 626  SCDDGTLKILSLPKAAYDVPVTGNFLVGTKQQGFHSFSRSLLAIWNVQASRVTGLVAYCG 685

Query: 321  EEGSTICFKPTTRSLKDPSRNRVPHYLCGSILDEEGAIIIASPLLDSSFQTKSPGMK--- 151
             +G+ + F+ T+R   D  RNR PH+LCGS  ++E  I + +P+ +S F+T   G +   
Sbjct: 686  ADGTAVRFQLTSRMENDAVRNRTPHFLCGSFSEDESGISVVTPVPNSPFRTFYSGKQWRD 745

Query: 150  --------RSRGKEEERRAKEQSNRQVTAWKEERD------------------------- 70
                     +    +E+RA EQS+ Q  A     D                         
Sbjct: 746  TISRFPHGVNSVPNQEKRAMEQSDEQPLALCYGNDPNVEGGSDDELVAQKSKQASKAKTK 805

Query: 69   ------------------------EEMEAVPPKIVAMHRVRWNVNRG 1
                                    E++E +PPK V+++RVRWN+NRG
Sbjct: 806  TTSKKPKASDCALICNEEEPTRLREKLEELPPKDVSINRVRWNMNRG 852


Top