BLASTX nr result

ID: Cornus23_contig00005153 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00005153
         (2044 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012087437.1| PREDICTED: protein EMSY-LIKE 3 isoform X3 [J...   610   e-171
ref|XP_012087435.1| PREDICTED: protein EMSY-LIKE 3 isoform X1 [J...   605   e-170
ref|XP_012087436.1| PREDICTED: protein EMSY-LIKE 3 isoform X2 [J...   596   e-167
ref|XP_007014847.1| Emsy N Terminus/ plant Tudor-like domains-co...   591   e-166
ref|XP_007014849.1| Emsy N Terminus/ plant Tudor-like domains-co...   590   e-165
gb|KHG28322.1| Protein EMSY [Gossypium arboreum]                      585   e-164
ref|XP_012472977.1| PREDICTED: protein EMSY-LIKE 3-like isoform ...   583   e-163
ref|XP_010087381.1| hypothetical protein L484_018407 [Morus nota...   578   e-162
ref|XP_010265885.1| PREDICTED: uncharacterized protein LOC104603...   578   e-162
ref|XP_007014850.1| Emsy N Terminus/ plant Tudor-like domains-co...   574   e-160
ref|XP_010265884.1| PREDICTED: uncharacterized protein LOC104603...   573   e-160
ref|XP_010265883.1| PREDICTED: uncharacterized protein LOC104603...   569   e-159
ref|XP_002299156.2| hypothetical protein POPTR_0001s05150g [Popu...   565   e-158
gb|KHG28323.1| Protein EMSY [Gossypium arboreum]                      564   e-158
ref|XP_012472979.1| PREDICTED: protein EMSY-LIKE 3-like isoform ...   562   e-157
gb|KHG14231.1| Protein EMSY [Gossypium arboreum]                      558   e-156
ref|XP_011037909.1| PREDICTED: uncharacterized protein LOC105134...   558   e-156
gb|KJB83471.1| hypothetical protein B456_013G249400 [Gossypium r...   556   e-155
gb|KJB55313.1| hypothetical protein B456_009G070600 [Gossypium r...   555   e-155
ref|XP_012462611.1| PREDICTED: protein EMSY-LIKE 3 isoform X1 [G...   555   e-155

>ref|XP_012087437.1| PREDICTED: protein EMSY-LIKE 3 isoform X3 [Jatropha curcas]
          Length = 460

 Score =  610 bits (1574), Expect = e-171
 Identities = 320/458 (69%), Positives = 351/458 (76%), Gaps = 1/458 (0%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPAGNGRSAVVGTAPFPRMHSDMESQIHHIEQE 1476
            M YELSDSSGTDDDLPPPHRNRF  G  PAGNGRS VVG+ P PRMHSDME+QIH+IEQE
Sbjct: 1    MDYELSDSSGTDDDLPPPHRNRFPSGVRPAGNGRSTVVGSTPLPRMHSDMETQIHNIEQE 60

Query: 1475 AYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT 1296
            AY+SVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT
Sbjct: 61   AYTSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT 120

Query: 1295 RGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVLX 1116
              +QP M S++QP HDP PSPTVSAS KKQKTSQS+ASL++ APSPAL PSVQPSSS + 
Sbjct: 121  NAIQPSMPSTAQPAHDPTPSPTVSASHKKQKTSQSVASLSMGAPSPAL-PSVQPSSSAMR 179

Query: 1115 XXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWTRWPE 936
                         +YPS G TGRAQ     SSGAFA +EPAEA ++DPLIGRKVWTRWPE
Sbjct: 180  RGPPPGPKSKKPKAYPSAGLTGRAQANNRSSSGAFATSEPAEATSYDPLIGRKVWTRWPE 239

Query: 935  DNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXXXXXX 756
            DN+FYEAVITDYNPVEGRHALVYDINTA+ETWEWVNLKEISPED+RWE EDP        
Sbjct: 240  DNHFYEAVITDYNPVEGRHALVYDINTANETWEWVNLKEISPEDLRWEGEDPGIFRRGSR 299

Query: 755  XXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTDTLIK 579
                 G KK M                     KDFP S+NGIGKKA+GDIEILHTD+LIK
Sbjct: 300  PGPGRGNKKPMARGGALAGGGRGRGTMKGHSRKDFPLSQNGIGKKAMGDIEILHTDSLIK 359

Query: 578  EVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQSMDX 399
            EVEKVF +S PDP+EIEKAKK+L+EHE++LVDAIA+LEDASDGESD GE  FS G+SMD 
Sbjct: 360  EVEKVFGSSHPDPMEIEKAKKVLKEHEQALVDAIAKLEDASDGESD-GEHTFSHGRSMD- 417

Query: 398  XXXXXXXXXXQYDEMXXXXXXXXGSDGNKMAREGRVGS 285
                       YDE+         SDGNKMAR+GR GS
Sbjct: 418  --QDRGWRKRPYDEIGGEGRVIESSDGNKMARDGRGGS 453


>ref|XP_012087435.1| PREDICTED: protein EMSY-LIKE 3 isoform X1 [Jatropha curcas]
          Length = 463

 Score =  605 bits (1560), Expect = e-170
 Identities = 320/461 (69%), Positives = 350/461 (75%), Gaps = 4/461 (0%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPAGNGRSAVVGTAPFPRMHSDMESQIHHIEQE 1476
            M YELSDSSGTDDDLPPPHRNRF  G  PAGNGRS VVG+ P PRMHSDME+QIH+IEQE
Sbjct: 1    MDYELSDSSGTDDDLPPPHRNRFPSGVRPAGNGRSTVVGSTPLPRMHSDMETQIHNIEQE 60

Query: 1475 AYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT 1296
            AY+SVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT
Sbjct: 61   AYTSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT 120

Query: 1295 RGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL- 1119
              +QP M S++QP HDP PSPTVSAS KKQKTSQS+ASL++ APSPAL PSVQPSSS + 
Sbjct: 121  NAIQPSMPSTAQPAHDPTPSPTVSASHKKQKTSQSVASLSMGAPSPAL-PSVQPSSSAMR 179

Query: 1118 --XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWTR 945
                             YPS G TGRAQ     SSGAFA +EPAEA ++DPLIGRKVWTR
Sbjct: 180  RGPPPGPKSKKPKASMQYPSAGLTGRAQANNRSSSGAFATSEPAEATSYDPLIGRKVWTR 239

Query: 944  WPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXXX 765
            WPEDN+FYEAVITDYNPVEGRHALVYDINTA+ETWEWVNLKEISPED+RWE EDP     
Sbjct: 240  WPEDNHFYEAVITDYNPVEGRHALVYDINTANETWEWVNLKEISPEDLRWEGEDPGIFRR 299

Query: 764  XXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTDT 588
                    G KK M                     KDFP S+NGIGKKA+GDIEILHTD+
Sbjct: 300  GSRPGPGRGNKKPMARGGALAGGGRGRGTMKGHSRKDFPLSQNGIGKKAMGDIEILHTDS 359

Query: 587  LIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQS 408
            LIKEVEKVF +S PDP+EIEKAKK+L+EHE++LVDAIA+LEDASDGESD GE  FS G+S
Sbjct: 360  LIKEVEKVFGSSHPDPMEIEKAKKVLKEHEQALVDAIAKLEDASDGESD-GEHTFSHGRS 418

Query: 407  MDXXXXXXXXXXXQYDEMXXXXXXXXGSDGNKMAREGRVGS 285
            MD            YDE+         SDGNKMAR+GR GS
Sbjct: 419  MD---QDRGWRKRPYDEIGGEGRVIESSDGNKMARDGRGGS 456


>ref|XP_012087436.1| PREDICTED: protein EMSY-LIKE 3 isoform X2 [Jatropha curcas]
            gi|643711628|gb|KDP25135.1| hypothetical protein
            JCGZ_22670 [Jatropha curcas]
          Length = 461

 Score =  596 bits (1536), Expect = e-167
 Identities = 318/461 (68%), Positives = 348/461 (75%), Gaps = 4/461 (0%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPAGNGRSAVVGTAPFPRMHSDMESQIHHIEQE 1476
            M YELSDSS  DDDLPPPHRNRF  G  PAGNGRS VVG+ P PRMHSDME+QIH+IEQE
Sbjct: 1    MDYELSDSS--DDDLPPPHRNRFPSGVRPAGNGRSTVVGSTPLPRMHSDMETQIHNIEQE 58

Query: 1475 AYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT 1296
            AY+SVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT
Sbjct: 59   AYTSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT 118

Query: 1295 RGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL- 1119
              +QP M S++QP HDP PSPTVSAS KKQKTSQS+ASL++ APSPAL PSVQPSSS + 
Sbjct: 119  NAIQPSMPSTAQPAHDPTPSPTVSASHKKQKTSQSVASLSMGAPSPAL-PSVQPSSSAMR 177

Query: 1118 --XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWTR 945
                             YPS G TGRAQ     SSGAFA +EPAEA ++DPLIGRKVWTR
Sbjct: 178  RGPPPGPKSKKPKASMQYPSAGLTGRAQANNRSSSGAFATSEPAEATSYDPLIGRKVWTR 237

Query: 944  WPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXXX 765
            WPEDN+FYEAVITDYNPVEGRHALVYDINTA+ETWEWVNLKEISPED+RWE EDP     
Sbjct: 238  WPEDNHFYEAVITDYNPVEGRHALVYDINTANETWEWVNLKEISPEDLRWEGEDPGIFRR 297

Query: 764  XXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTDT 588
                    G KK M                     KDFP S+NGIGKKA+GDIEILHTD+
Sbjct: 298  GSRPGPGRGNKKPMARGGALAGGGRGRGTMKGHSRKDFPLSQNGIGKKAMGDIEILHTDS 357

Query: 587  LIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQS 408
            LIKEVEKVF +S PDP+EIEKAKK+L+EHE++LVDAIA+LEDASDGESD GE  FS G+S
Sbjct: 358  LIKEVEKVFGSSHPDPMEIEKAKKVLKEHEQALVDAIAKLEDASDGESD-GEHTFSHGRS 416

Query: 407  MDXXXXXXXXXXXQYDEMXXXXXXXXGSDGNKMAREGRVGS 285
            MD            YDE+         SDGNKMAR+GR GS
Sbjct: 417  MD---QDRGWRKRPYDEIGGEGRVIESSDGNKMARDGRGGS 454


>ref|XP_007014847.1| Emsy N Terminus/ plant Tudor-like domains-containing protein isoform
            1 [Theobroma cacao] gi|590583247|ref|XP_007014848.1| Emsy
            N Terminus/ plant Tudor-like domains-containing protein
            isoform 1 [Theobroma cacao] gi|508785210|gb|EOY32466.1|
            Emsy N Terminus/ plant Tudor-like domains-containing
            protein isoform 1 [Theobroma cacao]
            gi|508785211|gb|EOY32467.1| Emsy N Terminus/ plant
            Tudor-like domains-containing protein isoform 1
            [Theobroma cacao]
          Length = 453

 Score =  591 bits (1523), Expect = e-166
 Identities = 309/423 (73%), Positives = 332/423 (78%), Gaps = 5/423 (1%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPA-GNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NRF  GG  A GNGRSAVVG+AP PRMHSDME+QIH IEQ
Sbjct: 1    MDYGLSDSSGTDDDLPPSHQNRFQRGGRTAAGNGRSAVVGSAPLPRMHSDMETQIHLIEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAYSSVLRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELL RVNADDI+RRIREWR 
Sbjct: 61   EAYSSVLRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLLRVNADDILRRIREWRT 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL 1119
              GLQPGMLS+SQPVHD +PSPTVS S KKQKTSQS+ASL++ APSPALHPS+QPSSS L
Sbjct: 121  ASGLQPGMLSTSQPVHDTVPSPTVSGSRKKQKTSQSVASLSMGAPSPALHPSMQPSSSAL 180

Query: 1118 ---XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWT 948
                              YPSTG   R Q P   SSGAFA NEPAEAA +DPLIGRKVWT
Sbjct: 181  RRGPLPGAKSKKSKSSTQYPSTGLPVRPQAPNRTSSGAFATNEPAEAAPYDPLIGRKVWT 240

Query: 947  RWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXX 768
            RWPEDN+FYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWE +DP    
Sbjct: 241  RWPEDNHFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWEGDDPGMSR 300

Query: 767  XXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTD 591
                     G KKSM                     KDFP ++NGIGKK LGDIEILHTD
Sbjct: 301  RGGRPGPGRGIKKSMARGGGVAGAGRGRGSLKGHAKKDFPLAQNGIGKKVLGDIEILHTD 360

Query: 590  TLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQ 411
            TLIKEVEKVF A QPDP+EIEKAKK+L+EHE++LV+AIARLEDASDGES  GE  FS+GQ
Sbjct: 361  TLIKEVEKVFGAGQPDPMEIEKAKKVLKEHEQALVEAIARLEDASDGESADGEHPFSRGQ 420

Query: 410  SMD 402
            SMD
Sbjct: 421  SMD 423


>ref|XP_007014849.1| Emsy N Terminus/ plant Tudor-like domains-containing protein isoform
            3 [Theobroma cacao] gi|508785212|gb|EOY32468.1| Emsy N
            Terminus/ plant Tudor-like domains-containing protein
            isoform 3 [Theobroma cacao]
          Length = 452

 Score =  590 bits (1520), Expect = e-165
 Identities = 310/423 (73%), Positives = 333/423 (78%), Gaps = 5/423 (1%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPA-GNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NRF  GG  A GNGRSAVVG+AP PRMHSDME+QIH IEQ
Sbjct: 1    MDYGLSDSSGTDDDLPPSHQNRFQRGGRTAAGNGRSAVVGSAPLPRMHSDMETQIHLIEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAYSSVLRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELL RVNADDI+RRIREWR 
Sbjct: 61   EAYSSVLRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLLRVNADDILRRIREWRT 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL 1119
              GLQPGMLS+SQPVHD +PSPTVS S KKQKTSQS+ASL++ APSPALHPS+QPSSS L
Sbjct: 121  ASGLQPGMLSTSQPVHDTVPSPTVSGSRKKQKTSQSVASLSMGAPSPALHPSMQPSSSAL 180

Query: 1118 ---XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWT 948
                              YPSTG   R Q P   SSGAFA NEPAEAA +DPLIGRKVWT
Sbjct: 181  RRGPLPGAKSKKSKSSTQYPSTGLPVRPQAPNRTSSGAFATNEPAEAAPYDPLIGRKVWT 240

Query: 947  RWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXX 768
            RWPEDN+FYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWE +DP    
Sbjct: 241  RWPEDNHFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWEGDDPGMSR 300

Query: 767  XXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTD 591
                     G KKSM                     KDFP ++NGIGKK LGDIEILHTD
Sbjct: 301  RGGRPGPGRGIKKSMARGGGVAGAGRGRGSLKGHAKKDFPLAQNGIGKKVLGDIEILHTD 360

Query: 590  TLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQ 411
            TLIKEVEKVF A QPDP+EIEKAKK+L+EHE++LV+AIARLEDASDGESD GE  FS+GQ
Sbjct: 361  TLIKEVEKVFGAGQPDPMEIEKAKKVLKEHEQALVEAIARLEDASDGESD-GEHPFSRGQ 419

Query: 410  SMD 402
            SMD
Sbjct: 420  SMD 422


>gb|KHG28322.1| Protein EMSY [Gossypium arboreum]
          Length = 448

 Score =  585 bits (1508), Expect = e-164
 Identities = 316/453 (69%), Positives = 341/453 (75%), Gaps = 5/453 (1%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPA-GNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NRF  GG  A GNGRSAV G+ P PRMH DME+QIH IEQ
Sbjct: 1    MDYALSDSSGTDDDLPPSHQNRFQRGGRTAAGNGRSAVAGSTPLPRMHGDMETQIHLIEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAY SVLRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELL RVNADDIIRRIREWR 
Sbjct: 61   EAYCSVLRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLLRVNADDIIRRIREWRT 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL 1119
              GLQPGMLS+SQP+HDP+PSP+VS S KKQKTSQS+ASL++ APSPALHPS+QPSSS L
Sbjct: 121  MSGLQPGMLSTSQPLHDPLPSPSVSGSRKKQKTSQSVASLSMGAPSPALHPSMQPSSSAL 180

Query: 1118 ---XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWT 948
                              YPSTG  GR Q P   SSGAFA+NEPAEAA +DPLIGRKVWT
Sbjct: 181  RRGPPPGAKSKKSKSSTQYPSTGPPGRPQPPNRTSSGAFAINEPAEAAPYDPLIGRKVWT 240

Query: 947  RWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXX 768
            RWPEDN+FYEAVITDYN  EGRHALVYDINTADETWEWVNLKEISPEDI+WE EDP    
Sbjct: 241  RWPEDNHFYEAVITDYNAAEGRHALVYDINTADETWEWVNLKEISPEDIKWEGEDPGISR 300

Query: 767  XXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTD 591
                     G KKSM                     KDFP+ +NG+GKK LGDIEILHTD
Sbjct: 301  RGGRPGQGHGVKKSMSRGGGVAGAGRGRGSLKGQAKKDFPSKQNGVGKKVLGDIEILHTD 360

Query: 590  TLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQ 411
            TLIKEVEKVF AS PDP+EIEKAKK+L+EHE+SLVDAIARLE+ASD ESD GE +FSQGQ
Sbjct: 361  TLIKEVEKVFGASHPDPIEIEKAKKVLKEHEQSLVDAIARLEEASDDESD-GEHRFSQGQ 419

Query: 410  SMDXXXXXXXXXXXQYDEMXXXXXXXXGSDGNK 312
            SMD           QYDEM        GSDGNK
Sbjct: 420  SMD---QERAWRKRQYDEM-GEGRMIEGSDGNK 448


>ref|XP_012472977.1| PREDICTED: protein EMSY-LIKE 3-like isoform X1 [Gossypium raimondii]
            gi|823146179|ref|XP_012472978.1| PREDICTED: protein
            EMSY-LIKE 3-like isoform X1 [Gossypium raimondii]
            gi|763754523|gb|KJB21854.1| hypothetical protein
            B456_004G018500 [Gossypium raimondii]
            gi|763754524|gb|KJB21855.1| hypothetical protein
            B456_004G018500 [Gossypium raimondii]
          Length = 448

 Score =  583 bits (1502), Expect = e-163
 Identities = 314/453 (69%), Positives = 340/453 (75%), Gaps = 5/453 (1%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPA-GNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NRF  GG  A GNGRSAV G+ P PRMH DME+QIH IEQ
Sbjct: 1    MDYALSDSSGTDDDLPPSHQNRFQRGGRTAAGNGRSAVAGSTPLPRMHGDMETQIHLIEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAY SVLRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELL RVNA+DIIRRIREWR 
Sbjct: 61   EAYCSVLRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLLRVNAEDIIRRIREWRT 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL 1119
              GLQPGMLS+SQP+HDP+PSP+VS S KKQKTSQS+ASL++ APSPALHPS+QPSSS L
Sbjct: 121  MSGLQPGMLSTSQPLHDPLPSPSVSGSRKKQKTSQSVASLSMGAPSPALHPSMQPSSSAL 180

Query: 1118 ---XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWT 948
                              YPSTG  GR Q P   SSGAFA NEPAEAA +DPLIGRKVWT
Sbjct: 181  RRGPPPGAKSKKSKSSTQYPSTGLPGRPQPPNRTSSGAFATNEPAEAAPYDPLIGRKVWT 240

Query: 947  RWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXX 768
            RWPEDN+FYEAVITDYN  EGRHALVYDINTADETWEWVNLKEISPEDI+WE EDP    
Sbjct: 241  RWPEDNHFYEAVITDYNAAEGRHALVYDINTADETWEWVNLKEISPEDIKWEGEDPGISR 300

Query: 767  XXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTD 591
                     G KKSM                     KDFP+ +NG+GKK LGDIEILHTD
Sbjct: 301  RGGRPGQGHGVKKSMSRGGGVAGAGRGRGSLKGQAKKDFPSKQNGVGKKVLGDIEILHTD 360

Query: 590  TLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQ 411
            TLIKEVEKVF AS PDP+EIEKAKK+L+EHE++LVDAIARLE+ASD ESD GE +FSQGQ
Sbjct: 361  TLIKEVEKVFGASHPDPIEIEKAKKVLKEHEQALVDAIARLEEASDDESD-GEHRFSQGQ 419

Query: 410  SMDXXXXXXXXXXXQYDEMXXXXXXXXGSDGNK 312
            SMD           QYDEM        GSDGNK
Sbjct: 420  SMD---QERAWRKRQYDEM-GEGRMIEGSDGNK 448


>ref|XP_010087381.1| hypothetical protein L484_018407 [Morus notabilis]
            gi|587838282|gb|EXB28991.1| hypothetical protein
            L484_018407 [Morus notabilis]
          Length = 487

 Score =  578 bits (1491), Expect = e-162
 Identities = 318/489 (65%), Positives = 350/489 (71%), Gaps = 24/489 (4%)
 Frame = -3

Query: 1655 MAYELSDSSG------------------TDDDLPPPHRNRFAGGGHPAGNGRSAVVGTAP 1530
            M Y LSDSSG                  TDDDLPP H+NRF  GG  +GNGRSA VG+A 
Sbjct: 1    MDYGLSDSSGELTEKILFEYAFCVFVFCTDDDLPPSHQNRFQRGGRASGNGRSAPVGSAQ 60

Query: 1529 FPRMHSDMESQIHHIEQEAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELL 1350
             PR+H DME+QIHHIEQEAY SVLRAFKAQSDAITW+KESLITELRKELRVSDEEHRELL
Sbjct: 61   MPRIHGDMETQIHHIEQEAYCSVLRAFKAQSDAITWDKESLITELRKELRVSDEEHRELL 120

Query: 1349 SRVNADDIIRRIREWRKTRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLS 1170
            SRVNADD+IRRIREWRK  GLQPGM S+ QPVHDP+PSPTVSAS KKQKTSQS+ASL+L 
Sbjct: 121  SRVNADDMIRRIREWRKASGLQPGMGSAPQPVHDPIPSPTVSASRKKQKTSQSVASLSLG 180

Query: 1169 APSPALHPSVQPSSSVL---XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNE 999
            AP PAL PS+QPSSS L                  YPSTG TGRAQ    GS+GAF   E
Sbjct: 181  APPPALPPSMQPSSSALRRGPPPGARTKKPKSSMQYPSTGVTGRAQATNRGSTGAFGTTE 240

Query: 998  PAEAANFDPLIGRKVWTRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKE 819
             AE A FDPLIGRKVWTRWPEDN+FYEAVITDYN +EGRHALVYDINTADETWEWVNLKE
Sbjct: 241  AAEGAAFDPLIGRKVWTRWPEDNHFYEAVITDYNALEGRHALVYDINTADETWEWVNLKE 300

Query: 818  ISPEDIRWESEDPXXXXXXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASE 642
            ISPEDIRWE EDP             G KKSM                     KDFP  +
Sbjct: 301  ISPEDIRWEGEDPGISRKGGRPGPGRGNKKSMTRGGAVPGAGRGRGTTKGQSKKDFPLQQ 360

Query: 641  NGIGKKALGDIEILHTDTLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLED 462
            NGIGKK +GDIEILHTDTLIKEVEKVF AS PDP+EIEKAKK+L++HE++LVDAIA+LED
Sbjct: 361  NGIGKKGMGDIEILHTDTLIKEVEKVFGASHPDPMEIEKAKKVLKDHEQALVDAIAKLED 420

Query: 461  ASDGES--DGGERQFSQGQSMDXXXXXXXXXXXQYDEMXXXXXXXXGSDGNKMAREGRVG 288
            ASDGES  D G+  FSQGQSMD           Q+DEM        G +G+KMAR+GR+ 
Sbjct: 421  ASDGESEADEGDHPFSQGQSMD---HERGWRKRQFDEM-GEGRPVEGPNGDKMARDGRLV 476

Query: 287  SDAQQDEGD 261
             D Q+ +GD
Sbjct: 477  PDDQRYDGD 485


>ref|XP_010265885.1| PREDICTED: uncharacterized protein LOC104603526 isoform X3 [Nelumbo
            nucifera]
          Length = 480

 Score =  578 bits (1491), Expect = e-162
 Identities = 324/481 (67%), Positives = 349/481 (72%), Gaps = 16/481 (3%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNR-FAGGGHPAGNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NR    GG  AGNGRSAVVG+A +PRMH+DME+QIH +EQ
Sbjct: 1    MDYGLSDSSGTDDDLPPSHQNRGVPRGGRLAGNGRSAVVGSAAYPRMHTDMETQIHQLEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELL+RVNADDIIRRIREWR+
Sbjct: 61   EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLTRVNADDIIRRIREWRQ 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHP-----SVQP 1134
              GLQ  +L SSQPVHDP+PSPTVSAS KKQKTSQS+ SL+L+APSPA HP     S+QP
Sbjct: 121  AGGLQASLL-SSQPVHDPVPSPTVSASRKKQKTSQSVPSLSLNAPSPAFHPQTVSASMQP 179

Query: 1133 SSSVLXXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKV 954
            SSS                SYP TG  GR QV   GSSGA   NEP EAA FDPLIGRKV
Sbjct: 180  SSSAAKRGAGVGARGKKPKSYPLTGAAGRGQVANRGSSGALVANEPTEAAAFDPLIGRKV 239

Query: 953  WTRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXX 774
             TRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISP+DIRWE EDP  
Sbjct: 240  MTRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPDDIRWEGEDPGI 299

Query: 773  XXXXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILH 597
                       G KKS+                     KDFP S+NGIGKK   DIEILH
Sbjct: 300  SRRGGRGGSGRGIKKSVGRGGAVPGAGRGRGTTKVQGKKDFPPSQNGIGKKNSDDIEILH 359

Query: 596  TDTLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQ 417
            TDTLIKEVE+VF AS PDPLEIEKAKK+L+EHE++L+DAIARL DASDGESD GE QFS 
Sbjct: 360  TDTLIKEVERVFGASHPDPLEIEKAKKVLKEHEQALIDAIARLADASDGESD-GEHQFSH 418

Query: 416  GQSMDXXXXXXXXXXXQY-------DEMXXXXXXXXGSDGNKMAREGRVGSDAQQ--DEG 264
            GQSMD                    DEM        GSDG+ +A EGRVGSD QQ  DEG
Sbjct: 419  GQSMDRERGWRNRQYGGNQHTTDFDDEM--GEGRGEGSDGDHIAGEGRVGSDDQQDGDEG 476

Query: 263  D 261
            D
Sbjct: 477  D 477


>ref|XP_007014850.1| Emsy N Terminus/ plant Tudor-like domains-containing protein isoform
            4 [Theobroma cacao] gi|590583258|ref|XP_007014851.1| Emsy
            N Terminus/ plant Tudor-like domains-containing protein
            isoform 4 [Theobroma cacao] gi|508785213|gb|EOY32469.1|
            Emsy N Terminus/ plant Tudor-like domains-containing
            protein isoform 4 [Theobroma cacao]
            gi|508785214|gb|EOY32470.1| Emsy N Terminus/ plant
            Tudor-like domains-containing protein isoform 4
            [Theobroma cacao]
          Length = 412

 Score =  574 bits (1479), Expect = e-160
 Identities = 300/409 (73%), Positives = 322/409 (78%), Gaps = 5/409 (1%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPA-GNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NRF  GG  A GNGRSAVVG+AP PRMHSDME+QIH IEQ
Sbjct: 1    MDYGLSDSSGTDDDLPPSHQNRFQRGGRTAAGNGRSAVVGSAPLPRMHSDMETQIHLIEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAYSSVLRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELL RVNADDI+RRIREWR 
Sbjct: 61   EAYSSVLRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLLRVNADDILRRIREWRT 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL 1119
              GLQPGMLS+SQPVHD +PSPTVS S KKQKTSQS+ASL++ APSPALHPS+QPSSS L
Sbjct: 121  ASGLQPGMLSTSQPVHDTVPSPTVSGSRKKQKTSQSVASLSMGAPSPALHPSMQPSSSAL 180

Query: 1118 ---XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWT 948
                              YPSTG   R Q P   SSGAFA NEPAEAA +DPLIGRKVWT
Sbjct: 181  RRGPLPGAKSKKSKSSTQYPSTGLPVRPQAPNRTSSGAFATNEPAEAAPYDPLIGRKVWT 240

Query: 947  RWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXX 768
            RWPEDN+FYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWE +DP    
Sbjct: 241  RWPEDNHFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWEGDDPGMSR 300

Query: 767  XXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTD 591
                     G KKSM                     KDFP ++NGIGKK LGDIEILHTD
Sbjct: 301  RGGRPGPGRGIKKSMARGGGVAGAGRGRGSLKGHAKKDFPLAQNGIGKKVLGDIEILHTD 360

Query: 590  TLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGES 444
            TLIKEVEKVF A QPDP+EIEKAKK+L+EHE++LV+AIARLEDASDGES
Sbjct: 361  TLIKEVEKVFGAGQPDPMEIEKAKKVLKEHEQALVEAIARLEDASDGES 409


>ref|XP_010265884.1| PREDICTED: uncharacterized protein LOC104603526 isoform X2 [Nelumbo
            nucifera]
          Length = 483

 Score =  573 bits (1477), Expect = e-160
 Identities = 323/484 (66%), Positives = 348/484 (71%), Gaps = 19/484 (3%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNR-FAGGGHPAGNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NR    GG  AGNGRSAVVG+A +PRMH+DME+QIH +EQ
Sbjct: 1    MDYGLSDSSGTDDDLPPSHQNRGVPRGGRLAGNGRSAVVGSAAYPRMHTDMETQIHQLEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELL+RVNADDIIRRIREWR+
Sbjct: 61   EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLTRVNADDIIRRIREWRQ 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHP-----SVQP 1134
              GLQ  +L SSQPVHDP+PSPTVSAS KKQKTSQS+ SL+L+APSPA HP     S+QP
Sbjct: 121  AGGLQASLL-SSQPVHDPVPSPTVSASRKKQKTSQSVPSLSLNAPSPAFHPQTVSASMQP 179

Query: 1133 SSSVL---XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIG 963
            SSS                    YP TG  GR QV   GSSGA   NEP EAA FDPLIG
Sbjct: 180  SSSAAKRGAGVGARGKKPKSSMQYPLTGAAGRGQVANRGSSGALVANEPTEAAAFDPLIG 239

Query: 962  RKVWTRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESED 783
            RKV TRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISP+DIRWE ED
Sbjct: 240  RKVMTRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPDDIRWEGED 299

Query: 782  PXXXXXXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIE 606
            P             G KKS+                     KDFP S+NGIGKK   DIE
Sbjct: 300  PGISRRGGRGGSGRGIKKSVGRGGAVPGAGRGRGTTKVQGKKDFPPSQNGIGKKNSDDIE 359

Query: 605  ILHTDTLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQ 426
            ILHTDTLIKEVE+VF AS PDPLEIEKAKK+L+EHE++L+DAIARL DASDGESD GE Q
Sbjct: 360  ILHTDTLIKEVERVFGASHPDPLEIEKAKKVLKEHEQALIDAIARLADASDGESD-GEHQ 418

Query: 425  FSQGQSMDXXXXXXXXXXXQY-------DEMXXXXXXXXGSDGNKMAREGRVGSDAQQ-- 273
            FS GQSMD                    DEM        GSDG+ +A EGRVGSD QQ  
Sbjct: 419  FSHGQSMDRERGWRNRQYGGNQHTTDFDDEM--GEGRGEGSDGDHIAGEGRVGSDDQQDG 476

Query: 272  DEGD 261
            DEGD
Sbjct: 477  DEGD 480


>ref|XP_010265883.1| PREDICTED: uncharacterized protein LOC104603526 isoform X1 [Nelumbo
            nucifera]
          Length = 494

 Score =  569 bits (1466), Expect = e-159
 Identities = 323/495 (65%), Positives = 348/495 (70%), Gaps = 30/495 (6%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNR-FAGGGHPAGNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NR    GG  AGNGRSAVVG+A +PRMH+DME+QIH +EQ
Sbjct: 1    MDYGLSDSSGTDDDLPPSHQNRGVPRGGRLAGNGRSAVVGSAAYPRMHTDMETQIHQLEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELL+RVNADDIIRRIREWR+
Sbjct: 61   EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLTRVNADDIIRRIREWRQ 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHP-----SVQP 1134
              GLQ  +L SSQPVHDP+PSPTVSAS KKQKTSQS+ SL+L+APSPA HP     S+QP
Sbjct: 121  AGGLQASLL-SSQPVHDPVPSPTVSASRKKQKTSQSVPSLSLNAPSPAFHPQTVSASMQP 179

Query: 1133 SSSVL--------------XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEP 996
            SSS                               YP TG  GR QV   GSSGA   NEP
Sbjct: 180  SSSAAKRGAGVGARGKKPKSGQSMPDTSSMKSMQYPLTGAAGRGQVANRGSSGALVANEP 239

Query: 995  AEAANFDPLIGRKVWTRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEI 816
             EAA FDPLIGRKV TRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEI
Sbjct: 240  TEAAAFDPLIGRKVMTRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEI 299

Query: 815  SPEDIRWESEDPXXXXXXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASEN 639
            SP+DIRWE EDP             G KKS+                     KDFP S+N
Sbjct: 300  SPDDIRWEGEDPGISRRGGRGGSGRGIKKSVGRGGAVPGAGRGRGTTKVQGKKDFPPSQN 359

Query: 638  GIGKKALGDIEILHTDTLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDA 459
            GIGKK   DIEILHTDTLIKEVE+VF AS PDPLEIEKAKK+L+EHE++L+DAIARL DA
Sbjct: 360  GIGKKNSDDIEILHTDTLIKEVERVFGASHPDPLEIEKAKKVLKEHEQALIDAIARLADA 419

Query: 458  SDGESDGGERQFSQGQSMDXXXXXXXXXXXQY-------DEMXXXXXXXXGSDGNKMARE 300
            SDGESD GE QFS GQSMD                    DEM        GSDG+ +A E
Sbjct: 420  SDGESD-GEHQFSHGQSMDRERGWRNRQYGGNQHTTDFDDEM--GEGRGEGSDGDHIAGE 476

Query: 299  GRVGSDAQQ--DEGD 261
            GRVGSD QQ  DEGD
Sbjct: 477  GRVGSDDQQDGDEGD 491


>ref|XP_002299156.2| hypothetical protein POPTR_0001s05150g [Populus trichocarpa]
            gi|550346552|gb|EEE83961.2| hypothetical protein
            POPTR_0001s05150g [Populus trichocarpa]
          Length = 456

 Score =  565 bits (1455), Expect = e-158
 Identities = 303/464 (65%), Positives = 335/464 (72%), Gaps = 1/464 (0%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPAGNGRSAVVGTAPFPRMHSDMESQIHHIEQE 1476
            M YELSDSSGTDDDLPP HRNRF  G   AGNGRSAV G A  PR+HSDME+QIH+IEQE
Sbjct: 1    MDYELSDSSGTDDDLPPTHRNRFQSGARTAGNGRSAVGGAASQPRLHSDMETQIHNIEQE 60

Query: 1475 AYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT 1296
            AY+SVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELL+RVNADD+IRRIREWRK 
Sbjct: 61   AYTSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLARVNADDMIRRIREWRKA 120

Query: 1295 RGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVLX 1116
             G+QP M S++QP HDP+PSPTVS S KKQKTSQS+ASL++  PSP LHPS+QPS+S L 
Sbjct: 121  NGIQPSMPSTAQPSHDPIPSPTVSGSRKKQKTSQSVASLSMVVPSPVLHPSMQPSTSALR 180

Query: 1115 XXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWTRWPE 936
                         S  STG + RAQ    GSSG FA N+         LIG+KVWTRWPE
Sbjct: 181  HGPPPGSGNKKPKSQRSTGLSSRAQAANRGSSGVFATND---------LIGKKVWTRWPE 231

Query: 935  DNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXXXXXX 756
            DN+FYEAVITDYNPVEGRHALVYDINT DETWEWVNLKEISPEDIRWE E+P        
Sbjct: 232  DNHFYEAVITDYNPVEGRHALVYDINTGDETWEWVNLKEISPEDIRWEGEEPGLFRRGGR 291

Query: 755  XXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTDTLIK 579
                 G KK++                     KDFP  +NGIGKKA+GDIEILHT+TLIK
Sbjct: 292  PGPGRGNKKAIARGGAVVTAGRGRGTTKGQSKKDFPLIQNGIGKKAMGDIEILHTNTLIK 351

Query: 578  EVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQSMDX 399
            EVEKVF AS PDPLEIEKAKK L E E++LV+AIARLE+ASDGESD GE  F + QSMD 
Sbjct: 352  EVEKVFGASHPDPLEIEKAKKALEEQEQALVNAIARLEEASDGESDEGEHPFPRVQSMD- 410

Query: 398  XXXXXXXXXXQYDEMXXXXXXXXGSDGNKMAREGRVGSDAQQDE 267
                       YDE+        GSDGNKMAR GR+ S  Q DE
Sbjct: 411  --QDRGWRKRSYDEIVGEGRGIEGSDGNKMARNGRIVSSDQHDE 452


>gb|KHG28323.1| Protein EMSY [Gossypium arboreum]
          Length = 412

 Score =  564 bits (1454), Expect = e-158
 Identities = 294/409 (71%), Positives = 318/409 (77%), Gaps = 5/409 (1%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPA-GNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NRF  GG  A GNGRSAV G+ P PRMH DME+QIH IEQ
Sbjct: 1    MDYALSDSSGTDDDLPPSHQNRFQRGGRTAAGNGRSAVAGSTPLPRMHGDMETQIHLIEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAY SVLRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELL RVNADDIIRRIREWR 
Sbjct: 61   EAYCSVLRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLLRVNADDIIRRIREWRT 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL 1119
              GLQPGMLS+SQP+HDP+PSP+VS S KKQKTSQS+ASL++ APSPALHPS+QPSSS L
Sbjct: 121  MSGLQPGMLSTSQPLHDPLPSPSVSGSRKKQKTSQSVASLSMGAPSPALHPSMQPSSSAL 180

Query: 1118 ---XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWT 948
                              YPSTG  GR Q P   SSGAFA+NEPAEAA +DPLIGRKVWT
Sbjct: 181  RRGPPPGAKSKKSKSSTQYPSTGPPGRPQPPNRTSSGAFAINEPAEAAPYDPLIGRKVWT 240

Query: 947  RWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXX 768
            RWPEDN+FYEAVITDYN  EGRHALVYDINTADETWEWVNLKEISPEDI+WE EDP    
Sbjct: 241  RWPEDNHFYEAVITDYNAAEGRHALVYDINTADETWEWVNLKEISPEDIKWEGEDPGISR 300

Query: 767  XXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTD 591
                     G KKSM                     KDFP+ +NG+GKK LGDIEILHTD
Sbjct: 301  RGGRPGQGHGVKKSMSRGGGVAGAGRGRGSLKGQAKKDFPSKQNGVGKKVLGDIEILHTD 360

Query: 590  TLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGES 444
            TLIKEVEKVF AS PDP+EIEKAKK+L+EHE+SLVDAIARLE+ASD ES
Sbjct: 361  TLIKEVEKVFGASHPDPIEIEKAKKVLKEHEQSLVDAIARLEEASDDES 409


>ref|XP_012472979.1| PREDICTED: protein EMSY-LIKE 3-like isoform X2 [Gossypium raimondii]
            gi|763754527|gb|KJB21858.1| hypothetical protein
            B456_004G018500 [Gossypium raimondii]
          Length = 412

 Score =  562 bits (1448), Expect = e-157
 Identities = 292/409 (71%), Positives = 317/409 (77%), Gaps = 5/409 (1%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPA-GNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NRF  GG  A GNGRSAV G+ P PRMH DME+QIH IEQ
Sbjct: 1    MDYALSDSSGTDDDLPPSHQNRFQRGGRTAAGNGRSAVAGSTPLPRMHGDMETQIHLIEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAY SVLRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELL RVNA+DIIRRIREWR 
Sbjct: 61   EAYCSVLRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLLRVNAEDIIRRIREWRT 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL 1119
              GLQPGMLS+SQP+HDP+PSP+VS S KKQKTSQS+ASL++ APSPALHPS+QPSSS L
Sbjct: 121  MSGLQPGMLSTSQPLHDPLPSPSVSGSRKKQKTSQSVASLSMGAPSPALHPSMQPSSSAL 180

Query: 1118 ---XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWT 948
                              YPSTG  GR Q P   SSGAFA NEPAEAA +DPLIGRKVWT
Sbjct: 181  RRGPPPGAKSKKSKSSTQYPSTGLPGRPQPPNRTSSGAFATNEPAEAAPYDPLIGRKVWT 240

Query: 947  RWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXX 768
            RWPEDN+FYEAVITDYN  EGRHALVYDINTADETWEWVNLKEISPEDI+WE EDP    
Sbjct: 241  RWPEDNHFYEAVITDYNAAEGRHALVYDINTADETWEWVNLKEISPEDIKWEGEDPGISR 300

Query: 767  XXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTD 591
                     G KKSM                     KDFP+ +NG+GKK LGDIEILHTD
Sbjct: 301  RGGRPGQGHGVKKSMSRGGGVAGAGRGRGSLKGQAKKDFPSKQNGVGKKVLGDIEILHTD 360

Query: 590  TLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGES 444
            TLIKEVEKVF AS PDP+EIEKAKK+L+EHE++LVDAIARLE+ASD ES
Sbjct: 361  TLIKEVEKVFGASHPDPIEIEKAKKVLKEHEQALVDAIARLEEASDDES 409


>gb|KHG14231.1| Protein EMSY [Gossypium arboreum]
          Length = 447

 Score =  558 bits (1439), Expect = e-156
 Identities = 292/420 (69%), Positives = 321/420 (76%), Gaps = 2/420 (0%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPA-GNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NRF  GG  A GNGRSAV+G+AP PRMH DME+QIH IEQ
Sbjct: 1    MDYGLSDSSGTDDDLPPSHQNRFQRGGRTAAGNGRSAVIGSAPLPRMHGDMETQIHLIEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAY S+LRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELL RVNADD++RRIREWR 
Sbjct: 61   EAYCSILRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLLRVNADDMLRRIREWRT 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL 1119
              GLQPGMLS+SQP+ DP+PSP+VS S KK KTS S+ASL++ APSPALHPS+QPSSS  
Sbjct: 121  ASGLQPGMLSTSQPMLDPVPSPSVSGSCKKMKTSHSVASLSMGAPSPALHPSMQPSSSAS 180

Query: 1118 XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWTRWP 939
                          S+PS G  G+ Q P   S+GAFA N  AEAA +DPLIGRKVWTRWP
Sbjct: 181  RRGPMPGAKSKKSKSHPSRGLPGKPQAPNRTSTGAFAANVRAEAAPYDPLIGRKVWTRWP 240

Query: 938  EDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXXXXX 759
            EDN+FYEAVITDYN VEGRHALVYDINTADETWEWVNLKEISPEDIRWE +DP       
Sbjct: 241  EDNHFYEAVITDYNSVEGRHALVYDINTADETWEWVNLKEISPEDIRWEGDDPGISRRGG 300

Query: 758  XXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTDTLI 582
                  G KKSM                     KDFP  +NG+GKK L DIEILHTDTLI
Sbjct: 301  HPGPGRGIKKSMACSGGVAGAGRGRGSLKGQAKKDFPLMQNGVGKKVLADIEILHTDTLI 360

Query: 581  KEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQSMD 402
            KEV KVF AS PD +EIEKAKK+L+EHE++LVDAIARLE ASDGESD GE  FSQGQSMD
Sbjct: 361  KEVGKVFGASHPDSIEIEKAKKVLKEHEQALVDAIARLEGASDGESD-GEHPFSQGQSMD 419


>ref|XP_011037909.1| PREDICTED: uncharacterized protein LOC105134965 isoform X2 [Populus
            euphratica]
          Length = 459

 Score =  558 bits (1437), Expect = e-156
 Identities = 303/467 (64%), Positives = 333/467 (71%), Gaps = 4/467 (0%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPAGNGRSAVVGTAPFPRMHSDMESQIHHIEQE 1476
            M YELSDSSGTDDDLPP HRNRF  G   AGNGRSAV G A  PR+HSDMESQIH+IEQE
Sbjct: 1    MDYELSDSSGTDDDLPPTHRNRFQSGVRTAGNGRSAVGGAASQPRLHSDMESQIHNIEQE 60

Query: 1475 AYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRKT 1296
            AY+SVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELL+RVNADD+IRRIREWRK 
Sbjct: 61   AYTSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLARVNADDMIRRIREWRKA 120

Query: 1295 RGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL- 1119
             G+QP M S++QP HDP+PSPTVS S KKQKTSQS ASL++ APSP LHPS+QPS+S L 
Sbjct: 121  NGIQPSMPSNAQPSHDPIPSPTVSGSRKKQKTSQSAASLSMGAPSPVLHPSMQPSTSALR 180

Query: 1118 --XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWTR 945
                               STG + RAQ    GSSG FA N+         LIG+KVWTR
Sbjct: 181  HGPSPGSGNKKPKSSMQQRSTGLSSRAQAANRGSSGVFATND---------LIGKKVWTR 231

Query: 944  WPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXXX 765
            WPEDN+FYEAVITDYNPVEGRHALVYDINT DETWEWVNLKEISPEDIRWE E+P     
Sbjct: 232  WPEDNHFYEAVITDYNPVEGRHALVYDINTGDETWEWVNLKEISPEDIRWEGEEPGLFRR 291

Query: 764  XXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTDT 588
                    G KK++                     KDFP  +NGIGKKA+GDIEILHT+T
Sbjct: 292  GGRPGPGRGNKKAIARGGAVVTAGRGRGTTKGQSKKDFPLIQNGIGKKAMGDIEILHTNT 351

Query: 587  LIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQS 408
            LIKEVEKVF AS PDP EIEKAKK L E E++LV+AIARLE+ASDGESD GE  F + QS
Sbjct: 352  LIKEVEKVFGASHPDPSEIEKAKKALEEQEQALVNAIARLEEASDGESDEGEHPFPRVQS 411

Query: 407  MDXXXXXXXXXXXQYDEMXXXXXXXXGSDGNKMAREGRVGSDAQQDE 267
            MD            YDE+        GSDGNKMAR GR+ S  Q DE
Sbjct: 412  MD---QDRGWRKRSYDEIVGEGRGIEGSDGNKMARNGRIVSSDQHDE 455


>gb|KJB83471.1| hypothetical protein B456_013G249400 [Gossypium raimondii]
          Length = 417

 Score =  556 bits (1432), Expect = e-155
 Identities = 292/414 (70%), Positives = 319/414 (77%), Gaps = 10/414 (2%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPA-GNGRSAVVGT--------APFPRMHSDME 1503
            M YELSDSSGTDDDLP  H+NRF  GG  A GNGRS VVG+        AP PR+H DME
Sbjct: 1    MDYELSDSSGTDDDLPSSHQNRFQRGGRTAAGNGRSTVVGSMGNGRSAVAPLPRIHGDME 60

Query: 1502 SQIHHIEQEAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDII 1323
            +QIH IEQEAYSSVLRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELLSRVNADD+I
Sbjct: 61   TQIHLIEQEAYSSVLRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLSRVNADDMI 120

Query: 1322 RRIREWRKTRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPS 1143
            RRIREWR   G+QPGMLS+SQP+HDP+PSP+VS S KKQKTSQS+ASL++ APSPALHPS
Sbjct: 121  RRIREWRAAGGIQPGMLSTSQPIHDPVPSPSVSGSRKKQKTSQSVASLSMVAPSPALHPS 180

Query: 1142 VQPSSSVLXXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIG 963
            +QPSSS L              SY STG  GR Q     SSGAFA NE AEAA +DPLIG
Sbjct: 181  MQPSSSALRRGPPSGAKSKKSKSYTSTGLPGRPQASNRMSSGAFATNESAEAAPYDPLIG 240

Query: 962  RKVWTRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESED 783
            RKVWTRWPEDN+FYEAVITDYN +EGRHALVYDINTADETWEWVNLKEISPEDIRWE +D
Sbjct: 241  RKVWTRWPEDNHFYEAVITDYNRLEGRHALVYDINTADETWEWVNLKEISPEDIRWEGDD 300

Query: 782  PXXXXXXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIE 606
            P             G KKSM                     KDFP ++NG+GKK LGDIE
Sbjct: 301  PAISRRGGRPGPGPGIKKSMAYGGGVVGAGRGRGNLKGQGKKDFPLTQNGVGKKVLGDIE 360

Query: 605  ILHTDTLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGES 444
            ILHTDTLIKEVEKVF A+ PDP+EIEKAKK+L EHE++LVDAIARLEDASDGES
Sbjct: 361  ILHTDTLIKEVEKVFGANHPDPVEIEKAKKVLNEHEQALVDAIARLEDASDGES 414


>gb|KJB55313.1| hypothetical protein B456_009G070600 [Gossypium raimondii]
            gi|763788321|gb|KJB55317.1| hypothetical protein
            B456_009G070600 [Gossypium raimondii]
          Length = 417

 Score =  555 bits (1431), Expect = e-155
 Identities = 290/417 (69%), Positives = 317/417 (76%), Gaps = 2/417 (0%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGG-HPAGNGRSAVVGTAPFPRMHSDMESQIHHIEQ 1479
            M Y LSDSSGTDDDLPP H+NRF  GG   AGNGRSAV+G+AP PRMH DME+QIH IEQ
Sbjct: 1    MDYGLSDSSGTDDDLPPSHQNRFQRGGCTAAGNGRSAVIGSAPLPRMHGDMETQIHLIEQ 60

Query: 1478 EAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDIIRRIREWRK 1299
            EAY S+LRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELL RVNADD+IRRIREWR 
Sbjct: 61   EAYCSILRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLLRVNADDMIRRIREWRT 120

Query: 1298 TRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPSVQPSSSVL 1119
              GLQPGMLS+SQP+ DP+PSP+VS S KK KTS S+ASL++ APSPALHPS+QPSSS  
Sbjct: 121  ASGLQPGMLSTSQPMLDPVPSPSVSGSRKKMKTSHSVASLSMGAPSPALHPSMQPSSSAS 180

Query: 1118 XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDPLIGRKVWTRWP 939
                          S+PS G  G+ Q P   S+GAFA N PAEAA +DPLIGRKVWTRWP
Sbjct: 181  RRGPMPGAKSKKSKSHPSRGLPGKPQAPNRTSTGAFAANVPAEAAPYDPLIGRKVWTRWP 240

Query: 938  EDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWESEDPXXXXXXX 759
            EDN+FYEAVITDYN VEGRHALVYDINTADETWEWVNLKEIS EDIRWE +DP       
Sbjct: 241  EDNHFYEAVITDYNSVEGRHALVYDINTADETWEWVNLKEISSEDIRWEGDDPGISRRGG 300

Query: 758  XXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALGDIEILHTDTLI 582
                  G KKSM                     KDFP  +NG+GKK L DIEILHTDTLI
Sbjct: 301  RPGPGRGIKKSMACGGGVAGAGRGRGSLKGQAKKDFPLMQNGVGKKVLADIEILHTDTLI 360

Query: 581  KEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDGGERQFSQGQ 411
            KEVEKVF AS PD +EIEKAKK+L EHE++LVDAIARLE ASDGES  GE  FSQGQ
Sbjct: 361  KEVEKVFGASHPDSIEIEKAKKVLTEHEQALVDAIARLEGASDGESADGEHPFSQGQ 417


>ref|XP_012462611.1| PREDICTED: protein EMSY-LIKE 3 isoform X1 [Gossypium raimondii]
            gi|823259796|ref|XP_012462612.1| PREDICTED: protein
            EMSY-LIKE 3 isoform X1 [Gossypium raimondii]
            gi|763816616|gb|KJB83468.1| hypothetical protein
            B456_013G249400 [Gossypium raimondii]
            gi|763816617|gb|KJB83469.1| hypothetical protein
            B456_013G249400 [Gossypium raimondii]
          Length = 440

 Score =  555 bits (1430), Expect = e-155
 Identities = 293/419 (69%), Positives = 320/419 (76%), Gaps = 13/419 (3%)
 Frame = -3

Query: 1655 MAYELSDSSGTDDDLPPPHRNRFAGGGHPA-GNGRSAVVGT--------APFPRMHSDME 1503
            M YELSDSSGTDDDLP  H+NRF  GG  A GNGRS VVG+        AP PR+H DME
Sbjct: 1    MDYELSDSSGTDDDLPSSHQNRFQRGGRTAAGNGRSTVVGSMGNGRSAVAPLPRIHGDME 60

Query: 1502 SQIHHIEQEAYSSVLRAFKAQSDAITWEKESLITELRKELRVSDEEHRELLSRVNADDII 1323
            +QIH IEQEAYSSVLRAFKAQSDA+TWEKESLITELRKELRVSDEEHRELLSRVNADD+I
Sbjct: 61   TQIHLIEQEAYSSVLRAFKAQSDALTWEKESLITELRKELRVSDEEHRELLSRVNADDMI 120

Query: 1322 RRIREWRKTRGLQPGMLSSSQPVHDPMPSPTVSASLKKQKTSQSIASLTLSAPSPALHPS 1143
            RRIREWR   G+QPGMLS+SQP+HDP+PSP+VS S KKQKTSQS+ASL++ APSPALHPS
Sbjct: 121  RRIREWRAAGGIQPGMLSTSQPIHDPVPSPSVSGSRKKQKTSQSVASLSMVAPSPALHPS 180

Query: 1142 VQPSSSVL---XXXXXXXXXXXXXXSYPSTGFTGRAQVPYHGSSGAFAVNEPAEAANFDP 972
            +QPSSS L                  Y STG  GR Q     SSGAFA NE AEAA +DP
Sbjct: 181  MQPSSSALRRGPPSGAKSKKSKSSTQYTSTGLPGRPQASNRMSSGAFATNESAEAAPYDP 240

Query: 971  LIGRKVWTRWPEDNNFYEAVITDYNPVEGRHALVYDINTADETWEWVNLKEISPEDIRWE 792
            LIGRKVWTRWPEDN+FYEAVITDYN +EGRHALVYDINTADETWEWVNLKEISPEDIRWE
Sbjct: 241  LIGRKVWTRWPEDNHFYEAVITDYNRLEGRHALVYDINTADETWEWVNLKEISPEDIRWE 300

Query: 791  SEDPXXXXXXXXXXXXXGFKKSM-XXXXXXXXXXXXXXXXXXXXKDFPASENGIGKKALG 615
             +DP             G KKSM                     KDFP ++NG+GKK LG
Sbjct: 301  GDDPAISRRGGRPGPGPGIKKSMAYGGGVVGAGRGRGNLKGQGKKDFPLTQNGVGKKVLG 360

Query: 614  DIEILHTDTLIKEVEKVFSASQPDPLEIEKAKKLLREHERSLVDAIARLEDASDGESDG 438
            DIEILHTDTLIKEVEKVF A+ PDP+EIEKAKK+L EHE++LVDAIARLEDASDGESDG
Sbjct: 361  DIEILHTDTLIKEVEKVFGANHPDPVEIEKAKKVLNEHEQALVDAIARLEDASDGESDG 419


Top