BLASTX nr result

ID: Forsythia21_contig00025433 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00025433
         (1822 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011083268.1| PREDICTED: uncharacterized protein LOC105165...   586   e-164
ref|XP_011083373.1| PREDICTED: uncharacterized protein LOC105165...   583   e-163
ref|XP_011083244.1| PREDICTED: uncharacterized protein LOC105165...   581   e-163
ref|XP_012832820.1| PREDICTED: histone-lysine N-methyltransferas...   520   e-144
gb|EYU41227.1| hypothetical protein MIMGU_mgv1a023175mg [Erythra...   507   e-140
ref|XP_010647005.1| PREDICTED: uncharacterized protein LOC104878...   486   e-134
emb|CDP07236.1| unnamed protein product [Coffea canephora]            486   e-134
ref|XP_007018610.1| Set domain protein, putative isoform 5 [Theo...   482   e-133
ref|XP_007018606.1| Set domain protein, putative isoform 1 [Theo...   482   e-133
ref|XP_012478184.1| PREDICTED: uncharacterized protein LOC105793...   474   e-131
ref|XP_012478181.1| PREDICTED: uncharacterized protein LOC105793...   474   e-131
ref|XP_012478188.1| PREDICTED: uncharacterized protein LOC105793...   474   e-131
ref|XP_002307834.2| hypothetical protein POPTR_0005s28130g [Popu...   472   e-130
ref|XP_007018609.1| Set domain protein, putative isoform 4 [Theo...   469   e-129
gb|KJB29707.1| hypothetical protein B456_005G115300 [Gossypium r...   461   e-127
ref|XP_008231636.1| PREDICTED: uncharacterized protein LOC103330...   459   e-126
ref|XP_010111522.1| Histone-lysine N-methyltransferase SETD1B [M...   455   e-125
ref|XP_011657472.1| PREDICTED: uncharacterized protein LOC101220...   455   e-125
ref|XP_011657471.1| PREDICTED: uncharacterized protein LOC101220...   455   e-125
ref|XP_009759057.1| PREDICTED: uncharacterized protein LOC104211...   452   e-124

>ref|XP_011083268.1| PREDICTED: uncharacterized protein LOC105165803 isoform X2 [Sesamum
            indicum]
          Length = 1151

 Score =  586 bits (1511), Expect = e-164
 Identities = 319/527 (60%), Positives = 375/527 (71%)
 Frame = -1

Query: 1819 CVPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCS 1640
            C  SQ  Q++S KLD       FQV+LM+S+ RIY              AIEKA+T  CS
Sbjct: 629  CSLSQIGQVQSFKLDGHAWKTTFQVALMISRLRIYDYVMKKFESLYDD-AIEKAITATCS 687

Query: 1639 SRRHDSCRNNNKGAENRVSNEKPDDRERSSKSSLLNGNYICYRKRKLGEKKSGSFFESLI 1460
             RR++S    NKG    ++ EKPDD ER S+ SLL   Y   R+RKL  KKS SF  SL 
Sbjct: 688  FRRYES---PNKGTVRCMNKEKPDDGERYSEVSLLKEEYTYSRRRKLSGKKSDSFILSLT 744

Query: 1459 AGDIGSQKQSIENSNKGNVLKHVPRSKKVKNMLLNLEKTRTENHSSKXXXXXXXXXXXXX 1280
             G+     ++ + S +   LK +P++ +V+ M+ +LEK   EN S+K             
Sbjct: 745  MGETDHLNRASKRSRRSYTLKTIPQAAQVQYMIPHLEKQGPENDSNKPCANVSILGEKGS 804

Query: 1279 XXXXXXXXSEKVADVVKDKSSCRTHKASFSPVDQCNIERITNEKSRGSDPLEIPAADRTK 1100
                    SEKVA  ++D SS  T   SF   DQ N+ERIT  KS  S+ L+  A   T 
Sbjct: 805  SMQNCSWRSEKVARAIQDDSSSNTRNTSFLTKDQHNLERITCAKSLESNSLDFEATGSTT 864

Query: 1099 KVSKLTKVAKLKRKQPIDDAPPSISKKVQKLANSSTKQAVCKKTVVQKIKRSKSRTMRPC 920
            K+ K +KV+KLKRKQ IDD       KVQKLAN   KQ++CK+    KIKRSKSR  RPC
Sbjct: 865  KMPKASKVSKLKRKQLIDDTQILRPGKVQKLANGVAKQSLCKQVDAHKIKRSKSRIARPC 924

Query: 919  PQSDGCARSSIDGWEWHKWSLNASPAERARITGTRLRSQHISLDGNGLQLPNVKGLSART 740
            PQS+GCARSS++GWEW +W+L ASP ERAR+ G+R  SQ+++ +  G    + KGLSART
Sbjct: 925  PQSNGCARSSMNGWEWREWALTASPGERARVRGSRPHSQYMNSECIGSHSSSFKGLSART 984

Query: 739  HRVKLRNLLAAAEGADLLKATQLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGE 560
            +RVKLRNLLAAAEGADLLKATQLKARKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGE
Sbjct: 985  NRVKLRNLLAAAEGADLLKATQLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 1044

Query: 559  LIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVITV 380
            LIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+V
Sbjct: 1045 LIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISV 1104

Query: 379  DGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRRCRGSLN 239
            +GQKKIFIYAKRHI+AGEE+TYNYKFPLEEKKIPC+CGSRRCRGSLN
Sbjct: 1105 EGQKKIFIYAKRHISAGEELTYNYKFPLEEKKIPCHCGSRRCRGSLN 1151


>ref|XP_011083373.1| PREDICTED: uncharacterized protein LOC105165914 [Sesamum indicum]
            gi|747072877|ref|XP_011083374.1| PREDICTED:
            uncharacterized protein LOC105165914 [Sesamum indicum]
          Length = 1151

 Score =  583 bits (1504), Expect = e-163
 Identities = 321/527 (60%), Positives = 376/527 (71%)
 Frame = -1

Query: 1819 CVPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCS 1640
            C  SQ  Q++S KLD       FQV+LM+S+ RIY              AIEKA+T  CS
Sbjct: 630  CALSQIGQVQSFKLDGHAWKTTFQVALMISRLRIYDYVMKKFESLCDD-AIEKAITATCS 688

Query: 1639 SRRHDSCRNNNKGAENRVSNEKPDDRERSSKSSLLNGNYICYRKRKLGEKKSGSFFESLI 1460
             RR++S    NK     ++ EKPDD ER S+ SLL   Y   R+RKLG KKS SFF SL 
Sbjct: 689  FRRYES---PNKVTVRCMNKEKPDDGERYSEVSLLKEEYTYSRRRKLGGKKSDSFFVSLT 745

Query: 1459 AGDIGSQKQSIENSNKGNVLKHVPRSKKVKNMLLNLEKTRTENHSSKXXXXXXXXXXXXX 1280
             G+     ++ + S +   LK +P++ +V+NM+ +LE+   EN S+K             
Sbjct: 746  MGETDHLNRASKRSRRSYTLKTIPQAAQVQNMIPHLEQG-PENGSNKPCANVSILGEKGS 804

Query: 1279 XXXXXXXXSEKVADVVKDKSSCRTHKASFSPVDQCNIERITNEKSRGSDPLEIPAADRTK 1100
                    SEKVA   +D SS  T   SF   DQ N+ERIT  K+   + L+  A   T 
Sbjct: 805  SMHNCSWRSEKVARAFQDDSSSNTRNTSFFIKDQHNLERITCAKNLELNSLDFEATGSTT 864

Query: 1099 KVSKLTKVAKLKRKQPIDDAPPSISKKVQKLANSSTKQAVCKKTVVQKIKRSKSRTMRPC 920
            K+ K TKV+KLKRKQ IDD       KVQKLAN   KQ++CK+  V KIKR+KSR  RPC
Sbjct: 865  KMPKATKVSKLKRKQLIDDTQNLRPGKVQKLANGVAKQSLCKQVDVHKIKRNKSRIARPC 924

Query: 919  PQSDGCARSSIDGWEWHKWSLNASPAERARITGTRLRSQHISLDGNGLQLPNVKGLSART 740
            PQS+GCARSS++GWEW +W+L ASP ERARI G+R  SQ+I+ +  G    + KGLSART
Sbjct: 925  PQSNGCARSSMNGWEWREWALTASPTERARIRGSRPHSQYINSECIGSHSSSFKGLSART 984

Query: 739  HRVKLRNLLAAAEGADLLKATQLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGE 560
            +RVKLRNLLAAAEGADLLKATQLKARKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGE
Sbjct: 985  NRVKLRNLLAAAEGADLLKATQLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 1044

Query: 559  LIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVITV 380
            LIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+V
Sbjct: 1045 LIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISV 1104

Query: 379  DGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRRCRGSLN 239
            +GQKKIFIYAKRHI+AGEE+TYNYKFPLEEKKIPC+CGSRRCRGSLN
Sbjct: 1105 EGQKKIFIYAKRHISAGEELTYNYKFPLEEKKIPCHCGSRRCRGSLN 1151


>ref|XP_011083244.1| PREDICTED: uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072647|ref|XP_011083245.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072649|ref|XP_011083246.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072651|ref|XP_011083247.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072653|ref|XP_011083248.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072655|ref|XP_011083249.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072657|ref|XP_011083250.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072659|ref|XP_011083252.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072661|ref|XP_011083253.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072663|ref|XP_011083254.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072665|ref|XP_011083255.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072667|ref|XP_011083256.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072669|ref|XP_011083257.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072671|ref|XP_011083258.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072673|ref|XP_011083259.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072675|ref|XP_011083260.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072677|ref|XP_011083261.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072679|ref|XP_011083262.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072681|ref|XP_011083263.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072683|ref|XP_011083264.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072685|ref|XP_011083265.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum] gi|747072687|ref|XP_011083266.1| PREDICTED:
            uncharacterized protein LOC105165803 isoform X1 [Sesamum
            indicum]
          Length = 1156

 Score =  581 bits (1498), Expect = e-163
 Identities = 320/532 (60%), Positives = 375/532 (70%), Gaps = 5/532 (0%)
 Frame = -1

Query: 1819 CVPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCS 1640
            C  SQ  Q++S KLD       FQV+LM+S+ RIY              AIEKA+T  CS
Sbjct: 629  CSLSQIGQVQSFKLDGHAWKTTFQVALMISRLRIYDYVMKKFESLYDD-AIEKAITATCS 687

Query: 1639 SRRHDSCRNNNKGAENRVSNEKPDDRERSSKSSLLNGNYICYRKRKLGEKKSGSFFESLI 1460
             RR++S    NKG    ++ EKPDD ER S+ SLL   Y   R+RKL  KKS SF  SL 
Sbjct: 688  FRRYES---PNKGTVRCMNKEKPDDGERYSEVSLLKEEYTYSRRRKLSGKKSDSFILSLT 744

Query: 1459 AGDIGSQKQSIENSNKGNVLKHVPRSKKVKNMLLNLEKTRTENHSSKXXXXXXXXXXXXX 1280
             G+     ++ + S +   LK +P++ +V+ M+ +LEK   EN S+K             
Sbjct: 745  MGETDHLNRASKRSRRSYTLKTIPQAAQVQYMIPHLEKQGPENDSNKPCANVSILGEKGS 804

Query: 1279 XXXXXXXXSEKVA-----DVVKDKSSCRTHKASFSPVDQCNIERITNEKSRGSDPLEIPA 1115
                    SEKVA     D  +D SS  T   SF   DQ N+ERIT  KS  S+ L+  A
Sbjct: 805  SMQNCSWRSEKVARAIQDDFFEDDSSSNTRNTSFLTKDQHNLERITCAKSLESNSLDFEA 864

Query: 1114 ADRTKKVSKLTKVAKLKRKQPIDDAPPSISKKVQKLANSSTKQAVCKKTVVQKIKRSKSR 935
               T K+ K +KV+KLKRKQ IDD       KVQKLAN   KQ++CK+    KIKRSKSR
Sbjct: 865  TGSTTKMPKASKVSKLKRKQLIDDTQILRPGKVQKLANGVAKQSLCKQVDAHKIKRSKSR 924

Query: 934  TMRPCPQSDGCARSSIDGWEWHKWSLNASPAERARITGTRLRSQHISLDGNGLQLPNVKG 755
              RPCPQS+GCARSS++GWEW +W+L ASP ERAR+ G+R  SQ+++ +  G    + KG
Sbjct: 925  IARPCPQSNGCARSSMNGWEWREWALTASPGERARVRGSRPHSQYMNSECIGSHSSSFKG 984

Query: 754  LSARTHRVKLRNLLAAAEGADLLKATQLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVI 575
            LSART+RVKLRNLLAAAEGADLLKATQLKARKKRL FQ+SKIHDWG+VALEPIEAEDFVI
Sbjct: 985  LSARTNRVKLRNLLAAAEGADLLKATQLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 1044

Query: 574  EYVGELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYT 395
            EYVGELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYT
Sbjct: 1045 EYVGELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYT 1104

Query: 394  KVITVDGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRRCRGSLN 239
            KVI+V+GQKKIFIYAKRHI+AGEE+TYNYKFPLEEKKIPC+CGSRRCRGSLN
Sbjct: 1105 KVISVEGQKKIFIYAKRHISAGEELTYNYKFPLEEKKIPCHCGSRRCRGSLN 1156


>ref|XP_012832820.1| PREDICTED: histone-lysine N-methyltransferase ATXR7 isoform X1
            [Erythranthe guttatus] gi|848864135|ref|XP_012832821.1|
            PREDICTED: histone-lysine N-methyltransferase ATXR7
            isoform X1 [Erythranthe guttatus]
            gi|848864138|ref|XP_012832823.1| PREDICTED:
            histone-lysine N-methyltransferase ATXR7 isoform X1
            [Erythranthe guttatus] gi|848864140|ref|XP_012832824.1|
            PREDICTED: histone-lysine N-methyltransferase ATXR7
            isoform X1 [Erythranthe guttatus]
            gi|848864142|ref|XP_012832825.1| PREDICTED:
            histone-lysine N-methyltransferase ATXR7 isoform X1
            [Erythranthe guttatus] gi|848864145|ref|XP_012832826.1|
            PREDICTED: histone-lysine N-methyltransferase ATXR7
            isoform X1 [Erythranthe guttatus]
          Length = 1081

 Score =  520 bits (1339), Expect = e-144
 Identities = 290/528 (54%), Positives = 353/528 (66%)
 Frame = -1

Query: 1822 HCVPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWC 1643
            H   SQ C+L   KL        FQV+LM+S+ RIY             DAIEKA+T+  
Sbjct: 611  HYASSQICRLPLFKLGGHAWKTTFQVALMISRVRIYDCVMRKIKSICLDDAIEKAVTMMQ 670

Query: 1642 SSRRHDSCRNNNKGAENRVSNEKPDDRERSSKSSLLNGNYICYRKRKLGEKKSGSFFESL 1463
            S RR++S     KG  N ++ +K +  ERSS++S+L G Y+  R+RKLG K S SFF+SL
Sbjct: 671  SMRRNES---GKKGTMNWMNKKKHEGLERSSETSVLIGTYVYSRRRKLGSKSSASFFQSL 727

Query: 1462 IAGDIGSQKQSIENSNKGNVLKHVPRSKKVKNMLLNLEKTRTENHSSKXXXXXXXXXXXX 1283
             A       ++ + ++K    +++P +  V  ++ NL+K   E+                
Sbjct: 728  AA-------ENTKKTSKRGRRRNIPEATAVGKIVSNLDKKILEH---------------- 764

Query: 1282 XXXXXXXXXSEKVADVVKDKSSCRTHKASFSPVDQCNIERITNEKSRGSDPLEIPAADRT 1103
                                 SC+    + +P  + +   I ++KS      E+  A + 
Sbjct: 765  --------------------DSCQPPANAATPGKKRSSMHICDQKSE-----EVAHAVQA 799

Query: 1102 KKVSKLTKVAKLKRKQPIDDAPPSISKKVQKLANSSTKQAVCKKTVVQKIKRSKSRTMRP 923
             KVSKL      KRKQ +DD P S S KV KLAN   + A+CK+    KIKRSKSR +R 
Sbjct: 800  SKVSKL------KRKQLVDDTPHSRSGKVPKLANGIVEHALCKQIDTHKIKRSKSRAVRA 853

Query: 922  CPQSDGCARSSIDGWEWHKWSLNASPAERARITGTRLRSQHISLDGNGLQLPNVKGLSAR 743
            CP+SDGCARSS+DGWEW KW+  ASP ERAR+ GT + S  I+ + NG    N KGLSAR
Sbjct: 854  CPKSDGCARSSMDGWEWRKWASTASPTERARVRGTHIYSGPINSECNGSHSSNFKGLSAR 913

Query: 742  THRVKLRNLLAAAEGADLLKATQLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVG 563
            T+RVKLRNLLAAA+GADLLK+TQLKARKKRL FQ+SKIHDWG++ALEPIEAEDFVIEYVG
Sbjct: 914  TNRVKLRNLLAAADGADLLKSTQLKARKKRLRFQRSKIHDWGLLALEPIEAEDFVIEYVG 973

Query: 562  ELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVIT 383
            ELIRP ISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+
Sbjct: 974  ELIRPSISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVIS 1033

Query: 382  VDGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRRCRGSLN 239
            V+GQKKIFIYAKRHIA+GEE+TYNYKFPLEE KIPCNCGS+RCRGSLN
Sbjct: 1034 VEGQKKIFIYAKRHIASGEELTYNYKFPLEENKIPCNCGSKRCRGSLN 1081


>gb|EYU41227.1| hypothetical protein MIMGU_mgv1a023175mg [Erythranthe guttata]
          Length = 1075

 Score =  507 bits (1305), Expect = e-140
 Identities = 284/522 (54%), Positives = 347/522 (66%)
 Frame = -1

Query: 1822 HCVPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWC 1643
            H   SQ C+L   KL        FQV+LM+S+ RIY             DAIEKA+T+  
Sbjct: 611  HYASSQICRLPLFKLGGHAWKTTFQVALMISRVRIYDCVMRKIKSICLDDAIEKAVTMMQ 670

Query: 1642 SSRRHDSCRNNNKGAENRVSNEKPDDRERSSKSSLLNGNYICYRKRKLGEKKSGSFFESL 1463
            S RR++S     KG  N ++ +K +  ERSS++S+L G Y+  R+RKLG K S SFF+SL
Sbjct: 671  SMRRNES---GKKGTMNWMNKKKHEGLERSSETSVLIGTYVYSRRRKLGSKSSASFFQSL 727

Query: 1462 IAGDIGSQKQSIENSNKGNVLKHVPRSKKVKNMLLNLEKTRTENHSSKXXXXXXXXXXXX 1283
             A       ++ + ++K    +++P +  V  ++ NL+K   E+                
Sbjct: 728  AA-------ENTKKTSKRGRRRNIPEATAVGKIVSNLDKKILEH---------------- 764

Query: 1282 XXXXXXXXXSEKVADVVKDKSSCRTHKASFSPVDQCNIERITNEKSRGSDPLEIPAADRT 1103
                                 SC+    + +P  + +   I ++KS      E+  A + 
Sbjct: 765  --------------------DSCQPPANAATPGKKRSSMHICDQKSE-----EVAHAVQA 799

Query: 1102 KKVSKLTKVAKLKRKQPIDDAPPSISKKVQKLANSSTKQAVCKKTVVQKIKRSKSRTMRP 923
             KVSKL      KRKQ +DD P S S KV KLAN   + A+CK+    KIKRSKSR +R 
Sbjct: 800  SKVSKL------KRKQLVDDTPHSRSGKVPKLANGIVEHALCKQIDTHKIKRSKSRAVRA 853

Query: 922  CPQSDGCARSSIDGWEWHKWSLNASPAERARITGTRLRSQHISLDGNGLQLPNVKGLSAR 743
            CP+SDGCARSS+DGWEW KW+  ASP ERAR+ GT + S  I+ + NG    N KGLSAR
Sbjct: 854  CPKSDGCARSSMDGWEWRKWASTASPTERARVRGTHIYSGPINSECNGSHSSNFKGLSAR 913

Query: 742  THRVKLRNLLAAAEGADLLKATQLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVG 563
            T+RVKLRNLLAAA+GADLLK+TQLKARKKRL FQ+SKIHDWG++ALEPIEAEDFVIEYVG
Sbjct: 914  TNRVKLRNLLAAADGADLLKSTQLKARKKRLRFQRSKIHDWGLLALEPIEAEDFVIEYVG 973

Query: 562  ELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVIT 383
            ELIRP ISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+
Sbjct: 974  ELIRPSISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVIS 1033

Query: 382  VDGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRR 257
            V+GQKKIFIYAKRHIA+GEE+TYNYKFPLEE KIPCNCGS+R
Sbjct: 1034 VEGQKKIFIYAKRHIASGEELTYNYKFPLEENKIPCNCGSKR 1075


>ref|XP_010647005.1| PREDICTED: uncharacterized protein LOC104878403 [Vitis vinifera]
          Length = 1301

 Score =  486 bits (1250), Expect = e-134
 Identities = 290/562 (51%), Positives = 351/562 (62%), Gaps = 36/562 (6%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCSS 1637
            VPSQ C+ R    DECT  +   V+L + ++R++               +++    W +S
Sbjct: 746  VPSQICRFRPSSSDECTPIIGEYVALALCRQRLHEDVLQEWKDLLVEGTLDQFFASWWTS 805

Query: 1636 RRHDSCRNNNKGAENRVSN---EKPDD--------RER--------SSKSSLLNGNYICY 1514
            ++    R ++ G E  VSN   EKP D        RER        S + SL+ G Y  Y
Sbjct: 806  KQ----RCDSTGCEEGVSNSNKEKPCDSSAASDQRRERTKDRHSLGSPELSLVIGKYTYY 861

Query: 1513 RKRKLGEKKSGSFFESLIAGDIGSQKQSIENSNKGNVLKHVPRSKKV-----KNMLLNLE 1349
            RK+KL  KK GS   +  + D GSQ Q +E S K +V   V    +V     K   + L 
Sbjct: 862  RKKKLVRKKIGSLSHAAASVDSGSQDQLMEKSRKQDVPGDVSEITEVEMGILKRRKIGLN 921

Query: 1348 KTRTENHSSKXXXXXXXXXXXXXXXXXXXXXSEKVA------DVVKDKSSCRTHKASFSP 1187
                E++S +                     S K A      +V++D  +C   +AS   
Sbjct: 922  TCHAEDNSLQAIVQSTLPGDSSSVRIKPNRRSTKCAHVVRNGEVIEDDLACGREEASPFA 981

Query: 1186 VDQCNIERITNEKSRGSDP--LEIPAADRTKKVSKLTKVAKLKRKQPIDDAPPSISKKVQ 1013
             D   ++++ N    G D   L+  A D +KK +K TKV+K KRK  + D P S S KV 
Sbjct: 982  EDCDFVDKVVNSNGNGHDVGNLKELAGDCSKK-TKSTKVSKKKRKD-LKDVPSSRSAKVL 1039

Query: 1012 KLANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNASPAERA 833
            K AN + KQ   ++  V K K SK +T+ PC +S GCARSSI+GW+W  WSLNASP ERA
Sbjct: 1040 KPANGAAKQDTGRQVAVHKSKFSKFKTLNPCLRSVGCARSSINGWDWRNWSLNASPTERA 1099

Query: 832  RITGTRLRS----QHISLDGNGLQLPNVKGLSARTHRVKLRNLLAAAEGADLLKATQLKA 665
             + G         Q+   +    QL NVKGLSART+RVK+RNLLAAAEGADLLKATQLKA
Sbjct: 1100 HVRGIHKAQFACDQYFRSEVVSSQLSNVKGLSARTNRVKMRNLLAAAEGADLLKATQLKA 1159

Query: 664  RKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYLF 485
            RKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRER YEKMGIGSSYLF
Sbjct: 1160 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIRERLYEKMGIGSSYLF 1219

Query: 484  RLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEITYNYK 305
            RLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+V+G+KKIFIYAKR I AGEEITYNYK
Sbjct: 1220 RLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGEKKIFIYAKRQITAGEEITYNYK 1279

Query: 304  FPLEEKKIPCNCGSRRCRGSLN 239
            FPLEEKKIPCNCGS+RCRGSLN
Sbjct: 1280 FPLEEKKIPCNCGSKRCRGSLN 1301


>emb|CDP07236.1| unnamed protein product [Coffea canephora]
          Length = 1202

 Score =  486 bits (1250), Expect = e-134
 Identities = 277/539 (51%), Positives = 351/539 (65%), Gaps = 12/539 (2%)
 Frame = -1

Query: 1819 CVPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAI---EKALTI 1649
            C+ SQ  +++  + DE    +     L + + R++             DAI      LT 
Sbjct: 664  CITSQARRVKPSRSDESVPRMTLDAVLTVCRLRVHDVVLRELKLMLVDDAILGTSMTLTP 723

Query: 1648 WCSSRRHDSCRNNNKGAENRVS-NEKPDDRERSSKSSLLNGNYICYRKRKLGEKKSGSFF 1472
                 R D       G  +  S +E      RSS+   L+G +  YRK+KL  + SGS  
Sbjct: 724  LKKLLRSDHSEGLGSGRLDENSFDEFKKYGHRSSRVLELSGKHTYYRKKKLARRNSGSVS 783

Query: 1471 ESLI-AGDIGSQKQSIENSNKGNVLKHVPRSKKVKNMLLNLEKTRTENHSSKXXXXXXXX 1295
            +S   AG I   +QS++ S K  + + +P + +++N ++N E+   ++  +         
Sbjct: 784  QSAATAGSIRLLRQSVQKSRKHEISEGIPENARLENAVVNAERYAVQSCRNDVHNAADAL 843

Query: 1294 XXXXXXXXXXXXXSEKVADVVK---DKSSCRTHKASFSPVDQCNIERITNEKSRGSDPLE 1124
                          EKV+  VK   D +S      SFS  D  ++E+I   +S+    L+
Sbjct: 844  GDSFLLDNVCNKKFEKVSREVKAREDLASRSRKTTSFSTQDTKDLEKIARSRSKKFAKLD 903

Query: 1123 IPAADRTKKV--SKLTKVAKLKRKQPIDDAPPSISKKVQKLANSSTKQAVCKKTVVQKIK 950
            + ++   +K+  +  +KV KLKRKQ  DD   S S+KV +++  + KQA  K   ++K++
Sbjct: 904  LQSSGCLEKMPNNPASKVVKLKRKQVEDDMAQSQSRKVLRVSKGAGKQAASKHVTIEKVR 963

Query: 949  RS-KSRTMRPCPQSDGCARSSIDGWEWHKWSLNASPAERARITGT-RLRSQHISLDGNGL 776
             + KSR   P PQS+GC R S++GWEW KWSLNASPA+RAR  GT R+ +Q+I  +  G 
Sbjct: 964  MTCKSRKGAPFPQSEGCTRCSVNGWEWRKWSLNASPADRARARGTTRVHAQNIISNAPGS 1023

Query: 775  QLPNVKGLSARTHRVKLRNLLAAAEGADLLKATQLKARKKRLCFQKSKIHDWGIVALEPI 596
            Q  ++KGLSART+RVKLRNLLAAAEGADLLKATQLKARKKRL FQ+S IHDWG+VALEPI
Sbjct: 1024 QSSSIKGLSARTNRVKLRNLLAAAEGADLLKATQLKARKKRLRFQRSMIHDWGLVALEPI 1083

Query: 595  EAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 416
            EAEDFVIEYVGELIRPRISDIRER YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS
Sbjct: 1084 EAEDFVIEYVGELIRPRISDIRERHYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 1143

Query: 415  CEPNCYTKVITVDGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRRCRGSLN 239
            CEPNCYTKVI+V+GQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRRCRGSLN
Sbjct: 1144 CEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRRCRGSLN 1202


>ref|XP_007018610.1| Set domain protein, putative isoform 5 [Theobroma cacao]
            gi|508723938|gb|EOY15835.1| Set domain protein, putative
            isoform 5 [Theobroma cacao]
          Length = 1001

 Score =  482 bits (1240), Expect = e-133
 Identities = 284/562 (50%), Positives = 355/562 (63%), Gaps = 36/562 (6%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCSS 1637
            VPS  C+ R  + DE +  +   V++ M +++++               + + LT W S 
Sbjct: 449  VPSHLCKFRPSRSDERSPKIGEYVAVAMCRQKLHEDVLREWKSSFIDATLYQFLTSWRSL 508

Query: 1636 RRHDSCRNNNK-------GAENRVSNEKPDD--RERSSKS--------SLLNGNYICYRK 1508
            ++   C+ ++K       G E    +    D  RERS KS        SL+ G Y  YRK
Sbjct: 509  KKR--CKADSKEERAFSVGREILADSSAIGDKLRERSKKSQSSGSSEVSLVTGKYTYYRK 566

Query: 1507 RKLGEKKSGSFFESLIAGDIGSQKQSIENSNKG----NVLKHV---PRSKKVKNMLLN-- 1355
            +KL  KK GS   +++ G   SQ   +E   K     N+L H    P +   K + +N  
Sbjct: 567  KKLVRKKIGSTQSTIVNG---SQNHPVERPRKKEASRNLLDHADPEPTAATSKKVGINKS 623

Query: 1354 LEKTRTENHSSKXXXXXXXXXXXXXXXXXXXXXSEKVADVVKD----KSSCRTHKASFSP 1187
              ++ T + SSK                       KV   V+     + + +  +   S 
Sbjct: 624  ASQSSTVSRSSKTIAKSSLLNDHSILKSAGGRKKTKVTLAVQKNLVGEGAVQVSRERAST 683

Query: 1186 VDQCNIERITNEKSR--GSDPLEIPAADRTKKVSKLTKVAKLKRKQPIDDAPPSISKKVQ 1013
               C+++++    +   GS+ +E+   D  KK  K  KV+++KRKQ  +D PP +  KVQ
Sbjct: 684  SQNCDVKKVVGRTNHIVGSE-VEL-TNDSHKKTLKAPKVSRVKRKQLDNDEPPLLPTKVQ 741

Query: 1012 KLANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNASPAERA 833
            K+ANS++K    +    +     +SRT   CP+SDGCARSSI+GWEWHKWSLNASPAERA
Sbjct: 742  KVANSASKHPSSRGNADRNTHSIRSRTANSCPRSDGCARSSINGWEWHKWSLNASPAERA 801

Query: 832  RITGTRLRSQHISLDG----NGLQLPNVKGLSARTHRVKLRNLLAAAEGADLLKATQLKA 665
            R+ G  ++  H+   G    N +QL N KGLSART+RVKLRNLLAAAEGADLLKATQLKA
Sbjct: 802  RVRG--IQCTHMKYSGSEVNNMMQLSNGKGLSARTNRVKLRNLLAAAEGADLLKATQLKA 859

Query: 664  RKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYLF 485
            RKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRE  YEKMGIGSSYLF
Sbjct: 860  RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIREHYYEKMGIGSSYLF 919

Query: 484  RLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEITYNYK 305
            RLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+V+GQKKIFIYAKRHIAAGEEITYNYK
Sbjct: 920  RLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 979

Query: 304  FPLEEKKIPCNCGSRRCRGSLN 239
            FPLEEKKIPCNCGS++CRGSLN
Sbjct: 980  FPLEEKKIPCNCGSKKCRGSLN 1001


>ref|XP_007018606.1| Set domain protein, putative isoform 1 [Theobroma cacao]
            gi|590597427|ref|XP_007018607.1| Set domain protein,
            putative isoform 1 [Theobroma cacao]
            gi|590597431|ref|XP_007018608.1| Set domain protein,
            putative isoform 1 [Theobroma cacao]
            gi|508723934|gb|EOY15831.1| Set domain protein, putative
            isoform 1 [Theobroma cacao] gi|508723935|gb|EOY15832.1|
            Set domain protein, putative isoform 1 [Theobroma cacao]
            gi|508723936|gb|EOY15833.1| Set domain protein, putative
            isoform 1 [Theobroma cacao]
          Length = 1241

 Score =  482 bits (1240), Expect = e-133
 Identities = 284/562 (50%), Positives = 355/562 (63%), Gaps = 36/562 (6%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCSS 1637
            VPS  C+ R  + DE +  +   V++ M +++++               + + LT W S 
Sbjct: 689  VPSHLCKFRPSRSDERSPKIGEYVAVAMCRQKLHEDVLREWKSSFIDATLYQFLTSWRSL 748

Query: 1636 RRHDSCRNNNK-------GAENRVSNEKPDD--RERSSKS--------SLLNGNYICYRK 1508
            ++   C+ ++K       G E    +    D  RERS KS        SL+ G Y  YRK
Sbjct: 749  KKR--CKADSKEERAFSVGREILADSSAIGDKLRERSKKSQSSGSSEVSLVTGKYTYYRK 806

Query: 1507 RKLGEKKSGSFFESLIAGDIGSQKQSIENSNKG----NVLKHV---PRSKKVKNMLLN-- 1355
            +KL  KK GS   +++ G   SQ   +E   K     N+L H    P +   K + +N  
Sbjct: 807  KKLVRKKIGSTQSTIVNG---SQNHPVERPRKKEASRNLLDHADPEPTAATSKKVGINKS 863

Query: 1354 LEKTRTENHSSKXXXXXXXXXXXXXXXXXXXXXSEKVADVVKD----KSSCRTHKASFSP 1187
              ++ T + SSK                       KV   V+     + + +  +   S 
Sbjct: 864  ASQSSTVSRSSKTIAKSSLLNDHSILKSAGGRKKTKVTLAVQKNLVGEGAVQVSRERAST 923

Query: 1186 VDQCNIERITNEKSR--GSDPLEIPAADRTKKVSKLTKVAKLKRKQPIDDAPPSISKKVQ 1013
               C+++++    +   GS+ +E+   D  KK  K  KV+++KRKQ  +D PP +  KVQ
Sbjct: 924  SQNCDVKKVVGRTNHIVGSE-VEL-TNDSHKKTLKAPKVSRVKRKQLDNDEPPLLPTKVQ 981

Query: 1012 KLANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNASPAERA 833
            K+ANS++K    +    +     +SRT   CP+SDGCARSSI+GWEWHKWSLNASPAERA
Sbjct: 982  KVANSASKHPSSRGNADRNTHSIRSRTANSCPRSDGCARSSINGWEWHKWSLNASPAERA 1041

Query: 832  RITGTRLRSQHISLDG----NGLQLPNVKGLSARTHRVKLRNLLAAAEGADLLKATQLKA 665
            R+ G  ++  H+   G    N +QL N KGLSART+RVKLRNLLAAAEGADLLKATQLKA
Sbjct: 1042 RVRG--IQCTHMKYSGSEVNNMMQLSNGKGLSARTNRVKLRNLLAAAEGADLLKATQLKA 1099

Query: 664  RKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYLF 485
            RKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRE  YEKMGIGSSYLF
Sbjct: 1100 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIREHYYEKMGIGSSYLF 1159

Query: 484  RLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEITYNYK 305
            RLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+V+GQKKIFIYAKRHIAAGEEITYNYK
Sbjct: 1160 RLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 1219

Query: 304  FPLEEKKIPCNCGSRRCRGSLN 239
            FPLEEKKIPCNCGS++CRGSLN
Sbjct: 1220 FPLEEKKIPCNCGSKKCRGSLN 1241


>ref|XP_012478184.1| PREDICTED: uncharacterized protein LOC105793866 isoform X2 [Gossypium
            raimondii] gi|823156531|ref|XP_012478185.1| PREDICTED:
            uncharacterized protein LOC105793866 isoform X2
            [Gossypium raimondii] gi|823156533|ref|XP_012478186.1|
            PREDICTED: uncharacterized protein LOC105793866 isoform
            X2 [Gossypium raimondii] gi|823156535|ref|XP_012478187.1|
            PREDICTED: uncharacterized protein LOC105793866 isoform
            X2 [Gossypium raimondii]
          Length = 1224

 Score =  474 bits (1221), Expect = e-131
 Identities = 278/566 (49%), Positives = 344/566 (60%), Gaps = 40/566 (7%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDA-IEKALTIWCS 1640
            VPS NC+ R +    C+  +   V++ M +++++             DA + + L +  S
Sbjct: 670  VPSHNCKFRPLTSVGCSPKIGEYVAMAMCRQKLHDDVLREWKSSFAGDASLYQFLILRSS 729

Query: 1639 SRRHDSCRNNNKGAEN-----------RVSNEKPDDRERSSKSS------LLNGNYICYR 1511
            S++H  C+ + K A+              S +KP D  R S SS      L+ G    YR
Sbjct: 730  SKKH--CKADGKEAKTFSEDRKNLAGFSASRDKPRDGSRKSLSSGSSDISLVTGTCTYYR 787

Query: 1510 KRKLGEKKSGSFFESLIAG-------------------DIGSQKQSIENSNKGNVLKHVP 1388
            K+KL  KK GS   ++I G                   D   QK S   S KG   K + 
Sbjct: 788  KKKLVHKKVGSSLSTIINGSRDQPVERPRTKRPSKNLLDHADQKLSAATSKKGGTNKSMS 847

Query: 1387 RSKKVKNMLLNLEKTRTEN-HSSKXXXXXXXXXXXXXXXXXXXXXSEKVADVVKDKSSCR 1211
            +S  +      + K    N HS                        E    V ++++S  
Sbjct: 848  QSSNISRSSKIIAKNSLPNDHSLPKSAIGRKTSKGAAAAVRKNLIGEGAIKVGRERAST- 906

Query: 1210 THKASFSPVDQCNIERITNEKSRGSDPLEIPAADRTKKVSKLTKVAKLKRKQPIDDAPPS 1031
                       C++E+I  + +           D +KK  K  KV+ +KRKQ   D  PS
Sbjct: 907  --------FQNCDVEKIARKSNHTVGSEGEVTNDSSKKTLKAKKVSGVKRKQLNYDECPS 958

Query: 1030 ISKKVQKLANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNA 851
             S KVQK+A+  +K +  +    QK +  +SRT  PCP+SDGCAR+SI+GWEWHKWSLNA
Sbjct: 959  PSIKVQKVASCGSKSSSSRGVADQKSRTVRSRTANPCPRSDGCARTSINGWEWHKWSLNA 1018

Query: 850  SPAERARITGTR-LRSQHISLDGNGL-QLPNVKGLSARTHRVKLRNLLAAAEGADLLKAT 677
            SPAERAR+ G + ++ ++   + N +  L N KGLSART+RVKLRNLLAA EGADLLKAT
Sbjct: 1019 SPAERARVRGVQCIQMKYSGPEVNSMTHLSNSKGLSARTNRVKLRNLLAAVEGADLLKAT 1078

Query: 676  QLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGS 497
            QLKARKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRE  YEKMGIGS
Sbjct: 1079 QLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIREHYYEKMGIGS 1138

Query: 496  SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEIT 317
            SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+V+GQKKIFIYAKRHIAAGEE+T
Sbjct: 1139 SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEVT 1198

Query: 316  YNYKFPLEEKKIPCNCGSRRCRGSLN 239
            YNYKFPLEEKKIPCNCGS++CRGSLN
Sbjct: 1199 YNYKFPLEEKKIPCNCGSKKCRGSLN 1224


>ref|XP_012478181.1| PREDICTED: uncharacterized protein LOC105793866 isoform X1 [Gossypium
            raimondii] gi|823156525|ref|XP_012478182.1| PREDICTED:
            uncharacterized protein LOC105793866 isoform X1
            [Gossypium raimondii] gi|823156527|ref|XP_012478183.1|
            PREDICTED: uncharacterized protein LOC105793866 isoform
            X1 [Gossypium raimondii]
          Length = 1228

 Score =  474 bits (1221), Expect = e-131
 Identities = 278/566 (49%), Positives = 344/566 (60%), Gaps = 40/566 (7%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDA-IEKALTIWCS 1640
            VPS NC+ R +    C+  +   V++ M +++++             DA + + L +  S
Sbjct: 674  VPSHNCKFRPLTSVGCSPKIGEYVAMAMCRQKLHDDVLREWKSSFAGDASLYQFLILRSS 733

Query: 1639 SRRHDSCRNNNKGAEN-----------RVSNEKPDDRERSSKSS------LLNGNYICYR 1511
            S++H  C+ + K A+              S +KP D  R S SS      L+ G    YR
Sbjct: 734  SKKH--CKADGKEAKTFSEDRKNLAGFSASRDKPRDGSRKSLSSGSSDISLVTGTCTYYR 791

Query: 1510 KRKLGEKKSGSFFESLIAG-------------------DIGSQKQSIENSNKGNVLKHVP 1388
            K+KL  KK GS   ++I G                   D   QK S   S KG   K + 
Sbjct: 792  KKKLVHKKVGSSLSTIINGSRDQPVERPRTKRPSKNLLDHADQKLSAATSKKGGTNKSMS 851

Query: 1387 RSKKVKNMLLNLEKTRTEN-HSSKXXXXXXXXXXXXXXXXXXXXXSEKVADVVKDKSSCR 1211
            +S  +      + K    N HS                        E    V ++++S  
Sbjct: 852  QSSNISRSSKIIAKNSLPNDHSLPKSAIGRKTSKGAAAAVRKNLIGEGAIKVGRERAST- 910

Query: 1210 THKASFSPVDQCNIERITNEKSRGSDPLEIPAADRTKKVSKLTKVAKLKRKQPIDDAPPS 1031
                       C++E+I  + +           D +KK  K  KV+ +KRKQ   D  PS
Sbjct: 911  --------FQNCDVEKIARKSNHTVGSEGEVTNDSSKKTLKAKKVSGVKRKQLNYDECPS 962

Query: 1030 ISKKVQKLANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNA 851
             S KVQK+A+  +K +  +    QK +  +SRT  PCP+SDGCAR+SI+GWEWHKWSLNA
Sbjct: 963  PSIKVQKVASCGSKSSSSRGVADQKSRTVRSRTANPCPRSDGCARTSINGWEWHKWSLNA 1022

Query: 850  SPAERARITGTR-LRSQHISLDGNGL-QLPNVKGLSARTHRVKLRNLLAAAEGADLLKAT 677
            SPAERAR+ G + ++ ++   + N +  L N KGLSART+RVKLRNLLAA EGADLLKAT
Sbjct: 1023 SPAERARVRGVQCIQMKYSGPEVNSMTHLSNSKGLSARTNRVKLRNLLAAVEGADLLKAT 1082

Query: 676  QLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGS 497
            QLKARKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRE  YEKMGIGS
Sbjct: 1083 QLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIREHYYEKMGIGS 1142

Query: 496  SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEIT 317
            SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+V+GQKKIFIYAKRHIAAGEE+T
Sbjct: 1143 SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEVT 1202

Query: 316  YNYKFPLEEKKIPCNCGSRRCRGSLN 239
            YNYKFPLEEKKIPCNCGS++CRGSLN
Sbjct: 1203 YNYKFPLEEKKIPCNCGSKKCRGSLN 1228


>ref|XP_012478188.1| PREDICTED: uncharacterized protein LOC105793866 isoform X3 [Gossypium
            raimondii] gi|763762452|gb|KJB29706.1| hypothetical
            protein B456_005G115300 [Gossypium raimondii]
          Length = 1217

 Score =  474 bits (1221), Expect = e-131
 Identities = 278/566 (49%), Positives = 344/566 (60%), Gaps = 40/566 (7%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDA-IEKALTIWCS 1640
            VPS NC+ R +    C+  +   V++ M +++++             DA + + L +  S
Sbjct: 663  VPSHNCKFRPLTSVGCSPKIGEYVAMAMCRQKLHDDVLREWKSSFAGDASLYQFLILRSS 722

Query: 1639 SRRHDSCRNNNKGAEN-----------RVSNEKPDDRERSSKSS------LLNGNYICYR 1511
            S++H  C+ + K A+              S +KP D  R S SS      L+ G    YR
Sbjct: 723  SKKH--CKADGKEAKTFSEDRKNLAGFSASRDKPRDGSRKSLSSGSSDISLVTGTCTYYR 780

Query: 1510 KRKLGEKKSGSFFESLIAG-------------------DIGSQKQSIENSNKGNVLKHVP 1388
            K+KL  KK GS   ++I G                   D   QK S   S KG   K + 
Sbjct: 781  KKKLVHKKVGSSLSTIINGSRDQPVERPRTKRPSKNLLDHADQKLSAATSKKGGTNKSMS 840

Query: 1387 RSKKVKNMLLNLEKTRTEN-HSSKXXXXXXXXXXXXXXXXXXXXXSEKVADVVKDKSSCR 1211
            +S  +      + K    N HS                        E    V ++++S  
Sbjct: 841  QSSNISRSSKIIAKNSLPNDHSLPKSAIGRKTSKGAAAAVRKNLIGEGAIKVGRERAST- 899

Query: 1210 THKASFSPVDQCNIERITNEKSRGSDPLEIPAADRTKKVSKLTKVAKLKRKQPIDDAPPS 1031
                       C++E+I  + +           D +KK  K  KV+ +KRKQ   D  PS
Sbjct: 900  --------FQNCDVEKIARKSNHTVGSEGEVTNDSSKKTLKAKKVSGVKRKQLNYDECPS 951

Query: 1030 ISKKVQKLANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNA 851
             S KVQK+A+  +K +  +    QK +  +SRT  PCP+SDGCAR+SI+GWEWHKWSLNA
Sbjct: 952  PSIKVQKVASCGSKSSSSRGVADQKSRTVRSRTANPCPRSDGCARTSINGWEWHKWSLNA 1011

Query: 850  SPAERARITGTR-LRSQHISLDGNGL-QLPNVKGLSARTHRVKLRNLLAAAEGADLLKAT 677
            SPAERAR+ G + ++ ++   + N +  L N KGLSART+RVKLRNLLAA EGADLLKAT
Sbjct: 1012 SPAERARVRGVQCIQMKYSGPEVNSMTHLSNSKGLSARTNRVKLRNLLAAVEGADLLKAT 1071

Query: 676  QLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGS 497
            QLKARKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRE  YEKMGIGS
Sbjct: 1072 QLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIREHYYEKMGIGS 1131

Query: 496  SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEIT 317
            SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+V+GQKKIFIYAKRHIAAGEE+T
Sbjct: 1132 SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEVT 1191

Query: 316  YNYKFPLEEKKIPCNCGSRRCRGSLN 239
            YNYKFPLEEKKIPCNCGS++CRGSLN
Sbjct: 1192 YNYKFPLEEKKIPCNCGSKKCRGSLN 1217


>ref|XP_002307834.2| hypothetical protein POPTR_0005s28130g [Populus trichocarpa]
            gi|550339919|gb|EEE94830.2| hypothetical protein
            POPTR_0005s28130g [Populus trichocarpa]
          Length = 1149

 Score =  472 bits (1214), Expect = e-130
 Identities = 268/531 (50%), Positives = 330/531 (62%), Gaps = 19/531 (3%)
 Frame = -1

Query: 1774 ECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCSSRRHDSCRNNNKGAE 1595
            E TS     V++ M +++++             D + +   + C+S +H    +N +G  
Sbjct: 620  ESTSKNGAYVAIAMCKQKLHDDVLSVWKSLFVNDVLHRFPGLCCTSEKHTEPDSNEEGVF 679

Query: 1594 NRVSNEKPDDRERSSKSSLLNGNYICYRKRKLGEKKSGSFFESLIAGDIGSQKQSIENSN 1415
                  +      SS  SL++  Y  +RK+KL  KK GS   S    D G QK+ +E S 
Sbjct: 680  KFTEGSRKFHSPDSSVLSLVSSKYTYHRKKKLAGKKLGSSSHSTTT-DAGLQKRPVEKSR 738

Query: 1414 KGNVLKHVPRSKKVKNMLLNLEKTRTENHSSKXXXXXXXXXXXXXXXXXXXXXSEKVADV 1235
            K N L++V  +  V+ +    +K R +  +                          V   
Sbjct: 739  KQNFLRNVSENVVVQPVGTPKKKERIKGQAESSVNGRPSKATFAELPVNARSSKATVRST 798

Query: 1234 VKDKSSCRT---HKASFSPVDQCNIERITNEKSRGSDP------------LEIPAADRT- 1103
            VK   S      H+         N +++  E  + S              +EI  A+ T 
Sbjct: 799  VKRVQSLPKNAGHRKVMKIAQAVNDDKVAEEAIKTSRERAGKVFDCNGCDVEIENAETTE 858

Query: 1102 --KKVSKLTKVAKLKRKQPIDDAPPSISKKVQKLANSSTKQAVCKKTVVQKIKRSKSRTM 929
              KK     KV+KLKRK  +D    S   K  K+ NS+ KQA  ++  V+K K SKSRT+
Sbjct: 859  CSKKTLNTNKVSKLKRKSTVDGGSVSHPMKFLKVENSAIKQAASRQVSVRKTKSSKSRTL 918

Query: 928  RPCPQSDGCARSSIDGWEWHKWSLNASPAERARITGT-RLRSQHISLDGNGLQLPNVKGL 752
             PCP SDGCARSSI+GWEWH WS+NASPAERAR+ G   + +++   +    QL N K L
Sbjct: 919  NPCPISDGCARSSINGWEWHAWSINASPAERARVRGVPHVHAKYSFPEAYTSQLSNGKAL 978

Query: 751  SARTHRVKLRNLLAAAEGADLLKATQLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVIE 572
            SART+RVKLRNL+AAAEGA+LLKATQLKARKK L FQ+SKIHDWG+VALEPIEAEDFVIE
Sbjct: 979  SARTNRVKLRNLVAAAEGAELLKATQLKARKKHLRFQRSKIHDWGLVALEPIEAEDFVIE 1038

Query: 571  YVGELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTK 392
            YVGELIRP+ISDIRER YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTK
Sbjct: 1039 YVGELIRPQISDIRERLYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTK 1098

Query: 391  VITVDGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRRCRGSLN 239
            VI+V+GQKKIFIYAKRHIAAGEEITYNYKFPLE+KKIPCNCGSR+CRGSLN
Sbjct: 1099 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEDKKIPCNCGSRKCRGSLN 1149


>ref|XP_007018609.1| Set domain protein, putative isoform 4 [Theobroma cacao]
            gi|508723937|gb|EOY15834.1| Set domain protein, putative
            isoform 4 [Theobroma cacao]
          Length = 1235

 Score =  469 bits (1206), Expect = e-129
 Identities = 278/556 (50%), Positives = 349/556 (62%), Gaps = 36/556 (6%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCSS 1637
            VPS  C+ R  + DE +  +   V++ M +++++               + + LT W S 
Sbjct: 689  VPSHLCKFRPSRSDERSPKIGEYVAVAMCRQKLHEDVLREWKSSFIDATLYQFLTSWRSL 748

Query: 1636 RRHDSCRNNNK-------GAENRVSNEKPDD--RERSSKS--------SLLNGNYICYRK 1508
            ++   C+ ++K       G E    +    D  RERS KS        SL+ G Y  YRK
Sbjct: 749  KKR--CKADSKEERAFSVGREILADSSAIGDKLRERSKKSQSSGSSEVSLVTGKYTYYRK 806

Query: 1507 RKLGEKKSGSFFESLIAGDIGSQKQSIENSNKG----NVLKHV---PRSKKVKNMLLN-- 1355
            +KL  KK GS   +++ G   SQ   +E   K     N+L H    P +   K + +N  
Sbjct: 807  KKLVRKKIGSTQSTIVNG---SQNHPVERPRKKEASRNLLDHADPEPTAATSKKVGINKS 863

Query: 1354 LEKTRTENHSSKXXXXXXXXXXXXXXXXXXXXXSEKVADVVKD----KSSCRTHKASFSP 1187
              ++ T + SSK                       KV   V+     + + +  +   S 
Sbjct: 864  ASQSSTVSRSSKTIAKSSLLNDHSILKSAGGRKKTKVTLAVQKNLVGEGAVQVSRERAST 923

Query: 1186 VDQCNIERITNEKSR--GSDPLEIPAADRTKKVSKLTKVAKLKRKQPIDDAPPSISKKVQ 1013
               C+++++    +   GS+ +E+   D  KK  K  KV+++KRKQ  +D PP +  KVQ
Sbjct: 924  SQNCDVKKVVGRTNHIVGSE-VEL-TNDSHKKTLKAPKVSRVKRKQLDNDEPPLLPTKVQ 981

Query: 1012 KLANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNASPAERA 833
            K+ANS++K    +    +     +SRT   CP+SDGCARSSI+GWEWHKWSLNASPAERA
Sbjct: 982  KVANSASKHPSSRGNADRNTHSIRSRTANSCPRSDGCARSSINGWEWHKWSLNASPAERA 1041

Query: 832  RITGTRLRSQHISLDG----NGLQLPNVKGLSARTHRVKLRNLLAAAEGADLLKATQLKA 665
            R+ G  ++  H+   G    N +QL N KGLSART+RVKLRNLLAAAEGADLLKATQLKA
Sbjct: 1042 RVRG--IQCTHMKYSGSEVNNMMQLSNGKGLSARTNRVKLRNLLAAAEGADLLKATQLKA 1099

Query: 664  RKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYLF 485
            RKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRE  YEKMGIGSSYLF
Sbjct: 1100 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIREHYYEKMGIGSSYLF 1159

Query: 484  RLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEITYNYK 305
            RLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+V+GQKKIFIYAKRHIAAGEEITYNYK
Sbjct: 1160 RLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 1219

Query: 304  FPLEEKKIPCNCGSRR 257
            FPLEEKKIPCNCGS++
Sbjct: 1220 FPLEEKKIPCNCGSKK 1235


>gb|KJB29707.1| hypothetical protein B456_005G115300 [Gossypium raimondii]
          Length = 1211

 Score =  461 bits (1187), Expect = e-127
 Identities = 272/560 (48%), Positives = 338/560 (60%), Gaps = 40/560 (7%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDA-IEKALTIWCS 1640
            VPS NC+ R +    C+  +   V++ M +++++             DA + + L +  S
Sbjct: 663  VPSHNCKFRPLTSVGCSPKIGEYVAMAMCRQKLHDDVLREWKSSFAGDASLYQFLILRSS 722

Query: 1639 SRRHDSCRNNNKGAEN-----------RVSNEKPDDRERSSKSS------LLNGNYICYR 1511
            S++H  C+ + K A+              S +KP D  R S SS      L+ G    YR
Sbjct: 723  SKKH--CKADGKEAKTFSEDRKNLAGFSASRDKPRDGSRKSLSSGSSDISLVTGTCTYYR 780

Query: 1510 KRKLGEKKSGSFFESLIAG-------------------DIGSQKQSIENSNKGNVLKHVP 1388
            K+KL  KK GS   ++I G                   D   QK S   S KG   K + 
Sbjct: 781  KKKLVHKKVGSSLSTIINGSRDQPVERPRTKRPSKNLLDHADQKLSAATSKKGGTNKSMS 840

Query: 1387 RSKKVKNMLLNLEKTRTEN-HSSKXXXXXXXXXXXXXXXXXXXXXSEKVADVVKDKSSCR 1211
            +S  +      + K    N HS                        E    V ++++S  
Sbjct: 841  QSSNISRSSKIIAKNSLPNDHSLPKSAIGRKTSKGAAAAVRKNLIGEGAIKVGRERAST- 899

Query: 1210 THKASFSPVDQCNIERITNEKSRGSDPLEIPAADRTKKVSKLTKVAKLKRKQPIDDAPPS 1031
                       C++E+I  + +           D +KK  K  KV+ +KRKQ   D  PS
Sbjct: 900  --------FQNCDVEKIARKSNHTVGSEGEVTNDSSKKTLKAKKVSGVKRKQLNYDECPS 951

Query: 1030 ISKKVQKLANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNA 851
             S KVQK+A+  +K +  +    QK +  +SRT  PCP+SDGCAR+SI+GWEWHKWSLNA
Sbjct: 952  PSIKVQKVASCGSKSSSSRGVADQKSRTVRSRTANPCPRSDGCARTSINGWEWHKWSLNA 1011

Query: 850  SPAERARITGTR-LRSQHISLDGNGL-QLPNVKGLSARTHRVKLRNLLAAAEGADLLKAT 677
            SPAERAR+ G + ++ ++   + N +  L N KGLSART+RVKLRNLLAA EGADLLKAT
Sbjct: 1012 SPAERARVRGVQCIQMKYSGPEVNSMTHLSNSKGLSARTNRVKLRNLLAAVEGADLLKAT 1071

Query: 676  QLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGS 497
            QLKARKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRE  YEKMGIGS
Sbjct: 1072 QLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIREHYYEKMGIGS 1131

Query: 496  SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEIT 317
            SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVI+V+GQKKIFIYAKRHIAAGEE+T
Sbjct: 1132 SYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEVT 1191

Query: 316  YNYKFPLEEKKIPCNCGSRR 257
            YNYKFPLEEKKIPCNCGS++
Sbjct: 1192 YNYKFPLEEKKIPCNCGSKK 1211


>ref|XP_008231636.1| PREDICTED: uncharacterized protein LOC103330802 [Prunus mume]
          Length = 1130

 Score =  459 bits (1181), Expect = e-126
 Identities = 264/557 (47%), Positives = 334/557 (59%), Gaps = 31/557 (5%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCSS 1637
            + SQ C+ R  + DEC   +   ++  M +++++               + + L  W +S
Sbjct: 597  ISSQTCKFRPSRSDECIPKIGEYIATAMCRKKLHDSVINEWKSSFIDCVLHQFLASWRTS 656

Query: 1636 RR-----HDSCRNNNKGAENRVSNEKPDDRERSSKSSLLNGNYICYRKRKLGEKKSGSFF 1472
            ++       +C+ N        S  K  D   ++K S + G Y  Y ++KL  KKSGS  
Sbjct: 657  KKTHAHKERACKTNKNHKLEEES--KHCDNSGTAKVSPIIGKYT-YHRKKLFLKKSGSSR 713

Query: 1471 ESLIAGDIGSQKQSIENSNKGNVLKHVPRSKKVKNMLLNLEKTRTENHSSKXXXXXXXXX 1292
               + G    + + +E S   +V   +P + + KN  +  +K R ++ S           
Sbjct: 714  SVTLDGK-ELENEIVEKSKNLHVSGDMPETTEFKNATVIPKKKRGQSKSQTELSVGATSL 772

Query: 1291 XXXXXXXXXXXXSEKVADVVKDKSSCRTHKASFSPVDQCNIERITNEKSRGSDPLEIPAA 1112
                         E      K  SS +  K S +               + S+P+E P  
Sbjct: 773  QAIAKGCASTDKKE-----AKSSSSRKLLKVSHAV--------------KSSEPMECPPK 813

Query: 1111 DRTK-------------------------KVSKLTKVAKLKRKQPIDDAPPSISKKVQKL 1007
               K                         K    TK +KLKR+  +DD   +  KKV K+
Sbjct: 814  PSKKMALAHGANHRDVQKVVNSNGPDFGLKREPSTKASKLKRECVMDDLKLARPKKVLKV 873

Query: 1006 ANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNASPAERARI 827
             + + KQA CK   V+K++ SKSR + PCP+S GCAR SI+GWEWH+WSLNASP ERAR+
Sbjct: 874  TSGTPKQAACKSIPVRKMQSSKSRKLNPCPKSCGCARVSINGWEWHRWSLNASPVERARV 933

Query: 826  TGTR-LRSQHISLDGNGLQLPNVKGLSARTHRVKLRNLLAAAEGADLLKATQLKARKKRL 650
             G + + ++H   D N  QL N KGLSART+RVK+RNL AAAEGADL+KATQLKARKK L
Sbjct: 934  RGVKYVNAEHRGSDINTSQLSNGKGLSARTNRVKMRNLAAAAEGADLMKATQLKARKKLL 993

Query: 649  CFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYLFRLDDG 470
             FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRER YEKMGIGSSYLFRLDDG
Sbjct: 994  RFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIRERHYEKMGIGSSYLFRLDDG 1053

Query: 469  YVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEITYNYKFPLEE 290
            YVVDATKRGG+ARFINHSCEPNCYTKVI+V+GQK+IFIYAKRHIA GEEITYNYKFPLEE
Sbjct: 1054 YVVDATKRGGVARFINHSCEPNCYTKVISVEGQKRIFIYAKRHIAVGEEITYNYKFPLEE 1113

Query: 289  KKIPCNCGSRRCRGSLN 239
            KKIPCNCGS++CRGSLN
Sbjct: 1114 KKIPCNCGSKKCRGSLN 1130


>ref|XP_010111522.1| Histone-lysine N-methyltransferase SETD1B [Morus notabilis]
            gi|587944573|gb|EXC31045.1| Histone-lysine
            N-methyltransferase SETD1B [Morus notabilis]
          Length = 1249

 Score =  455 bits (1171), Expect = e-125
 Identities = 266/561 (47%), Positives = 343/561 (61%), Gaps = 37/561 (6%)
 Frame = -1

Query: 1810 SQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCSSRR 1631
            S   + R+++ ++C   +   V++ M +++++              A++K L  W SS++
Sbjct: 704  SHQDKFRTLRSNKCVPKMGEYVAIAMCRQKLHEDVLRELKMSFIGYALQKFLQTWRSSKK 763

Query: 1630 HDSCRNNNKGAEN--------------RVSNE-KPDDRERSSKSSLLNGNYICYRKRKLG 1496
            H    +  +GA+N              ++  E +   +  S KSS   G Y  +RK+   
Sbjct: 764  HCKLLDYEEGAQNANRKLPGGSSLLLDKIGEELECCPKSTSDKSSTAVGKYTYHRKKS-- 821

Query: 1495 EKKSGSFFESLIAGDIGSQKQSIENSNKGNVLKHVPRSKKVKNM---LLNLEKTRTENHS 1325
            +KKSGS                ++ +  G +L H+    K +++   ++   K +    S
Sbjct: 822  QKKSGSI-------------SKLDTTVGGGLLDHLAEESKKEHVSGDVIVAAKAQVAATS 868

Query: 1324 SKXXXXXXXXXXXXXXXXXXXXXSEKVADVVKDK-----SSCRTHKASFSPVDQCNIERI 1160
            SK                     S+   ++  D+     SS R    S        +   
Sbjct: 869  SKKIGLKKGQNESSAKDKSLQVVSKVKRNLSSDRLKTKNSSSRKAMVSSRAQKSGKLAEG 928

Query: 1159 TNEKSRGSDPLEIPAADRTKKVSK------------LTKVAKLKRKQPIDDAPPSISKKV 1016
             N+ SR          D   KV               TK +KLKR++P+D  PPS SKKV
Sbjct: 929  ANKPSRTQVLAPSSKRDGVHKVENDNDHDVKIQEDLPTKASKLKRERPMDSMPPSHSKKV 988

Query: 1015 QKLANSSTKQAVCKKTVVQKIKRSKSRTMRPC-PQSDGCARSSIDGWEWHKWSLNASPAE 839
             K+AN   KQA+ K+ VV+K K  KS+ ++   P+SDGCAR+SI+GWEWH+WS++ASPAE
Sbjct: 989  LKVANGDAKQALSKQAVVKKTKSRKSKIVKNAYPRSDGCARASINGWEWHRWSVSASPAE 1048

Query: 838  RARITGTR-LRSQHISLDGNGLQLPNVKGLSARTHRVKLRNLLAAAEGADLLKATQLKAR 662
            RA + G + + ++  S D N   L N K LSART+R KLRNL+AAAEGADLLKATQLKAR
Sbjct: 1049 RAHVRGIKYIDTKRSSSDVNKSPLSNGKALSARTNRAKLRNLVAAAEGADLLKATQLKAR 1108

Query: 661  KKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYLFR 482
            KK+L FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRER YEKMGIGSSYLFR
Sbjct: 1109 KKQLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIRERHYEKMGIGSSYLFR 1168

Query: 481  LDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEITYNYKF 302
            LDDGYVVDATKRGGIARF+NHSCEPNCYTKVI+V+G+KKIFIYAKRHIAAGEEITYNYKF
Sbjct: 1169 LDDGYVVDATKRGGIARFVNHSCEPNCYTKVISVEGEKKIFIYAKRHIAAGEEITYNYKF 1228

Query: 301  PLEEKKIPCNCGSRRCRGSLN 239
            PLEEKKIPCNCGS+RCRGSLN
Sbjct: 1229 PLEEKKIPCNCGSKRCRGSLN 1249


>ref|XP_011657472.1| PREDICTED: uncharacterized protein LOC101220062 isoform X2 [Cucumis
            sativus] gi|778715880|ref|XP_011657473.1| PREDICTED:
            uncharacterized protein LOC101220062 isoform X2 [Cucumis
            sativus]
          Length = 1179

 Score =  455 bits (1170), Expect = e-125
 Identities = 276/563 (49%), Positives = 349/563 (61%), Gaps = 37/563 (6%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCSS 1637
            +PS  C+ R    ++C S +   + L + +++++             D + + ++ W +S
Sbjct: 646  IPSPACKFRPSSSNKCYSKIEGYIMLAICRQKLHDAVLKEWTSSYKDDLLRQFVSSWIAS 705

Query: 1636 RRHDSCRNN-------NKGAENRVSNEKPDDRERSSKSSLLNGNYICYRKRKLGEKKSGS 1478
            ++H  C +N       + G  ++V ++  +  ER  +SSL+ GNY  YRK K  ++K GS
Sbjct: 706  KKH--CNSNRIVEGACDGGEASKVPDKLREGSERFLESSLVTGNYTYYRK-KSSKRKLGS 762

Query: 1477 FFESLIAGDIGSQKQSIENSNKGNV----------------LKHVPRSKKVKNMLLN--- 1355
              +    G    + Q  E S K N+                LK + ++K+ K++ +    
Sbjct: 763  -SDCATEGSPVVRNQPSEKSRKENISVGVCETTDSEIASLTLKSIAKNKRKKDLSIKATC 821

Query: 1354 ----LEKTRTENHSSKXXXXXXXXXXXXXXXXXXXXXSEKVADVVKD----KSSCRTHKA 1199
                 E T   +HSS                        K +  VKD    K S +  K 
Sbjct: 822  KRTCAEVTLPSSHSSGKTICGTKKL--------------KFSPPVKDDNAKKDSVKHGKG 867

Query: 1198 SF--SPVDQCNIERITNEKSRGSDPLEIPAADRTKKVSKLTKVAKLKRKQPIDDAPPSIS 1025
                SP+   N++++ N+  RG    E        K+S    V+K+KRKQ +D+A   + 
Sbjct: 868  RMIGSPLMIKNVDQVMNKCDRGVGAQE--------KLS--VNVSKIKRKQKVDEA-SLLG 916

Query: 1024 KKVQKLANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNASP 845
             KV  +A+  +KQA  K+ V QK K  KSR +     SDGCARSSI+GWEW +W+L ASP
Sbjct: 917  NKVLTVADDFSKQAASKRVVAQKKKSDKSRKLNISIISDGCARSSINGWEWRRWTLKASP 976

Query: 844  AERARITGTR-LRSQHISLDGNGLQLPNVKGLSARTHRVKLRNLLAAAEGADLLKATQLK 668
            AERAR  G +   S  I  D +   L N KGLSART+RVKLRNLLAAA+GADLLKA+QLK
Sbjct: 977  AERARNRGFQYFYSDPIGPDVSTSHLLNGKGLSARTNRVKLRNLLAAADGADLLKASQLK 1036

Query: 667  ARKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYL 488
            ARKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYL
Sbjct: 1037 ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYL 1096

Query: 487  FRLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEITYNY 308
            FRLDDGYVVDATKRGG+ARFINHSCEPNCYTKVITV+GQKKIFIYAKRHI+AGEEITYNY
Sbjct: 1097 FRLDDGYVVDATKRGGVARFINHSCEPNCYTKVITVEGQKKIFIYAKRHISAGEEITYNY 1156

Query: 307  KFPLEEKKIPCNCGSRRCRGSLN 239
            KFPLEEKKIPCNC SRRCRGSLN
Sbjct: 1157 KFPLEEKKIPCNCRSRRCRGSLN 1179


>ref|XP_011657471.1| PREDICTED: uncharacterized protein LOC101220062 isoform X1 [Cucumis
            sativus] gi|700192576|gb|KGN47780.1| hypothetical protein
            Csa_6G401500 [Cucumis sativus]
          Length = 1262

 Score =  455 bits (1170), Expect = e-125
 Identities = 276/563 (49%), Positives = 349/563 (61%), Gaps = 37/563 (6%)
 Frame = -1

Query: 1816 VPSQNCQLRSVKLDECTSNVNFQVSLMMSQERIYXXXXXXXXXXXXXDAIEKALTIWCSS 1637
            +PS  C+ R    ++C S +   + L + +++++             D + + ++ W +S
Sbjct: 729  IPSPACKFRPSSSNKCYSKIEGYIMLAICRQKLHDAVLKEWTSSYKDDLLRQFVSSWIAS 788

Query: 1636 RRHDSCRNN-------NKGAENRVSNEKPDDRERSSKSSLLNGNYICYRKRKLGEKKSGS 1478
            ++H  C +N       + G  ++V ++  +  ER  +SSL+ GNY  YRK K  ++K GS
Sbjct: 789  KKH--CNSNRIVEGACDGGEASKVPDKLREGSERFLESSLVTGNYTYYRK-KSSKRKLGS 845

Query: 1477 FFESLIAGDIGSQKQSIENSNKGNV----------------LKHVPRSKKVKNMLLN--- 1355
              +    G    + Q  E S K N+                LK + ++K+ K++ +    
Sbjct: 846  -SDCATEGSPVVRNQPSEKSRKENISVGVCETTDSEIASLTLKSIAKNKRKKDLSIKATC 904

Query: 1354 ----LEKTRTENHSSKXXXXXXXXXXXXXXXXXXXXXSEKVADVVKD----KSSCRTHKA 1199
                 E T   +HSS                        K +  VKD    K S +  K 
Sbjct: 905  KRTCAEVTLPSSHSSGKTICGTKKL--------------KFSPPVKDDNAKKDSVKHGKG 950

Query: 1198 SF--SPVDQCNIERITNEKSRGSDPLEIPAADRTKKVSKLTKVAKLKRKQPIDDAPPSIS 1025
                SP+   N++++ N+  RG    E        K+S    V+K+KRKQ +D+A   + 
Sbjct: 951  RMIGSPLMIKNVDQVMNKCDRGVGAQE--------KLS--VNVSKIKRKQKVDEA-SLLG 999

Query: 1024 KKVQKLANSSTKQAVCKKTVVQKIKRSKSRTMRPCPQSDGCARSSIDGWEWHKWSLNASP 845
             KV  +A+  +KQA  K+ V QK K  KSR +     SDGCARSSI+GWEW +W+L ASP
Sbjct: 1000 NKVLTVADDFSKQAASKRVVAQKKKSDKSRKLNISIISDGCARSSINGWEWRRWTLKASP 1059

Query: 844  AERARITGTR-LRSQHISLDGNGLQLPNVKGLSARTHRVKLRNLLAAAEGADLLKATQLK 668
            AERAR  G +   S  I  D +   L N KGLSART+RVKLRNLLAAA+GADLLKA+QLK
Sbjct: 1060 AERARNRGFQYFYSDPIGPDVSTSHLLNGKGLSARTNRVKLRNLLAAADGADLLKASQLK 1119

Query: 667  ARKKRLCFQKSKIHDWGIVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYL 488
            ARKKRL FQ+SKIHDWG+VALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYL
Sbjct: 1120 ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIRERQYEKMGIGSSYL 1179

Query: 487  FRLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVDGQKKIFIYAKRHIAAGEEITYNY 308
            FRLDDGYVVDATKRGG+ARFINHSCEPNCYTKVITV+GQKKIFIYAKRHI+AGEEITYNY
Sbjct: 1180 FRLDDGYVVDATKRGGVARFINHSCEPNCYTKVITVEGQKKIFIYAKRHISAGEEITYNY 1239

Query: 307  KFPLEEKKIPCNCGSRRCRGSLN 239
            KFPLEEKKIPCNC SRRCRGSLN
Sbjct: 1240 KFPLEEKKIPCNCRSRRCRGSLN 1262


>ref|XP_009759057.1| PREDICTED: uncharacterized protein LOC104211659 isoform X5 [Nicotiana
            sylvestris]
          Length = 1325

 Score =  452 bits (1162), Expect = e-124
 Identities = 236/412 (57%), Positives = 292/412 (70%), Gaps = 4/412 (0%)
 Frame = -1

Query: 1462 IAGDIGSQKQSIENSNKGNVLKHVPRSKKVKNMLLNLEKTRTENHSSKXXXXXXXXXXXX 1283
            + GD+G +K+S   S K ++L     S K  N   +++K   ++   +            
Sbjct: 915  VDGDVGFKKRSSNKSRKQDLLGEATESTKGDNATSSVKKIELKD-CHRELFTNASLVVPP 973

Query: 1282 XXXXXXXXXSEKVAD---VVKDKSSCRTHKASFSPVDQCNIERITNEKSRGSDPLEIPAA 1112
                      EKVA    V +  +SC+  K +F      +  R+    +R    LE+   
Sbjct: 974  SVVINSNTIPEKVASFSKVGRSNASCKKLKVAFDSEGSSDNGRVAEVVNRELGTLEMQPT 1033

Query: 1111 DRTKKVSKLTKVAKLKRKQPIDDAPPSISKKVQKLANSSTKQAVCKKTVVQKIKRSKSRT 932
               KK  +L K+ KL +++   +   S S+K+Q++++ +  Q   K+ +V+K ++ KSRT
Sbjct: 1034 ASLKKTPQLAKLPKLNKRKLEYNMSASRSRKIQRVSSGAGSQPATKEVIVEKKQKGKSRT 1093

Query: 931  MRPCPQSDGCARSSIDGWEWHKWSLNASPAERARITGTRL-RSQHISLDGNGLQLPNVKG 755
             + CPQSDGCARSSI GWEWHKWSL A+PAERAR+ G  +   Q +S D NG Q+ N KG
Sbjct: 1094 AKHCPQSDGCARSSIIGWEWHKWSLKATPAERARVRGITIDHIQSVSSDANGSQVLNAKG 1153

Query: 754  LSARTHRVKLRNLLAAAEGADLLKATQLKARKKRLCFQKSKIHDWGIVALEPIEAEDFVI 575
            +SART+RVKLRNLLAAA+GADLLKATQLKARKKRL FQ+SKIHDWG++ALEPIEAEDFVI
Sbjct: 1154 ISARTNRVKLRNLLAAADGADLLKATQLKARKKRLRFQRSKIHDWGLLALEPIEAEDFVI 1213

Query: 574  EYVGELIRPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYT 395
            EYVGELIR R+SDIRE  YEK+GIGSSYLFRLDD YVVDATKRGGIARFINHSCEPNCYT
Sbjct: 1214 EYVGELIRRRVSDIREHYYEKIGIGSSYLFRLDDDYVVDATKRGGIARFINHSCEPNCYT 1273

Query: 394  KVITVDGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRRCRGSLN 239
            KVI+V+GQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGS+RCRGS+N
Sbjct: 1274 KVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSKRCRGSMN 1325


Top