BLASTX nr result

ID: Paeonia25_contig00041015 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00041015
         (1130 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI29431.3| unnamed protein product [Vitis vinifera]              288   3e-75
ref|XP_006494427.1| PREDICTED: uncharacterized protein LOC102611...   268   4e-69
ref|XP_006435514.1| hypothetical protein CICLE_v10000043mg [Citr...   268   4e-69
ref|XP_006435511.1| hypothetical protein CICLE_v10000043mg [Citr...   268   4e-69
ref|XP_006435508.1| hypothetical protein CICLE_v10000043mg [Citr...   268   4e-69
ref|XP_006435507.1| hypothetical protein CICLE_v10000043mg [Citr...   268   4e-69
ref|XP_007018610.1| Set domain protein, putative isoform 5 [Theo...   266   2e-68
ref|XP_007018609.1| Set domain protein, putative isoform 4 [Theo...   266   2e-68
ref|XP_007018606.1| Set domain protein, putative isoform 1 [Theo...   266   2e-68
ref|XP_002510762.1| set domain protein, putative [Ricinus commun...   265   2e-68
ref|XP_002307834.2| hypothetical protein POPTR_0005s28130g [Popu...   239   1e-60
ref|XP_004487927.1| PREDICTED: uncharacterized protein LOC101514...   229   2e-57
ref|XP_007220291.1| hypothetical protein PRUPE_ppa000519mg [Prun...   228   4e-57
gb|EXC31045.1| Histone-lysine N-methyltransferase SETD1B [Morus ...   220   1e-54
ref|XP_003594905.1| Histone-lysine N-methyltransferase SETD1B [M...   215   2e-53
ref|XP_004487926.1| PREDICTED: uncharacterized protein LOC101514...   214   4e-53
ref|XP_002300607.2| hypothetical protein POPTR_0002s00320g [Popu...   210   1e-51
ref|XP_006586959.1| PREDICTED: uncharacterized protein LOC100805...   209   2e-51
ref|XP_006586956.1| PREDICTED: uncharacterized protein LOC100805...   209   2e-51
ref|XP_006586954.1| PREDICTED: uncharacterized protein LOC100805...   209   2e-51

>emb|CBI29431.3| unnamed protein product [Vitis vinifera]
          Length = 1127

 Score =  288 bits (737), Expect = 3e-75
 Identities = 180/399 (45%), Positives = 234/399 (58%), Gaps = 24/399 (6%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EPPPPGFE N R      + +FRPS SD+C+P I EY+A+A+CRQ+LH+DVL EWK   V
Sbjct: 625  EPPPPGFEYNSRTFVPSQICRFRPSSSDECTPIIGEYVALALCRQRLHEDVLQEWKDLLV 684

Query: 183  D-SVHQLFLSRCAANKQIKSY---DGAVGRNNERLNDSIAAIDKFRERSYNCHNSGSSQV 350
            + ++ Q F S   + ++  S    +G    N E+  DS AA D+ RER+ + H+ GS ++
Sbjct: 685  EGTLDQFFASWWTSKQRCDSTGCEEGVSNSNKEKPCDSSAASDQRRERTKDRHSLGSPEL 744

Query: 351  SLLHGKYTYYRKKKSERKKFGSLSRFVPG-DVSVPNLDVDMPRKHGI----SKLTEVETS 515
            SL+ GKYTYYRKKK  RKK GSLS      D    +  ++  RK  +    S++TEVE  
Sbjct: 745  SLVIGKYTYYRKKKLVRKKIGSLSHAAASVDSGSQDQLMEKSRKQDVPGDVSEITEVEMG 804

Query: 516  VVRTKS-------SENSRLQ---DPSLPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALK 665
            +++ +        +E++ LQ     +LP    S +    R+S K + V++  ++ ED L 
Sbjct: 805  ILKRRKIGLNTCHAEDNSLQAIVQSTLPGDSSSVRIKPNRRSTKCAHVVRNGEVIEDDLA 864

Query: 666  CVKERFSALTEDCIETEKVVNSND--HDIGMHHGVAGGCSKKKIPNSTKASKLKRKHLMD 839
            C +E  S   EDC   +KVVNSN   HD+G    +AG CSKK    STK SK KRK L D
Sbjct: 865  CGREEASPFAEDCDFVDKVVNSNGNGHDVGNLKELAGDCSKKT--KSTKVSKKKRKDLKD 922

Query: 840  DVTLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGCARSSINGWEWHR 1019
             V  S   K LK AN  +K+  GRQV V K KFSK +T  PC RS GCARSSINGW+W  
Sbjct: 923  -VPSSRSAKVLKPANGAAKQDTGRQVAVHKSKFSKFKTLNPCLRSVGCARSSINGWDWRN 981

Query: 1020 WSLNASPAERARVRGFRSFQ---SWYSGSEANASQWSNV 1127
            WSLNASP ERA VRG    Q     Y  SE  +SQ SNV
Sbjct: 982  WSLNASPTERAHVRGIHKAQFACDQYFRSEVVSSQLSNV 1020


>ref|XP_006494427.1| PREDICTED: uncharacterized protein LOC102611958 isoform X1 [Citrus
            sinensis]
          Length = 1295

 Score =  268 bits (684), Expect = 4e-69
 Identities = 179/429 (41%), Positives = 231/429 (53%), Gaps = 53/429 (12%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EP PPGFED+ R L   C GKF+ S SD+ + ++ EY+A+A+CRQKLH  V+GEWKS FV
Sbjct: 705  EPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFV 764

Query: 183  DSVHQLFLSR-------CAANKQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSGS 341
            D   Q FL+        C A+   K+ +GA   +NE   D+   +DK +E S   H   S
Sbjct: 765  DDALQQFLALWCNMKECCEADGNEKA-EGASNAHNEHHGDTSTVVDKLKEGSKRFH---S 820

Query: 342  SQVSLLHGKYTYYRKKKSERKKFGSLSRFVPGDVSVPNL----DVDMPRKHGIS----KL 497
            S+ S +  KYTY+RKKK  RKKFGS S       SV N      V+  RK G++    + 
Sbjct: 821  SEASTMVEKYTYHRKKKLLRKKFGSPSNC---SNSVENAFQTEHVEKSRKQGVAGDVFEN 877

Query: 498  TEVETSVVRTKSSENSRLQDPS-------------------------------------- 563
             +V+ S V +K    ++L D S                                      
Sbjct: 878  AKVQPSAVSSKKIGKNKLIDASSKKIGANKFTAVPSKMIGKNKVTAESSASAGSSKVKSK 937

Query: 564  LPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALKCVKERFSALTEDCIETEKVVNSNDHD 743
            LPSG  S K    +K MK +  +QR+K+     K   E  S L+ D  +  KVV    H+
Sbjct: 938  LPSGYSSAKSTISQKVMKVTSAVQRDKVPVP--KPSGEMLSTLSADGNDVGKVVRGKAHN 995

Query: 744  IGMHHGVAGGCSKKKIPNSTKASKLKRKHLMDDVTLSGPGKALKLANSVSKKVLGRQVPV 923
            +G+        SK K PN+TK SK KRK  MD + L    KALK+A   +K+   RQV +
Sbjct: 996  VGIEKDSILDSSKSK-PNATKESKQKRKRTMDGLELHAT-KALKVAKGTAKQAASRQVAM 1053

Query: 924  QKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWSLNASPAERARVRGFRSFQSWYSGSEA 1103
            +K K SKSRT   CPRSDGCARSSI+GWEWH+WSLNASPAERARVRG +   + Y G E 
Sbjct: 1054 KKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV 1113

Query: 1104 NASQWSNVK 1130
            NASQW+N K
Sbjct: 1114 NASQWANGK 1122


>ref|XP_006435514.1| hypothetical protein CICLE_v10000043mg [Citrus clementina]
            gi|557537636|gb|ESR48754.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
          Length = 1256

 Score =  268 bits (684), Expect = 4e-69
 Identities = 179/429 (41%), Positives = 231/429 (53%), Gaps = 53/429 (12%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EP PPGFED+ R L   C GKF+ S SD+ + ++ EY+A+A+CRQKLH  V+GEWKS FV
Sbjct: 705  EPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFV 764

Query: 183  DSVHQLFLSR-------CAANKQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSGS 341
            D   Q FL+        C A+   K+ +GA   +NE   D+   +DK +E S   H   S
Sbjct: 765  DDALQQFLALWCNMKECCEADGNEKA-EGASNAHNEHHGDTSTVVDKLKEGSKRFH---S 820

Query: 342  SQVSLLHGKYTYYRKKKSERKKFGSLSRFVPGDVSVPNL----DVDMPRKHGIS----KL 497
            S+ S +  KYTY+RKKK  RKKFGS S       SV N      V+  RK G++    + 
Sbjct: 821  SEASTMVEKYTYHRKKKLLRKKFGSPSNC---SNSVENAFQTEHVEKSRKQGVAGDVFEN 877

Query: 498  TEVETSVVRTKSSENSRLQDPS-------------------------------------- 563
             +V+ S V +K    ++L D S                                      
Sbjct: 878  AKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGKNKVTAESSASAGSSKVKSK 937

Query: 564  LPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALKCVKERFSALTEDCIETEKVVNSNDHD 743
            LPSG  S K    +K MK +  +QR+K+     K   E  S L+ D  +  KVV    H+
Sbjct: 938  LPSGYSSAKSTISQKVMKVTSAVQRDKVPVP--KPSGEMLSTLSADGNDVGKVVRGKAHN 995

Query: 744  IGMHHGVAGGCSKKKIPNSTKASKLKRKHLMDDVTLSGPGKALKLANSVSKKVLGRQVPV 923
            +G+        SK K PN+TK SK KRK  MD + L    KALK+A   +K+   RQV +
Sbjct: 996  VGIEKDSILDSSKSK-PNATKESKQKRKRTMDGLELHAT-KALKVAKGTAKQAASRQVAM 1053

Query: 924  QKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWSLNASPAERARVRGFRSFQSWYSGSEA 1103
            +K K SKSRT   CPRSDGCARSSI+GWEWH+WSLNASPAERARVRG +   + Y G E 
Sbjct: 1054 KKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV 1113

Query: 1104 NASQWSNVK 1130
            NASQW+N K
Sbjct: 1114 NASQWANGK 1122


>ref|XP_006435511.1| hypothetical protein CICLE_v10000043mg [Citrus clementina]
            gi|557537633|gb|ESR48751.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
          Length = 1290

 Score =  268 bits (684), Expect = 4e-69
 Identities = 179/429 (41%), Positives = 231/429 (53%), Gaps = 53/429 (12%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EP PPGFED+ R L   C GKF+ S SD+ + ++ EY+A+A+CRQKLH  V+GEWKS FV
Sbjct: 700  EPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFV 759

Query: 183  DSVHQLFLSR-------CAANKQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSGS 341
            D   Q FL+        C A+   K+ +GA   +NE   D+   +DK +E S   H   S
Sbjct: 760  DDALQQFLALWCNMKECCEADGNEKA-EGASNAHNEHHGDTSTVVDKLKEGSKRFH---S 815

Query: 342  SQVSLLHGKYTYYRKKKSERKKFGSLSRFVPGDVSVPNL----DVDMPRKHGIS----KL 497
            S+ S +  KYTY+RKKK  RKKFGS S       SV N      V+  RK G++    + 
Sbjct: 816  SEASTMVEKYTYHRKKKLLRKKFGSPSNC---SNSVENAFQTEHVEKSRKQGVAGDVFEN 872

Query: 498  TEVETSVVRTKSSENSRLQDPS-------------------------------------- 563
             +V+ S V +K    ++L D S                                      
Sbjct: 873  AKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGKNKVTAESSASAGSSKVKSK 932

Query: 564  LPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALKCVKERFSALTEDCIETEKVVNSNDHD 743
            LPSG  S K    +K MK +  +QR+K+     K   E  S L+ D  +  KVV    H+
Sbjct: 933  LPSGYSSAKSTISQKVMKVTSAVQRDKVPVP--KPSGEMLSTLSADGNDVGKVVRGKAHN 990

Query: 744  IGMHHGVAGGCSKKKIPNSTKASKLKRKHLMDDVTLSGPGKALKLANSVSKKVLGRQVPV 923
            +G+        SK K PN+TK SK KRK  MD + L    KALK+A   +K+   RQV +
Sbjct: 991  VGIEKDSILDSSKSK-PNATKESKQKRKRTMDGLELHAT-KALKVAKGTAKQAASRQVAM 1048

Query: 924  QKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWSLNASPAERARVRGFRSFQSWYSGSEA 1103
            +K K SKSRT   CPRSDGCARSSI+GWEWH+WSLNASPAERARVRG +   + Y G E 
Sbjct: 1049 KKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV 1108

Query: 1104 NASQWSNVK 1130
            NASQW+N K
Sbjct: 1109 NASQWANGK 1117


>ref|XP_006435508.1| hypothetical protein CICLE_v10000043mg [Citrus clementina]
            gi|567885901|ref|XP_006435509.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
            gi|567885903|ref|XP_006435510.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
            gi|567885909|ref|XP_006435513.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
            gi|557537630|gb|ESR48748.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
            gi|557537631|gb|ESR48749.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
            gi|557537632|gb|ESR48750.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
            gi|557537635|gb|ESR48753.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
          Length = 1295

 Score =  268 bits (684), Expect = 4e-69
 Identities = 179/429 (41%), Positives = 231/429 (53%), Gaps = 53/429 (12%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EP PPGFED+ R L   C GKF+ S SD+ + ++ EY+A+A+CRQKLH  V+GEWKS FV
Sbjct: 705  EPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFV 764

Query: 183  DSVHQLFLSR-------CAANKQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSGS 341
            D   Q FL+        C A+   K+ +GA   +NE   D+   +DK +E S   H   S
Sbjct: 765  DDALQQFLALWCNMKECCEADGNEKA-EGASNAHNEHHGDTSTVVDKLKEGSKRFH---S 820

Query: 342  SQVSLLHGKYTYYRKKKSERKKFGSLSRFVPGDVSVPNL----DVDMPRKHGIS----KL 497
            S+ S +  KYTY+RKKK  RKKFGS S       SV N      V+  RK G++    + 
Sbjct: 821  SEASTMVEKYTYHRKKKLLRKKFGSPSNC---SNSVENAFQTEHVEKSRKQGVAGDVFEN 877

Query: 498  TEVETSVVRTKSSENSRLQDPS-------------------------------------- 563
             +V+ S V +K    ++L D S                                      
Sbjct: 878  AKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGKNKVTAESSASAGSSKVKSK 937

Query: 564  LPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALKCVKERFSALTEDCIETEKVVNSNDHD 743
            LPSG  S K    +K MK +  +QR+K+     K   E  S L+ D  +  KVV    H+
Sbjct: 938  LPSGYSSAKSTISQKVMKVTSAVQRDKVPVP--KPSGEMLSTLSADGNDVGKVVRGKAHN 995

Query: 744  IGMHHGVAGGCSKKKIPNSTKASKLKRKHLMDDVTLSGPGKALKLANSVSKKVLGRQVPV 923
            +G+        SK K PN+TK SK KRK  MD + L    KALK+A   +K+   RQV +
Sbjct: 996  VGIEKDSILDSSKSK-PNATKESKQKRKRTMDGLELHAT-KALKVAKGTAKQAASRQVAM 1053

Query: 924  QKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWSLNASPAERARVRGFRSFQSWYSGSEA 1103
            +K K SKSRT   CPRSDGCARSSI+GWEWH+WSLNASPAERARVRG +   + Y G E 
Sbjct: 1054 KKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV 1113

Query: 1104 NASQWSNVK 1130
            NASQW+N K
Sbjct: 1114 NASQWANGK 1122


>ref|XP_006435507.1| hypothetical protein CICLE_v10000043mg [Citrus clementina]
            gi|567885907|ref|XP_006435512.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
            gi|557537629|gb|ESR48747.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
            gi|557537634|gb|ESR48752.1| hypothetical protein
            CICLE_v10000043mg [Citrus clementina]
          Length = 1241

 Score =  268 bits (684), Expect = 4e-69
 Identities = 179/429 (41%), Positives = 231/429 (53%), Gaps = 53/429 (12%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EP PPGFED+ R L   C GKF+ S SD+ + ++ EY+A+A+CRQKLH  V+GEWKS FV
Sbjct: 705  EPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFV 764

Query: 183  DSVHQLFLSR-------CAANKQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSGS 341
            D   Q FL+        C A+   K+ +GA   +NE   D+   +DK +E S   H   S
Sbjct: 765  DDALQQFLALWCNMKECCEADGNEKA-EGASNAHNEHHGDTSTVVDKLKEGSKRFH---S 820

Query: 342  SQVSLLHGKYTYYRKKKSERKKFGSLSRFVPGDVSVPNL----DVDMPRKHGIS----KL 497
            S+ S +  KYTY+RKKK  RKKFGS S       SV N      V+  RK G++    + 
Sbjct: 821  SEASTMVEKYTYHRKKKLLRKKFGSPSNC---SNSVENAFQTEHVEKSRKQGVAGDVFEN 877

Query: 498  TEVETSVVRTKSSENSRLQDPS-------------------------------------- 563
             +V+ S V +K    ++L D S                                      
Sbjct: 878  AKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGKNKVTAESSASAGSSKVKSK 937

Query: 564  LPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALKCVKERFSALTEDCIETEKVVNSNDHD 743
            LPSG  S K    +K MK +  +QR+K+     K   E  S L+ D  +  KVV    H+
Sbjct: 938  LPSGYSSAKSTISQKVMKVTSAVQRDKVPVP--KPSGEMLSTLSADGNDVGKVVRGKAHN 995

Query: 744  IGMHHGVAGGCSKKKIPNSTKASKLKRKHLMDDVTLSGPGKALKLANSVSKKVLGRQVPV 923
            +G+        SK K PN+TK SK KRK  MD + L    KALK+A   +K+   RQV +
Sbjct: 996  VGIEKDSILDSSKSK-PNATKESKQKRKRTMDGLELHAT-KALKVAKGTAKQAASRQVAM 1053

Query: 924  QKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWSLNASPAERARVRGFRSFQSWYSGSEA 1103
            +K K SKSRT   CPRSDGCARSSI+GWEWH+WSLNASPAERARVRG +   + Y G E 
Sbjct: 1054 KKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV 1113

Query: 1104 NASQWSNVK 1130
            NASQW+N K
Sbjct: 1114 NASQWANGK 1122


>ref|XP_007018610.1| Set domain protein, putative isoform 5 [Theobroma cacao]
            gi|508723938|gb|EOY15835.1| Set domain protein, putative
            isoform 5 [Theobroma cacao]
          Length = 1001

 Score =  266 bits (679), Expect = 2e-68
 Identities = 181/410 (44%), Positives = 234/410 (57%), Gaps = 34/410 (8%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EPPPPG E N   L    L KFRPSRSD+ SP+I EY+A+A+CRQKLH+DVL EWKS F+
Sbjct: 435  EPPPPGLEGNAGTLVPSHLCKFRPSRSDERSPKIGEYVAVAMCRQKLHEDVLREWKSSFI 494

Query: 183  DSVHQLFLS-------RCAAN-KQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSG 338
            D+    FL+       RC A+ K+ +++  +VGR  E L DS A  DK RERS    +SG
Sbjct: 495  DATLYQFLTSWRSLKKRCKADSKEERAF--SVGR--EILADSSAIGDKLRERSKKSQSSG 550

Query: 339  SSQVSLLHGKYTYYRKKKSERKKFGSL-SRFVPGDVSVPNLDVDMPRKH----------- 482
            SS+VSL+ GKYTYYRKKK  RKK GS  S  V G  + P   V+ PRK            
Sbjct: 551  SSEVSLVTGKYTYYRKKKLVRKKIGSTQSTIVNGSQNHP---VERPRKKEASRNLLDHAD 607

Query: 483  -----------GISKLTEVETSVVRTKSS--ENSRLQDPSLPSGCPSTKKPTGRKSMKPS 623
                       GI+K     ++V R+  +  ++S L D S+       K   GRK  K +
Sbjct: 608  PEPTAATSKKVGINKSASQSSTVSRSSKTIAKSSLLNDHSI------LKSAGGRKKTKVT 661

Query: 624  RVIQRNKLTEDALKCVKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNST 803
              +Q+N + E A++  +ER S  +++C + +KVV   +H +G    +     KK +  + 
Sbjct: 662  LAVQKNLVGEGAVQVSRERAST-SQNC-DVKKVVGRTNHIVGSEVELTNDSHKKTL-KAP 718

Query: 804  KASKLKRKHLMDDVTLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGC 983
            K S++KRK L +D     P K  K+ANS SK    R    +     +SRT   CPRSDGC
Sbjct: 719  KVSRVKRKQLDNDEPPLLPTKVQKVANSASKHPSSRGNADRNTHSIRSRTANSCPRSDGC 778

Query: 984  ARSSINGWEWHRWSLNASPAERARVRGFRSFQSWYSGSEA-NASQWSNVK 1130
            ARSSINGWEWH+WSLNASPAERARVRG +     YSGSE  N  Q SN K
Sbjct: 779  ARSSINGWEWHKWSLNASPAERARVRGIQCTHMKYSGSEVNNMMQLSNGK 828


>ref|XP_007018609.1| Set domain protein, putative isoform 4 [Theobroma cacao]
            gi|508723937|gb|EOY15834.1| Set domain protein, putative
            isoform 4 [Theobroma cacao]
          Length = 1235

 Score =  266 bits (679), Expect = 2e-68
 Identities = 181/410 (44%), Positives = 234/410 (57%), Gaps = 34/410 (8%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EPPPPG E N   L    L KFRPSRSD+ SP+I EY+A+A+CRQKLH+DVL EWKS F+
Sbjct: 675  EPPPPGLEGNAGTLVPSHLCKFRPSRSDERSPKIGEYVAVAMCRQKLHEDVLREWKSSFI 734

Query: 183  DSVHQLFLS-------RCAAN-KQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSG 338
            D+    FL+       RC A+ K+ +++  +VGR  E L DS A  DK RERS    +SG
Sbjct: 735  DATLYQFLTSWRSLKKRCKADSKEERAF--SVGR--EILADSSAIGDKLRERSKKSQSSG 790

Query: 339  SSQVSLLHGKYTYYRKKKSERKKFGSL-SRFVPGDVSVPNLDVDMPRKH----------- 482
            SS+VSL+ GKYTYYRKKK  RKK GS  S  V G  + P   V+ PRK            
Sbjct: 791  SSEVSLVTGKYTYYRKKKLVRKKIGSTQSTIVNGSQNHP---VERPRKKEASRNLLDHAD 847

Query: 483  -----------GISKLTEVETSVVRTKSS--ENSRLQDPSLPSGCPSTKKPTGRKSMKPS 623
                       GI+K     ++V R+  +  ++S L D S+       K   GRK  K +
Sbjct: 848  PEPTAATSKKVGINKSASQSSTVSRSSKTIAKSSLLNDHSI------LKSAGGRKKTKVT 901

Query: 624  RVIQRNKLTEDALKCVKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNST 803
              +Q+N + E A++  +ER S  +++C + +KVV   +H +G    +     KK +  + 
Sbjct: 902  LAVQKNLVGEGAVQVSRERAST-SQNC-DVKKVVGRTNHIVGSEVELTNDSHKKTL-KAP 958

Query: 804  KASKLKRKHLMDDVTLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGC 983
            K S++KRK L +D     P K  K+ANS SK    R    +     +SRT   CPRSDGC
Sbjct: 959  KVSRVKRKQLDNDEPPLLPTKVQKVANSASKHPSSRGNADRNTHSIRSRTANSCPRSDGC 1018

Query: 984  ARSSINGWEWHRWSLNASPAERARVRGFRSFQSWYSGSEA-NASQWSNVK 1130
            ARSSINGWEWH+WSLNASPAERARVRG +     YSGSE  N  Q SN K
Sbjct: 1019 ARSSINGWEWHKWSLNASPAERARVRGIQCTHMKYSGSEVNNMMQLSNGK 1068


>ref|XP_007018606.1| Set domain protein, putative isoform 1 [Theobroma cacao]
            gi|590597427|ref|XP_007018607.1| Set domain protein,
            putative isoform 1 [Theobroma cacao]
            gi|590597431|ref|XP_007018608.1| Set domain protein,
            putative isoform 1 [Theobroma cacao]
            gi|508723934|gb|EOY15831.1| Set domain protein, putative
            isoform 1 [Theobroma cacao] gi|508723935|gb|EOY15832.1|
            Set domain protein, putative isoform 1 [Theobroma cacao]
            gi|508723936|gb|EOY15833.1| Set domain protein, putative
            isoform 1 [Theobroma cacao]
          Length = 1241

 Score =  266 bits (679), Expect = 2e-68
 Identities = 181/410 (44%), Positives = 234/410 (57%), Gaps = 34/410 (8%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EPPPPG E N   L    L KFRPSRSD+ SP+I EY+A+A+CRQKLH+DVL EWKS F+
Sbjct: 675  EPPPPGLEGNAGTLVPSHLCKFRPSRSDERSPKIGEYVAVAMCRQKLHEDVLREWKSSFI 734

Query: 183  DSVHQLFLS-------RCAAN-KQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSG 338
            D+    FL+       RC A+ K+ +++  +VGR  E L DS A  DK RERS    +SG
Sbjct: 735  DATLYQFLTSWRSLKKRCKADSKEERAF--SVGR--EILADSSAIGDKLRERSKKSQSSG 790

Query: 339  SSQVSLLHGKYTYYRKKKSERKKFGSL-SRFVPGDVSVPNLDVDMPRKH----------- 482
            SS+VSL+ GKYTYYRKKK  RKK GS  S  V G  + P   V+ PRK            
Sbjct: 791  SSEVSLVTGKYTYYRKKKLVRKKIGSTQSTIVNGSQNHP---VERPRKKEASRNLLDHAD 847

Query: 483  -----------GISKLTEVETSVVRTKSS--ENSRLQDPSLPSGCPSTKKPTGRKSMKPS 623
                       GI+K     ++V R+  +  ++S L D S+       K   GRK  K +
Sbjct: 848  PEPTAATSKKVGINKSASQSSTVSRSSKTIAKSSLLNDHSI------LKSAGGRKKTKVT 901

Query: 624  RVIQRNKLTEDALKCVKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNST 803
              +Q+N + E A++  +ER S  +++C + +KVV   +H +G    +     KK +  + 
Sbjct: 902  LAVQKNLVGEGAVQVSRERAST-SQNC-DVKKVVGRTNHIVGSEVELTNDSHKKTL-KAP 958

Query: 804  KASKLKRKHLMDDVTLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGC 983
            K S++KRK L +D     P K  K+ANS SK    R    +     +SRT   CPRSDGC
Sbjct: 959  KVSRVKRKQLDNDEPPLLPTKVQKVANSASKHPSSRGNADRNTHSIRSRTANSCPRSDGC 1018

Query: 984  ARSSINGWEWHRWSLNASPAERARVRGFRSFQSWYSGSEA-NASQWSNVK 1130
            ARSSINGWEWH+WSLNASPAERARVRG +     YSGSE  N  Q SN K
Sbjct: 1019 ARSSINGWEWHKWSLNASPAERARVRGIQCTHMKYSGSEVNNMMQLSNGK 1068


>ref|XP_002510762.1| set domain protein, putative [Ricinus communis]
            gi|223551463|gb|EEF52949.1| set domain protein, putative
            [Ricinus communis]
          Length = 1258

 Score =  265 bits (678), Expect = 2e-68
 Identities = 167/398 (41%), Positives = 221/398 (55%), Gaps = 22/398 (5%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EPPPPGF DN R L    + KFRP++ ++  P+I EY+AMAICRQKLHDDVL EWKSFF+
Sbjct: 692  EPPPPGFGDNARTLVPSPIHKFRPTQPEESIPKIREYVAMAICRQKLHDDVLSEWKSFFI 751

Query: 183  DSVHQLFLSRCAANKQI----KSYDGAVGRNNERLNDSIAAIDKFR-ERSYNCHNSGSSQ 347
            D +   FL      +Q         G    N +    ++ ++ K +  R +N  +S S+ 
Sbjct: 752  DGILNQFLRSIHTLRQHCQPGSKMGGTSNANKDHNGTALTSLYKLKGTREFN--SSDSAG 809

Query: 348  VSLLHGKYTYYRKKKSERKKFGSLSRFV-PGDVSVPNLDVDMPRKHGISKLTEVETSVVR 524
            VS +  KYTYYRKKK  RKK GS S+ + P D  + +  V+  +K  + K  EVE  V  
Sbjct: 810  VSSVCDKYTYYRKKKLVRKKLGSSSQSITPVDTGLQHHPVEKLQKQNVVKDIEVEPVVAT 869

Query: 525  TKSSENSRLQDP--------------SLPSGCPSTKKPTGRKSMKPSRVIQRNKL--TED 656
             K  +  + Q                SLPS     K  T +K +K    + R  +  T D
Sbjct: 870  LKKKKQKKGQTELSDDRRAIKSIVKSSLPSDQSMAKNGTHQKVIKYKHAVPRPSINVTID 929

Query: 657  ALKCVKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNSTKASKLKRKHLM 836
             +K  ++  S +++D  + +KV +SN+HD G+        SKK +  +TK SKLKRKH  
Sbjct: 930  TIKPNRKNSSDVSKDHAKVKKVSDSNNHDGGIEEVPTHDYSKKNL--ATKISKLKRKHSA 987

Query: 837  DDVTLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGCARSSINGWEWH 1016
            D  ++S P K LK+  S SK+   RQV   K K  KSR    CPRSDGCARSSI GWEWH
Sbjct: 988  DGRSVSHPMKFLKVTTSGSKQAASRQVTAGKAKSRKSRASNSCPRSDGCARSSITGWEWH 1047

Query: 1017 RWSLNASPAERARVRGFRSFQSWYSGSEANASQWSNVK 1130
            +WS +ASPA+RARVRG     + YS SEA  SQ SN K
Sbjct: 1048 KWSHSASPADRARVRGIHCLHANYSVSEAYTSQLSNGK 1085


>ref|XP_002307834.2| hypothetical protein POPTR_0005s28130g [Populus trichocarpa]
            gi|550339919|gb|EEE94830.2| hypothetical protein
            POPTR_0005s28130g [Populus trichocarpa]
          Length = 1149

 Score =  239 bits (611), Expect = 1e-60
 Identities = 161/409 (39%), Positives = 211/409 (51%), Gaps = 33/409 (8%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EPPPPGF+D+   +    + KF+PS+S + + +   Y+A+A+C+QKLHDDVL  WKS FV
Sbjct: 594  EPPPPGFKDSA--IFPPTISKFQPSKSLESTSKNGAYVAIAMCKQKLHDDVLSVWKSLFV 651

Query: 183  DSVHQLFLSRCAANKQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSGSSQVSLLH 362
            + V   F   C  +++    D     +NE        + KF E S   H+  SS +SL+ 
Sbjct: 652  NDVLHRFPGLCCTSEKHTEPD-----SNEE------GVFKFTEGSRKFHSPDSSVLSLVS 700

Query: 363  GKYTYYRKKKSERKKFGSLSRFVPGDVSVPNLDVDMPRKHGISKLTEVETSVV-----RT 527
             KYTY+RKKK   KK GS S     D  +    V+  RK     L  V  +VV       
Sbjct: 701  SKYTYHRKKKLAGKKLGSSSHSTTTDAGLQKRPVEKSRKQNF--LRNVSENVVVQPVGTP 758

Query: 528  KSSENSRLQDPSLPSGCPST----------------------------KKPTGRKSMKPS 623
            K  E  + Q  S  +G PS                             K    RK MK +
Sbjct: 759  KKKERIKGQAESSVNGRPSKATFAELPVNARSSKATVRSTVKRVQSLPKNAGHRKVMKIA 818

Query: 624  RVIQRNKLTEDALKCVKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNST 803
            + +  +K+ E+A+K  +ER            KV + N  D+ + +     CSKK + N+ 
Sbjct: 819  QAVNDDKVAEEAIKTSRERAG----------KVFDCNGCDVEIENAETTECSKKTL-NTN 867

Query: 804  KASKLKRKHLMDDVTLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGC 983
            K SKLKRK  +D  ++S P K LK+ NS  K+   RQV V+K K SKSRT  PCP SDGC
Sbjct: 868  KVSKLKRKSTVDGGSVSHPMKFLKVENSAIKQAASRQVSVRKTKSSKSRTLNPCPISDGC 927

Query: 984  ARSSINGWEWHRWSLNASPAERARVRGFRSFQSWYSGSEANASQWSNVK 1130
            ARSSINGWEWH WS+NASPAERARVRG     + YS  EA  SQ SN K
Sbjct: 928  ARSSINGWEWHAWSINASPAERARVRGVPHVHAKYSFPEAYTSQLSNGK 976


>ref|XP_004487927.1| PREDICTED: uncharacterized protein LOC101514300 isoform X5 [Cicer
            arietinum]
          Length = 1146

 Score =  229 bits (584), Expect = 2e-57
 Identities = 148/394 (37%), Positives = 218/394 (55%), Gaps = 21/394 (5%)
 Frame = +3

Query: 12   PPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFVDSV 191
            PPGFE N + +      KFRPSR  +C+P+I EY+A A+CRQKLHD VL EWK  F+DS 
Sbjct: 590  PPGFEKNSQTIVPHYKSKFRPSRIVECNPKITEYVASALCRQKLHDKVLEEWKLSFLDSA 649

Query: 192  -HQLFLSRCAANKQIKSYDGAVGRN----NERLNDSIAAIDKFRERSYNCHNSGSSQVSL 356
             +Q+F+S C   K  +      G++     ++L+D+ + + K +E      +SG+  VS 
Sbjct: 650  FNQVFMSSCTIKKHFQCRGHEKGKSVSVSKKQLDDATSGLGKVKE---GAKSSGAPPVS- 705

Query: 357  LHGKYTYYRKKKSERKKFGSLSRFVPGDVSVPNLDVDMPRK---HGISKLTEVETSVVR- 524
              GKY YYRKK S RK+FGS    V  D       +   RK     + +  EV+ + ++ 
Sbjct: 706  --GKYAYYRKKLS-RKEFGSSQSVVEDDSGPGKQPLAKLRKIVSGDVHETAEVKIAAIKR 762

Query: 525  ------------TKSSENSRLQDPSLPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALKC 668
                        +  S +S + + S PS   S    T +K +K +  +Q + +  D +K 
Sbjct: 763  GKAKMFKGKKDTSSKSRSSVIVNNSSPSYQLSLTNKTSQKVLKLACTVQNDVM--DVVKS 820

Query: 669  VKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNSTKASKLKRKHLMDDVT 848
             K R S  T++ +   KV+ SN+ D  +H    G   ++K+  + KASK K+KH  D VT
Sbjct: 821  NKRRLSTSTDNSVNM-KVIKSNNSDGTIHRKTTGHIPREKLNATNKASKSKKKHQTDGVT 879

Query: 849  LSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWSL 1028
             S P K LK++N  +     ++V V +   ++S++   CP+S+GCAR+SINGWEWH+WS 
Sbjct: 880  SSHPAKVLKISNKGASLGASKKVTVARRDSAESKSLDLCPQSNGCARTSINGWEWHKWSQ 939

Query: 1029 NASPAERARVRGFRSFQSWYSGSEANASQWSNVK 1130
            +ASPA RARVRG    Q+   GSE N+SQ SN K
Sbjct: 940  SASPACRARVRGLLRVQNKSIGSENNSSQLSNGK 973


>ref|XP_007220291.1| hypothetical protein PRUPE_ppa000519mg [Prunus persica]
            gi|462416753|gb|EMJ21490.1| hypothetical protein
            PRUPE_ppa000519mg [Prunus persica]
          Length = 1116

 Score =  228 bits (581), Expect = 4e-57
 Identities = 159/444 (35%), Positives = 221/444 (49%), Gaps = 68/444 (15%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EP PPG  D  + +      KFRPSRSD+C P+I EYIA A+CR+KLHD V+ EWKS F+
Sbjct: 520  EPLPPGLVDKAKAVISSQTCKFRPSRSDECIPKIGEYIATAMCRKKLHDSVINEWKSSFI 579

Query: 183  DSVHQLFLSRCAANKQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSGSSQVSLLH 362
            D V   FL+    +K+  ++     + N+          K  E S +C NSG+++VS + 
Sbjct: 580  DCVLHQFLASWRTSKKTHAHKERACKTNKN--------HKLEEESKHCDNSGTAKVSPII 631

Query: 363  GKYTYYRKKKSERKKFGSLSRFVPGDVSVPNLDVDMPRKHGIS----KLTEVE-TSVVRT 527
            GKYTY+RKK   +K   S S  + G   + N  V+  +   +S    + TE +  +V+  
Sbjct: 632  GKYTYHRKKLFLKKSGSSRSVTLDGK-ELKNEIVEKSKNLHVSGDMPETTEFKNATVIPK 690

Query: 528  KSSENSRLQD---------PSLPSGCPST-----KKPTGRKSMKPSRVIQRNKL-TEDAL 662
            K    S+ Q           ++  GC ST     K  + RK +K S  ++   + T   +
Sbjct: 691  KKRGQSKSQTELSVGATSLQAIAKGCASTDKKEAKSSSSRKLLKVSHAVKSMPIPTLIDI 750

Query: 663  KCVKERFSALTEDCI--------------------------------------------- 707
                  FS L + CI                                             
Sbjct: 751  VSFSFSFSFLLQSCILPVHLVYHSEWNASNDRGYGLFYPGSEPMECTPKPSKKMASAHGA 810

Query: 708  ---ETEKVVNSNDHDIGMHHGVAGGCSKKKIPNSTKASKLKRKHLMDDVTLSGPGKALKL 878
               + +KVVNSN  D G+          K+ P STKASKLKR+ +MDD+ L+ P K LK+
Sbjct: 811  NHRDVQKVVNSNGPDFGL----------KREP-STKASKLKRECVMDDLKLARPKKVLKV 859

Query: 879  ANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWSLNASPAERARV 1058
             +   K+   + +PV+K++ SKSR   PCP+S GCAR SINGWEWHRWSLNASP ERARV
Sbjct: 860  TSGTPKQAPCKPIPVRKMQSSKSRKLNPCPKSCGCARVSINGWEWHRWSLNASPVERARV 919

Query: 1059 RGFRSFQSWYSGSEANASQWSNVK 1130
            RG +   + + GS+ N SQ SN K
Sbjct: 920  RGVKYVNAEHRGSDINTSQLSNGK 943


>gb|EXC31045.1| Histone-lysine N-methyltransferase SETD1B [Morus notabilis]
          Length = 1249

 Score =  220 bits (560), Expect = 1e-54
 Identities = 154/404 (38%), Positives = 204/404 (50%), Gaps = 28/404 (6%)
 Frame = +3

Query: 3    EPPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFV 182
            EPPPPG EDN R        KFR  RS++C P++ EY+A+A+CRQKLH+DVL E K  F+
Sbjct: 688  EPPPPGCEDNIRSFASSHQDKFRTLRSNKCVPKMGEYVAIAMCRQKLHEDVLRELKMSFI 747

Query: 183  DSVHQLFLSRCAANKQ----IKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSGSSQV 350
                Q FL    ++K+    +   +GA   N +    S   +DK  E    C  S S + 
Sbjct: 748  GYALQKFLQTWRSSKKHCKLLDYEEGAQNANRKLPGGSSLLLDKIGEELECCPKSTSDKS 807

Query: 351  SLLHGKYTYYRKKKSERKKFGSLSRF--------------------VPGDVSVP---NLD 461
            S   GKYTY+RKK   +KK GS+S+                     V GDV V     + 
Sbjct: 808  STAVGKYTYHRKKS--QKKSGSISKLDTTVGGGLLDHLAEESKKEHVSGDVIVAAKAQVA 865

Query: 462  VDMPRKHGISKLTEVETSVVRTKSSENSRLQDPSLPSGCPSTKKPTGRKSMKPSRVIQRN 641
                +K G+ K      S  + KS +       +L S    TK  + RK+M  SR  +  
Sbjct: 866  ATSSKKIGLKK--GQNESSAKDKSLQVVSKVKRNLSSDRLKTKNSSSRKAMVSSRAQKSG 923

Query: 642  KLTEDALKCVKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNSTKASKLK 821
            KL E A K  + +  A +       KV N NDHD+ +   +            TKASKLK
Sbjct: 924  KLAEGANKPSRTQVLAPSSKRDGVHKVENDNDHDVKIQEDLP-----------TKASKLK 972

Query: 822  RKHLMDDVTLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSR-TPYPCPRSDGCARSSI 998
            R+  MD +  S   K LK+AN  +K+ L +Q  V+K K  KS+      PRSDGCAR+SI
Sbjct: 973  RERPMDSMPPSHSKKVLKVANGDAKQALSKQAVVKKTKSRKSKIVKNAYPRSDGCARASI 1032

Query: 999  NGWEWHRWSLNASPAERARVRGFRSFQSWYSGSEANASQWSNVK 1130
            NGWEWHRWS++ASPAERA VRG +   +  S S+ N S  SN K
Sbjct: 1033 NGWEWHRWSVSASPAERAHVRGIKYIDTKRSSSDVNKSPLSNGK 1076


>ref|XP_003594905.1| Histone-lysine N-methyltransferase SETD1B [Medicago truncatula]
            gi|355483953|gb|AES65156.1| Histone-lysine
            N-methyltransferase SETD1B [Medicago truncatula]
          Length = 1232

 Score =  215 bits (548), Expect = 2e-53
 Identities = 146/399 (36%), Positives = 202/399 (50%), Gaps = 26/399 (6%)
 Frame = +3

Query: 12   PPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFVDSV 191
            PPGFE N   +   C  KFRPSR+ +C+P+I EY+  A+CRQKLHD+VL +WK   +DS 
Sbjct: 652  PPGFEKNS--IFPHCNSKFRPSRTVECNPKITEYVTAALCRQKLHDEVLKDWKLSILDST 709

Query: 192  HQLFLSRCAANKQIKSYDGAVGR----NNERLNDSIAAIDKFRERSYNC-----HNSGSS 344
             +  +S C   K ++S     G+    N E LND+   + K +E +          + SS
Sbjct: 710  FKKVMSSCTIKKNLQSRGHGKGKSFSANKEHLNDATLGLGKVKEGTKLGLGKVKEGAKSS 769

Query: 345  QVSLLHGKYTYYRKKKSER-----------------KKFGSLSRFVPGDVSVPNLDVDMP 473
             V L   KYTY+RK  S +                 K    L + V GDV          
Sbjct: 770  GVPLAIEKYTYHRKNLSRKELCSSKPVVDDNSGPGKKPLAKLRKDVSGDVKESAEVKVTA 829

Query: 474  RKHGISKLTEVETSVVRTKSSENSRLQDPSLPSGCPSTKKPTGRKSMKPSRVIQRNKLTE 653
             K G +K+ + +      KSS  +   D S PS   S K  T +K  K +  +Q      
Sbjct: 830  IKRGKAKMIKGKKDTSSKKSSPVN--VDNSSPSVQLSLKNKTCQKVSKFAHTVQNG--VT 885

Query: 654  DALKCVKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNSTKASKLKRKHL 833
            D LK  K+R    +++ +   KVV  N+ D+ +     G  SK+K+  +   SK KRKH 
Sbjct: 886  DVLKSNKKRLLVSSDNSVGM-KVVKRNNTDVTIQRKTTGHISKEKLNATNTVSKSKRKHQ 944

Query: 834  MDDVTLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGCARSSINGWEW 1013
             D VT S P K LK++NS +     +QV   +   +KS++   CPRS GCAR+SI+GWEW
Sbjct: 945  PDGVTSSHPAKVLKISNSGASLEASKQVTEARRNSAKSKSLDLCPRSIGCARTSIDGWEW 1004

Query: 1014 HRWSLNASPAERARVRGFRSFQSWYSGSEANASQWSNVK 1130
            H+WS +ASP  RARVRG    Q+ +  SE N SQ SN K
Sbjct: 1005 HKWSQSASPTSRARVRGLPRLQNKFINSEKNPSQLSNSK 1043


>ref|XP_004487926.1| PREDICTED: uncharacterized protein LOC101514300 isoform X4 [Cicer
            arietinum]
          Length = 1196

 Score =  214 bits (546), Expect = 4e-53
 Identities = 156/442 (35%), Positives = 221/442 (50%), Gaps = 69/442 (15%)
 Frame = +3

Query: 12   PPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFVDSV 191
            PPGFE N + +      KFRPSR  +C+P+I EY+A A+CRQKLHD VL EWK  F+DS 
Sbjct: 590  PPGFEKNSQTIVPHYKSKFRPSRIVECNPKITEYVASALCRQKLHDKVLEEWKLSFLDSA 649

Query: 192  -HQLFLSRCAANKQIKSYDGAVGRN----NERLNDSIAAIDKFRERSYNCHNSGSSQVSL 356
             +Q+F+S C   K  +      G++     ++L+D+ + + K +E      +SG+  VS 
Sbjct: 650  FNQVFMSSCTIKKHFQCRGHEKGKSVSVSKKQLDDATSGLGKVKE---GAKSSGAPPVS- 705

Query: 357  LHGKYTYYRKKKSERKKFGS------------------LSRFVPGDV------------- 443
              GKY YYRKK S RK+FGS                  L + V GDV             
Sbjct: 706  --GKYAYYRKKLS-RKEFGSSQSVVEDDSGPGKQPLAKLRKIVSGDVHETAEVKIAAIKR 762

Query: 444  ------------------------SVPNLDVDMPRKHG--ISKLT-EVETSVVRTKSSEN 542
                                    S P+  + +  K    + KL   V+  V+    S  
Sbjct: 763  GKAKMFKGKKDTSSKSRSSVIVNNSSPSYQLSLTNKTSQKVLKLACTVQNDVMDVVKSNK 822

Query: 543  SRLQ---DPSLPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALKCVK---ERFSALTEDC 704
             RL    D S+      +    G    K +  I R KL +D    VK   ++ SA T++ 
Sbjct: 823  RRLSTSTDNSVNMKVIKSNNSDGTIHRKTTGHIPREKLNDDVTDVVKSNEKKLSASTDNR 882

Query: 705  IETEKVVNSNDHDIGMHHGVAGGCSKKKIPNSTKASKLKRKHLMDDVTLSGPGKALKLAN 884
            +   KVV SN+ D  +H    G  +++K+  + KASK K+KH  D VT S P K LK++N
Sbjct: 883  VSM-KVVKSNNSDGTVHRKTTGHIAREKLNATNKASKSKKKHQTDGVTSSHPAKVLKISN 941

Query: 885  SVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWSLNASPAERARVRG 1064
              +     ++V V +   ++S++   CP+S+GCAR+SINGWEWH+WS +ASPA RARVRG
Sbjct: 942  KGASLGASKKVTVARRDSAESKSLDLCPQSNGCARTSINGWEWHKWSQSASPACRARVRG 1001

Query: 1065 FRSFQSWYSGSEANASQWSNVK 1130
                Q+   GSE N+SQ SN K
Sbjct: 1002 LLRVQNKSIGSENNSSQLSNGK 1023


>ref|XP_002300607.2| hypothetical protein POPTR_0002s00320g [Populus trichocarpa]
            gi|550343967|gb|EEE79880.2| hypothetical protein
            POPTR_0002s00320g [Populus trichocarpa]
          Length = 1390

 Score =  210 bits (534), Expect = 1e-51
 Identities = 162/459 (35%), Positives = 223/459 (48%), Gaps = 84/459 (18%)
 Frame = +3

Query: 6    PPPPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFVD 185
            PPPPGF+D    L    + KFRPS+S + +P++  Y+ +A+C QKLHDDVL  WKS FVD
Sbjct: 682  PPPPGFKDTA--LFPSAINKFRPSKSLKLTPKVGAYVTIAMCMQKLHDDVLNVWKSIFVD 739

Query: 186  SV-HQLFLSRCAANKQIKSYDGAVGRNNERLNDSIAAIDKFRERSYNCHNSGSSQVSLLH 362
             + H+     C++ K  +      G N E          KF E S   H+  SS +SL+ 
Sbjct: 740  EILHRSPRLCCSSEKHTEP-----GINEE-------GAFKFTEGSNKFHSPDSSVLSLVS 787

Query: 363  GKYTYYRKKKSERKKFGSLS--------------------RFVPGDVS--VPNLDVDMPR 476
            GKYTY+RK+K   KK GS S                    + V  DVS  V    V  P+
Sbjct: 788  GKYTYHRKRKLVGKKLGSSSHSTTTVDSGLLKQPVEKSRKQDVLSDVSENVVVQPVKTPK 847

Query: 477  KHGIS--------KLTEVETSV----VRTKSSENSRLQDPSLPSGCPSTKKP-------T 599
            K G +        K T  E+SV    ++   +E+S    PS  +   + K+        +
Sbjct: 848  KKGQASSVDAKPLKATIAESSVNARPLKATIAESSVNVGPSKAAVKSTLKRDQSLPKNIS 907

Query: 600  GRKSMKPSRVIQRNKLTEDALKCVKERFSAL-------------TEDC----IETEKVVN 728
             RK MK +R +  +K  +D++K  ++    +             T +C    + + KV N
Sbjct: 908  RRKVMKIARAVNDDKDAKDSIKTSRDVVGLIDCNGRDAGIKKSGTTECSKKTLNSTKVSN 967

Query: 729  SN-------------------DHDIGMHHGVAGGCSKKK------IPNSTKASKLKRKHL 833
            S                    ++D+          ++K       +  +TK SKLKRK  
Sbjct: 968  SKRKSTVDGGSVSHPMKILKVENDVNKQAATGQVMARKTKSDHVFLCTATKVSKLKRKST 1027

Query: 834  MDDVTLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGCARSSINGWEW 1013
            ++  ++S P K LK+ N  +K+    Q   +K K SKSR   PCPRSDGCARSSINGWEW
Sbjct: 1028 VNGGSVSHPMKILKVENGANKQTATGQFTARKTKSSKSRMLIPCPRSDGCARSSINGWEW 1087

Query: 1014 HRWSLNASPAERARVRGFRSFQSWYSGSEANASQWSNVK 1130
            H WS+ ASPAERARVRG R   + YSGSEA ASQ SN K
Sbjct: 1088 HAWSVKASPAERARVRGVRCIHAKYSGSEAYASQLSNGK 1126


>ref|XP_006586959.1| PREDICTED: uncharacterized protein LOC100805708 isoform X6 [Glycine
            max]
          Length = 1153

 Score =  209 bits (531), Expect = 2e-51
 Identities = 150/395 (37%), Positives = 215/395 (54%), Gaps = 22/395 (5%)
 Frame = +3

Query: 12   PPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFVDSV 191
            PPG E + + + L    KFRPSRS +C+ +I EY+A A+CRQKLHD+VL +W+S F+DSV
Sbjct: 599  PPGLEKS-QTVALHYNSKFRPSRSAECNLKITEYVATALCRQKLHDEVLEKWRSLFLDSV 657

Query: 192  -HQLFLSRCAANKQIKSYDG-----AVGRNNERLNDSIAAIDKFRERSYNCHNSGSSQVS 353
              Q+F+S     K  KS DG      V  + E LN + + + + +E +       SS+V 
Sbjct: 658  PKQVFISSSTIKKHFKS-DGHKKRKTVNASKEHLNSATSGLGRVKEGA-----KSSSEVP 711

Query: 354  LLHGKYTYYRKKKSERKKFGSLSRFV----PGDVSVPNL------DVDMPRKHGISKLTE 503
             + GKYTY RKK S ++   S S       PG   V  L      DV    +  I+ +  
Sbjct: 712  PVIGKYTYCRKKLSRKELISSKSVAENDSRPGKQPVAKLRKHFSGDVGEAAEVKIASVIH 771

Query: 504  VETSVVRTKSSENSRLQ-----DPSLPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALKC 668
             +T +++ K    S+ +     + S  +   S K   G+K +K S  +Q +   +D +K 
Sbjct: 772  GKTKMIKGKKDTTSKGKSSVSVNSSSHNDQLSLKNKAGQKVLKFSGEVQND--VKDFVKS 829

Query: 669  VKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNST-KASKLKRKHLMDDV 845
              ++ SA T++ +  +K+V S   D  +   V   CS++ I N+T K SK KRKH MD  
Sbjct: 830  NVKKLSASTDNSVVMKKIVKS---DGTVKEKVTSHCSRE-IQNATMKVSKSKRKHQMDGT 885

Query: 846  TLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWS 1025
              S P K LK++N  +     +QV V   K +KS+    CPRSDGCAR+SI+GWEWH+WS
Sbjct: 886  ASSHPTKVLKISNGGAYLGASKQVTVASRKSAKSKPLNLCPRSDGCARTSIDGWEWHKWS 945

Query: 1026 LNASPAERARVRGFRSFQSWYSGSEANASQWSNVK 1130
             +ASPA +ARVRG    Q+    SE N SQ SN K
Sbjct: 946  RSASPAYKARVRGLPCVQNKCIDSENNLSQLSNGK 980


>ref|XP_006586956.1| PREDICTED: uncharacterized protein LOC100805708 isoform X3 [Glycine
            max]
          Length = 1227

 Score =  209 bits (531), Expect = 2e-51
 Identities = 150/395 (37%), Positives = 215/395 (54%), Gaps = 22/395 (5%)
 Frame = +3

Query: 12   PPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFVDSV 191
            PPG E + + + L    KFRPSRS +C+ +I EY+A A+CRQKLHD+VL +W+S F+DSV
Sbjct: 673  PPGLEKS-QTVALHYNSKFRPSRSAECNLKITEYVATALCRQKLHDEVLEKWRSLFLDSV 731

Query: 192  -HQLFLSRCAANKQIKSYDG-----AVGRNNERLNDSIAAIDKFRERSYNCHNSGSSQVS 353
              Q+F+S     K  KS DG      V  + E LN + + + + +E +       SS+V 
Sbjct: 732  PKQVFISSSTIKKHFKS-DGHKKRKTVNASKEHLNSATSGLGRVKEGA-----KSSSEVP 785

Query: 354  LLHGKYTYYRKKKSERKKFGSLSRFV----PGDVSVPNL------DVDMPRKHGISKLTE 503
             + GKYTY RKK S ++   S S       PG   V  L      DV    +  I+ +  
Sbjct: 786  PVIGKYTYCRKKLSRKELISSKSVAENDSRPGKQPVAKLRKHFSGDVGEAAEVKIASVIH 845

Query: 504  VETSVVRTKSSENSRLQ-----DPSLPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALKC 668
             +T +++ K    S+ +     + S  +   S K   G+K +K S  +Q +   +D +K 
Sbjct: 846  GKTKMIKGKKDTTSKGKSSVSVNSSSHNDQLSLKNKAGQKVLKFSGEVQND--VKDFVKS 903

Query: 669  VKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNST-KASKLKRKHLMDDV 845
              ++ SA T++ +  +K+V S   D  +   V   CS++ I N+T K SK KRKH MD  
Sbjct: 904  NVKKLSASTDNSVVMKKIVKS---DGTVKEKVTSHCSRE-IQNATMKVSKSKRKHQMDGT 959

Query: 846  TLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWS 1025
              S P K LK++N  +     +QV V   K +KS+    CPRSDGCAR+SI+GWEWH+WS
Sbjct: 960  ASSHPTKVLKISNGGAYLGASKQVTVASRKSAKSKPLNLCPRSDGCARTSIDGWEWHKWS 1019

Query: 1026 LNASPAERARVRGFRSFQSWYSGSEANASQWSNVK 1130
             +ASPA +ARVRG    Q+    SE N SQ SN K
Sbjct: 1020 RSASPAYKARVRGLPCVQNKCIDSENNLSQLSNGK 1054


>ref|XP_006586954.1| PREDICTED: uncharacterized protein LOC100805708 isoform X1 [Glycine
            max] gi|571476418|ref|XP_006586955.1| PREDICTED:
            uncharacterized protein LOC100805708 isoform X2 [Glycine
            max]
          Length = 1229

 Score =  209 bits (531), Expect = 2e-51
 Identities = 150/395 (37%), Positives = 215/395 (54%), Gaps = 22/395 (5%)
 Frame = +3

Query: 12   PPGFEDNPRPLNLLCLGKFRPSRSDQCSPRINEYIAMAICRQKLHDDVLGEWKSFFVDSV 191
            PPG E + + + L    KFRPSRS +C+ +I EY+A A+CRQKLHD+VL +W+S F+DSV
Sbjct: 675  PPGLEKS-QTVALHYNSKFRPSRSAECNLKITEYVATALCRQKLHDEVLEKWRSLFLDSV 733

Query: 192  -HQLFLSRCAANKQIKSYDG-----AVGRNNERLNDSIAAIDKFRERSYNCHNSGSSQVS 353
              Q+F+S     K  KS DG      V  + E LN + + + + +E +       SS+V 
Sbjct: 734  PKQVFISSSTIKKHFKS-DGHKKRKTVNASKEHLNSATSGLGRVKEGA-----KSSSEVP 787

Query: 354  LLHGKYTYYRKKKSERKKFGSLSRFV----PGDVSVPNL------DVDMPRKHGISKLTE 503
             + GKYTY RKK S ++   S S       PG   V  L      DV    +  I+ +  
Sbjct: 788  PVIGKYTYCRKKLSRKELISSKSVAENDSRPGKQPVAKLRKHFSGDVGEAAEVKIASVIH 847

Query: 504  VETSVVRTKSSENSRLQ-----DPSLPSGCPSTKKPTGRKSMKPSRVIQRNKLTEDALKC 668
             +T +++ K    S+ +     + S  +   S K   G+K +K S  +Q +   +D +K 
Sbjct: 848  GKTKMIKGKKDTTSKGKSSVSVNSSSHNDQLSLKNKAGQKVLKFSGEVQND--VKDFVKS 905

Query: 669  VKERFSALTEDCIETEKVVNSNDHDIGMHHGVAGGCSKKKIPNST-KASKLKRKHLMDDV 845
              ++ SA T++ +  +K+V S   D  +   V   CS++ I N+T K SK KRKH MD  
Sbjct: 906  NVKKLSASTDNSVVMKKIVKS---DGTVKEKVTSHCSRE-IQNATMKVSKSKRKHQMDGT 961

Query: 846  TLSGPGKALKLANSVSKKVLGRQVPVQKIKFSKSRTPYPCPRSDGCARSSINGWEWHRWS 1025
              S P K LK++N  +     +QV V   K +KS+    CPRSDGCAR+SI+GWEWH+WS
Sbjct: 962  ASSHPTKVLKISNGGAYLGASKQVTVASRKSAKSKPLNLCPRSDGCARTSIDGWEWHKWS 1021

Query: 1026 LNASPAERARVRGFRSFQSWYSGSEANASQWSNVK 1130
             +ASPA +ARVRG    Q+    SE N SQ SN K
Sbjct: 1022 RSASPAYKARVRGLPCVQNKCIDSENNLSQLSNGK 1056


Top