BLASTX nr result

ID: Ziziphus21_contig00008037 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ziziphus21_contig00008037
         (1227 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010109164.1| SET and MYND domain-containing protein 4 [Mo...   496   e-137
ref|XP_008234145.1| PREDICTED: SET and MYND domain-containing pr...   493   e-136
ref|XP_009363760.1| PREDICTED: histone-lysine N-methyltransferas...   483   e-133
ref|XP_007220254.1| hypothetical protein PRUPE_ppa001654mg [Prun...   470   e-130
ref|XP_008376346.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND...   466   e-128
ref|XP_004309003.1| PREDICTED: SET and MYND domain-containing pr...   461   e-127
ref|XP_010658301.1| PREDICTED: SET and MYND domain-containing pr...   439   e-120
ref|XP_010658299.1| PREDICTED: SET and MYND domain-containing pr...   439   e-120
ref|XP_007011440.1| Tetratricopeptide repeat-like superfamily pr...   436   e-119
ref|XP_007011439.1| Tetratricopeptide repeat-like superfamily pr...   436   e-119
ref|XP_003602446.2| heat shock protein 70 (HSP70)-interacting pr...   431   e-118
ref|XP_006480312.1| PREDICTED: SET and MYND domain-containing pr...   431   e-118
gb|KRH64253.1| hypothetical protein GLYMA_04G225100 [Glycine max...   427   e-116
ref|XP_006578856.1| PREDICTED: uncharacterized protein LOC100794...   427   e-116
ref|XP_012077590.1| PREDICTED: SET and MYND domain-containing pr...   426   e-116
ref|XP_002323703.2| tetratricopeptide repeat-containing family p...   424   e-115
ref|XP_012077602.1| PREDICTED: uncharacterized protein LOC105638...   421   e-115
ref|XP_012077596.1| PREDICTED: SET and MYND domain-containing pr...   421   e-115
gb|KHN02136.1| RNA polymerase II-associated protein 3 [Glycine s...   421   e-115
ref|XP_012077579.1| PREDICTED: SET and MYND domain-containing pr...   421   e-115

>ref|XP_010109164.1| SET and MYND domain-containing protein 4 [Morus notabilis]
            gi|587934155|gb|EXC21093.1| SET and MYND
            domain-containing protein 4 [Morus notabilis]
          Length = 796

 Score =  496 bits (1278), Expect = e-137
 Identities = 254/369 (68%), Positives = 291/369 (78%), Gaps = 2/369 (0%)
 Frame = -2

Query: 1226 FSIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFSSGALTSNVEQVKVAQAIYETG 1047
            FSI QVVI++SQIRVNSM I R+ S + +G+  QF KFSSGALTSNVEQVKV QAIY+ G
Sbjct: 425  FSIAQVVIIISQIRVNSMAITRIKSIDVNGIVDQFGKFSSGALTSNVEQVKVGQAIYKAG 484

Query: 1046 SLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYSF 867
            SL NHSCQP IH YFLSRTLFIR TE VAAGCPLELSYG QVGQW CKDRI+ LEDEYSF
Sbjct: 485  SLLNHSCQPNIHAYFLSRTLFIRTTETVAAGCPLELSYGLQVGQWDCKDRIKLLEDEYSF 544

Query: 866  RCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPHL 687
            RC C  C + N SDLVL+AFHC+K NCSGIV DS VLN EK KI+    I GTS  EPH 
Sbjct: 545  RCQCRACLKANFSDLVLHAFHCIKPNCSGIVVDSGVLNCEKHKIEQLYDIVGTSNWEPHF 604

Query: 686  KVDNFINDAIHYGGQDTF--SNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLRD 513
            +V+NF +D  +    D F  SNS S+VNPG CLKCGSY D+E+S A V+KAWK IRRL+D
Sbjct: 605  QVENFNSDYANEVMLDAFLDSNSSSNVNPGNCLKCGSYCDLETSRATVNKAWKCIRRLQD 664

Query: 512  ALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKAS 333
             + SED+S + LSD L  LDLL+STLHAYN+ IAEVED LAQAFC+ GD+  A  HCKAS
Sbjct: 665  GIVSEDISSSELSDTLTSLDLLKSTLHAYNRRIAEVEDILAQAFCMVGDLQLARDHCKAS 724

Query: 332  IEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPYL 153
            IEILEKLY  NHIVIG+ELVKL+SIQLS  +  AVD+INRV +IFS YYGS AD IFP+L
Sbjct: 725  IEILEKLYTCNHIVIGHELVKLSSIQLSLNDPAAVDSINRVVKIFSCYYGSDADFIFPHL 784

Query: 152  QFLRRQTQK 126
            QFLR++T+K
Sbjct: 785  QFLRKETEK 793


>ref|XP_008234145.1| PREDICTED: SET and MYND domain-containing protein 4 [Prunus mume]
          Length = 797

 Score =  493 bits (1269), Expect = e-136
 Identities = 251/371 (67%), Positives = 290/371 (78%), Gaps = 4/371 (1%)
 Frame = -2

Query: 1226 FSIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFSS--GALTSNVEQVKVAQAIYE 1053
            FSI Q+VIL+SQIRVNSMT+VRM S + HGLE    KFSS  G LTSNVEQV+V QAIY 
Sbjct: 419  FSISQIVILISQIRVNSMTVVRMKSIDQHGLE-DIGKFSSLGGGLTSNVEQVRVGQAIYT 477

Query: 1052 TGSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEY 873
            +GSLFNHSCQP IH YFLSRTLFIR TE+VAAG PLELSYGPQVGQW CKDR++FLEDEY
Sbjct: 478  SGSLFNHSCQPNIHAYFLSRTLFIRTTEYVAAGVPLELSYGPQVGQWDCKDRVKFLEDEY 537

Query: 872  SFRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEP 693
            SFRC CSGC ++N SDLVLNAFHCVK NCSGIV  S V++ EK+K+K  P+I     +EP
Sbjct: 538  SFRCQCSGCLKVNFSDLVLNAFHCVKPNCSGIVLQSSVVDCEKEKLKRLPNIITAGNMEP 597

Query: 692  HLKVDNFIN--DAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRL 519
            HL+ + FIN  D           NSL  +NPG CLKC SY D+ESSSAA +KAW  IRRL
Sbjct: 598  HLQAEEFINIDDIDRVAHHHMQINSLFHINPGLCLKCCSYHDLESSSAAANKAWIIIRRL 657

Query: 518  RDALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCK 339
            +DA+ S+DVS T L DAL  L +LRST HAYN+SIAE ED+LAQ FC  G++  AM HCK
Sbjct: 658  QDAIVSKDVSSTILVDALSSLGVLRSTFHAYNRSIAEAEDNLAQVFCFVGELQPAMEHCK 717

Query: 338  ASIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFP 159
            ASIEILEKLYNPNHIVIGYELVKL+SIQLS G+  AVD+INR+ +IFS YYGS    IFP
Sbjct: 718  ASIEILEKLYNPNHIVIGYELVKLSSIQLSLGDCAAVDSINRLCDIFSCYYGSHTYKIFP 777

Query: 158  YLQFLRRQTQK 126
            YLQFL+R+ ++
Sbjct: 778  YLQFLKRREKQ 788


>ref|XP_009363760.1| PREDICTED: histone-lysine N-methyltransferase ASHR1 [Pyrus x
            bretschneideri]
          Length = 794

 Score =  483 bits (1243), Expect = e-133
 Identities = 241/365 (66%), Positives = 284/365 (77%), Gaps = 2/365 (0%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFS--SGALTSNVEQVKVAQAIYET 1050
            SI Q+VIL+SQIRVNSMTIVRM S + HGLE QF K +     LTSN+EQVKV QAIY +
Sbjct: 419  SISQIVILISQIRVNSMTIVRMKSIDYHGLEDQFGKLTPWGEGLTSNMEQVKVGQAIYLS 478

Query: 1049 GSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYS 870
            GSLFNHSCQP IH YFL RTLFIR TEFVA+G PLE SYGPQ+GQW CKDR++FLEDEYS
Sbjct: 479  GSLFNHSCQPNIHAYFLLRTLFIRTTEFVASGVPLEFSYGPQIGQWDCKDRVKFLEDEYS 538

Query: 869  FRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPH 690
            FRC C GC  +N SDL LN FHCVK NCSGIV D+ ++N E++K+K+ PSI  TS +EPH
Sbjct: 539  FRCQCRGCLNVNFSDLALNGFHCVKPNCSGIVLDTGIVNIEREKLKYLPSIISTSSVEPH 598

Query: 689  LKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLRDA 510
            L+V+ F  D I+        N++  +NPG+CLKCGSYRD+ESSSAA  KA   IRRL+DA
Sbjct: 599  LQVEEFNTDEINKLAPHVQPNNVFDINPGFCLKCGSYRDLESSSAAADKARMCIRRLQDA 658

Query: 509  LGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKASI 330
            + S+DVS T   DAL  L LLRST  AYN+SIAE ED+LAQAFCL G++  AM HCKASI
Sbjct: 659  IVSQDVSSTVPLDALSSLGLLRSTFFAYNRSIAEAEDNLAQAFCLVGELQPAMEHCKASI 718

Query: 329  EILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPYLQ 150
            EILEKLYNPNHIVIGYELVKL+S+QLS G+  A  +I R+ +IFS YYGS   +IFPYL+
Sbjct: 719  EILEKLYNPNHIVIGYELVKLSSLQLSLGDCAAAASIKRLYQIFSCYYGSHTYVIFPYLR 778

Query: 149  FLRRQ 135
            FLRR+
Sbjct: 779  FLRRE 783


>ref|XP_007220254.1| hypothetical protein PRUPE_ppa001654mg [Prunus persica]
            gi|462416716|gb|EMJ21453.1| hypothetical protein
            PRUPE_ppa001654mg [Prunus persica]
          Length = 785

 Score =  470 bits (1210), Expect = e-130
 Identities = 243/371 (65%), Positives = 280/371 (75%), Gaps = 4/371 (1%)
 Frame = -2

Query: 1226 FSIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFSS--GALTSNVEQVKVAQAIYE 1053
            FSI Q+VIL+SQIRVNSMT+VRM S + HGLE    KFSS  G LTSNVEQV+V QAIY 
Sbjct: 419  FSISQIVILLSQIRVNSMTVVRMKSIDQHGLE-DIGKFSSLGGGLTSNVEQVRVGQAIYT 477

Query: 1052 TGSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEY 873
            +GSLFNHSCQP IH YFLSRTLFIR TEFV AG PLELSYGPQVGQW CKDR++FLEDEY
Sbjct: 478  SGSLFNHSCQPNIHAYFLSRTLFIRTTEFVTAGVPLELSYGPQVGQWDCKDRVKFLEDEY 537

Query: 872  SFRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEP 693
            SFRC CSGC ++N SDLVLNAFHCV+ NCSGIV  S V++ EK+K+K  P+I     +EP
Sbjct: 538  SFRCQCSGCLKVNFSDLVLNAFHCVELNCSGIVLQSSVVDCEKEKLKRLPNIITAGNMEP 597

Query: 692  HLKVDNFIN--DAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRL 519
            HL+ + FIN  D           NSL  +NPG CLKC SYRD+ESSSAA +KAW  IR  
Sbjct: 598  HLQAEEFINIDDIDRVAHHHMQINSLFHINPGLCLKCCSYRDLESSSAAANKAWIIIR-- 655

Query: 518  RDALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCK 339
                       T L DAL  L +LRST HAYN+SIAE ED+LAQAFC  G++  AM HCK
Sbjct: 656  ----------STILVDALSSLGVLRSTFHAYNRSIAEAEDNLAQAFCFVGELQHAMEHCK 705

Query: 338  ASIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFP 159
            ASIEILEKLYNPNHIVIGYELVKL+SIQLS G+  AVD+INR+ +IFS YYGS A  +FP
Sbjct: 706  ASIEILEKLYNPNHIVIGYELVKLSSIQLSLGDCAAVDSINRLCDIFSCYYGSHAYKVFP 765

Query: 158  YLQFLRRQTQK 126
            Y QFL+R+ ++
Sbjct: 766  YFQFLKRREKQ 776


>ref|XP_008376346.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
            protein 4 [Malus domestica]
          Length = 828

 Score =  466 bits (1200), Expect = e-128
 Identities = 242/399 (60%), Positives = 283/399 (70%), Gaps = 36/399 (9%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFS--SGALTSNVEQVKVAQAIYET 1050
            SI Q+VIL+SQIRVNSMTIVRM S + HGLE QF K +     LTSN+EQVKV QAIY +
Sbjct: 419  SISQIVILISQIRVNSMTIVRMKSIDHHGLEDQFGKLTPWGEGLTSNMEQVKVGQAIYIS 478

Query: 1049 GSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYS 870
            GSLFNHSC+P IH YFLSRTLFIR TEFV +G PLE SYGPQVGQW C DR++FLEDEYS
Sbjct: 479  GSLFNHSCRPNIHAYFLSRTLFIRTTEFVTSGVPLEFSYGPQVGQWDCXDRVKFLEDEYS 538

Query: 869  FRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPH 690
            FRC C GC  +N S+L LN FHCVK NC GIV DS V+N E++K+K+ P I  TS +EPH
Sbjct: 539  FRCQCRGCLNVNFSNLALNGFHCVKPNCPGIVLDSGVVNIEREKLKYLPRIVSTSSVEPH 598

Query: 689  LKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKA---------- 540
            L+V+ F  D I+        N++  +NPG+CLKCGSYRD ESSSAA  KA          
Sbjct: 599  LQVEEFNTDEINKLAPHVQPNNVFDINPGFCLKCGSYRDXESSSAAADKARTRIXRXTLS 658

Query: 539  ---------------------WKP---IRRLRDALGSEDVSGTTLSDALKCLDLLRSTLH 432
                                 W     + RLRDA+ S+DVS T L DAL  L LLRST  
Sbjct: 659  EKWFLSFQITGVPILYVHFACWSIDCFLDRLRDAIVSQDVSSTVLLDALSSLGLLRSTFF 718

Query: 431  AYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKASIEILEKLYNPNHIVIGYELVKLASIQL 252
            AYN+SIAE ED+LAQAFCL G++  AM HCKASIEILEKLYNPNHIVIGYELVKL+S+QL
Sbjct: 719  AYNRSIAEAEDNLAQAFCLVGELQPAMEHCKASIEILEKLYNPNHIVIGYELVKLSSLQL 778

Query: 251  SSGESTAVDTINRVGEIFSRYYGSQADIIFPYLQFLRRQ 135
            S G+  A D+INR+ +IFS YYGS   +IFPYL+FLRR+
Sbjct: 779  SLGDRAAADSINRLYQIFSCYYGSHTYVIFPYLRFLRRE 817


>ref|XP_004309003.1| PREDICTED: SET and MYND domain-containing protein 4 [Fragaria vesca
            subsp. vesca]
          Length = 780

 Score =  461 bits (1187), Expect = e-127
 Identities = 237/367 (64%), Positives = 283/367 (77%), Gaps = 4/367 (1%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFS--SGALTSNVEQVKVAQAIYET 1050
            SI Q+VIL+SQIRVNSMTIVRM     H LE QF   S   G  TSNVEQV+V QAIY +
Sbjct: 420  SISQIVILISQIRVNSMTIVRMKFPNHHELEDQFGNLSPWKGGPTSNVEQVRVGQAIYTS 479

Query: 1049 GSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYS 870
             SLFNHSCQP IH YFLSRTL IR TEFVAAG PLELSYGPQVGQW CKDRI+FLEDEYS
Sbjct: 480  ASLFNHSCQPNIHAYFLSRTLHIRTTEFVAAGSPLELSYGPQVGQWDCKDRIKFLEDEYS 539

Query: 869  FRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPH 690
            FRC C+GCS++N SDLVLNAFHCVK NCSGIV +S V+N EK+K+KH P+I  T  ++  
Sbjct: 540  FRCQCTGCSKMNFSDLVLNAFHCVKLNCSGIVLESSVINCEKEKLKHLPNILNTDSIDSL 599

Query: 689  LKVD--NFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLR 516
            L+    N ++ A++    D   NS   +NPGYCLKCG+YRD+ESSS A +     IRRL+
Sbjct: 600  LQAKELNIVSKAVN----DMQINSFFQLNPGYCLKCGTYRDLESSSVAANNC---IRRLQ 652

Query: 515  DALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKA 336
            +++ S+ +S TTL  AL  L +LRSTLHAYN++IAE ED+ AQAFCL G++  AM HCKA
Sbjct: 653  NSIDSKTISRTTLLGALSSLGVLRSTLHAYNRNIAEAEDNFAQAFCLVGEMQSAMEHCKA 712

Query: 335  SIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPY 156
            SIEILEKLYN NHIV+GYELVKL+SIQLS  +S AVD+I+R+ +IFS YYGS  D+IFP 
Sbjct: 713  SIEILEKLYNCNHIVVGYELVKLSSIQLSLRDSGAVDSIDRLYQIFSCYYGSHTDVIFPD 772

Query: 155  LQFLRRQ 135
            LQFLR++
Sbjct: 773  LQFLRKE 779


>ref|XP_010658301.1| PREDICTED: SET and MYND domain-containing protein 4 isoform X2 [Vitis
            vinifera]
          Length = 763

 Score =  439 bits (1129), Expect = e-120
 Identities = 222/369 (60%), Positives = 272/369 (73%), Gaps = 2/369 (0%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFS--SGALTSNVEQVKVAQAIYET 1050
            SI Q++IL+SQI+VNS+ IVRM   + +    Q   FS   GA TSN+EQV+V QAIY  
Sbjct: 396  SISQLIILISQIKVNSIAIVRMKFMDGYSPLDQSVNFSPAGGAFTSNMEQVRVGQAIYSV 455

Query: 1049 GSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYS 870
             SLFNHSCQP IH YFLSRTLF+RATE VA GCPLELSYGPQVGQW CKDR +FL+DEYS
Sbjct: 456  ASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQWDCKDRQKFLKDEYS 515

Query: 869  FRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPH 690
            FRC CSGCSELN+SDLVLNAF CV  +C G V DSCV+ +E +K + +  +      EPH
Sbjct: 516  FRCECSGCSELNVSDLVLNAFRCVNPDCFGTVLDSCVIKYENKKFERFQGVPQDCISEPH 575

Query: 689  LKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLRDA 510
            L++    ND I       F+NS     PGYCL CG+YRD+E+S A V +A   I RL++A
Sbjct: 576  LQLK---NDGIREVAHQAFANSSFRAAPGYCLHCGAYRDLEASHATVGEAGIYISRLQEA 632

Query: 509  LGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKASI 330
            + S++V  TT SDAL+ LDLL+STLHAYNK IAE ED +AQAFC+ G++  AM HCKASI
Sbjct: 633  IVSKEVPATTFSDALRSLDLLKSTLHAYNKGIAEAEDWIAQAFCMIGELQPAMHHCKASI 692

Query: 329  EILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPYLQ 150
            EILEKLY  NHIVIGYEL+KL+SIQLS G++ A+ +I+R+  IFS YYG  AD++FPYL 
Sbjct: 693  EILEKLYGSNHIVIGYELMKLSSIQLSLGDTAAMKSISRLAAIFSWYYGPHADMMFPYLG 752

Query: 149  FLRRQTQKL 123
             L+R+  KL
Sbjct: 753  SLKREICKL 761


>ref|XP_010658299.1| PREDICTED: SET and MYND domain-containing protein 4 isoform X1 [Vitis
            vinifera] gi|731412278|ref|XP_010658300.1| PREDICTED: SET
            and MYND domain-containing protein 4 isoform X1 [Vitis
            vinifera]
          Length = 787

 Score =  439 bits (1129), Expect = e-120
 Identities = 222/369 (60%), Positives = 272/369 (73%), Gaps = 2/369 (0%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFS--SGALTSNVEQVKVAQAIYET 1050
            SI Q++IL+SQI+VNS+ IVRM   + +    Q   FS   GA TSN+EQV+V QAIY  
Sbjct: 420  SISQLIILISQIKVNSIAIVRMKFMDGYSPLDQSVNFSPAGGAFTSNMEQVRVGQAIYSV 479

Query: 1049 GSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYS 870
             SLFNHSCQP IH YFLSRTLF+RATE VA GCPLELSYGPQVGQW CKDR +FL+DEYS
Sbjct: 480  ASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQWDCKDRQKFLKDEYS 539

Query: 869  FRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPH 690
            FRC CSGCSELN+SDLVLNAF CV  +C G V DSCV+ +E +K + +  +      EPH
Sbjct: 540  FRCECSGCSELNVSDLVLNAFRCVNPDCFGTVLDSCVIKYENKKFERFQGVPQDCISEPH 599

Query: 689  LKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLRDA 510
            L++    ND I       F+NS     PGYCL CG+YRD+E+S A V +A   I RL++A
Sbjct: 600  LQLK---NDGIREVAHQAFANSSFRAAPGYCLHCGAYRDLEASHATVGEAGIYISRLQEA 656

Query: 509  LGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKASI 330
            + S++V  TT SDAL+ LDLL+STLHAYNK IAE ED +AQAFC+ G++  AM HCKASI
Sbjct: 657  IVSKEVPATTFSDALRSLDLLKSTLHAYNKGIAEAEDWIAQAFCMIGELQPAMHHCKASI 716

Query: 329  EILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPYLQ 150
            EILEKLY  NHIVIGYEL+KL+SIQLS G++ A+ +I+R+  IFS YYG  AD++FPYL 
Sbjct: 717  EILEKLYGSNHIVIGYELMKLSSIQLSLGDTAAMKSISRLAAIFSWYYGPHADMMFPYLG 776

Query: 149  FLRRQTQKL 123
             L+R+  KL
Sbjct: 777  SLKREICKL 785


>ref|XP_007011440.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
            2, partial [Theobroma cacao] gi|508728353|gb|EOY20250.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative isoform 2, partial [Theobroma cacao]
          Length = 626

 Score =  436 bits (1121), Expect = e-119
 Identities = 226/368 (61%), Positives = 272/368 (73%), Gaps = 4/368 (1%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFSSG----ALTSNVEQVKVAQAIY 1056
            S  ++VIL+SQIRVNSM IVRM S++ +  +  FRKFSSG    ALTS+VEQV+V QA+Y
Sbjct: 260  STSRIVILLSQIRVNSMAIVRMKSSDVYDQQDWFRKFSSGEAETALTSSVEQVRVGQALY 319

Query: 1055 ETGSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDE 876
             T SLFNHSC+P IH YF+SR+L IRATEFVA GCPLELSYGPQVGQW CKDR+RFL+++
Sbjct: 320  ITASLFNHSCRPNIHAYFISRSLVIRATEFVAGGCPLELSYGPQVGQWDCKDRLRFLDEQ 379

Query: 875  YSFRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLE 696
            Y FRC C GCSE+N SDLV+N F CV  NCSG+V D  V N EKQK K   +I   S L+
Sbjct: 380  YFFRCWCHGCSEVNASDLVINGFCCVNPNCSGVVLDKLVANCEKQKPKIPETIGVESHLQ 439

Query: 695  PHLKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLR 516
             H   D  I  A H    +T S+    ++  YCLKCGSY ++ S S AV KAW  +RRL+
Sbjct: 440  VHELNDIDIKKAAHISLDETRSSL--RIDSEYCLKCGSYCNLASMSEAVKKAWINLRRLQ 497

Query: 515  DALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKA 336
            D++  +D+ GT LSDAL+ + +LRS LHAYNK I E ED+LAQAFC  GD+  A  HCKA
Sbjct: 498  DSITLKDMHGTELSDALRSVGILRSILHAYNKGIGEAEDNLAQAFCFTGDLQPARDHCKA 557

Query: 335  SIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPY 156
            SIEILEKLY P+HIVIGYELVKL+SIQL  G+  AVD+INR+  IFSRYYG  A IIFPY
Sbjct: 558  SIEILEKLYGPDHIVIGYELVKLSSIQLWLGDCAAVDSINRLSLIFSRYYGPDAGIIFPY 617

Query: 155  LQFLRRQT 132
            L FLRR++
Sbjct: 618  LGFLRRKS 625


>ref|XP_007011439.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1
            [Theobroma cacao] gi|508728352|gb|EOY20249.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative isoform 1 [Theobroma cacao]
          Length = 833

 Score =  436 bits (1121), Expect = e-119
 Identities = 226/368 (61%), Positives = 272/368 (73%), Gaps = 4/368 (1%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFSSG----ALTSNVEQVKVAQAIY 1056
            S  ++VIL+SQIRVNSM IVRM S++ +  +  FRKFSSG    ALTS+VEQV+V QA+Y
Sbjct: 463  STSRIVILLSQIRVNSMAIVRMKSSDVYDQQDWFRKFSSGEAETALTSSVEQVRVGQALY 522

Query: 1055 ETGSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDE 876
             T SLFNHSC+P IH YF+SR+L IRATEFVA GCPLELSYGPQVGQW CKDR+RFL+++
Sbjct: 523  ITASLFNHSCRPNIHAYFISRSLVIRATEFVAGGCPLELSYGPQVGQWDCKDRLRFLDEQ 582

Query: 875  YSFRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLE 696
            Y FRC C GCSE+N SDLV+N F CV  NCSG+V D  V N EKQK K   +I   S L+
Sbjct: 583  YFFRCWCHGCSEVNASDLVINGFCCVNPNCSGVVLDKLVANCEKQKPKIPETIGVESHLQ 642

Query: 695  PHLKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLR 516
             H   D  I  A H    +T S+    ++  YCLKCGSY ++ S S AV KAW  +RRL+
Sbjct: 643  VHELNDIDIKKAAHISLDETRSSL--RIDSEYCLKCGSYCNLASMSEAVKKAWINLRRLQ 700

Query: 515  DALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKA 336
            D++  +D+ GT LSDAL+ + +LRS LHAYNK I E ED+LAQAFC  GD+  A  HCKA
Sbjct: 701  DSITLKDMHGTELSDALRSVGILRSILHAYNKGIGEAEDNLAQAFCFTGDLQPARDHCKA 760

Query: 335  SIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPY 156
            SIEILEKLY P+HIVIGYELVKL+SIQL  G+  AVD+INR+  IFSRYYG  A IIFPY
Sbjct: 761  SIEILEKLYGPDHIVIGYELVKLSSIQLWLGDCAAVDSINRLSLIFSRYYGPDAGIIFPY 820

Query: 155  LQFLRRQT 132
            L FLRR++
Sbjct: 821  LGFLRRKS 828


>ref|XP_003602446.2| heat shock protein 70 (HSP70)-interacting protein, putative [Medicago
            truncatula] gi|657395087|gb|AES72697.2| heat shock
            protein 70 (HSP70)-interacting protein, putative
            [Medicago truncatula]
          Length = 778

 Score =  431 bits (1108), Expect = e-118
 Identities = 221/366 (60%), Positives = 278/366 (75%), Gaps = 3/366 (0%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKF---SSGALTSNVEQVKVAQAIYE 1053
            SI QVVIL+SQI+VN MT+VR+ S + HGL  Q   F   SS  LTSNVEQV+V +AIY+
Sbjct: 421  SILQVVILISQIKVNCMTVVRLKSIDAHGLSDQSGGFPFHSSVHLTSNVEQVRVGKAIYK 480

Query: 1052 TGSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEY 873
             GSLFNHSCQP +H YFLSRTL++R T+ VAAGC LELSYGPQVG W CKDR  FL+DEY
Sbjct: 481  VGSLFNHSCQPNVHAYFLSRTLYLRTTQAVAAGCQLELSYGPQVGLWDCKDRQSFLKDEY 540

Query: 872  SFRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEP 693
            +F C C+GCSE+NLSD+VLNAFHCV  NCSG V +S VL  EKQKIKH   +A   ++  
Sbjct: 541  AFHCQCTGCSEVNLSDIVLNAFHCVNPNCSGAVLESRVLECEKQKIKH---LAVADKV-- 595

Query: 692  HLKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLRD 513
             +K D+     +H   Q+  S     + PG+CLKC SYRD+ESS A V KA   I+RL+D
Sbjct: 596  -IKNDDIYEVCLHAFNQNDAS---IHIQPGFCLKCSSYRDLESSRATVDKALICIKRLQD 651

Query: 512  ALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKAS 333
            A+ S+++S T++SDAL+ L LLRS LHA NK IAE ED+LAQAFCL G++  +  HCKAS
Sbjct: 652  AILSKEISNTSISDALRSLHLLRSNLHACNKVIAEAEDNLAQAFCLVGELQLSADHCKAS 711

Query: 332  IEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPYL 153
            I+ILEK+Y+P+ IVI YELVKL+S+QLS G+++AV++I R+G IFSRYYG  AD++FPYL
Sbjct: 712  IQILEKIYDPDDIVIAYELVKLSSVQLSLGDNSAVNSIGRIGAIFSRYYGLHADLVFPYL 771

Query: 152  QFLRRQ 135
            Q+LRR+
Sbjct: 772  QYLRRE 777


>ref|XP_006480312.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform X2
            [Citrus sinensis]
          Length = 786

 Score =  431 bits (1107), Expect = e-118
 Identities = 229/372 (61%), Positives = 269/372 (72%), Gaps = 5/372 (1%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFSSGALTSNVEQVKVAQAIYETGS 1044
            S+ QVVIL+SQIRVNS+ IVRM+SN     +H     SSG+ T  VEQV+V  AIY  GS
Sbjct: 421  SVSQVVILISQIRVNSLAIVRMDSNNYGQSDH----VSSGS-TCTVEQVRVGLAIYTAGS 475

Query: 1043 LFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYSFR 864
            LFNHSC P IH YFLSRTL IR TEFV +G PLELSYGPQVGQW CKDR++FLEDEYSFR
Sbjct: 476  LFNHSCLPNIHAYFLSRTLMIRTTEFVPSGYPLELSYGPQVGQWDCKDRLKFLEDEYSFR 535

Query: 863  CHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPHLK 684
            C CSGCSELN SDLV+NAF CV  NC G+V D+ +LN EKQK KH P++   S   PHL+
Sbjct: 536  CQCSGCSELNTSDLVINAFCCVDPNCPGVVLDNSILNCEKQKRKHLPAVPQCSSSVPHLQ 595

Query: 683  VDNFINDAIHYGGQDTF-----SNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRL 519
            V    +D   Y G   +     +N  S   PGYCLKCGS RD+ESS A V +AW  IRRL
Sbjct: 596  VGKLSSD---YIGLVAYLLLEENNRTSRYGPGYCLKCGSDRDLESSYATVDEAWIYIRRL 652

Query: 518  RDALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCK 339
            +DA+ S+++S   L DA + L LLRS LHAYNK IAE ED+LAQA CL GD+  A  HCK
Sbjct: 653  QDAIISKEISRAVLLDASRFLGLLRSILHAYNKRIAEAEDNLAQASCLVGDLISARDHCK 712

Query: 338  ASIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFP 159
            ASIEILEKLY  NHIVIGYELVKL+SIQLS  +  AVDTI+R+  IF  Y+GS A+ +FP
Sbjct: 713  ASIEILEKLYGHNHIVIGYELVKLSSIQLSLDDHNAVDTISRLAAIFLHYFGSHAETMFP 772

Query: 158  YLQFLRRQTQKL 123
            +L FL+R+  KL
Sbjct: 773  HLLFLQREALKL 784


>gb|KRH64253.1| hypothetical protein GLYMA_04G225100 [Glycine max]
            gi|947115952|gb|KRH64254.1| hypothetical protein
            GLYMA_04G225100 [Glycine max]
          Length = 576

 Score =  427 bits (1097), Expect = e-116
 Identities = 224/370 (60%), Positives = 280/370 (75%), Gaps = 5/370 (1%)
 Frame = -2

Query: 1220 IEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKF--SSGA-LTSNVEQVKVAQAIYET 1050
            I QVVI++SQI+VN MT+VR+ S + HGL  +F +F   SGA  TSNVEQV+V +AIY+ 
Sbjct: 211  ISQVVIIISQIKVNCMTVVRLKSIDAHGLSGRFGEFPFQSGAHSTSNVEQVRVGKAIYKA 270

Query: 1049 GSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYS 870
            GSLFNHSCQP IH YFLSRTL++R T  VAA   LELSYGPQVG W CKDR+ FL+DEY+
Sbjct: 271  GSLFNHSCQPNIHAYFLSRTLYLRTTNVVAAESQLELSYGPQVGLWDCKDRLNFLKDEYA 330

Query: 869  FRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPH 690
            F C C+GCSE+NLSD+VLNAFHCV  NCSG V +S V + EKQKIKH+P       +  H
Sbjct: 331  FLCQCTGCSEVNLSDIVLNAFHCVNTNCSGTVLESRVHDSEKQKIKHFP-------ISDH 383

Query: 689  LKVDNFINDAIHYGGQDTFSNSLSSVN--PGYCLKCGSYRDIESSSAAVSKAWKPIRRLR 516
              VD   N  I+      F  + +S++  PGYCLKCGSY D+ESS AAVSKA   I+RL+
Sbjct: 384  --VDK--NADIYEVCLRVFKQNGASIDIQPGYCLKCGSYCDLESSRAAVSKALTCIKRLQ 439

Query: 515  DALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKA 336
            DA+ S+ +S  T+SDALK L LLR  LHAYNK IAE EDS+AQAFCL G++  ++ +CKA
Sbjct: 440  DAILSQQISSITISDALKSLRLLRLNLHAYNKLIAEAEDSIAQAFCLVGELQLSLDYCKA 499

Query: 335  SIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPY 156
            SI+ILEKLY+ + IVI YELVKL+SIQLS G+ TAV++I+R+ +IFSRYYG  AD++FPY
Sbjct: 500  SIQILEKLYDTDDIVIAYELVKLSSIQLSLGDGTAVESISRIDDIFSRYYGLHADLVFPY 559

Query: 155  LQFLRRQTQK 126
            LQ+LRR+ +K
Sbjct: 560  LQYLRREIKK 569


>ref|XP_006578856.1| PREDICTED: uncharacterized protein LOC100794609 isoform X1 [Glycine
            max] gi|571451820|ref|XP_006578857.1| PREDICTED:
            uncharacterized protein LOC100794609 isoform X2 [Glycine
            max] gi|947115949|gb|KRH64251.1| hypothetical protein
            GLYMA_04G225100 [Glycine max] gi|947115950|gb|KRH64252.1|
            hypothetical protein GLYMA_04G225100 [Glycine max]
          Length = 789

 Score =  427 bits (1097), Expect = e-116
 Identities = 224/370 (60%), Positives = 280/370 (75%), Gaps = 5/370 (1%)
 Frame = -2

Query: 1220 IEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKF--SSGA-LTSNVEQVKVAQAIYET 1050
            I QVVI++SQI+VN MT+VR+ S + HGL  +F +F   SGA  TSNVEQV+V +AIY+ 
Sbjct: 424  ISQVVIIISQIKVNCMTVVRLKSIDAHGLSGRFGEFPFQSGAHSTSNVEQVRVGKAIYKA 483

Query: 1049 GSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYS 870
            GSLFNHSCQP IH YFLSRTL++R T  VAA   LELSYGPQVG W CKDR+ FL+DEY+
Sbjct: 484  GSLFNHSCQPNIHAYFLSRTLYLRTTNVVAAESQLELSYGPQVGLWDCKDRLNFLKDEYA 543

Query: 869  FRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPH 690
            F C C+GCSE+NLSD+VLNAFHCV  NCSG V +S V + EKQKIKH+P       +  H
Sbjct: 544  FLCQCTGCSEVNLSDIVLNAFHCVNTNCSGTVLESRVHDSEKQKIKHFP-------ISDH 596

Query: 689  LKVDNFINDAIHYGGQDTFSNSLSSVN--PGYCLKCGSYRDIESSSAAVSKAWKPIRRLR 516
              VD   N  I+      F  + +S++  PGYCLKCGSY D+ESS AAVSKA   I+RL+
Sbjct: 597  --VDK--NADIYEVCLRVFKQNGASIDIQPGYCLKCGSYCDLESSRAAVSKALTCIKRLQ 652

Query: 515  DALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKA 336
            DA+ S+ +S  T+SDALK L LLR  LHAYNK IAE EDS+AQAFCL G++  ++ +CKA
Sbjct: 653  DAILSQQISSITISDALKSLRLLRLNLHAYNKLIAEAEDSIAQAFCLVGELQLSLDYCKA 712

Query: 335  SIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPY 156
            SI+ILEKLY+ + IVI YELVKL+SIQLS G+ TAV++I+R+ +IFSRYYG  AD++FPY
Sbjct: 713  SIQILEKLYDTDDIVIAYELVKLSSIQLSLGDGTAVESISRIDDIFSRYYGLHADLVFPY 772

Query: 155  LQFLRRQTQK 126
            LQ+LRR+ +K
Sbjct: 773  LQYLRREIKK 782


>ref|XP_012077590.1| PREDICTED: SET and MYND domain-containing protein 4 isoform X2
            [Jatropha curcas]
          Length = 787

 Score =  426 bits (1094), Expect = e-116
 Identities = 219/369 (59%), Positives = 263/369 (71%), Gaps = 2/369 (0%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFS--SGALTSNVEQVKVAQAIYET 1050
            S+ Q +ILVSQIRVN+M +VRM S +      QFRKFS    ALTS+V+QV V QAIY  
Sbjct: 420  SLSQTIILVSQIRVNAMAVVRMKSIDSCCPLDQFRKFSHIGDALTSSVDQVSVGQAIYRA 479

Query: 1049 GSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYS 870
            GSLFNHSCQP IH YFLSRTLF+R TE VA GCPLELSYGP+VGQW CKDR++FLED YS
Sbjct: 480  GSLFNHSCQPNIHAYFLSRTLFVRTTELVATGCPLELSYGPRVGQWDCKDRLKFLEDRYS 539

Query: 869  FRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPH 690
            FRC CSGCS LN SDLV+NAFHCV  NC G+V DSCV+N E  K+   P ++ T  L+  
Sbjct: 540  FRCQCSGCSRLNPSDLVINAFHCVNSNCDGVVLDSCVINTELCKLTSIPRVSKTQGLDLC 599

Query: 689  LKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLRDA 510
            L+VD  +ND   +  +    NS   + PG CL CGS  ++ES   A  K+W  I RL+DA
Sbjct: 600  LQVDG-LNDVACFAQE--LCNSSLHIQPGSCLNCGSLCNLESLHEATRKSWIYIERLQDA 656

Query: 509  LGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKASI 330
            + S+++S   LSDAL+ L  LRS LH YNK IAE  D+LAQAFC  GD   A  HCK SI
Sbjct: 657  VVSKEISTAILSDALRALGTLRSILHIYNKRIAEANDNLAQAFCQLGDFQSAQDHCKVSI 716

Query: 329  EILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPYLQ 150
            +ILE LY P+ IVIGYELVKL++IQ+S GE +A D+INR+G IF RYYGS AD  FPYLQ
Sbjct: 717  KILEMLYGPDDIVIGYELVKLSTIQISLGEPSASDSINRLGAIFLRYYGSHADSNFPYLQ 776

Query: 149  FLRRQTQKL 123
             L+R++  L
Sbjct: 777  MLKRESCNL 785


>ref|XP_002323703.2| tetratricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550321554|gb|EEF05464.2|
            tetratricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 584

 Score =  424 bits (1089), Expect = e-115
 Identities = 213/368 (57%), Positives = 271/368 (73%), Gaps = 6/368 (1%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFSS--GALTSNVEQVKVAQAIYET 1050
            S+ Q +IL+SQIRVNSM IVRM S +D     QFRK +S   ALTS++EQV V QAIY+ 
Sbjct: 217  SLSQTIILISQIRVNSMAIVRMKSVDDP--PDQFRKLTSVGDALTSSLEQVPVGQAIYKA 274

Query: 1049 GSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEYS 870
             SLFNHSC P IH YFLSRTLFIR TE+V+ GCPLELSYGPQVGQ  C+DR+R+L D+YS
Sbjct: 275  ASLFNHSCLPNIHAYFLSRTLFIRTTEYVSTGCPLELSYGPQVGQSDCEDRLRYLADKYS 334

Query: 869  FRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEPH 690
            FRC C GCS+LNLSDLVLNAF CV  NC+G+V +S ++N E +K+ ++P      + + H
Sbjct: 335  FRCQCRGCSQLNLSDLVLNAFCCVNHNCAGVVLESTIINGETRKLNNFPRAPEKQKFDSH 394

Query: 689  LKVDNF----INDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRR 522
            L+        IND      +  F+NS   + PG+CL CG++RD+++S  A++KAW  I+R
Sbjct: 395  LQGHKLNIVDINDVASLALK--FNNSSLHIQPGFCLHCGTHRDLDASHEAINKAWSYIKR 452

Query: 521  LRDALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHC 342
            L++A+ S+D+SGTTL DA + L +LRSTLHAYNKS+AE ED+LAQAFCL  D   A  HC
Sbjct: 453  LQEAIISKDISGTTLLDASRALGILRSTLHAYNKSVAEAEDNLAQAFCLVRDFQSAREHC 512

Query: 341  KASIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIF 162
            K SI+IL+ LY+P+HIVIGYELVKLASIQLS  +  AVD+ N +G IF+RY+G   D I 
Sbjct: 513  KESIKILQTLYDPDHIVIGYELVKLASIQLSLDDPAAVDSTNHLGLIFARYFGPHVDFIV 572

Query: 161  PYLQFLRR 138
            PY QFL+R
Sbjct: 573  PYQQFLKR 580


>ref|XP_012077602.1| PREDICTED: uncharacterized protein LOC105638398 isoform X4 [Jatropha
            curcas]
          Length = 653

 Score =  421 bits (1081), Expect = e-115
 Identities = 219/371 (59%), Positives = 264/371 (71%), Gaps = 4/371 (1%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFS--SGALTSNVEQVKVA--QAIY 1056
            S+ Q +ILVSQIRVN+M +VRM S +      QFRKFS    ALTS+V+QV V+  QAIY
Sbjct: 284  SLSQTIILVSQIRVNAMAVVRMKSIDSCCPLDQFRKFSHIGDALTSSVDQVHVSVGQAIY 343

Query: 1055 ETGSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDE 876
              GSLFNHSCQP IH YFLSRTLF+R TE VA GCPLELSYGP+VGQW CKDR++FLED 
Sbjct: 344  RAGSLFNHSCQPNIHAYFLSRTLFVRTTELVATGCPLELSYGPRVGQWDCKDRLKFLEDR 403

Query: 875  YSFRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLE 696
            YSFRC CSGCS LN SDLV+NAFHCV  NC G+V DSCV+N E  K+   P ++ T  L+
Sbjct: 404  YSFRCQCSGCSRLNPSDLVINAFHCVNSNCDGVVLDSCVINTELCKLTSIPRVSKTQGLD 463

Query: 695  PHLKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLR 516
              L+VD  +ND   +  +    NS   + PG CL CGS  ++ES   A  K+W  I RL+
Sbjct: 464  LCLQVDG-LNDVACFAQE--LCNSSLHIQPGSCLNCGSLCNLESLHEATRKSWIYIERLQ 520

Query: 515  DALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKA 336
            DA+ S+++S   LSDAL+ L  LRS LH YNK IAE  D+LAQAFC  GD   A  HCK 
Sbjct: 521  DAVVSKEISTAILSDALRALGTLRSILHIYNKRIAEANDNLAQAFCQLGDFQSAQDHCKV 580

Query: 335  SIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPY 156
            SI+ILE LY P+ IVIGYELVKL++IQ+S GE +A D+INR+G IF RYYGS AD  FPY
Sbjct: 581  SIKILEMLYGPDDIVIGYELVKLSTIQISLGEPSASDSINRLGAIFLRYYGSHADSNFPY 640

Query: 155  LQFLRRQTQKL 123
            LQ L+R++  L
Sbjct: 641  LQMLKRESCNL 651


>ref|XP_012077596.1| PREDICTED: SET and MYND domain-containing protein 4 isoform X3
            [Jatropha curcas]
          Length = 746

 Score =  421 bits (1081), Expect = e-115
 Identities = 219/371 (59%), Positives = 264/371 (71%), Gaps = 4/371 (1%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFS--SGALTSNVEQVKVA--QAIY 1056
            S+ Q +ILVSQIRVN+M +VRM S +      QFRKFS    ALTS+V+QV V+  QAIY
Sbjct: 377  SLSQTIILVSQIRVNAMAVVRMKSIDSCCPLDQFRKFSHIGDALTSSVDQVHVSVGQAIY 436

Query: 1055 ETGSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDE 876
              GSLFNHSCQP IH YFLSRTLF+R TE VA GCPLELSYGP+VGQW CKDR++FLED 
Sbjct: 437  RAGSLFNHSCQPNIHAYFLSRTLFVRTTELVATGCPLELSYGPRVGQWDCKDRLKFLEDR 496

Query: 875  YSFRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLE 696
            YSFRC CSGCS LN SDLV+NAFHCV  NC G+V DSCV+N E  K+   P ++ T  L+
Sbjct: 497  YSFRCQCSGCSRLNPSDLVINAFHCVNSNCDGVVLDSCVINTELCKLTSIPRVSKTQGLD 556

Query: 695  PHLKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLR 516
              L+VD  +ND   +  +    NS   + PG CL CGS  ++ES   A  K+W  I RL+
Sbjct: 557  LCLQVDG-LNDVACFAQE--LCNSSLHIQPGSCLNCGSLCNLESLHEATRKSWIYIERLQ 613

Query: 515  DALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKA 336
            DA+ S+++S   LSDAL+ L  LRS LH YNK IAE  D+LAQAFC  GD   A  HCK 
Sbjct: 614  DAVVSKEISTAILSDALRALGTLRSILHIYNKRIAEANDNLAQAFCQLGDFQSAQDHCKV 673

Query: 335  SIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPY 156
            SI+ILE LY P+ IVIGYELVKL++IQ+S GE +A D+INR+G IF RYYGS AD  FPY
Sbjct: 674  SIKILEMLYGPDDIVIGYELVKLSTIQISLGEPSASDSINRLGAIFLRYYGSHADSNFPY 733

Query: 155  LQFLRRQTQKL 123
            LQ L+R++  L
Sbjct: 734  LQMLKRESCNL 744


>gb|KHN02136.1| RNA polymerase II-associated protein 3 [Glycine soja]
          Length = 786

 Score =  421 bits (1081), Expect = e-115
 Identities = 218/371 (58%), Positives = 275/371 (74%), Gaps = 5/371 (1%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEH--QFRKFSSGA-LTSNVEQVKVAQAIYE 1053
            SI QVVI++SQI+VN MT+VR+ S + HG  H   F  F SGA  TSNVEQV+V +AIY+
Sbjct: 421  SISQVVIIISQIKVNCMTVVRLKSIDAHGSGHFGDF-PFQSGAHSTSNVEQVRVGKAIYK 479

Query: 1052 TGSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDEY 873
             GSLFNHSCQP +H YFLSR L++R T  VAAG  LELSYGPQVG W CKDR+ FL++EY
Sbjct: 480  AGSLFNHSCQPNVHAYFLSRALYLRTTNVVAAGSQLELSYGPQVGLWDCKDRLNFLKNEY 539

Query: 872  SFRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLEP 693
            +F C C+GCSE+N SDLVLNAFHCV  NCSG V +S VL+ E QKIKH+P       +  
Sbjct: 540  AFHCLCTGCSEVNRSDLVLNAFHCVNPNCSGAVLESRVLDCEMQKIKHFP-------IPD 592

Query: 692  HLKVDNFINDAIHYGGQDTFSNSLSSVN--PGYCLKCGSYRDIESSSAAVSKAWKPIRRL 519
            H+  ++ I +  H+     F  +  S++  PGYCLKCGSY D+ESS AAV KA   I RL
Sbjct: 593  HVDKNDDIYEVCHH----VFKQNGKSIHIQPGYCLKCGSYCDLESSHAAVGKALACITRL 648

Query: 518  RDALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCK 339
            +DA+ S+ +S  T+SDAL+ L LLR  LHAYNK  AE EDS+AQAFCL G++  ++ HCK
Sbjct: 649  QDAILSQQISSITISDALRSLKLLRLNLHAYNKLTAEAEDSIAQAFCLVGELQLSLDHCK 708

Query: 338  ASIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFP 159
            ASI+ILEKLY+ + IVI YELVKL+SIQLS  + TAV++I+R+ +IFSRYYG  AD++FP
Sbjct: 709  ASIQILEKLYDTDDIVIAYELVKLSSIQLSLDDGTAVESISRIDDIFSRYYGLHADLVFP 768

Query: 158  YLQFLRRQTQK 126
            YLQ+LRR+ +K
Sbjct: 769  YLQYLRREVEK 779


>ref|XP_012077579.1| PREDICTED: SET and MYND domain-containing protein 4 isoform X1
            [Jatropha curcas] gi|802540642|ref|XP_012077585.1|
            PREDICTED: SET and MYND domain-containing protein 4
            isoform X1 [Jatropha curcas] gi|643739969|gb|KDP45655.1|
            hypothetical protein JCGZ_17262 [Jatropha curcas]
          Length = 789

 Score =  421 bits (1081), Expect = e-115
 Identities = 219/371 (59%), Positives = 264/371 (71%), Gaps = 4/371 (1%)
 Frame = -2

Query: 1223 SIEQVVILVSQIRVNSMTIVRMNSNEDHGLEHQFRKFS--SGALTSNVEQVKVA--QAIY 1056
            S+ Q +ILVSQIRVN+M +VRM S +      QFRKFS    ALTS+V+QV V+  QAIY
Sbjct: 420  SLSQTIILVSQIRVNAMAVVRMKSIDSCCPLDQFRKFSHIGDALTSSVDQVHVSVGQAIY 479

Query: 1055 ETGSLFNHSCQPKIHVYFLSRTLFIRATEFVAAGCPLELSYGPQVGQWGCKDRIRFLEDE 876
              GSLFNHSCQP IH YFLSRTLF+R TE VA GCPLELSYGP+VGQW CKDR++FLED 
Sbjct: 480  RAGSLFNHSCQPNIHAYFLSRTLFVRTTELVATGCPLELSYGPRVGQWDCKDRLKFLEDR 539

Query: 875  YSFRCHCSGCSELNLSDLVLNAFHCVKQNCSGIVFDSCVLNHEKQKIKHYPSIAGTSRLE 696
            YSFRC CSGCS LN SDLV+NAFHCV  NC G+V DSCV+N E  K+   P ++ T  L+
Sbjct: 540  YSFRCQCSGCSRLNPSDLVINAFHCVNSNCDGVVLDSCVINTELCKLTSIPRVSKTQGLD 599

Query: 695  PHLKVDNFINDAIHYGGQDTFSNSLSSVNPGYCLKCGSYRDIESSSAAVSKAWKPIRRLR 516
              L+VD  +ND   +  +    NS   + PG CL CGS  ++ES   A  K+W  I RL+
Sbjct: 600  LCLQVDG-LNDVACFAQE--LCNSSLHIQPGSCLNCGSLCNLESLHEATRKSWIYIERLQ 656

Query: 515  DALGSEDVSGTTLSDALKCLDLLRSTLHAYNKSIAEVEDSLAQAFCLAGDIPRAMAHCKA 336
            DA+ S+++S   LSDAL+ L  LRS LH YNK IAE  D+LAQAFC  GD   A  HCK 
Sbjct: 657  DAVVSKEISTAILSDALRALGTLRSILHIYNKRIAEANDNLAQAFCQLGDFQSAQDHCKV 716

Query: 335  SIEILEKLYNPNHIVIGYELVKLASIQLSSGESTAVDTINRVGEIFSRYYGSQADIIFPY 156
            SI+ILE LY P+ IVIGYELVKL++IQ+S GE +A D+INR+G IF RYYGS AD  FPY
Sbjct: 717  SIKILEMLYGPDDIVIGYELVKLSTIQISLGEPSASDSINRLGAIFLRYYGSHADSNFPY 776

Query: 155  LQFLRRQTQKL 123
            LQ L+R++  L
Sbjct: 777  LQMLKRESCNL 787


Top