BLASTX nr result

ID: Perilla23_contig00015215 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00015215
         (836 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011096928.1| PREDICTED: protein SET DOMAIN GROUP 40 [Sesa...   350   6e-94
ref|XP_012841547.1| PREDICTED: protein SET DOMAIN GROUP 40 [Eryt...   308   4e-81
gb|EYU33951.1| hypothetical protein MIMGU_mgv1a021112mg, partial...   308   4e-81
ref|XP_010648906.1| PREDICTED: protein SET DOMAIN GROUP 40 isofo...   270   1e-69
ref|XP_010648905.1| PREDICTED: protein SET DOMAIN GROUP 40 isofo...   270   1e-69
ref|XP_002269094.3| PREDICTED: protein SET DOMAIN GROUP 40 isofo...   270   1e-69
ref|XP_009765288.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   268   4e-69
ref|XP_009794947.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   266   1e-68
ref|XP_009794946.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   266   1e-68
ref|XP_011043888.1| PREDICTED: protein SET DOMAIN GROUP 40 [Popu...   265   3e-68
ref|XP_002305239.2| hypothetical protein POPTR_0004s07950g [Popu...   264   5e-68
ref|XP_009590593.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   261   3e-67
ref|XP_012074708.1| PREDICTED: protein SET DOMAIN GROUP 40 [Jatr...   261   6e-67
ref|XP_009627152.1| PREDICTED: protein SET DOMAIN GROUP 40 isofo...   256   1e-65
ref|XP_009627151.1| PREDICTED: protein SET DOMAIN GROUP 40 isofo...   256   1e-65
gb|KHN39503.1| Protein SET DOMAIN GROUP 40 [Glycine soja]             252   2e-64
ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus ...   252   3e-64
ref|XP_010252199.1| PREDICTED: protein SET DOMAIN GROUP 40 isofo...   251   4e-64
ref|XP_004288574.1| PREDICTED: protein SET DOMAIN GROUP 40 [Frag...   251   5e-64
gb|KRH17269.1| hypothetical protein GLYMA_14G209800 [Glycine max]     248   3e-63

>ref|XP_011096928.1| PREDICTED: protein SET DOMAIN GROUP 40 [Sesamum indicum]
          Length = 467

 Score =  350 bits (899), Expect = 6e-94
 Identities = 174/251 (69%), Positives = 205/251 (81%), Gaps = 6/251 (2%)
 Frame = -3

Query: 801 DEGQRAENSTKELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELL 622
           +EG+R+ NS  EL+DA TDRL DAG+DE +ASYCFYAKRNY +GDQVLLSYGTYTNLELL
Sbjct: 218 EEGERSGNSIDELLDANTDRLTDAGFDEGVASYCFYAKRNYGKGDQVLLSYGTYTNLELL 277

Query: 621 EHYGFILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKR 442
           EHYGF+LQENPNDKAFISLE EM+SLCSWP +S+YIS DGKPSFALLSTVRLWATP+SKR
Sbjct: 278 EHYGFLLQENPNDKAFISLEPEMHSLCSWPRESIYISEDGKPSFALLSTVRLWATPMSKR 337

Query: 441 RSVKHLAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNGET 262
           RSVKH+A+SG  IS +NE AVMEW A +CRA+LS CL++I+ED  +LH +D  ++  GE 
Sbjct: 338 RSVKHIAFSGNRISAENEAAVMEWIADKCRALLSGCLSTIDEDMLMLHIIDNFQDYTGEP 397

Query: 261 ESSTLALGDEIRALLKSNSVDAGKLTA------KTRRSISRLKLAVQWRHRYKIILSDCI 100
           E S  AL DEIRA L+SN V  G L+       KTRR++ R KLAVQWRHRYK ILSDCI
Sbjct: 398 ELSP-ALCDEIRAFLESNHVTCGGLSTKLHVSLKTRRAVYRWKLAVQWRHRYKRILSDCI 456

Query: 99  THCSKMLDNIS 67
           ++C+ MLDN S
Sbjct: 457 SYCTSMLDNPS 467


>ref|XP_012841547.1| PREDICTED: protein SET DOMAIN GROUP 40 [Erythranthe guttatus]
          Length = 444

 Score =  308 bits (788), Expect = 4e-81
 Identities = 157/236 (66%), Positives = 187/236 (79%), Gaps = 2/236 (0%)
 Frame = -3

Query: 768 ELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGFILQENP 589
           +L+D  TDRL D G+DEA+ASYCFYAK+NY +GDQVLLSYGTYTNLELLE+YGF+LQENP
Sbjct: 221 DLLDGNTDRLTDPGFDEAVASYCFYAKKNYVKGDQVLLSYGTYTNLELLEYYGFLLQENP 280

Query: 588 NDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKHLAYSGQ 409
           NDKAFI LE+EMYSLCSWP DSLYIS +GKPSFAL+STVRLWATP   RRSVKH+A+SGQ
Sbjct: 281 NDKAFIPLESEMYSLCSWPRDSLYISKNGKPSFALISTVRLWATPERNRRSVKHIAFSGQ 340

Query: 408 LISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNGETESSTLALGDEI 229
           LIS +NE+A MEW A +CR +L+   +SIEED+ +L  +D+IEN +G+ E +       I
Sbjct: 341 LISNENEVAAMEWIANKCRDLLTGFPSSIEEDALVLCMIDEIENCSGDRELA-------I 393

Query: 228 RALLKSNSVDAGKLTAKTRRSISRLKLAVQWRHRYKIILSDCITHCS--KMLDNIS 67
           R LL+ NSV A      TR  I RLKLAV+WRHRYK ILSDCI +CS  +MLDN S
Sbjct: 394 RTLLERNSVKA------TRSDICRLKLAVEWRHRYKNILSDCILYCSCTRMLDNPS 443


>gb|EYU33951.1| hypothetical protein MIMGU_mgv1a021112mg, partial [Erythranthe
           guttata]
          Length = 232

 Score =  308 bits (788), Expect = 4e-81
 Identities = 157/236 (66%), Positives = 187/236 (79%), Gaps = 2/236 (0%)
 Frame = -3

Query: 768 ELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGFILQENP 589
           +L+D  TDRL D G+DEA+ASYCFYAK+NY +GDQVLLSYGTYTNLELLE+YGF+LQENP
Sbjct: 9   DLLDGNTDRLTDPGFDEAVASYCFYAKKNYVKGDQVLLSYGTYTNLELLEYYGFLLQENP 68

Query: 588 NDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKHLAYSGQ 409
           NDKAFI LE+EMYSLCSWP DSLYIS +GKPSFAL+STVRLWATP   RRSVKH+A+SGQ
Sbjct: 69  NDKAFIPLESEMYSLCSWPRDSLYISKNGKPSFALISTVRLWATPERNRRSVKHIAFSGQ 128

Query: 408 LISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNGETESSTLALGDEI 229
           LIS +NE+A MEW A +CR +L+   +SIEED+ +L  +D+IEN +G+ E +       I
Sbjct: 129 LISNENEVAAMEWIANKCRDLLTGFPSSIEEDALVLCMIDEIENCSGDRELA-------I 181

Query: 228 RALLKSNSVDAGKLTAKTRRSISRLKLAVQWRHRYKIILSDCITHCS--KMLDNIS 67
           R LL+ NSV A      TR  I RLKLAV+WRHRYK ILSDCI +CS  +MLDN S
Sbjct: 182 RTLLERNSVKA------TRSDICRLKLAVEWRHRYKNILSDCILYCSCTRMLDNPS 231


>ref|XP_010648906.1| PREDICTED: protein SET DOMAIN GROUP 40 isoform X3 [Vitis vinifera]
          Length = 489

 Score =  270 bits (689), Expect = 1e-69
 Identities = 141/262 (53%), Positives = 183/262 (69%), Gaps = 8/262 (3%)
 Frame = -3

Query: 828  DSLLRITSCDEGQRAENSTKELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSY 649
            +S L+ +S        NS  E  D  + RL D GY E LA+YCFYA++NY++G+QVLLSY
Sbjct: 217  ESSLQDSSFWNKDATSNSDAEQDDVLSQRLTDGGYKEDLAAYCFYARKNYKKGEQVLLSY 276

Query: 648  GTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVR 469
            GTYTNLELLEHYGF+L ENPNDKAFI LE E+Y+  SWP DSLYI  +GKPSFALLS +R
Sbjct: 277  GTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKDSLYIHQNGKPSFALLSALR 336

Query: 468  LWATPVSKRRSVKHLAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMD 289
            LWATP S+RRSV HL YSG  +S +NEI VMEW AK C  VL +  TS+EEDS LL  +D
Sbjct: 337  LWATPASQRRSVGHLVYSGTQLSSENEIFVMEWIAKSCHVVLENLPTSVEEDSLLLCALD 396

Query: 288  KIENSN--GETESSTLALGDEIRALLKSNSVDAGK------LTAKTRRSISRLKLAVQWR 133
            K+++ +   E  ++  + G E  A L+++ +  G       L+ K RRS+ R KLAVQWR
Sbjct: 397  KMQDPDLPMEVGNALRSSGVEFSAFLEAHDLKIGDGNVGLLLSEKARRSMERWKLAVQWR 456

Query: 132  HRYKIILSDCITHCSKMLDNIS 67
             R+K IL DCI+ C++++ ++S
Sbjct: 457  LRHKRILVDCISRCTEIISSLS 478


>ref|XP_010648905.1| PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Vitis vinifera]
          Length = 497

 Score =  270 bits (689), Expect = 1e-69
 Identities = 141/262 (53%), Positives = 183/262 (69%), Gaps = 8/262 (3%)
 Frame = -3

Query: 828  DSLLRITSCDEGQRAENSTKELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSY 649
            +S L+ +S        NS  E  D  + RL D GY E LA+YCFYA++NY++G+QVLLSY
Sbjct: 225  ESSLQDSSFWNKDATSNSDAEQDDVLSQRLTDGGYKEDLAAYCFYARKNYKKGEQVLLSY 284

Query: 648  GTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVR 469
            GTYTNLELLEHYGF+L ENPNDKAFI LE E+Y+  SWP DSLYI  +GKPSFALLS +R
Sbjct: 285  GTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKDSLYIHQNGKPSFALLSALR 344

Query: 468  LWATPVSKRRSVKHLAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMD 289
            LWATP S+RRSV HL YSG  +S +NEI VMEW AK C  VL +  TS+EEDS LL  +D
Sbjct: 345  LWATPASQRRSVGHLVYSGTQLSSENEIFVMEWIAKSCHVVLENLPTSVEEDSLLLCALD 404

Query: 288  KIENSN--GETESSTLALGDEIRALLKSNSVDAGK------LTAKTRRSISRLKLAVQWR 133
            K+++ +   E  ++  + G E  A L+++ +  G       L+ K RRS+ R KLAVQWR
Sbjct: 405  KMQDPDLPMEVGNALRSSGVEFSAFLEAHDLKIGDGNVGLLLSEKARRSMERWKLAVQWR 464

Query: 132  HRYKIILSDCITHCSKMLDNIS 67
             R+K IL DCI+ C++++ ++S
Sbjct: 465  LRHKRILVDCISRCTEIISSLS 486


>ref|XP_002269094.3| PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Vitis vinifera]
          Length = 533

 Score =  270 bits (689), Expect = 1e-69
 Identities = 141/262 (53%), Positives = 183/262 (69%), Gaps = 8/262 (3%)
 Frame = -3

Query: 828  DSLLRITSCDEGQRAENSTKELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSY 649
            +S L+ +S        NS  E  D  + RL D GY E LA+YCFYA++NY++G+QVLLSY
Sbjct: 261  ESSLQDSSFWNKDATSNSDAEQDDVLSQRLTDGGYKEDLAAYCFYARKNYKKGEQVLLSY 320

Query: 648  GTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVR 469
            GTYTNLELLEHYGF+L ENPNDKAFI LE E+Y+  SWP DSLYI  +GKPSFALLS +R
Sbjct: 321  GTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKDSLYIHQNGKPSFALLSALR 380

Query: 468  LWATPVSKRRSVKHLAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMD 289
            LWATP S+RRSV HL YSG  +S +NEI VMEW AK C  VL +  TS+EEDS LL  +D
Sbjct: 381  LWATPASQRRSVGHLVYSGTQLSSENEIFVMEWIAKSCHVVLENLPTSVEEDSLLLCALD 440

Query: 288  KIENSN--GETESSTLALGDEIRALLKSNSVDAGK------LTAKTRRSISRLKLAVQWR 133
            K+++ +   E  ++  + G E  A L+++ +  G       L+ K RRS+ R KLAVQWR
Sbjct: 441  KMQDPDLPMEVGNALRSSGVEFSAFLEAHDLKIGDGNVGLLLSEKARRSMERWKLAVQWR 500

Query: 132  HRYKIILSDCITHCSKMLDNIS 67
             R+K IL DCI+ C++++ ++S
Sbjct: 501  LRHKRILVDCISRCTEIISSLS 522


>ref|XP_009765288.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Nicotiana sylvestris]
          Length = 526

 Score =  268 bits (685), Expect = 4e-69
 Identities = 132/242 (54%), Positives = 176/242 (72%), Gaps = 5/242 (2%)
 Frame = -3

Query: 780 NSTKELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGFIL 601
           NS  EL  A   RL DAGY+E ++SYCFYA+RNY++G+QVLLSYGTYTNLELL+HYGFIL
Sbjct: 246 NSVTEL--AAAHRLIDAGYEEDVSSYCFYARRNYQKGEQVLLSYGTYTNLELLQHYGFIL 303

Query: 600 QENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKHLA 421
            +NPNDKAFI LE  MYSLCSW ++SLYI  DGKPSFALLSTVR+WA P + RR V HL 
Sbjct: 304 SDNPNDKAFIPLEPNMYSLCSWESESLYIQPDGKPSFALLSTVRIWAVPQNNRRPVAHLV 363

Query: 420 YSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNG--ETESSTL 247
           YSG  +SI+NE+  M W AK+CR +L    T+ EED +LL  +D+ + ++   + +    
Sbjct: 364 YSGYQLSIENEVVAMRWLAKKCRTILDILPTTAEEDGKLLVILDEFQETHQLVDIKEMPS 423

Query: 246 ALGDEIRALLKSNSVDAG---KLTAKTRRSISRLKLAVQWRHRYKIILSDCITHCSKMLD 76
           AL  E+ A ++S  + +     ++   RRSI RLKLA+ WR++YK ILS+CI HC+++++
Sbjct: 424 ALATELCAFMESKKIVSDGTCVISGVARRSIGRLKLAILWRYQYKKILSNCILHCTEVIN 483

Query: 75  NI 70
           +I
Sbjct: 484 DI 485


>ref|XP_009794947.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X2 [Nicotiana
           sylvestris]
          Length = 483

 Score =  266 bits (680), Expect = 1e-68
 Identities = 133/234 (56%), Positives = 167/234 (71%), Gaps = 5/234 (2%)
 Frame = -3

Query: 756 AKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGFILQENPNDKA 577
           A   RL DAGY+E ++SYCFYA+RNY++G+QVLLSYGTYTNLELL+HYGF+L +N NDKA
Sbjct: 232 AAAHRLIDAGYEEDVSSYCFYARRNYQKGEQVLLSYGTYTNLELLQHYGFLLTDNLNDKA 291

Query: 576 FISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKHLAYSGQLISI 397
           FI LE +MYSLCSW N+ LYI  DGKPSFALLSTVRLWA P  KRRS+ HL YSG  +S 
Sbjct: 292 FIPLEPDMYSLCSWENELLYIQPDGKPSFALLSTVRLWAVPQKKRRSLLHLVYSGNRLST 351

Query: 396 DNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNG--ETESSTLALGDEIRA 223
            NE+A M W AK+CR  L    T+ EED +LL  +DK ++ +   E E     L  E+ A
Sbjct: 352 HNEVAAMRWLAKKCRTTLDILPTTAEEDGKLLIILDKFQDIHQLVEIEEMPPPLASELCA 411

Query: 222 LLKSN---SVDAGKLTAKTRRSISRLKLAVQWRHRYKIILSDCITHCSKMLDNI 70
            ++S    S D   L++  RRSI R KLA+QWRH YK IL +CI HC+++++ I
Sbjct: 412 FMESKDMLSEDIYVLSSVARRSIERWKLAIQWRHHYKKILCNCIIHCTEVINEI 465


>ref|XP_009794946.1| PREDICTED: protein SET DOMAIN GROUP 40-like isoform X1 [Nicotiana
           sylvestris]
          Length = 499

 Score =  266 bits (680), Expect = 1e-68
 Identities = 133/234 (56%), Positives = 167/234 (71%), Gaps = 5/234 (2%)
 Frame = -3

Query: 756 AKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGFILQENPNDKA 577
           A   RL DAGY+E ++SYCFYA+RNY++G+QVLLSYGTYTNLELL+HYGF+L +N NDKA
Sbjct: 248 AAAHRLIDAGYEEDVSSYCFYARRNYQKGEQVLLSYGTYTNLELLQHYGFLLTDNLNDKA 307

Query: 576 FISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKHLAYSGQLISI 397
           FI LE +MYSLCSW N+ LYI  DGKPSFALLSTVRLWA P  KRRS+ HL YSG  +S 
Sbjct: 308 FIPLEPDMYSLCSWENELLYIQPDGKPSFALLSTVRLWAVPQKKRRSLLHLVYSGNRLST 367

Query: 396 DNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNG--ETESSTLALGDEIRA 223
            NE+A M W AK+CR  L    T+ EED +LL  +DK ++ +   E E     L  E+ A
Sbjct: 368 HNEVAAMRWLAKKCRTTLDILPTTAEEDGKLLIILDKFQDIHQLVEIEEMPPPLASELCA 427

Query: 222 LLKSN---SVDAGKLTAKTRRSISRLKLAVQWRHRYKIILSDCITHCSKMLDNI 70
            ++S    S D   L++  RRSI R KLA+QWRH YK IL +CI HC+++++ I
Sbjct: 428 FMESKDMLSEDIYVLSSVARRSIERWKLAIQWRHHYKKILCNCIIHCTEVINEI 481


>ref|XP_011043888.1| PREDICTED: protein SET DOMAIN GROUP 40 [Populus euphratica]
          Length = 507

 Score =  265 bits (677), Expect = 3e-68
 Identities = 136/261 (52%), Positives = 179/261 (68%), Gaps = 8/261 (3%)
 Frame = -3

Query: 825  SLLRITSCDEGQRAENSTKELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYG 646
            S L  TS   G+  ++   +  D   DRL D G+DE +A+YCFYA++NY++G QVLL YG
Sbjct: 241  SSLEDTSLLNGETTDDFIGDQPDISLDRLTDGGFDENMAAYCFYARKNYKKGTQVLLGYG 300

Query: 645  TYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRL 466
            TYTNLELLEHYGF+L  NPNDK FI+LE  MYS+ SWP  S+YI  DGKPSFALLS +RL
Sbjct: 301  TYTNLELLEHYGFLLNGNPNDKVFITLEPSMYSIISWPKVSMYIHQDGKPSFALLSALRL 360

Query: 465  WATPVSKRRSVKHLAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDK 286
            WAT  ++RRS+ HL YSG  +S+ NEI+V++W +K C  VLS+  T IEEDS LL  +DK
Sbjct: 361  WATLPNQRRSISHLVYSGSQLSVYNEISVLKWISKNCAMVLSNLPTVIEEDSLLLSTIDK 420

Query: 285  IENSNGETESSTL--ALGDEIRALLKSNSVDAGK------LTAKTRRSISRLKLAVQWRH 130
            I+N +  TE   L  + G E RA L+++ +  GK       + KT+R I R KLAVQWR 
Sbjct: 421  IKNYDNPTELGKLLCSSGGEARAFLEASDLQKGKNGSELMFSGKTKRVIERWKLAVQWRI 480

Query: 129  RYKIILSDCITHCSKMLDNIS 67
             YK  L DCI++C+  ++++S
Sbjct: 481  SYKKTLIDCISYCTVTINSLS 501


>ref|XP_002305239.2| hypothetical protein POPTR_0004s07950g [Populus trichocarpa]
            gi|550340570|gb|EEE85750.2| hypothetical protein
            POPTR_0004s07950g [Populus trichocarpa]
          Length = 518

 Score =  264 bits (675), Expect = 5e-68
 Identities = 134/259 (51%), Positives = 177/259 (68%), Gaps = 6/259 (2%)
 Frame = -3

Query: 825  SLLRITSCDEGQRAENSTKELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYG 646
            S L  TS   G+  ++   +  D   +RL D G++E +A+YCFYA++NY++G QVLL YG
Sbjct: 251  SSLEDTSLSNGETTDDFIGDQPDIGLERLTDGGFNENMAAYCFYARKNYKKGTQVLLGYG 310

Query: 645  TYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRL 466
            TYTNLELLEHYGF+L ENPNDK FI LE  MYS  SWP  S+YI  DGKPSFALLS +RL
Sbjct: 311  TYTNLELLEHYGFLLNENPNDKVFIPLEPSMYSFISWPKVSMYIHQDGKPSFALLSALRL 370

Query: 465  WATPVSKRRSVKHLAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDK 286
            WATP ++RRS+ HL YSG  +S+ NEI+V++W +K C  +LS+  T IEEDS LL  ++K
Sbjct: 371  WATPPNQRRSISHLVYSGSRLSVYNEISVLKWISKNCALILSNLPTVIEEDSLLLSTINK 430

Query: 285  IENSNGETESSTLALGDEIRALLKSNSVDAGK------LTAKTRRSISRLKLAVQWRHRY 124
            IEN +  TE      G E RA L+++ +  GK       + KT+R I R KLAVQWR  Y
Sbjct: 431  IENFDKPTE-LVCTSGGEARAFLEASDLQKGKNGSELMFSGKTKRVIERWKLAVQWRISY 489

Query: 123  KIILSDCITHCSKMLDNIS 67
            K  L DCI++C+  ++++S
Sbjct: 490  KKTLIDCISYCTVTINSLS 508


>ref|XP_009590593.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Nicotiana
           tomentosiformis]
          Length = 587

 Score =  261 bits (668), Expect = 3e-67
 Identities = 124/233 (53%), Positives = 169/233 (72%), Gaps = 5/233 (2%)
 Frame = -3

Query: 756 AKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGFILQENPNDKA 577
           A   RL DAG++E ++SYCFYA+RNY +G+QVLLSYGTYTNLELL+HYGFIL +NPNDKA
Sbjct: 268 AAAHRLIDAGFEEDVSSYCFYARRNYRKGEQVLLSYGTYTNLELLQHYGFILSDNPNDKA 327

Query: 576 FISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKHLAYSGQLISI 397
           FI LE+ MYSLCSW ++SLYI  DGKPSFALLSTVR+WA P + RR V HL YSG  +S 
Sbjct: 328 FIPLESNMYSLCSWESESLYIQPDGKPSFALLSTVRIWAVPQNNRRPVAHLVYSGYQLST 387

Query: 396 DNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNG--ETESSTLALGDEIRA 223
           +NE+  M W AK+CR +L S  T+ EED +LL  +D+ + ++   E +     L  E+ A
Sbjct: 388 ENEVVAMRWLAKKCRTILDSLPTTAEEDGKLLVILDEFQENHQLMEIKEMPSTLASELCA 447

Query: 222 LLKSNSV---DAGKLTAKTRRSISRLKLAVQWRHRYKIILSDCITHCSKMLDN 73
            ++S  +       +++  RRSI R+KLA+ WR+ YK I+S+CI HC++++++
Sbjct: 448 FMESKKIFSEGTCVISSVARRSIGRMKLAILWRYHYKKIISNCILHCTEVIND 500


>ref|XP_012074708.1| PREDICTED: protein SET DOMAIN GROUP 40 [Jatropha curcas]
           gi|643727177|gb|KDP35711.1| hypothetical protein
           JCGZ_10483 [Jatropha curcas]
          Length = 506

 Score =  261 bits (666), Expect = 6e-67
 Identities = 128/237 (54%), Positives = 171/237 (72%), Gaps = 8/237 (3%)
 Frame = -3

Query: 759 DAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGFILQENPNDK 580
           DA   RL D G+DE L +YCFYA++NY++G+QVLLSYGTYTNLELLEHYGF+L ENPNDK
Sbjct: 258 DAHLQRLTDGGFDEDLDAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGFVLDENPNDK 317

Query: 579 AFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKHLAYSGQLIS 400
            FI LE  MYS  SWP +S+YI  DGKPSFALLS +RLWATP ++RRSV HLAYSG  +S
Sbjct: 318 VFIPLEPSMYSSNSWPKESMYIHQDGKPSFALLSALRLWATPPNQRRSVGHLAYSGSQLS 377

Query: 399 IDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNGETESSTL--ALGDEIR 226
           ++NE  V++W +K C  +L++  T +EED  LL  +DKI+N     E   +      E R
Sbjct: 378 VENETWVLKWISKSCHEILNNLPTKVEEDHLLLSTIDKIQNLYNPMELGQMLCQFKGEFR 437

Query: 225 ALLKSNSVDAGK------LTAKTRRSISRLKLAVQWRHRYKIILSDCITHCSKMLDN 73
             L+++S+  GK      L++KT+++I R KLAVQWR RYK I+ DCI+ C++++++
Sbjct: 438 DFLEASSIGKGKNGDELMLSSKTKQAIERWKLAVQWRFRYKKIVVDCISSCTEIINS 494


>ref|XP_009627152.1| PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Nicotiana
           tomentosiformis]
          Length = 490

 Score =  256 bits (655), Expect = 1e-65
 Identities = 128/245 (52%), Positives = 171/245 (69%), Gaps = 6/245 (2%)
 Frame = -3

Query: 783 ENSTKELVDAKTD-RLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGF 607
           ++  K   +  TD RL DAGY+E ++SYCFYA+RNY +G+QVL+SYGTYTNLELL+HYGF
Sbjct: 219 DSMLKSATELATDHRLIDAGYEEDVSSYCFYARRNYRKGEQVLVSYGTYTNLELLQHYGF 278

Query: 606 ILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKH 427
           +L +NPNDKAFI LE +MYSLCSW N+ LY+  DGKPSFALLSTVR WA P +KRRS  H
Sbjct: 279 LLTDNPNDKAFIPLEPDMYSLCSWENELLYVQPDGKPSFALLSTVRFWAVPQNKRRSFIH 338

Query: 426 LAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNG--ETESS 253
           L YSG  +S  NE+A M W AK+CR  L    T+ EED +LL  +DK ++ +   E +  
Sbjct: 339 LVYSGNRLSTHNEVAAMRWLAKKCRTTLDILPTTAEEDGKLLVILDKFQDIHQFMEIKEM 398

Query: 252 TLALGDEIRALLKSNSVDAGKL---TAKTRRSISRLKLAVQWRHRYKIILSDCITHCSKM 82
              L  E+ A ++S  + +  +   ++  RRSI R KLA+QWR  YK IL +CI HC+++
Sbjct: 399 PPPLASELCAFMESKDMLSEGIYVPSSVARRSIERWKLAIQWRLHYKKILCNCIAHCTEI 458

Query: 81  LDNIS 67
           ++ I+
Sbjct: 459 INYIN 463


>ref|XP_009627151.1| PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Nicotiana
           tomentosiformis]
          Length = 506

 Score =  256 bits (655), Expect = 1e-65
 Identities = 128/245 (52%), Positives = 171/245 (69%), Gaps = 6/245 (2%)
 Frame = -3

Query: 783 ENSTKELVDAKTD-RLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGF 607
           ++  K   +  TD RL DAGY+E ++SYCFYA+RNY +G+QVL+SYGTYTNLELL+HYGF
Sbjct: 235 DSMLKSATELATDHRLIDAGYEEDVSSYCFYARRNYRKGEQVLVSYGTYTNLELLQHYGF 294

Query: 606 ILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKH 427
           +L +NPNDKAFI LE +MYSLCSW N+ LY+  DGKPSFALLSTVR WA P +KRRS  H
Sbjct: 295 LLTDNPNDKAFIPLEPDMYSLCSWENELLYVQPDGKPSFALLSTVRFWAVPQNKRRSFIH 354

Query: 426 LAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNG--ETESS 253
           L YSG  +S  NE+A M W AK+CR  L    T+ EED +LL  +DK ++ +   E +  
Sbjct: 355 LVYSGNRLSTHNEVAAMRWLAKKCRTTLDILPTTAEEDGKLLVILDKFQDIHQFMEIKEM 414

Query: 252 TLALGDEIRALLKSNSVDAGKL---TAKTRRSISRLKLAVQWRHRYKIILSDCITHCSKM 82
              L  E+ A ++S  + +  +   ++  RRSI R KLA+QWR  YK IL +CI HC+++
Sbjct: 415 PPPLASELCAFMESKDMLSEGIYVPSSVARRSIERWKLAIQWRLHYKKILCNCIAHCTEI 474

Query: 81  LDNIS 67
           ++ I+
Sbjct: 475 INYIN 479


>gb|KHN39503.1| Protein SET DOMAIN GROUP 40 [Glycine soja]
          Length = 497

 Score =  252 bits (644), Expect = 2e-64
 Identities = 134/267 (50%), Positives = 180/267 (67%), Gaps = 12/267 (4%)
 Frame = -3

Query: 834  DTDSLLRITSCDE-----GQRAENSTKELVDAKTDRLADAGYDEALASYCFYAKRNYERG 670
            D D LL  TS  +     G +      E +D+ + RL D G++E   +YCFYA+ +Y++G
Sbjct: 229  DLDRLLSNTSIPDTIVLNGDKNIMVDAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKG 288

Query: 669  DQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSF 490
            DQVLL YGTYTNLELLEHYGF+LQENPNDK FI LE  +YS  SW  +SLYI  +GKPSF
Sbjct: 289  DQVLLCYGTYTNLELLEHYGFLLQENPNDKVFIPLEPALYSSTSWSKESLYIHHNGKPSF 348

Query: 489  ALLSTVRLWATPVSKRRSVKHLAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDS 310
            ALL+ +RLWATP S+RRSV HL YSG  +S DNEI +M+W +K C AVL +  TS+EED+
Sbjct: 349  ALLAALRLWATPQSRRRSVGHLVYSGSRVSTDNEIFIMKWLSKTCDAVLRNLPTSLEEDT 408

Query: 309  ELLHFMDKIENSNGETESSTLALG-DEIRALLKSNSVDAGK------LTAKTRRSISRLK 151
             LL+ MD  ++ +   E + L    DE    L+++++          L+ K RRS+ R K
Sbjct: 409  FLLNAMDNSQDFSTFMEITKLVSSRDETCTFLETHNMKDTHSFTDVILSRKARRSMDRWK 468

Query: 150  LAVQWRHRYKIILSDCITHCSKMLDNI 70
            LAVQWR +YK ++ DCIT+C+K+LD++
Sbjct: 469  LAVQWRLKYKKVIFDCITYCNKILDSL 495


>ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
            gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP,
            putative [Ricinus communis]
          Length = 510

 Score =  252 bits (643), Expect = 3e-64
 Identities = 124/263 (47%), Positives = 178/263 (67%), Gaps = 9/263 (3%)
 Frame = -3

Query: 825  SLLRITSCDEGQRAENSTKELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYG 646
            S L   S    +   N   E  D +   L D G+DE  A+YCFYA++NY++G QVLLSYG
Sbjct: 239  SCLEDASLSSERSTSNFCSETFDVQLKSLTDGGFDEDKAAYCFYARQNYKKGAQVLLSYG 298

Query: 645  TYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRL 466
            TYTNLELLEHYGF+L ENPNDK FI LE  M S  +WP +S+YI  DGKPSF+LL  +RL
Sbjct: 299  TYTNLELLEHYGFLLNENPNDKVFIPLELSMQSSNTWPKESMYIHQDGKPSFSLLCALRL 358

Query: 465  WATPVSKRRSVKHLAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDK 286
            WATP ++RRS+ HLAYSG  +S++NE+++++W +++C AVL    T++EEDS LL  +DK
Sbjct: 359  WATPSNRRRSMGHLAYSGSQLSVENEVSILKWISRKCHAVLKKLPTTVEEDSLLLSAIDK 418

Query: 285  IENSNGETESSTLALGDE--IRALLKSNSV-------DAGKLTAKTRRSISRLKLAVQWR 133
            I+N +   E   +  G E    A ++++++       ++  L  K +RS+ R KLAV+WR
Sbjct: 419  IQNCHSPLELGKMLHGFEGQASAFVEAHNLLNIKIGTESTMLCGKAKRSMERWKLAVKWR 478

Query: 132  HRYKIILSDCITHCSKMLDNISV 64
              YK  L DCI++C++++D++S+
Sbjct: 479  LSYKKTLIDCISYCTEVIDSLSM 501


>ref|XP_010252199.1| PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Nelumbo
           nucifera]
          Length = 502

 Score =  251 bits (642), Expect = 4e-64
 Identities = 125/241 (51%), Positives = 166/241 (68%), Gaps = 10/241 (4%)
 Frame = -3

Query: 759 DAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGFILQENPNDK 580
           +A   RL D GY+E +++YCFYA+++Y+ G+QVLLSYGTYTNLELLEHYGFIL  NPNDK
Sbjct: 250 NAYLQRLTDGGYEEDISAYCFYARKSYKIGEQVLLSYGTYTNLELLEHYGFILDMNPNDK 309

Query: 579 AFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKHLAYSGQLIS 400
           AFI L+ E+ S  SW  D+LYI  DGKPSF LLS +RLWATP ++R+SV H AYSG  +S
Sbjct: 310 AFIELDAEICSSSSWSKDTLYIQQDGKPSFTLLSALRLWATPPNQRKSVAHYAYSGSQLS 369

Query: 399 IDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIEN----SNGETESSTLALGDE 232
            +NE++ M W AK C+ +L+   T +E+D  LLH +DK++N       E E   LA G E
Sbjct: 370 AENEMSAMRWMAKNCQILLNKFPTKVEDDDLLLHIIDKMQNFPLPKEVEYEQMMLAFGGE 429

Query: 231 IRALLKSNSVDAG------KLTAKTRRSISRLKLAVQWRHRYKIILSDCITHCSKMLDNI 70
           + A  ++N +  G        + K  RSI R KL VQWR RYK IL DCI++C++++D +
Sbjct: 430 VGAFFEANGLQKGGSGGDITFSRKMIRSIERWKLVVQWRLRYKKILVDCISYCTEVVDFL 489

Query: 69  S 67
           S
Sbjct: 490 S 490


>ref|XP_004288574.1| PREDICTED: protein SET DOMAIN GROUP 40 [Fragaria vesca subsp.
           vesca]
          Length = 511

 Score =  251 bits (641), Expect = 5e-64
 Identities = 125/239 (52%), Positives = 169/239 (70%), Gaps = 8/239 (3%)
 Frame = -3

Query: 768 ELVDAKTDRLADAGYDEALASYCFYAKRNYERGDQVLLSYGTYTNLELLEHYGFILQENP 589
           E +D+ + RL D  ++  + +YCFYAK++Y +G+QVLLSYGTYTNLELLEHYGF+L ENP
Sbjct: 262 EQLDSDSGRLTDGRFENNVGAYCFYAKKSYRKGEQVLLSYGTYTNLELLEHYGFLLNENP 321

Query: 588 NDKAFISLETEMYSLCSWPNDSLYISVDGKPSFALLSTVRLWATPVSKRRSVKHLAYSGQ 409
           NDKA++ LE E+YS CSWP + LYI   GKPSFALLS +RLWATP ++RRSV HLAYSG 
Sbjct: 322 NDKAYVPLEPEIYSSCSWPKEFLYIHQSGKPSFALLSALRLWATPANRRRSVGHLAYSGL 381

Query: 408 LISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDSELLHFMDKIENSNGETESSTLA--LGD 235
            +SI+NEI VM W + +C +++ +  T+ EEDS LL  +DKI+N N   E + ++    D
Sbjct: 382 QLSIENEIFVMRWISNKCNSIVKNLPTTFEEDSLLLSVIDKIQNVNAPLEFANISSVSTD 441

Query: 234 EI----RALLKSNSVDAGKLTAK--TRRSISRLKLAVQWRHRYKIILSDCITHCSKMLD 76
           EI      +LK  + D+  + ++   +RS  R +LAVQWR  YK IL DCI+ C +M+D
Sbjct: 442 EICTYRAEVLKKGATDSETVVSRKTMQRSRERWRLAVQWRLSYKKILVDCISFCDEMID 500


>gb|KRH17269.1| hypothetical protein GLYMA_14G209800 [Glycine max]
          Length = 348

 Score =  248 bits (634), Expect = 3e-63
 Identities = 131/267 (49%), Positives = 180/267 (67%), Gaps = 12/267 (4%)
 Frame = -3

Query: 834 DTDSLLRITSCDE-----GQRAENSTKELVDAKTDRLADAGYDEALASYCFYAKRNYERG 670
           D D LL  TS  +     G +      E +D+ + RL D G++E   +YCFYA+ +Y++G
Sbjct: 80  DLDRLLSNTSIPDTIVLNGDKNIMVDAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKG 139

Query: 669 DQVLLSYGTYTNLELLEHYGFILQENPNDKAFISLETEMYSLCSWPNDSLYISVDGKPSF 490
           DQVLL YGTYTNLELLEHYGF+LQENPNDK FI LE  +YS  SW  +SLYI  +GKPSF
Sbjct: 140 DQVLLCYGTYTNLELLEHYGFLLQENPNDKVFIPLEPALYSSTSWSKESLYIHHNGKPSF 199

Query: 489 ALLSTVRLWATPVSKRRSVKHLAYSGQLISIDNEIAVMEWTAKRCRAVLSSCLTSIEEDS 310
           ALL+ +RLWATP ++RRSV HL YSG  +S DNEI +M+W +K C AVL +  TS+EED+
Sbjct: 200 ALLAALRLWATPQNRRRSVGHLVYSGSRVSTDNEIFIMKWLSKTCDAVLRNLPTSLEEDT 259

Query: 309 ELLHFMDKIENSNGETESSTLALG-DEIRALLKSNSVDAGK------LTAKTRRSISRLK 151
            LL+ MD  ++ +   E + L    +E    L+++++          L+ K RRS+ R K
Sbjct: 260 LLLNAMDNSQDFSTFMEITKLVSSREETYTFLETHNMKDTHSFTDVILSRKARRSMDRWK 319

Query: 150 LAVQWRHRYKIILSDCITHCSKMLDNI 70
           LAVQWR +YK ++ DCI++C+K+LD++
Sbjct: 320 LAVQWRLKYKKVIFDCISYCNKILDSL 346


Top