BLASTX nr result

ID: Chrysanthemum21_contig00029041 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00029041
         (769 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVI06070.1| Armadillo-type fold [Cynara cardunculus var. scol...   173   2e-46
gb|OTF90440.1| putative armadillo-type fold protein [Helianthus ...   159   1e-40
ref|XP_022015686.1| uncharacterized protein LOC110915317 isoform...   159   1e-40
ref|XP_022015685.1| uncharacterized protein LOC110915317 isoform...   154   6e-39
ref|XP_023744117.1| uncharacterized protein LOC111892274 [Lactuc...   150   2e-38
ref|XP_021984276.1| uncharacterized protein LOC110880033 [Helian...   142   9e-36
ref|XP_023760703.1| uncharacterized protein LOC111909144 [Lactuc...   129   8e-31
ref|XP_010275763.1| PREDICTED: neurofilament heavy polypeptide-l...   111   5e-24
gb|PLY87820.1| hypothetical protein LSAT_5X56700 [Lactuca sativa]     106   6e-24
gb|PIN23145.1| Histone-lysine N-methyltransferase [Handroanthus ...   101   7e-22
ref|XP_021661429.1| uncharacterized protein LOC110650663 isoform...   104   9e-22
ref|XP_021661426.1| uncharacterized protein LOC110650663 isoform...   104   9e-22
ref|XP_021661428.1| uncharacterized protein LOC110650663 isoform...   104   1e-21
ref|XP_024018646.1| transcriptional regulator ATRX isoform X4 [M...   104   1e-21
ref|XP_024018645.1| treacle protein isoform X3 [Morus notabilis]      104   1e-21
ref|XP_024018643.1| treacle protein isoform X1 [Morus notabilis]      104   1e-21
ref|XP_018833674.1| PREDICTED: muscle M-line assembly protein un...   103   2e-21
ref|XP_019243876.1| PREDICTED: uncharacterized protein LOC109223...   102   4e-21
ref|XP_013682915.1| inner centromere protein A-like isoform X2 [...   102   4e-21
ref|XP_013682914.1| inner centromere protein A-like isoform X1 [...   102   4e-21

>gb|KVI06070.1| Armadillo-type fold [Cynara cardunculus var. scolymus]
          Length = 646

 Score =  173 bits (439), Expect = 2e-46
 Identities = 98/191 (51%), Positives = 128/191 (67%), Gaps = 7/191 (3%)
 Frame = -3

Query: 767 NGSSGK-----NNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDE 603
           NGS GK     N  KK+G++LVGH IKVWWPLD+M+Y GAVSS+N  DKKHKVLY+DGDE
Sbjct: 464 NGSHGKDLQNVNTLKKYGKELVGHRIKVWWPLDRMYYEGAVSSYNSLDKKHKVLYADGDE 523

Query: 602 EMLDLSQERWLMVDDMSLD--QDQIVVLPSPVTAPVKHSKQKGKRKLDSAPTQEYNSNPP 429
           E+LDL  E+W M++D+S D  Q+Q+  L SP+T   K  K KGKRK++S+  Q  N N P
Sbjct: 524 ELLDLRHEKWSMLNDLSPDQLQEQVADLTSPMTTSAKRLKPKGKRKIESSLMQVDNYNSP 583

Query: 428 KRPRSSAYSLRTKPTREHEIKDANTSTTDILCNSDYHKDDPVQTPVTSRGLNSSEKIIEV 249
           K P + AYS +T+PT+EH+I      T D+LC +D  K DPV+T        S+EK   V
Sbjct: 584 KSP-APAYSSKTRPTKEHDI------TMDVLC-TDNQKRDPVKTSDI-----STEK---V 627

Query: 248 ERDGEPNPGEN 216
           E+DG+   G +
Sbjct: 628 EQDGKRKMGRS 638


>gb|OTF90440.1| putative armadillo-type fold protein [Helianthus annuus]
          Length = 762

 Score =  159 bits (401), Expect = 1e-40
 Identities = 79/163 (48%), Positives = 101/163 (61%), Gaps = 2/163 (1%)
 Frame = -3

Query: 767 NGSSGKNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDL 588
           NGS GK+  KK+GQDLVGH +KVWWPLDKM+Y G VSS+N  D KHKV Y DGDEE+LDL
Sbjct: 391 NGSHGKDTSKKWGQDLVGHRVKVWWPLDKMYYEGTVSSYNHVDNKHKVSYVDGDEEVLDL 450

Query: 587 SQERWLMVDDMSLDQDQIVVLPSPVTAPVKHSKQKGKRK--LDSAPTQEYNSNPPKRPRS 414
             E+W M+DD+  DQ+Q+  LPSPVT   +  +QK KRK  L S  T        KR R 
Sbjct: 451 CSEKWEMLDDVLPDQEQVADLPSPVTTSSERPEQKRKRKRRLPSPDTTSAEQPEQKRKRK 510

Query: 413 SAYSLRTKPTREHEIKDANTSTTDILCNSDYHKDDPVQTPVTS 285
            +        +   +   ++ T D++CN++  KD PV TP  S
Sbjct: 511 QSRPRVVMGLKAWGLHAGSSKTADVICNAENQKDHPVSTPADS 553


>ref|XP_022015686.1| uncharacterized protein LOC110915317 isoform X2 [Helianthus annuus]
          Length = 785

 Score =  159 bits (401), Expect = 1e-40
 Identities = 79/163 (48%), Positives = 101/163 (61%), Gaps = 2/163 (1%)
 Frame = -3

Query: 767 NGSSGKNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDL 588
           NGS GK+  KK+GQDLVGH +KVWWPLDKM+Y G VSS+N  D KHKV Y DGDEE+LDL
Sbjct: 391 NGSHGKDTSKKWGQDLVGHRVKVWWPLDKMYYEGTVSSYNHVDNKHKVSYVDGDEEVLDL 450

Query: 587 SQERWLMVDDMSLDQDQIVVLPSPVTAPVKHSKQKGKRK--LDSAPTQEYNSNPPKRPRS 414
             E+W M+DD+  DQ+Q+  LPSPVT   +  +QK KRK  L S  T        KR R 
Sbjct: 451 CSEKWEMLDDVLPDQEQVADLPSPVTTSSERPEQKRKRKRRLPSPDTTSAEQPEQKRKRK 510

Query: 413 SAYSLRTKPTREHEIKDANTSTTDILCNSDYHKDDPVQTPVTS 285
            +        +   +   ++ T D++CN++  KD PV TP  S
Sbjct: 511 QSRPRVVMGLKAWGLHAGSSKTADVICNAENQKDHPVSTPADS 553


>ref|XP_022015685.1| uncharacterized protein LOC110915317 isoform X1 [Helianthus annuus]
          Length = 787

 Score =  154 bits (388), Expect = 6e-39
 Identities = 79/165 (47%), Positives = 101/165 (61%), Gaps = 4/165 (2%)
 Frame = -3

Query: 767 NGSSGKNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDL 588
           NGS GK+  KK+GQDLVGH +KVWWPLDKM+Y G VSS+N  D KHKV Y DGDEE+LDL
Sbjct: 391 NGSHGKDTSKKWGQDLVGHRVKVWWPLDKMYYEGTVSSYNHVDNKHKVSYVDGDEEVLDL 450

Query: 587 SQERWLMVDDMSLD--QDQIVVLPSPVTAPVKHSKQKGKRK--LDSAPTQEYNSNPPKRP 420
             E+W M+DD+  D  Q+Q+  LPSPVT   +  +QK KRK  L S  T        KR 
Sbjct: 451 CSEKWEMLDDVLPDQQQEQVADLPSPVTTSSERPEQKRKRKRRLPSPDTTSAEQPEQKRK 510

Query: 419 RSSAYSLRTKPTREHEIKDANTSTTDILCNSDYHKDDPVQTPVTS 285
           R  +        +   +   ++ T D++CN++  KD PV TP  S
Sbjct: 511 RKQSRPRVVMGLKAWGLHAGSSKTADVICNAENQKDHPVSTPADS 555


>ref|XP_023744117.1| uncharacterized protein LOC111892274 [Lactuca sativa]
 gb|PLY65954.1| hypothetical protein LSAT_4X87100 [Lactuca sativa]
          Length = 557

 Score =  150 bits (379), Expect = 2e-38
 Identities = 77/147 (52%), Positives = 105/147 (71%), Gaps = 2/147 (1%)
 Frame = -3

Query: 731 GQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERWLMVDDMS 552
           G+ LVGH IKVWWPLDKM+Y GAVSS+NP DKKHKVLY+DGDEE+LDL  E+W+++D  S
Sbjct: 399 GKQLVGHRIKVWWPLDKMYYEGAVSSYNPVDKKHKVLYADGDEEVLDLRVEKWIILDKSS 458

Query: 551 LDQDQIVVLPSPV-TAPVKHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAYSLRTKPTREH 375
             +++   LP+P+ T+  K  KQKGKRKL+ +P +E N N PK     + SL+TKPT+E 
Sbjct: 459 PHKEKSSDLPTPMTTSSTKRLKQKGKRKLEFSPMEEDNMNSPK-----SSSLKTKPTKED 513

Query: 374 EIKDANTS-TTDILCNSDYHKDDPVQT 297
           +I +++T+   +I   S   K DP+ T
Sbjct: 514 DITNSSTNEVVEINDASSTEKMDPIPT 540


>ref|XP_021984276.1| uncharacterized protein LOC110880033 [Helianthus annuus]
 gb|OTG16718.1| putative armadillo-type fold protein [Helianthus annuus]
          Length = 513

 Score =  142 bits (359), Expect = 9e-36
 Identities = 87/198 (43%), Positives = 112/198 (56%), Gaps = 1/198 (0%)
 Frame = -3

Query: 764 GSSGKNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLS 585
           GS GK+ PKK+G+DLVGH IKVWWPLD+M+Y G V S++P D KH+VLY+DGDEE+LDL 
Sbjct: 352 GSGGKDIPKKWGEDLVGHRIKVWWPLDEMYYKGVVFSYDPRDDKHRVLYADGDEEVLDLH 411

Query: 584 QERWLMVDDMSLDQDQIVVLPSPVTAPVKHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAY 405
            E+W M+D++S DQ     L   VT           R  +S+PTQE NSN PK P  S  
Sbjct: 412 NEKWKMLDEISPDQ-----LHGKVT-----------RNPESSPTQEDNSNSPKSPPPS-- 453

Query: 404 SLRTKPTREHEIKDANTSTTDILCNSDYHKDDPVQTPVTSRGLNSSEKIIE-VERDGEPN 228
                        DAN             KD+ V     S GL S+E+ I+ VE++GE +
Sbjct: 454 -------------DAN------------QKDNLVLALAVSDGLTSTEQTIDNVEKNGESD 488

Query: 227 PGENLEVKRLLVSGQTTE 174
           P   L  K LL   +TT+
Sbjct: 489 PSGELPEKSLLEMSETTQ 506


>ref|XP_023760703.1| uncharacterized protein LOC111909144 [Lactuca sativa]
          Length = 515

 Score =  129 bits (324), Expect = 8e-31
 Identities = 64/124 (51%), Positives = 84/124 (67%)
 Frame = -3

Query: 734 FGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERWLMVDDM 555
           +G+  VGH IK+W P DKM   GA+SS+NP DK+HKV Y DGD+E+L+L  E+W + D  
Sbjct: 244 YGKQSVGHRIKIWRPPDKMHCEGAISSYNPIDKQHKVSYVDGDDEVLNLCFEKWSIQDKS 303

Query: 554 SLDQDQIVVLPSPVTAPVKHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAYSLRTKPTREH 375
           S  ++++  LPSP+T   KH KQK KRKL+ +P QE NSN PKR     +S  TKPT+  
Sbjct: 304 SPQEEKVSELPSPITTSSKHLKQKVKRKLEFSPKQEDNSNSPKR-----FSSETKPTKGD 358

Query: 374 EIKD 363
            IKD
Sbjct: 359 NIKD 362


>ref|XP_010275763.1| PREDICTED: neurofilament heavy polypeptide-like [Nelumbo nucifera]
 ref|XP_019055527.1| PREDICTED: neurofilament heavy polypeptide-like [Nelumbo nucifera]
 ref|XP_019055528.1| PREDICTED: neurofilament heavy polypeptide-like [Nelumbo nucifera]
          Length = 923

 Score =  111 bits (277), Expect = 5e-24
 Identities = 76/223 (34%), Positives = 105/223 (47%)
 Frame = -3

Query: 740  KKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERWLMVD 561
            K  G++LVG  IKVWWP+D  FY G + SFNP  KKHKVLY DGDEE+L+L +ERW  + 
Sbjct: 665  KGHGENLVGAKIKVWWPIDHQFYKGVIDSFNPVKKKHKVLYDDGDEEILNLRKERWEFIG 724

Query: 560  DMSLDQDQIVVLPSPVTAPVKHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAYSLRTKPTR 381
            D   D DQ   L SP  A   H K+K K  LDS+  Q  +    KR   ++ S      R
Sbjct: 725  DKVSDGDQEADLSSP-DASEMHQKKKTKTNLDSSTKQARSDASSKRGGGTSASKSKVEAR 783

Query: 380  EHEIKDANTSTTDILCNSDYHKDDPVQTPVTSRGLNSSEKIIEVERDGEPNPGENLEVKR 201
                K  + S  D   N    +D P     +  G   S       +D  P     L+ + 
Sbjct: 784  ----KSGSKSRDDGKVNGKSKEDTPKAVGKSKDGGGKS-------KDDTPKASSKLKDET 832

Query: 200  LLVSGQTTEVQQDGELKLSGKFEEKKLSQTIEMEQNGEPNPSG 72
               +G++ +  Q  ++    K E  K+S +   +  GE   +G
Sbjct: 833  QKTAGKSKDDNQ--KVSTKSKDETPKISSS---KSKGETPKTG 870


>gb|PLY87820.1| hypothetical protein LSAT_5X56700 [Lactuca sativa]
          Length = 254

 Score =  106 bits (264), Expect = 6e-24
 Identities = 54/106 (50%), Positives = 71/106 (66%)
 Frame = -3

Query: 680 MFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERWLMVDDMSLDQDQIVVLPSPVTAPV 501
           M   GA+SS+NP DK+HKV Y DGD+E+L+L  E+W + D  S  ++++  LPSP+T   
Sbjct: 1   MHCEGAISSYNPIDKQHKVSYVDGDDEVLNLCFEKWSIQDKSSPQEEKVSELPSPITTSS 60

Query: 500 KHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAYSLRTKPTREHEIKD 363
           KH KQK KRKL+ +P QE NSN PKR     +S  TKPT+   IKD
Sbjct: 61  KHLKQKVKRKLEFSPKQEDNSNSPKR-----FSSETKPTKGDNIKD 101


>gb|PIN23145.1| Histone-lysine N-methyltransferase [Handroanthus impetiginosus]
          Length = 271

 Score =  101 bits (251), Expect = 7e-22
 Identities = 72/257 (28%), Positives = 126/257 (49%), Gaps = 30/257 (11%)
 Frame = -3

Query: 737 KFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERWLMV-D 561
           ++G++LVG  +KVWWP D+MFY G ++SF+   KKHKVLY DGD+E+L+L +ERW  + D
Sbjct: 13  EYGENLVGSKVKVWWPKDRMFYEGVIASFDSVKKKHKVLYIDGDKEILNLRRERWEFIGD 72

Query: 560 DMSLDQDQIVVLPSPVTAPVKHSKQKG---------KRKLDSAPTQEYNSNPPK------ 426
           D+  D+DQ V   S   +     K+KG         +RK++S+P  +      K      
Sbjct: 73  DLVSDEDQDVGHSSHDASSDMQRKKKGNKNAETSSKRRKMESSPKSKLKDTATKSGGKSK 132

Query: 425 ---RPRSSAYSLRTKPTR----------EHEIKDANTSTTDILCNSDYHKDDPVQTPVTS 285
              +  S A   ++KP+R          +H  K    S +D    +   KD+  +TP  S
Sbjct: 133 DDGKAESEAKDHKSKPSRKSVDDNIKSKDHSQKLGGKSQSDSGKAAGRSKDNVAKTPSNS 192

Query: 284 RGLNSSEKIIEVERDGEPNPGENLEVK-RLLVSGQTTEVQQDGELKLSGKFEEKKLSQTI 108
           +    S++  +  +   P  G++L      ++   T +V++   +K       +KL++T 
Sbjct: 193 K--QDSQRAAK-SKGKTPPSGKSLSASGTKMMKSSTPKVKETDRMK-------EKLAETA 242

Query: 107 EMEQNGEPNPSGCLKEK 57
           +  ++ +   +   K +
Sbjct: 243 KSSESAKGKSTETAKSR 259


>ref|XP_021661429.1| uncharacterized protein LOC110650663 isoform X4 [Hevea brasiliensis]
          Length = 759

 Score =  104 bits (260), Expect = 9e-22
 Identities = 66/189 (34%), Positives = 97/189 (51%), Gaps = 11/189 (5%)
 Frame = -3

Query: 752  KNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERW 573
            K  P   G+ LVG  IKVWWP DKMFY G + S++P  KKHKVLY+DGDEE+L+L +ERW
Sbjct: 556  KEVPPDLGEQLVGSRIKVWWPRDKMFYEGVLDSYDPIKKKHKVLYADGDEEILNLGRERW 615

Query: 572  LMVDDMSLDQDQI--VVLPSPVTAPVKHSKQKGK--------RKLDSAPTQEYNSNPPKR 423
             +V D  L  +Q+    +P+   +  K  KQKGK         K+D   +    ++  K 
Sbjct: 616  ELVGDDILPGEQVPETDIPNADPSSDKPGKQKGKLISESTKQLKVDFKRSGAATASRKKA 675

Query: 422  PRSSAYSLRTKPTREHEIKDANTSTTDILCNSDYHKD-DPVQTPVTSRGLNSSEKIIEVE 246
             +S   + + +P   +++ D +TS  D     D  +  D ++      G+NS +K  E  
Sbjct: 676  RKSKGAAAQDEPIVANKVMD-DTSRPDNGSEGDSKESTDKLKIKTLRIGINSKQKTPETA 734

Query: 245  RDGEPNPGE 219
                   GE
Sbjct: 735  SPSSDENGE 743


>ref|XP_021661426.1| uncharacterized protein LOC110650663 isoform X1 [Hevea brasiliensis]
          Length = 809

 Score =  104 bits (260), Expect = 9e-22
 Identities = 66/189 (34%), Positives = 97/189 (51%), Gaps = 11/189 (5%)
 Frame = -3

Query: 752  KNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERW 573
            K  P   G+ LVG  IKVWWP DKMFY G + S++P  KKHKVLY+DGDEE+L+L +ERW
Sbjct: 606  KEVPPDLGEQLVGSRIKVWWPRDKMFYEGVLDSYDPIKKKHKVLYADGDEEILNLGRERW 665

Query: 572  LMVDDMSLDQDQI--VVLPSPVTAPVKHSKQKGK--------RKLDSAPTQEYNSNPPKR 423
             +V D  L  +Q+    +P+   +  K  KQKGK         K+D   +    ++  K 
Sbjct: 666  ELVGDDILPGEQVPETDIPNADPSSDKPGKQKGKLISESTKQLKVDFKRSGAATASRKKA 725

Query: 422  PRSSAYSLRTKPTREHEIKDANTSTTDILCNSDYHKD-DPVQTPVTSRGLNSSEKIIEVE 246
             +S   + + +P   +++ D +TS  D     D  +  D ++      G+NS +K  E  
Sbjct: 726  RKSKGAAAQDEPIVANKVMD-DTSRPDNGSEGDSKESTDKLKIKTLRIGINSKQKTPETA 784

Query: 245  RDGEPNPGE 219
                   GE
Sbjct: 785  SPSSDENGE 793


>ref|XP_021661428.1| uncharacterized protein LOC110650663 isoform X3 [Hevea brasiliensis]
          Length = 808

 Score =  104 bits (259), Expect = 1e-21
 Identities = 65/188 (34%), Positives = 93/188 (49%), Gaps = 10/188 (5%)
 Frame = -3

Query: 752  KNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERW 573
            K  P   G+ LVG  IKVWWP DKMFY G + S++P  KKHKVLY+DGDEE+L+L +ERW
Sbjct: 606  KEVPPDLGEQLVGSRIKVWWPRDKMFYEGVLDSYDPIKKKHKVLYADGDEEILNLGRERW 665

Query: 572  LMVDDMSLDQDQIVVLPSPVTAPVKHSKQKGKRKLDSAPTQEY---------NSNPPKRP 420
             +V D  L  +Q+     P   P      K K KL S  T++           ++  K  
Sbjct: 666  ELVGDDILPGEQVPETDIPNADPSSDKPGKQKGKLISESTKQLKVDFKSGAATASRKKAR 725

Query: 419  RSSAYSLRTKPTREHEIKDANTSTTDILCNSDYHKD-DPVQTPVTSRGLNSSEKIIEVER 243
            +S   + + +P   +++ D +TS  D     D  +  D ++      G+NS +K  E   
Sbjct: 726  KSKGAAAQDEPIVANKVMD-DTSRPDNGSEGDSKESTDKLKIKTLRIGINSKQKTPETAS 784

Query: 242  DGEPNPGE 219
                  GE
Sbjct: 785  PSSDENGE 792


>ref|XP_024018646.1| transcriptional regulator ATRX isoform X4 [Morus notabilis]
          Length = 1084

 Score =  104 bits (259), Expect = 1e-21
 Identities = 72/247 (29%), Positives = 121/247 (48%), Gaps = 4/247 (1%)
 Frame = -3

Query: 734  FGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERWLMVDDM 555
            FG+ +VG  IKVWWP+DKMFY G + S++P  K+HKV Y DGD E+L++  +RW  V   
Sbjct: 744  FGELMVGRRIKVWWPMDKMFYEGVIDSYDPIRKRHKVCYIDGDVEILNMKNQRWETVGHD 803

Query: 554  SL-DQDQIVVLPS-PVTAPVKHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAYSLRTKPTR 381
            SL D+DQ   LP    ++P+   K+K K K D    ++ + +P   P+   +  +TK  +
Sbjct: 804  SLADKDQKSDLPGLDTSSPISPQKEKQKAKSDQKKQRKDDCSPESNPQKGKW--KTKSNQ 861

Query: 380  EHEIKDANTSTTDILCNSDYHKDDPVQTPVTSRGLNSSEKIIEVERDGEPNPGENLEVKR 201
            + ++K ++    +    +DY    P   P + RG    ++  + +R G+          +
Sbjct: 862  KTQVKISSPEKEE---KADY---SPESIPRSGRGKTKPDQEKQAKRSGQ---------AK 906

Query: 200  LLVSGQTTEVQQDGELKLSGKFEEKKLSQTIEM--EQNGEPNPSGCLKEKRLLDLCHTIE 27
               S ++       ++KL    ++KK +Q  +   + + E NP    K KR  D     E
Sbjct: 907  ARSSSKSNPRSGKVKMKLDKGTQDKKSNQEKQSKDDSSSESNPQKG-KRKRESDEGEQAE 965

Query: 26   MEQDGEP 6
               +  P
Sbjct: 966  KSDESNP 972


>ref|XP_024018645.1| treacle protein isoform X3 [Morus notabilis]
          Length = 1140

 Score =  104 bits (259), Expect = 1e-21
 Identities = 72/247 (29%), Positives = 121/247 (48%), Gaps = 4/247 (1%)
 Frame = -3

Query: 734  FGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERWLMVDDM 555
            FG+ +VG  IKVWWP+DKMFY G + S++P  K+HKV Y DGD E+L++  +RW  V   
Sbjct: 744  FGELMVGRRIKVWWPMDKMFYEGVIDSYDPIRKRHKVCYIDGDVEILNMKNQRWETVGHD 803

Query: 554  SL-DQDQIVVLPS-PVTAPVKHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAYSLRTKPTR 381
            SL D+DQ   LP    ++P+   K+K K K D    ++ + +P   P+   +  +TK  +
Sbjct: 804  SLADKDQKSDLPGLDTSSPISPQKEKQKAKSDQKKQRKDDCSPESNPQKGKW--KTKSNQ 861

Query: 380  EHEIKDANTSTTDILCNSDYHKDDPVQTPVTSRGLNSSEKIIEVERDGEPNPGENLEVKR 201
            + ++K ++    +    +DY    P   P + RG    ++  + +R G+          +
Sbjct: 862  KTQVKISSPEKEE---KADY---SPESIPRSGRGKTKPDQEKQAKRSGQ---------AK 906

Query: 200  LLVSGQTTEVQQDGELKLSGKFEEKKLSQTIEM--EQNGEPNPSGCLKEKRLLDLCHTIE 27
               S ++       ++KL    ++KK +Q  +   + + E NP    K KR  D     E
Sbjct: 907  ARSSSKSNPRSGKVKMKLDKGTQDKKSNQEKQSKDDSSSESNPQKG-KRKRESDEGEQAE 965

Query: 26   MEQDGEP 6
               +  P
Sbjct: 966  KSDESNP 972


>ref|XP_024018643.1| treacle protein isoform X1 [Morus notabilis]
          Length = 1174

 Score =  104 bits (259), Expect = 1e-21
 Identities = 72/247 (29%), Positives = 121/247 (48%), Gaps = 4/247 (1%)
 Frame = -3

Query: 734  FGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERWLMVDDM 555
            FG+ +VG  IKVWWP+DKMFY G + S++P  K+HKV Y DGD E+L++  +RW  V   
Sbjct: 744  FGELMVGRRIKVWWPMDKMFYEGVIDSYDPIRKRHKVCYIDGDVEILNMKNQRWETVGHD 803

Query: 554  SL-DQDQIVVLPS-PVTAPVKHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAYSLRTKPTR 381
            SL D+DQ   LP    ++P+   K+K K K D    ++ + +P   P+   +  +TK  +
Sbjct: 804  SLADKDQKSDLPGLDTSSPISPQKEKQKAKSDQKKQRKDDCSPESNPQKGKW--KTKSNQ 861

Query: 380  EHEIKDANTSTTDILCNSDYHKDDPVQTPVTSRGLNSSEKIIEVERDGEPNPGENLEVKR 201
            + ++K ++    +    +DY    P   P + RG    ++  + +R G+          +
Sbjct: 862  KTQVKISSPEKEE---KADY---SPESIPRSGRGKTKPDQEKQAKRSGQ---------AK 906

Query: 200  LLVSGQTTEVQQDGELKLSGKFEEKKLSQTIEM--EQNGEPNPSGCLKEKRLLDLCHTIE 27
               S ++       ++KL    ++KK +Q  +   + + E NP    K KR  D     E
Sbjct: 907  ARSSSKSNPRSGKVKMKLDKGTQDKKSNQEKQSKDDSSSESNPQKG-KRKRESDEGEQAE 965

Query: 26   MEQDGEP 6
               +  P
Sbjct: 966  KSDESNP 972


>ref|XP_018833674.1| PREDICTED: muscle M-line assembly protein unc-89-like isoform X2
            [Juglans regia]
          Length = 935

 Score =  103 bits (258), Expect = 2e-21
 Identities = 71/216 (32%), Positives = 102/216 (47%), Gaps = 3/216 (1%)
 Frame = -3

Query: 755  GKNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQER 576
            GK N   +G++LVG  IKVWWP D+MFY G + SF  + KKH+VLY+DGDEE+L+L +E+
Sbjct: 628  GKENDSDYGENLVGSKIKVWWPKDRMFYDGVIDSFISSSKKHRVLYTDGDEEVLNLKKEK 687

Query: 575  WLMV-DDMSLDQDQIVVLPSPVTAPVKHSKQKGKRKLDSAPTQEYNSNPPKR--PRSSAY 405
            W  +  D   D +Q+   PSP  +     K+K K   D    Q      PK+    SS+ 
Sbjct: 688  WEYIGGDSGSDGEQVADQPSPDPSSEMPPKKKAKNNSDEPTKQAKMDALPKKGGGASSSK 747

Query: 404  SLRTKPTREHEIKDANTSTTDILCNSDYHKDDPVQTPVTSRGLNSSEKIIEVERDGEPNP 225
            S  T     H+ ++ +             KDD  +T   S   NS +      +D  P  
Sbjct: 748  SKGTSSKSGHKFREGSKV-------DGKSKDDFPKTENKSENANSVK-----SKDRTPRS 795

Query: 224  GENLEVKRLLVSGQTTEVQQDGELKLSGKFEEKKLS 117
            G +  V     SG  ++       K+ GKF +   S
Sbjct: 796  GGSKSVGPAPKSGGKSKKNDPNTHKI-GKFMDDNTS 830


>ref|XP_019243876.1| PREDICTED: uncharacterized protein LOC109223867 isoform X3 [Nicotiana
            attenuata]
          Length = 861

 Score =  102 bits (255), Expect = 4e-21
 Identities = 65/213 (30%), Positives = 105/213 (49%)
 Frame = -3

Query: 761  SSGKNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQ 582
            S  ++  K +G+++VG  I+VWWPLDK+FY GAV+ F+P  K+HK+LY D   E L+L++
Sbjct: 440  SKRRHITKNYGEEMVGTRIRVWWPLDKVFYEGAVTEFDPVKKRHKILYYDEQVETLNLTK 499

Query: 581  ERWLMVDDMSLDQDQIVVLPSPVTAPVKHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAYS 402
            ERW MV D   D+     L     + V   K+K KR   S+  +E   +  KR + +A  
Sbjct: 500  ERWEMVGDNPSDKKHETDLQCNAVSSVPSMKKKAKR--TSSTKREPGVSSSKRSKRNAQK 557

Query: 401  LRTKPTREHEIKDANTSTTDILCNSDYHKDDPVQTPVTSRGLNSSEKIIEVERDGEPNPG 222
              T+ + +  + D     TD+ C +   K DPV         N   K  +V+ +      
Sbjct: 558  GETE-SMDIPVPDEMNDATDVGCATSSRKKDPVDKEDNMINENQISKNSKVDLESSLMLE 616

Query: 221  ENLEVKRLLVSGQTTEVQQDGELKLSGKFEEKK 123
            ++   K+     Q+++V      K + +   KK
Sbjct: 617  DHPSDKKHETDLQSSDVSSIPSSKKAKRTSSKK 649


>ref|XP_013682915.1| inner centromere protein A-like isoform X2 [Brassica napus]
          Length = 974

 Score =  102 bits (255), Expect = 4e-21
 Identities = 79/254 (31%), Positives = 128/254 (50%), Gaps = 6/254 (2%)
 Frame = -3

Query: 752  KNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERW 573
            ++N  + G++LVG  + +WWPLDK FY G + S+N  +KKH+VLYSDGD E L+L +ERW
Sbjct: 633  ESNKSELGEELVGKRVNIWWPLDKKFYDGVIESYNSLNKKHQVLYSDGDSEELNLKKERW 692

Query: 572  LMVDDMSLDQDQIVVLPSPVTAPVKHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAYSLRT 393
               D +S ++++I +  S   + +    +  KRK +S   Q  +S+  +   S    L T
Sbjct: 693  ---DIISEEKEEIDLPDSTPLSDIMRRNKAKKRKTESMHVQLKSSS--EVGSSKKKDLVT 747

Query: 392  KPTREHEI-KDA-----NTSTTDILCNSDYHKDDPVQTPVTSRGLNSSEKIIEVERDGEP 231
              TR+ ++ KDA     N        N  + KD   +    ++G +S +  I+ E + EP
Sbjct: 748  SSTRQGKVTKDAVKGGSNEPERREEINIQFPKDCDDKEESETKGEDSLK--IKEESNAEP 805

Query: 230  NPGENLEVKRLLVSGQTTEVQQDGELKLSGKFEEKKLSQTIEMEQNGEPNPSGCLKEKRL 51
               E    ++  +     E + DGE   S + E     Q IE E   EP   G  +E++ 
Sbjct: 806  ---ECKRDQQEPLEDSNAEAKSDGEELKSAETETDGEEQEIEKEATAEPQTDG--EERQS 860

Query: 50   LDLCHTIEMEQDGE 9
            + + +  E + DGE
Sbjct: 861  VKVPN--EAKSDGE 872


>ref|XP_013682914.1| inner centromere protein A-like isoform X1 [Brassica napus]
          Length = 975

 Score =  102 bits (255), Expect = 4e-21
 Identities = 82/256 (32%), Positives = 132/256 (51%), Gaps = 8/256 (3%)
 Frame = -3

Query: 752  KNNPKKFGQDLVGHSIKVWWPLDKMFYTGAVSSFNPTDKKHKVLYSDGDEEMLDLSQERW 573
            ++N  + G++LVG  + +WWPLDK FY G + S+N  +KKH+VLYSDGD E L+L +ERW
Sbjct: 633  ESNKSELGEELVGKRVNIWWPLDKKFYDGVIESYNSLNKKHQVLYSDGDSEELNLKKERW 692

Query: 572  LMVDDMSLDQDQIVVLP--SPVTAPVKHSKQKGKRKLDSAPTQEYNSNPPKRPRSSAYSL 399
               D +S  + + + LP  +P++  ++ +K K KRK +S   Q  +S+  +   S    L
Sbjct: 693  ---DIISEQEKEEIDLPDSTPLSDIMRRNKAK-KRKTESMHVQLKSSS--EVGSSKKKDL 746

Query: 398  RTKPTREHEI-KDA-----NTSTTDILCNSDYHKDDPVQTPVTSRGLNSSEKIIEVERDG 237
             T  TR+ ++ KDA     N        N  + KD   +    ++G +S +  I+ E + 
Sbjct: 747  VTSSTRQGKVTKDAVKGGSNEPERREEINIQFPKDCDDKEESETKGEDSLK--IKEESNA 804

Query: 236  EPNPGENLEVKRLLVSGQTTEVQQDGELKLSGKFEEKKLSQTIEMEQNGEPNPSGCLKEK 57
            EP   E    ++  +     E + DGE   S + E     Q IE E   EP   G  +E+
Sbjct: 805  EP---ECKRDQQEPLEDSNAEAKSDGEELKSAETETDGEEQEIEKEATAEPQTDG--EER 859

Query: 56   RLLDLCHTIEMEQDGE 9
            + + + +  E + DGE
Sbjct: 860  QSVKVPN--EAKSDGE 873


Top