BLASTX nr result

ID: Atropa21_contig00027716 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00027716
         (781 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006342030.1| PREDICTED: filament-like plant protein 6-lik...   365   e-104
ref|XP_004238341.1| PREDICTED: filament-like plant protein 6-lik...   348   8e-99
emb|CBI19835.3| unnamed protein product [Vitis vinifera]              191   2e-46
ref|XP_002280306.2| PREDICTED: filament-like plant protein 4-lik...   189   9e-46
emb|CAN60525.1| hypothetical protein VITISV_000522 [Vitis vinifera]   189   9e-46
gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis]     176   8e-42
ref|XP_002510512.1| Myosin heavy chain, striated muscle, putativ...   176   1e-41
gb|EOY14987.1| Uncharacterized protein isoform 8, partial [Theob...   169   8e-40
gb|EOY14986.1| Uncharacterized protein isoform 7 [Theobroma cacao]    169   8e-40
gb|EOY14985.1| Uncharacterized protein isoform 6 [Theobroma cacao]    169   8e-40
gb|EOY14984.1| Uncharacterized protein isoform 5 [Theobroma cacao]    169   8e-40
gb|EOY14983.1| Uncharacterized protein isoform 4 [Theobroma cacao]    169   8e-40
gb|EOY14982.1| Uncharacterized protein isoform 3 [Theobroma cacao]    169   8e-40
gb|EOY14981.1| Uncharacterized protein isoform 2, partial [Theob...   169   8e-40
gb|EOY14980.1| Uncharacterized protein isoform 1 [Theobroma cacao]    169   8e-40
ref|XP_002306918.2| hypothetical protein POPTR_0005s25830g [Popu...   164   2e-38
ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citr...   163   7e-38
ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-lik...   162   2e-37
ref|XP_004168855.1| PREDICTED: LOW QUALITY PROTEIN: filament-lik...   161   3e-37
ref|XP_004136392.1| PREDICTED: filament-like plant protein 4-lik...   161   3e-37

>ref|XP_006342030.1| PREDICTED: filament-like plant protein 6-like [Solanum tuberosum]
          Length = 1093

 Score =  365 bits (936), Expect(2) = e-104
 Identities = 198/260 (76%), Positives = 209/260 (80%), Gaps = 23/260 (8%)
 Frame = +2

Query: 71   TNPDSQLKEHNETSISGNXXXXXXXXXXXXXXPLSHTSLFMNIQSRISTVLESLSKEADI 250
            T+PDSQLKEHNETS+SG+              PLS TS+ M +QSRISTVLESLSK+ADI
Sbjct: 541  TSPDSQLKEHNETSVSGDQASRNEEVSSQSHQPLSDTSISMKLQSRISTVLESLSKDADI 600

Query: 251  QSIQEDLREIVQEMRNA---------------SEIATESQPSLDDGEANLEKEITVSQDS 385
            Q IQEDLREIVQEMRNA               S  ATESQPSLDDGEANLEKEI VS+DS
Sbjct: 601  QRIQEDLREIVQEMRNALIPQSTKSIVEITLSSNTATESQPSLDDGEANLEKEIPVSEDS 660

Query: 386  --------GISKELADAMSQIHDFVLFLGKEAKAIQGTAPDGRGINEKLGVFSATYVEVI 541
                    GISKELADAMSQIHDFVLFLGKEAKAIQGTAPDG GINEKL  FSATYVEVI
Sbjct: 661  KSCNESIHGISKELADAMSQIHDFVLFLGKEAKAIQGTAPDGSGINEKLDDFSATYVEVI 720

Query: 542  SSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSDCIDKVALPENKGLQHSDEV 721
            S++L+MVNFVLDLS VLSNAS+LH NILGYKNSETEISTSDCIDKVALPENK LQHS EV
Sbjct: 721  SNKLSMVNFVLDLSHVLSNASQLHFNILGYKNSETEISTSDCIDKVALPENKDLQHSGEV 780

Query: 722  YANGCAHFSDSTSDPDIPHE 781
            YANGCAHFSDSTSDPDIPHE
Sbjct: 781  YANGCAHFSDSTSDPDIPHE 800



 Score = 40.8 bits (94), Expect(2) = e-104
 Identities = 21/26 (80%), Positives = 22/26 (84%), Gaps = 3/26 (11%)
 Frame = +3

Query: 3   SSDTNGAVSNP---NNASPETTKVDT 71
           SSDTNGAVS+P   NNA PETTKVDT
Sbjct: 510 SSDTNGAVSSPDIPNNARPETTKVDT 535


>ref|XP_004238341.1| PREDICTED: filament-like plant protein 6-like [Solanum lycopersicum]
          Length = 1091

 Score =  348 bits (894), Expect(2) = 8e-99
 Identities = 192/260 (73%), Positives = 203/260 (78%), Gaps = 23/260 (8%)
 Frame = +2

Query: 71   TNPDSQLKEHNETSISGNXXXXXXXXXXXXXXPLSHTSLFMNIQSRISTVLESLSKEADI 250
            T+PD+QLKE NET +S +              PL   S+ M +QSRISTVLESLSKEADI
Sbjct: 538  TSPDTQLKERNETIVSEDQASQQEEVSSQSHQPLLDASISMKLQSRISTVLESLSKEADI 597

Query: 251  QSIQEDLREIVQEMRNA---------------SEIATESQPSLDDGEANLEKEITVSQDS 385
            Q IQEDLREIVQEMRNA                + ATESQ SLDDGEANLEKEI VS+DS
Sbjct: 598  QRIQEDLREIVQEMRNAVVPQSTKSIVEITLSPKTATESQASLDDGEANLEKEIPVSEDS 657

Query: 386  --------GISKELADAMSQIHDFVLFLGKEAKAIQGTAPDGRGINEKLGVFSATYVEVI 541
                    GISKELADAMSQIHDFVLFLGKEAKAIQGTAPDG GINEKL  FSATYVEVI
Sbjct: 658  KSCNESIHGISKELADAMSQIHDFVLFLGKEAKAIQGTAPDGSGINEKLDDFSATYVEVI 717

Query: 542  SSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSDCIDKVALPENKGLQHSDEV 721
            S+RL+MVNFVLDLS VLSNAS+LH NILGYKNSETEISTSDCIDKVALPENK LQHS EV
Sbjct: 718  SNRLSMVNFVLDLSHVLSNASQLHFNILGYKNSETEISTSDCIDKVALPENKDLQHSGEV 777

Query: 722  YANGCAHFSDSTSDPDIPHE 781
            YANGCAHFSDSTSDPDIPHE
Sbjct: 778  YANGCAHFSDSTSDPDIPHE 797



 Score = 39.3 bits (90), Expect(2) = 8e-99
 Identities = 20/30 (66%), Positives = 23/30 (76%), Gaps = 3/30 (10%)
 Frame = +3

Query: 3   SSDTNGAVSNPN---NASPETTKVDTLTQI 83
           SSDTNGAVS+P+   NA PETTKVDT   +
Sbjct: 507 SSDTNGAVSSPDIPRNARPETTKVDTSVHV 536


>emb|CBI19835.3| unnamed protein product [Vitis vinifera]
          Length = 993

 Score =  191 bits (485), Expect = 2e-46
 Identities = 103/210 (49%), Positives = 145/210 (69%), Gaps = 10/210 (4%)
 Frame = +2

Query: 182  SLFMNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNASEIATESQPSLDDGEANLEK 361
            SL   ++SRIS V ES+S+++D   I E+++ ++Q+  +      +     +D     E+
Sbjct: 507  SLANQLRSRISMVFESVSEDSDTGKILEEIKRVLQDTHDTLH---QHSACPEDAGVTAER 563

Query: 362  EITVSQDSG--------ISKELADAMSQIHDFVLFLGKEAKAIQGTAPDGRGINEKLGVF 517
            EI++SQD          IS+ELA A+SQIH+FVLFLGKEA AIQG +PDG G + K+  F
Sbjct: 564  EISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGASPDGNGWSRKIEDF 623

Query: 518  SATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSDCIDKVALPENK 697
            SAT  +V+  ++++++F+ DLS VL+ ASEL+ NILGYK +  EI++SDCIDKVALPENK
Sbjct: 624  SATVNKVLCRKMSVIDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDCIDKVALPENK 683

Query: 698  GLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
             +Q   S E Y NGCAH SDSTSDP++PH+
Sbjct: 684  VVQKDTSGERYPNGCAHISDSTSDPEVPHD 713


>ref|XP_002280306.2| PREDICTED: filament-like plant protein 4-like [Vitis vinifera]
          Length = 1040

 Score =  189 bits (480), Expect = 9e-46
 Identities = 103/220 (46%), Positives = 146/220 (66%), Gaps = 25/220 (11%)
 Frame = +2

Query: 197  IQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------------SEIATESQPS 331
            ++SRIS V ES+S+++D   I E+++ ++Q+  +                S+   + Q  
Sbjct: 517  LRSRISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQHSVSCVVEEIHCSDATCDRQAC 576

Query: 332  LDDGEANLEKEITVSQDSG--------ISKELADAMSQIHDFVLFLGKEAKAIQGTAPDG 487
             +D     E+EI++SQD          IS+ELA A+SQIH+FVLFLGKEA AIQG +PDG
Sbjct: 577  PEDAGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGASPDG 636

Query: 488  RGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSDC 667
             G + K+  FSAT  +V+  ++++++F+ DLS VL+ ASEL+ NILGYK +  EI++SDC
Sbjct: 637  NGWSRKIEDFSATVNKVLCRKMSVIDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDC 696

Query: 668  IDKVALPENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            IDKVALPENK +Q   S E Y NGCAH SDSTSDP++PH+
Sbjct: 697  IDKVALPENKVVQKDTSGERYPNGCAHISDSTSDPEVPHD 736


>emb|CAN60525.1| hypothetical protein VITISV_000522 [Vitis vinifera]
          Length = 1085

 Score =  189 bits (480), Expect = 9e-46
 Identities = 103/220 (46%), Positives = 146/220 (66%), Gaps = 25/220 (11%)
 Frame = +2

Query: 197  IQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------------SEIATESQPS 331
            ++SRIS V ES+S+++D   I E+++ ++Q+  +                S+   + Q  
Sbjct: 562  LRSRISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQHSVSCVVEEIHCSDATCDRQAC 621

Query: 332  LDDGEANLEKEITVSQDSG--------ISKELADAMSQIHDFVLFLGKEAKAIQGTAPDG 487
             +D     E+EI++SQD          IS+ELA A+SQIH+FVLFLGKEA AIQG +PDG
Sbjct: 622  PEDAGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGASPDG 681

Query: 488  RGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSDC 667
             G + K+  FSAT  +V+  ++++++F+ DLS VL+ ASEL+ NILGYK +  EI++SDC
Sbjct: 682  NGWSRKIEDFSATVNKVLCXKMSVIDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDC 741

Query: 668  IDKVALPENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            IDKVALPENK +Q   S E Y NGCAH SDSTSDP++PH+
Sbjct: 742  IDKVALPENKVVQKDTSGERYPNGCAHISDSTSDPEVPHD 781


>gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis]
          Length = 1087

 Score =  176 bits (446), Expect = 8e-42
 Identities = 99/219 (45%), Positives = 141/219 (64%), Gaps = 22/219 (10%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------SEIATESQPSLDDG 343
            M +QSRIS +LES+SK++D+ +I ED++  +QE  +          SE    S    DD 
Sbjct: 572  MKLQSRISVLLESVSKDSDVGTILEDIKHAIQETHDTLHQHTVSCISEDVHCSDAGCDDR 631

Query: 344  EAN-------LEKEITVSQDSG-----ISKELADAMSQIHDFVLFLGKEAKAIQGTAPDG 487
            +AN        EKEI +SQ +      I  +LA A+SQIHDFVLFLGKEA  +  T+ +G
Sbjct: 632  QANPEDAGLTSEKEIALSQPAREARQIIRDDLAAAISQIHDFVLFLGKEAMGVHDTSTEG 691

Query: 488  RGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSDC 667
               ++++  FS T  +VI S L++++FVLDLS VL+ ASEL  ++LG+K +E E ++ DC
Sbjct: 692  SEFSQRIEEFSVTLNKVIHSDLSLIDFVLDLSSVLAKASELRFSVLGFKGNEAETNSPDC 751

Query: 668  IDKVALPENKGLQ-HSDEVYANGCAHFSDSTSDPDIPHE 781
            IDKV LPENK +Q  S E+Y NGCAH  +STS+P++P +
Sbjct: 752  IDKVVLPENKAIQKDSSEIYQNGCAHMPNSTSNPEVPDD 790


>ref|XP_002510512.1| Myosin heavy chain, striated muscle, putative [Ricinus communis]
            gi|223551213|gb|EEF52699.1| Myosin heavy chain, striated
            muscle, putative [Ricinus communis]
          Length = 1041

 Score =  176 bits (445), Expect = 1e-41
 Identities = 96/212 (45%), Positives = 139/212 (65%), Gaps = 15/212 (7%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNASEIATESQPSLDD-----GEANL 355
            + ++SRIS +LES+S++AD+  I ED++ IVQ+   A    +E   + D           
Sbjct: 527  VKLRSRISMLLESISQDADMGKILEDVQRIVQDTHGAVSSVSEDVRATDATCPEYASITG 586

Query: 356  EKEITVSQDSG--------ISKELADAMSQIHDFVLFLGKEAKAIQGTAPDGRGINEKLG 511
            +KEIT+ QD+         +++ELA A+S IHDFVLFLGKEA A+  T+ DG  +++K+ 
Sbjct: 587  DKEITLFQDTNAATDTVRSVNQELATAVSSIHDFVLFLGKEAMAVHDTSSDGSDLSQKIE 646

Query: 512  VFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSDCIDKVALPE 691
             FS T+ +V++   ++++F+  LS VL+ ASEL  N+LGYK SE EI++SDCIDKVALPE
Sbjct: 647  HFSVTFNKVLNGNTSLIDFIFYLSCVLAKASELRFNVLGYKGSEAEINSSDCIDKVALPE 706

Query: 692  NKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            NK LQ   S E Y N CAH S  TS+P++P +
Sbjct: 707  NKVLQRDSSGESYQNSCAHISSPTSNPEVPDD 738


>gb|EOY14987.1| Uncharacterized protein isoform 8, partial [Theobroma cacao]
          Length = 951

 Score =  169 bits (429), Expect = 8e-40
 Identities = 93/222 (41%), Positives = 137/222 (61%), Gaps = 25/222 (11%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------------SEIATESQ 325
            M +++R+S VL+S+SK+AD+Q I ED++  VQ+ R+                S+     Q
Sbjct: 579  MKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQ 638

Query: 326  PSLDDGEANLEKEITVSQDSGI--------SKELADAMSQIHDFVLFLGKEAKAIQGTAP 481
                 G    EKEI +S    +        S+ELA A+SQIHDFVL LGKEA+A+     
Sbjct: 639  AHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICS 698

Query: 482  DGRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTS 661
            DG  ++ K+  FS TY +V+ S +++ +F+ DLS +L+ AS+L +N+LGYK++E EI++ 
Sbjct: 699  DGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSP 758

Query: 662  DCIDKVALPENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            DCIDKV LPENK +Q   S   Y NGCAH S+ TS+P++P +
Sbjct: 759  DCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDD 800


>gb|EOY14986.1| Uncharacterized protein isoform 7 [Theobroma cacao]
          Length = 1107

 Score =  169 bits (429), Expect = 8e-40
 Identities = 93/222 (41%), Positives = 137/222 (61%), Gaps = 25/222 (11%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------------SEIATESQ 325
            M +++R+S VL+S+SK+AD+Q I ED++  VQ+ R+                S+     Q
Sbjct: 583  MKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQ 642

Query: 326  PSLDDGEANLEKEITVSQDSGI--------SKELADAMSQIHDFVLFLGKEAKAIQGTAP 481
                 G    EKEI +S    +        S+ELA A+SQIHDFVL LGKEA+A+     
Sbjct: 643  AHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICS 702

Query: 482  DGRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTS 661
            DG  ++ K+  FS TY +V+ S +++ +F+ DLS +L+ AS+L +N+LGYK++E EI++ 
Sbjct: 703  DGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSP 762

Query: 662  DCIDKVALPENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            DCIDKV LPENK +Q   S   Y NGCAH S+ TS+P++P +
Sbjct: 763  DCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDD 804


>gb|EOY14985.1| Uncharacterized protein isoform 6 [Theobroma cacao]
          Length = 837

 Score =  169 bits (429), Expect = 8e-40
 Identities = 93/222 (41%), Positives = 137/222 (61%), Gaps = 25/222 (11%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------------SEIATESQ 325
            M +++R+S VL+S+SK+AD+Q I ED++  VQ+ R+                S+     Q
Sbjct: 424  MKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQ 483

Query: 326  PSLDDGEANLEKEITVSQDSGI--------SKELADAMSQIHDFVLFLGKEAKAIQGTAP 481
                 G    EKEI +S    +        S+ELA A+SQIHDFVL LGKEA+A+     
Sbjct: 484  AHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICS 543

Query: 482  DGRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTS 661
            DG  ++ K+  FS TY +V+ S +++ +F+ DLS +L+ AS+L +N+LGYK++E EI++ 
Sbjct: 544  DGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSP 603

Query: 662  DCIDKVALPENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            DCIDKV LPENK +Q   S   Y NGCAH S+ TS+P++P +
Sbjct: 604  DCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDD 645


>gb|EOY14984.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 992

 Score =  169 bits (429), Expect = 8e-40
 Identities = 93/222 (41%), Positives = 137/222 (61%), Gaps = 25/222 (11%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------------SEIATESQ 325
            M +++R+S VL+S+SK+AD+Q I ED++  VQ+ R+                S+     Q
Sbjct: 579  MKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQ 638

Query: 326  PSLDDGEANLEKEITVSQDSGI--------SKELADAMSQIHDFVLFLGKEAKAIQGTAP 481
                 G    EKEI +S    +        S+ELA A+SQIHDFVL LGKEA+A+     
Sbjct: 639  AHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICS 698

Query: 482  DGRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTS 661
            DG  ++ K+  FS TY +V+ S +++ +F+ DLS +L+ AS+L +N+LGYK++E EI++ 
Sbjct: 699  DGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSP 758

Query: 662  DCIDKVALPENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            DCIDKV LPENK +Q   S   Y NGCAH S+ TS+P++P +
Sbjct: 759  DCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDD 800


>gb|EOY14983.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 947

 Score =  169 bits (429), Expect = 8e-40
 Identities = 93/222 (41%), Positives = 137/222 (61%), Gaps = 25/222 (11%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------------SEIATESQ 325
            M +++R+S VL+S+SK+AD+Q I ED++  VQ+ R+                S+     Q
Sbjct: 424  MKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQ 483

Query: 326  PSLDDGEANLEKEITVSQDSGI--------SKELADAMSQIHDFVLFLGKEAKAIQGTAP 481
                 G    EKEI +S    +        S+ELA A+SQIHDFVL LGKEA+A+     
Sbjct: 484  AHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICS 543

Query: 482  DGRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTS 661
            DG  ++ K+  FS TY +V+ S +++ +F+ DLS +L+ AS+L +N+LGYK++E EI++ 
Sbjct: 544  DGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSP 603

Query: 662  DCIDKVALPENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            DCIDKV LPENK +Q   S   Y NGCAH S+ TS+P++P +
Sbjct: 604  DCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDD 645


>gb|EOY14982.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 1106

 Score =  169 bits (429), Expect = 8e-40
 Identities = 93/222 (41%), Positives = 137/222 (61%), Gaps = 25/222 (11%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------------SEIATESQ 325
            M +++R+S VL+S+SK+AD+Q I ED++  VQ+ R+                S+     Q
Sbjct: 583  MKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQ 642

Query: 326  PSLDDGEANLEKEITVSQDSGI--------SKELADAMSQIHDFVLFLGKEAKAIQGTAP 481
                 G    EKEI +S    +        S+ELA A+SQIHDFVL LGKEA+A+     
Sbjct: 643  AHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICS 702

Query: 482  DGRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTS 661
            DG  ++ K+  FS TY +V+ S +++ +F+ DLS +L+ AS+L +N+LGYK++E EI++ 
Sbjct: 703  DGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSP 762

Query: 662  DCIDKVALPENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            DCIDKV LPENK +Q   S   Y NGCAH S+ TS+P++P +
Sbjct: 763  DCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDD 804


>gb|EOY14981.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
          Length = 992

 Score =  169 bits (429), Expect = 8e-40
 Identities = 93/222 (41%), Positives = 137/222 (61%), Gaps = 25/222 (11%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------------SEIATESQ 325
            M +++R+S VL+S+SK+AD+Q I ED++  VQ+ R+                S+     Q
Sbjct: 579  MKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQ 638

Query: 326  PSLDDGEANLEKEITVSQDSGI--------SKELADAMSQIHDFVLFLGKEAKAIQGTAP 481
                 G    EKEI +S    +        S+ELA A+SQIHDFVL LGKEA+A+     
Sbjct: 639  AHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICS 698

Query: 482  DGRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTS 661
            DG  ++ K+  FS TY +V+ S +++ +F+ DLS +L+ AS+L +N+LGYK++E EI++ 
Sbjct: 699  DGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSP 758

Query: 662  DCIDKVALPENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            DCIDKV LPENK +Q   S   Y NGCAH S+ TS+P++P +
Sbjct: 759  DCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDD 800


>gb|EOY14980.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 1102

 Score =  169 bits (429), Expect = 8e-40
 Identities = 93/222 (41%), Positives = 137/222 (61%), Gaps = 25/222 (11%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNA---------------SEIATESQ 325
            M +++R+S VL+S+SK+AD+Q I ED++  VQ+ R+                S+     Q
Sbjct: 579  MKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQ 638

Query: 326  PSLDDGEANLEKEITVSQDSGI--------SKELADAMSQIHDFVLFLGKEAKAIQGTAP 481
                 G    EKEI +S    +        S+ELA A+SQIHDFVL LGKEA+A+     
Sbjct: 639  AHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICS 698

Query: 482  DGRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTS 661
            DG  ++ K+  FS TY +V+ S +++ +F+ DLS +L+ AS+L +N+LGYK++E EI++ 
Sbjct: 699  DGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSP 758

Query: 662  DCIDKVALPENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            DCIDKV LPENK +Q   S   Y NGCAH S+ TS+P++P +
Sbjct: 759  DCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDD 800


>ref|XP_002306918.2| hypothetical protein POPTR_0005s25830g [Populus trichocarpa]
            gi|550339754|gb|EEE93914.2| hypothetical protein
            POPTR_0005s25830g [Populus trichocarpa]
          Length = 1077

 Score =  164 bits (416), Expect = 2e-38
 Identities = 94/221 (42%), Positives = 138/221 (62%), Gaps = 21/221 (9%)
 Frame = +2

Query: 182  SLFMNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNASEIATES-----------QP 328
            S FM +Q RIS +L+S SK+AD+  I ED++++VQ+    +   ++            Q 
Sbjct: 564  SSFMKLQLRISMLLDSGSKKADLGKILEDIKQVVQDAETGASCVSKEAHCSDATTHDRQT 623

Query: 329  SLDDGEANLEKEITVSQDSG--------ISKELADAMSQIHDFVLFLGKEAKAIQGTAPD 484
              +D     EKEI + Q+S         +S+EL  A+SQIHDFVL LGKEA  +  T+ D
Sbjct: 624  CPEDAGIMGEKEIELFQESKTAAQIMHTVSQELLPAISQIHDFVLLLGKEAMTVHDTSCD 683

Query: 485  GRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSD 664
              G+++K+  FS T+ +V+ S  ++V+FV DL+ +L+ AS L  N+LGYK +E EIS+ D
Sbjct: 684  SIGLSQKIKEFSITFNKVLYSDRSLVDFVSDLAHILALASGLRFNVLGYKGNEAEISSPD 743

Query: 665  CIDKVALPENKGLQ--HSDEVYANGCAHFSDSTSDPDIPHE 781
            CIDK+ALPENK +Q   S E Y NGCA+ S  TS+P++P +
Sbjct: 744  CIDKIALPENKVVQKNSSVETYQNGCANISSPTSNPEVPDD 784


>ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citrus clementina]
            gi|567885183|ref|XP_006435150.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
            gi|557537271|gb|ESR48389.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
            gi|557537272|gb|ESR48390.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
          Length = 1091

 Score =  163 bits (412), Expect = 7e-38
 Identities = 90/214 (42%), Positives = 138/214 (64%), Gaps = 17/214 (7%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMR---------------NASEIATESQ 325
            M ++SRIS +LE++SK+AD+  I ED++ +V++                   S+++  ++
Sbjct: 579  MKLRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLHQHSANCISEEVKCSDVSCSAE 638

Query: 326  PSLDDGEANLEKEITVSQDSGISKELADAMSQIHDFVLFLGKEAKAIQGTAPDGRGINEK 505
                D   N E++I ++    IS+EL  A+SQIHDFVLFLGKEA+A+  T  +  G ++K
Sbjct: 639  AYPGDASLNTERKIDLTVQV-ISQELVAAISQIHDFVLFLGKEARAVHDTTNEN-GFSQK 696

Query: 506  LGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSDCIDKVAL 685
            +  F  ++ +VI S   +V+FV  LS VL+ ASEL +N++GYK++E E ++ DCIDKVAL
Sbjct: 697  IEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRINVMGYKDTEIEPNSPDCIDKVAL 756

Query: 686  PENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            PENK ++   S E Y NGCAH S+ TSDP++P +
Sbjct: 757  PENKVIKKDTSGERYPNGCAHISNPTSDPEVPDD 790


>ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-like isoform X1 [Citrus
            sinensis] gi|568839322|ref|XP_006473633.1| PREDICTED:
            filament-like plant protein 4-like isoform X2 [Citrus
            sinensis]
          Length = 1091

 Score =  162 bits (409), Expect = 2e-37
 Identities = 89/214 (41%), Positives = 138/214 (64%), Gaps = 17/214 (7%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMR---------------NASEIATESQ 325
            M ++SRIS +LE++SK+AD+  I ED++ +V++                   S+++  ++
Sbjct: 579  MKLRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLHQHSANCISEEVKCSDVSCSAE 638

Query: 326  PSLDDGEANLEKEITVSQDSGISKELADAMSQIHDFVLFLGKEAKAIQGTAPDGRGINEK 505
                D   N E++I ++    IS+EL  A++QIHDFVLFLGKEA+A+  T  +  G ++K
Sbjct: 639  AYPGDARLNTERKIDLTVQV-ISQELVAAITQIHDFVLFLGKEARAVHDTTNEN-GFSQK 696

Query: 506  LGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSDCIDKVAL 685
            +  F  ++ +VI S   +V+FV  LS VL+ ASEL +N++GYK++E E ++ DCIDKVAL
Sbjct: 697  IEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRINVMGYKDTEIEPNSPDCIDKVAL 756

Query: 686  PENKGLQH--SDEVYANGCAHFSDSTSDPDIPHE 781
            PENK ++   S E Y NGCAH S+ TSDP++P +
Sbjct: 757  PENKVIKKDTSGERYPNGCAHISNPTSDPEVPDD 790


>ref|XP_004168855.1| PREDICTED: LOW QUALITY PROTEIN: filament-like plant protein 4-like
            [Cucumis sativus]
          Length = 1084

 Score =  161 bits (407), Expect = 3e-37
 Identities = 90/221 (40%), Positives = 141/221 (63%), Gaps = 24/221 (10%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNASEIAT-----------------E 319
            + ++SRIS + ES+SK+AD   I ED++ IVQ+  +A +  T                 +
Sbjct: 565  LKLRSRISMIFESISKDADTGKILEDIKCIVQDAHDALQQPTINCVSCVSEVQSPDTTCD 624

Query: 320  SQPSLDDGEANLEKEITVSQ----DSGISKELADAMSQIHDFVLFLGKEAKAIQGT-APD 484
             Q + DD    +E+EI  SQ    +  +S+EL  A+SQIH+FVLFLGKEA  +  T +PD
Sbjct: 625  RQANPDDAGLGVEREIAFSQPVAHNQPMSQELEAAISQIHEFVLFLGKEASRVHDTISPD 684

Query: 485  GRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSD 664
            G G+ +K+  FS+T+ +++ +  ++V+FV+ LS VLS ASEL  + +G K+++ + ++ D
Sbjct: 685  GHGLGQKVEEFSSTFNKIVHANTSLVDFVVILSHVLSEASELRFSFIGCKDTDGDTNSPD 744

Query: 665  CIDKVALPENKGLQHS--DEVYANGCAHFSDSTSDPDIPHE 781
            CIDKVALPE+K +Q+   DE Y NGC+H S  TSD ++P++
Sbjct: 745  CIDKVALPEHKVVQNDSIDERYTNGCSHISSPTSDLEVPYD 785


>ref|XP_004136392.1| PREDICTED: filament-like plant protein 4-like [Cucumis sativus]
          Length = 1078

 Score =  161 bits (407), Expect = 3e-37
 Identities = 90/221 (40%), Positives = 141/221 (63%), Gaps = 24/221 (10%)
 Frame = +2

Query: 191  MNIQSRISTVLESLSKEADIQSIQEDLREIVQEMRNASEIAT-----------------E 319
            + ++SRIS + ES+SK+AD   I ED++ IVQ+  +A +  T                 +
Sbjct: 559  LKLRSRISMIFESISKDADTGKILEDIKCIVQDAHDALQQPTINCVSCVSEVQSPDTTCD 618

Query: 320  SQPSLDDGEANLEKEITVSQ----DSGISKELADAMSQIHDFVLFLGKEAKAIQGT-APD 484
             Q + DD    +E+EI  SQ    +  +S+EL  A+SQIH+FVLFLGKEA  +  T +PD
Sbjct: 619  RQANPDDAGLGVEREIAFSQPVAHNQPMSQELEAAISQIHEFVLFLGKEASRVHDTISPD 678

Query: 485  GRGINEKLGVFSATYVEVISSRLNMVNFVLDLSRVLSNASELHLNILGYKNSETEISTSD 664
            G G+ +K+  FS+T+ +++ +  ++V+FV+ LS VLS ASEL  + +G K+++ + ++ D
Sbjct: 679  GHGLGQKVEEFSSTFNKIVHANTSLVDFVVILSHVLSEASELRFSFIGCKDTDGDTNSPD 738

Query: 665  CIDKVALPENKGLQHS--DEVYANGCAHFSDSTSDPDIPHE 781
            CIDKVALPE+K +Q+   DE Y NGC+H S  TSD ++P++
Sbjct: 739  CIDKVALPEHKVVQNDSIDERYTNGCSHISSPTSDLEVPYD 779


Top