BLASTX nr result

ID: Rehmannia22_contig00017349 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00017349
         (686 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI19835.3| unnamed protein product [Vitis vinifera]              163   5e-38
gb|EMJ26698.1| hypothetical protein PRUPE_ppa000819mg [Prunus pe...   147   4e-33
ref|XP_004238341.1| PREDICTED: filament-like plant protein 6-lik...   145   1e-32
ref|XP_006342030.1| PREDICTED: filament-like plant protein 6-lik...   144   3e-32
gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis]     142   7e-32
ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-lik...   134   3e-29
ref|XP_004291383.1| PREDICTED: filament-like plant protein 6-lik...   133   5e-29
ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citr...   133   6e-29
ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Popu...   132   8e-29
ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Popu...   132   8e-29
gb|EOY14987.1| Uncharacterized protein isoform 8, partial [Theob...   131   2e-28
gb|EOY14986.1| Uncharacterized protein isoform 7 [Theobroma cacao]    131   2e-28
gb|EOY14985.1| Uncharacterized protein isoform 6 [Theobroma cacao]    131   2e-28
gb|EOY14984.1| Uncharacterized protein isoform 5 [Theobroma cacao]    131   2e-28
gb|EOY14983.1| Uncharacterized protein isoform 4 [Theobroma cacao]    131   2e-28
gb|EOY14982.1| Uncharacterized protein isoform 3 [Theobroma cacao]    131   2e-28
gb|EOY14981.1| Uncharacterized protein isoform 2, partial [Theob...   131   2e-28
gb|EOY14980.1| Uncharacterized protein isoform 1 [Theobroma cacao]    131   2e-28
ref|XP_002893071.1| hypothetical protein ARALYDRAFT_335233 [Arab...   125   9e-27
ref|XP_002510512.1| Myosin heavy chain, striated muscle, putativ...   125   9e-27

>emb|CBI19835.3| unnamed protein product [Vitis vinifera]
          Length = 993

 Score =  163 bits (412), Expect = 5e-38
 Identities = 117/287 (40%), Positives = 156/287 (54%), Gaps = 59/287 (20%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQL----------RANAQVSTGGFSSQKVSDQ--- 141
            A RNSELQASR+ICA+TASKLQNLEAQL          ++N Q+   G  SQ  S+    
Sbjct: 381  AKRNSELQASRNICAKTASKLQNLEAQLQMNNQQKSPPKSNLQIPNDGSLSQNASNPPSM 440

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              MSEDGNDD VSCA S AT  +S LS  +KE          N NHL+LMDDFLEMEK+A
Sbjct: 441  TSMSEDGNDDAVSCAESWATGLVSGLSQFKKE----------NANHLELMDDFLEMEKLA 490

Query: 319  CLPHGSNGAVSSSDV--------------------SVNTG-----------NTGSELVKH 405
            CL + SNGA S  D+                      +TG           +T   L +H
Sbjct: 491  CLSNNSNGAFSKHDLDSLANQLRSRISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQH 550

Query: 406  E---DDAKINVNSCI----------DTVQTNDQALEMAISGIYDFVMILGKEAKALPGSS 546
                +DA +     I          DT+    Q L  AIS I++FV+ LGKEA A+ G+S
Sbjct: 551  SACPEDAGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGAS 610

Query: 547  -DEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684
             D +G  +K++ FSA   + +  ++++ DF+ D+S+VL+KA+ L+FN
Sbjct: 611  PDGNGWSRKIEDFSATVNKVLCRKMSVIDFIFDLSNVLAKASELNFN 657


>gb|EMJ26698.1| hypothetical protein PRUPE_ppa000819mg [Prunus persica]
          Length = 993

 Score =  147 bits (370), Expect = 4e-33
 Identities = 97/235 (41%), Positives = 131/235 (55%), Gaps = 15/235 (6%)
 Frame = +1

Query: 7   RNSELQASRSICAQTASKLQNLEAQLRAN----------AQVSTGGFSSQKVSDQ----L 144
           RNSELQ SR +CAQT SKLQ LEAQL+ N           Q++T G SSQ  S+      
Sbjct: 305 RNSELQTSRGMCAQTVSKLQTLEAQLQINNQQKGSPKSVVQITTEGSSSQNASNPPSLTS 364

Query: 145 MSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIACL 324
           +SEDGNDD+ SCA S AT   S+LS+I+KEK+    +K+EN NHL+LMDDFLEMEK+ACL
Sbjct: 365 LSEDGNDDDRSCAESWATTLGSDLSHIRKEKSNQKSNKAENQNHLNLMDDFLEMEKLACL 424

Query: 325 PHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGIYDFV 504
           P+ SNGAVS   +S    N  SE   H+    +     I + Q  D      +S +    
Sbjct: 425 PNDSNGAVS---ISSGPNNKTSERENHDASGDVTAEKDIQSEQQQD------LSPLEGDQ 475

Query: 505 MILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                +   L   SDE+ L + KL +  +   E ++ + +    + DI HV+ +A
Sbjct: 476 ASSNVKLSGLSPESDENQLPLVKLRSKISMLLELLSKDTDFGKVIEDIKHVVQEA 530


>ref|XP_004238341.1| PREDICTED: filament-like plant protein 6-like [Solanum
           lycopersicum]
          Length = 1091

 Score =  145 bits (366), Expect = 1e-32
 Identities = 88/166 (53%), Positives = 110/166 (66%), Gaps = 17/166 (10%)
 Frame = +1

Query: 1   ANRNSELQASRSICAQTASKLQNLEAQLRANA------------QVSTGGFSSQKVSDQL 144
           A+RNSELQASRSICA+T+SKLQ+LEAQL+AN             Q S G FS +  ++ L
Sbjct: 384 AHRNSELQASRSICAKTSSKLQSLEAQLQANLEQKSPQKSTIRRQPSEGSFSHE--ANHL 441

Query: 145 -----MSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEME 309
                MSEDGNDDNVSCA S  T  MS+LS ++KEKN DSPHKSE  +HLDLMDDFLEME
Sbjct: 442 PRLASMSEDGNDDNVSCASSWTTALMSDLSNVKKEKNFDSPHKSECASHLDLMDDFLEME 501

Query: 310 KIACLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDT 447
           K+A     +NGAVSS D+  N     +++     D  ++V++  DT
Sbjct: 502 KLAYQSSDTNGAVSSPDIPRNARPETTKV-----DTSVHVSTSPDT 542



 Score = 81.6 bits (200), Expect = 2e-13
 Identities = 68/229 (29%), Positives = 114/229 (49%), Gaps = 5/229 (2%)
 Frame = +1

Query: 13   SELQASRS--ICAQTASKLQNLEAQLRANAQVSTGGFSSQKVSD-QLMSEDGNDDNVSCA 183
            SE QAS+   + +Q+   L +    ++  +++ST   S  K +D Q + ED  +      
Sbjct: 553  SEDQASQQEEVSSQSHQPLLDASISMKLQSRISTVLESLSKEADIQRIQEDLRE------ 606

Query: 184  GSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIACLPHGSNGAVSSSDV 363
                         +Q+ +NA  P  +++   + L                S    + S  
Sbjct: 607  ------------IVQEMRNAVVPQSTKSIVEITL----------------SPKTATESQA 638

Query: 364  SVNTGNTGSEL-VKHEDDAKINVNSCIDTVQTNDQALEMAISGIYDFVMILGKEAKALPG 540
            S++ G    E  +   +D+K    SC +++    + L  A+S I+DFV+ LGKEAKA+ G
Sbjct: 639  SLDDGEANLEKEIPVSEDSK----SCNESIHGISKELADAMSQIHDFVLFLGKEAKAIQG 694

Query: 541  SS-DEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684
            ++ D  G+ +KLD FSA Y E I++ +++ +FVLD+SHVLS A+ L FN
Sbjct: 695  TAPDGSGINEKLDDFSATYVEVISNRLSMVNFVLDLSHVLSNASQLHFN 743


>ref|XP_006342030.1| PREDICTED: filament-like plant protein 6-like [Solanum tuberosum]
          Length = 1093

 Score =  144 bits (363), Expect = 3e-32
 Identities = 82/141 (58%), Positives = 100/141 (70%), Gaps = 17/141 (12%)
 Frame = +1

Query: 1   ANRNSELQASRSICAQTASKLQNLEAQLRANA------------QVSTGGFSSQKVSDQL 144
           A+RNSELQASRSICA+T+SKLQ+LEAQL+AN             Q S G  S +  ++ L
Sbjct: 387 AHRNSELQASRSICAKTSSKLQSLEAQLQANVEQKSPQKSTIRRQPSEGSLSHE--ANHL 444

Query: 145 -----MSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEME 309
                MSEDGNDDNVSCA S  T  MS+L++++KEKN DSPHKSE+ +HLDLMDDFLEME
Sbjct: 445 PRLASMSEDGNDDNVSCASSWTTALMSDLTHVKKEKNFDSPHKSESASHLDLMDDFLEME 504

Query: 310 KIACLPHGSNGAVSSSDVSVN 372
           K+A     +NGAVSS D+  N
Sbjct: 505 KLAYQSSDTNGAVSSPDIPNN 525



 Score = 84.0 bits (206), Expect = 4e-14
 Identities = 66/215 (30%), Positives = 111/215 (51%), Gaps = 11/215 (5%)
 Frame = +1

Query: 73   EAQLRANAQVSTGGFSSQKVSDQLMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSP 252
            ++QL+ + + S  G   Q   ++ +S   +      + S+   S          K+AD  
Sbjct: 544  DSQLKEHNETSVSG--DQASRNEEVSSQSHQPLSDTSISMKLQSRISTVLESLSKDADIQ 601

Query: 253  HKSENTNHLDLMDDFLEMEKIACLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVN 432
               E     DL +   EM   A +P  +   V   ++++++ NT +E     DD + N+ 
Sbjct: 602  RIQE-----DLREIVQEMRN-ALIPQSTKSIV---EITLSS-NTATESQPSLDDGEANLE 651

Query: 433  ----------SCIDTVQTNDQALEMAISGIYDFVMILGKEAKALPGSS-DEDGLIKKLDT 579
                      SC +++    + L  A+S I+DFV+ LGKEAKA+ G++ D  G+ +KLD 
Sbjct: 652  KEIPVSEDSKSCNESIHGISKELADAMSQIHDFVLFLGKEAKAIQGTAPDGSGINEKLDD 711

Query: 580  FSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684
            FSA Y E I++++++ +FVLD+SHVLS A+ L FN
Sbjct: 712  FSATYVEVISNKLSMVNFVLDLSHVLSNASQLHFN 746


>gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis]
          Length = 1087

 Score =  142 bits (359), Expect = 7e-32
 Identities = 93/235 (39%), Positives = 135/235 (57%), Gaps = 14/235 (5%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRAN----------AQVSTGGFSSQKVSDQ--- 141
            A RNSELQ SRS+CA+T+SKLQ+LEAQ+++N           Q+S  G  SQ  S+    
Sbjct: 383  AKRNSELQVSRSMCAKTSSKLQSLEAQIQSNNQHKTTPKSIVQISAEGSFSQNASNPPSL 442

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              MSEDGNDD+ SCA S  T  +SE+S ++KEK+ +  +++E  NHL+LMDDFLEMEK+A
Sbjct: 443  TSMSEDGNDDDRSCAESWTTTLISEVSQVKKEKSNEKTNRAEKPNHLNLMDDFLEMEKLA 502

Query: 319  CLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGIYD 498
            CL + SNGA+S SD   +  +  SE V H  DA   V    +   +N  A +   S    
Sbjct: 503  CLSNESNGAISVSD---SMSSKISETVNH--DASEVVMRKEEQCDSNSLANQQLTSN--- 554

Query: 499  FVMILGKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSK 663
                 GK  +  PGS+ E   + KL +  +   E+++ + ++   + DI H + +
Sbjct: 555  -----GKSPELRPGSNSEQLPLMKLQSRISVLLESVSKDSDVGTILEDIKHAIQE 604



 Score = 57.8 bits (138), Expect = 3e-06
 Identities = 32/73 (43%), Positives = 49/73 (67%), Gaps = 1/73 (1%)
 Frame = +1

Query: 469 LEMAISGIYDFVMILGKEAKALPGSSDEDG-LIKKLDTFSAKYAEAINSEINLFDFVLDI 645
           L  AIS I+DFV+ LGKEA  +  +S E     ++++ FS    + I+S+++L DFVLD+
Sbjct: 663 LAAAISQIHDFVLFLGKEAMGVHDTSTEGSEFSQRIEEFSVTLNKVIHSDLSLIDFVLDL 722

Query: 646 SHVLSKATLLDFN 684
           S VL+KA+ L F+
Sbjct: 723 SSVLAKASELRFS 735


>ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-like isoform X1 [Citrus
            sinensis] gi|568839322|ref|XP_006473633.1| PREDICTED:
            filament-like plant protein 4-like isoform X2 [Citrus
            sinensis]
          Length = 1091

 Score =  134 bits (337), Expect = 3e-29
 Identities = 90/235 (38%), Positives = 133/235 (56%), Gaps = 16/235 (6%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
            A RNSELQASR++CA+TASKLQ+LEAQ++ + Q          ++  G++SQ  S+    
Sbjct: 384  AKRNSELQASRNLCAKTASKLQSLEAQMQTSTQQKSPTKSVVQIAAEGYTSQNASNPPSL 443

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              MSED NDD VSCA S AT  +SELS I+KEKN +  +K+E   HL+LMDDFLEMEK+A
Sbjct: 444  TSMSEDDNDDKVSCADSWATALISELSQIKKEKNVEKSNKAETPKHLELMDDFLEMEKLA 503

Query: 319  CLPH--GSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492
            CL +   SNG +++S+      N  S++V H  DA   V S  D +    + +  ++   
Sbjct: 504  CLSNDTNSNGTITASN---GPNNKTSDIVNH--DASGAVTSGEDLLSEQQRDMNPSV--- 555

Query: 493  YDFVMILGKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVL 657
             D +    + +   P +      + KL +  +   E I+ + ++   V DI  V+
Sbjct: 556  -DKLSSNTESSTVNPEADAGQPQLMKLRSRISMLLETISKDADMGKIVEDIKRVV 609



 Score = 73.2 bits (178), Expect = 7e-11
 Identities = 66/237 (27%), Positives = 109/237 (45%), Gaps = 28/237 (11%)
 Frame = +1

Query: 58   KLQNLEAQLRANAQVSTGGFSSQKVSD--------------QLMSEDGNDDNVSCAGSLA 195
            KL  L     +N  ++     + K SD               L+SE   D N S      
Sbjct: 501  KLACLSNDTNSNGTITASNGPNNKTSDIVNHDASGAVTSGEDLLSEQQRDMNPSVD---K 557

Query: 196  TVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEK-IACLPHGSNGAVSSSDVSVN 372
              S +E S +  E +A  P   +  + + ++ + +  +  +  +       V    V+++
Sbjct: 558  LSSNTESSTVNPEADAGQPQLMKLRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLH 617

Query: 373  --TGNTGSELVKHED----------DAKINVNSCID-TVQTNDQALEMAISGIYDFVMIL 513
              + N  SE VK  D          DA++N    ID TVQ   Q L  AI+ I+DFV+ L
Sbjct: 618  QHSANCISEEVKCSDVSCSAEAYPGDARLNTERKIDLTVQVISQELVAAITQIHDFVLFL 677

Query: 514  GKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684
            GKEA+A+  +++E+G  +K++ F   + + I+S   L DFV  +S+VL+KA+ L  N
Sbjct: 678  GKEARAVHDTTNENGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRIN 734


>ref|XP_004291383.1| PREDICTED: filament-like plant protein 6-like [Fragaria vesca subsp.
            vesca]
          Length = 1091

 Score =  133 bits (335), Expect = 5e-29
 Identities = 89/236 (37%), Positives = 127/236 (53%), Gaps = 14/236 (5%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
            + RNSELQASRSICA+T SKLQ LEAQL+   Q          +ST G  S+  S     
Sbjct: 400  SKRNSELQASRSICAKTVSKLQTLEAQLQITGQQKGSPKSVVHISTEGSLSRNASIPPSF 459

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              MSEDGNDD+ SCA S  T   S+LS+ +KEKN +   K+EN NHL+LMDDFLEMEK+A
Sbjct: 460  ASMSEDGNDDDRSCAESWGTTLNSDLSHSKKEKNNEKSSKAENQNHLNLMDDFLEMEKLA 519

Query: 319  CLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGIYD 498
            CLP+ SNG V +S++ +N           E   ++     I + Q ++ +          
Sbjct: 520  CLPNDSNG-VKTSEIEIN-----------EASGEVTATKDIHSEQQHEASFN-------- 559

Query: 499  FVMILGKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                 G  +   PG+++    + KL +  +   E ++ + +    + DI HV+ +A
Sbjct: 560  -----GDLSVLSPGANENKLPLVKLRSRISVLLELLSKDTDFVKVIEDIKHVVQEA 610



 Score = 63.5 bits (153), Expect = 6e-08
 Identities = 42/149 (28%), Positives = 82/149 (55%), Gaps = 7/149 (4%)
 Frame = +1

Query: 259  SENTNHLDLMDDF---LEMEKIACLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINV 429
            S++T+ + +++D    ++  + A  PH     V+S    +++ +   +   H +D+  + 
Sbjct: 591  SKDTDFVKVIEDIKHVVQEAQDALQPH----TVNSVSEEIHSADAICDTQAHPEDSVFST 646

Query: 430  N---SCIDTVQTNDQALEMAISGIYDFVMILGKEAKALPGS-SDEDGLIKKLDTFSAKYA 597
                +  +T+    + L  AIS I+DFV+ LGKE   +  +  D + L +K++ FS  ++
Sbjct: 647  EKETTAKETMSAISEELASAISLIHDFVVFLGKEVVGVHDTFPDSNELSQKIEEFSGTFS 706

Query: 598  EAINSEINLFDFVLDISHVLSKATLLDFN 684
            + I+  ++L D VLD+SHVL+ A+ L FN
Sbjct: 707  KVIHGNLSLVDLVLDLSHVLANASELKFN 735


>ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citrus clementina]
            gi|567885183|ref|XP_006435150.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
            gi|557537271|gb|ESR48389.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
            gi|557537272|gb|ESR48390.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
          Length = 1091

 Score =  133 bits (334), Expect = 6e-29
 Identities = 89/235 (37%), Positives = 133/235 (56%), Gaps = 16/235 (6%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
            A RNSELQASR++CA+TASKLQ+LEAQ++ + Q          ++  G++SQ  S+    
Sbjct: 384  AKRNSELQASRNLCAKTASKLQSLEAQMQTSTQQKSPTKSVVQIAAEGYTSQNASNPPSL 443

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              MSED NDD VSCA S AT  +SELS I+KEKN +  +K+E   HL+LMDDFLEMEK+A
Sbjct: 444  TSMSEDDNDDKVSCADSWATALISELSQIKKEKNVEKSNKAETPKHLELMDDFLEMEKLA 503

Query: 319  CLPH--GSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492
            CL +   SNG +++S+      N  S+++ H  DA   V S  D +    + +  ++   
Sbjct: 504  CLSNDTNSNGTITASN---GPNNKTSDILNH--DASGAVTSGEDLLSEQQRDMNPSV--- 555

Query: 493  YDFVMILGKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVL 657
             D +    + +   P +      + KL +  +   E I+ + ++   V DI  V+
Sbjct: 556  -DKLSSNTESSTVNPEADAGQPQLMKLRSRISMLLETISKDADMGKIVEDIKRVV 609



 Score = 73.6 bits (179), Expect = 6e-11
 Identities = 67/237 (28%), Positives = 108/237 (45%), Gaps = 28/237 (11%)
 Frame = +1

Query: 58   KLQNLEAQLRANAQVSTGGFSSQKVSD--------------QLMSEDGNDDNVSCAGSLA 195
            KL  L     +N  ++     + K SD               L+SE   D N S      
Sbjct: 501  KLACLSNDTNSNGTITASNGPNNKTSDILNHDASGAVTSGEDLLSEQQRDMNPSVD---K 557

Query: 196  TVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEK-IACLPHGSNGAVSSSDVSVN 372
              S +E S +  E +A  P   +  + + ++ + +  +  +  +       V    V+++
Sbjct: 558  LSSNTESSTVNPEADAGQPQLMKLRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLH 617

Query: 373  --TGNTGSELVKHED----------DAKINVNSCID-TVQTNDQALEMAISGIYDFVMIL 513
              + N  SE VK  D          DA +N    ID TVQ   Q L  AIS I+DFV+ L
Sbjct: 618  QHSANCISEEVKCSDVSCSAEAYPGDASLNTERKIDLTVQVISQELVAAISQIHDFVLFL 677

Query: 514  GKEAKALPGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684
            GKEA+A+  +++E+G  +K++ F   + + I+S   L DFV  +S+VL+KA+ L  N
Sbjct: 678  GKEARAVHDTTNENGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRIN 734


>ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa]
            gi|550344134|gb|EEE81259.2| hypothetical protein
            POPTR_0002s02600g [Populus trichocarpa]
          Length = 1063

 Score =  132 bits (333), Expect = 8e-29
 Identities = 86/237 (36%), Positives = 135/237 (56%), Gaps = 15/237 (6%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRAN----------AQVSTGGFSSQKVSDQ--- 141
            A RNSELQASR++CA+TASKLQ+LEAQ + N           QV   G+SSQ +S+    
Sbjct: 375  AKRNSELQASRNLCAKTASKLQSLEAQFQINNHQKSSPKSITQVPAEGYSSQNISNPPSL 434

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              +SEDGNDD  SCA S AT S+S++S+ +K+ + +  +K+EN  HL+LMDDFLEMEK+A
Sbjct: 435  TSVSEDGNDDTQSCADSWATTSVSDVSHFKKDNHIEKSNKAENAKHLELMDDFLEMEKLA 494

Query: 319  CLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGIYD 498
            CL      A S++ +S +  N  SE    +  A++++    D +    + L+   + +  
Sbjct: 495  CL-----NADSATTISSSPNNKASETANTDALAEVSLQK-EDALSEEKRDLDPLANHV-- 546

Query: 499  FVMILGKEAKALPGSSDED-GLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                  K++ A+   SD D     KL +  +   E+++ E+++   + +I  V+  A
Sbjct: 547  ---SCNKDSSAINSGSDADLSSFGKLQSRISMLLESVSKEVDVDKILEEIKQVVHDA 600



 Score = 58.5 bits (140), Expect = 2e-06
 Identities = 64/225 (28%), Positives = 108/225 (48%), Gaps = 11/225 (4%)
 Frame = +1

Query: 43   AQTASKLQNLEAQLRANAQVSTGGFSSQKVSDQLMSEDGND-----DNVSC---AGSLAT 198
            A T S   N +A   AN   +    S QK  +  +SE+  D     ++VSC   + ++ +
Sbjct: 501  ATTISSSPNNKASETANTD-ALAEVSLQK--EDALSEEKRDLDPLANHVSCNKDSSAINS 557

Query: 199  VSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIACLPHGSNGAVSSSDVSVNTG 378
             S ++LS   K ++  S      +  +D+ D  LE  +I  + H +  A S     V   
Sbjct: 558  GSDADLSSFGKLQSRISMLLESVSKEVDV-DKILE--EIKQVVHDAETAASCGSKEV--- 611

Query: 379  NTGSELVKHEDDAKINVNSCID--TVQTNDQALEMAISGIYDFVMILGKEAKALPGSS-D 549
                    H  DA  +  +C +   +    +   +  S I+DFV++LGKEA A+  +S D
Sbjct: 612  --------HHSDATCDRQTCPEDAVIMGEKEITLLQESIIHDFVLLLGKEAMAVHDTSCD 663

Query: 550  EDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684
              GL +K++ FS  + + + S+ +L DF+ D+S VL+ A+ L FN
Sbjct: 664  SIGLSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALASGLRFN 708


>ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa]
           gi|550344133|gb|ERP63976.1| hypothetical protein
           POPTR_0002s02600g [Populus trichocarpa]
          Length = 991

 Score =  132 bits (333), Expect = 8e-29
 Identities = 86/237 (36%), Positives = 135/237 (56%), Gaps = 15/237 (6%)
 Frame = +1

Query: 1   ANRNSELQASRSICAQTASKLQNLEAQLRAN----------AQVSTGGFSSQKVSDQ--- 141
           A RNSELQASR++CA+TASKLQ+LEAQ + N           QV   G+SSQ +S+    
Sbjct: 303 AKRNSELQASRNLCAKTASKLQSLEAQFQINNHQKSSPKSITQVPAEGYSSQNISNPPSL 362

Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
             +SEDGNDD  SCA S AT S+S++S+ +K+ + +  +K+EN  HL+LMDDFLEMEK+A
Sbjct: 363 TSVSEDGNDDTQSCADSWATTSVSDVSHFKKDNHIEKSNKAENAKHLELMDDFLEMEKLA 422

Query: 319 CLPHGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGIYD 498
           CL      A S++ +S +  N  SE    +  A++++    D +    + L+   + +  
Sbjct: 423 CL-----NADSATTISSSPNNKASETANTDALAEVSLQK-EDALSEEKRDLDPLANHV-- 474

Query: 499 FVMILGKEAKALPGSSDED-GLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                 K++ A+   SD D     KL +  +   E+++ E+++   + +I  V+  A
Sbjct: 475 ---SCNKDSSAINSGSDADLSSFGKLQSRISMLLESVSKEVDVDKILEEIKQVVHDA 528



 Score = 58.5 bits (140), Expect = 2e-06
 Identities = 64/225 (28%), Positives = 108/225 (48%), Gaps = 11/225 (4%)
 Frame = +1

Query: 43   AQTASKLQNLEAQLRANAQVSTGGFSSQKVSDQLMSEDGND-----DNVSC---AGSLAT 198
            A T S   N +A   AN   +    S QK  +  +SE+  D     ++VSC   + ++ +
Sbjct: 429  ATTISSSPNNKASETANTD-ALAEVSLQK--EDALSEEKRDLDPLANHVSCNKDSSAINS 485

Query: 199  VSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIACLPHGSNGAVSSSDVSVNTG 378
             S ++LS   K ++  S      +  +D+ D  LE  +I  + H +  A S     V   
Sbjct: 486  GSDADLSSFGKLQSRISMLLESVSKEVDV-DKILE--EIKQVVHDAETAASCGSKEV--- 539

Query: 379  NTGSELVKHEDDAKINVNSCID--TVQTNDQALEMAISGIYDFVMILGKEAKALPGSS-D 549
                    H  DA  +  +C +   +    +   +  S I+DFV++LGKEA A+  +S D
Sbjct: 540  --------HHSDATCDRQTCPEDAVIMGEKEITLLQESIIHDFVLLLGKEAMAVHDTSCD 591

Query: 550  EDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684
              GL +K++ FS  + + + S+ +L DF+ D+S VL+ A+ L FN
Sbjct: 592  SIGLSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALASGLRFN 636


>gb|EOY14987.1| Uncharacterized protein isoform 8, partial [Theobroma cacao]
          Length = 951

 Score =  131 bits (329), Expect = 2e-28
 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
            A RNSEL ASR++CA+T+SKLQ LEAQL  ++Q          +    +SSQ VS+    
Sbjct: 385  AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 444

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              +SEDGNDD+ SCA S AT  MSELS  +KEKN + P+K+EN  HLDLMDDFLEMEK+A
Sbjct: 445  TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 504

Query: 319  CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492
            C  + S  NG ++ SD   +T N  SE V  +   +I   SC +        L  +++ +
Sbjct: 505  CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 558

Query: 493  YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                     +   +   SD D L + KL T  +   ++++ + ++   + DI   +  A
Sbjct: 559  SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 612



 Score = 63.9 bits (154), Expect = 4e-08
 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%)
 Frame = +1

Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501
           HGS+G        +   + G   +  E +  I+    +  + VQT  Q L  AIS I+DF
Sbjct: 629 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 682

Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678
           V+ LGKEA+A+    SD + L  K++ FS  Y + + S ++L DF+ D+S +L+KA+ L 
Sbjct: 683 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 742

Query: 679 FN 684
            N
Sbjct: 743 VN 744


>gb|EOY14986.1| Uncharacterized protein isoform 7 [Theobroma cacao]
          Length = 1107

 Score =  131 bits (329), Expect = 2e-28
 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
            A RNSEL ASR++CA+T+SKLQ LEAQL  ++Q          +    +SSQ VS+    
Sbjct: 389  AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 448

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              +SEDGNDD+ SCA S AT  MSELS  +KEKN + P+K+EN  HLDLMDDFLEMEK+A
Sbjct: 449  TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 508

Query: 319  CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492
            C  + S  NG ++ SD   +T N  SE V  +   +I   SC +        L  +++ +
Sbjct: 509  CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 562

Query: 493  YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                     +   +   SD D L + KL T  +   ++++ + ++   + DI   +  A
Sbjct: 563  SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 616



 Score = 63.9 bits (154), Expect = 4e-08
 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%)
 Frame = +1

Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501
           HGS+G        +   + G   +  E +  I+    +  + VQT  Q L  AIS I+DF
Sbjct: 633 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 686

Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678
           V+ LGKEA+A+    SD + L  K++ FS  Y + + S ++L DF+ D+S +L+KA+ L 
Sbjct: 687 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 746

Query: 679 FN 684
            N
Sbjct: 747 VN 748


>gb|EOY14985.1| Uncharacterized protein isoform 6 [Theobroma cacao]
          Length = 837

 Score =  131 bits (329), Expect = 2e-28
 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%)
 Frame = +1

Query: 1   ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
           A RNSEL ASR++CA+T+SKLQ LEAQL  ++Q          +    +SSQ VS+    
Sbjct: 230 AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 289

Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
             +SEDGNDD+ SCA S AT  MSELS  +KEKN + P+K+EN  HLDLMDDFLEMEK+A
Sbjct: 290 TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 349

Query: 319 CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492
           C  + S  NG ++ SD   +T N  SE V  +   +I   SC +        L  +++ +
Sbjct: 350 CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 403

Query: 493 YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                    +   +   SD D L + KL T  +   ++++ + ++   + DI   +  A
Sbjct: 404 SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 457



 Score = 63.9 bits (154), Expect = 4e-08
 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%)
 Frame = +1

Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501
           HGS+G        +   + G   +  E +  I+    +  + VQT  Q L  AIS I+DF
Sbjct: 474 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 527

Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678
           V+ LGKEA+A+    SD + L  K++ FS  Y + + S ++L DF+ D+S +L+KA+ L 
Sbjct: 528 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 587

Query: 679 FN 684
            N
Sbjct: 588 VN 589


>gb|EOY14984.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 992

 Score =  131 bits (329), Expect = 2e-28
 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
            A RNSEL ASR++CA+T+SKLQ LEAQL  ++Q          +    +SSQ VS+    
Sbjct: 385  AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 444

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              +SEDGNDD+ SCA S AT  MSELS  +KEKN + P+K+EN  HLDLMDDFLEMEK+A
Sbjct: 445  TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 504

Query: 319  CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492
            C  + S  NG ++ SD   +T N  SE V  +   +I   SC +        L  +++ +
Sbjct: 505  CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 558

Query: 493  YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                     +   +   SD D L + KL T  +   ++++ + ++   + DI   +  A
Sbjct: 559  SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 612



 Score = 63.9 bits (154), Expect = 4e-08
 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%)
 Frame = +1

Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501
           HGS+G        +   + G   +  E +  I+    +  + VQT  Q L  AIS I+DF
Sbjct: 629 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 682

Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678
           V+ LGKEA+A+    SD + L  K++ FS  Y + + S ++L DF+ D+S +L+KA+ L 
Sbjct: 683 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 742

Query: 679 FN 684
            N
Sbjct: 743 VN 744


>gb|EOY14983.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 947

 Score =  131 bits (329), Expect = 2e-28
 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%)
 Frame = +1

Query: 1   ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
           A RNSEL ASR++CA+T+SKLQ LEAQL  ++Q          +    +SSQ VS+    
Sbjct: 230 AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 289

Query: 142 -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
             +SEDGNDD+ SCA S AT  MSELS  +KEKN + P+K+EN  HLDLMDDFLEMEK+A
Sbjct: 290 TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 349

Query: 319 CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492
           C  + S  NG ++ SD   +T N  SE V  +   +I   SC +        L  +++ +
Sbjct: 350 CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 403

Query: 493 YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                    +   +   SD D L + KL T  +   ++++ + ++   + DI   +  A
Sbjct: 404 SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 457



 Score = 63.9 bits (154), Expect = 4e-08
 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%)
 Frame = +1

Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501
           HGS+G        +   + G   +  E +  I+    +  + VQT  Q L  AIS I+DF
Sbjct: 474 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 527

Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678
           V+ LGKEA+A+    SD + L  K++ FS  Y + + S ++L DF+ D+S +L+KA+ L 
Sbjct: 528 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 587

Query: 679 FN 684
            N
Sbjct: 588 VN 589


>gb|EOY14982.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 1106

 Score =  131 bits (329), Expect = 2e-28
 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
            A RNSEL ASR++CA+T+SKLQ LEAQL  ++Q          +    +SSQ VS+    
Sbjct: 389  AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 448

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              +SEDGNDD+ SCA S AT  MSELS  +KEKN + P+K+EN  HLDLMDDFLEMEK+A
Sbjct: 449  TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 508

Query: 319  CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492
            C  + S  NG ++ SD   +T N  SE V  +   +I   SC +        L  +++ +
Sbjct: 509  CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 562

Query: 493  YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                     +   +   SD D L + KL T  +   ++++ + ++   + DI   +  A
Sbjct: 563  SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 616



 Score = 63.9 bits (154), Expect = 4e-08
 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%)
 Frame = +1

Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501
           HGS+G        +   + G   +  E +  I+    +  + VQT  Q L  AIS I+DF
Sbjct: 633 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 686

Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678
           V+ LGKEA+A+    SD + L  K++ FS  Y + + S ++L DF+ D+S +L+KA+ L 
Sbjct: 687 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 746

Query: 679 FN 684
            N
Sbjct: 747 VN 748


>gb|EOY14981.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
          Length = 992

 Score =  131 bits (329), Expect = 2e-28
 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
            A RNSEL ASR++CA+T+SKLQ LEAQL  ++Q          +    +SSQ VS+    
Sbjct: 385  AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 444

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              +SEDGNDD+ SCA S AT  MSELS  +KEKN + P+K+EN  HLDLMDDFLEMEK+A
Sbjct: 445  TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 504

Query: 319  CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492
            C  + S  NG ++ SD   +T N  SE V  +   +I   SC +        L  +++ +
Sbjct: 505  CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 558

Query: 493  YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                     +   +   SD D L + KL T  +   ++++ + ++   + DI   +  A
Sbjct: 559  SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 612



 Score = 63.9 bits (154), Expect = 4e-08
 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%)
 Frame = +1

Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501
           HGS+G        +   + G   +  E +  I+    +  + VQT  Q L  AIS I+DF
Sbjct: 629 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 682

Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678
           V+ LGKEA+A+    SD + L  K++ FS  Y + + S ++L DF+ D+S +L+KA+ L 
Sbjct: 683 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 742

Query: 679 FN 684
            N
Sbjct: 743 VN 744


>gb|EOY14980.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 1102

 Score =  131 bits (329), Expect = 2e-28
 Identities = 92/239 (38%), Positives = 132/239 (55%), Gaps = 17/239 (7%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRANAQ----------VSTGGFSSQKVSDQ--- 141
            A RNSEL ASR++CA+T+SKLQ LEAQL  ++Q          +    +SSQ VS+    
Sbjct: 385  AKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSV 444

Query: 142  -LMSEDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIA 318
              +SEDGNDD+ SCA S AT  MSELS  +KEKN + P+K+EN  HLDLMDDFLEMEK+A
Sbjct: 445  TSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLA 504

Query: 319  CLPHGS--NGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCIDTVQTNDQALEMAISGI 492
            C  + S  NG ++ SD   +T N  SE V  +   +I   SC +        L  +++ +
Sbjct: 505  CSSNDSTANGTITISD---STNNKISESVNGDASGEI---SCKELQSEKQHVLSPSVNQV 558

Query: 493  YDFVMILGKEAKALPGSSDEDGL-IKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKA 666
                     +   +   SD D L + KL T  +   ++++ + ++   + DI   +  A
Sbjct: 559  SS-----NMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDA 612



 Score = 63.9 bits (154), Expect = 4e-08
 Identities = 43/122 (35%), Positives = 67/122 (54%), Gaps = 3/122 (2%)
 Frame = +1

Query: 328 HGSNGAVSSSDVSVNTGNTGSELVKHEDDAKINVNSCI--DTVQTNDQALEMAISGIYDF 501
           HGS+G        +   + G   +  E +  I+    +  + VQT  Q L  AIS I+DF
Sbjct: 629 HGSDGTC------IGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDF 682

Query: 502 VMILGKEAKALPG-SSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLD 678
           V+ LGKEA+A+    SD + L  K++ FS  Y + + S ++L DF+ D+S +L+KA+ L 
Sbjct: 683 VLSLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLR 742

Query: 679 FN 684
            N
Sbjct: 743 VN 744


>ref|XP_002893071.1| hypothetical protein ARALYDRAFT_335233 [Arabidopsis lyrata subsp.
            lyrata] gi|297338913|gb|EFH69330.1| hypothetical protein
            ARALYDRAFT_335233 [Arabidopsis lyrata subsp. lyrata]
          Length = 986

 Score =  125 bits (315), Expect = 9e-27
 Identities = 100/288 (34%), Positives = 138/288 (47%), Gaps = 63/288 (21%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRANAQVSTGG------FSSQKVSDQ----LMS 150
            A RNSELQ SR+ICA+TA++LQ LEAQ+   +    G       FS Q  S+      MS
Sbjct: 376  AKRNSELQVSRNICAKTANRLQTLEAQMVNKSPTKRGFEMPAEIFSRQNASNPPSMASMS 435

Query: 151  EDGNDDNVSCAGSLATVSMSELSYIQKEKNADSPHKSENTNHLDLMDDFLEMEKIACLPH 330
            EDGN+D  S AGSL    MSELS   K+KN     K+E+ N L+LMDDFLEMEK+ACLP+
Sbjct: 436  EDGNEDARSVAGSL----MSELSQSNKDKNNAKIKKTESANQLELMDDFLEMEKLACLPN 491

Query: 331  GSNG-----------------------------------------------AVSSSDVSV 369
            GSN                                                AV  + V +
Sbjct: 492  GSNANGTTDHSSADSDGEILPATQLKKRISTVLQSLPKDAAFEKILAEIQCAVKDAGVKL 551

Query: 370  NTGNTGSELVKHEDDAKINVNS-----CIDTVQTNDQALEMAISGIYDFVMILGKEAKAL 534
             +   G+ L    ++ +I +++      +  V+   Q L  A+S IY FV  L KEA A 
Sbjct: 552  PSKCHGANLNGVTEEKEIAMSNETTEEKVTIVEVITQELSDALSQIYQFVSYLAKEATAC 611

Query: 535  PGSSDEDGLI-KKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLL 675
              +  E+    +K++ FS  +   +  E  L DF+ D+S VL +A+ L
Sbjct: 612  QDTFSENRTFSQKVEEFSVTFERVLAKEKTLVDFLFDLSRVLVEASEL 659


>ref|XP_002510512.1| Myosin heavy chain, striated muscle, putative [Ricinus communis]
            gi|223551213|gb|EEF52699.1| Myosin heavy chain, striated
            muscle, putative [Ricinus communis]
          Length = 1041

 Score =  125 bits (315), Expect = 9e-27
 Identities = 101/300 (33%), Positives = 148/300 (49%), Gaps = 72/300 (24%)
 Frame = +1

Query: 1    ANRNSELQASRSICAQTASKLQNLEAQLRAN--------AQVSTGGFSSQKVSDQ----L 144
            A RNSELQASR++CA+TAS+LQ+LEAQ+            QV   G+SSQ +S+      
Sbjct: 383  AKRNSELQASRNLCAKTASRLQSLEAQVSNQQKSSPTSVVQVPIEGYSSQNMSNPPSLTS 442

Query: 145  MSEDGNDDNVSCAGSLATVSMSELSYIQKEKN---------------------------- 240
            MSEDGNDD+ SCA S AT  +SELS ++KEK+                            
Sbjct: 443  MSEDGNDDDRSCADSWATSLISELSQLKKEKSTEKLNKTKNTQHLELMDDFLEMEKLACL 502

Query: 241  ------------------ADSPHKSENTNHLDLMDDFLE--------MEKIACLPHGSNG 342
                              AD P   +  + + ++ + +         +E +  +   ++G
Sbjct: 503  NANVNLVSSMSAANSGSEADQPCLVKLRSRISMLLESISQDADMGKILEDVQRIVQDTHG 562

Query: 343  AVSSSDVSVN-TGNTGSELVKHEDDAKINV----NSCIDTVQTNDQALEMAISGIYDFVM 507
            AVSS    V  T  T  E      D +I +    N+  DTV++ +Q L  A+S I+DFV+
Sbjct: 563  AVSSVSEDVRATDATCPEYASITGDKEITLFQDTNAATDTVRSVNQELATAVSSIHDFVL 622

Query: 508  ILGKEAKAL-PGSSDEDGLIKKLDTFSAKYAEAINSEINLFDFVLDISHVLSKATLLDFN 684
             LGKEA A+   SSD   L +K++ FS  + + +N   +L DF+  +S VL+KA+ L FN
Sbjct: 623  FLGKEAMAVHDTSSDGSDLSQKIEHFSVTFNKVLNGNTSLIDFIFYLSCVLAKASELRFN 682


Top