BLASTX nr result

ID: Akebia26_contig00025100 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00025100
         (1686 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI15161.3| unnamed protein product [Vitis vinifera]              341   7e-91
ref|XP_006837077.1| hypothetical protein AMTR_s00110p00093580 [A...   319   2e-84
ref|XP_007037494.1| Myosin heavy chain-related, putative isoform...   319   2e-84
ref|XP_002514690.1| conserved hypothetical protein [Ricinus comm...   311   6e-82
ref|XP_007037497.1| Myosin heavy chain-related, putative isoform...   309   2e-81
ref|XP_007037495.1| Myosin heavy chain-related, putative isoform...   309   2e-81
ref|XP_007209209.1| hypothetical protein PRUPE_ppa006629mg [Prun...   303   2e-79
ref|XP_006440698.1| hypothetical protein CICLE_v10020474mg [Citr...   300   1e-78
ref|XP_006477624.1| PREDICTED: tropomyosin-like isoform X1 [Citr...   299   2e-78
ref|XP_004299323.1| PREDICTED: uncharacterized protein LOC101294...   298   6e-78
ref|XP_002322042.2| hypothetical protein POPTR_0015s03460g [Popu...   297   1e-77
emb|CAN78532.1| hypothetical protein VITISV_035305 [Vitis vinifera]   294   7e-77
gb|EXC11033.1| hypothetical protein L484_015253 [Morus notabilis]     279   2e-72
ref|XP_006358271.1| PREDICTED: intracellular protein transport p...   256   3e-65
ref|NP_001190585.1| uncharacterized protein [Arabidopsis thalian...   251   5e-64
ref|XP_007037496.1| Myosin heavy chain-related, putative isoform...   249   3e-63
ref|XP_006358272.1| PREDICTED: intracellular protein transport p...   248   6e-63
ref|XP_002873329.1| hypothetical protein ARALYDRAFT_487621 [Arab...   244   9e-62
ref|NP_196406.2| myosin heavy chain-like protein [Arabidopsis th...   240   1e-60
ref|XP_004137423.1| PREDICTED: uncharacterized protein LOC101221...   239   4e-60

>emb|CBI15161.3| unnamed protein product [Vitis vinifera]
          Length = 420

 Score =  341 bits (874), Expect = 7e-91
 Identities = 200/405 (49%), Positives = 258/405 (63%), Gaps = 7/405 (1%)
 Frame = -3

Query: 1639 EMSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEA 1460
            +M S  +S++D S+++++LLQ+ T C++L++E NMLRESQS S ELIRRLEL V+ LSEA
Sbjct: 16   KMFSSSKSESDSSYDIEDLLQIETRCKQLKRETNMLRESQSESFELIRRLELHVRTLSEA 75

Query: 1459 RSKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEI 1280
            RS+D K+I ELE +L+N SQEI YLQDQLN R+ E  CLGEHVHSLELKLA+   L   +
Sbjct: 76   RSEDEKHIQELERELRNCSQEIDYLQDQLNARDAEVKCLGEHVHSLELKLADKDNLEDMV 135

Query: 1279 GQLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDI 1100
            G+L +EL +SNS+   LM+ELENKE++LQ S LCI+K                 SMKL++
Sbjct: 136  GRLMQELKRSNSECMLLMQELENKEVELQMSSLCIDKLEESISSVTLEFQCEMESMKLEM 195

Query: 1099 TALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISE 920
              LEQ   EAKK QD A+ EK +M+ L++EF+VQ QD+Q MI C           L  SE
Sbjct: 196  ITLEQSCFEAKKLQDEASEEKTKMNGLIQEFQVQLQDAQKMIECLDKENKELRGKLKTSE 255

Query: 919  RNAKTLCRKVEEYLGEWLGKHAIVDIPS------CRSELLVSKEIGTCEEVLGLLLSKLE 758
             +A  L +K++E+  EWL      ++ +        S+  +S E+ T  EVL  L  KL 
Sbjct: 256  MDAILLRQKIKEHSEEWLENKDESELKTQSSSGELESKFNLSTEMSTSAEVLVPLFPKLA 315

Query: 757  IVA-EDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQIT 581
            + A  D   K++ EKMSH+I                         EDL QEMAELRYQIT
Sbjct: 316  VSATSDVGLKEKMEKMSHQIHGYELLVKQLKEELREEKLKAKEEAEDLAQEMAELRYQIT 375

Query: 580  GMLEEECKRRACIEQASLQRISELEAQVGKEQKKSSIALIRYHEA 446
            GMLEEECKRRACIEQASLQRI+ELEAQ+ KEQ KS  A+ R+ EA
Sbjct: 376  GMLEEECKRRACIEQASLQRIAELEAQIQKEQTKSYAAIRRFREA 420


>ref|XP_006837077.1| hypothetical protein AMTR_s00110p00093580 [Amborella trichopoda]
            gi|548839670|gb|ERM99930.1| hypothetical protein
            AMTR_s00110p00093580 [Amborella trichopoda]
          Length = 509

 Score =  319 bits (818), Expect = 2e-84
 Identities = 207/505 (40%), Positives = 288/505 (57%), Gaps = 19/505 (3%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRES----QSRSTELIRRLELDVKLL 1469
            MSS F+S+N YS +VDELL+LG  C+ELRKEN++LRES    QS++ E+I+RLE ++K L
Sbjct: 2    MSSSFKSENAYSVDVDELLELGILCQELRKENDILRESLLLEQSKNGEVIKRLESELKEL 61

Query: 1468 SEARSKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLH 1289
             +A S+D  +I  LE++L+  S++IGYLQDQLNL+N+EA+ + EH+HSLELKL E +KLH
Sbjct: 62   HDAHSEDMMHIGSLESELRTCSRKIGYLQDQLNLKNVEASYVAEHIHSLELKLVEAAKLH 121

Query: 1288 QEIGQLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMK 1109
            +++  L++EL KS+S+R +LM ELE K+ +L+NS   IE                  S++
Sbjct: 122  EKVTYLREELEKSDSERLALMEELELKKKELENSAFHIENLEVIISSLTLESQCEIESIR 181

Query: 1108 LDITALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLG 929
             ++ A E   +E K   +NAA E   M +L++ ++ Q ++++ MI             L 
Sbjct: 182  HELVACEAKYTEVKVSNENAAKETAGMADLIKLYKEQFKEAKQMITSLEKENITLQEKLA 241

Query: 928  ISERNAKTLCRKVEEYLGEWLGKHAIVDIPS----------CRSELLVSKEIGTCEEVLG 779
              E+     C KVE +L + L     + +P             +EL V KEI T EE L 
Sbjct: 242  NCEKTTVLFCHKVETHLDQLL--KGQIRLPMLGFNQSMANLLENELTVEKEISTGEETLL 299

Query: 778  LLLSKLEIV-AEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMA 602
             +LSKL I+ A D+   DE EKMSH+I                         EDLTQEMA
Sbjct: 300  PILSKLSIIDASDECLDDELEKMSHQIRESQLLIEQLREELRKEKARAKEDAEDLTQEMA 359

Query: 601  ELRYQITGMLEEECKRRACIEQASLQRISELEAQVGKEQKKSSIALIRYHEAQKLAESRS 422
            E+RYQ+ GMLEEEC RRACIEQASL RI ELEAQV KE+ +S  A I   EA+KLAE RS
Sbjct: 360  EMRYQVMGMLEEECSRRACIEQASLHRIEELEAQVRKEEMRSQAAEICCREAEKLAEDRS 419

Query: 421  MEVHQLKKVL----REGPCKDSKRNEQCSCGECITLRTLDRVDDGLVEAEPVELVSSDDD 254
             EV  LK VL    R+G    +++ E CS  +C+ +       + L   E    ++S+ D
Sbjct: 420  KEVENLKNVLAGLQRDG---GTQKAEACSSEDCLRVEKPSSPSEELAGDE--SKITSNKD 474

Query: 253  RSTLATITWS*RGKEKLCNIGRGIF 179
                A + W     E L +    IF
Sbjct: 475  NEDQAIVAWCKEDPEPLYDERETIF 499


>ref|XP_007037494.1| Myosin heavy chain-related, putative isoform 1 [Theobroma cacao]
            gi|508774739|gb|EOY21995.1| Myosin heavy chain-related,
            putative isoform 1 [Theobroma cacao]
          Length = 396

 Score =  319 bits (818), Expect = 2e-84
 Identities = 192/402 (47%), Positives = 252/402 (62%), Gaps = 5/402 (1%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1457
            MSS  +S+ D S NV+ELL++ T CRELRKE  ML+ESQS+  ELIR LE+ VK LSEAR
Sbjct: 1    MSSSSKSEGDNSINVEELLEIETRCRELRKEKEMLKESQSQGFELIRSLEVHVKSLSEAR 60

Query: 1456 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1277
             +D K+I +LE +LKN SQEI YLQDQL+ RN E N L EHVH LE+KLA+   L +++ 
Sbjct: 61   VQDKKHIKKLEGELKNCSQEIDYLQDQLSARNEEVNFLTEHVHDLEIKLADKGNLQEKVD 120

Query: 1276 QLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1097
            +L  EL  SNSDR SLM+E+ENKE +LQ S LCIEK                 SMKLDIT
Sbjct: 121  RLIGELNSSNSDRLSLMQEIENKEEELQQSALCIEKLEESVSSMALESQCEIESMKLDIT 180

Query: 1096 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISER 917
            ALEQ+  EA K ++    EK RM+ L+EE EVQ Q++  +I             L  SE+
Sbjct: 181  ALEQMCLEANKTEE----EKSRMNILIEELEVQLQNALKIIEGLDDENKELRGKLITSEK 236

Query: 916  NAKTLCRKVEEYL----GEWLGKHAIVDIPSCRSELLVSKEIGTCEEVLGLLLSKLEIVA 749
            NAK  C+K++E+L       L  H++  +    S + +SK+I  C+E+   LLS++ ++ 
Sbjct: 237  NAKIFCQKIKEWLKSKDRSQLNMHSV--LGEQESMMTISKDISGCKELFSALLSEVALLL 294

Query: 748  E-DKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 572
            E D ++K++ E MSH+I                         EDL QEMAELRY++ G+L
Sbjct: 295  ESDADSKEQYESMSHQINEYELLVKQLKDELREQKLKAKEEAEDLAQEMAELRYRMMGLL 354

Query: 571  EEECKRRACIEQASLQRISELEAQVGKEQKKSSIALIRYHEA 446
            EEECKRRACIEQASLQRI+ELEAQ+ KE +KS   +   HE+
Sbjct: 355  EEECKRRACIEQASLQRIAELEAQIQKEPQKSDAVVRHLHES 396


>ref|XP_002514690.1| conserved hypothetical protein [Ricinus communis]
            gi|223546294|gb|EEF47796.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 407

 Score =  311 bits (797), Expect = 6e-82
 Identities = 193/406 (47%), Positives = 248/406 (61%), Gaps = 15/406 (3%)
 Frame = -3

Query: 1618 SDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAKY 1439
            S  D + +V+ELLQ+GT C+ELRKE +MLRESQS+S ELIRRLEL VK LSEA S+D K+
Sbjct: 4    SSGDSTLDVEELLQIGTRCKELRKEKDMLRESQSQSFELIRRLELHVKSLSEAHSEDRKH 63

Query: 1438 IWELENDLKNFSQEI-----------GYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKL 1292
            I +LE +L N SQEI            YLQDQLN RN E   LGEHVH LELKL ++  L
Sbjct: 64   IQKLERELLNCSQEIVWISKIITFLTDYLQDQLNARNAEVYSLGEHVHELELKLVDMDDL 123

Query: 1291 HQEIGQLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSM 1112
              +I QL++EL KS+S+ F L++ELE KE++LQ S+  IEK                 SM
Sbjct: 124  LVKISQLQEELRKSDSECFLLIQELERKEVELQKSVSFIEKLEESVASFTLDSQCEIESM 183

Query: 1111 KLDITALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXL 932
            KLD+ ALEQ   E+KK Q+    EK  MD LV+E + Q  D++ +I+C           L
Sbjct: 184  KLDVMALEQACCESKKKQEETTMEKDTMDGLVQELKNQVYDAEEIIQCLEKENKELRVKL 243

Query: 931  GISERNAKTLCRKVEEYLGEWLGKHAIVDIPSCRSEL---LVSKEIGTCEEVLGLLLSKL 761
              SE N +   +K+EE++      + ++      SEL    +SKE+  C EVLGLL SKL
Sbjct: 244  ATSEMNGRIFIQKIEEWMEN--QDNLLLSTQPYSSELEKENMSKEMSACGEVLGLLFSKL 301

Query: 760  EIV-AEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQI 584
             IV A + + K + +++SHKI                         EDL QEMAELR+Q+
Sbjct: 302  AIVLAPESDLKKQMKRLSHKIREYEVLMNQLKEDLREEKLKAKEEAEDLAQEMAELRHQM 361

Query: 583  TGMLEEECKRRACIEQASLQRISELEAQVGKEQKKSSIALIRYHEA 446
            TG+LEEECKRRACIEQASLQRI+ELEAQ+ KEQ+K S A+   HEA
Sbjct: 362  TGLLEEECKRRACIEQASLQRIAELEAQIQKEQRKPSFAIRTLHEA 407


>ref|XP_007037497.1| Myosin heavy chain-related, putative isoform 4 [Theobroma cacao]
            gi|508774742|gb|EOY21998.1| Myosin heavy chain-related,
            putative isoform 4 [Theobroma cacao]
          Length = 383

 Score =  309 bits (792), Expect = 2e-81
 Identities = 187/385 (48%), Positives = 243/385 (63%), Gaps = 5/385 (1%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1457
            MSS  +S+ D S NV+ELL++ T CRELRKE  ML+ESQS+  ELIR LE+ VK LSEAR
Sbjct: 1    MSSSSKSEGDNSINVEELLEIETRCRELRKEKEMLKESQSQGFELIRSLEVHVKSLSEAR 60

Query: 1456 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1277
             +D K+I +LE +LKN SQEI YLQDQL+ RN E N L EHVH LE+KLA+   L +++ 
Sbjct: 61   VQDKKHIKKLEGELKNCSQEIDYLQDQLSARNEEVNFLTEHVHDLEIKLADKGNLQEKVD 120

Query: 1276 QLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1097
            +L  EL  SNSDR SLM+E+ENKE +LQ S LCIEK                 SMKLDIT
Sbjct: 121  RLIGELNSSNSDRLSLMQEIENKEEELQQSALCIEKLEESVSSMALESQCEIESMKLDIT 180

Query: 1096 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISER 917
            ALEQ+  EA K ++    EK RM+ L+EE EVQ Q++  +I             L  SE+
Sbjct: 181  ALEQMCLEANKTEE----EKSRMNILIEELEVQLQNALKIIEGLDDENKELRGKLITSEK 236

Query: 916  NAKTLCRKVEEYL----GEWLGKHAIVDIPSCRSELLVSKEIGTCEEVLGLLLSKLEIVA 749
            NAK  C+K++E+L       L  H++  +    S + +SK+I  C+E+   LLS++ ++ 
Sbjct: 237  NAKIFCQKIKEWLKSKDRSQLNMHSV--LGEQESMMTISKDISGCKELFSALLSEVALLL 294

Query: 748  E-DKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 572
            E D ++K++ E MSH+I                         EDL QEMAELRY++ G+L
Sbjct: 295  ESDADSKEQYESMSHQINEYELLVKQLKDELREQKLKAKEEAEDLAQEMAELRYRMMGLL 354

Query: 571  EEECKRRACIEQASLQRISELEAQV 497
            EEECKRRACIEQASLQRI+ELEAQV
Sbjct: 355  EEECKRRACIEQASLQRIAELEAQV 379


>ref|XP_007037495.1| Myosin heavy chain-related, putative isoform 2 [Theobroma cacao]
            gi|508774740|gb|EOY21996.1| Myosin heavy chain-related,
            putative isoform 2 [Theobroma cacao]
          Length = 406

 Score =  309 bits (792), Expect = 2e-81
 Identities = 188/389 (48%), Positives = 244/389 (62%), Gaps = 5/389 (1%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1457
            MSS  +S+ D S NV+ELL++ T CRELRKE  ML+ESQS+  ELIR LE+ VK LSEAR
Sbjct: 1    MSSSSKSEGDNSINVEELLEIETRCRELRKEKEMLKESQSQGFELIRSLEVHVKSLSEAR 60

Query: 1456 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1277
             +D K+I +LE +LKN SQEI YLQDQL+ RN E N L EHVH LE+KLA+   L +++ 
Sbjct: 61   VQDKKHIKKLEGELKNCSQEIDYLQDQLSARNEEVNFLTEHVHDLEIKLADKGNLQEKVD 120

Query: 1276 QLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1097
            +L  EL  SNSDR SLM+E+ENKE +LQ S LCIEK                 SMKLDIT
Sbjct: 121  RLIGELNSSNSDRLSLMQEIENKEEELQQSALCIEKLEESVSSMALESQCEIESMKLDIT 180

Query: 1096 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISER 917
            ALEQ+  EA K ++    EK RM+ L+EE EVQ Q++  +I             L  SE+
Sbjct: 181  ALEQMCLEANKTEE----EKSRMNILIEELEVQLQNALKIIEGLDDENKELRGKLITSEK 236

Query: 916  NAKTLCRKVEEYL----GEWLGKHAIVDIPSCRSELLVSKEIGTCEEVLGLLLSKLEIVA 749
            NAK  C+K++E+L       L  H++  +    S + +SK+I  C+E+   LLS++ ++ 
Sbjct: 237  NAKIFCQKIKEWLKSKDRSQLNMHSV--LGEQESMMTISKDISGCKELFSALLSEVALLL 294

Query: 748  E-DKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 572
            E D ++K++ E MSH+I                         EDL QEMAELRY++ G+L
Sbjct: 295  ESDADSKEQYESMSHQINEYELLVKQLKDELREQKLKAKEEAEDLAQEMAELRYRMMGLL 354

Query: 571  EEECKRRACIEQASLQRISELEAQVGKEQ 485
            EEECKRRACIEQASLQRI+ELEAQ  K Q
Sbjct: 355  EEECKRRACIEQASLQRIAELEAQSLKNQ 383


>ref|XP_007209209.1| hypothetical protein PRUPE_ppa006629mg [Prunus persica]
            gi|462404944|gb|EMJ10408.1| hypothetical protein
            PRUPE_ppa006629mg [Prunus persica]
          Length = 402

 Score =  303 bits (776), Expect = 2e-79
 Identities = 192/402 (47%), Positives = 244/402 (60%), Gaps = 8/402 (1%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1457
            MSS  + +   SF+V+ELLQ+GT CREL+KE +ML+ES S+S  LIRRLE+ V  LSEA 
Sbjct: 1    MSSSTKGNTVSSFDVEELLQIGTRCRELKKEKDMLKESHSQSFGLIRRLEVHVNSLSEAC 60

Query: 1456 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1277
            ++D K I  LE +LKN SQEI YLQDQLN RN E N L EH H LE KLA++  L + + 
Sbjct: 61   TEDKKQIQVLEKELKNCSQEIDYLQDQLNARNTEVNLLEEHTHGLEFKLADMENLQETVD 120

Query: 1276 QLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1097
            +L+ EL KS S+R  LM ELE+KE++LQNS LCI++                 SMKLDI 
Sbjct: 121  RLRDELKKSYSERMFLMEELESKEIELQNSALCIDELEESISSMSLESQCEIESMKLDIL 180

Query: 1096 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISER 917
            ALE    E KK Q+    EK RM EL++E EVQ Q++   +             L  SE 
Sbjct: 181  ALEHSFLEVKKIQEETVQEKTRMSELIQELEVQCQNAHKTVESLYMENKELRKKLDASET 240

Query: 916  NAKTLCRKVEEYLGEWLGKHAI-VDIPSCRSEL----LVSKEIGTCEEVLGLLLSKLEI- 755
            + +  C++VE    +WL K  I +D  S   +L    + SKE+ +C EVLG L SKL I 
Sbjct: 241  STRIFCQRVE----KWLEKDRIQLDSESPLGQLEGNYIYSKEM-SCGEVLGPLFSKLAIV 295

Query: 754  VAEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGM 575
            VA D ++  + EKMSH I                         EDL QEMAELRY++TG+
Sbjct: 296  VAPDADSIMKMEKMSHHIQDYELLVKQLKEELKEEKLKAKEEAEDLAQEMAELRYRMTGL 355

Query: 574  LEEECKRRACIEQASLQRISELEAQVGKEQKKS--SIALIRY 455
            LEEECKRRACIEQASLQRI+ELEAQV KE+ +S  S A +R+
Sbjct: 356  LEEECKRRACIEQASLQRIAELEAQVTKERTQSVKSFAALRH 397


>ref|XP_006440698.1| hypothetical protein CICLE_v10020474mg [Citrus clementina]
            gi|557542960|gb|ESR53938.1| hypothetical protein
            CICLE_v10020474mg [Citrus clementina]
          Length = 399

 Score =  300 bits (768), Expect = 1e-78
 Identities = 186/391 (47%), Positives = 238/391 (60%), Gaps = 5/391 (1%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1457
            MS   RSD +  F+V+ELLQ+ T CRELRKE + LRESQS+S +LI+RLEL  K LSEA 
Sbjct: 1    MSISSRSDGESVFDVEELLQIETRCRELRKEKDTLRESQSQSFDLIKRLELHAKSLSEAH 60

Query: 1456 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1277
            ++D K+I +LE +L N SQEI YLQDQLN RN E   L EHVHSLELKL ++  L  ++G
Sbjct: 61   NEDKKHIQKLERELMNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVG 120

Query: 1276 QLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1097
            QL++EL +S+S+   LM EL++KE  L+NS L I+K                 S+K+D+ 
Sbjct: 121  QLEEELRRSDSECLLLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIASLKIDMI 180

Query: 1096 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISER 917
            ALEQ   EAKK       EK RM+ L++E EV++QDSQ +I C           L   E 
Sbjct: 181  ALEQTCVEAKKVHKENVQEKVRMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYET 240

Query: 916  NAKTLCRKVEEYLGEWLGKHAIVDIPSCRSEL----LVSKEIGTCEEVLGLLLSKLEIV- 752
            N +  C+K+EE++ +   K   +DI S  SEL     VSKE   C +V G LLSKL +V 
Sbjct: 241  NGRVFCQKIEEWMEKEDRKQ--LDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVL 298

Query: 751  AEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 572
              D N K++ + MS +I                         EDL QEMAELRYQ+T +L
Sbjct: 299  GPDANLKEKIKGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLL 358

Query: 571  EEECKRRACIEQASLQRISELEAQVGKEQKK 479
            EEECKRRACIEQASLQRI+ELE Q+ K Q K
Sbjct: 359  EEECKRRACIEQASLQRIAELETQIEKGQNK 389


>ref|XP_006477624.1| PREDICTED: tropomyosin-like isoform X1 [Citrus sinensis]
          Length = 399

 Score =  299 bits (766), Expect = 2e-78
 Identities = 185/391 (47%), Positives = 239/391 (61%), Gaps = 5/391 (1%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1457
            MS   +SD +  F+V+ELLQ+ T CRELRKE + LRESQS+S +LI+RLE+  K LSEA 
Sbjct: 1    MSISSKSDGESVFDVEELLQIETRCRELRKEKDTLRESQSQSFDLIKRLEIHAKSLSEAH 60

Query: 1456 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1277
            ++D K+I +LE +L N SQEI YLQDQLN RN E   L EHVHSLELKL ++  L  ++G
Sbjct: 61   NEDKKHIQKLERELMNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVG 120

Query: 1276 QLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1097
            QL++EL +S+S+   LM EL++KE  L+NS L I+K                 S+K+D+ 
Sbjct: 121  QLEEELRRSDSECLLLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMI 180

Query: 1096 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISER 917
            ALEQ   EAKK       EK RM+ L++E EV++QDSQ +I C           L   E 
Sbjct: 181  ALEQTCVEAKKVHKENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYET 240

Query: 916  NAKTLCRKVEEYLGEWLGKHAIVDIPSCRSEL----LVSKEIGTCEEVLGLLLSKLEIV- 752
            N +  C+K+EE++ +   K   +DI S  SEL     VSKE   C +V G LLSKL +V 
Sbjct: 241  NGRVFCQKIEEWMEKEDRKQ--LDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVL 298

Query: 751  AEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 572
            A D N K++ + MS +I                         EDL QEMAELRYQ+T +L
Sbjct: 299  APDANLKEKIKGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLL 358

Query: 571  EEECKRRACIEQASLQRISELEAQVGKEQKK 479
            EEECKRRACIEQASLQRI+ELE Q+ K Q K
Sbjct: 359  EEECKRRACIEQASLQRIAELETQIEKGQNK 389


>ref|XP_004299323.1| PREDICTED: uncharacterized protein LOC101294367 [Fragaria vesca
            subsp. vesca]
          Length = 395

 Score =  298 bits (762), Expect = 6e-78
 Identities = 184/392 (46%), Positives = 240/392 (61%), Gaps = 2/392 (0%)
 Frame = -3

Query: 1618 SDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAKY 1439
            S +D SF+++ELLQ+G+ CREL+KE +ML+ESQS+S  LIR L++ +K LSE  ++D K 
Sbjct: 4    SSSDSSFDIEELLQIGSRCRELKKEKDMLKESQSQSFGLIRSLDVHMKSLSEFHTEDKKQ 63

Query: 1438 IWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKEL 1259
            I  LE +LKN SQEI YLQDQLN R+ E N L EHVHSLELKLA++  L   + +L+ EL
Sbjct: 64   IQMLEKELKNCSQEIDYLQDQLNARDTEVNLLQEHVHSLELKLADMETLQVTVDRLRDEL 123

Query: 1258 VKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQLS 1079
             KS S+   LM+ELENKE++LQNS L IEK                 SMKLD+ ALEQ  
Sbjct: 124  KKSYSECLFLMQELENKEVELQNSNLFIEKLEESVSSISLESQCEIESMKLDMLALEQSF 183

Query: 1078 SEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISERNAKTLC 899
             EAKK Q+    EK RM+EL++E EVQ QD+Q                L  SE N +  C
Sbjct: 184  LEAKKIQEETVQEKTRMNELIQELEVQCQDAQKTTDDLYIENKELREKLDTSETNTRIFC 243

Query: 898  RKVEEYLGEWLGKHAIVDIPSCRSE-LLVSKEIGTCEEVLGLLLSKL-EIVAEDKNTKDE 725
            +++E++L     +  +  + + + E    S ++ TC EVL  L SKL +++A D N   +
Sbjct: 244  QRIEKWLENDRYESKLESLLNEQDEKCTFSTDMSTCGEVLEPLFSKLAKVLAPDANFIVK 303

Query: 724  NEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRAC 545
             ++MSH+I                         EDL QEMAELRYQ+TG+LEEECKRRA 
Sbjct: 304  MKEMSHQIHEYELLVKQLKEELREEKLKAKEEAEDLAQEMAELRYQLTGLLEEECKRRAY 363

Query: 544  IEQASLQRISELEAQVGKEQKKSSIALIRYHE 449
            IEQASLQRISELEAQV K + KSS  L+   E
Sbjct: 364  IEQASLQRISELEAQVHKARTKSSTCLLSLDE 395


>ref|XP_002322042.2| hypothetical protein POPTR_0015s03460g [Populus trichocarpa]
            gi|550321847|gb|EEF06169.2| hypothetical protein
            POPTR_0015s03460g [Populus trichocarpa]
          Length = 406

 Score =  297 bits (760), Expect = 1e-77
 Identities = 188/410 (45%), Positives = 244/410 (59%), Gaps = 13/410 (3%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1457
            MSS  +SD D SF+ +ELLQ+GT CRELRKE +MLR+SQ +S ELIRRLEL VK LSEAR
Sbjct: 1    MSSSSKSDGDSSFDAEELLQIGTRCRELRKEKDMLRDSQPQSFELIRRLELHVKQLSEAR 60

Query: 1456 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1277
            ++D K+I +LE +L N SQEI YLQDQLN RN E   LG HVH LELKLA +  L    G
Sbjct: 61   TEDKKHIQKLERELLNCSQEIDYLQDQLNARNSEVYTLGGHVHELELKLANMEHLQANNG 120

Query: 1276 QLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1097
            QL++EL + +S+   L++ELE+KE++LQ S LCI K                 SMKLD+ 
Sbjct: 121  QLREELKRCDSEHLLLLQELESKEIELQESALCIGKLEESISSLTLDSQCEIESMKLDMI 180

Query: 1096 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISER 917
            ALEQ   +AKK Q+    E  RM+ L++E E Q  +++  I C           L  S+ 
Sbjct: 181  ALEQACFKAKKTQEETIQENARMNGLIKELEFQILEAKETIECVEKENIELRDKLVTSDV 240

Query: 916  NAKTLCRKVEEYLGEWLGKHAIVDIPSCRSEL----LVSKEIGTCEEVLGLLLSKL-EIV 752
            N+K   +++EE+L       + ++  SC SE+     +SKE+    E LG   SKL  ++
Sbjct: 241  NSKLFLQQIEEWLEN--KDTSQLNTQSCSSEIEHQSNMSKEM---REALGPCFSKLATLL 295

Query: 751  AEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 572
              + N K+  E MSH+I                         +DL QEMAELRYQ+TG+L
Sbjct: 296  GSESNLKEWMESMSHQIRKYEVLVKQLKDELREEKSKAKEEADDLAQEMAELRYQMTGLL 355

Query: 571  EEECKRRACIEQASLQRISELEAQV--------GKEQKKSSIALIRYHEA 446
            EEECKRRACIEQASLQRISELEAQV         +E++K   A+   HEA
Sbjct: 356  EEECKRRACIEQASLQRISELEAQVFLVFPSKIERERRKFFAAVGHLHEA 405


>emb|CAN78532.1| hypothetical protein VITISV_035305 [Vitis vinifera]
          Length = 1164

 Score =  294 bits (753), Expect = 7e-77
 Identities = 184/408 (45%), Positives = 239/408 (58%), Gaps = 11/408 (2%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1457
            M S  +S++D S+++++LLQ+ T C++                    RLEL V+ LSEAR
Sbjct: 777  MFSSSKSESDSSYDIEDLLQIETRCKQ--------------------RLELHVRTLSEAR 816

Query: 1456 SKDAKYIWELENDLKNFSQEI----GYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLH 1289
            S+D K+I ELE +L+N SQEI     YLQDQLN R+ E  CLGEH HSLELKLA+   L 
Sbjct: 817  SEDEKHIQELERELRNCSQEIVFLVDYLQDQLNARDAEVKCLGEHAHSLELKLADKDNLE 876

Query: 1288 QEIGQLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMK 1109
              +G+L +EL +SNS+   LM+ELENKE++LQ S LCI+K                 SMK
Sbjct: 877  DMVGRLMEELKRSNSECMFLMQELENKEVELQTSSLCIDKLEESISSVTLEFQCEIESMK 936

Query: 1108 LDITALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLG 929
            L++  LEQ   EAKK QD A+ EK +M+ L++EF+VQ QD+Q MI C           L 
Sbjct: 937  LEMITLEQSCFEAKKLQDEASEEKTKMNGLIQEFQVQLQDAQKMIECLDKENKELRGKLK 996

Query: 928  ISERNAKTLCRKVEEYLGEWLGKHAIVDIPS------CRSELLVSKEIGTCEEVLGLLLS 767
             SE +A  L +K++E+  EWL      ++ +        S+  +S E+ T  EVL  L  
Sbjct: 997  TSEMDAILLRQKIKEHSEEWLENKDESELKTQSSSGELESKFNLSTEMSTSAEVLVPLFP 1056

Query: 766  KLEIVA-EDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRY 590
            KL + A  D   K++ EKMSH+I                         EDL QEMAELRY
Sbjct: 1057 KLAVSATSDVXLKEKMEKMSHQIHGYELLVKQLKEELREEKLKAKEEAEDLAQEMAELRY 1116

Query: 589  QITGMLEEECKRRACIEQASLQRISELEAQVGKEQKKSSIALIRYHEA 446
            QITGMLEEECKRRACIEQASLQRI+ELEAQ+ KEQ KS  A+ R+ EA
Sbjct: 1117 QITGMLEEECKRRACIEQASLQRIAELEAQIQKEQTKSYAAIRRFREA 1164


>gb|EXC11033.1| hypothetical protein L484_015253 [Morus notabilis]
          Length = 380

 Score =  279 bits (714), Expect = 2e-72
 Identities = 177/399 (44%), Positives = 239/399 (59%), Gaps = 2/399 (0%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1457
            MSS  RS +D + +V+ELLQ+GT CRELR+E +ML+ESQS+S +LIRRLE  V  LS A 
Sbjct: 1    MSSHSRSQSDNTSDVEELLQIGTRCRELRREKDMLKESQSQSFDLIRRLERHVTSLSAAS 60

Query: 1456 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1277
            ++D K I  LE +L N SQEI YLQDQ N RN E N L +H+  LELKLA++  L + +G
Sbjct: 61   TEDKKCIEMLEKELMNCSQEIDYLQDQGNARNTEVNVLKDHLRDLELKLADMEYLQEAVG 120

Query: 1276 QLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1097
            +L++EL +S+SD   LM+ELE++E++LQNS LCIE+                 S+KL+I 
Sbjct: 121  RLREELKRSDSDCLFLMQELESREVELQNSSLCIERLRMSISSITLDSQCEIESLKLEIV 180

Query: 1096 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISER 917
             LEQ   EA+K Q+ A  EK R+++LV + E Q QD+Q  IR            L  SE 
Sbjct: 181  TLEQSCFEAEKSQEKAIQEKARINQLVRDLEAQFQDAQKNIRRLELENKELREKLDTSET 240

Query: 916  NAKTLCRKVEEYLGEWLGKHAIVD-IPSCRSELLVSKEIGTCEEVLGLLLSKLE-IVAED 743
              +T  + +E+ L     +  I   +    ++L++S +  TC EVL  L+SKLE ++  D
Sbjct: 241  KVRTFWQMLEKLLARDGSQPDIKQLVNEIEAKLMMSNDPSTCGEVLSPLISKLETLLGRD 300

Query: 742  KNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEE 563
             +  ++ E    K+                         EDL QEMAELRYQ+TG+LEEE
Sbjct: 301  GDDMEKEELREEKL-------------------KAKEEAEDLAQEMAELRYQMTGLLEEE 341

Query: 562  CKRRACIEQASLQRISELEAQVGKEQKKSSIALIRYHEA 446
              RRACIEQAS QRI+ELEAQV KEQ+KS  A+   H A
Sbjct: 342  RNRRACIEQASTQRIAELEAQVQKEQRKSLDAVKYLHGA 380


>ref|XP_006358271.1| PREDICTED: intracellular protein transport protein USO1-like isoform
            X1 [Solanum tuberosum]
          Length = 399

 Score =  256 bits (653), Expect = 3e-65
 Identities = 158/382 (41%), Positives = 224/382 (58%), Gaps = 1/382 (0%)
 Frame = -3

Query: 1618 SDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAKY 1439
            S  ++SF+V ELL++   C+ELRKE + LR SQ +S ELIR++E  V+ LSEAR +D  +
Sbjct: 4    SSGEHSFDVKELLEIRARCKELRKEKDTLRGSQGQSVELIRKIEQHVQTLSEAREEDKYH 63

Query: 1438 IWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKEL 1259
              +L+++L+N SQEI YLQDQLNLRN E + L + V SL+LKLA +  + +E+ +L++EL
Sbjct: 64   TQKLKSELENCSQEIDYLQDQLNLRNEEMDSLSKCVCSLQLKLANLENMEEEVTRLREEL 123

Query: 1258 VKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQLS 1079
              SN++R  L+++LE+KE++++ S LCIE+                 SMKLD+ A+EQ  
Sbjct: 124  ETSNAERLYLLQQLESKELEIEGSALCIERLEESVASVGLEHQFEIESMKLDLIAMEQNY 183

Query: 1078 SEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISERNAKTLC 899
             +AKK QD  A +   M+EL+ + ++Q  D++ +I             L  SE NA+T  
Sbjct: 184  FKAKKSQDETAQDSAMMNELIHDLQLQIYDAEKVIESLEKENVNLREQLQTSELNARTFS 243

Query: 898  RKVEEYLGEWLGKHAIVDIPSCRSELLVSKEIGTCEEVLGLLLSKL-EIVAEDKNTKDEN 722
             KVEE     L +  I +     S          C ++LG LL KL  +   D +  D+ 
Sbjct: 244  EKVEE-----LFRGLIPNNDDSSSSKEDDSASSCCGDILGPLLIKLASLGLSDVDLTDKM 298

Query: 721  EKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRACI 542
            +KM+ +I                         EDL QEMAELRYQ+TG+LEEE KRRAC+
Sbjct: 299  KKMAGQIKNYESLVKQLKEELRMEKLKAKEESEDLAQEMAELRYQMTGLLEEERKRRACV 358

Query: 541  EQASLQRISELEAQVGKEQKKS 476
            EQ SLQRI+ELEAQV KE  KS
Sbjct: 359  EQLSLQRIAELEAQVEKESMKS 380


>ref|NP_001190585.1| uncharacterized protein [Arabidopsis thaliana]
            gi|332010052|gb|AED97435.1| uncharacterized protein
            AT5G61200 [Arabidopsis thaliana]
          Length = 389

 Score =  251 bits (642), Expect = 5e-64
 Identities = 157/390 (40%), Positives = 217/390 (55%)
 Frame = -3

Query: 1630 SGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSK 1451
            S  RSD D SF+ DELLQ+G+ C ELR+E  MLRESQS+S EL+RRLEL+   LSE+R +
Sbjct: 15   SSSRSDVDNSFDADELLQIGSRCMELRREKEMLRESQSQSVELVRRLELNANSLSESRLE 74

Query: 1450 DAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQL 1271
            D + I  LE +L N  QEI YL+DQ+N R+ E N L EHV  LE+++ +  KL +E+  L
Sbjct: 75   DKRRIQMLEKELLNCYQEIDYLRDQVNFRSQEMNDLSEHVLDLEVRVTKSGKLEEEVNYL 134

Query: 1270 KKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITAL 1091
            ++EL  S S++  L++ELE+ E +LQ S+  +EK                 S+KLDI AL
Sbjct: 135  REELCSSKSEQLLLLQELESTETELQFSLFSVEKLEESVSSLTLESQCEIESIKLDIVAL 194

Query: 1090 EQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISERNA 911
            EQ   +A+KFQ  +  E  ++ E+V+E  + S++++    C              SERN 
Sbjct: 195  EQALFDAQKFQGESIQENDKLREIVKELRLNSREAEENAECLEKQNKELMERCVASERNI 254

Query: 910  KTLCRKVEEYLGEWLGKHAIVDIPSCRSELLVSKEIGTCEEVLGLLLSKLEIVAEDKNTK 731
            K L +                   S R  L    E     +    ++ KLE V +D   +
Sbjct: 255  KDLRQ-------------------SFRGRLESESEAPVNPDCFHDIIKKLE-VFQDGKLR 294

Query: 730  DENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRR 551
            D+ E M+ +I                         EDLTQEMAELRY++T +LEEECKRR
Sbjct: 295  DKMEDMARQILQYKDLVKQLKDELKEEKLKAKEEAEDLTQEMAELRYEMTCLLEEECKRR 354

Query: 550  ACIEQASLQRISELEAQVGKEQKKSSIALI 461
            ACIEQASLQRI+ LEAQ+ +E+ KSS  L+
Sbjct: 355  ACIEQASLQRIANLEAQIKREKNKSSTCLV 384


>ref|XP_007037496.1| Myosin heavy chain-related, putative isoform 3 [Theobroma cacao]
            gi|508774741|gb|EOY21997.1| Myosin heavy chain-related,
            putative isoform 3 [Theobroma cacao]
          Length = 324

 Score =  249 bits (635), Expect = 3e-63
 Identities = 150/317 (47%), Positives = 202/317 (63%), Gaps = 5/317 (1%)
 Frame = -3

Query: 1636 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1457
            MSS  +S+ D S NV+ELL++ T CRELRKE  ML+ESQS+  ELIR LE+ VK LSEAR
Sbjct: 1    MSSSSKSEGDNSINVEELLEIETRCRELRKEKEMLKESQSQGFELIRSLEVHVKSLSEAR 60

Query: 1456 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1277
             +D K+I +LE +LKN SQEI YLQDQL+ RN E N L EHVH LE+KLA+   L +++ 
Sbjct: 61   VQDKKHIKKLEGELKNCSQEIDYLQDQLSARNEEVNFLTEHVHDLEIKLADKGNLQEKVD 120

Query: 1276 QLKKELVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1097
            +L  EL  SNSDR SLM+E+ENKE +LQ S LCIEK                 SMKLDIT
Sbjct: 121  RLIGELNSSNSDRLSLMQEIENKEEELQQSALCIEKLEESVSSMALESQCEIESMKLDIT 180

Query: 1096 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISER 917
            ALEQ+  EA K ++    EK RM+ L+EE EVQ Q++  +I             L  SE+
Sbjct: 181  ALEQMCLEANKTEE----EKSRMNILIEELEVQLQNALKIIEGLDDENKELRGKLITSEK 236

Query: 916  NAKTLCRKVEEYL----GEWLGKHAIVDIPSCRSELLVSKEIGTCEEVLGLLLSKLEIVA 749
            NAK  C+K++E+L       L  H++  +    S + +SK+I  C+E+   LLS++ ++ 
Sbjct: 237  NAKIFCQKIKEWLKSKDRSQLNMHSV--LGEQESMMTISKDISGCKELFSALLSEVALLL 294

Query: 748  E-DKNTKDENEKMSHKI 701
            E D ++K++ E MSH+I
Sbjct: 295  ESDADSKEQYESMSHQI 311


>ref|XP_006358272.1| PREDICTED: intracellular protein transport protein USO1-like isoform
            X2 [Solanum tuberosum]
          Length = 375

 Score =  248 bits (633), Expect = 6e-63
 Identities = 153/374 (40%), Positives = 219/374 (58%), Gaps = 1/374 (0%)
 Frame = -3

Query: 1618 SDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAKY 1439
            S  ++SF+V ELL++   C+ELRKE + LR SQ +S ELIR++E  V+ LSEAR +D  +
Sbjct: 4    SSGEHSFDVKELLEIRARCKELRKEKDTLRGSQGQSVELIRKIEQHVQTLSEAREEDKYH 63

Query: 1438 IWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKEL 1259
              +L+++L+N SQEI YLQDQLNLRN E + L + V SL+LKLA +  + +E+ +L++EL
Sbjct: 64   TQKLKSELENCSQEIDYLQDQLNLRNEEMDSLSKCVCSLQLKLANLENMEEEVTRLREEL 123

Query: 1258 VKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQLS 1079
              SN++R  L+++LE+KE++++ S LCIE+                 SMKLD+ A+EQ  
Sbjct: 124  ETSNAERLYLLQQLESKELEIEGSALCIERLEESVASVGLEHQFEIESMKLDLIAMEQNY 183

Query: 1078 SEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISERNAKTLC 899
             +AKK QD  A +   M+EL+ + ++Q  D++ +I             L  SE NA+T  
Sbjct: 184  FKAKKSQDETAQDSAMMNELIHDLQLQIYDAEKVIESLEKENVNLREQLQTSELNARTFS 243

Query: 898  RKVEEYLGEWLGKHAIVDIPSCRSELLVSKEIGTCEEVLGLLLSKL-EIVAEDKNTKDEN 722
             KVEE     L +  I +     S          C ++LG LL KL  +   D +  D+ 
Sbjct: 244  EKVEE-----LFRGLIPNNDDSSSSKEDDSASSCCGDILGPLLIKLASLGLSDVDLTDKM 298

Query: 721  EKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRACI 542
            +KM+ +I                         EDL QEMAELRYQ+TG+LEEE KRRAC+
Sbjct: 299  KKMAGQIKNYESLVKQLKEELRMEKLKAKEESEDLAQEMAELRYQMTGLLEEERKRRACV 358

Query: 541  EQASLQRISELEAQ 500
            EQ SLQRI+ELEAQ
Sbjct: 359  EQLSLQRIAELEAQ 372


>ref|XP_002873329.1| hypothetical protein ARALYDRAFT_487621 [Arabidopsis lyrata subsp.
            lyrata] gi|297319166|gb|EFH49588.1| hypothetical protein
            ARALYDRAFT_487621 [Arabidopsis lyrata subsp. lyrata]
          Length = 409

 Score =  244 bits (623), Expect = 9e-62
 Identities = 157/385 (40%), Positives = 218/385 (56%), Gaps = 2/385 (0%)
 Frame = -3

Query: 1621 RSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAK 1442
            RSD + SF+V+ELLQ+GTT RELRK+ +MLRESQ  S EL+RRLEL  K LSE+R +D  
Sbjct: 19   RSDCENSFDVEELLQIGTTRRELRKQKDMLRESQPHSIELVRRLELHTKSLSESRLEDTA 78

Query: 1441 YIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKE 1262
             I  +E +L N  +EI YL+DQL  R+ E N L EH+H LE KLAE   L +E+  L+ E
Sbjct: 79   RIQMMEKELLNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDE 138

Query: 1261 LVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQL 1082
            L  S S+   L++ELE+KE++LQ S L +EK                 SMK+DITALEQ 
Sbjct: 139  LCMSKSEHLLLLQELESKEIELQCSSLSLEKLEETISSLTLESLCEIESMKIDITALEQA 198

Query: 1081 SSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISERNAKTL 902
              +A K Q+ +  EK ++  ++EE + QSQ +Q  ++               SE++ K  
Sbjct: 199  LFDAMKIQEESIQEKHQLKGIIEESQFQSQRAQENVKYIEKQNEELREKFNASEKSIKEF 258

Query: 901  CRKVEEYLGEWLGKHAIVD--IPSCRSELLVSKEIGTCEEVLGLLLSKLEIVAEDKNTKD 728
             +  +E L     +   V          L +S E+  C +    ++ KLE+ +++ N  D
Sbjct: 259  FQSTKERLESEDEEPLTVGCFFAELSHVLPMSNEVRNCFDA---IMKKLEL-SQNVNLTD 314

Query: 727  ENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRA 548
            + E M+ +I                         EDLTQEMAELRY++T +L+EE  RR 
Sbjct: 315  KVEGMAKQIHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRV 374

Query: 547  CIEQASLQRISELEAQVGKEQKKSS 473
            CIEQASLQRI+ELEAQ+ +E KK S
Sbjct: 375  CIEQASLQRIAELEAQIKREIKKPS 399


>ref|NP_196406.2| myosin heavy chain-like protein [Arabidopsis thaliana]
            gi|79327239|ref|NP_001031851.1| myosin heavy chain-like
            protein [Arabidopsis thaliana]
            gi|222423567|dbj|BAH19753.1| AT5G07890 [Arabidopsis
            thaliana] gi|332003833|gb|AED91216.1| myosin heavy
            chain-like protein [Arabidopsis thaliana]
            gi|332003835|gb|AED91218.1| myosin heavy chain-like
            protein [Arabidopsis thaliana]
          Length = 409

 Score =  240 bits (613), Expect = 1e-60
 Identities = 155/383 (40%), Positives = 218/383 (56%), Gaps = 2/383 (0%)
 Frame = -3

Query: 1621 RSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAK 1442
            RSD + SF+V++LLQ+GTT RELRK+ ++LRESQ  S EL+RRLEL  K LSE+R +D  
Sbjct: 19   RSDCENSFDVEDLLQIGTTRRELRKQKDLLRESQPHSIELVRRLELHTKSLSESRLEDTA 78

Query: 1441 YIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKE 1262
             I  +E +L N  +EI YL+DQL  R+ E N L EH+H LE KLAE   L +E+  L+ E
Sbjct: 79   RIQMMEKELLNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDE 138

Query: 1261 LVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQL 1082
            L  S S+   L++ELE+KE++LQ S L +EK                 SMKLDITALEQ 
Sbjct: 139  LCMSKSEHLLLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQA 198

Query: 1081 SSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISERNAKTL 902
              +A K Q+ +  EK ++  ++EE + QSQ ++  ++               SE++ K  
Sbjct: 199  LFDAMKIQEESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDF 258

Query: 901  CRKVEEYL--GEWLGKHAIVDIPSCRSELLVSKEIGTCEEVLGLLLSKLEIVAEDKNTKD 728
             +  +E L   +    +A+         L VS E+  C +    ++ KLE+ +++ N  D
Sbjct: 259  FQSTKERLESEDEQPLNAMCFFAELSHVLPVSNEVRNCFDA---IMKKLEL-SQNVNLID 314

Query: 727  ENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRA 548
            + E M  +I                         EDLTQEMAELRY++T +L+EE  RR 
Sbjct: 315  KVEGMGKQIHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRV 374

Query: 547  CIEQASLQRISELEAQVGKEQKK 479
            CIEQASLQRISELEAQ+ ++ KK
Sbjct: 375  CIEQASLQRISELEAQIKRDVKK 397


>ref|XP_004137423.1| PREDICTED: uncharacterized protein LOC101221046 [Cucumis sativus]
            gi|449486970|ref|XP_004157457.1| PREDICTED:
            uncharacterized protein LOC101230337 [Cucumis sativus]
          Length = 390

 Score =  239 bits (609), Expect = 4e-60
 Identities = 158/383 (41%), Positives = 220/383 (57%), Gaps = 1/383 (0%)
 Frame = -3

Query: 1621 RSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAK 1442
            RS++  S +++ELL++GT CR+L+KE + L +S+ +S ELIR LEL V  LSEAR +D  
Sbjct: 5    RSNSYSSSDLEELLEIGTRCRQLKKEKDTLIDSRPQSFELIR-LELHVNSLSEARKEDKL 63

Query: 1441 YIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKE 1262
             I  LE +L N +QEI YLQDQL  RN E   L +HV SLE KL  +    ++  +L++E
Sbjct: 64   RIENLEKELTNCTQEIDYLQDQLCTRNTELTYLVDHVESLEFKLVHMEHSQEKASKLEEE 123

Query: 1261 LVKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQL 1082
            + +SNS+   LM++L++KE +L+ S   +EK                 SMKLD+ A+EQ 
Sbjct: 124  VKRSNSECLFLMQKLDDKEQELRESNSNVEKLEESISAITLESQCEIESMKLDMLAMEQR 183

Query: 1081 SSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLGISERNAKTL 902
              E KKFQ+ A  +  +MD L+EE     Q++Q  ++            L +S RNA T 
Sbjct: 184  YIETKKFQEEALSQNDKMDRLIEEL----QNAQRNVKFLETENEELQRELDVSTRNASTF 239

Query: 901  CRKVEEYLGEWLGKHAIVDIPSCRSELLVSKEIGTCEEVLGLLLSKLEI-VAEDKNTKDE 725
            CR VEE +     + +   + + R   L S    +C +VLG LL KL + +  D N++ +
Sbjct: 240  CRSVEELIEN--KERSQNTMRNDRDGKLTSILKNSCGDVLGHLLPKLAVALFADANSEAK 297

Query: 724  NEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRAC 545
             + M  +I                         EDL QEMAELRYQITG+LEEECKRRAC
Sbjct: 298  MDVMKKQILDYELLVEQLKEELREEKLKAKEEAEDLAQEMAELRYQITGLLEEECKRRAC 357

Query: 544  IEQASLQRISELEAQVGKEQKKS 476
            IEQASLQRI++LEAQV K Q +S
Sbjct: 358  IEQASLQRIAQLEAQVLKGQNRS 380


Top