BLASTX nr result

ID: Akebia23_contig00012336 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00012336
         (1772 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI15161.3| unnamed protein product [Vitis vinifera]              345   3e-92
ref|XP_007037494.1| Myosin heavy chain-related, putative isoform...   322   4e-85
ref|XP_006837077.1| hypothetical protein AMTR_s00110p00093580 [A...   314   9e-83
ref|XP_002514690.1| conserved hypothetical protein [Ricinus comm...   313   2e-82
ref|XP_007037497.1| Myosin heavy chain-related, putative isoform...   308   4e-81
ref|XP_007037495.1| Myosin heavy chain-related, putative isoform...   308   4e-81
ref|XP_007209209.1| hypothetical protein PRUPE_ppa006629mg [Prun...   306   3e-80
emb|CAN78532.1| hypothetical protein VITISV_035305 [Vitis vinifera]   299   3e-78
ref|XP_006440698.1| hypothetical protein CICLE_v10020474mg [Citr...   298   4e-78
ref|XP_006477624.1| PREDICTED: tropomyosin-like isoform X1 [Citr...   298   7e-78
ref|XP_002322042.2| hypothetical protein POPTR_0015s03460g [Popu...   297   1e-77
ref|XP_004299323.1| PREDICTED: uncharacterized protein LOC101294...   296   2e-77
gb|EXC11033.1| hypothetical protein L484_015253 [Morus notabilis]     278   4e-72
ref|XP_006358271.1| PREDICTED: intracellular protein transport p...   258   6e-66
ref|XP_006358272.1| PREDICTED: intracellular protein transport p...   250   1e-63
ref|NP_001190585.1| uncharacterized protein [Arabidopsis thalian...   249   2e-63
ref|XP_007037496.1| Myosin heavy chain-related, putative isoform...   248   6e-63
ref|XP_002873329.1| hypothetical protein ARALYDRAFT_487621 [Arab...   246   2e-62
ref|XP_004137423.1| PREDICTED: uncharacterized protein LOC101221...   239   3e-60
ref|NP_196406.2| myosin heavy chain-like protein [Arabidopsis th...   238   5e-60

>emb|CBI15161.3| unnamed protein product [Vitis vinifera]
          Length = 420

 Score =  345 bits (886), Expect = 3e-92
 Identities = 201/405 (49%), Positives = 260/405 (64%), Gaps = 7/405 (1%)
 Frame = -2

Query: 1627 EMSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEA 1448
            +M S  +S++D S+++++LLQ+ T C++L++E NMLRESQS S ELIRRLEL V+ LSEA
Sbjct: 16   KMFSSSKSESDSSYDIEDLLQIETRCKQLKRETNMLRESQSESFELIRRLELHVRTLSEA 75

Query: 1447 RSKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEI 1268
            RS+D K+I ELE +L+N SQEI YLQDQLN R+ E  CLGEHVHSLELKLA+   L   +
Sbjct: 76   RSEDEKHIQELERELRNCSQEIDYLQDQLNARDAEVKCLGEHVHSLELKLADKDNLEDMV 135

Query: 1267 GQLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDI 1088
            G+L +EL +SNS+   LM+ELENKE++LQ S LCI+K                 SMKL++
Sbjct: 136  GRLMQELKRSNSECMLLMQELENKEVELQMSSLCIDKLEESISSVTLEFQCEMESMKLEM 195

Query: 1087 TALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISE 908
              LEQ   EAKK QD A+ EK +M+ L++EF+VQ QD+Q MI C           L+ SE
Sbjct: 196  ITLEQSCFEAKKLQDEASEEKTKMNGLIQEFQVQLQDAQKMIECLDKENKELRGKLKTSE 255

Query: 907  RNAKTLCRKVEEYLGEWLGKHAIVDIPS------CRSEHLVSKEIGTCEEVLGLLLSKLE 746
             +A  L +K++E+  EWL      ++ +        S+  +S E+ T  EVL  L  KL 
Sbjct: 256  MDAILLRQKIKEHSEEWLENKDESELKTQSSSGELESKFNLSTEMSTSAEVLVPLFPKLA 315

Query: 745  IVA-EDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQIT 569
            + A  D   K++ EKMSH+I                         EDL QEMAELRYQIT
Sbjct: 316  VSATSDVGLKEKMEKMSHQIHGYELLVKQLKEELREEKLKAKEEAEDLAQEMAELRYQIT 375

Query: 568  GMLEEECKRRACIEQASLQRISELEAQVGKEQKKSSIALRRYHEA 434
            GMLEEECKRRACIEQASLQRI+ELEAQ+ KEQ KS  A+RR+ EA
Sbjct: 376  GMLEEECKRRACIEQASLQRIAELEAQIQKEQTKSYAAIRRFREA 420


>ref|XP_007037494.1| Myosin heavy chain-related, putative isoform 1 [Theobroma cacao]
            gi|508774739|gb|EOY21995.1| Myosin heavy chain-related,
            putative isoform 1 [Theobroma cacao]
          Length = 396

 Score =  322 bits (824), Expect = 4e-85
 Identities = 193/402 (48%), Positives = 252/402 (62%), Gaps = 5/402 (1%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1445
            MSS  +S+ D S NV+ELL++ T CRELRKE  ML+ESQS+  ELIR LE+ VK LSEAR
Sbjct: 1    MSSSSKSEGDNSINVEELLEIETRCRELRKEKEMLKESQSQGFELIRSLEVHVKSLSEAR 60

Query: 1444 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1265
             +D K+I +LE +LKN SQEI YLQDQL+ RN E N L EHVH LE+KLA+   L +++ 
Sbjct: 61   VQDKKHIKKLEGELKNCSQEIDYLQDQLSARNEEVNFLTEHVHDLEIKLADKGNLQEKVD 120

Query: 1264 QLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1085
            +L  EL  SNSDR SLM+E+ENKE +LQ S LCIEK                 SMKLDIT
Sbjct: 121  RLIGELNSSNSDRLSLMQEIENKEEELQQSALCIEKLEESVSSMALESQCEIESMKLDIT 180

Query: 1084 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISER 905
            ALEQ+  EA K ++    EK RM+ L+EE EVQ Q++  +I             L  SE+
Sbjct: 181  ALEQMCLEANKTEE----EKSRMNILIEELEVQLQNALKIIEGLDDENKELRGKLITSEK 236

Query: 904  NAKTLCRKVEEYL----GEWLGKHAIVDIPSCRSEHLVSKEIGTCEEVLGLLLSKLEIVA 737
            NAK  C+K++E+L       L  H++  +    S   +SK+I  C+E+   LLS++ ++ 
Sbjct: 237  NAKIFCQKIKEWLKSKDRSQLNMHSV--LGEQESMMTISKDISGCKELFSALLSEVALLL 294

Query: 736  E-DKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 560
            E D ++K++ E MSH+I                         EDL QEMAELRY++ G+L
Sbjct: 295  ESDADSKEQYESMSHQINEYELLVKQLKDELREQKLKAKEEAEDLAQEMAELRYRMMGLL 354

Query: 559  EEECKRRACIEQASLQRISELEAQVGKEQKKSSIALRRYHEA 434
            EEECKRRACIEQASLQRI+ELEAQ+ KE +KS   +R  HE+
Sbjct: 355  EEECKRRACIEQASLQRIAELEAQIQKEPQKSDAVVRHLHES 396


>ref|XP_006837077.1| hypothetical protein AMTR_s00110p00093580 [Amborella trichopoda]
            gi|548839670|gb|ERM99930.1| hypothetical protein
            AMTR_s00110p00093580 [Amborella trichopoda]
          Length = 509

 Score =  314 bits (804), Expect = 9e-83
 Identities = 205/505 (40%), Positives = 286/505 (56%), Gaps = 19/505 (3%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRES----QSRSTELIRRLELDVKLL 1457
            MSS F+S+N YS +VDELL+LG  C+ELRKEN++LRES    QS++ E+I+RLE ++K L
Sbjct: 2    MSSSFKSENAYSVDVDELLELGILCQELRKENDILRESLLLEQSKNGEVIKRLESELKEL 61

Query: 1456 SEARSKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLH 1277
             +A S+D  +I  LE++L+  S++IGYLQDQLNL+N+EA+ + EH+HSLELKL E +KLH
Sbjct: 62   HDAHSEDMMHIGSLESELRTCSRKIGYLQDQLNLKNVEASYVAEHIHSLELKLVEAAKLH 121

Query: 1276 QEIGQLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMK 1097
            +++  L++EL KS+S+R +LM ELE K+ +L+NS   IE                  S++
Sbjct: 122  EKVTYLREELEKSDSERLALMEELELKKKELENSAFHIENLEVIISSLTLESQCEIESIR 181

Query: 1096 LDITALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLR 917
             ++ A E   +E K   +NAA E   M +L++ ++ Q ++++ MI             L 
Sbjct: 182  HELVACEAKYTEVKVSNENAAKETAGMADLIKLYKEQFKEAKQMITSLEKENITLQEKLA 241

Query: 916  ISERNAKTLCRKVEEYLGEWLGKHAIVDIPS----------CRSEHLVSKEIGTCEEVLG 767
              E+     C KVE +L + L     + +P             +E  V KEI T EE L 
Sbjct: 242  NCEKTTVLFCHKVETHLDQLL--KGQIRLPMLGFNQSMANLLENELTVEKEISTGEETLL 299

Query: 766  LLLSKLEIV-AEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMA 590
             +LSKL I+ A D+   DE EKMSH+I                         EDLTQEMA
Sbjct: 300  PILSKLSIIDASDECLDDELEKMSHQIRESQLLIEQLREELRKEKARAKEDAEDLTQEMA 359

Query: 589  ELRYQITGMLEEECKRRACIEQASLQRISELEAQVGKEQKKSSIALRRYHEAQKLAESRS 410
            E+RYQ+ GMLEEEC RRACIEQASL RI ELEAQV KE+ +S  A     EA+KLAE RS
Sbjct: 360  EMRYQVMGMLEEECSRRACIEQASLHRIEELEAQVRKEEMRSQAAEICCREAEKLAEDRS 419

Query: 409  MEVHQLKKVL----REGPCKDSKRNEQCSCGECITLRTLDRVDDGLVEAEPVELVSSDDD 242
             EV  LK VL    R+G    +++ E CS  +C+ +       + L   E    ++S+ D
Sbjct: 420  KEVENLKNVLAGLQRDG---GTQKAEACSSEDCLRVEKPSSPSEELAGDE--SKITSNKD 474

Query: 241  RSTLATITWS*RGKEKLCNIGRGIF 167
                A + W     E L +    IF
Sbjct: 475  NEDQAIVAWCKEDPEPLYDERETIF 499


>ref|XP_002514690.1| conserved hypothetical protein [Ricinus communis]
            gi|223546294|gb|EEF47796.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 407

 Score =  313 bits (802), Expect = 2e-82
 Identities = 193/406 (47%), Positives = 248/406 (61%), Gaps = 15/406 (3%)
 Frame = -2

Query: 1606 SDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAKY 1427
            S  D + +V+ELLQ+GT C+ELRKE +MLRESQS+S ELIRRLEL VK LSEA S+D K+
Sbjct: 4    SSGDSTLDVEELLQIGTRCKELRKEKDMLRESQSQSFELIRRLELHVKSLSEAHSEDRKH 63

Query: 1426 IWELENDLKNFSQEI-----------GYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKL 1280
            I +LE +L N SQEI            YLQDQLN RN E   LGEHVH LELKL ++  L
Sbjct: 64   IQKLERELLNCSQEIVWISKIITFLTDYLQDQLNARNAEVYSLGEHVHELELKLVDMDDL 123

Query: 1279 HQEIGQLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSM 1100
              +I QL++EL KS+S+ F L++ELE KE++LQ S+  IEK                 SM
Sbjct: 124  LVKISQLQEELRKSDSECFLLIQELERKEVELQKSVSFIEKLEESVASFTLDSQCEIESM 183

Query: 1099 KLDITALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXL 920
            KLD+ ALEQ   E+KK Q+    EK  MD LV+E + Q  D++ +I+C           L
Sbjct: 184  KLDVMALEQACCESKKKQEETTMEKDTMDGLVQELKNQVYDAEEIIQCLEKENKELRVKL 243

Query: 919  RISERNAKTLCRKVEEYLGEWLGKHAIVDIPSCRSE---HLVSKEIGTCEEVLGLLLSKL 749
              SE N +   +K+EE++      + ++      SE     +SKE+  C EVLGLL SKL
Sbjct: 244  ATSEMNGRIFIQKIEEWMENQ--DNLLLSTQPYSSELEKENMSKEMSACGEVLGLLFSKL 301

Query: 748  EIV-AEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQI 572
             IV A + + K + +++SHKI                         EDL QEMAELR+Q+
Sbjct: 302  AIVLAPESDLKKQMKRLSHKIREYEVLMNQLKEDLREEKLKAKEEAEDLAQEMAELRHQM 361

Query: 571  TGMLEEECKRRACIEQASLQRISELEAQVGKEQKKSSIALRRYHEA 434
            TG+LEEECKRRACIEQASLQRI+ELEAQ+ KEQ+K S A+R  HEA
Sbjct: 362  TGLLEEECKRRACIEQASLQRIAELEAQIQKEQRKPSFAIRTLHEA 407


>ref|XP_007037497.1| Myosin heavy chain-related, putative isoform 4 [Theobroma cacao]
            gi|508774742|gb|EOY21998.1| Myosin heavy chain-related,
            putative isoform 4 [Theobroma cacao]
          Length = 383

 Score =  308 bits (790), Expect = 4e-81
 Identities = 187/385 (48%), Positives = 242/385 (62%), Gaps = 5/385 (1%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1445
            MSS  +S+ D S NV+ELL++ T CRELRKE  ML+ESQS+  ELIR LE+ VK LSEAR
Sbjct: 1    MSSSSKSEGDNSINVEELLEIETRCRELRKEKEMLKESQSQGFELIRSLEVHVKSLSEAR 60

Query: 1444 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1265
             +D K+I +LE +LKN SQEI YLQDQL+ RN E N L EHVH LE+KLA+   L +++ 
Sbjct: 61   VQDKKHIKKLEGELKNCSQEIDYLQDQLSARNEEVNFLTEHVHDLEIKLADKGNLQEKVD 120

Query: 1264 QLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1085
            +L  EL  SNSDR SLM+E+ENKE +LQ S LCIEK                 SMKLDIT
Sbjct: 121  RLIGELNSSNSDRLSLMQEIENKEEELQQSALCIEKLEESVSSMALESQCEIESMKLDIT 180

Query: 1084 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISER 905
            ALEQ+  EA K ++    EK RM+ L+EE EVQ Q++  +I             L  SE+
Sbjct: 181  ALEQMCLEANKTEE----EKSRMNILIEELEVQLQNALKIIEGLDDENKELRGKLITSEK 236

Query: 904  NAKTLCRKVEEYL----GEWLGKHAIVDIPSCRSEHLVSKEIGTCEEVLGLLLSKLEIVA 737
            NAK  C+K++E+L       L  H++  +    S   +SK+I  C+E+   LLS++ ++ 
Sbjct: 237  NAKIFCQKIKEWLKSKDRSQLNMHSV--LGEQESMMTISKDISGCKELFSALLSEVALLL 294

Query: 736  E-DKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 560
            E D ++K++ E MSH+I                         EDL QEMAELRY++ G+L
Sbjct: 295  ESDADSKEQYESMSHQINEYELLVKQLKDELREQKLKAKEEAEDLAQEMAELRYRMMGLL 354

Query: 559  EEECKRRACIEQASLQRISELEAQV 485
            EEECKRRACIEQASLQRI+ELEAQV
Sbjct: 355  EEECKRRACIEQASLQRIAELEAQV 379


>ref|XP_007037495.1| Myosin heavy chain-related, putative isoform 2 [Theobroma cacao]
            gi|508774740|gb|EOY21996.1| Myosin heavy chain-related,
            putative isoform 2 [Theobroma cacao]
          Length = 406

 Score =  308 bits (790), Expect = 4e-81
 Identities = 188/389 (48%), Positives = 243/389 (62%), Gaps = 5/389 (1%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1445
            MSS  +S+ D S NV+ELL++ T CRELRKE  ML+ESQS+  ELIR LE+ VK LSEAR
Sbjct: 1    MSSSSKSEGDNSINVEELLEIETRCRELRKEKEMLKESQSQGFELIRSLEVHVKSLSEAR 60

Query: 1444 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1265
             +D K+I +LE +LKN SQEI YLQDQL+ RN E N L EHVH LE+KLA+   L +++ 
Sbjct: 61   VQDKKHIKKLEGELKNCSQEIDYLQDQLSARNEEVNFLTEHVHDLEIKLADKGNLQEKVD 120

Query: 1264 QLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1085
            +L  EL  SNSDR SLM+E+ENKE +LQ S LCIEK                 SMKLDIT
Sbjct: 121  RLIGELNSSNSDRLSLMQEIENKEEELQQSALCIEKLEESVSSMALESQCEIESMKLDIT 180

Query: 1084 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISER 905
            ALEQ+  EA K ++    EK RM+ L+EE EVQ Q++  +I             L  SE+
Sbjct: 181  ALEQMCLEANKTEE----EKSRMNILIEELEVQLQNALKIIEGLDDENKELRGKLITSEK 236

Query: 904  NAKTLCRKVEEYL----GEWLGKHAIVDIPSCRSEHLVSKEIGTCEEVLGLLLSKLEIVA 737
            NAK  C+K++E+L       L  H++  +    S   +SK+I  C+E+   LLS++ ++ 
Sbjct: 237  NAKIFCQKIKEWLKSKDRSQLNMHSV--LGEQESMMTISKDISGCKELFSALLSEVALLL 294

Query: 736  E-DKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 560
            E D ++K++ E MSH+I                         EDL QEMAELRY++ G+L
Sbjct: 295  ESDADSKEQYESMSHQINEYELLVKQLKDELREQKLKAKEEAEDLAQEMAELRYRMMGLL 354

Query: 559  EEECKRRACIEQASLQRISELEAQVGKEQ 473
            EEECKRRACIEQASLQRI+ELEAQ  K Q
Sbjct: 355  EEECKRRACIEQASLQRIAELEAQSLKNQ 383


>ref|XP_007209209.1| hypothetical protein PRUPE_ppa006629mg [Prunus persica]
            gi|462404944|gb|EMJ10408.1| hypothetical protein
            PRUPE_ppa006629mg [Prunus persica]
          Length = 402

 Score =  306 bits (783), Expect = 3e-80
 Identities = 192/407 (47%), Positives = 243/407 (59%), Gaps = 9/407 (2%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1445
            MSS  + +   SF+V+ELLQ+GT CREL+KE +ML+ES S+S  LIRRLE+ V  LSEA 
Sbjct: 1    MSSSTKGNTVSSFDVEELLQIGTRCRELKKEKDMLKESHSQSFGLIRRLEVHVNSLSEAC 60

Query: 1444 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1265
            ++D K I  LE +LKN SQEI YLQDQLN RN E N L EH H LE KLA++  L + + 
Sbjct: 61   TEDKKQIQVLEKELKNCSQEIDYLQDQLNARNTEVNLLEEHTHGLEFKLADMENLQETVD 120

Query: 1264 QLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1085
            +L+ EL KS S+R  LM ELE+KE++LQNS LCI++                 SMKLDI 
Sbjct: 121  RLRDELKKSYSERMFLMEELESKEIELQNSALCIDELEESISSMSLESQCEIESMKLDIL 180

Query: 1084 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISER 905
            ALE    E KK Q+    EK RM EL++E EVQ Q++   +             L  SE 
Sbjct: 181  ALEHSFLEVKKIQEETVQEKTRMSELIQELEVQCQNAHKTVESLYMENKELRKKLDASET 240

Query: 904  NAKTLCRKVEEYLGEWLGKHAI-----VDIPSCRSEHLVSKEIGTCEEVLGLLLSKLEI- 743
            + +  C++VE    +WL K  I       +      ++ SKE+ +C EVLG L SKL I 
Sbjct: 241  STRIFCQRVE----KWLEKDRIQLDSESPLGQLEGNYIYSKEM-SCGEVLGPLFSKLAIV 295

Query: 742  VAEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGM 563
            VA D ++  + EKMSH I                         EDL QEMAELRY++TG+
Sbjct: 296  VAPDADSIMKMEKMSHHIQDYELLVKQLKEELKEEKLKAKEEAEDLAQEMAELRYRMTGL 355

Query: 562  LEEECKRRACIEQASLQRISELEAQVGKEQK---KSSIALRRYHEAQ 431
            LEEECKRRACIEQASLQRI+ELEAQV KE+    KS  ALR  +EA+
Sbjct: 356  LEEECKRRACIEQASLQRIAELEAQVTKERTQSVKSFAALRHLNEAK 402


>emb|CAN78532.1| hypothetical protein VITISV_035305 [Vitis vinifera]
          Length = 1164

 Score =  299 bits (765), Expect = 3e-78
 Identities = 185/408 (45%), Positives = 241/408 (59%), Gaps = 11/408 (2%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1445
            M S  +S++D S+++++LLQ+ T C++                    RLEL V+ LSEAR
Sbjct: 777  MFSSSKSESDSSYDIEDLLQIETRCKQ--------------------RLELHVRTLSEAR 816

Query: 1444 SKDAKYIWELENDLKNFSQEI----GYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLH 1277
            S+D K+I ELE +L+N SQEI     YLQDQLN R+ E  CLGEH HSLELKLA+   L 
Sbjct: 817  SEDEKHIQELERELRNCSQEIVFLVDYLQDQLNARDAEVKCLGEHAHSLELKLADKDNLE 876

Query: 1276 QEIGQLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMK 1097
              +G+L +EL +SNS+   LM+ELENKE++LQ S LCI+K                 SMK
Sbjct: 877  DMVGRLMEELKRSNSECMFLMQELENKEVELQTSSLCIDKLEESISSVTLEFQCEIESMK 936

Query: 1096 LDITALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLR 917
            L++  LEQ   EAKK QD A+ EK +M+ L++EF+VQ QD+Q MI C           L+
Sbjct: 937  LEMITLEQSCFEAKKLQDEASEEKTKMNGLIQEFQVQLQDAQKMIECLDKENKELRGKLK 996

Query: 916  ISERNAKTLCRKVEEYLGEWLGKHAIVDIPS------CRSEHLVSKEIGTCEEVLGLLLS 755
             SE +A  L +K++E+  EWL      ++ +        S+  +S E+ T  EVL  L  
Sbjct: 997  TSEMDAILLRQKIKEHSEEWLENKDESELKTQSSSGELESKFNLSTEMSTSAEVLVPLFP 1056

Query: 754  KLEIVA-EDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRY 578
            KL + A  D   K++ EKMSH+I                         EDL QEMAELRY
Sbjct: 1057 KLAVSATSDVXLKEKMEKMSHQIHGYELLVKQLKEELREEKLKAKEEAEDLAQEMAELRY 1116

Query: 577  QITGMLEEECKRRACIEQASLQRISELEAQVGKEQKKSSIALRRYHEA 434
            QITGMLEEECKRRACIEQASLQRI+ELEAQ+ KEQ KS  A+RR+ EA
Sbjct: 1117 QITGMLEEECKRRACIEQASLQRIAELEAQIQKEQTKSYAAIRRFREA 1164


>ref|XP_006440698.1| hypothetical protein CICLE_v10020474mg [Citrus clementina]
            gi|557542960|gb|ESR53938.1| hypothetical protein
            CICLE_v10020474mg [Citrus clementina]
          Length = 399

 Score =  298 bits (764), Expect = 4e-78
 Identities = 185/391 (47%), Positives = 237/391 (60%), Gaps = 5/391 (1%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1445
            MS   RSD +  F+V+ELLQ+ T CRELRKE + LRESQS+S +LI+RLEL  K LSEA 
Sbjct: 1    MSISSRSDGESVFDVEELLQIETRCRELRKEKDTLRESQSQSFDLIKRLELHAKSLSEAH 60

Query: 1444 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1265
            ++D K+I +LE +L N SQEI YLQDQLN RN E   L EHVHSLELKL ++  L  ++G
Sbjct: 61   NEDKKHIQKLERELMNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVG 120

Query: 1264 QLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1085
            QL++EL +S+S+   LM EL++KE  L+NS L I+K                 S+K+D+ 
Sbjct: 121  QLEEELRRSDSECLLLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIASLKIDMI 180

Query: 1084 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISER 905
            ALEQ   EAKK       EK RM+ L++E EV++QDSQ +I C           L   E 
Sbjct: 181  ALEQTCVEAKKVHKENVQEKVRMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYET 240

Query: 904  NAKTLCRKVEEYLGEWLGKHAIVDIPSCRSE----HLVSKEIGTCEEVLGLLLSKLEIV- 740
            N +  C+K+EE++ +   K   +DI S  SE      VSKE   C +V G LLSKL +V 
Sbjct: 241  NGRVFCQKIEEWMEKEDRKQ--LDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVL 298

Query: 739  AEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 560
              D N K++ + MS +I                         EDL QEMAELRYQ+T +L
Sbjct: 299  GPDANLKEKIKGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLL 358

Query: 559  EEECKRRACIEQASLQRISELEAQVGKEQKK 467
            EEECKRRACIEQASLQRI+ELE Q+ K Q K
Sbjct: 359  EEECKRRACIEQASLQRIAELETQIEKGQNK 389


>ref|XP_006477624.1| PREDICTED: tropomyosin-like isoform X1 [Citrus sinensis]
          Length = 399

 Score =  298 bits (762), Expect = 7e-78
 Identities = 184/391 (47%), Positives = 238/391 (60%), Gaps = 5/391 (1%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1445
            MS   +SD +  F+V+ELLQ+ T CRELRKE + LRESQS+S +LI+RLE+  K LSEA 
Sbjct: 1    MSISSKSDGESVFDVEELLQIETRCRELRKEKDTLRESQSQSFDLIKRLEIHAKSLSEAH 60

Query: 1444 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1265
            ++D K+I +LE +L N SQEI YLQDQLN RN E   L EHVHSLELKL ++  L  ++G
Sbjct: 61   NEDKKHIQKLERELMNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVG 120

Query: 1264 QLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1085
            QL++EL +S+S+   LM EL++KE  L+NS L I+K                 S+K+D+ 
Sbjct: 121  QLEEELRRSDSECLLLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMI 180

Query: 1084 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISER 905
            ALEQ   EAKK       EK RM+ L++E EV++QDSQ +I C           L   E 
Sbjct: 181  ALEQTCVEAKKVHKENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYET 240

Query: 904  NAKTLCRKVEEYLGEWLGKHAIVDIPSCRSE----HLVSKEIGTCEEVLGLLLSKLEIV- 740
            N +  C+K+EE++ +   K   +DI S  SE      VSKE   C +V G LLSKL +V 
Sbjct: 241  NGRVFCQKIEEWMEKEDRKQ--LDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVL 298

Query: 739  AEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 560
            A D N K++ + MS +I                         EDL QEMAELRYQ+T +L
Sbjct: 299  APDANLKEKIKGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLL 358

Query: 559  EEECKRRACIEQASLQRISELEAQVGKEQKK 467
            EEECKRRACIEQASLQRI+ELE Q+ K Q K
Sbjct: 359  EEECKRRACIEQASLQRIAELETQIEKGQNK 389


>ref|XP_002322042.2| hypothetical protein POPTR_0015s03460g [Populus trichocarpa]
            gi|550321847|gb|EEF06169.2| hypothetical protein
            POPTR_0015s03460g [Populus trichocarpa]
          Length = 406

 Score =  297 bits (760), Expect = 1e-77
 Identities = 188/410 (45%), Positives = 243/410 (59%), Gaps = 13/410 (3%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1445
            MSS  +SD D SF+ +ELLQ+GT CRELRKE +MLR+SQ +S ELIRRLEL VK LSEAR
Sbjct: 1    MSSSSKSDGDSSFDAEELLQIGTRCRELRKEKDMLRDSQPQSFELIRRLELHVKQLSEAR 60

Query: 1444 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1265
            ++D K+I +LE +L N SQEI YLQDQLN RN E   LG HVH LELKLA +  L    G
Sbjct: 61   TEDKKHIQKLERELLNCSQEIDYLQDQLNARNSEVYTLGGHVHELELKLANMEHLQANNG 120

Query: 1264 QLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1085
            QL++EL + +S+   L++ELE+KE++LQ S LCI K                 SMKLD+ 
Sbjct: 121  QLREELKRCDSEHLLLLQELESKEIELQESALCIGKLEESISSLTLDSQCEIESMKLDMI 180

Query: 1084 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISER 905
            ALEQ   +AKK Q+    E  RM+ L++E E Q  +++  I C           L  S+ 
Sbjct: 181  ALEQACFKAKKTQEETIQENARMNGLIKELEFQILEAKETIECVEKENIELRDKLVTSDV 240

Query: 904  NAKTLCRKVEEYLGEWLGKHAIVDIPSCRSE----HLVSKEIGTCEEVLGLLLSKL-EIV 740
            N+K   +++EE+L       + ++  SC SE      +SKE+    E LG   SKL  ++
Sbjct: 241  NSKLFLQQIEEWLEN--KDTSQLNTQSCSSEIEHQSNMSKEM---REALGPCFSKLATLL 295

Query: 739  AEDKNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGML 560
              + N K+  E MSH+I                         +DL QEMAELRYQ+TG+L
Sbjct: 296  GSESNLKEWMESMSHQIRKYEVLVKQLKDELREEKSKAKEEADDLAQEMAELRYQMTGLL 355

Query: 559  EEECKRRACIEQASLQRISELEAQV--------GKEQKKSSIALRRYHEA 434
            EEECKRRACIEQASLQRISELEAQV         +E++K   A+   HEA
Sbjct: 356  EEECKRRACIEQASLQRISELEAQVFLVFPSKIERERRKFFAAVGHLHEA 405


>ref|XP_004299323.1| PREDICTED: uncharacterized protein LOC101294367 [Fragaria vesca
            subsp. vesca]
          Length = 395

 Score =  296 bits (759), Expect = 2e-77
 Identities = 183/387 (47%), Positives = 238/387 (61%), Gaps = 2/387 (0%)
 Frame = -2

Query: 1606 SDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAKY 1427
            S +D SF+++ELLQ+G+ CREL+KE +ML+ESQS+S  LIR L++ +K LSE  ++D K 
Sbjct: 4    SSSDSSFDIEELLQIGSRCRELKKEKDMLKESQSQSFGLIRSLDVHMKSLSEFHTEDKKQ 63

Query: 1426 IWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKEL 1247
            I  LE +LKN SQEI YLQDQLN R+ E N L EHVHSLELKLA++  L   + +L+ EL
Sbjct: 64   IQMLEKELKNCSQEIDYLQDQLNARDTEVNLLQEHVHSLELKLADMETLQVTVDRLRDEL 123

Query: 1246 AKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQLS 1067
             KS S+   LM+ELENKE++LQNS L IEK                 SMKLD+ ALEQ  
Sbjct: 124  KKSYSECLFLMQELENKEVELQNSNLFIEKLEESVSSISLESQCEIESMKLDMLALEQSF 183

Query: 1066 SEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISERNAKTLC 887
             EAKK Q+    EK RM+EL++E EVQ QD+Q                L  SE N +  C
Sbjct: 184  LEAKKIQEETVQEKTRMNELIQELEVQCQDAQKTTDDLYIENKELREKLDTSETNTRIFC 243

Query: 886  RKVEEYLGEWLGKHAIVDIPSCRSEHLV-SKEIGTCEEVLGLLLSKL-EIVAEDKNTKDE 713
            +++E++L     +  +  + + + E    S ++ TC EVL  L SKL +++A D N   +
Sbjct: 244  QRIEKWLENDRYESKLESLLNEQDEKCTFSTDMSTCGEVLEPLFSKLAKVLAPDANFIVK 303

Query: 712  NEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRAC 533
             ++MSH+I                         EDL QEMAELRYQ+TG+LEEECKRRA 
Sbjct: 304  MKEMSHQIHEYELLVKQLKEELREEKLKAKEEAEDLAQEMAELRYQLTGLLEEECKRRAY 363

Query: 532  IEQASLQRISELEAQVGKEQKKSSIAL 452
            IEQASLQRISELEAQV K + KSS  L
Sbjct: 364  IEQASLQRISELEAQVHKARTKSSTCL 390


>gb|EXC11033.1| hypothetical protein L484_015253 [Morus notabilis]
          Length = 380

 Score =  278 bits (712), Expect = 4e-72
 Identities = 176/399 (44%), Positives = 239/399 (59%), Gaps = 2/399 (0%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1445
            MSS  RS +D + +V+ELLQ+GT CRELR+E +ML+ESQS+S +LIRRLE  V  LS A 
Sbjct: 1    MSSHSRSQSDNTSDVEELLQIGTRCRELRREKDMLKESQSQSFDLIRRLERHVTSLSAAS 60

Query: 1444 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1265
            ++D K I  LE +L N SQEI YLQDQ N RN E N L +H+  LELKLA++  L + +G
Sbjct: 61   TEDKKCIEMLEKELMNCSQEIDYLQDQGNARNTEVNVLKDHLRDLELKLADMEYLQEAVG 120

Query: 1264 QLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1085
            +L++EL +S+SD   LM+ELE++E++LQNS LCIE+                 S+KL+I 
Sbjct: 121  RLREELKRSDSDCLFLMQELESREVELQNSSLCIERLRMSISSITLDSQCEIESLKLEIV 180

Query: 1084 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISER 905
             LEQ   EA+K Q+ A  EK R+++LV + E Q QD+Q  IR            L  SE 
Sbjct: 181  TLEQSCFEAEKSQEKAIQEKARINQLVRDLEAQFQDAQKNIRRLELENKELREKLDTSET 240

Query: 904  NAKTLCRKVEEYLGEWLGKHAIVD-IPSCRSEHLVSKEIGTCEEVLGLLLSKLE-IVAED 731
              +T  + +E+ L     +  I   +    ++ ++S +  TC EVL  L+SKLE ++  D
Sbjct: 241  KVRTFWQMLEKLLARDGSQPDIKQLVNEIEAKLMMSNDPSTCGEVLSPLISKLETLLGRD 300

Query: 730  KNTKDENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEE 551
             +  ++ E    K+                         EDL QEMAELRYQ+TG+LEEE
Sbjct: 301  GDDMEKEELREEKL-------------------KAKEEAEDLAQEMAELRYQMTGLLEEE 341

Query: 550  CKRRACIEQASLQRISELEAQVGKEQKKSSIALRRYHEA 434
              RRACIEQAS QRI+ELEAQV KEQ+KS  A++  H A
Sbjct: 342  RNRRACIEQASTQRIAELEAQVQKEQRKSLDAVKYLHGA 380


>ref|XP_006358271.1| PREDICTED: intracellular protein transport protein USO1-like isoform
            X1 [Solanum tuberosum]
          Length = 399

 Score =  258 bits (659), Expect = 6e-66
 Identities = 158/382 (41%), Positives = 227/382 (59%), Gaps = 1/382 (0%)
 Frame = -2

Query: 1606 SDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAKY 1427
            S  ++SF+V ELL++   C+ELRKE + LR SQ +S ELIR++E  V+ LSEAR +D  +
Sbjct: 4    SSGEHSFDVKELLEIRARCKELRKEKDTLRGSQGQSVELIRKIEQHVQTLSEAREEDKYH 63

Query: 1426 IWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKEL 1247
              +L+++L+N SQEI YLQDQLNLRN E + L + V SL+LKLA +  + +E+ +L++EL
Sbjct: 64   TQKLKSELENCSQEIDYLQDQLNLRNEEMDSLSKCVCSLQLKLANLENMEEEVTRLREEL 123

Query: 1246 AKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQLS 1067
              SN++R  L+++LE+KE++++ S LCIE+                 SMKLD+ A+EQ  
Sbjct: 124  ETSNAERLYLLQQLESKELEIEGSALCIERLEESVASVGLEHQFEIESMKLDLIAMEQNY 183

Query: 1066 SEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISERNAKTLC 887
             +AKK QD  A +   M+EL+ + ++Q  D++ +I             L+ SE NA+T  
Sbjct: 184  FKAKKSQDETAQDSAMMNELIHDLQLQIYDAEKVIESLEKENVNLREQLQTSELNARTFS 243

Query: 886  RKVEEYLGEWLGKHAIVDIPSCRSEHLVSKEIGTCEEVLGLLLSKL-EIVAEDKNTKDEN 710
             KVEE     +  +   D  S + +   S     C ++LG LL KL  +   D +  D+ 
Sbjct: 244  EKVEELFRGLIPNND--DSSSSKEDDSAS---SCCGDILGPLLIKLASLGLSDVDLTDKM 298

Query: 709  EKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRACI 530
            +KM+ +I                         EDL QEMAELRYQ+TG+LEEE KRRAC+
Sbjct: 299  KKMAGQIKNYESLVKQLKEELRMEKLKAKEESEDLAQEMAELRYQMTGLLEEERKRRACV 358

Query: 529  EQASLQRISELEAQVGKEQKKS 464
            EQ SLQRI+ELEAQV KE  KS
Sbjct: 359  EQLSLQRIAELEAQVEKESMKS 380


>ref|XP_006358272.1| PREDICTED: intracellular protein transport protein USO1-like isoform
            X2 [Solanum tuberosum]
          Length = 375

 Score =  250 bits (639), Expect = 1e-63
 Identities = 153/374 (40%), Positives = 222/374 (59%), Gaps = 1/374 (0%)
 Frame = -2

Query: 1606 SDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAKY 1427
            S  ++SF+V ELL++   C+ELRKE + LR SQ +S ELIR++E  V+ LSEAR +D  +
Sbjct: 4    SSGEHSFDVKELLEIRARCKELRKEKDTLRGSQGQSVELIRKIEQHVQTLSEAREEDKYH 63

Query: 1426 IWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKEL 1247
              +L+++L+N SQEI YLQDQLNLRN E + L + V SL+LKLA +  + +E+ +L++EL
Sbjct: 64   TQKLKSELENCSQEIDYLQDQLNLRNEEMDSLSKCVCSLQLKLANLENMEEEVTRLREEL 123

Query: 1246 AKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQLS 1067
              SN++R  L+++LE+KE++++ S LCIE+                 SMKLD+ A+EQ  
Sbjct: 124  ETSNAERLYLLQQLESKELEIEGSALCIERLEESVASVGLEHQFEIESMKLDLIAMEQNY 183

Query: 1066 SEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISERNAKTLC 887
             +AKK QD  A +   M+EL+ + ++Q  D++ +I             L+ SE NA+T  
Sbjct: 184  FKAKKSQDETAQDSAMMNELIHDLQLQIYDAEKVIESLEKENVNLREQLQTSELNARTFS 243

Query: 886  RKVEEYLGEWLGKHAIVDIPSCRSEHLVSKEIGTCEEVLGLLLSKL-EIVAEDKNTKDEN 710
             KVEE     +  +   D  S + +   S     C ++LG LL KL  +   D +  D+ 
Sbjct: 244  EKVEELFRGLIPNND--DSSSSKEDDSAS---SCCGDILGPLLIKLASLGLSDVDLTDKM 298

Query: 709  EKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRACI 530
            +KM+ +I                         EDL QEMAELRYQ+TG+LEEE KRRAC+
Sbjct: 299  KKMAGQIKNYESLVKQLKEELRMEKLKAKEESEDLAQEMAELRYQMTGLLEEERKRRACV 358

Query: 529  EQASLQRISELEAQ 488
            EQ SLQRI+ELEAQ
Sbjct: 359  EQLSLQRIAELEAQ 372


>ref|NP_001190585.1| uncharacterized protein [Arabidopsis thaliana]
            gi|332010052|gb|AED97435.1| uncharacterized protein
            AT5G61200 [Arabidopsis thaliana]
          Length = 389

 Score =  249 bits (637), Expect = 2e-63
 Identities = 156/389 (40%), Positives = 217/389 (55%)
 Frame = -2

Query: 1618 SGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSK 1439
            S  RSD D SF+ DELLQ+G+ C ELR+E  MLRESQS+S EL+RRLEL+   LSE+R +
Sbjct: 15   SSSRSDVDNSFDADELLQIGSRCMELRREKEMLRESQSQSVELVRRLELNANSLSESRLE 74

Query: 1438 DAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQL 1259
            D + I  LE +L N  QEI YL+DQ+N R+ E N L EHV  LE+++ +  KL +E+  L
Sbjct: 75   DKRRIQMLEKELLNCYQEIDYLRDQVNFRSQEMNDLSEHVLDLEVRVTKSGKLEEEVNYL 134

Query: 1258 KKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITAL 1079
            ++EL  S S++  L++ELE+ E +LQ S+  +EK                 S+KLDI AL
Sbjct: 135  REELCSSKSEQLLLLQELESTETELQFSLFSVEKLEESVSSLTLESQCEIESIKLDIVAL 194

Query: 1078 EQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISERNA 899
            EQ   +A+KFQ  +  E  ++ E+V+E  + S++++    C              SERN 
Sbjct: 195  EQALFDAQKFQGESIQENDKLREIVKELRLNSREAEENAECLEKQNKELMERCVASERNI 254

Query: 898  KTLCRKVEEYLGEWLGKHAIVDIPSCRSEHLVSKEIGTCEEVLGLLLSKLEIVAEDKNTK 719
            K L    + + G    +      P C  +                ++ KLE V +D   +
Sbjct: 255  KDL---RQSFRGRLESESEAPVNPDCFHD----------------IIKKLE-VFQDGKLR 294

Query: 718  DENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRR 539
            D+ E M+ +I                         EDLTQEMAELRY++T +LEEECKRR
Sbjct: 295  DKMEDMARQILQYKDLVKQLKDELKEEKLKAKEEAEDLTQEMAELRYEMTCLLEEECKRR 354

Query: 538  ACIEQASLQRISELEAQVGKEQKKSSIAL 452
            ACIEQASLQRI+ LEAQ+ +E+ KSS  L
Sbjct: 355  ACIEQASLQRIANLEAQIKREKNKSSTCL 383


>ref|XP_007037496.1| Myosin heavy chain-related, putative isoform 3 [Theobroma cacao]
            gi|508774741|gb|EOY21997.1| Myosin heavy chain-related,
            putative isoform 3 [Theobroma cacao]
          Length = 324

 Score =  248 bits (633), Expect = 6e-63
 Identities = 150/317 (47%), Positives = 201/317 (63%), Gaps = 5/317 (1%)
 Frame = -2

Query: 1624 MSSGFRSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEAR 1445
            MSS  +S+ D S NV+ELL++ T CRELRKE  ML+ESQS+  ELIR LE+ VK LSEAR
Sbjct: 1    MSSSSKSEGDNSINVEELLEIETRCRELRKEKEMLKESQSQGFELIRSLEVHVKSLSEAR 60

Query: 1444 SKDAKYIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIG 1265
             +D K+I +LE +LKN SQEI YLQDQL+ RN E N L EHVH LE+KLA+   L +++ 
Sbjct: 61   VQDKKHIKKLEGELKNCSQEIDYLQDQLSARNEEVNFLTEHVHDLEIKLADKGNLQEKVD 120

Query: 1264 QLKKELAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDIT 1085
            +L  EL  SNSDR SLM+E+ENKE +LQ S LCIEK                 SMKLDIT
Sbjct: 121  RLIGELNSSNSDRLSLMQEIENKEEELQQSALCIEKLEESVSSMALESQCEIESMKLDIT 180

Query: 1084 ALEQLSSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISER 905
            ALEQ+  EA K ++    EK RM+ L+EE EVQ Q++  +I             L  SE+
Sbjct: 181  ALEQMCLEANKTEE----EKSRMNILIEELEVQLQNALKIIEGLDDENKELRGKLITSEK 236

Query: 904  NAKTLCRKVEEYL----GEWLGKHAIVDIPSCRSEHLVSKEIGTCEEVLGLLLSKLEIVA 737
            NAK  C+K++E+L       L  H++  +    S   +SK+I  C+E+   LLS++ ++ 
Sbjct: 237  NAKIFCQKIKEWLKSKDRSQLNMHSV--LGEQESMMTISKDISGCKELFSALLSEVALLL 294

Query: 736  E-DKNTKDENEKMSHKI 689
            E D ++K++ E MSH+I
Sbjct: 295  ESDADSKEQYESMSHQI 311


>ref|XP_002873329.1| hypothetical protein ARALYDRAFT_487621 [Arabidopsis lyrata subsp.
            lyrata] gi|297319166|gb|EFH49588.1| hypothetical protein
            ARALYDRAFT_487621 [Arabidopsis lyrata subsp. lyrata]
          Length = 409

 Score =  246 bits (629), Expect = 2e-62
 Identities = 157/385 (40%), Positives = 219/385 (56%), Gaps = 2/385 (0%)
 Frame = -2

Query: 1609 RSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAK 1430
            RSD + SF+V+ELLQ+GTT RELRK+ +MLRESQ  S EL+RRLEL  K LSE+R +D  
Sbjct: 19   RSDCENSFDVEELLQIGTTRRELRKQKDMLRESQPHSIELVRRLELHTKSLSESRLEDTA 78

Query: 1429 YIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKE 1250
             I  +E +L N  +EI YL+DQL  R+ E N L EH+H LE KLAE   L +E+  L+ E
Sbjct: 79   RIQMMEKELLNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDE 138

Query: 1249 LAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQL 1070
            L  S S+   L++ELE+KE++LQ S L +EK                 SMK+DITALEQ 
Sbjct: 139  LCMSKSEHLLLLQELESKEIELQCSSLSLEKLEETISSLTLESLCEIESMKIDITALEQA 198

Query: 1069 SSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISERNAKTL 890
              +A K Q+ +  EK ++  ++EE + QSQ +Q  ++               SE++ K  
Sbjct: 199  LFDAMKIQEESIQEKHQLKGIIEESQFQSQRAQENVKYIEKQNEELREKFNASEKSIKEF 258

Query: 889  CRKVEEYLGEWLGKHAIVDIPSCRSEHL--VSKEIGTCEEVLGLLLSKLEIVAEDKNTKD 716
             +  +E L     +   V        H+  +S E+  C +    ++ KLE+ +++ N  D
Sbjct: 259  FQSTKERLESEDEEPLTVGCFFAELSHVLPMSNEVRNCFDA---IMKKLEL-SQNVNLTD 314

Query: 715  ENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRA 536
            + E M+ +I                         EDLTQEMAELRY++T +L+EE  RR 
Sbjct: 315  KVEGMAKQIHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRV 374

Query: 535  CIEQASLQRISELEAQVGKEQKKSS 461
            CIEQASLQRI+ELEAQ+ +E KK S
Sbjct: 375  CIEQASLQRIAELEAQIKREIKKPS 399


>ref|XP_004137423.1| PREDICTED: uncharacterized protein LOC101221046 [Cucumis sativus]
            gi|449486970|ref|XP_004157457.1| PREDICTED:
            uncharacterized protein LOC101230337 [Cucumis sativus]
          Length = 390

 Score =  239 bits (610), Expect = 3e-60
 Identities = 158/383 (41%), Positives = 220/383 (57%), Gaps = 1/383 (0%)
 Frame = -2

Query: 1609 RSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAK 1430
            RS++  S +++ELL++GT CR+L+KE + L +S+ +S ELIR LEL V  LSEAR +D  
Sbjct: 5    RSNSYSSSDLEELLEIGTRCRQLKKEKDTLIDSRPQSFELIR-LELHVNSLSEARKEDKL 63

Query: 1429 YIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKE 1250
             I  LE +L N +QEI YLQDQL  RN E   L +HV SLE KL  +    ++  +L++E
Sbjct: 64   RIENLEKELTNCTQEIDYLQDQLCTRNTELTYLVDHVESLEFKLVHMEHSQEKASKLEEE 123

Query: 1249 LAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQL 1070
            + +SNS+   LM++L++KE +L+ S   +EK                 SMKLD+ A+EQ 
Sbjct: 124  VKRSNSECLFLMQKLDDKEQELRESNSNVEKLEESISAITLESQCEIESMKLDMLAMEQR 183

Query: 1069 SSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISERNAKTL 890
              E KKFQ+ A  +  +MD L+EE     Q++Q  ++            L +S RNA T 
Sbjct: 184  YIETKKFQEEALSQNDKMDRLIEEL----QNAQRNVKFLETENEELQRELDVSTRNASTF 239

Query: 889  CRKVEEYLGEWLGKHAIVDIPSCRSEHLVSKEIGTCEEVLGLLLSKLEI-VAEDKNTKDE 713
            CR VEE +     + +   + + R   L S    +C +VLG LL KL + +  D N++ +
Sbjct: 240  CRSVEELIEN--KERSQNTMRNDRDGKLTSILKNSCGDVLGHLLPKLAVALFADANSEAK 297

Query: 712  NEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRAC 533
             + M  +I                         EDL QEMAELRYQITG+LEEECKRRAC
Sbjct: 298  MDVMKKQILDYELLVEQLKEELREEKLKAKEEAEDLAQEMAELRYQITGLLEEECKRRAC 357

Query: 532  IEQASLQRISELEAQVGKEQKKS 464
            IEQASLQRI++LEAQV K Q +S
Sbjct: 358  IEQASLQRIAQLEAQVLKGQNRS 380


>ref|NP_196406.2| myosin heavy chain-like protein [Arabidopsis thaliana]
            gi|79327239|ref|NP_001031851.1| myosin heavy chain-like
            protein [Arabidopsis thaliana]
            gi|222423567|dbj|BAH19753.1| AT5G07890 [Arabidopsis
            thaliana] gi|332003833|gb|AED91216.1| myosin heavy
            chain-like protein [Arabidopsis thaliana]
            gi|332003835|gb|AED91218.1| myosin heavy chain-like
            protein [Arabidopsis thaliana]
          Length = 409

 Score =  238 bits (608), Expect = 5e-60
 Identities = 154/383 (40%), Positives = 217/383 (56%), Gaps = 2/383 (0%)
 Frame = -2

Query: 1609 RSDNDYSFNVDELLQLGTTCRELRKENNMLRESQSRSTELIRRLELDVKLLSEARSKDAK 1430
            RSD + SF+V++LLQ+GTT RELRK+ ++LRESQ  S EL+RRLEL  K LSE+R +D  
Sbjct: 19   RSDCENSFDVEDLLQIGTTRRELRKQKDLLRESQPHSIELVRRLELHTKSLSESRLEDTA 78

Query: 1429 YIWELENDLKNFSQEIGYLQDQLNLRNIEANCLGEHVHSLELKLAEVSKLHQEIGQLKKE 1250
             I  +E +L N  +EI YL+DQL  R+ E N L EH+H LE KLAE   L +E+  L+ E
Sbjct: 79   RIQMMEKELLNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDE 138

Query: 1249 LAKSNSDRFSLMRELENKEMDLQNSILCIEKXXXXXXXXXXXXXXXXXSMKLDITALEQL 1070
            L  S S+   L++ELE+KE++LQ S L +EK                 SMKLDITALEQ 
Sbjct: 139  LCMSKSEHLLLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQA 198

Query: 1069 SSEAKKFQDNAAHEKFRMDELVEEFEVQSQDSQNMIRCXXXXXXXXXXXLRISERNAKTL 890
              +A K Q+ +  EK ++  ++EE + QSQ ++  ++               SE++ K  
Sbjct: 199  LFDAMKIQEESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDF 258

Query: 889  CRKVEEYL--GEWLGKHAIVDIPSCRSEHLVSKEIGTCEEVLGLLLSKLEIVAEDKNTKD 716
             +  +E L   +    +A+           VS E+  C +    ++ KLE+ +++ N  D
Sbjct: 259  FQSTKERLESEDEQPLNAMCFFAELSHVLPVSNEVRNCFDA---IMKKLEL-SQNVNLID 314

Query: 715  ENEKMSHKIXXXXXXXXXXXXXXXXXXXXXXXXXEDLTQEMAELRYQITGMLEEECKRRA 536
            + E M  +I                         EDLTQEMAELRY++T +L+EE  RR 
Sbjct: 315  KVEGMGKQIHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRV 374

Query: 535  CIEQASLQRISELEAQVGKEQKK 467
            CIEQASLQRISELEAQ+ ++ KK
Sbjct: 375  CIEQASLQRISELEAQIKRDVKK 397


Top