BLASTX nr result

ID: Ephedra27_contig00017575 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00017575
         (1420 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theob...   162   3e-37
gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao]    162   3e-37
ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245...   161   7e-37
ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm...   151   6e-34
ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620...   149   3e-33
ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620...   145   4e-32
ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, part...   141   8e-31
ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps...   140   1e-30
ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ...   139   2e-30
ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab...   139   4e-30
gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao]    137   1e-29
gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao]    136   2e-29
ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcript...   135   5e-29
dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana]        134   9e-29
ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcr...   133   2e-28
gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus pe...   128   5e-27
gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theob...   125   3e-26
gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma caca...   125   3e-26
ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu...   125   6e-26
ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211...   124   7e-26

>gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 372

 Score =  162 bits (410), Expect = 3e-37
 Identities = 114/366 (31%), Positives = 179/366 (48%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  +  +K EL  VE E AK  NEIE+L+    +++N L  ++E L   ++   +  
Sbjct: 19   EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQG 78

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
              G               +E+    +S  +D  +  + + S E   FE +ELE Q+ +  
Sbjct: 79   MEG---------------VEEDPCLDSSMNDEDQSNL-MHSNEEQKFEIMELESQIEKNN 122

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                 L+ LD   KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   GL+CQ  I 
Sbjct: 123  IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 182

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
            +I E   + H L V++   T    + E+             +K     S           
Sbjct: 183  DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 242

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            L   V ++Q RI   TLR  +++   K SR+SF+Y  RD+ I  ++ GGI AFIKL Q  
Sbjct: 243  LEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERDETIVAHLVGGIDAFIKLSQGW 301

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1112
                        K  D  +  ISL++LC+A E+ N L++  R  L  F+DAVE++LL+Q+
Sbjct: 302  PLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQM 361

Query: 1113 KKSSKS 1130
            +   +S
Sbjct: 362  RLDLQS 367


>gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 430

 Score =  162 bits (410), Expect = 3e-37
 Identities = 114/366 (31%), Positives = 179/366 (48%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  +  +K EL  VE E AK  NEIE+L+    +++N L  ++E L   ++   +  
Sbjct: 77   EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQG 136

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
              G               +E+    +S  +D  +  + + S E   FE +ELE Q+ +  
Sbjct: 137  MEG---------------VEEDPCLDSSMNDEDQSNL-MHSNEEQKFEIMELESQIEKNN 180

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                 L+ LD   KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   GL+CQ  I 
Sbjct: 181  IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
            +I E   + H L V++   T    + E+             +K     S           
Sbjct: 241  DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            L   V ++Q RI   TLR  +++   K SR+SF+Y  RD+ I  ++ GGI AFIKL Q  
Sbjct: 301  LEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERDETIVAHLVGGIDAFIKLSQGW 359

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1112
                        K  D  +  ISL++LC+A E+ N L++  R  L  F+DAVE++LL+Q+
Sbjct: 360  PLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQM 419

Query: 1113 KKSSKS 1130
            +   +S
Sbjct: 420  RLDLQS 425


>ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera]
            gi|298205214|emb|CBI17273.3| unnamed protein product
            [Vitis vinifera]
          Length = 425

 Score =  161 bits (407), Expect = 7e-37
 Identities = 107/361 (29%), Positives = 176/361 (48%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            DD D  +  +K EL +VE E AK  NEIE L     +D+N L  D+E+L   ++F  +  
Sbjct: 71   DDLDAYLGHLKKELNLVESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVDFVASQ- 129

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
              G    +   + +  + +E Q    + + D +             FE L+L  Q  + +
Sbjct: 130  --GLKRAEAGALVDYSSSVEDQLDSRTAHGDNN-------------FEILDLNYQTQKNK 174

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                 L+ LD   KR +A+  IE++L+ +K I    NCI+L+L TFIP   GL+C+ +I 
Sbjct: 175  ITLKSLQDLDYTFKRFEAIEKIEDALTGLKVIDFEGNCIRLSLSTFIPNLEGLLCEEKIE 234

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
             + E   + H L ++V   +    + E+             +K S         L     
Sbjct: 235  AVNEPSELNHELLIEVMDQSMELKNVEIFPNDVYLGEIIDAAKSSRKLFSHMSILETRSS 294

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            L   VR++Q +I    LR S+++   K SR+S +Y  RD++I  ++ GG+ A+IK+ Q  
Sbjct: 295  LEWFVRKVQDKIILCALRQSIVKGANK-SRHSLEYLDRDEIIVAHMVGGVDAYIKVCQGW 353

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1112
                        K  D+ +  ISL+ LC+  E+ N L++S R  +  F+DA+EEIL+QQ+
Sbjct: 354  PVSNNALKLKSLKSSDQQSKGISLSFLCKVEEMANSLDVSIRKNISSFVDAIEEILVQQM 413

Query: 1113 K 1115
            +
Sbjct: 414  Q 414


>ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis]
            gi|223542639|gb|EEF44176.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 415

 Score =  151 bits (382), Expect = 6e-34
 Identities = 107/368 (29%), Positives = 177/368 (48%), Gaps = 7/368 (1%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  ++ +K EL+    E AK   EIE L     +D   L  DIE+L   ++F  +  
Sbjct: 66   EDLDAFVEHLKEELSTTMSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISS-- 123

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
                        K++E + E    E+    D  R         ++ FE  +L+DQ+ + +
Sbjct: 124  ------------KDVEKEKEVACREDLYSTDAHR---------DYEFEISKLDDQIAKSK 162

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                 L+  D   KRVDA+  IEE+LS +K I    +CI+L+L+T++P    ++CQ +  
Sbjct: 163  MILKSLQDFDSVFKRVDAVEQIEEALSGLKVIEFDGSCIRLSLRTYLPKLDDVMCQHKTE 222

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSK-------YSSDASDKNI 731
            +  E   V H L ++V   T    + E+             +K       YS+    +  
Sbjct: 223  DTAEPSEVNHELLIEVVSGTMELKNVEIFPNDIYISDIVDAAKSFRKEFLYSALTESETR 282

Query: 732  SLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAF 911
            S      L  LVR++Q RI  +TLR  V++   K SR SF+Y  RD+ +  ++ GG+ AF
Sbjct: 283  S-----SLGWLVRKVQDRIIQFTLRRLVVKSSNK-SRYSFEYLDRDETVVAHLVGGVDAF 336

Query: 912  IKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVE 1091
            IKL Q              K  +  + +ISL+ LCR  E++N L++  R  LL F++ +E
Sbjct: 337  IKLSQGWPVSRSPLKLISLKSSNHHSKEISLSFLCRVEEVVNSLDIQMRLNLLSFVEVIE 396

Query: 1092 EILLQQVK 1115
            ++L++Q++
Sbjct: 397  KLLVEQMR 404


>ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus
            sinensis]
          Length = 447

 Score =  149 bits (376), Expect = 3e-33
 Identities = 107/362 (29%), Positives = 173/362 (47%), Gaps = 1/362 (0%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  ++ +K EL  VE E +K  NEIE L     +D++ L  D+E L   I+   ++ 
Sbjct: 85   EDLDAYLEHLKEELKTVEAESSKISNEIETLTRTQVEDSDRLESDLEELNCAIDLIVSEG 144

Query: 213  NVGGGTDKNAQIKEL-ENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRK 389
            +     D+ A      E+++    +E+    D+ +        E+  FE LELE Q+ + 
Sbjct: 145  SQNAKEDRQAVCPARGEDQVCPTHTEDQS--DLIKIH------EDHRFEILELESQIEKN 196

Query: 390  RQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEI 569
            +     L+ LD   KR DA+  IE+SL+ +K I     C +L+++T+IP       Q +I
Sbjct: 197  KIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDGKCFRLSMQTYIPTLEESSFQHKI 256

Query: 570  NNIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNG 749
             ++ E   V H L ++V   T    + E+             +K    +  +  SL  + 
Sbjct: 257  EDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHISDLVDAAKSFRQSGTQLDSLETSS 316

Query: 750  QLNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQX 929
             L   +R +Q RI   TLR  V++   K SR+ F+Y  RD++I  ++ GG+ AFIK  Q 
Sbjct: 317  SLQWFIRNVQDRIILSTLRRFVVKTANK-SRHFFEYFERDEMIVAHLVGGVDAFIKPSQG 375

Query: 930  XXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQ 1109
                         K  D  +  ISL+  CR  E  N L++  R  L  F+D VE+ILL+Q
Sbjct: 376  WPLSNSPLKVISLKNSDHHSKGISLSFFCRVEEAANSLDVHIRQNLSSFVDGVEKILLEQ 435

Query: 1110 VK 1115
            ++
Sbjct: 436  MR 437


>ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus
            sinensis]
          Length = 444

 Score =  145 bits (366), Expect = 4e-32
 Identities = 106/362 (29%), Positives = 170/362 (46%), Gaps = 1/362 (0%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  ++ +K EL  VE E +K  NEIE L     +D++ L  D+E L   I+   +++
Sbjct: 85   EDLDAYLEHLKEELKTVEAESSKISNEIETLTRTQVEDSDRLESDLEELNCAIDLIVSEN 144

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLS-GENFMFEALELEDQLIRK 389
                        KE    +   + E+      +  +  L+   E+  FE LELE Q+ + 
Sbjct: 145  -----------AKEDRQAVCPARGEDQVCPTHTEDQSDLIKIHEDHRFEILELESQIEKN 193

Query: 390  RQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEI 569
            +     L+ LD   KR DA+  IE+SL+ +K I     C +L+++T+IP       Q +I
Sbjct: 194  KIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDGKCFRLSMQTYIPTLEESSFQHKI 253

Query: 570  NNIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNG 749
             ++ E   V H L ++V   T    + E+             +K    +  +  SL  + 
Sbjct: 254  EDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHISDLVDAAKSFRQSGTQLDSLETSS 313

Query: 750  QLNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQX 929
             L   +R +Q RI   TLR  V++   K SR+ F+Y  RD++I  ++ GG+ AFIK  Q 
Sbjct: 314  SLQWFIRNVQDRIILSTLRRFVVKTANK-SRHFFEYFERDEMIVAHLVGGVDAFIKPSQG 372

Query: 930  XXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQ 1109
                         K  D  +  ISL+  CR  E  N L++  R  L  F+D VE+ILL+Q
Sbjct: 373  WPLSNSPLKVISLKNSDHHSKGISLSFFCRVEEAANSLDVHIRQNLSSFVDGVEKILLEQ 432

Query: 1110 VK 1115
            ++
Sbjct: 433  MR 434


>ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum]
            gi|557096755|gb|ESQ37263.1| hypothetical protein
            EUTSA_v10002763mg, partial [Eutrema salsugineum]
          Length = 355

 Score =  141 bits (355), Expect = 8e-31
 Identities = 103/368 (27%), Positives = 173/368 (47%), Gaps = 5/368 (1%)
 Frame = +3

Query: 42   DMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDDNVG 221
            D  ++ ++ EL  VE E AK   EIE L+++ A+D++ L  D+E L   ++F  +     
Sbjct: 5    DAYLEYLRKELHSVEAESAKVSEEIERLSSSHAEDSSRLDRDLEGLLLSLDFLSSQ---- 60

Query: 222  GGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLS-----GENFMFEALELEDQLIR 386
                            E QKS+E+     S  E C  S      ++  F+  ELE+Q+  
Sbjct: 61   ----------------EVQKSKENP-PSTSSMERCDASTWIDVNDDEKFKMFELENQIEE 103

Query: 387  KRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFE 566
            KR+    L  LD   KR DA   +E++L+ +K +    N I+L L+T+IP   GL+ Q +
Sbjct: 104  KRRILKSLENLDSVCKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIPKLDGLLGQHK 163

Query: 567  INNIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVN 746
            + +  E   + H L + ++  TT     E+             +         +  L+  
Sbjct: 164  LLHNTEPSELIHELLIDLKDKTTEITKVEMLPNDVYIGDITDAADSFRQIRLHSALLDTR 223

Query: 747  GQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQ 926
              L  LV ++Q RI    LR  +++   K  R++F+Y  +D+ I  ++ GGI AF+K+  
Sbjct: 224  SSLQWLVAKVQERIITTNLRKHIVK-SSKTIRHTFEYYDKDETIVAHITGGIDAFLKVSV 282

Query: 927  XXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQ 1106
                          K  D  +  ISL+++C+  E+ N L+L  R  L  F+DA+E+IL+Q
Sbjct: 283  GWPLLSTPLKLTSLKNSDNQSNGISLSLICKVEELANSLDLQTRQNLSGFMDAIEKILVQ 342

Query: 1107 QVKKSSKS 1130
            Q ++   S
Sbjct: 343  QTREELHS 350


>ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella]
            gi|482566470|gb|EOA30659.1| hypothetical protein
            CARUB_v10013795mg [Capsella rubella]
          Length = 420

 Score =  140 bits (353), Expect = 1e-30
 Identities = 100/366 (27%), Positives = 171/366 (46%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  ++ ++ EL  VE E AK   EIE L+ + A+D++ L  D+E L   ++   + D
Sbjct: 72   EDSDAYLEYLRKELHSVEAESAKVSEEIERLSRSHAEDSSRLERDLEGLLLSLDSMSSQD 131

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
                                  KS+ES     S  E+C ++ ++  F+  ELE+Q+  KR
Sbjct: 132  --------------------VNKSKESP-PSCSSMEVCEVNDDD-KFKMFELENQMEEKR 169

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                 L  LD   KR DA   +E++L+ +K +    N I+L L+T+IP   GL  Q +  
Sbjct: 170  MILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIPELDGLPAQHKFE 229

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
            +  +   + H L + ++  TT     E+             +         +  L+    
Sbjct: 230  HTTKPSELIHELLIYLKDKTTEITKLEMFPNDVYIGDIIEAADSFRQVRLHSAVLDTRSS 289

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            +  +V ++Q RI   TLR  ++    K  R++F+Y  +D+ I  ++ GGI AF+K+    
Sbjct: 290  VQWVVAKVQDRIITTTLRKYIVT-SSKTMRHTFKYYDKDETIVAHIAGGIDAFLKVSDGW 348

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1112
                        K  D  +  ISL+++C+  E+ N L+L  R  L  FIDA+E+IL+ Q 
Sbjct: 349  PLLNSPLKLASLKNSDNQSKGISLSLICKVEELANSLDLQTRQNLSGFIDAIEKILVHQT 408

Query: 1113 KKSSKS 1130
            ++  +S
Sbjct: 409  REELQS 414


>ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana]
            gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis
            thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein
            [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1|
            putative HAPp48,5 protein [Arabidopsis thaliana]
            gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein
            [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1|
            uncharacterized protein AT3G23910 [Arabidopsis thaliana]
          Length = 421

 Score =  139 bits (351), Expect = 2e-30
 Identities = 99/366 (27%), Positives = 172/366 (46%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  ++ ++NEL  VE E AK   EIE L+ + A+D++ L  D+E L   ++   + D
Sbjct: 73   EDSDAYLEYLRNELQSVEAESAKVSEEIERLSQSHAQDSSRLQRDLEGLLLSLDSMSSQD 132

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
                          +E   E Q S  S        E+C +  ++  F+  ELE+Q+  KR
Sbjct: 133  --------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKR 170

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                 L  LD   KR DA   +E++L+ +K +    N I+L L+T+I    G + Q + +
Sbjct: 171  MILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFD 230

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
            +I E   + H L + ++  TT     E+             +         +  L+    
Sbjct: 231  HITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSS 290

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            +  +V ++Q +I   TLR  ++    K  R +F+Y  +D+ I  ++ GGI AF+K+    
Sbjct: 291  VQWVVAKVQDKIISTTLRKYIVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGW 349

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1112
                        K  D  +  ISL+++C+  E+ N L+L  R  L  F+DA+E+IL++Q 
Sbjct: 350  PLLNTPLKLASLKNSDNQSKGISLSLICKVEELANSLDLETRQNLSGFMDAIEKILVEQT 409

Query: 1113 KKSSKS 1130
            ++  +S
Sbjct: 410  REELQS 415


>ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp.
            lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein
            ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  139 bits (349), Expect = 4e-30
 Identities = 99/362 (27%), Positives = 171/362 (47%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  ++ ++ EL  VE E AK   EIE L+ + A+D++ L  D+E L   ++   + D
Sbjct: 72   EDSDAYLEYLRKELQSVEAESAKVSEEIERLSKSHAQDSSRLERDLEGLLLSLDSMSSQD 131

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
                          +E   E Q S  S        E+C ++ ++  F+  ELE+Q+  KR
Sbjct: 132  --------------VEKSKENQPSSSS-------MEVCEVNDDD-KFKMFELENQMEEKR 169

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                 L  LD   KR DA   +E++L+ +K +    N I+L L+T+IP    L+ Q +  
Sbjct: 170  SILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLQTYIPKLDSLLGQQKFE 229

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
            +  E   + H L + ++  TT     E+             +      S  +  L+    
Sbjct: 230  HTTEPSELIHELLIYLKDKTTEITKFEMFPNDVYIGDIIEAADSFRQVSLHSAVLDTRSS 289

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            +  +V ++Q RI   TLR  ++    K  R++F+Y  +D+ I  ++ GGI AF+K+    
Sbjct: 290  VQWVVAKVQDRIISSTLRKYLVT-SSKTIRHTFEYYEKDETIVGHIAGGIDAFLKVSNGW 348

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1112
                        K  D  +  ISL+++C+  ++ N L+L  R  L  F+DA+E+IL+QQ 
Sbjct: 349  PLLNTPLKLESLKNSDNQSKGISLSLICKVEDLANSLDLQTRQNLSGFMDAIEKILVQQT 408

Query: 1113 KK 1118
            ++
Sbjct: 409  RE 410


>gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao]
          Length = 432

 Score =  137 bits (345), Expect = 1e-29
 Identities = 101/334 (30%), Positives = 157/334 (47%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  +  +K EL  VE E AK  NEIE+L+    +++N L  ++E L   ++   +  
Sbjct: 77   EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQG 136

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
              G               +E+    +S  +D  +  + + S E   FE +ELE Q+ +  
Sbjct: 137  MEG---------------VEEDPCLDSSMNDEDQSNL-MHSNEEQKFEIMELESQIEKNN 180

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                 L+ LD   KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   GL+CQ  I 
Sbjct: 181  IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
            +I E   + H L V++   T    + E+             +K     S           
Sbjct: 241  DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            L   V ++Q RI   TLR  +++   K SR+SF+Y  RD+ I  ++ GGI AFIKL Q  
Sbjct: 301  LEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERDETIVAHLVGGIDAFIKLSQGW 359

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEIL 1034
                        K  D  +  ISL++LC+A E +
Sbjct: 360  PLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEAI 393


>gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 392

 Score =  136 bits (342), Expect = 2e-29
 Identities = 100/333 (30%), Positives = 156/333 (46%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  +  +K EL  VE E AK  NEIE+L+    +++N L  ++E L   ++   +  
Sbjct: 77   EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQG 136

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
              G               +E+    +S  +D  +  + + S E   FE +ELE Q+ +  
Sbjct: 137  MEG---------------VEEDPCLDSSMNDEDQSNL-MHSNEEQKFEIMELESQIEKNN 180

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                 L+ LD   KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   GL+CQ  I 
Sbjct: 181  IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
            +I E   + H L V++   T    + E+             +K     S           
Sbjct: 241  DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            L   V ++Q RI   TLR  +++   K SR+SF+Y  RD+ I  ++ GGI AFIKL Q  
Sbjct: 301  LEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERDETIVAHLVGGIDAFIKLSQGW 359

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEI 1031
                        K  D  +  ISL++LC+A  +
Sbjct: 360  PLSKSPLKLLSIKSSDHHSRGISLSLLCKAERV 392


>ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcriptase)-related protein
            [Arabidopsis thaliana] gi|332643359|gb|AEE76880.1|
            RNA-directed DNA polymerase (reverse
            transcriptase)-related protein [Arabidopsis thaliana]
          Length = 746

 Score =  135 bits (339), Expect = 5e-29
 Identities = 102/372 (27%), Positives = 171/372 (45%)
 Frame = +3

Query: 15   SDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIE 194
            SD N+ D +   ++ ++NEL  VE E AK   EIE L+ + A D++ L  D+E L   ++
Sbjct: 395  SDGNLTDAY---LEYLRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLD 451

Query: 195  FWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELED 374
               + D              +E   E Q S  S        E+C +  ++  F+  ELE+
Sbjct: 452  SMSSQD--------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELEN 489

Query: 375  QLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLI 554
            Q+  KR     L  LD   KR DA   +E++L+ +K +    N I+L L+T+I    G +
Sbjct: 490  QMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFL 549

Query: 555  CQFEINNIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNIS 734
             Q + ++I E   + H L + ++  TT     E+             +         +  
Sbjct: 550  GQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAV 609

Query: 735  LNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFI 914
            L+    +  +V ++Q +I   TLR   +    K  R +F+Y  +D+ I  ++ GGI AF+
Sbjct: 610  LDTRSSVQWVVAKVQDKIISTTLRKDFVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFL 668

Query: 915  KLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEE 1094
            K+                K  D  +   SL+++ +  E+ N L+L  R  L  F+DAVE+
Sbjct: 669  KVSDGWPLLNTPLKLASLKNSDNQSKGFSLSLISKLEELANSLDLETRQNLSGFMDAVEK 728

Query: 1095 ILLQQVKKSSKS 1130
            IL+QQ ++  KS
Sbjct: 729  ILVQQTREELKS 740


>dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana]
          Length = 421

 Score =  134 bits (337), Expect = 9e-29
 Identities = 100/366 (27%), Positives = 167/366 (45%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            D  D  ++ ++NEL  VE E AK   EIE L+ + A D++ L  D+E L   ++   + D
Sbjct: 73   DQTDAYLEYLRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQD 132

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
                          +E   E Q S  S        E+C +  ++  F+  ELE+Q+  KR
Sbjct: 133  --------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKR 170

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                 L  LD   KR DA   +E++L+ +K +    N I+L L+T+I    G + Q + +
Sbjct: 171  MILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFD 230

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
            +I E   + H L + ++  TT     E+             +         +  L+    
Sbjct: 231  HITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSS 290

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            +  +V ++Q +I   TLR   +    K  R +F+Y  +D+ I  ++ GGI AF+K+    
Sbjct: 291  VQWVVAKVQDKIISTTLRKDFVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGW 349

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1112
                        K  D  +   SL+++ +  E+ N L+L  R  L  F+DAVE+IL+QQ 
Sbjct: 350  PLLNTPLKLASLKNSDNQSKGFSLSLISKLEELANSLDLETRQNLSGFMDAVEKILVQQT 409

Query: 1113 KKSSKS 1130
            ++  KS
Sbjct: 410  REELKS 415


>ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein
            [Arabidopsis thaliana] gi|332643360|gb|AEE76881.1|
            RNA-directed DNA polymerase (reverse
            transcriptase)-related protein [Arabidopsis thaliana]
          Length = 428

 Score =  133 bits (335), Expect = 2e-28
 Identities = 101/371 (27%), Positives = 170/371 (45%)
 Frame = +3

Query: 18   DDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEF 197
            D N+ D +   ++ ++NEL  VE E AK   EIE L+ + A D++ L  D+E L   ++ 
Sbjct: 78   DGNLTDAY---LEYLRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDS 134

Query: 198  WQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQ 377
              + D              +E   E Q S  S        E+C +  ++  F+  ELE+Q
Sbjct: 135  MSSQD--------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELENQ 172

Query: 378  LIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLIC 557
            +  KR     L  LD   KR DA   +E++L+ +K +    N I+L L+T+I    G + 
Sbjct: 173  MEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLG 232

Query: 558  QFEINNIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISL 737
            Q + ++I E   + H L + ++  TT     E+             +         +  L
Sbjct: 233  QHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVL 292

Query: 738  NVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIK 917
            +    +  +V ++Q +I   TLR   +    K  R +F+Y  +D+ I  ++ GGI AF+K
Sbjct: 293  DTRSSVQWVVAKVQDKIISTTLRKDFVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFLK 351

Query: 918  LPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEI 1097
            +                K  D  +   SL+++ +  E+ N L+L  R  L  F+DAVE+I
Sbjct: 352  VSDGWPLLNTPLKLASLKNSDNQSKGFSLSLISKLEELANSLDLETRQNLSGFMDAVEKI 411

Query: 1098 LLQQVKKSSKS 1130
            L+QQ ++  KS
Sbjct: 412  LVQQTREELKS 422


>gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica]
          Length = 416

 Score =  128 bits (322), Expect = 5e-27
 Identities = 97/361 (26%), Positives = 172/361 (47%)
 Frame = +3

Query: 30   NDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQND 209
            + +F+  +   + EL  VE E  K  N IE+L     +D N L  D+  L   ++F +  
Sbjct: 74   DQEFEAYVGRFEQELNSVEAESTKVSNGIEDLIRTHGEDFNRLGTDLAQLKCSLDFVE-- 131

Query: 210  DNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRK 389
                   +K+ +  +L   ++  K  +   D ++      ++ + F  E LELE+Q+ + 
Sbjct: 132  -------EKDLEKAKLGADVDYHKCGKDLLDPMN------VNADKF--ELLELENQIEKN 176

Query: 390  RQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEI 569
                  L+ L+   K +D    IE++++ +K I    NC++L+L+T+IP    L    ++
Sbjct: 177  NIILKSLQDLECTLKWLDNTEQIEDAVTGLKVIAFEGNCVRLSLRTYIPKLEDLFSPKKV 236

Query: 570  NNIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNG 749
             +  E   V H L +++ + T    + E+               Y +D  D   SL    
Sbjct: 237  GDATEPSEVNHELLIELLEGTMGLRNVEIFPNDV----------YINDILDAAKSLR-KS 285

Query: 750  QLNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQX 929
             L   V ++Q RI   T+R  V++++ K SR+S +Y  +D+ +  +V GG+ AFIK+PQ 
Sbjct: 286  SLQWFVTKVQDRIVLCTMRRLVVKNENK-SRHSLEYLDKDETVVAHVVGGVDAFIKVPQG 344

Query: 930  XXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQ 1109
                         K  D+ +  ISL+ LC   E+ N L +  R  L  F+DA+E+IL++Q
Sbjct: 345  WPLLSSPLKLIYLKSSDQHSKGISLSFLCTVQELANSLAVRIRQTLSSFVDAIEKILVEQ 404

Query: 1110 V 1112
            +
Sbjct: 405  M 405


>gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theobroma cacao]
          Length = 343

 Score =  125 bits (315), Expect = 3e-26
 Identities = 92/298 (30%), Positives = 143/298 (47%)
 Frame = +3

Query: 33  DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
           +D D  +  +K EL  VE E AK  NEIE+L+    +++N L  ++E L   ++   +  
Sbjct: 51  EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQG 110

Query: 213 NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
             G               +E+    +S  +D  +  + + S E   FE +ELE Q+ +  
Sbjct: 111 MEG---------------VEEDPCLDSSMNDEDQSNL-MHSNEEQKFEIMELESQIEKNN 154

Query: 393 QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                L+ LD   KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   GL+CQ  I 
Sbjct: 155 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 214

Query: 573 NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
           +I E   + H L V++   T    + E+             +K     S           
Sbjct: 215 DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 274

Query: 753 LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQ 926
           L   V ++Q RI   TLR  +++   K SR+SF+Y  RD+ I  ++ GGI AFIKL Q
Sbjct: 275 LEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERDETIVAHLVGGIDAFIKLSQ 331


>gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma cacao]
           gi|508713298|gb|EOY05195.1| Uncharacterized protein
           isoform 2 [Theobroma cacao]
          Length = 369

 Score =  125 bits (315), Expect = 3e-26
 Identities = 92/298 (30%), Positives = 143/298 (47%)
 Frame = +3

Query: 33  DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
           +D D  +  +K EL  VE E AK  NEIE+L+    +++N L  ++E L   ++   +  
Sbjct: 77  EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQG 136

Query: 213 NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
             G               +E+    +S  +D  +  + + S E   FE +ELE Q+ +  
Sbjct: 137 MEG---------------VEEDPCLDSSMNDEDQSNL-MHSNEEQKFEIMELESQIEKNN 180

Query: 393 QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
                L+ LD   KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   GL+CQ  I 
Sbjct: 181 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 573 NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
           +I E   + H L V++   T    + E+             +K     S           
Sbjct: 241 DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300

Query: 753 LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQ 926
           L   V ++Q RI   TLR  +++   K SR+SF+Y  RD+ I  ++ GGI AFIKL Q
Sbjct: 301 LEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERDETIVAHLVGGIDAFIKLSQ 357


>ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa]
            gi|222847415|gb|EEE84962.1| hypothetical protein
            POPTR_0001s32530g [Populus trichocarpa]
          Length = 429

 Score =  125 bits (313), Expect = 6e-26
 Identities = 94/360 (26%), Positives = 164/360 (45%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            +D D  +  +K EL   E E AK  NEIE L     +D++ L  D+E +   ++   +  
Sbjct: 75   EDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELENDLEWMKCSLDLISSQ- 133

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
                   ++ + ++ + ++E   S E++ + I+       + E   FE L+L++Q+    
Sbjct: 134  -------RDREKEKGDEQMEHFSSGENQSNLIN-------TNEENKFEILKLDNQIEEST 179

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
            +    ++ LD   K  DA+  IE+ LS +K I     CI+L+L+T+IP +  L  Q +I 
Sbjct: 180  RILKSMQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLRTYIPKQDVLFLQ-KIE 238

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
                 + + H   ++V   +      E+             +K           +  +  
Sbjct: 239  ETNVPYEINHEFLIEVTNGSMEIKKVEMFPNDIYIGDIVDAAKSFRQMFLHLALMETSSS 298

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            L   VR+ Q RI   TLR  V       SR S +Y  RD++I  ++ GG+ AF+++ Q  
Sbjct: 299  LEWFVRKAQDRIIQSTLRRLVAR-SASTSRQSIEYLDRDEIIVAHMVGGVDAFMEVSQGW 357

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1112
                        K  +  A +ISL  LC+  E  N L++  R  L  F+D+VE+IL++Q+
Sbjct: 358  PITNSPLKLVSLKNSNHHAKEISLGFLCKVEEAANSLDVHTRQNLSSFVDSVEKILVEQM 417


>ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211137 [Cucumis sativus]
            gi|449527675|ref|XP_004170835.1| PREDICTED:
            uncharacterized protein LOC101229419 [Cucumis sativus]
          Length = 414

 Score =  124 bits (312), Expect = 7e-26
 Identities = 98/360 (27%), Positives = 165/360 (45%)
 Frame = +3

Query: 33   DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 212
            DD D  ++ +K EL  VE E +K  NEIE L     +D+N L  D+E+L   ++ + + D
Sbjct: 77   DDLDAYVEHMKEELVAVEAESSKISNEIEVLKRTNIEDSNKLKMDLEVLKLSLDRFPSQD 136

Query: 213  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 392
                        +E          E+     ++R        E   FE LELE Q+ + +
Sbjct: 137  P-----------EEATFNCSSMNGEDPMNVIVNR--------ECNAFEVLELESQIEKNK 177

Query: 393  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 572
            +    L+ +D+  K +D +  +E ++  +K I + +N I+L+L T IP          + 
Sbjct: 178  KILKSLQEVDEIFKSLDVIEQVEGTIGGMKVIDVADNSIRLSLHTHIPNVEDFSTLQRLE 237

Query: 573  NIGESFMVEHVLKVKVEKDTTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 752
             + E   ++H L ++V   T    +AE+           + SK  S++S           
Sbjct: 238  GLIEKSELDHELIIEVLDGTMELKNAEIFPADVHLHDIINASKSISNSS----------- 286

Query: 753  LNSLVREIQYRIFCYTLRNSVLEHDGKASRNSFQYSSRDQVITVNVFGGIKAFIKLPQXX 932
            L   VR++Q RI   TLR   ++   K S +SF+Y  +D++I  ++ GGI A IK+ Q  
Sbjct: 287  LEWFVRKVQDRIVLCTLRRFAVKSANK-SCHSFEYLDQDEMIMCSMIGGIDACIKVSQGW 345

Query: 933  XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1112
                        K  D     +SL+++C+  ++ N L+   R  L  F DAVE+IL +Q+
Sbjct: 346  PLADSPLKLISLKSSDHYTKGVSLSLICKVEKMANSLDAHIRRNLSSFADAVEKILKEQM 405


Top