BLASTX nr result

ID: Ephedra28_contig00003295 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra28_contig00003295
         (1721 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245...   162   5e-37
gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao]    159   4e-36
gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theob...   158   6e-36
ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620...   155   5e-35
ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620...   151   7e-34
ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm...   149   5e-33
ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, part...   139   5e-30
ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps...   139   5e-30
ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ...   137   1e-29
ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab...   137   2e-29
gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao]    134   1e-28
gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao]    133   3e-28
ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcript...   132   4e-28
ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcr...   132   5e-28
dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana]        132   6e-28
ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211...   128   7e-27
gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus pe...   127   1e-26
ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu...   125   7e-26
gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma caca...   122   4e-25
gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theob...   122   6e-25

>ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera]
            gi|298205214|emb|CBI17273.3| unnamed protein product
            [Vitis vinifera]
          Length = 425

 Score =  162 bits (409), Expect = 5e-37
 Identities = 119/427 (27%), Positives = 199/427 (46%)
 Frame = +1

Query: 178  SKLDLGNRLRTKLVKLQDDLQGLEEEVTSVSNFDTFNDSHLLRLQETLADMPLEYLKFSG 357
            S++   NR+ T    + D        + S S F  F+     R+ + L+       ++S 
Sbjct: 17   SRMSELNRIHTNYSHISDS-----NPLDSRSLFQEFSHHLQSRVNQILS-------QYSD 64

Query: 358  SDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIE 537
             +    DD D  +  +K EL +VE E AK  NEIE L     +D+N L  D+E+L   ++
Sbjct: 65   VESLEADDLDAYLGHLKKELNLVESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVD 124

Query: 538  FWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELED 717
            F  +    G    +   + +  + +E Q    + + D +             FE L+L  
Sbjct: 125  FVASQ---GLKRAEAGALVDYSSSVEDQLDSRTAHGDNN-------------FEILDLNY 168

Query: 718  QLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLI 897
            Q  + +     L+ LD   KR +A+  IE++L+ +K I    NCI+L+L TFIP   GL+
Sbjct: 169  QTQKNKITLKSLQDLDYTFKRFEAIEKIEDALTGLKVIDFEGNCIRLSLSTFIPNLEGLL 228

Query: 898  CQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNIS 1077
            C+ +I  + E   + H L ++V   S    + E+             +K S         
Sbjct: 229  CEEKIEAVNEPSELNHELLIEVMDQSMELKNVEIFPNDVYLGEIIDAAKSSRKLFSHMSI 288

Query: 1078 LNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFI 1257
            L     L   VR++Q +I    LR S+++   K SR+  +Y  RD++I  ++ GG+ A+I
Sbjct: 289  LETRSSLEWFVRKVQDKIILCALRQSIVKGANK-SRHSLEYLDRDEIIVAHMVGGVDAYI 347

Query: 1258 KLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEE 1437
            K+ Q              K  D+ +  ISL+ LC+  E+ N L++S R  +  F+DA+EE
Sbjct: 348  KVCQGWPVSNNALKLKSLKSSDQQSKGISLSFLCKVEEMANSLDVSIRKNISSFVDAIEE 407

Query: 1438 ILLQQVK 1458
            IL+QQ++
Sbjct: 408  ILVQQMQ 414


>gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 430

 Score =  159 bits (402), Expect = 4e-36
 Identities = 130/447 (29%), Positives = 212/447 (47%), Gaps = 6/447 (1%)
 Frame = +1

Query: 151  EEMAHSNAGSKLDLGNRLRTKLVKLQD----DLQGLEEEVTSVSNFDTFNDSHLLRLQET 318
            E M  S++   LDL + +R+++ +L +    D    E E  S+++     D  L   +  
Sbjct: 3    EPMEISSSSEALDL-HSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSL-HFESK 60

Query: 319  LADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDT 492
            +  +  EY  + F G +D      D  +  +K EL  VE E AK  NEIE+L+    +++
Sbjct: 61   VKQIIEEYSDVGFLGIED-----LDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEES 115

Query: 493  NNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICL 672
            N L  ++E L   ++   +    G               +E+    +S  +D  +  + +
Sbjct: 116  NILEGNLEGLKYALDSIASQGMEG---------------VEEDPCLDSSMNDEDQSNL-M 159

Query: 673  LSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCI 852
             S E   FE +ELE Q+ +       L+ LD   KR+D L  IE++L+ +K IG   NCI
Sbjct: 160  HSNEEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCI 219

Query: 853  KLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXX 1032
            +L+L+T+IP   GL+CQ  I +I E   + H L V++   +    + E+           
Sbjct: 220  RLSLQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDII 279

Query: 1033 HMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRD 1212
              +K     S           L   V ++Q RI   TLR  +++   K SR+ F+Y  RD
Sbjct: 280  DAAKSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERD 338

Query: 1213 QVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELEL 1392
            + I  ++ GGI AFIKL Q              K  D  +  ISL++LC+A E+ N L++
Sbjct: 339  ETIVAHLVGGIDAFIKLSQGWPLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDM 398

Query: 1393 SRRHRLLPFIDAVEEILLQQVKKSSKS 1473
              R  L  F+DAVE++LL+Q++   +S
Sbjct: 399  HIRQNLSAFVDAVEKLLLEQMRLDLQS 425


>gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 372

 Score =  158 bits (400), Expect = 6e-36
 Identities = 112/366 (30%), Positives = 178/366 (48%)
 Frame = +1

Query: 376  DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 555
            +D D  +  +K EL  VE E AK  NEIE+L+    +++N L  ++E L   ++   +  
Sbjct: 19   EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQG 78

Query: 556  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 735
              G               +E+    +S  +D  +  + + S E   FE +ELE Q+ +  
Sbjct: 79   MEG---------------VEEDPCLDSSMNDEDQSNL-MHSNEEQKFEIMELESQIEKNN 122

Query: 736  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 915
                 L+ LD   KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   GL+CQ  I 
Sbjct: 123  IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 182

Query: 916  NIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 1095
            +I E   + H L V++   +    + E+             +K     S           
Sbjct: 183  DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 242

Query: 1096 LNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQXX 1275
            L   V ++Q RI   TLR  +++   K SR+ F+Y  RD+ I  ++ GGI AFIKL Q  
Sbjct: 243  LEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERDETIVAHLVGGIDAFIKLSQGW 301

Query: 1276 XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1455
                        K  D  +  ISL++LC+A E+ N L++  R  L  F+DAVE++LL+Q+
Sbjct: 302  PLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQM 361

Query: 1456 KKSSKS 1473
            +   +S
Sbjct: 362  RLDLQS 367


>ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus
            sinensis]
          Length = 447

 Score =  155 bits (392), Expect = 5e-35
 Identities = 127/446 (28%), Positives = 214/446 (47%), Gaps = 9/446 (2%)
 Frame = +1

Query: 148  VEEMAHSNAGSKLDLGNRLRTKLVKLQD-DLQGLEEEVTSVSNFDTFNDSHLLR-----L 309
            VE  A  ++ S LDL + LR+++ +L +    G+E+E  +VS+    +  +LL+      
Sbjct: 11   VEATATPSSSSPLDL-HSLRSEVKELMEIHRSGIEDEPNTVSS----DSENLLKEYAHDF 65

Query: 310  QETLADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIA 483
            +  + ++  EY  + F G +D      D  ++ +K EL  VE E +K  NEIE L     
Sbjct: 66   ESKVKEIITEYADVSFLGIED-----LDAYLEHLKEELKTVEAESSKISNEIETLTRTQV 120

Query: 484  KDTNNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKEL-ENKIEQQKSEESKYDDISRF 660
            +D++ L  D+E L   I+   ++ +     D+ A      E+++    +E+    D+ + 
Sbjct: 121  EDSDRLESDLEELNCAIDLIVSEGSQNAKEDRQAVCPARGEDQVCPTHTEDQS--DLIKI 178

Query: 661  EICLLSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLL 840
                   E+  FE LELE Q+ + +     L+ LD   KR DA+  IE+SL+ +K I   
Sbjct: 179  H------EDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFD 232

Query: 841  ENCIKLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXX 1020
              C +L+++T+IP       Q +I ++ E   V H L ++V   +    + E+       
Sbjct: 233  GKCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHI 292

Query: 1021 XXXXHMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQY 1200
                  +K    +  +  SL  +  L   +R +Q RI   TLR  V++   K SR+ F+Y
Sbjct: 293  SDLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANK-SRHFFEY 351

Query: 1201 SSRDQVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILN 1380
              RD++I  ++ GG+ AFIK  Q              K  D  +  ISL+  CR  E  N
Sbjct: 352  FERDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISLKNSDHHSKGISLSFFCRVEEAAN 411

Query: 1381 ELELSRRHRLLPFIDAVEEILLQQVK 1458
             L++  R  L  F+D VE+ILL+Q++
Sbjct: 412  SLDVHIRQNLSSFVDGVEKILLEQMR 437


>ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus
            sinensis]
          Length = 444

 Score =  151 bits (382), Expect = 7e-34
 Identities = 126/446 (28%), Positives = 211/446 (47%), Gaps = 9/446 (2%)
 Frame = +1

Query: 148  VEEMAHSNAGSKLDLGNRLRTKLVKLQD-DLQGLEEEVTSVSNFDTFNDSHLLR-----L 309
            VE  A  ++ S LDL + LR+++ +L +    G+E+E  +VS+    +  +LL+      
Sbjct: 11   VEATATPSSSSPLDL-HSLRSEVKELMEIHRSGIEDEPNTVSS----DSENLLKEYAHDF 65

Query: 310  QETLADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIA 483
            +  + ++  EY  + F G +D      D  ++ +K EL  VE E +K  NEIE L     
Sbjct: 66   ESKVKEIITEYADVSFLGIED-----LDAYLEHLKEELKTVEAESSKISNEIETLTRTQV 120

Query: 484  KDTNNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFE 663
            +D++ L  D+E L   I+   +++            KE    +   + E+      +  +
Sbjct: 121  EDSDRLESDLEELNCAIDLIVSEN-----------AKEDRQAVCPARGEDQVCPTHTEDQ 169

Query: 664  ICLLS-GENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLL 840
              L+   E+  FE LELE Q+ + +     L+ LD   KR DA+  IE+SL+ +K I   
Sbjct: 170  SDLIKIHEDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFD 229

Query: 841  ENCIKLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXX 1020
              C +L+++T+IP       Q +I ++ E   V H L ++V   +    + E+       
Sbjct: 230  GKCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHI 289

Query: 1021 XXXXHMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQY 1200
                  +K    +  +  SL  +  L   +R +Q RI   TLR  V++   K SR+ F+Y
Sbjct: 290  SDLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANK-SRHFFEY 348

Query: 1201 SSRDQVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILN 1380
              RD++I  ++ GG+ AFIK  Q              K  D  +  ISL+  CR  E  N
Sbjct: 349  FERDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISLKNSDHHSKGISLSFFCRVEEAAN 408

Query: 1381 ELELSRRHRLLPFIDAVEEILLQQVK 1458
             L++  R  L  F+D VE+ILL+Q++
Sbjct: 409  SLDVHIRQNLSSFVDGVEKILLEQMR 434


>ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis]
            gi|223542639|gb|EEF44176.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 415

 Score =  149 bits (375), Expect = 5e-33
 Identities = 118/416 (28%), Positives = 196/416 (47%), Gaps = 13/416 (3%)
 Frame = +1

Query: 250  EEVTSVSNFDT-FNDSHLLRLQETLA---DMPLEYLKFSGSDDNVN--DDFDMEIQSVKN 411
            EE+ S  N DT    SH  ++ E  A   +  ++ +    SD N    +D D  ++ +K 
Sbjct: 18   EEIYSGCNGDTEMLSSHSDQVLEDCALHLESKVQQIMSECSDFNFLGIEDLDAFVEHLKE 77

Query: 412  ELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQI 591
            EL+    E AK   EIE L     +D   L  DIE+L   ++F  +              
Sbjct: 78   ELSTTMSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISS-------------- 123

Query: 592  KELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKRQKYGELRMLDDK 771
            K++E + E    E+    D  R         ++ FE  +L+DQ+ + +     L+  D  
Sbjct: 124  KDVEKEKEVACREDLYSTDAHR---------DYEFEISKLDDQIAKSKMILKSLQDFDSV 174

Query: 772  SKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEINNIGESVMVEHVL 951
             KRVDA+  IEE+LS +K I    +CI+L+L+T++P    ++CQ +  +  E   V H L
Sbjct: 175  FKRVDAVEQIEEALSGLKVIEFDGSCIRLSLRTYLPKLDDVMCQHKTEDTAEPSEVNHEL 234

Query: 952  KVKVEKDSTTFLDAELSXXXXXXXXXXHMSK-------YSSDASDKNISLNVNGQLNSLV 1110
             ++V   +    + E+             +K       YS+    +  S      L  LV
Sbjct: 235  LIEVVSGTMELKNVEIFPNDIYISDIVDAAKSFRKEFLYSALTESETRS-----SLGWLV 289

Query: 1111 REIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQXXXXXXX 1290
            R++Q RI  +TLR  V++   K SR  F+Y  RD+ +  ++ GG+ AFIKL Q       
Sbjct: 290  RKVQDRIIQFTLRRLVVKSSNK-SRYSFEYLDRDETVVAHLVGGVDAFIKLSQGWPVSRS 348

Query: 1291 XXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQVK 1458
                   K  +  + +ISL+ LCR  E++N L++  R  LL F++ +E++L++Q++
Sbjct: 349  PLKLISLKSSNHHSKEISLSFLCRVEEVVNSLDIQMRLNLLSFVEVIEKLLVEQMR 404


>ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum]
            gi|557096755|gb|ESQ37263.1| hypothetical protein
            EUTSA_v10002763mg, partial [Eutrema salsugineum]
          Length = 355

 Score =  139 bits (349), Expect = 5e-30
 Identities = 102/368 (27%), Positives = 172/368 (46%), Gaps = 5/368 (1%)
 Frame = +1

Query: 385  DMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDDNVG 564
            D  ++ ++ EL  VE E AK   EIE L+++ A+D++ L  D+E L   ++F  +     
Sbjct: 5    DAYLEYLRKELHSVEAESAKVSEEIERLSSSHAEDSSRLDRDLEGLLLSLDFLSSQ---- 60

Query: 565  GGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLS-----GENFMFEALELEDQLIR 729
                            E QKS+E+     S  E C  S      ++  F+  ELE+Q+  
Sbjct: 61   ----------------EVQKSKENP-PSTSSMERCDASTWIDVNDDEKFKMFELENQIEE 103

Query: 730  KRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFE 909
            KR+    L  LD   KR DA   +E++L+ +K +    N I+L L+T+IP   GL+ Q +
Sbjct: 104  KRRILKSLENLDSVCKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIPKLDGLLGQHK 163

Query: 910  INNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVN 1089
            + +  E   + H L + ++  +T     E+             +         +  L+  
Sbjct: 164  LLHNTEPSELIHELLIDLKDKTTEITKVEMLPNDVYIGDITDAADSFRQIRLHSALLDTR 223

Query: 1090 GQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQ 1269
              L  LV ++Q RI    LR  +++   K  R+ F+Y  +D+ I  ++ GGI AF+K+  
Sbjct: 224  SSLQWLVAKVQERIITTNLRKHIVK-SSKTIRHTFEYYDKDETIVAHITGGIDAFLKVSV 282

Query: 1270 XXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQ 1449
                          K  D  +  ISL+++C+  E+ N L+L  R  L  F+DA+E+IL+Q
Sbjct: 283  GWPLLSTPLKLTSLKNSDNQSNGISLSLICKVEELANSLDLQTRQNLSGFMDAIEKILVQ 342

Query: 1450 QVKKSSKS 1473
            Q ++   S
Sbjct: 343  QTREELHS 350


>ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella]
            gi|482566470|gb|EOA30659.1| hypothetical protein
            CARUB_v10013795mg [Capsella rubella]
          Length = 420

 Score =  139 bits (349), Expect = 5e-30
 Identities = 115/446 (25%), Positives = 206/446 (46%), Gaps = 4/446 (0%)
 Frame = +1

Query: 148  VEEMAHSNAGSKLDLGNRLRTKLVKLQDDLQGLEEEVTSVSNFDTFN--DSHLLRLQETL 321
            +EE  H  +   LDL  ++R+++ +L+   +  + E       D+ N     +L+ +  +
Sbjct: 1    MEEDTHDGS---LDL-QQIRSRVKELESIHRNCKYEPGESCTSDSENLVQDFVLQFETKV 56

Query: 322  ADMPLEYLKFSGSDDNVND--DFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTN 495
             ++  +Y     SD ++ D  D D  ++ ++ EL  VE E AK   EIE L+ + A+D++
Sbjct: 57   NEIVEDY-----SDVDILDVEDSDAYLEYLRKELHSVEAESAKVSEEIERLSRSHAEDSS 111

Query: 496  NLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLL 675
             L  D+E L   ++   + D                      KS+ES     S  E+C +
Sbjct: 112  RLERDLEGLLLSLDSMSSQD--------------------VNKSKESP-PSCSSMEVCEV 150

Query: 676  SGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIK 855
            + ++  F+  ELE+Q+  KR     L  LD   KR DA   +E++L+ +K +    N I+
Sbjct: 151  NDDD-KFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIR 209

Query: 856  LTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXH 1035
            L L+T+IP   GL  Q +  +  +   + H L + ++  +T     E+            
Sbjct: 210  LQLRTYIPELDGLPAQHKFEHTTKPSELIHELLIYLKDKTTEITKLEMFPNDVYIGDIIE 269

Query: 1036 MSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQ 1215
             +         +  L+    +  +V ++Q RI   TLR  ++    K  R+ F+Y  +D+
Sbjct: 270  AADSFRQVRLHSAVLDTRSSVQWVVAKVQDRIITTTLRKYIVT-SSKTMRHTFKYYDKDE 328

Query: 1216 VITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELS 1395
             I  ++ GGI AF+K+                K  D  +  ISL+++C+  E+ N L+L 
Sbjct: 329  TIVAHIAGGIDAFLKVSDGWPLLNSPLKLASLKNSDNQSKGISLSLICKVEELANSLDLQ 388

Query: 1396 RRHRLLPFIDAVEEILLQQVKKSSKS 1473
             R  L  FIDA+E+IL+ Q ++  +S
Sbjct: 389  TRQNLSGFIDAIEKILVHQTREELQS 414


>ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana]
            gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis
            thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein
            [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1|
            putative HAPp48,5 protein [Arabidopsis thaliana]
            gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein
            [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1|
            uncharacterized protein AT3G23910 [Arabidopsis thaliana]
          Length = 421

 Score =  137 bits (345), Expect = 1e-29
 Identities = 98/366 (26%), Positives = 171/366 (46%)
 Frame = +1

Query: 376  DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 555
            +D D  ++ ++NEL  VE E AK   EIE L+ + A+D++ L  D+E L   ++   + D
Sbjct: 73   EDSDAYLEYLRNELQSVEAESAKVSEEIERLSQSHAQDSSRLQRDLEGLLLSLDSMSSQD 132

Query: 556  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 735
                          +E   E Q S  S        E+C +  ++  F+  ELE+Q+  KR
Sbjct: 133  --------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKR 170

Query: 736  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 915
                 L  LD   KR DA   +E++L+ +K +    N I+L L+T+I    G + Q + +
Sbjct: 171  MILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFD 230

Query: 916  NIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 1095
            +I E   + H L + ++  +T     E+             +         +  L+    
Sbjct: 231  HITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSS 290

Query: 1096 LNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQXX 1275
            +  +V ++Q +I   TLR  ++    K  R  F+Y  +D+ I  ++ GGI AF+K+    
Sbjct: 291  VQWVVAKVQDKIISTTLRKYIVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGW 349

Query: 1276 XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1455
                        K  D  +  ISL+++C+  E+ N L+L  R  L  F+DA+E+IL++Q 
Sbjct: 350  PLLNTPLKLASLKNSDNQSKGISLSLICKVEELANSLDLETRQNLSGFMDAIEKILVEQT 409

Query: 1456 KKSSKS 1473
            ++  +S
Sbjct: 410  REELQS 415


>ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp.
            lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein
            ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  137 bits (344), Expect = 2e-29
 Identities = 114/442 (25%), Positives = 206/442 (46%), Gaps = 4/442 (0%)
 Frame = +1

Query: 148  VEEMAHSNAGSKLDLGNRLRTKLVKLQDDLQGLEEEV--TSVSNFDTFNDSHLLRLQETL 321
            +EE  H      LDL   +R+++ +L+   +   +E   +  S+ +T     +L+ +  +
Sbjct: 1    MEEETHDGP---LDL-QEIRSRVKELEFIHRNCRDEPGESCSSDSETLVQDFVLQFEPKV 56

Query: 322  ADMPLEYLKFSGSDDNVND--DFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTN 495
             ++  +Y     SD ++ D  D D  ++ ++ EL  VE E AK   EIE L+ + A+D++
Sbjct: 57   KEIVEDY-----SDVDLLDVEDSDAYLEYLRKELQSVEAESAKVSEEIERLSKSHAQDSS 111

Query: 496  NLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLL 675
             L  D+E L   ++   + D              +E   E Q S  S        E+C +
Sbjct: 112  RLERDLEGLLLSLDSMSSQD--------------VEKSKENQPSSSS-------MEVCEV 150

Query: 676  SGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIK 855
            + ++  F+  ELE+Q+  KR     L  LD   KR DA   +E++L+ +K +    N I+
Sbjct: 151  NDDD-KFKMFELENQMEEKRSILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIR 209

Query: 856  LTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXH 1035
            L L+T+IP    L+ Q +  +  E   + H L + ++  +T     E+            
Sbjct: 210  LQLQTYIPKLDSLLGQQKFEHTTEPSELIHELLIYLKDKTTEITKFEMFPNDVYIGDIIE 269

Query: 1036 MSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQ 1215
             +      S  +  L+    +  +V ++Q RI   TLR  ++    K  R+ F+Y  +D+
Sbjct: 270  AADSFRQVSLHSAVLDTRSSVQWVVAKVQDRIISSTLRKYLVT-SSKTIRHTFEYYEKDE 328

Query: 1216 VITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELS 1395
             I  ++ GGI AF+K+                K  D  +  ISL+++C+  ++ N L+L 
Sbjct: 329  TIVGHIAGGIDAFLKVSNGWPLLNTPLKLESLKNSDNQSKGISLSLICKVEDLANSLDLQ 388

Query: 1396 RRHRLLPFIDAVEEILLQQVKK 1461
             R  L  F+DA+E+IL+QQ ++
Sbjct: 389  TRQNLSGFMDAIEKILVQQTRE 410


>gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao]
          Length = 432

 Score =  134 bits (337), Expect = 1e-28
 Identities = 117/415 (28%), Positives = 190/415 (45%), Gaps = 6/415 (1%)
 Frame = +1

Query: 151  EEMAHSNAGSKLDLGNRLRTKLVKLQD----DLQGLEEEVTSVSNFDTFNDSHLLRLQET 318
            E M  S++   LDL + +R+++ +L +    D    E E  S+++     D  L   +  
Sbjct: 3    EPMEISSSSEALDL-HSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSL-HFESK 60

Query: 319  LADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDT 492
            +  +  EY  + F G +D      D  +  +K EL  VE E AK  NEIE+L+    +++
Sbjct: 61   VKQIIEEYSDVGFLGIED-----LDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEES 115

Query: 493  NNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICL 672
            N L  ++E L   ++   +    G               +E+    +S  +D  +  + +
Sbjct: 116  NILEGNLEGLKYALDSIASQGMEG---------------VEEDPCLDSSMNDEDQSNL-M 159

Query: 673  LSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCI 852
             S E   FE +ELE Q+ +       L+ LD   KR+D L  IE++L+ +K IG   NCI
Sbjct: 160  HSNEEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCI 219

Query: 853  KLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXX 1032
            +L+L+T+IP   GL+CQ  I +I E   + H L V++   +    + E+           
Sbjct: 220  RLSLQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDII 279

Query: 1033 HMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRD 1212
              +K     S           L   V ++Q RI   TLR  +++   K SR+ F+Y  RD
Sbjct: 280  DAAKSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERD 338

Query: 1213 QVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEIL 1377
            + I  ++ GGI AFIKL Q              K  D  +  ISL++LC+A E +
Sbjct: 339  ETIVAHLVGGIDAFIKLSQGWPLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEAI 393


>gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 392

 Score =  133 bits (334), Expect = 3e-28
 Identities = 116/414 (28%), Positives = 189/414 (45%), Gaps = 6/414 (1%)
 Frame = +1

Query: 151  EEMAHSNAGSKLDLGNRLRTKLVKLQD----DLQGLEEEVTSVSNFDTFNDSHLLRLQET 318
            E M  S++   LDL + +R+++ +L +    D    E E  S+++     D  L   +  
Sbjct: 3    EPMEISSSSEALDL-HSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSL-HFESK 60

Query: 319  LADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDT 492
            +  +  EY  + F G +D      D  +  +K EL  VE E AK  NEIE+L+    +++
Sbjct: 61   VKQIIEEYSDVGFLGIED-----LDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEES 115

Query: 493  NNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICL 672
            N L  ++E L   ++   +    G               +E+    +S  +D  +  + +
Sbjct: 116  NILEGNLEGLKYALDSIASQGMEG---------------VEEDPCLDSSMNDEDQSNL-M 159

Query: 673  LSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCI 852
             S E   FE +ELE Q+ +       L+ LD   KR+D L  IE++L+ +K IG   NCI
Sbjct: 160  HSNEEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCI 219

Query: 853  KLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXX 1032
            +L+L+T+IP   GL+CQ  I +I E   + H L V++   +    + E+           
Sbjct: 220  RLSLQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDII 279

Query: 1033 HMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRD 1212
              +K     S           L   V ++Q RI   TLR  +++   K SR+ F+Y  RD
Sbjct: 280  DAAKSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERD 338

Query: 1213 QVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEI 1374
            + I  ++ GGI AFIKL Q              K  D  +  ISL++LC+A  +
Sbjct: 339  ETIVAHLVGGIDAFIKLSQGWPLSKSPLKLLSIKSSDHHSRGISLSLLCKAERV 392


>ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcriptase)-related protein
            [Arabidopsis thaliana] gi|332643359|gb|AEE76880.1|
            RNA-directed DNA polymerase (reverse
            transcriptase)-related protein [Arabidopsis thaliana]
          Length = 746

 Score =  132 bits (333), Expect = 4e-28
 Identities = 101/372 (27%), Positives = 170/372 (45%)
 Frame = +1

Query: 358  SDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIE 537
            SD N+ D +   ++ ++NEL  VE E AK   EIE L+ + A D++ L  D+E L   ++
Sbjct: 395  SDGNLTDAY---LEYLRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLD 451

Query: 538  FWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELED 717
               + D              +E   E Q S  S        E+C +  ++  F+  ELE+
Sbjct: 452  SMSSQD--------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELEN 489

Query: 718  QLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLI 897
            Q+  KR     L  LD   KR DA   +E++L+ +K +    N I+L L+T+I    G +
Sbjct: 490  QMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFL 549

Query: 898  CQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNIS 1077
             Q + ++I E   + H L + ++  +T     E+             +         +  
Sbjct: 550  GQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAV 609

Query: 1078 LNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFI 1257
            L+    +  +V ++Q +I   TLR   +    K  R  F+Y  +D+ I  ++ GGI AF+
Sbjct: 610  LDTRSSVQWVVAKVQDKIISTTLRKDFVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFL 668

Query: 1258 KLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEE 1437
            K+                K  D  +   SL+++ +  E+ N L+L  R  L  F+DAVE+
Sbjct: 669  KVSDGWPLLNTPLKLASLKNSDNQSKGFSLSLISKLEELANSLDLETRQNLSGFMDAVEK 728

Query: 1438 ILLQQVKKSSKS 1473
            IL+QQ ++  KS
Sbjct: 729  ILVQQTREELKS 740


>ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein
            [Arabidopsis thaliana] gi|332643360|gb|AEE76881.1|
            RNA-directed DNA polymerase (reverse
            transcriptase)-related protein [Arabidopsis thaliana]
          Length = 428

 Score =  132 bits (332), Expect = 5e-28
 Identities = 106/407 (26%), Positives = 182/407 (44%)
 Frame = +1

Query: 253  EVTSVSNFDTFNDSHLLRLQETLADMPLEYLKFSGSDDNVNDDFDMEIQSVKNELAIVER 432
            E   V +F    +  +  + E   D+ L  +  +  D N+ D +   ++ ++NEL  VE 
Sbjct: 42   ETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDHTLVDGNLTDAY---LEYLRNELQSVEA 98

Query: 433  EMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKI 612
            E AK   EIE L+ + A D++ L  D+E L   ++   + D              +E   
Sbjct: 99   ESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQD--------------VEKSK 144

Query: 613  EQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDAL 792
            E Q S  S        E+C +  ++  F+  ELE+Q+  KR     L  LD   KR DA 
Sbjct: 145  ENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKRMILKSLEDLDSLRKRFDAA 196

Query: 793  SMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKD 972
              +E++L+ +K +    N I+L L+T+I    G + Q + ++I E   + H L + ++  
Sbjct: 197  EQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDK 256

Query: 973  STTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRN 1152
            +T     E+             +         +  L+    +  +V ++Q +I   TLR 
Sbjct: 257  TTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRK 316

Query: 1153 SVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESA 1332
              +    K  R  F+Y  +D+ I  ++ GGI AF+K+                K  D  +
Sbjct: 317  DFVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASLKNSDNQS 375

Query: 1333 GDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQVKKSSKS 1473
               SL+++ +  E+ N L+L  R  L  F+DAVE+IL+QQ ++  KS
Sbjct: 376  KGFSLSLISKLEELANSLDLETRQNLSGFMDAVEKILVQQTREELKS 422


>dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana]
          Length = 421

 Score =  132 bits (331), Expect = 6e-28
 Identities = 99/366 (27%), Positives = 166/366 (45%)
 Frame = +1

Query: 376  DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 555
            D  D  ++ ++NEL  VE E AK   EIE L+ + A D++ L  D+E L   ++   + D
Sbjct: 73   DQTDAYLEYLRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQD 132

Query: 556  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 735
                          +E   E Q S  S        E+C +  ++  F+  ELE+Q+  KR
Sbjct: 133  --------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKR 170

Query: 736  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 915
                 L  LD   KR DA   +E++L+ +K +    N I+L L+T+I    G + Q + +
Sbjct: 171  MILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFD 230

Query: 916  NIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 1095
            +I E   + H L + ++  +T     E+             +         +  L+    
Sbjct: 231  HITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSS 290

Query: 1096 LNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQXX 1275
            +  +V ++Q +I   TLR   +    K  R  F+Y  +D+ I  ++ GGI AF+K+    
Sbjct: 291  VQWVVAKVQDKIISTTLRKDFVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGW 349

Query: 1276 XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1455
                        K  D  +   SL+++ +  E+ N L+L  R  L  F+DAVE+IL+QQ 
Sbjct: 350  PLLNTPLKLASLKNSDNQSKGFSLSLISKLEELANSLDLETRQNLSGFMDAVEKILVQQT 409

Query: 1456 KKSSKS 1473
            ++  KS
Sbjct: 410  REELKS 415


>ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211137 [Cucumis sativus]
            gi|449527675|ref|XP_004170835.1| PREDICTED:
            uncharacterized protein LOC101229419 [Cucumis sativus]
          Length = 414

 Score =  128 bits (322), Expect = 7e-27
 Identities = 112/425 (26%), Positives = 191/425 (44%), Gaps = 1/425 (0%)
 Frame = +1

Query: 184  LDLGNRLRTKLVKLQDDLQGLEEEVTSVSNFDTFNDSHLLRLQETLADMPLEYLKFSGSD 363
            LDL   +R++L +LQ  L+  EE  T     +       L L+  +  +  EY   S  D
Sbjct: 16   LDL-QAVRSELEELQRSLEENEESTTDSLGSEKLLRECALHLESRIQQVLSEY---SNVD 71

Query: 364  DNVN-DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEF 540
              +  DD D  ++ +K EL  VE E +K  NEIE L     +D+N L  D+E+L   ++ 
Sbjct: 72   SFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEVLKRTNIEDSNKLKMDLEVLKLSLDR 131

Query: 541  WQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQ 720
            + + D            +E          E+     ++R        E   FE LELE Q
Sbjct: 132  FPSQDP-----------EEATFNCSSMNGEDPMNVIVNR--------ECNAFEVLELESQ 172

Query: 721  LIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLIC 900
            + + ++    L+ +D+  K +D +  +E ++  +K I + +N I+L+L T IP       
Sbjct: 173  IEKNKKILKSLQEVDEIFKSLDVIEQVEGTIGGMKVIDVADNSIRLSLHTHIPNVEDFST 232

Query: 901  QFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISL 1080
               +  + E   ++H L ++V   +    +AE+           + SK  S++S      
Sbjct: 233  LQRLEGLIEKSELDHELIIEVLDGTMELKNAEIFPADVHLHDIINASKSISNSS------ 286

Query: 1081 NVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIK 1260
                 L   VR++Q RI   TLR   ++   K+  + F+Y  +D++I  ++ GGI A IK
Sbjct: 287  -----LEWFVRKVQDRIVLCTLRRFAVKSANKSCHS-FEYLDQDEMIMCSMIGGIDACIK 340

Query: 1261 LPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEI 1440
            + Q              K  D     +SL+++C+  ++ N L+   R  L  F DAVE+I
Sbjct: 341  VSQGWPLADSPLKLISLKSSDHYTKGVSLSLICKVEKMANSLDAHIRRNLSSFADAVEKI 400

Query: 1441 LLQQV 1455
            L +Q+
Sbjct: 401  LKEQM 405


>gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica]
          Length = 416

 Score =  127 bits (320), Expect = 1e-26
 Identities = 108/426 (25%), Positives = 199/426 (46%), Gaps = 2/426 (0%)
 Frame = +1

Query: 184  LDLGNRLRTKLVKLQDDLQGLEEEVTSVSNFDTFNDSHLLRLQETLADMPLEYLKFSGSD 363
            LDL N ++ ++ +L++ ++   ++    S     +   L+R    L    +E +    SD
Sbjct: 12   LDL-NTIQRQVRELEEIIESCRQD--DASELSPSDSDDLIRNCGLLLQSRVEQIVSECSD 68

Query: 364  DNVNDD--FDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIE 537
              + +D  F+  +   + EL  VE E  K  N IE+L     +D N L  D+  L   ++
Sbjct: 69   VGLLEDQEFEAYVGRFEQELNSVEAESTKVSNGIEDLIRTHGEDFNRLGTDLAQLKCSLD 128

Query: 538  FWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELED 717
            F +         +K+ +  +L   ++  K  +   D ++      ++ + F  E LELE+
Sbjct: 129  FVE---------EKDLEKAKLGADVDYHKCGKDLLDPMN------VNADKF--ELLELEN 171

Query: 718  QLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLI 897
            Q+ +       L+ L+   K +D    IE++++ +K I    NC++L+L+T+IP    L 
Sbjct: 172  QIEKNNIILKSLQDLECTLKWLDNTEQIEDAVTGLKVIAFEGNCVRLSLRTYIPKLEDLF 231

Query: 898  CQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNIS 1077
               ++ +  E   V H L +++ + +    + E+               Y +D  D   S
Sbjct: 232  SPKKVGDATEPSEVNHELLIELLEGTMGLRNVEIFPNDV----------YINDILDAAKS 281

Query: 1078 LNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFI 1257
            L     L   V ++Q RI   T+R  V++++ K SR+  +Y  +D+ +  +V GG+ AFI
Sbjct: 282  LR-KSSLQWFVTKVQDRIVLCTMRRLVVKNENK-SRHSLEYLDKDETVVAHVVGGVDAFI 339

Query: 1258 KLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEE 1437
            K+PQ              K  D+ +  ISL+ LC   E+ N L +  R  L  F+DA+E+
Sbjct: 340  KVPQGWPLLSSPLKLIYLKSSDQHSKGISLSFLCTVQELANSLAVRIRQTLSSFVDAIEK 399

Query: 1438 ILLQQV 1455
            IL++Q+
Sbjct: 400  ILVEQM 405


>ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa]
            gi|222847415|gb|EEE84962.1| hypothetical protein
            POPTR_0001s32530g [Populus trichocarpa]
          Length = 429

 Score =  125 bits (313), Expect = 7e-26
 Identities = 106/436 (24%), Positives = 197/436 (45%), Gaps = 2/436 (0%)
 Frame = +1

Query: 154  EMAHSNAGSKLDLGNRLRTKLVKLQDDLQGLEEEVTSVSNFDTFNDSHLLR--LQETLAD 327
            E++ S     L+L N +R+++ +L++  +    +  S S  ++ +   L++   Q+ ++ 
Sbjct: 2    EISPSTTQESLNL-NTIRSRINELEEIYRDCNAD--SFSEINSSDSDELMKDSAQQLVSK 58

Query: 328  MPLEYLKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTE 507
            +     ++S       +D D  +  +K EL   E E AK  NEIE L     +D++ L  
Sbjct: 59   VSQTVTEYSDFSFLGIEDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELEN 118

Query: 508  DIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGEN 687
            D+E +   ++   +         ++ + ++ + ++E   S E++ + I+       + E 
Sbjct: 119  DLEWMKCSLDLISSQ--------RDREKEKGDEQMEHFSSGENQSNLIN-------TNEE 163

Query: 688  FMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLK 867
              FE L+L++Q+    +    ++ LD   K  DA+  IE+ LS +K I     CI+L+L+
Sbjct: 164  NKFEILKLDNQIEESTRILKSMQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLR 223

Query: 868  TFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKY 1047
            T+IP +  L  Q +I        + H   ++V   S      E+             +K 
Sbjct: 224  TYIPKQDVLFLQ-KIEETNVPYEINHEFLIEVTNGSMEIKKVEMFPNDIYIGDIVDAAKS 282

Query: 1048 SSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITV 1227
                      +  +  L   VR+ Q RI   TLR  V       SR   +Y  RD++I  
Sbjct: 283  FRQMFLHLALMETSSSLEWFVRKAQDRIIQSTLRRLVAR-SASTSRQSIEYLDRDEIIVA 341

Query: 1228 NVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHR 1407
            ++ GG+ AF+++ Q              K  +  A +ISL  LC+  E  N L++  R  
Sbjct: 342  HMVGGVDAFMEVSQGWPITNSPLKLVSLKNSNHHAKEISLGFLCKVEEAANSLDVHTRQN 401

Query: 1408 LLPFIDAVEEILLQQV 1455
            L  F+D+VE+IL++Q+
Sbjct: 402  LSSFVDSVEKILVEQM 417


>gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508713298|gb|EOY05195.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 369

 Score =  122 bits (307), Expect = 4e-25
 Identities = 108/379 (28%), Positives = 176/379 (46%), Gaps = 6/379 (1%)
 Frame = +1

Query: 151  EEMAHSNAGSKLDLGNRLRTKLVKLQD----DLQGLEEEVTSVSNFDTFNDSHLLRLQET 318
            E M  S++   LDL + +R+++ +L +    D    E E  S+++     D  L   +  
Sbjct: 3    EPMEISSSSEALDL-HSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSL-HFESK 60

Query: 319  LADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDT 492
            +  +  EY  + F G +D      D  +  +K EL  VE E AK  NEIE+L+    +++
Sbjct: 61   VKQIIEEYSDVGFLGIED-----LDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEES 115

Query: 493  NNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICL 672
            N L  ++E L   ++   +    G               +E+    +S  +D  +  + +
Sbjct: 116  NILEGNLEGLKYALDSIASQGMEG---------------VEEDPCLDSSMNDEDQSNL-M 159

Query: 673  LSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCI 852
             S E   FE +ELE Q+ +       L+ LD   KR+D L  IE++L+ +K IG   NCI
Sbjct: 160  HSNEEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCI 219

Query: 853  KLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXX 1032
            +L+L+T+IP   GL+CQ  I +I E   + H L V++   +    + E+           
Sbjct: 220  RLSLQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDII 279

Query: 1033 HMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRD 1212
              +K     S           L   V ++Q RI   TLR  +++   K SR+ F+Y  RD
Sbjct: 280  DAAKSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERD 338

Query: 1213 QVITVNVFGGIKAFIKLPQ 1269
            + I  ++ GGI AFIKL Q
Sbjct: 339  ETIVAHLVGGIDAFIKLSQ 357


>gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theobroma cacao]
          Length = 343

 Score =  122 bits (305), Expect = 6e-25
 Identities = 90/298 (30%), Positives = 142/298 (47%)
 Frame = +1

Query: 376  DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 555
            +D D  +  +K EL  VE E AK  NEIE+L+    +++N L  ++E L   ++   +  
Sbjct: 51   EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQG 110

Query: 556  NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 735
              G               +E+    +S  +D  +  + + S E   FE +ELE Q+ +  
Sbjct: 111  MEG---------------VEEDPCLDSSMNDEDQSNL-MHSNEEQKFEIMELESQIEKNN 154

Query: 736  QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 915
                 L+ LD   KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   GL+CQ  I 
Sbjct: 155  IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 214

Query: 916  NIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 1095
            +I E   + H L V++   +    + E+             +K     S           
Sbjct: 215  DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 274

Query: 1096 LNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQ 1269
            L   V ++Q RI   TLR  +++   K SR+ F+Y  RD+ I  ++ GGI AFIKL Q
Sbjct: 275  LEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERDETIVAHLVGGIDAFIKLSQ 331


Top