BLASTX nr result

ID: Akebia24_contig00028311 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00028311
         (1483 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245...   419   e-114
ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma...   400   e-109
ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [...   388   e-105
ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm...   377   e-102
ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620...   373   e-100
ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620...   369   3e-99
ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma...   361   4e-97
ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma...   359   2e-96
ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu...   344   5e-92
ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prun...   342   3e-91
ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps...   335   4e-89
ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab...   334   5e-89
ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ...   325   2e-86
ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244...   325   2e-86
ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma...   321   6e-85
ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592...   319   2e-84
ref|XP_007034273.1| Uncharacterized protein isoform 7, partial [...   318   3e-84
dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana]        315   4e-83
ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcr...   308   5e-81
ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, part...   305   3e-80

>ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera]
            gi|298205214|emb|CBI17273.3| unnamed protein product
            [Vitis vinifera]
          Length = 425

 Score =  419 bits (1078), Expect = e-114
 Identities = 221/390 (56%), Positives = 290/390 (74%), Gaps = 4/390 (1%)
 Frame = -1

Query: 1378 DFTELSSSS--ESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLV 1205
            +++ +S S+  +S  +F++F   L+S++ QI               DAY+ H K+ELNLV
Sbjct: 28   NYSHISDSNPLDSRSLFQEFSHHLQSRVNQILSQYSDVESLEADDLDAYLGHLKKELNLV 87

Query: 1204 EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 1025
            E+EN + SNEIE  TRTY+ED+ +LES+LE L +S+  ++SQGL + E  A V+   S E
Sbjct: 88   ESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVDFVASQGLKRAEAGALVDYSSSVE 147

Query: 1024 NQ-GSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 848
            +Q  S  AH D NFE+L+L+++ +KNK+ L SL DLDYTFKR E++ KIED L GLKVI+
Sbjct: 148  DQLDSRTAHGDNNFEILDLNYQTQKNKITLKSLQDLDYTFKRFEAIEKIEDALTGLKVID 207

Query: 847  FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 668
            FEGNCI+LSL T++P LEGLLC++K+E   +P  ++HEL IE+MD +MELKN EIFP+D+
Sbjct: 208  FEGNCIRLSLSTFIPNLEGLLCEEKIEAVNEPSELNHELLIEVMDQSMELKNVEIFPNDV 267

Query: 667  FIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 491
            ++GEIIDAAK+ R+ +  +S+ E RSSLEW VRKVQ +IIL  LRQ +VK AN SRH  E
Sbjct: 268  YLGEIIDAAKSSRKLFSHMSILETRSSLEWFVRKVQDKIILCALRQSIVKGANKSRHSLE 327

Query: 490  YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 311
            Y DRD  I AHMVGG+DA+IK+ Q WPV N+ALKL SL  SS+  SKGISLSFLCKVEE+
Sbjct: 328  YLDRDEIIVAHMVGGVDAYIKVCQGWPVSNNALKLKSL-KSSDQQSKGISLSFLCKVEEM 386

Query: 310  VNSLDVKTRQNLSSFADAIEEILVQQMRSE 221
             NSLDV  R+N+SSF DAIEEILVQQM+S+
Sbjct: 387  ANSLDVSIRKNISSFVDAIEEILVQQMQSK 416


>ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508713296|gb|EOY05193.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 430

 Score =  400 bits (1028), Expect = e-109
 Identities = 213/428 (49%), Positives = 298/428 (69%), Gaps = 3/428 (0%)
 Frame = -1

Query: 1474 ESMENSHSSERVDIETXXXXXXXXXXXXXSCNDFTELSSSS-ESVKIFKDFFLDLESKIK 1298
            E ME S SSE +D+ +                +  E  + S  S K+ KD  L  ESK+K
Sbjct: 3    EPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVK 62

Query: 1297 QITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNL 1118
            QI               D Y+ H KEELN VEAE+ + SNEIE  +R +IE++  LE NL
Sbjct: 63   QIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNL 122

Query: 1117 EGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVA 941
            EGL Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++++  FE++EL+ +IEKN + 
Sbjct: 123  EGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNII 182

Query: 940  LSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYA 761
            L SL DLD  FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E  
Sbjct: 183  LKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDI 242

Query: 760  TDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLE 584
            ++P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ    L+V + +SSLE
Sbjct: 243  SEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLE 302

Query: 583  WVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVL 404
            W V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+ 
Sbjct: 303  WFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLS 362

Query: 403  NSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRS 224
             S LKL+S+  SS++HS+GISLS LCK EE+ NSLD+  RQNLS+F DA+E++L++QMR 
Sbjct: 363  KSPLKLLSI-KSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQMRL 421

Query: 223  ERQSGHIS 200
            + QS   S
Sbjct: 422  DLQSDDAS 429


>ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
            gi|508713299|gb|EOY05196.1| Uncharacterized protein
            isoform 4, partial [Theobroma cacao]
          Length = 372

 Score =  388 bits (996), Expect = e-105
 Identities = 197/372 (52%), Positives = 276/372 (74%), Gaps = 2/372 (0%)
 Frame = -1

Query: 1309 SKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRL 1130
            SK+KQI               D Y+ H KEELN VEAE+ + SNEIE  +R +IE++  L
Sbjct: 1    SKVKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNIL 60

Query: 1129 ESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEK 953
            E NLEGL Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++++  FE++EL+ +IEK
Sbjct: 61   EGNLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEK 120

Query: 952  NKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQK 773
            N + L SL DLD  FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ 
Sbjct: 121  NNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKT 180

Query: 772  MEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPR 596
            +E  ++P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ    L+V + +
Sbjct: 181  IEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQ 240

Query: 595  SSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQS 416
            SSLEW V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI AH+VGGIDAFIK+ Q 
Sbjct: 241  SSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQG 300

Query: 415  WPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQ 236
            WP+  S LKL+S+  SS++HS+GISLS LCK EE+ NSLD+  RQNLS+F DA+E++L++
Sbjct: 301  WPLSKSPLKLLSI-KSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLE 359

Query: 235  QMRSERQSGHIS 200
            QMR + QS   S
Sbjct: 360  QMRLDLQSDDAS 371


>ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis]
            gi|223542639|gb|EEF44176.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 415

 Score =  377 bits (968), Expect = e-102
 Identities = 206/394 (52%), Positives = 276/394 (70%), Gaps = 3/394 (0%)
 Frame = -1

Query: 1384 CNDFTELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLV 1205
            CN  TE+ SS  S ++ +D  L LESK++QI               DA+VEH KEEL+  
Sbjct: 24   CNGDTEMLSS-HSDQVLEDCALHLESKVQQIMSECSDFNFLGIEDLDAFVEHLKEELSTT 82

Query: 1204 EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 1025
             +E  + S EIE   R ++ED TRLES++E L  SL  ISS+ ++K +  A  E   ST+
Sbjct: 83   MSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISSKDVEKEKEVACREDLYSTD 142

Query: 1024 NQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEF 845
                  AH+DY FE+ +LD +I K+K+ L SL D D  FKR+++V +IE+ L GLKVIEF
Sbjct: 143  ------AHRDYEFEISKLDDQIAKSKMILKSLQDFDSVFKRVDAVEQIEEALSGLKVIEF 196

Query: 844  EGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIF 665
            +G+CI+LSL+TY+P L+ ++CQ K E   +P  V+HEL IE++ GTMELKN EIFP+DI+
Sbjct: 197  DGSCIRLSLRTYLPKLDDVMCQHKTEDTAEPSEVNHELLIEVVSGTMELKNVEIFPNDIY 256

Query: 664  IGEIIDAAKALRQ---YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWF 494
            I +I+DAAK+ R+   Y  L+ SE RSSL W+VRKVQ +II  TLR+L+VK +N SR+ F
Sbjct: 257  ISDIVDAAKSFRKEFLYSALTESETRSSLGWLVRKVQDRIIQFTLRRLVVKSSNKSRYSF 316

Query: 493  EYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEE 314
            EY DRD T+ AH+VGG+DAFIK+ Q WPV  S LKL+SL SS+ +HSK ISLSFLC+VEE
Sbjct: 317  EYLDRDETVVAHLVGGVDAFIKLSQGWPVSRSPLKLISLKSSN-HHSKEISLSFLCRVEE 375

Query: 313  LVNSLDVKTRQNLSSFADAIEEILVQQMRSERQS 212
            +VNSLD++ R NL SF + IE++LV+QMR E  S
Sbjct: 376  VVNSLDIQMRLNLLSFVEVIEKLLVEQMRIELHS 409


>ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus
            sinensis]
          Length = 447

 Score =  373 bits (957), Expect = e-100
 Identities = 201/390 (51%), Positives = 270/390 (69%), Gaps = 11/390 (2%)
 Frame = -1

Query: 1357 SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSN 1178
            SS+S  + K++  D ESK+K+I               DAY+EH KEEL  VEAE+ + SN
Sbjct: 51   SSDSENLLKEYAHDFESKVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISN 110

Query: 1177 EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMD------ASVEGFIS---TE 1025
            EIE  TRT +ED+ RLES+LE LN ++  I S+G    + D      A  E  +    TE
Sbjct: 111  EIETLTRTQVEDSDRLESDLEELNCAIDLIVSEGSQNAKEDRQAVCPARGEDQVCPTHTE 170

Query: 1024 NQGSSYA-HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 848
            +Q      H+D+ FE+LEL+ +IEKNK+ L+SL DLD+  KR ++V +IED+L GLKVI+
Sbjct: 171  DQSDLIKIHEDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVID 230

Query: 847  FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 668
            F+G C +LS++TY+PTLE    Q K+E   +P  V+HEL IE++DGTME+KN E+FP+D+
Sbjct: 231  FDGKCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDV 290

Query: 667  FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 491
             I +++DAAK+ RQ    L   E  SSL+W +R VQ +IILSTLR+ +VK AN SRH+FE
Sbjct: 291  HISDLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANKSRHFFE 350

Query: 490  YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 311
            Y +RD  I AH+VGG+DAFIK  Q WP+ NS LK++SL  +S++HSKGISLSF C+VEE 
Sbjct: 351  YFERDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISL-KNSDHHSKGISLSFFCRVEEA 409

Query: 310  VNSLDVKTRQNLSSFADAIEEILVQQMRSE 221
             NSLDV  RQNLSSF D +E+IL++QMR E
Sbjct: 410  ANSLDVHIRQNLSSFVDGVEKILLEQMRVE 439


>ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus
            sinensis]
          Length = 444

 Score =  369 bits (946), Expect = 3e-99
 Identities = 199/387 (51%), Positives = 268/387 (69%), Gaps = 8/387 (2%)
 Frame = -1

Query: 1357 SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSN 1178
            SS+S  + K++  D ESK+K+I               DAY+EH KEEL  VEAE+ + SN
Sbjct: 51   SSDSENLLKEYAHDFESKVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISN 110

Query: 1177 EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEM---DASVEGFIS---TENQG 1016
            EIE  TRT +ED+ RLES+LE LN ++  I S+   +       A  E  +    TE+Q 
Sbjct: 111  EIETLTRTQVEDSDRLESDLEELNCAIDLIVSENAKEDRQAVCPARGEDQVCPTHTEDQS 170

Query: 1015 SSYA-HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEG 839
                 H+D+ FE+LEL+ +IEKNK+ L+SL DLD+  KR ++V +IED+L GLKVI+F+G
Sbjct: 171  DLIKIHEDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDG 230

Query: 838  NCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIG 659
             C +LS++TY+PTLE    Q K+E   +P  V+HEL IE++DGTME+KN E+FP+D+ I 
Sbjct: 231  KCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHIS 290

Query: 658  EIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSD 482
            +++DAAK+ RQ    L   E  SSL+W +R VQ +IILSTLR+ +VK AN SRH+FEY +
Sbjct: 291  DLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANKSRHFFEYFE 350

Query: 481  RDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNS 302
            RD  I AH+VGG+DAFIK  Q WP+ NS LK++SL  +S++HSKGISLSF C+VEE  NS
Sbjct: 351  RDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISL-KNSDHHSKGISLSFFCRVEEAANS 409

Query: 301  LDVKTRQNLSSFADAIEEILVQQMRSE 221
            LDV  RQNLSSF D +E+IL++QMR E
Sbjct: 410  LDVHIRQNLSSFVDGVEKILLEQMRVE 436


>ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508713301|gb|EOY05198.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 432

 Score =  361 bits (927), Expect = 4e-97
 Identities = 193/392 (49%), Positives = 270/392 (68%), Gaps = 3/392 (0%)
 Frame = -1

Query: 1474 ESMENSHSSERVDIETXXXXXXXXXXXXXSCNDFTELSSSS-ESVKIFKDFFLDLESKIK 1298
            E ME S SSE +D+ +                +  E  + S  S K+ KD  L  ESK+K
Sbjct: 3    EPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVK 62

Query: 1297 QITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNL 1118
            QI               D Y+ H KEELN VEAE+ + SNEIE  +R +IE++  LE NL
Sbjct: 63   QIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNL 122

Query: 1117 EGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVA 941
            EGL Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++++  FE++EL+ +IEKN + 
Sbjct: 123  EGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNII 182

Query: 940  LSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYA 761
            L SL DLD  FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E  
Sbjct: 183  LKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDI 242

Query: 760  TDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLE 584
            ++P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ    L+V + +SSLE
Sbjct: 243  SEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLE 302

Query: 583  WVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVL 404
            W V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+ 
Sbjct: 303  WFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLS 362

Query: 403  NSALKLVSLNSSSENHSKGISLSFLCKVEELV 308
             S LKL+S+  SS++HS+GISLS LCK EE +
Sbjct: 363  KSPLKLLSI-KSSDHHSRGISLSLLCKAEEAI 393


>ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508713300|gb|EOY05197.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 392

 Score =  359 bits (921), Expect = 2e-96
 Identities = 192/391 (49%), Positives = 269/391 (68%), Gaps = 3/391 (0%)
 Frame = -1

Query: 1474 ESMENSHSSERVDIETXXXXXXXXXXXXXSCNDFTELSSSS-ESVKIFKDFFLDLESKIK 1298
            E ME S SSE +D+ +                +  E  + S  S K+ KD  L  ESK+K
Sbjct: 3    EPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVK 62

Query: 1297 QITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNL 1118
            QI               D Y+ H KEELN VEAE+ + SNEIE  +R +IE++  LE NL
Sbjct: 63   QIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNL 122

Query: 1117 EGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVA 941
            EGL Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++++  FE++EL+ +IEKN + 
Sbjct: 123  EGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNII 182

Query: 940  LSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYA 761
            L SL DLD  FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E  
Sbjct: 183  LKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDI 242

Query: 760  TDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLE 584
            ++P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ    L+V + +SSLE
Sbjct: 243  SEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLE 302

Query: 583  WVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVL 404
            W V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+ 
Sbjct: 303  WFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLS 362

Query: 403  NSALKLVSLNSSSENHSKGISLSFLCKVEEL 311
             S LKL+S+  SS++HS+GISLS LCK E +
Sbjct: 363  KSPLKLLSI-KSSDHHSRGISLSLLCKAERV 392


>ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa]
            gi|222847415|gb|EEE84962.1| hypothetical protein
            POPTR_0001s32530g [Populus trichocarpa]
          Length = 429

 Score =  344 bits (883), Expect = 5e-92
 Identities = 196/421 (46%), Positives = 277/421 (65%), Gaps = 5/421 (1%)
 Frame = -1

Query: 1459 SHSSERVDIETXXXXXXXXXXXXXSCN--DFTELSSSSESVKIFKDFFLDLESKIKQITX 1286
            S + E +++ T              CN   F+E++SS +S ++ KD    L SK+ Q   
Sbjct: 6    STTQESLNLNTIRSRINELEEIYRDCNADSFSEINSS-DSDELMKDSAQQLVSKVSQTVT 64

Query: 1285 XXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLN 1106
                         DAY+ H KEEL+  EAE+ + SNEIE+  RT +ED++ LE++LE + 
Sbjct: 65   EYSDFSFLGIEDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELENDLEWMK 124

Query: 1105 YSLQSISSQ-GLDKLEMDASVEGFISTENQGSSY-AHQDYNFELLELDHEIEKNKVALSS 932
             SL  ISSQ   +K + D  +E F S ENQ +    +++  FE+L+LD++IE++   L S
Sbjct: 125  CSLDLISSQRDREKEKGDEQMEHFSSGENQSNLINTNEENKFEILKLDNQIEESTRILKS 184

Query: 931  LHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDP 752
            + DLD   K  +++ +IED L GLKVIEF+G CI+LSL+TY+P  + +L  QK+E    P
Sbjct: 185  MQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLRTYIPK-QDVLFLQKIEETNVP 243

Query: 751  FAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVV 575
            + ++HE  IE+ +G+ME+K  E+FP+DI+IG+I+DAAK+ RQ +  L++ E  SSLEW V
Sbjct: 244  YEINHEFLIEVTNGSMEIKKVEMFPNDIYIGDIVDAAKSFRQMFLHLALMETSSSLEWFV 303

Query: 574  RKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSA 395
            RK Q +II STLR+L+ + A+ SR   EY DRD  I AHMVGG+DAF+++ Q WP+ NS 
Sbjct: 304  RKAQDRIIQSTLRRLVARSASTSRQSIEYLDRDEIIVAHMVGGVDAFMEVSQGWPITNSP 363

Query: 394  LKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQ 215
            LKLVSL +S+ +H+K ISL FLCKVEE  NSLDV TRQNLSSF D++E+ILV+QM  E  
Sbjct: 364  LKLVSLKNSN-HHAKEISLGFLCKVEEAANSLDVHTRQNLSSFVDSVEKILVEQMHLELH 422

Query: 214  S 212
            S
Sbjct: 423  S 423


>ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica]
            gi|462422632|gb|EMJ26895.1| hypothetical protein
            PRUPE_ppa006350mg [Prunus persica]
          Length = 416

 Score =  342 bits (877), Expect = 3e-91
 Identities = 195/413 (47%), Positives = 271/413 (65%), Gaps = 2/413 (0%)
 Frame = -1

Query: 1453 SSERVDIETXXXXXXXXXXXXXSC--NDFTELSSSSESVKIFKDFFLDLESKIKQITXXX 1280
            SSE +D+ T             SC  +D +ELS S +S  + ++  L L+S+++QI    
Sbjct: 8    SSEPLDLNTIQRQVRELEEIIESCRQDDASELSPS-DSDDLIRNCGLLLQSRVEQIVSEC 66

Query: 1279 XXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYS 1100
                       +AYV   ++ELN VEAE+ + SN IE   RT+ ED  RL ++L  L  S
Sbjct: 67   SDVGLLEDQEFEAYVGRFEQELNSVEAESTKVSNGIEDLIRTHGEDFNRLGTDLAQLKCS 126

Query: 1099 LQSISSQGLDKLEMDASVEGFISTENQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDL 920
            L  +  + L+K ++ A V+     ++           FELLEL+++IEKN + L SL DL
Sbjct: 127  LDFVEEKDLEKAKLGADVDYHKCGKDLLDPMNVNADKFELLELENQIEKNNIILKSLQDL 186

Query: 919  DYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVD 740
            + T K +++  +IED + GLKVI FEGNC++LSL+TY+P LE L   +K+  AT+P  V+
Sbjct: 187  ECTLKWLDNTEQIEDAVTGLKVIAFEGNCVRLSLRTYIPKLEDLFSPKKVGDATEPSEVN 246

Query: 739  HELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCPLSVSEPRSSLEWVVRKVQH 560
            HEL IEL++GTM L+N EIFP+D++I +I+DAAK+LR          +SSL+W V KVQ 
Sbjct: 247  HELLIELLEGTMGLRNVEIFPNDVYINDILDAAKSLR----------KSSLQWFVTKVQD 296

Query: 559  QIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVS 380
            +I+L T+R+L+VK+ N SRH  EY D+D T+ AH+VGG+DAFIK+PQ WP+L+S LKL+ 
Sbjct: 297  RIVLCTMRRLVVKNENKSRHSLEYLDKDETVVAHVVGGVDAFIKVPQGWPLLSSPLKLIY 356

Query: 379  LNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSE 221
            L  SS+ HSKGISLSFLC V+EL NSL V+ RQ LSSF DAIE+ILV+QM SE
Sbjct: 357  L-KSSDQHSKGISLSFLCTVQELANSLAVRIRQTLSSFVDAIEKILVEQMCSE 408


>ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella]
            gi|482566470|gb|EOA30659.1| hypothetical protein
            CARUB_v10013795mg [Capsella rubella]
          Length = 420

 Score =  335 bits (858), Expect = 4e-89
 Identities = 183/389 (47%), Positives = 258/389 (66%), Gaps = 1/389 (0%)
 Frame = -1

Query: 1363 SSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRT 1184
            S +S+S  + +DF L  E+K+ +I               DAY+E+ ++EL+ VEAE+ + 
Sbjct: 36   SCTSDSENLVQDFVLQFETKVNEIVEDYSDVDILDVEDSDAYLEYLRKELHSVEAESAKV 95

Query: 1183 SNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSSYA 1004
            S EIE  +R++ ED++RLE +LEGL  SL S+SSQ ++K     S E   S  +      
Sbjct: 96   SEEIERLSRSHAEDSSRLERDLEGLLLSLDSMSSQDVNK-----SKESPPSCSSMEVCEV 150

Query: 1003 HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKL 824
            + D  F++ EL++++E+ ++ L SL DLD   KR ++  ++ED L GLKV+EF+GN I+L
Sbjct: 151  NDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRL 210

Query: 823  SLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDA 644
             L+TY+P L+GL  Q K E+ T P  + HEL I L D T E+   E+FP+D++IG+II+A
Sbjct: 211  QLRTYIPELDGLPAQHKFEHTTKPSELIHELLIYLKDKTTEITKLEMFPNDVYIGDIIEA 270

Query: 643  AKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATI 467
            A + RQ     +V + RSS++WVV KVQ +II +TLR+ +V  +   RH F+Y D+D TI
Sbjct: 271  ADSFRQVRLHSAVLDTRSSVQWVVAKVQDRIITTTLRKYIVTSSKTMRHTFKYYDKDETI 330

Query: 466  TAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKT 287
             AH+ GGIDAF+K+   WP+LNS LKL SL  +S+N SKGISLS +CKVEEL NSLD++T
Sbjct: 331  VAHIAGGIDAFLKVSDGWPLLNSPLKLASL-KNSDNQSKGISLSLICKVEELANSLDLQT 389

Query: 286  RQNLSSFADAIEEILVQQMRSERQSGHIS 200
            RQNLS F DAIE+ILV Q R E QS   S
Sbjct: 390  RQNLSGFIDAIEKILVHQTREELQSNDSS 418


>ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp.
            lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein
            ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  334 bits (857), Expect = 5e-89
 Identities = 187/395 (47%), Positives = 260/395 (65%), Gaps = 7/395 (1%)
 Frame = -1

Query: 1384 CNDFTELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLV 1205
            C D    S SS+S  + +DF L  E K+K+I               DAY+E+ ++EL  V
Sbjct: 29   CRDEPGESCSSDSETLVQDFVLQFEPKVKEIVEDYSDVDLLDVEDSDAYLEYLRKELQSV 88

Query: 1204 EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 1025
            EAE+ + S EIE  ++++ +D++RLE +LEGL  SL S+SSQ ++K           S E
Sbjct: 89   EAESAKVSEEIERLSKSHAQDSSRLERDLEGLLLSLDSMSSQDVEK-----------SKE 137

Query: 1024 NQGSSYA------HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMG 863
            NQ SS +      + D  F++ EL++++E+ +  L SL DLD   KR ++  ++ED L G
Sbjct: 138  NQPSSSSMEVCEVNDDDKFKMFELENQMEEKRSILKSLEDLDSLRKRFDAAEQVEDALTG 197

Query: 862  LKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEI 683
            LKV+EF+GN I+L L+TY+P L+ LL QQK E+ T+P  + HEL I L D T E+   E+
Sbjct: 198  LKVLEFDGNFIRLQLQTYIPKLDSLLGQQKFEHTTEPSELIHELLIYLKDKTTEITKFEM 257

Query: 682  FPHDIFIGEIIDAAKALRQYCPLS-VSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNS 506
            FP+D++IG+II+AA + RQ    S V + RSS++WVV KVQ +II STLR+ LV  +   
Sbjct: 258  FPNDVYIGDIIEAADSFRQVSLHSAVLDTRSSVQWVVAKVQDRIISSTLRKYLVTSSKTI 317

Query: 505  RHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLC 326
            RH FEY ++D TI  H+ GGIDAF+K+   WP+LN+ LKL SL  +S+N SKGISLS +C
Sbjct: 318  RHTFEYYEKDETIVGHIAGGIDAFLKVSNGWPLLNTPLKLESL-KNSDNQSKGISLSLIC 376

Query: 325  KVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSE 221
            KVE+L NSLD++TRQNLS F DAIE+ILVQQ R E
Sbjct: 377  KVEDLANSLDLQTRQNLSGFMDAIEKILVQQTREE 411


>ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana]
            gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis
            thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein
            [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1|
            putative HAPp48,5 protein [Arabidopsis thaliana]
            gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein
            [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1|
            uncharacterized protein AT3G23910 [Arabidopsis thaliana]
          Length = 421

 Score =  325 bits (834), Expect = 2e-86
 Identities = 182/397 (45%), Positives = 258/397 (64%), Gaps = 7/397 (1%)
 Frame = -1

Query: 1369 ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENL 1190
            E  SS     + +DF L  E K+K+I               DAY+E+ + EL  VEAE+ 
Sbjct: 35   ESCSSDYETLVVQDFVLQFEPKVKEIVEEYGDVDLLDVEDSDAYLEYLRNELQSVEAESA 94

Query: 1189 RTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSS 1010
            + S EIE  ++++ +D++RL+ +LEGL  SL S+SSQ ++K           S ENQ SS
Sbjct: 95   KVSEEIERLSQSHAQDSSRLQRDLEGLLLSLDSMSSQDVEK-----------SKENQPSS 143

Query: 1009 YAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 848
             + +      D  F++ EL++++E+ ++ L SL DLD   KR ++  ++ED L GLKV+E
Sbjct: 144  SSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLE 203

Query: 847  FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 668
            F+GN I+L L+TY+  L+G L Q K ++ T+P  + HEL I L D T E+   E+FP+DI
Sbjct: 204  FDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDI 263

Query: 667  FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 491
            +IG+II+AA + RQ     +V + RSS++WVV KVQ +II +TLR+ +V  +   R+ FE
Sbjct: 264  YIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKYIVMSSKTIRYTFE 323

Query: 490  YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 311
            Y D+D TI AH+ GGIDAF+K+   WP+LN+ LKL SL  +S+N SKGISLS +CKVEEL
Sbjct: 324  YYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGISLSLICKVEEL 382

Query: 310  VNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 200
             NSLD++TRQNLS F DAIE+ILV+Q R E QS   S
Sbjct: 383  ANSLDLETRQNLSGFMDAIEKILVEQTREELQSNKSS 419


>ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244321 [Solanum
            lycopersicum]
          Length = 415

 Score =  325 bits (834), Expect = 2e-86
 Identities = 180/376 (47%), Positives = 254/376 (67%), Gaps = 2/376 (0%)
 Frame = -1

Query: 1351 ESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEI 1172
            E  K  +D  L  ESK++Q+               D +  + K EL+  EA+N + ++EI
Sbjct: 35   ELKKSLEDCTLQFESKVEQLLCDASEVNFSSDQDLDEFWNYLKNELSTEEAKNAKIADEI 94

Query: 1171 EVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQD 995
            E  +R Y+E  ++L + +EGL+  L+ I S G+++     +       E++G+ S A  +
Sbjct: 95   EGLSREYVEGYSKLVNEVEGLSCLLELIESLGIEQGRALTNFPCSTPGEDKGNLSSAPVE 154

Query: 994  YNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLK 815
            +NF++ EL +++EK+K+ L SL +L+ TF R E++ KIED   GLK+++FEGN I+LSL+
Sbjct: 155  HNFKIFELGNQLEKSKLNLESLEELESTFNRFEAIEKIEDAFSGLKIVQFEGNRIRLSLR 214

Query: 814  TYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKA 635
            T++P LE LL  Q +  A  P   +HEL IEL+DGTMELK+ EIFP+D+ I EI D AK+
Sbjct: 215  TFIPNLENLLHNQTIGVAEPP-EQNHELLIELVDGTMELKHVEIFPNDVSISEITDTAKS 273

Query: 634  LRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAH 458
            LRQ Y P+ V E RSSLEW+V++VQ +IILSTLR+ LVK AN+SRH F+Y +R+ TI AH
Sbjct: 274  LRQVYFPVGVLENRSSLEWLVKRVQDRIILSTLRRFLVKSANSSRHSFDYVEREETIVAH 333

Query: 457  MVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQN 278
            MVGGIDAF+K+PQ WP+  S L L+SL SSS+ +S+ ISL+ LCKV E  NSLD   RQ 
Sbjct: 334  MVGGIDAFVKLPQGWPLTCSGLTLMSLKSSSQ-YSQQISLTLLCKVAEAANSLDTNARQT 392

Query: 277  LSSFADAIEEILVQQM 230
            +S F D +EEIL+QQM
Sbjct: 393  ISGFTDRVEEILMQQM 408


>ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590656431|ref|XP_007034269.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508713297|gb|EOY05194.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508713298|gb|EOY05195.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 369

 Score =  321 bits (822), Expect = 6e-85
 Identities = 172/355 (48%), Positives = 242/355 (68%), Gaps = 3/355 (0%)
 Frame = -1

Query: 1474 ESMENSHSSERVDIETXXXXXXXXXXXXXSCNDFTELSSSS-ESVKIFKDFFLDLESKIK 1298
            E ME S SSE +D+ +                +  E  + S  S K+ KD  L  ESK+K
Sbjct: 3    EPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVK 62

Query: 1297 QITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNL 1118
            QI               D Y+ H KEELN VEAE+ + SNEIE  +R +IE++  LE NL
Sbjct: 63   QIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNL 122

Query: 1117 EGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVA 941
            EGL Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++++  FE++EL+ +IEKN + 
Sbjct: 123  EGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNII 182

Query: 940  LSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYA 761
            L SL DLD  FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E  
Sbjct: 183  LKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDI 242

Query: 760  TDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLE 584
            ++P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ    L+V + +SSLE
Sbjct: 243  SEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLE 302

Query: 583  WVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQ 419
            W V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI AH+VGGIDAFIK+ Q
Sbjct: 303  WFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQ 357


>ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592291 [Solanum tuberosum]
          Length = 428

 Score =  319 bits (818), Expect = 2e-84
 Identities = 173/337 (51%), Positives = 239/337 (70%), Gaps = 2/337 (0%)
 Frame = -1

Query: 1234 EHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMD 1055
            ++ K EL+  EA N + ++EIE  +R Y+E  ++L + +EGL+  L+ I S GL++  + 
Sbjct: 87   KYLKNELSTEEANNAKIADEIEGLSREYVEGYSKLVNEIEGLSCPLELIESLGLEQGRVL 146

Query: 1054 ASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIE 878
             +       E++G+ S A  + NF++ EL +++EK+K+ L SL +L+ TF R E++ KIE
Sbjct: 147  TNFPCSTPGEDKGNVSSAPVEQNFKVFELGNQLEKSKLNLKSLEELESTFNRFEAIEKIE 206

Query: 877  DTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMEL 698
            D   GLK++EFEGN I+LSL+T++P LE LL  Q ++ A  P   +HEL IELMDGTMEL
Sbjct: 207  DAFSGLKIVEFEGNRIRLSLRTFIPNLENLLHNQTIDVAEPP-EQNHELLIELMDGTMEL 265

Query: 697  KNAEIFPHDIFIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVK 521
            K+ EIFP+D+ I  I D AK+LRQ Y P+ V E RSSLEW V+ VQ +I+LSTLR+ LVK
Sbjct: 266  KHVEIFPNDVSISYITDTAKSLRQVYFPVGVLENRSSLEWFVKGVQDRIVLSTLRRFLVK 325

Query: 520  DANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGIS 341
             AN+SRH F+Y DR+ TI AHMVGGIDAFIK+PQ WP+ +S L L+SL SSS+ +S+ IS
Sbjct: 326  SANSSRHSFDYVDREETIVAHMVGGIDAFIKLPQGWPLTSSGLTLMSLKSSSQ-YSQQIS 384

Query: 340  LSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQM 230
            L+ LCKV E+ N LD   RQ +S F D +EEIL+QQM
Sbjct: 385  LTLLCKVAEVANLLDTNERQTISGFTDRVEEILMQQM 421


>ref|XP_007034273.1| Uncharacterized protein isoform 7, partial [Theobroma cacao]
            gi|508713302|gb|EOY05199.1| Uncharacterized protein
            isoform 7, partial [Theobroma cacao]
          Length = 343

 Score =  318 bits (816), Expect = 3e-84
 Identities = 163/315 (51%), Positives = 228/315 (72%), Gaps = 2/315 (0%)
 Frame = -1

Query: 1357 SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSN 1178
            S  S K+ KD  L  ESK+KQI               D Y+ H KEELN VEAE+ + SN
Sbjct: 17   SLNSEKLLKDCSLHFESKVKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISN 76

Query: 1177 EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAH 1001
            EIE  +R +IE++  LE NLEGL Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++
Sbjct: 77   EIEDLSRNHIEESNILEGNLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSN 136

Query: 1000 QDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLS 821
            ++  FE++EL+ +IEKN + L SL DLD  FKR++++ +IED L GLKVI F+GNCI+LS
Sbjct: 137  EEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLS 196

Query: 820  LKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAA 641
            L+TY+P LEGLLCQ+ +E  ++P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAA
Sbjct: 197  LQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAA 256

Query: 640  KALRQYCP-LSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATIT 464
            K+ RQ    L+V + +SSLEW V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI 
Sbjct: 257  KSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIV 316

Query: 463  AHMVGGIDAFIKIPQ 419
            AH+VGGIDAFIK+ Q
Sbjct: 317  AHLVGGIDAFIKLSQ 331


>dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana]
          Length = 421

 Score =  315 bits (806), Expect = 4e-83
 Identities = 178/397 (44%), Positives = 254/397 (63%), Gaps = 7/397 (1%)
 Frame = -1

Query: 1369 ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENL 1190
            E  SS     + +DF L  E K+K+I               DAY+E+ + EL  VEAE+ 
Sbjct: 35   ESCSSDYETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDQTDAYLEYLRNELQSVEAESA 94

Query: 1189 RTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSS 1010
            + S EIE  ++++  D++RL+ +LEGL  SL S+SSQ ++K           S ENQ SS
Sbjct: 95   KVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQDVEK-----------SKENQPSS 143

Query: 1009 YAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 848
             + +      D  F++ EL++++E+ ++ L SL DLD   KR ++  ++ED L GLKV+E
Sbjct: 144  SSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLE 203

Query: 847  FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 668
            F+GN I+L L+TY+  L+G L Q K ++ T+P  + HEL I L D T E+   E+FP+DI
Sbjct: 204  FDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDI 263

Query: 667  FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 491
            +IG+II+AA + RQ     +V + RSS++WVV KVQ +II +TLR+  V  +   R+ FE
Sbjct: 264  YIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKDFVMSSKTIRYTFE 323

Query: 490  YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 311
            Y D+D TI AH+ GGIDAF+K+   WP+LN+ LKL SL  +S+N SKG SLS + K+EEL
Sbjct: 324  YYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGFSLSLISKLEEL 382

Query: 310  VNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 200
             NSLD++TRQNLS F DA+E+ILVQQ R E +S   S
Sbjct: 383  ANSLDLETRQNLSGFMDAVEKILVQQTREELKSNESS 419


>ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein
            [Arabidopsis thaliana] gi|332643360|gb|AEE76881.1|
            RNA-directed DNA polymerase (reverse
            transcriptase)-related protein [Arabidopsis thaliana]
          Length = 428

 Score =  308 bits (788), Expect = 5e-81
 Identities = 177/404 (43%), Positives = 253/404 (62%), Gaps = 14/404 (3%)
 Frame = -1

Query: 1369 ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXD-------AYVEHAKEELN 1211
            E  SS     + +DF L  E K+K+I                       AY+E+ + EL 
Sbjct: 35   ESCSSDYETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDHTLVDGNLTDAYLEYLRNELQ 94

Query: 1210 LVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFIS 1031
             VEAE+ + S EIE  ++++  D++RL+ +LEGL  SL S+SSQ ++K           S
Sbjct: 95   SVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQDVEK-----------S 143

Query: 1030 TENQGSSYAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTL 869
             ENQ SS + +      D  F++ EL++++E+ ++ L SL DLD   KR ++  ++ED L
Sbjct: 144  KENQPSSSSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDAL 203

Query: 868  MGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNA 689
             GLKV+EF+GN I+L L+TY+  L+G L Q K ++ T+P  + HEL I L D T E+   
Sbjct: 204  TGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKF 263

Query: 688  EIFPHDIFIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDAN 512
            E+FP+DI+IG+II+AA + RQ     +V + RSS++WVV KVQ +II +TLR+  V  + 
Sbjct: 264  EMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKDFVMSSK 323

Query: 511  NSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSF 332
              R+ FEY D+D TI AH+ GGIDAF+K+   WP+LN+ LKL SL  +S+N SKG SLS 
Sbjct: 324  TIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGFSLSL 382

Query: 331  LCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 200
            + K+EEL NSLD++TRQNLS F DA+E+ILVQQ R E +S   S
Sbjct: 383  ISKLEELANSLDLETRQNLSGFMDAVEKILVQQTREELKSNESS 426


>ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum]
            gi|557096755|gb|ESQ37263.1| hypothetical protein
            EUTSA_v10002763mg, partial [Eutrema salsugineum]
          Length = 355

 Score =  305 bits (781), Expect = 3e-80
 Identities = 164/346 (47%), Positives = 235/346 (67%), Gaps = 2/346 (0%)
 Frame = -1

Query: 1243 AYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKL 1064
            AY+E+ ++EL+ VEAE+ + S EIE  + ++ ED++RL+ +LEGL  SL  +SSQ + K 
Sbjct: 6    AYLEYLRKELHSVEAESAKVSEEIERLSSSHAEDSSRLDRDLEGLLLSLDFLSSQEVQKS 65

Query: 1063 -EMDASVEGFISTENQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVG 887
             E   S       +       + D  F++ EL+++IE+ +  L SL +LD   KR ++  
Sbjct: 66   KENPPSTSSMERCDASTWIDVNDDEKFKMFELENQIEEKRRILKSLENLDSVCKRFDAAE 125

Query: 886  KIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGT 707
            ++ED L GLKV+EF+GN I+L L+TY+P L+GLL Q K+ + T+P  + HEL I+L D T
Sbjct: 126  QVEDALTGLKVLEFDGNFIRLQLRTYIPKLDGLLGQHKLLHNTEPSELIHELLIDLKDKT 185

Query: 706  MELKNAEIFPHDIFIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQL 530
             E+   E+ P+D++IG+I DAA + RQ     ++ + RSSL+W+V KVQ +II + LR+ 
Sbjct: 186  TEITKVEMLPNDVYIGDITDAADSFRQIRLHSALLDTRSSLQWLVAKVQERIITTNLRKH 245

Query: 529  LVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSK 350
            +VK +   RH FEY D+D TI AH+ GGIDAF+K+   WP+L++ LKL SL  +S+N S 
Sbjct: 246  IVKSSKTIRHTFEYYDKDETIVAHITGGIDAFLKVSVGWPLLSTPLKLTSL-KNSDNQSN 304

Query: 349  GISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQS 212
            GISLS +CKVEEL NSLD++TRQNLS F DAIE+ILVQQ R E  S
Sbjct: 305  GISLSLICKVEELANSLDLQTRQNLSGFMDAIEKILVQQTREELHS 350


Top