BLASTX nr result

ID: Akebia25_contig00003444 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00003444
         (2005 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245...   419   e-114
ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma...   399   e-108
ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [...   388   e-105
ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm...   377   e-101
ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620...   373   e-100
ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620...   369   4e-99
ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma...   360   2e-96
ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma...   357   9e-96
ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu...   344   8e-92
ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prun...   342   4e-91
ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps...   335   6e-89
ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab...   334   8e-89
ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ...   325   4e-86
ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244...   325   4e-86
ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592...   319   3e-84
ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma...   319   3e-84
ref|XP_007034273.1| Uncharacterized protein isoform 7, partial [...   318   4e-84
dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana]        315   6e-83
ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcr...   308   8e-81
ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, part...   305   5e-80

>ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera]
            gi|298205214|emb|CBI17273.3| unnamed protein product
            [Vitis vinifera]
          Length = 425

 Score =  419 bits (1078), Expect = e-114
 Identities = 220/390 (56%), Positives = 289/390 (74%), Gaps = 4/390 (1%)
 Frame = +3

Query: 627  DFTELSSSS--ESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLV 800
            +++ +S S+  +S  +F++F   L+S++ QI                AY+ H K+ELNLV
Sbjct: 28   NYSHISDSNPLDSRSLFQEFSHHLQSRVNQILSQYSDVESLEADDLDAYLGHLKKELNLV 87

Query: 801  EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 980
            E+EN + SNEIE  TRTY+ED+ +LES+LE L +S+  ++SQGL + E  A V+   S E
Sbjct: 88   ESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVDFVASQGLKRAEAGALVDYSSSVE 147

Query: 981  NQ-GSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 1157
            +Q  S  AH D NFE+L+L+++ +KNK+ L SL DLDYTFKR E++ KIED L GLKVI+
Sbjct: 148  DQLDSRTAHGDNNFEILDLNYQTQKNKITLKSLQDLDYTFKRFEAIEKIEDALTGLKVID 207

Query: 1158 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 1337
            FEGNCI+LSL T++P LEGLLC++K+E   +P  ++HEL IE+MD +MELKN EIFP+D+
Sbjct: 208  FEGNCIRLSLSTFIPNLEGLLCEEKIEAVNEPSELNHELLIEVMDQSMELKNVEIFPNDV 267

Query: 1338 FIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 1514
            ++GEIIDAAK+ R+ +  +S+ E RSSLEW VRKVQ +IIL  LRQ +VK AN SRH  E
Sbjct: 268  YLGEIIDAAKSSRKLFSHMSILETRSSLEWFVRKVQDKIILCALRQSIVKGANKSRHSLE 327

Query: 1515 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 1694
            Y DRD  I AHMVGG+DA+IK+ Q WPV N+ALKL SL  SS+  SKGISLSFLCKVEE+
Sbjct: 328  YLDRDEIIVAHMVGGVDAYIKVCQGWPVSNNALKLKSL-KSSDQQSKGISLSFLCKVEEM 386

Query: 1695 VNSLDVKTRQNLSSFADAIEEILVQQMRSE 1784
             NSLDV  R+N+SSF DAIEEILVQQM+S+
Sbjct: 387  ANSLDVSIRKNISSFVDAIEEILVQQMQSK 416


>ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508713296|gb|EOY05193.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 430

 Score =  399 bits (1024), Expect = e-108
 Identities = 211/426 (49%), Positives = 296/426 (69%), Gaps = 3/426 (0%)
 Frame = +3

Query: 537  MENSHSSERVDIETXXXXXXXXXXXXXXCNDFTELSSSS-ESVKIFKDFFLDLESKIKQI 713
            ME S SSE +D+ +                +  E  + S  S K+ KD  L  ESK+KQI
Sbjct: 5    MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVKQI 64

Query: 714  TXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEG 893
                             Y+ H KEELN VEAE+ + SNEIE  +R +IE++  LE NLEG
Sbjct: 65   IEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEG 124

Query: 894  LNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALS 1070
            L Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++++  FE++EL+ +IEKN + L 
Sbjct: 125  LKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNIILK 184

Query: 1071 SLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATD 1250
            SL DLD  FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E  ++
Sbjct: 185  SLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDISE 244

Query: 1251 PFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLEWV 1427
            P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ    L+V + +SSLEW 
Sbjct: 245  PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLEWF 304

Query: 1428 VRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNS 1607
            V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+  S
Sbjct: 305  VGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLSKS 364

Query: 1608 ALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSER 1787
             LKL+S+  SS++HS+GISLS LCK EE+ NSLD+  RQNLS+F DA+E++L++QMR + 
Sbjct: 365  PLKLLSI-KSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQMRLDL 423

Query: 1788 QSGHIS 1805
            QS   S
Sbjct: 424  QSDDAS 429


>ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
            gi|508713299|gb|EOY05196.1| Uncharacterized protein
            isoform 4, partial [Theobroma cacao]
          Length = 372

 Score =  388 bits (996), Expect = e-105
 Identities = 196/372 (52%), Positives = 275/372 (73%), Gaps = 2/372 (0%)
 Frame = +3

Query: 696  SKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRL 875
            SK+KQI                 Y+ H KEELN VEAE+ + SNEIE  +R +IE++  L
Sbjct: 1    SKVKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNIL 60

Query: 876  ESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEK 1052
            E NLEGL Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++++  FE++EL+ +IEK
Sbjct: 61   EGNLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEK 120

Query: 1053 NKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQK 1232
            N + L SL DLD  FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ 
Sbjct: 121  NNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKT 180

Query: 1233 MEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPR 1409
            +E  ++P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ    L+V + +
Sbjct: 181  IEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQ 240

Query: 1410 SSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQS 1589
            SSLEW V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI AH+VGGIDAFIK+ Q 
Sbjct: 241  SSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQG 300

Query: 1590 WPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQ 1769
            WP+  S LKL+S+  SS++HS+GISLS LCK EE+ NSLD+  RQNLS+F DA+E++L++
Sbjct: 301  WPLSKSPLKLLSI-KSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLE 359

Query: 1770 QMRSERQSGHIS 1805
            QMR + QS   S
Sbjct: 360  QMRLDLQSDDAS 371


>ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis]
            gi|223542639|gb|EEF44176.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 415

 Score =  377 bits (968), Expect = e-101
 Identities = 205/394 (52%), Positives = 275/394 (69%), Gaps = 3/394 (0%)
 Frame = +3

Query: 621  CNDFTELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLV 800
            CN  TE+ SS  S ++ +D  L LESK++QI                A+VEH KEEL+  
Sbjct: 24   CNGDTEMLSS-HSDQVLEDCALHLESKVQQIMSECSDFNFLGIEDLDAFVEHLKEELSTT 82

Query: 801  EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 980
             +E  + S EIE   R ++ED TRLES++E L  SL  ISS+ ++K +  A  E   ST+
Sbjct: 83   MSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISSKDVEKEKEVACREDLYSTD 142

Query: 981  NQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEF 1160
                  AH+DY FE+ +LD +I K+K+ L SL D D  FKR+++V +IE+ L GLKVIEF
Sbjct: 143  ------AHRDYEFEISKLDDQIAKSKMILKSLQDFDSVFKRVDAVEQIEEALSGLKVIEF 196

Query: 1161 EGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIF 1340
            +G+CI+LSL+TY+P L+ ++CQ K E   +P  V+HEL IE++ GTMELKN EIFP+DI+
Sbjct: 197  DGSCIRLSLRTYLPKLDDVMCQHKTEDTAEPSEVNHELLIEVVSGTMELKNVEIFPNDIY 256

Query: 1341 IGEIIDAAKALRQ---YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWF 1511
            I +I+DAAK+ R+   Y  L+ SE RSSL W+VRKVQ +II  TLR+L+VK +N SR+ F
Sbjct: 257  ISDIVDAAKSFRKEFLYSALTESETRSSLGWLVRKVQDRIIQFTLRRLVVKSSNKSRYSF 316

Query: 1512 EYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEE 1691
            EY DRD T+ AH+VGG+DAFIK+ Q WPV  S LKL+SL SS+ +HSK ISLSFLC+VEE
Sbjct: 317  EYLDRDETVVAHLVGGVDAFIKLSQGWPVSRSPLKLISLKSSN-HHSKEISLSFLCRVEE 375

Query: 1692 LVNSLDVKTRQNLSSFADAIEEILVQQMRSERQS 1793
            +VNSLD++ R NL SF + IE++LV+QMR E  S
Sbjct: 376  VVNSLDIQMRLNLLSFVEVIEKLLVEQMRIELHS 409


>ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus
            sinensis]
          Length = 447

 Score =  373 bits (957), Expect = e-100
 Identities = 200/390 (51%), Positives = 269/390 (68%), Gaps = 11/390 (2%)
 Frame = +3

Query: 648  SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSN 827
            SS+S  + K++  D ESK+K+I                AY+EH KEEL  VEAE+ + SN
Sbjct: 51   SSDSENLLKEYAHDFESKVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISN 110

Query: 828  EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMD------ASVEGFIS---TE 980
            EIE  TRT +ED+ RLES+LE LN ++  I S+G    + D      A  E  +    TE
Sbjct: 111  EIETLTRTQVEDSDRLESDLEELNCAIDLIVSEGSQNAKEDRQAVCPARGEDQVCPTHTE 170

Query: 981  NQGSSYA-HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 1157
            +Q      H+D+ FE+LEL+ +IEKNK+ L+SL DLD+  KR ++V +IED+L GLKVI+
Sbjct: 171  DQSDLIKIHEDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVID 230

Query: 1158 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 1337
            F+G C +LS++TY+PTLE    Q K+E   +P  V+HEL IE++DGTME+KN E+FP+D+
Sbjct: 231  FDGKCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDV 290

Query: 1338 FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 1514
             I +++DAAK+ RQ    L   E  SSL+W +R VQ +IILSTLR+ +VK AN SRH+FE
Sbjct: 291  HISDLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANKSRHFFE 350

Query: 1515 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 1694
            Y +RD  I AH+VGG+DAFIK  Q WP+ NS LK++SL  +S++HSKGISLSF C+VEE 
Sbjct: 351  YFERDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISL-KNSDHHSKGISLSFFCRVEEA 409

Query: 1695 VNSLDVKTRQNLSSFADAIEEILVQQMRSE 1784
             NSLDV  RQNLSSF D +E+IL++QMR E
Sbjct: 410  ANSLDVHIRQNLSSFVDGVEKILLEQMRVE 439


>ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus
            sinensis]
          Length = 444

 Score =  369 bits (946), Expect = 4e-99
 Identities = 198/387 (51%), Positives = 267/387 (68%), Gaps = 8/387 (2%)
 Frame = +3

Query: 648  SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSN 827
            SS+S  + K++  D ESK+K+I                AY+EH KEEL  VEAE+ + SN
Sbjct: 51   SSDSENLLKEYAHDFESKVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISN 110

Query: 828  EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEM---DASVEGFIS---TENQG 989
            EIE  TRT +ED+ RLES+LE LN ++  I S+   +       A  E  +    TE+Q 
Sbjct: 111  EIETLTRTQVEDSDRLESDLEELNCAIDLIVSENAKEDRQAVCPARGEDQVCPTHTEDQS 170

Query: 990  SSYA-HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEG 1166
                 H+D+ FE+LEL+ +IEKNK+ L+SL DLD+  KR ++V +IED+L GLKVI+F+G
Sbjct: 171  DLIKIHEDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDG 230

Query: 1167 NCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIG 1346
             C +LS++TY+PTLE    Q K+E   +P  V+HEL IE++DGTME+KN E+FP+D+ I 
Sbjct: 231  KCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHIS 290

Query: 1347 EIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSD 1523
            +++DAAK+ RQ    L   E  SSL+W +R VQ +IILSTLR+ +VK AN SRH+FEY +
Sbjct: 291  DLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANKSRHFFEYFE 350

Query: 1524 RDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNS 1703
            RD  I AH+VGG+DAFIK  Q WP+ NS LK++SL  +S++HSKGISLSF C+VEE  NS
Sbjct: 351  RDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISL-KNSDHHSKGISLSFFCRVEEAANS 409

Query: 1704 LDVKTRQNLSSFADAIEEILVQQMRSE 1784
            LDV  RQNLSSF D +E+IL++QMR E
Sbjct: 410  LDVHIRQNLSSFVDGVEKILLEQMRVE 436


>ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508713301|gb|EOY05198.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 432

 Score =  360 bits (923), Expect = 2e-96
 Identities = 191/390 (48%), Positives = 268/390 (68%), Gaps = 3/390 (0%)
 Frame = +3

Query: 537  MENSHSSERVDIETXXXXXXXXXXXXXXCNDFTELSSSS-ESVKIFKDFFLDLESKIKQI 713
            ME S SSE +D+ +                +  E  + S  S K+ KD  L  ESK+KQI
Sbjct: 5    MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVKQI 64

Query: 714  TXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEG 893
                             Y+ H KEELN VEAE+ + SNEIE  +R +IE++  LE NLEG
Sbjct: 65   IEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEG 124

Query: 894  LNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALS 1070
            L Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++++  FE++EL+ +IEKN + L 
Sbjct: 125  LKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNIILK 184

Query: 1071 SLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATD 1250
            SL DLD  FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E  ++
Sbjct: 185  SLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDISE 244

Query: 1251 PFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLEWV 1427
            P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ    L+V + +SSLEW 
Sbjct: 245  PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLEWF 304

Query: 1428 VRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNS 1607
            V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+  S
Sbjct: 305  VGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLSKS 364

Query: 1608 ALKLVSLNSSSENHSKGISLSFLCKVEELV 1697
             LKL+S+  SS++HS+GISLS LCK EE +
Sbjct: 365  PLKLLSI-KSSDHHSRGISLSLLCKAEEAI 393


>ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508713300|gb|EOY05197.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 392

 Score =  357 bits (917), Expect = 9e-96
 Identities = 190/389 (48%), Positives = 267/389 (68%), Gaps = 3/389 (0%)
 Frame = +3

Query: 537  MENSHSSERVDIETXXXXXXXXXXXXXXCNDFTELSSSS-ESVKIFKDFFLDLESKIKQI 713
            ME S SSE +D+ +                +  E  + S  S K+ KD  L  ESK+KQI
Sbjct: 5    MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVKQI 64

Query: 714  TXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEG 893
                             Y+ H KEELN VEAE+ + SNEIE  +R +IE++  LE NLEG
Sbjct: 65   IEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEG 124

Query: 894  LNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALS 1070
            L Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++++  FE++EL+ +IEKN + L 
Sbjct: 125  LKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNIILK 184

Query: 1071 SLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATD 1250
            SL DLD  FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E  ++
Sbjct: 185  SLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDISE 244

Query: 1251 PFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLEWV 1427
            P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ    L+V + +SSLEW 
Sbjct: 245  PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLEWF 304

Query: 1428 VRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNS 1607
            V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+  S
Sbjct: 305  VGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLSKS 364

Query: 1608 ALKLVSLNSSSENHSKGISLSFLCKVEEL 1694
             LKL+S+  SS++HS+GISLS LCK E +
Sbjct: 365  PLKLLSI-KSSDHHSRGISLSLLCKAERV 392


>ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa]
            gi|222847415|gb|EEE84962.1| hypothetical protein
            POPTR_0001s32530g [Populus trichocarpa]
          Length = 429

 Score =  344 bits (883), Expect = 8e-92
 Identities = 195/421 (46%), Positives = 276/421 (65%), Gaps = 5/421 (1%)
 Frame = +3

Query: 546  SHSSERVDIETXXXXXXXXXXXXXXCN--DFTELSSSSESVKIFKDFFLDLESKIKQITX 719
            S + E +++ T              CN   F+E++SS +S ++ KD    L SK+ Q   
Sbjct: 6    STTQESLNLNTIRSRINELEEIYRDCNADSFSEINSS-DSDELMKDSAQQLVSKVSQTVT 64

Query: 720  XXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLN 899
                          AY+ H KEEL+  EAE+ + SNEIE+  RT +ED++ LE++LE + 
Sbjct: 65   EYSDFSFLGIEDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELENDLEWMK 124

Query: 900  YSLQSISSQ-GLDKLEMDASVEGFISTENQGSSY-AHQDYNFELLELDHEIEKNKVALSS 1073
             SL  ISSQ   +K + D  +E F S ENQ +    +++  FE+L+LD++IE++   L S
Sbjct: 125  CSLDLISSQRDREKEKGDEQMEHFSSGENQSNLINTNEENKFEILKLDNQIEESTRILKS 184

Query: 1074 LHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDP 1253
            + DLD   K  +++ +IED L GLKVIEF+G CI+LSL+TY+P  + +L  QK+E    P
Sbjct: 185  MQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLRTYIPK-QDVLFLQKIEETNVP 243

Query: 1254 FAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVV 1430
            + ++HE  IE+ +G+ME+K  E+FP+DI+IG+I+DAAK+ RQ +  L++ E  SSLEW V
Sbjct: 244  YEINHEFLIEVTNGSMEIKKVEMFPNDIYIGDIVDAAKSFRQMFLHLALMETSSSLEWFV 303

Query: 1431 RKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSA 1610
            RK Q +II STLR+L+ + A+ SR   EY DRD  I AHMVGG+DAF+++ Q WP+ NS 
Sbjct: 304  RKAQDRIIQSTLRRLVARSASTSRQSIEYLDRDEIIVAHMVGGVDAFMEVSQGWPITNSP 363

Query: 1611 LKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQ 1790
            LKLVSL +S+ +H+K ISL FLCKVEE  NSLDV TRQNLSSF D++E+ILV+QM  E  
Sbjct: 364  LKLVSLKNSN-HHAKEISLGFLCKVEEAANSLDVHTRQNLSSFVDSVEKILVEQMHLELH 422

Query: 1791 S 1793
            S
Sbjct: 423  S 423


>ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica]
            gi|462422632|gb|EMJ26895.1| hypothetical protein
            PRUPE_ppa006350mg [Prunus persica]
          Length = 416

 Score =  342 bits (877), Expect = 4e-91
 Identities = 194/413 (46%), Positives = 269/413 (65%), Gaps = 2/413 (0%)
 Frame = +3

Query: 552  SSERVDIETXXXXXXXXXXXXXXC--NDFTELSSSSESVKIFKDFFLDLESKIKQITXXX 725
            SSE +D+ T              C  +D +ELS S +S  + ++  L L+S+++QI    
Sbjct: 8    SSEPLDLNTIQRQVRELEEIIESCRQDDASELSPS-DSDDLIRNCGLLLQSRVEQIVSEC 66

Query: 726  XXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYS 905
                        AYV   ++ELN VEAE+ + SN IE   RT+ ED  RL ++L  L  S
Sbjct: 67   SDVGLLEDQEFEAYVGRFEQELNSVEAESTKVSNGIEDLIRTHGEDFNRLGTDLAQLKCS 126

Query: 906  LQSISSQGLDKLEMDASVEGFISTENQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDL 1085
            L  +  + L+K ++ A V+     ++           FELLEL+++IEKN + L SL DL
Sbjct: 127  LDFVEEKDLEKAKLGADVDYHKCGKDLLDPMNVNADKFELLELENQIEKNNIILKSLQDL 186

Query: 1086 DYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVD 1265
            + T K +++  +IED + GLKVI FEGNC++LSL+TY+P LE L   +K+  AT+P  V+
Sbjct: 187  ECTLKWLDNTEQIEDAVTGLKVIAFEGNCVRLSLRTYIPKLEDLFSPKKVGDATEPSEVN 246

Query: 1266 HELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCPLSVSEPRSSLEWVVRKVQH 1445
            HEL IEL++GTM L+N EIFP+D++I +I+DAAK+LR          +SSL+W V KVQ 
Sbjct: 247  HELLIELLEGTMGLRNVEIFPNDVYINDILDAAKSLR----------KSSLQWFVTKVQD 296

Query: 1446 QIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVS 1625
            +I+L T+R+L+VK+ N SRH  EY D+D T+ AH+VGG+DAFIK+PQ WP+L+S LKL+ 
Sbjct: 297  RIVLCTMRRLVVKNENKSRHSLEYLDKDETVVAHVVGGVDAFIKVPQGWPLLSSPLKLIY 356

Query: 1626 LNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSE 1784
            L  SS+ HSKGISLSFLC V+EL NSL V+ RQ LSSF DAIE+ILV+QM SE
Sbjct: 357  L-KSSDQHSKGISLSFLCTVQELANSLAVRIRQTLSSFVDAIEKILVEQMCSE 408


>ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella]
            gi|482566470|gb|EOA30659.1| hypothetical protein
            CARUB_v10013795mg [Capsella rubella]
          Length = 420

 Score =  335 bits (858), Expect = 6e-89
 Identities = 182/389 (46%), Positives = 257/389 (66%), Gaps = 1/389 (0%)
 Frame = +3

Query: 642  SSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRT 821
            S +S+S  + +DF L  E+K+ +I                AY+E+ ++EL+ VEAE+ + 
Sbjct: 36   SCTSDSENLVQDFVLQFETKVNEIVEDYSDVDILDVEDSDAYLEYLRKELHSVEAESAKV 95

Query: 822  SNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSSYA 1001
            S EIE  +R++ ED++RLE +LEGL  SL S+SSQ ++K     S E   S  +      
Sbjct: 96   SEEIERLSRSHAEDSSRLERDLEGLLLSLDSMSSQDVNK-----SKESPPSCSSMEVCEV 150

Query: 1002 HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKL 1181
            + D  F++ EL++++E+ ++ L SL DLD   KR ++  ++ED L GLKV+EF+GN I+L
Sbjct: 151  NDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRL 210

Query: 1182 SLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDA 1361
             L+TY+P L+GL  Q K E+ T P  + HEL I L D T E+   E+FP+D++IG+II+A
Sbjct: 211  QLRTYIPELDGLPAQHKFEHTTKPSELIHELLIYLKDKTTEITKLEMFPNDVYIGDIIEA 270

Query: 1362 AKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATI 1538
            A + RQ     +V + RSS++WVV KVQ +II +TLR+ +V  +   RH F+Y D+D TI
Sbjct: 271  ADSFRQVRLHSAVLDTRSSVQWVVAKVQDRIITTTLRKYIVTSSKTMRHTFKYYDKDETI 330

Query: 1539 TAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKT 1718
             AH+ GGIDAF+K+   WP+LNS LKL SL  +S+N SKGISLS +CKVEEL NSLD++T
Sbjct: 331  VAHIAGGIDAFLKVSDGWPLLNSPLKLASL-KNSDNQSKGISLSLICKVEELANSLDLQT 389

Query: 1719 RQNLSSFADAIEEILVQQMRSERQSGHIS 1805
            RQNLS F DAIE+ILV Q R E QS   S
Sbjct: 390  RQNLSGFIDAIEKILVHQTREELQSNDSS 418


>ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp.
            lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein
            ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  334 bits (857), Expect = 8e-89
 Identities = 186/395 (47%), Positives = 259/395 (65%), Gaps = 7/395 (1%)
 Frame = +3

Query: 621  CNDFTELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLV 800
            C D    S SS+S  + +DF L  E K+K+I                AY+E+ ++EL  V
Sbjct: 29   CRDEPGESCSSDSETLVQDFVLQFEPKVKEIVEDYSDVDLLDVEDSDAYLEYLRKELQSV 88

Query: 801  EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 980
            EAE+ + S EIE  ++++ +D++RLE +LEGL  SL S+SSQ ++K           S E
Sbjct: 89   EAESAKVSEEIERLSKSHAQDSSRLERDLEGLLLSLDSMSSQDVEK-----------SKE 137

Query: 981  NQGSSYA------HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMG 1142
            NQ SS +      + D  F++ EL++++E+ +  L SL DLD   KR ++  ++ED L G
Sbjct: 138  NQPSSSSMEVCEVNDDDKFKMFELENQMEEKRSILKSLEDLDSLRKRFDAAEQVEDALTG 197

Query: 1143 LKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEI 1322
            LKV+EF+GN I+L L+TY+P L+ LL QQK E+ T+P  + HEL I L D T E+   E+
Sbjct: 198  LKVLEFDGNFIRLQLQTYIPKLDSLLGQQKFEHTTEPSELIHELLIYLKDKTTEITKFEM 257

Query: 1323 FPHDIFIGEIIDAAKALRQYCPLS-VSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNS 1499
            FP+D++IG+II+AA + RQ    S V + RSS++WVV KVQ +II STLR+ LV  +   
Sbjct: 258  FPNDVYIGDIIEAADSFRQVSLHSAVLDTRSSVQWVVAKVQDRIISSTLRKYLVTSSKTI 317

Query: 1500 RHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLC 1679
            RH FEY ++D TI  H+ GGIDAF+K+   WP+LN+ LKL SL  +S+N SKGISLS +C
Sbjct: 318  RHTFEYYEKDETIVGHIAGGIDAFLKVSNGWPLLNTPLKLESL-KNSDNQSKGISLSLIC 376

Query: 1680 KVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSE 1784
            KVE+L NSLD++TRQNLS F DAIE+ILVQQ R E
Sbjct: 377  KVEDLANSLDLQTRQNLSGFMDAIEKILVQQTREE 411


>ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana]
            gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis
            thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein
            [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1|
            putative HAPp48,5 protein [Arabidopsis thaliana]
            gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein
            [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1|
            uncharacterized protein AT3G23910 [Arabidopsis thaliana]
          Length = 421

 Score =  325 bits (834), Expect = 4e-86
 Identities = 181/397 (45%), Positives = 257/397 (64%), Gaps = 7/397 (1%)
 Frame = +3

Query: 636  ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENL 815
            E  SS     + +DF L  E K+K+I                AY+E+ + EL  VEAE+ 
Sbjct: 35   ESCSSDYETLVVQDFVLQFEPKVKEIVEEYGDVDLLDVEDSDAYLEYLRNELQSVEAESA 94

Query: 816  RTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSS 995
            + S EIE  ++++ +D++RL+ +LEGL  SL S+SSQ ++K           S ENQ SS
Sbjct: 95   KVSEEIERLSQSHAQDSSRLQRDLEGLLLSLDSMSSQDVEK-----------SKENQPSS 143

Query: 996  YAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 1157
             + +      D  F++ EL++++E+ ++ L SL DLD   KR ++  ++ED L GLKV+E
Sbjct: 144  SSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLE 203

Query: 1158 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 1337
            F+GN I+L L+TY+  L+G L Q K ++ T+P  + HEL I L D T E+   E+FP+DI
Sbjct: 204  FDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDI 263

Query: 1338 FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 1514
            +IG+II+AA + RQ     +V + RSS++WVV KVQ +II +TLR+ +V  +   R+ FE
Sbjct: 264  YIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKYIVMSSKTIRYTFE 323

Query: 1515 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 1694
            Y D+D TI AH+ GGIDAF+K+   WP+LN+ LKL SL  +S+N SKGISLS +CKVEEL
Sbjct: 324  YYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGISLSLICKVEEL 382

Query: 1695 VNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 1805
             NSLD++TRQNLS F DAIE+ILV+Q R E QS   S
Sbjct: 383  ANSLDLETRQNLSGFMDAIEKILVEQTREELQSNKSS 419


>ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244321 [Solanum
            lycopersicum]
          Length = 415

 Score =  325 bits (834), Expect = 4e-86
 Identities = 179/376 (47%), Positives = 253/376 (67%), Gaps = 2/376 (0%)
 Frame = +3

Query: 654  ESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEI 833
            E  K  +D  L  ESK++Q+                 +  + K EL+  EA+N + ++EI
Sbjct: 35   ELKKSLEDCTLQFESKVEQLLCDASEVNFSSDQDLDEFWNYLKNELSTEEAKNAKIADEI 94

Query: 834  EVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQD 1010
            E  +R Y+E  ++L + +EGL+  L+ I S G+++     +       E++G+ S A  +
Sbjct: 95   EGLSREYVEGYSKLVNEVEGLSCLLELIESLGIEQGRALTNFPCSTPGEDKGNLSSAPVE 154

Query: 1011 YNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLK 1190
            +NF++ EL +++EK+K+ L SL +L+ TF R E++ KIED   GLK+++FEGN I+LSL+
Sbjct: 155  HNFKIFELGNQLEKSKLNLESLEELESTFNRFEAIEKIEDAFSGLKIVQFEGNRIRLSLR 214

Query: 1191 TYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKA 1370
            T++P LE LL  Q +  A  P   +HEL IEL+DGTMELK+ EIFP+D+ I EI D AK+
Sbjct: 215  TFIPNLENLLHNQTIGVAEPP-EQNHELLIELVDGTMELKHVEIFPNDVSISEITDTAKS 273

Query: 1371 LRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAH 1547
            LRQ Y P+ V E RSSLEW+V++VQ +IILSTLR+ LVK AN+SRH F+Y +R+ TI AH
Sbjct: 274  LRQVYFPVGVLENRSSLEWLVKRVQDRIILSTLRRFLVKSANSSRHSFDYVEREETIVAH 333

Query: 1548 MVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQN 1727
            MVGGIDAF+K+PQ WP+  S L L+SL SSS+ +S+ ISL+ LCKV E  NSLD   RQ 
Sbjct: 334  MVGGIDAFVKLPQGWPLTCSGLTLMSLKSSSQ-YSQQISLTLLCKVAEAANSLDTNARQT 392

Query: 1728 LSSFADAIEEILVQQM 1775
            +S F D +EEIL+QQM
Sbjct: 393  ISGFTDRVEEILMQQM 408


>ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592291 [Solanum tuberosum]
          Length = 428

 Score =  319 bits (818), Expect = 3e-84
 Identities = 173/337 (51%), Positives = 239/337 (70%), Gaps = 2/337 (0%)
 Frame = +3

Query: 771  EHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMD 950
            ++ K EL+  EA N + ++EIE  +R Y+E  ++L + +EGL+  L+ I S GL++  + 
Sbjct: 87   KYLKNELSTEEANNAKIADEIEGLSREYVEGYSKLVNEIEGLSCPLELIESLGLEQGRVL 146

Query: 951  ASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIE 1127
             +       E++G+ S A  + NF++ EL +++EK+K+ L SL +L+ TF R E++ KIE
Sbjct: 147  TNFPCSTPGEDKGNVSSAPVEQNFKVFELGNQLEKSKLNLKSLEELESTFNRFEAIEKIE 206

Query: 1128 DTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMEL 1307
            D   GLK++EFEGN I+LSL+T++P LE LL  Q ++ A  P   +HEL IELMDGTMEL
Sbjct: 207  DAFSGLKIVEFEGNRIRLSLRTFIPNLENLLHNQTIDVAEPP-EQNHELLIELMDGTMEL 265

Query: 1308 KNAEIFPHDIFIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVK 1484
            K+ EIFP+D+ I  I D AK+LRQ Y P+ V E RSSLEW V+ VQ +I+LSTLR+ LVK
Sbjct: 266  KHVEIFPNDVSISYITDTAKSLRQVYFPVGVLENRSSLEWFVKGVQDRIVLSTLRRFLVK 325

Query: 1485 DANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGIS 1664
             AN+SRH F+Y DR+ TI AHMVGGIDAFIK+PQ WP+ +S L L+SL SSS+ +S+ IS
Sbjct: 326  SANSSRHSFDYVDREETIVAHMVGGIDAFIKLPQGWPLTSSGLTLMSLKSSSQ-YSQQIS 384

Query: 1665 LSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQM 1775
            L+ LCKV E+ N LD   RQ +S F D +EEIL+QQM
Sbjct: 385  LTLLCKVAEVANLLDTNERQTISGFTDRVEEILMQQM 421


>ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590656431|ref|XP_007034269.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508713297|gb|EOY05194.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508713298|gb|EOY05195.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 369

 Score =  319 bits (818), Expect = 3e-84
 Identities = 170/353 (48%), Positives = 240/353 (67%), Gaps = 3/353 (0%)
 Frame = +3

Query: 537  MENSHSSERVDIETXXXXXXXXXXXXXXCNDFTELSSSS-ESVKIFKDFFLDLESKIKQI 713
            ME S SSE +D+ +                +  E  + S  S K+ KD  L  ESK+KQI
Sbjct: 5    MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVKQI 64

Query: 714  TXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEG 893
                             Y+ H KEELN VEAE+ + SNEIE  +R +IE++  LE NLEG
Sbjct: 65   IEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEG 124

Query: 894  LNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALS 1070
            L Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++++  FE++EL+ +IEKN + L 
Sbjct: 125  LKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNIILK 184

Query: 1071 SLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATD 1250
            SL DLD  FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E  ++
Sbjct: 185  SLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDISE 244

Query: 1251 PFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLEWV 1427
            P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ    L+V + +SSLEW 
Sbjct: 245  PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLEWF 304

Query: 1428 VRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQ 1586
            V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI AH+VGGIDAFIK+ Q
Sbjct: 305  VGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQ 357


>ref|XP_007034273.1| Uncharacterized protein isoform 7, partial [Theobroma cacao]
            gi|508713302|gb|EOY05199.1| Uncharacterized protein
            isoform 7, partial [Theobroma cacao]
          Length = 343

 Score =  318 bits (816), Expect = 4e-84
 Identities = 162/315 (51%), Positives = 227/315 (72%), Gaps = 2/315 (0%)
 Frame = +3

Query: 648  SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSN 827
            S  S K+ KD  L  ESK+KQI                 Y+ H KEELN VEAE+ + SN
Sbjct: 17   SLNSEKLLKDCSLHFESKVKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISN 76

Query: 828  EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAH 1004
            EIE  +R +IE++  LE NLEGL Y+L SI+SQG++ +E D  ++  ++ E+Q +  +++
Sbjct: 77   EIEDLSRNHIEESNILEGNLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSN 136

Query: 1005 QDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLS 1184
            ++  FE++EL+ +IEKN + L SL DLD  FKR++++ +IED L GLKVI F+GNCI+LS
Sbjct: 137  EEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLS 196

Query: 1185 LKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAA 1364
            L+TY+P LEGLLCQ+ +E  ++P  ++HEL +E++DGTME+KN E+FP+D+++G+IIDAA
Sbjct: 197  LQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAA 256

Query: 1365 KALRQYCP-LSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATIT 1541
            K+ RQ    L+V + +SSLEW V KVQ +IILSTLR+ +VK  N SRH FEY +RD TI 
Sbjct: 257  KSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIV 316

Query: 1542 AHMVGGIDAFIKIPQ 1586
            AH+VGGIDAFIK+ Q
Sbjct: 317  AHLVGGIDAFIKLSQ 331


>dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana]
          Length = 421

 Score =  315 bits (806), Expect = 6e-83
 Identities = 177/397 (44%), Positives = 253/397 (63%), Gaps = 7/397 (1%)
 Frame = +3

Query: 636  ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENL 815
            E  SS     + +DF L  E K+K+I                AY+E+ + EL  VEAE+ 
Sbjct: 35   ESCSSDYETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDQTDAYLEYLRNELQSVEAESA 94

Query: 816  RTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSS 995
            + S EIE  ++++  D++RL+ +LEGL  SL S+SSQ ++K           S ENQ SS
Sbjct: 95   KVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQDVEK-----------SKENQPSS 143

Query: 996  YAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 1157
             + +      D  F++ EL++++E+ ++ L SL DLD   KR ++  ++ED L GLKV+E
Sbjct: 144  SSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLE 203

Query: 1158 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 1337
            F+GN I+L L+TY+  L+G L Q K ++ T+P  + HEL I L D T E+   E+FP+DI
Sbjct: 204  FDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDI 263

Query: 1338 FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 1514
            +IG+II+AA + RQ     +V + RSS++WVV KVQ +II +TLR+  V  +   R+ FE
Sbjct: 264  YIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKDFVMSSKTIRYTFE 323

Query: 1515 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 1694
            Y D+D TI AH+ GGIDAF+K+   WP+LN+ LKL SL  +S+N SKG SLS + K+EEL
Sbjct: 324  YYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGFSLSLISKLEEL 382

Query: 1695 VNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 1805
             NSLD++TRQNLS F DA+E+ILVQQ R E +S   S
Sbjct: 383  ANSLDLETRQNLSGFMDAVEKILVQQTREELKSNESS 419


>ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein
            [Arabidopsis thaliana] gi|332643360|gb|AEE76881.1|
            RNA-directed DNA polymerase (reverse
            transcriptase)-related protein [Arabidopsis thaliana]
          Length = 428

 Score =  308 bits (788), Expect = 8e-81
 Identities = 177/404 (43%), Positives = 253/404 (62%), Gaps = 14/404 (3%)
 Frame = +3

Query: 636  ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXX-------AYVEHAKEELN 794
            E  SS     + +DF L  E K+K+I                       AY+E+ + EL 
Sbjct: 35   ESCSSDYETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDHTLVDGNLTDAYLEYLRNELQ 94

Query: 795  LVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFIS 974
             VEAE+ + S EIE  ++++  D++RL+ +LEGL  SL S+SSQ ++K           S
Sbjct: 95   SVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQDVEK-----------S 143

Query: 975  TENQGSSYAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTL 1136
             ENQ SS + +      D  F++ EL++++E+ ++ L SL DLD   KR ++  ++ED L
Sbjct: 144  KENQPSSSSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDAL 203

Query: 1137 MGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNA 1316
             GLKV+EF+GN I+L L+TY+  L+G L Q K ++ T+P  + HEL I L D T E+   
Sbjct: 204  TGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKF 263

Query: 1317 EIFPHDIFIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDAN 1493
            E+FP+DI+IG+II+AA + RQ     +V + RSS++WVV KVQ +II +TLR+  V  + 
Sbjct: 264  EMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKDFVMSSK 323

Query: 1494 NSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSF 1673
              R+ FEY D+D TI AH+ GGIDAF+K+   WP+LN+ LKL SL  +S+N SKG SLS 
Sbjct: 324  TIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGFSLSL 382

Query: 1674 LCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 1805
            + K+EEL NSLD++TRQNLS F DA+E+ILVQQ R E +S   S
Sbjct: 383  ISKLEELANSLDLETRQNLSGFMDAVEKILVQQTREELKSNESS 426


>ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum]
            gi|557096755|gb|ESQ37263.1| hypothetical protein
            EUTSA_v10002763mg, partial [Eutrema salsugineum]
          Length = 355

 Score =  305 bits (781), Expect = 5e-80
 Identities = 164/346 (47%), Positives = 235/346 (67%), Gaps = 2/346 (0%)
 Frame = +3

Query: 762  AYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKL 941
            AY+E+ ++EL+ VEAE+ + S EIE  + ++ ED++RL+ +LEGL  SL  +SSQ + K 
Sbjct: 6    AYLEYLRKELHSVEAESAKVSEEIERLSSSHAEDSSRLDRDLEGLLLSLDFLSSQEVQKS 65

Query: 942  -EMDASVEGFISTENQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVG 1118
             E   S       +       + D  F++ EL+++IE+ +  L SL +LD   KR ++  
Sbjct: 66   KENPPSTSSMERCDASTWIDVNDDEKFKMFELENQIEEKRRILKSLENLDSVCKRFDAAE 125

Query: 1119 KIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGT 1298
            ++ED L GLKV+EF+GN I+L L+TY+P L+GLL Q K+ + T+P  + HEL I+L D T
Sbjct: 126  QVEDALTGLKVLEFDGNFIRLQLRTYIPKLDGLLGQHKLLHNTEPSELIHELLIDLKDKT 185

Query: 1299 MELKNAEIFPHDIFIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQL 1475
             E+   E+ P+D++IG+I DAA + RQ     ++ + RSSL+W+V KVQ +II + LR+ 
Sbjct: 186  TEITKVEMLPNDVYIGDITDAADSFRQIRLHSALLDTRSSLQWLVAKVQERIITTNLRKH 245

Query: 1476 LVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSK 1655
            +VK +   RH FEY D+D TI AH+ GGIDAF+K+   WP+L++ LKL SL  +S+N S 
Sbjct: 246  IVKSSKTIRHTFEYYDKDETIVAHITGGIDAFLKVSVGWPLLSTPLKLTSL-KNSDNQSN 304

Query: 1656 GISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQS 1793
            GISLS +CKVEEL NSLD++TRQNLS F DAIE+ILVQQ R E  S
Sbjct: 305  GISLSLICKVEELANSLDLQTRQNLSGFMDAIEKILVQQTREELHS 350


Top