BLASTX nr result

ID: Mentha24_contig00032003 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00032003
         (1915 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU37261.1| hypothetical protein MIMGU_mgv1a001119mg [Mimulus...   487   e-135
ref|XP_002278233.1| PREDICTED: uncharacterized protein LOC100264...   298   4e-78
ref|XP_006344743.1| PREDICTED: uncharacterized protein LOC102600...   297   1e-77
emb|CAN73069.1| hypothetical protein VITISV_005845 [Vitis vinifera]   295   4e-77
ref|XP_004230289.1| PREDICTED: uncharacterized protein LOC101268...   294   8e-77
emb|CBI39573.3| unnamed protein product [Vitis vinifera]              290   2e-75
ref|XP_007031123.1| Uncharacterized protein isoform 6, partial [...   231   1e-57
ref|XP_007031122.1| Uncharacterized protein isoform 5 [Theobroma...   231   1e-57
ref|XP_007031121.1| Uncharacterized protein isoform 4 [Theobroma...   231   1e-57
ref|XP_007031120.1| Uncharacterized protein isoform 3, partial [...   231   1e-57
ref|XP_007031119.1| Uncharacterized protein isoform 2, partial [...   231   1e-57
ref|XP_007031118.1| Uncharacterized protein isoform 1 [Theobroma...   231   1e-57
ref|XP_007207147.1| hypothetical protein PRUPE_ppa001266mg [Prun...   226   4e-56
ref|XP_002314925.1| hypothetical protein POPTR_0010s15080g [Popu...   216   2e-53
ref|XP_002512492.1| conserved hypothetical protein [Ricinus comm...   215   5e-53
ref|XP_006382417.1| hypothetical protein POPTR_0005s01960g [Popu...   214   9e-53
ref|XP_004304870.1| PREDICTED: uncharacterized protein LOC101302...   213   2e-52
ref|XP_007035732.1| Uncharacterized protein TCM_021314 [Theobrom...   213   3e-52
ref|XP_007226468.1| hypothetical protein PRUPE_ppa1027230mg [Pru...   211   9e-52
ref|XP_006433512.1| hypothetical protein CICLE_v10000207mg [Citr...   206   3e-50

>gb|EYU37261.1| hypothetical protein MIMGU_mgv1a001119mg [Mimulus guttatus]
          Length = 883

 Score =  487 bits (1253), Expect = e-135
 Identities = 322/655 (49%), Positives = 401/655 (61%), Gaps = 17/655 (2%)
 Frame = -1

Query: 1915 VIFESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSP 1736
            VIFESVRNKLDGSSRNPL+LRLQK Q++ LERFQRE LPPKSAK ISV+H+RLLSPIKSP
Sbjct: 139  VIFESVRNKLDGSSRNPLELRLQKMQSQHLERFQREALPPKSAKSISVTHHRLLSPIKSP 198

Query: 1735 GFIPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSG 1556
            GF+PPKN            EQSPR+TSK N+PS GSPSVPFRVRDLK+K+ESA       
Sbjct: 199  GFVPPKNAAYIIEAASKIMEQSPRSTSKANFPSLGSPSVPFRVRDLKQKIESA------- 251

Query: 1555 VASQKGREENSQSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKPV-SLAVQA 1379
               QKGRE+NS++ K+QL++RGKGR+ D YLY+                 +   SLAVQA
Sbjct: 252  ---QKGREQNSKNTKRQLDSRGKGRLADSYLYQSSEESKSIGSSQRIKNREKSGSLAVQA 308

Query: 1378 RSNIQRKDGPXXXXXXXSEKQKE----HNGFVTRDPPNAQKKVEKPSSSRR-PSEVLRMN 1214
            ++NIQRKDG        SEKQ+E     +GF ++D  N QKKVEK SSSRR  SEVLR+N
Sbjct: 309  KTNIQRKDGFASVGNRSSEKQRELSDVKHGFASKDLTNTQKKVEKKSSSRRASSEVLRLN 368

Query: 1213 NQKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFL 1034
            NQKQN  S   D N  PSCS+ KE++E NLS NY+NGR              TSR+TNF+
Sbjct: 369  NQKQNCVSEGYDENPGPSCSKLKERKEPNLSNNYVNGRTNRTVNKIVIGDVATSRRTNFV 428

Query: 1033 AADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAF--EVDSKW 860
            AADPGKEV   R KT +KK+L  +G+I        KAM VKDEKS+K NVAF  + +S+W
Sbjct: 429  AADPGKEVPLSRPKTNAKKKLSIDGSI------THKAMMVKDEKSVKRNVAFAGDAESEW 482

Query: 859  DGIDKKXXXXXXXXXXXSPIKKHVGSNSSA-AMLEATXXXXXXXXXSARESDLRNSAVSS 683
            DG DKK           SPIKK   S+SS   +LEA              S+LR+S   S
Sbjct: 483  DGNDKKSSLDVVSFTFTSPIKKSGASSSSCNTILEANSSSFTNSDPCVHGSELRDSGSYS 542

Query: 682  SGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNMGSAVNLVESMD 503
            S FNVIGGD LS+LLEQKLKEL+S++E SQKD                  +A     ++ 
Sbjct: 543  SRFNVIGGDTLSLLLEQKLKELSSKIELSQKDVSE--------------SAASCSSSAIS 588

Query: 502  NGVCESEREAQDGSDCASTEKLWLRADKSSKVPECF-EGVVDGNN---IQRYXXXXXXXX 335
            N + +++ E ++    AS +K+ L+A+K +K  E   EG  D N+    QRY        
Sbjct: 589  NSILKTKEEIKN----ASIDKILLKAEKENKEVEYIEEGDGDDNSNIEYQRY-LHLLGSL 643

Query: 334  XXXXXXXXXXSCDSFDVDRSASNEGPTGCLSLDSCEGTNWRATRKPHVTEGDQEISDTAS 155
                      S DSFD+DR++ NEG    LS++S EG NW +TR       + E+SDTA 
Sbjct: 644  SASNQPLSRTSSDSFDLDRNSCNEGRLPYLSVESYEGINW-STR-------NVEVSDTA- 694

Query: 154  SLSFGT-MSETVS--STLHMADSKD-SPNWEVKYIEDILRHTELLLEDFSLGQAH 2
              S GT +SETV+  S L++ DS+D S NWE+ YI DIL   EL  E+F+LGQAH
Sbjct: 695  --SVGTVLSETVTSGSLLYLMDSRDSSSNWELLYIRDILSSAELFSEEFALGQAH 747


>ref|XP_002278233.1| PREDICTED: uncharacterized protein LOC100264914 [Vitis vinifera]
          Length = 919

 Score =  298 bits (764), Expect = 4e-78
 Identities = 220/654 (33%), Positives = 312/654 (47%), Gaps = 18/654 (2%)
 Frame = -1

Query: 1909 FESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGF 1730
            + ++ NKL+G   +P++ R ++ Q RP+ERFQ E+LPPKSAK I  +H++LLSPIKSPGF
Sbjct: 137  YNNMPNKLEGDRVSPVESRPRRVQRRPIERFQTEMLPPKSAKSIPFTHHKLLSPIKSPGF 196

Query: 1729 IPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVA 1550
            IP KN            E  P AT K   PS GS SVP R+RDLKEKME+AQ+SS+    
Sbjct: 197  IPTKNATYVMEAAAKIIEPGPHATPKRKVPSVGSSSVPLRIRDLKEKMEAAQKSSR---L 253

Query: 1549 SQKGREENSQSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKPVSLAVQARSN 1370
             +  +  + + +  Q+N +     ED                      K VSLA QA+ N
Sbjct: 254  QRPKQSTDVKHMNGQINGKRFNGSEDTPSLNNSKDLVKRNSDSMKKKGKSVSLAEQAKVN 313

Query: 1369 IQRKDGPXXXXXXXSEKQKEH----NGFVTRDPPNAQKKVEKPSSSRRPSEVLRMNNQKQ 1202
            IQRK+GP           KEH    +G  ++  P+ QK + K +S+ R S  L+ NNQKQ
Sbjct: 314  IQRKEGPSSSNRSSM-NPKEHTEVKSGQSSKSQPSMQKNMLKRTSTNRTSNALKQNNQKQ 372

Query: 1201 NRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFLAADP 1022
            N  S RD    + + S  K K+  +++ ++   +               S+K   +A D 
Sbjct: 373  NGGSTRDVLTSKTAVSNQKSKKAPSVNGSFGPSK---TVNKVVINTEAGSKKMGSVANDI 429

Query: 1021 GKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDGIDKK 842
             KE S  + K  S+K+L  +GNI   G +A   +  KD KSIKCNVA E  + W G + K
Sbjct: 430  RKESSLSKTKNASRKKLSVDGNICFEGSIADGVLTNKDVKSIKCNVAVEGGTDWGGDNIK 489

Query: 841  XXXXXXXXXXXSPIKKHV-GSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGFNVI 665
                       SP+KK + GS SS  ++EA                 +NS++SS G NVI
Sbjct: 490  KGMDVVSFTFTSPMKKPIPGSMSSDQVMEAKYQFNIDSNDENDAHGSKNSSISSLGPNVI 549

Query: 664  GGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNMGSAVNLVESMDN----- 500
            G D+L VLLEQKL+ELT RV  S  D               +    VN+V          
Sbjct: 550  GADSLGVLLEQKLRELTFRVGSSHSDLFAPGTAASSTSRLQDSDLRVNVVAPTSTKHTSR 609

Query: 499  ---GVCESEREAQDGSDCASTEKLWLRADKSSKVPECFEGVVDGNNIQR----YXXXXXX 341
                + E + +     D +S   L         V E  E +   +N              
Sbjct: 610  LLPDLHEDKSDGPHYFDFSSVGGLQANQKWQVHVSEGMEELSGNSNNNEMGNGLSGQHPS 669

Query: 340  XXXXXXXXXXXXSCDSFDVDRSASNEGPTGCLSLDSCEGTNWRATRKPHVTEGDQEISDT 161
                        +C+S D   S S  G   C   ++ E  +W +  K  + EG+ E+SD+
Sbjct: 670  PVLSLESSFSNITCNSPDSRNSYSVNGSEQCSLAETDEVDSWTSRSKSQLAEGEAELSDS 729

Query: 160  ASSLSFGTMS-ETVSSTLHMADSKDSPNWEVKYIEDILRHTELLLEDFSLGQAH 2
            ASS+S   M+   ++ST H+ D K+S NWE++Y+ +IL   EL LEDF+ G  H
Sbjct: 730  ASSVSILHMNPRNMASTSHLTDFKESVNWELEYMREILCKAELTLEDFASGHTH 783


>ref|XP_006344743.1| PREDICTED: uncharacterized protein LOC102600562 isoform X1 [Solanum
            tuberosum] gi|565355747|ref|XP_006344744.1| PREDICTED:
            uncharacterized protein LOC102600562 isoform X2 [Solanum
            tuberosum]
          Length = 907

 Score =  297 bits (761), Expect = 1e-77
 Identities = 222/645 (34%), Positives = 315/645 (48%), Gaps = 9/645 (1%)
 Frame = -1

Query: 1915 VIFESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSP 1736
            ++  ++RNKLDG  RNP+++ LQK Q+RP+ERFQ EVLPPKSAKPI+V+  RLLSPIKSP
Sbjct: 137  IVDGNMRNKLDGFKRNPVEVGLQKVQSRPIERFQSEVLPPKSAKPIAVTQPRLLSPIKSP 196

Query: 1735 GFIPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSG 1556
            GFIPPKN            +QSPR  ++    S GS S P R+RDL++++E+ QR S   
Sbjct: 197  GFIPPKNAAYIIEAAAKIYQQSPRPAAREKVQSSGSSSAPLRIRDLRDQIEAVQRQSSIY 256

Query: 1555 VASQKGREENS-QSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKPVSLAVQA 1379
             A  + +E+NS +++++Q   R + +  D                      K VSLAVQA
Sbjct: 257  EALHRPKEQNSVKNVRRQPCERVQVQ-SDNMRQLRVSEVSRRDISQNKGKEKSVSLAVQA 315

Query: 1378 RSNIQRKDGPXXXXXXXSEKQKEHNGFVTRDPPNAQKKVEKPSSSRRPSEVLRMNNQKQN 1199
            ++NIQ+++G           QKE N   +     + K  E+ +S  RPS+VLR NNQKQN
Sbjct: 316  KTNIQKREGKESTSSKNPSNQKEQNESKSGRRRPSVKGGERKNSLNRPSDVLRQNNQKQN 375

Query: 1198 RASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFLAADPG 1019
             AS +D  + + S    KEK+   LS+     R               +   + +  D G
Sbjct: 376  SASNKDGESSKTSAPYQKEKK---LSSTGNMSRSTKTVSRIVVNTTTATGIASIVETDVG 432

Query: 1018 KEVSSLRAKTTSK---KRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDGID 848
            K++SS R    S    K+   N +I S G  A   M  KDE+SIKCN+A E  S W+  D
Sbjct: 433  KDLSSSRDSRVSSFTGKKQSVNVDIGSDGCGADNMMKSKDERSIKCNLAIEGCSNWETAD 492

Query: 847  KKXXXXXXXXXXXSPIKKHV-GSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGFN 671
            +K           SPIKK + G  SS+ +LE              +SD R S + S    
Sbjct: 493  RKNGSDVVSFTFTSPIKKSMTGPTSSSHVLEKNNALCLFPGSYDDQSDSRTSTMPSF--- 549

Query: 670  VIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNMGSAVNLVESMDNGVC 491
             IGGD L +LLEQK+KELTS+V  S +D                   +V++V        
Sbjct: 550  PIGGDDLGILLEQKIKELTSKVRPSCED---FIKTGTASISASTFEDSVSIVAHGRRPQV 606

Query: 490  ESEREAQDGSDCASTEKLWLRADKSSKVPECFEGVVDGNNIQ---RYXXXXXXXXXXXXX 320
            +   E       +S + L L A +  + P   E     +       +             
Sbjct: 607  DLLNEKAGDHGHSSVDDLRLTATQMWQGPNRVENPKTASRFTCEGEFSLPCTSLASSMEP 666

Query: 319  XXXXXSCDSFDVDRSASNEGPTGCLSLDSCEGTNWRATRKPHVTEGDQEISDTASSLSFG 140
                 SC+S D  RS + +G    LS  S E  NW+   + H  EGD E+ D+ASS+S  
Sbjct: 667  SISGGSCNSLDSYRSLATDGSKYHLSDGSHEMMNWKTYMRTHFVEGDAELLDSASSVSLA 726

Query: 139  TMSETVS-STLHMADSKDSPNWEVKYIEDILRHTELLLEDFSLGQ 8
               E  S +T    +  +SP WE  YI DI+R ++L++E+F LG+
Sbjct: 727  DAGEKDSTATSTSTNFNESPYWEFNYIRDIIRSSDLVMEEFLLGE 771


>emb|CAN73069.1| hypothetical protein VITISV_005845 [Vitis vinifera]
          Length = 1640

 Score =  295 bits (756), Expect = 4e-77
 Identities = 221/651 (33%), Positives = 308/651 (47%), Gaps = 15/651 (2%)
 Frame = -1

Query: 1909 FESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGF 1730
            + ++ NKL+G   +P++ R ++ Q RP+ERFQ E+LPPKSAK I  +H++LLSPIKSPGF
Sbjct: 889  YNNMPNKLEGDRVSPVESRPRRVQRRPIERFQTEMLPPKSAKSIPFTHHKLLSPIKSPGF 948

Query: 1729 IPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVA 1550
            IP KN            E  P AT K   PS GS SVP R+RDLKEKME+AQ+S      
Sbjct: 949  IPTKNATYVMEAAAKIIEPGPHATPKRKVPSVGSSSVPLRIRDLKEKMEAAQKS------ 1002

Query: 1549 SQKGREENSQSIKK---QLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKPVSLAVQA 1379
            S+  R + S  +K+   Q+N +     ED                      K VSLA QA
Sbjct: 1003 SRLQRPKQSTDVKRMNGQINGKRFNGSEDTPSLNNSKDLVKRNSDSMKKKGKSVSLAEQA 1062

Query: 1378 RSNIQRKDGPXXXXXXXSEKQKEH----NGFVTRDPPNAQKKVEKPSSSRRPSEVLRMNN 1211
            + NIQRK+GP           KEH    +G  ++  P+ QK + K +S+ R S  L+ NN
Sbjct: 1063 KVNIQRKEGPSSSNRSSM-NPKEHTEVKSGQSSKSQPSMQKNMLKRTSTNRTSNALKQNN 1121

Query: 1210 QKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFLA 1031
            QKQN  S RD    + + S  K K+  ++S ++   +               S+K   +A
Sbjct: 1122 QKQNGGSTRDVLTSKTAVSNQKSKKAPSVSGSFGPSK---TVNKVVINTEAGSKKMGSVA 1178

Query: 1030 ADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDGI 851
             D  KE S  + K  S+K+L  +GNI   G +A   +  KD KSIKCNVA E  + W G 
Sbjct: 1179 NDIRKESSLSKTKNASQKKLSVDGNICFEGSIADGVLTNKDVKSIKCNVAVEGGTDWGGD 1238

Query: 850  DKKXXXXXXXXXXXSPIKKHV-GSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGF 674
            + K           SP+KK + GS SS  ++EA                 +NS++SS G 
Sbjct: 1239 NIKKGMDVVSFTFTSPMKKPIPGSMSSDQVMEAKYQFNIDSNDENDAHGSKNSSISSLGL 1298

Query: 673  NVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNMGSAVNLVESMDNGV 494
            NVIG D+L VLLEQKL+ELT RV  S  D                 G+A +         
Sbjct: 1299 NVIGADSLGVLLEQKLRELTFRVGLSHSDLFAP-------------GTAAS--------- 1336

Query: 493  CESEREAQDGSDCASTEKLWLRADKSSKVPECFEGVVDGNNIQRYXXXXXXXXXXXXXXX 314
              S    QD     +          S  +P+  E   DG +   +               
Sbjct: 1337 --STSRLQDSDLRVNVVAPTSTKHTSRLLPDLHEDKSDGPHYFDFSSVGGLQANQKWQVH 1394

Query: 313  XXXSCDSFDVDRSASNEGPTGCLSLDSC------EGTNWRATRKPHVTEGDQEISDTASS 152
                 +      S +NE   G    + C      E  +W +  K  + EG+ E+SD+ASS
Sbjct: 1395 VSEGMEELS-GNSNNNEMGNGLSGSEQCSLAETDEVDSWTSRSKSQLAEGEAELSDSASS 1453

Query: 151  LSFGTM-SETVSSTLHMADSKDSPNWEVKYIEDILRHTELLLEDFSLGQAH 2
            +S   M +  ++ST H+ D K+S NWE++Y+ +IL   EL LEDF+ G  H
Sbjct: 1454 VSILRMNTRNMASTSHLTDFKESVNWELEYMREILCKAELTLEDFASGHTH 1504


>ref|XP_004230289.1| PREDICTED: uncharacterized protein LOC101268805 [Solanum
            lycopersicum]
          Length = 902

 Score =  294 bits (753), Expect = 8e-77
 Identities = 218/642 (33%), Positives = 316/642 (49%), Gaps = 6/642 (0%)
 Frame = -1

Query: 1915 VIFESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSP 1736
            ++  ++RNKLDG  RNP+++RLQK Q+RP+ERFQ EVLPPKSAKPI+V+  RLLSPIKSP
Sbjct: 137  IVDGNMRNKLDGFKRNPVEVRLQKVQSRPIERFQSEVLPPKSAKPIAVTQPRLLSPIKSP 196

Query: 1735 GFIPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSG 1556
            GFIPPKN            +QSPR  ++    S GS S P R+RDL++++E+ QR S   
Sbjct: 197  GFIPPKNAAYIIEAAAKIYQQSPRPAAREKVQSSGSSSAPLRIRDLRDQIEAVQRQSSIY 256

Query: 1555 VASQKGREENS-QSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKPVSLAVQA 1379
             A  + +E+NS +++++Q   RG+ +  D                      K VSLAVQA
Sbjct: 257  EAPHRPKEQNSVKNVRRQPCERGQVQ-SDNLRQLRVSEVSRRDVSQNKGKEKSVSLAVQA 315

Query: 1378 RSNIQRKDGPXXXXXXXSEKQKEHNGFVTRDPPNAQKKVEKPSSSRRPSEVLRMNNQKQN 1199
            ++N+Q+++G           QKE N   +     + K  E+ +S  RPS+VLR NNQKQN
Sbjct: 316  KTNVQKREGKESTSSKNPLNQKEQNESKSGRRRTSVKVGERKNSLNRPSDVLRQNNQKQN 375

Query: 1198 RASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFLAADPG 1019
             AS +D  +   S    KEK+ S+        R               +   + +  D G
Sbjct: 376  SASNKDGESSNTSAPYHKEKKSSSTGN---MSRSTKTVSRIVVNTTAATGIASIVETDVG 432

Query: 1018 KEVSS---LRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDGID 848
            K++SS    R ++ + K+   N +I S    A   M  KDE+SIKCN+  E  S W+  D
Sbjct: 433  KDLSSSRDSRVRSFTGKKQPVNVDIGSDECGADNMMKNKDERSIKCNLTIEGCSNWETAD 492

Query: 847  KKXXXXXXXXXXXSPIKKHV-GSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGFN 671
            +K           SPIKK + G  SS+ +LE              +SD R S + S    
Sbjct: 493  RKNGSDVVSFTFTSPIKKSMPGPTSSSHVLEKNSALCLFPGSYDDQSDSRTSTMPSFR-- 550

Query: 670  VIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNMGSAVNLVESMDNGVC 491
             IGGD L +LLEQK+KELTS+V  S +D                   +V++V        
Sbjct: 551  -IGGDDLGILLEQKIKELTSKVGPSCED---FIKTGTASTSTNAFEDSVSIVAHGRRPQV 606

Query: 490  ESEREAQDGSDCASTEKLWLRADKSSKVPECFEGVVDGNNIQRYXXXXXXXXXXXXXXXX 311
            +   E       +S + L L A +  + P   E     ++I                   
Sbjct: 607  DLLNEKAGDPGHSSVDDLQLTATQMWQGPNRVENPKTASSIT--CEGEFSLASSMEPSIS 664

Query: 310  XXSCDSFDVDRSASNEGPTGCLSLDSCEGTNWRATRKPHVTEGDQEISDTASSLSFGTMS 131
              SC S D  RS + +G    LS  S    NW+   + H+ EGD E+ D+ASS S     
Sbjct: 665  GGSCSSLDSFRSLATDGSKYHLSDGSHYMMNWKTYMRTHLVEGDAELLDSASSASLADAG 724

Query: 130  ETVS-STLHMADSKDSPNWEVKYIEDILRHTELLLEDFSLGQ 8
            E  S +TL  ++  +S  WE +YI DI+R +++++E+F LG+
Sbjct: 725  EKESTTTLTSSNFNESAYWEFQYIRDIIRSSDMVMEEFLLGE 766


>emb|CBI39573.3| unnamed protein product [Vitis vinifera]
          Length = 901

 Score =  290 bits (741), Expect = 2e-75
 Identities = 215/650 (33%), Positives = 304/650 (46%), Gaps = 14/650 (2%)
 Frame = -1

Query: 1909 FESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGF 1730
            + ++ NKL+G   +P++ R ++ Q RP+ERFQ E+LPPKSAK I  +H++LLSPIKSPGF
Sbjct: 137  YNNMPNKLEGDRVSPVESRPRRVQRRPIERFQTEMLPPKSAKSIPFTHHKLLSPIKSPGF 196

Query: 1729 IPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVA 1550
            IP KN            E  P AT K   PS GS SVP R+RDLKEKME+AQ+SS+    
Sbjct: 197  IPTKNATYVMEAAAKIIEPGPHATPKRKVPSVGSSSVPLRIRDLKEKMEAAQKSSR---L 253

Query: 1549 SQKGREENSQSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKPVSLAVQARSN 1370
             +  +  + + +  Q+N +     ED                      K VSLA QA+ N
Sbjct: 254  QRPKQSTDVKHMNGQINGKRFNGSEDTPSLNNSKDLVKRNSDSMKKKGKSVSLAEQAKVN 313

Query: 1369 IQRKDGPXXXXXXXSEKQKEHNGFVTRDPPNAQKKVEKPSSSRRPSEVLRMNNQKQNRAS 1190
            IQRK+GP                   R   N ++  E  +S+ R S  L+ NNQKQN  S
Sbjct: 314  IQRKEGPSSS---------------NRSSMNPKEHTEVKTSTNRTSNALKQNNQKQNGGS 358

Query: 1189 ARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFLAADPGKEV 1010
             RD    + + S  K K+  +++ ++   +               S+K   +A D  KE 
Sbjct: 359  TRDVLTSKTAVSNQKSKKAPSVNGSFGPSK---TVNKVVINTEAGSKKMGSVANDIRKES 415

Query: 1009 SSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDGIDKKXXXX 830
            S  + K  S+K+L  +GNI   G +A   +  KD KSIKCNVA E  + W G + K    
Sbjct: 416  SLSKTKNASRKKLSVDGNICFEGSIADGVLTNKDVKSIKCNVAVEGGTDWGGDNIKKGMD 475

Query: 829  XXXXXXXSPIKKHV-GSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGFNVIGGDA 653
                   SP+KK + GS SS  ++EA                 +NS++SS G NVIG D+
Sbjct: 476  VVSFTFTSPMKKPIPGSMSSDQVMEAKYQFNIDSNDENDAHGSKNSSISSLGPNVIGADS 535

Query: 652  LSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNMGSAVNLVESMDN--------G 497
            L VLLEQKL+ELT RV  S  D               +    VN+V              
Sbjct: 536  LGVLLEQKLRELTFRVGSSHSDLFAPGTAASSTSRLQDSDLRVNVVAPTSTKHTSRLLPD 595

Query: 496  VCESEREAQDGSDCASTEKLWLRADKSSKVPECFEGVVDGNNIQR----YXXXXXXXXXX 329
            + E + +     D +S   L         V E  E +   +N                  
Sbjct: 596  LHEDKSDGPHYFDFSSVGGLQANQKWQVHVSEGMEELSGNSNNNEMGNGLSGQHPSPVLS 655

Query: 328  XXXXXXXXSCDSFDVDRSASNEGPTGCLSLDSCEGTNWRATRKPHVTEGDQEISDTASSL 149
                    +C+S D   S S  G   C   ++ E  +W +  K  + EG+ E+SD+ASS+
Sbjct: 656  LESSFSNITCNSPDSRNSYSVNGSEQCSLAETDEVDSWTSRSKSQLAEGEAELSDSASSV 715

Query: 148  SFGTMS-ETVSSTLHMADSKDSPNWEVKYIEDILRHTELLLEDFSLGQAH 2
            S   M+   ++ST H+ D K+S NWE++Y+ +IL   EL LEDF+ G  H
Sbjct: 716  SILHMNPRNMASTSHLTDFKESVNWELEYMREILCKAELTLEDFASGHTH 765


>ref|XP_007031123.1| Uncharacterized protein isoform 6, partial [Theobroma cacao]
            gi|508719728|gb|EOY11625.1| Uncharacterized protein
            isoform 6, partial [Theobroma cacao]
          Length = 697

 Score =  231 bits (588), Expect = 1e-57
 Identities = 167/446 (37%), Positives = 223/446 (50%), Gaps = 8/446 (1%)
 Frame = -1

Query: 1909 FESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGF 1730
            + ++ NKLD  S NP++ R  K QNRP+ERFQ E+LPPKSAKPI ++H++LLSPI+SPGF
Sbjct: 140  YTNISNKLDRLSSNPIEPRFHKVQNRPIERFQTEILPPKSAKPIPITHHKLLSPIRSPGF 199

Query: 1729 IPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVA 1550
            IP KN            E SP+ TSKG  PS GS SVP R+RDLK K+E+A ++S+    
Sbjct: 200  IPTKNAAYIMEAAAKIIEASPQTTSKGKGPSLGSSSVPLRIRDLKGKIEAAHKASR---- 255

Query: 1549 SQKGREENSQSIKKQLNARGKGRMEDGYLY----KGXXXXXXXXXXXXXXXXKPVSLAVQ 1382
              +  +E S S  K L  + K +  +   Y    +                 K VSLA Q
Sbjct: 256  -PQRPDEPSVSAMKPLKGQHKNKSHNKSDYTPTLRISRDSEKVSSNSLRNKGKSVSLAEQ 314

Query: 1381 ARSNIQRKDGPXXXXXXXSEKQKEHNGF----VTRDPPNAQKKVEKPSSSRRPSEVLRMN 1214
            AR N+QR+DG        S  QKE N       +R   + Q+ VEK +S+ R + VLR N
Sbjct: 315  ARVNVQRRDGSFSSSNGSSASQKERNDAKRKQFSRSQADMQRTVEKGTSANRTNNVLRPN 374

Query: 1213 NQKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFL 1034
            NQKQN  S RD    + S      ++  +++      R               SRKT  +
Sbjct: 375  NQKQNCISTRDYSTSKTSTLDQHARKARSMNGTIGRNR---TLNKVTINSEPQSRKTGSV 431

Query: 1033 AADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDG 854
            A D  KE+   R K   KK+   N ++ S    +  +     EKSIKCNVA       D 
Sbjct: 432  ANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYSEKSIKCNVATNGHLNRDA 491

Query: 853  IDKKXXXXXXXXXXXSPIKKHVGSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGF 674
               K           SPI + V   SS+   + +               L++SA SS GF
Sbjct: 492  EKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY--------LKSSAFSSPGF 542

Query: 673  NVIGGDALSVLLEQKLKELTSRVEFS 596
            N+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 543  NIIGGDSLSVLLEKKLQELTCGVESS 568


>ref|XP_007031122.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508719727|gb|EOY11624.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 699

 Score =  231 bits (588), Expect = 1e-57
 Identities = 167/446 (37%), Positives = 223/446 (50%), Gaps = 8/446 (1%)
 Frame = -1

Query: 1909 FESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGF 1730
            + ++ NKLD  S NP++ R  K QNRP+ERFQ E+LPPKSAKPI ++H++LLSPI+SPGF
Sbjct: 139  YTNISNKLDRLSSNPIEPRFHKVQNRPIERFQTEILPPKSAKPIPITHHKLLSPIRSPGF 198

Query: 1729 IPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVA 1550
            IP KN            E SP+ TSKG  PS GS SVP R+RDLK K+E+A ++S+    
Sbjct: 199  IPTKNAAYIMEAAAKIIEASPQTTSKGKGPSLGSSSVPLRIRDLKGKIEAAHKASR---- 254

Query: 1549 SQKGREENSQSIKKQLNARGKGRMEDGYLY----KGXXXXXXXXXXXXXXXXKPVSLAVQ 1382
              +  +E S S  K L  + K +  +   Y    +                 K VSLA Q
Sbjct: 255  -PQRPDEPSVSAMKPLKGQHKNKSHNKSDYTPTLRISRDSEKVSSNSLRNKGKSVSLAEQ 313

Query: 1381 ARSNIQRKDGPXXXXXXXSEKQKEHNGF----VTRDPPNAQKKVEKPSSSRRPSEVLRMN 1214
            AR N+QR+DG        S  QKE N       +R   + Q+ VEK +S+ R + VLR N
Sbjct: 314  ARVNVQRRDGSFSSSNGSSASQKERNDAKRKQFSRSQADMQRTVEKGTSANRTNNVLRPN 373

Query: 1213 NQKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFL 1034
            NQKQN  S RD    + S      ++  +++      R               SRKT  +
Sbjct: 374  NQKQNCISTRDYSTSKTSTLDQHARKARSMNGTIGRNR---TLNKVTINSEPQSRKTGSV 430

Query: 1033 AADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDG 854
            A D  KE+   R K   KK+   N ++ S    +  +     EKSIKCNVA       D 
Sbjct: 431  ANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYSEKSIKCNVATNGHLNRDA 490

Query: 853  IDKKXXXXXXXXXXXSPIKKHVGSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGF 674
               K           SPI + V   SS+   + +               L++SA SS GF
Sbjct: 491  EKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY--------LKSSAFSSPGF 541

Query: 673  NVIGGDALSVLLEQKLKELTSRVEFS 596
            N+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 542  NIIGGDSLSVLLEKKLQELTCGVESS 567


>ref|XP_007031121.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508719726|gb|EOY11623.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 909

 Score =  231 bits (588), Expect = 1e-57
 Identities = 167/446 (37%), Positives = 223/446 (50%), Gaps = 8/446 (1%)
 Frame = -1

Query: 1909 FESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGF 1730
            + ++ NKLD  S NP++ R  K QNRP+ERFQ E+LPPKSAKPI ++H++LLSPI+SPGF
Sbjct: 139  YTNISNKLDRLSSNPIEPRFHKVQNRPIERFQTEILPPKSAKPIPITHHKLLSPIRSPGF 198

Query: 1729 IPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVA 1550
            IP KN            E SP+ TSKG  PS GS SVP R+RDLK K+E+A ++S+    
Sbjct: 199  IPTKNAAYIMEAAAKIIEASPQTTSKGKGPSLGSSSVPLRIRDLKGKIEAAHKASR---- 254

Query: 1549 SQKGREENSQSIKKQLNARGKGRMEDGYLY----KGXXXXXXXXXXXXXXXXKPVSLAVQ 1382
              +  +E S S  K L  + K +  +   Y    +                 K VSLA Q
Sbjct: 255  -PQRPDEPSVSAMKPLKGQHKNKSHNKSDYTPTLRISRDSEKVSSNSLRNKGKSVSLAEQ 313

Query: 1381 ARSNIQRKDGPXXXXXXXSEKQKEHNGF----VTRDPPNAQKKVEKPSSSRRPSEVLRMN 1214
            AR N+QR+DG        S  QKE N       +R   + Q+ VEK +S+ R + VLR N
Sbjct: 314  ARVNVQRRDGSFSSSNGSSASQKERNDAKRKQFSRSQADMQRTVEKGTSANRTNNVLRPN 373

Query: 1213 NQKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFL 1034
            NQKQN  S RD    + S      ++  +++      R               SRKT  +
Sbjct: 374  NQKQNCISTRDYSTSKTSTLDQHARKARSMNGTIGRNR---TLNKVTINSEPQSRKTGSV 430

Query: 1033 AADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDG 854
            A D  KE+   R K   KK+   N ++ S    +  +     EKSIKCNVA       D 
Sbjct: 431  ANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYSEKSIKCNVATNGHLNRDA 490

Query: 853  IDKKXXXXXXXXXXXSPIKKHVGSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGF 674
               K           SPI + V   SS+   + +               L++SA SS GF
Sbjct: 491  EKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY--------LKSSAFSSPGF 541

Query: 673  NVIGGDALSVLLEQKLKELTSRVEFS 596
            N+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 542  NIIGGDSLSVLLEKKLQELTCGVESS 567


>ref|XP_007031120.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
            gi|590644591|ref|XP_007031124.1| Uncharacterized protein
            isoform 3, partial [Theobroma cacao]
            gi|508719725|gb|EOY11622.1| Uncharacterized protein
            isoform 3, partial [Theobroma cacao]
            gi|508719729|gb|EOY11626.1| Uncharacterized protein
            isoform 3, partial [Theobroma cacao]
          Length = 787

 Score =  231 bits (588), Expect = 1e-57
 Identities = 167/446 (37%), Positives = 223/446 (50%), Gaps = 8/446 (1%)
 Frame = -1

Query: 1909 FESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGF 1730
            + ++ NKLD  S NP++ R  K QNRP+ERFQ E+LPPKSAKPI ++H++LLSPI+SPGF
Sbjct: 139  YTNISNKLDRLSSNPIEPRFHKVQNRPIERFQTEILPPKSAKPIPITHHKLLSPIRSPGF 198

Query: 1729 IPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVA 1550
            IP KN            E SP+ TSKG  PS GS SVP R+RDLK K+E+A ++S+    
Sbjct: 199  IPTKNAAYIMEAAAKIIEASPQTTSKGKGPSLGSSSVPLRIRDLKGKIEAAHKASR---- 254

Query: 1549 SQKGREENSQSIKKQLNARGKGRMEDGYLY----KGXXXXXXXXXXXXXXXXKPVSLAVQ 1382
              +  +E S S  K L  + K +  +   Y    +                 K VSLA Q
Sbjct: 255  -PQRPDEPSVSAMKPLKGQHKNKSHNKSDYTPTLRISRDSEKVSSNSLRNKGKSVSLAEQ 313

Query: 1381 ARSNIQRKDGPXXXXXXXSEKQKEHNGF----VTRDPPNAQKKVEKPSSSRRPSEVLRMN 1214
            AR N+QR+DG        S  QKE N       +R   + Q+ VEK +S+ R + VLR N
Sbjct: 314  ARVNVQRRDGSFSSSNGSSASQKERNDAKRKQFSRSQADMQRTVEKGTSANRTNNVLRPN 373

Query: 1213 NQKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFL 1034
            NQKQN  S RD    + S      ++  +++      R               SRKT  +
Sbjct: 374  NQKQNCISTRDYSTSKTSTLDQHARKARSMNGTIGRNR---TLNKVTINSEPQSRKTGSV 430

Query: 1033 AADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDG 854
            A D  KE+   R K   KK+   N ++ S    +  +     EKSIKCNVA       D 
Sbjct: 431  ANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYSEKSIKCNVATNGHLNRDA 490

Query: 853  IDKKXXXXXXXXXXXSPIKKHVGSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGF 674
               K           SPI + V   SS+   + +               L++SA SS GF
Sbjct: 491  EKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY--------LKSSAFSSPGF 541

Query: 673  NVIGGDALSVLLEQKLKELTSRVEFS 596
            N+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 542  NIIGGDSLSVLLEKKLQELTCGVESS 567


>ref|XP_007031119.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508719724|gb|EOY11621.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 840

 Score =  231 bits (588), Expect = 1e-57
 Identities = 167/446 (37%), Positives = 223/446 (50%), Gaps = 8/446 (1%)
 Frame = -1

Query: 1909 FESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGF 1730
            + ++ NKLD  S NP++ R  K QNRP+ERFQ E+LPPKSAKPI ++H++LLSPI+SPGF
Sbjct: 140  YTNISNKLDRLSSNPIEPRFHKVQNRPIERFQTEILPPKSAKPIPITHHKLLSPIRSPGF 199

Query: 1729 IPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVA 1550
            IP KN            E SP+ TSKG  PS GS SVP R+RDLK K+E+A ++S+    
Sbjct: 200  IPTKNAAYIMEAAAKIIEASPQTTSKGKGPSLGSSSVPLRIRDLKGKIEAAHKASR---- 255

Query: 1549 SQKGREENSQSIKKQLNARGKGRMEDGYLY----KGXXXXXXXXXXXXXXXXKPVSLAVQ 1382
              +  +E S S  K L  + K +  +   Y    +                 K VSLA Q
Sbjct: 256  -PQRPDEPSVSAMKPLKGQHKNKSHNKSDYTPTLRISRDSEKVSSNSLRNKGKSVSLAEQ 314

Query: 1381 ARSNIQRKDGPXXXXXXXSEKQKEHNGF----VTRDPPNAQKKVEKPSSSRRPSEVLRMN 1214
            AR N+QR+DG        S  QKE N       +R   + Q+ VEK +S+ R + VLR N
Sbjct: 315  ARVNVQRRDGSFSSSNGSSASQKERNDAKRKQFSRSQADMQRTVEKGTSANRTNNVLRPN 374

Query: 1213 NQKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFL 1034
            NQKQN  S RD    + S      ++  +++      R               SRKT  +
Sbjct: 375  NQKQNCISTRDYSTSKTSTLDQHARKARSMNGTIGRNR---TLNKVTINSEPQSRKTGSV 431

Query: 1033 AADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDG 854
            A D  KE+   R K   KK+   N ++ S    +  +     EKSIKCNVA       D 
Sbjct: 432  ANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYSEKSIKCNVATNGHLNRDA 491

Query: 853  IDKKXXXXXXXXXXXSPIKKHVGSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGF 674
               K           SPI + V   SS+   + +               L++SA SS GF
Sbjct: 492  EKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY--------LKSSAFSSPGF 542

Query: 673  NVIGGDALSVLLEQKLKELTSRVEFS 596
            N+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 543  NIIGGDSLSVLLEKKLQELTCGVESS 568


>ref|XP_007031118.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508719723|gb|EOY11620.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 910

 Score =  231 bits (588), Expect = 1e-57
 Identities = 167/446 (37%), Positives = 223/446 (50%), Gaps = 8/446 (1%)
 Frame = -1

Query: 1909 FESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGF 1730
            + ++ NKLD  S NP++ R  K QNRP+ERFQ E+LPPKSAKPI ++H++LLSPI+SPGF
Sbjct: 140  YTNISNKLDRLSSNPIEPRFHKVQNRPIERFQTEILPPKSAKPIPITHHKLLSPIRSPGF 199

Query: 1729 IPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVA 1550
            IP KN            E SP+ TSKG  PS GS SVP R+RDLK K+E+A ++S+    
Sbjct: 200  IPTKNAAYIMEAAAKIIEASPQTTSKGKGPSLGSSSVPLRIRDLKGKIEAAHKASR---- 255

Query: 1549 SQKGREENSQSIKKQLNARGKGRMEDGYLY----KGXXXXXXXXXXXXXXXXKPVSLAVQ 1382
              +  +E S S  K L  + K +  +   Y    +                 K VSLA Q
Sbjct: 256  -PQRPDEPSVSAMKPLKGQHKNKSHNKSDYTPTLRISRDSEKVSSNSLRNKGKSVSLAEQ 314

Query: 1381 ARSNIQRKDGPXXXXXXXSEKQKEHNGF----VTRDPPNAQKKVEKPSSSRRPSEVLRMN 1214
            AR N+QR+DG        S  QKE N       +R   + Q+ VEK +S+ R + VLR N
Sbjct: 315  ARVNVQRRDGSFSSSNGSSASQKERNDAKRKQFSRSQADMQRTVEKGTSANRTNNVLRPN 374

Query: 1213 NQKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFL 1034
            NQKQN  S RD    + S      ++  +++      R               SRKT  +
Sbjct: 375  NQKQNCISTRDYSTSKTSTLDQHARKARSMNGTIGRNR---TLNKVTINSEPQSRKTGSV 431

Query: 1033 AADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDG 854
            A D  KE+   R K   KK+   N ++ S    +  +     EKSIKCNVA       D 
Sbjct: 432  ANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYSEKSIKCNVATNGHLNRDA 491

Query: 853  IDKKXXXXXXXXXXXSPIKKHVGSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGF 674
               K           SPI + V   SS+   + +               L++SA SS GF
Sbjct: 492  EKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY--------LKSSAFSSPGF 542

Query: 673  NVIGGDALSVLLEQKLKELTSRVEFS 596
            N+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 543  NIIGGDSLSVLLEKKLQELTCGVESS 568


>ref|XP_007207147.1| hypothetical protein PRUPE_ppa001266mg [Prunus persica]
            gi|462402789|gb|EMJ08346.1| hypothetical protein
            PRUPE_ppa001266mg [Prunus persica]
          Length = 867

 Score =  226 bits (575), Expect = 4e-56
 Identities = 200/647 (30%), Positives = 283/647 (43%), Gaps = 14/647 (2%)
 Frame = -1

Query: 1903 SVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGFIP 1724
            +V  KLD  S NP++ R Q  Q++P+ERFQ EVLPPKSAK I V+H++LLSPIKSPGFIP
Sbjct: 142  NVPKKLDRFSWNPVESRAQGVQSQPIERFQTEVLPPKSAKSIPVTHHKLLSPIKSPGFIP 201

Query: 1723 PKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVASQ 1544
             KN            E SPRA+SK    S G  S+P R+RDLKEKME+ Q+      AS+
Sbjct: 202  TKNAAYIMEATSKIIEASPRASSKSKGSSVGPSSIPLRIRDLKEKMEAVQK------ASR 255

Query: 1543 KGREENSQSIKKQLNARGKGRMEDG----YLYKGXXXXXXXXXXXXXXXXKPVSLAVQAR 1376
              R + +  +K      G  R+++G    +L K                 K VSLAVQA+
Sbjct: 256  PERPKEAGDVKYMKGLPG-DRIQNGSVNVHLPKASVNSERQSYRDGRNKGKSVSLAVQAK 314

Query: 1375 SNIQRKDGPXXXXXXXSEKQKE-----HNGFVTRDPPNAQKKVEKPSSSRRPSEVLRMNN 1211
             N+QRKDG           QKE      N F    PP+ Q+ V K +S      VL+ NN
Sbjct: 315  VNVQRKDGSSSCSNRSFMNQKEQNEMKQNQFSKSRPPSPQRAVHKKTSPDSTKSVLKQNN 374

Query: 1210 QKQNRASARD---DGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTN 1040
            QKQN  S +D     N  P+    + +     STN  + R               S K  
Sbjct: 375  QKQNCVSNKDKTTSKNIVPNPPTRRMR-----STNG-SSRPGKTVSKVLVNSETGSGKMG 428

Query: 1039 FLAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKW 860
             +    GKE S    K  S K      ++     V+  A   +DE+S+KCNV+ +  +  
Sbjct: 429  SMGNFTGKEFSLSTMKKVSGKLRSVGQDVHLEEAVSDNAFISEDERSVKCNVSMDGCTSL 488

Query: 859  DGIDKKXXXXXXXXXXXSPIKKHVG--SNSSAAMLEATXXXXXXXXXSARESDLRNSAVS 686
               ++K           SP+K+ +     S   M             + ++    N  +S
Sbjct: 489  GADNRKQAMDVVSFTFTSPLKRSISELQCSGQVMSRNNSFYIDSFGNNDQQRYPENFTLS 548

Query: 685  SSGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNMGSAVNLVESM 506
            S GFNVIGGDALSVLLEQKL+EL+ +VE SQ +                  S+ + ++ M
Sbjct: 549  SPGFNVIGGDALSVLLEQKLQELSCKVELSQHN--------PANEETTAAASSSSGLQDM 600

Query: 505  DNGVCESEREAQDGSDCASTEKLWLRADKSSKVPECFEGVVDGNNIQRYXXXXXXXXXXX 326
             +GV  +    +         +L LR D+   +     G +   ++ +            
Sbjct: 601  ASGVASTASRGK-------KFELGLRRDEFDSINH--YGCLLSVDVNQQWKGSEGMEECS 651

Query: 325  XXXXXXXSCDSFDVDRSASNEGPTGCLSLDSCEGTNWRATRKPHVTEGDQEISDTASSLS 146
                   +   FD      N  P    S +S   T             D   S  ASS S
Sbjct: 652  SSSITSANGKEFDY----QNHSPLSAPSFESRSCT-------------DNRNSANASSAS 694

Query: 145  FGTMSETVSSTLHMADSKDSPNWEVKYIEDILRHTELLLEDFSLGQA 5
             G +S  ++          + NWE++Y+  IL + +L +EDF+LG A
Sbjct: 695  TGDVSGNMTRISGSHYLNRTNNWELEYVRYILSNVDLEMEDFALGDA 741


>ref|XP_002314925.1| hypothetical protein POPTR_0010s15080g [Populus trichocarpa]
            gi|222863965|gb|EEF01096.1| hypothetical protein
            POPTR_0010s15080g [Populus trichocarpa]
          Length = 933

 Score =  216 bits (551), Expect = 2e-53
 Identities = 189/666 (28%), Positives = 292/666 (43%), Gaps = 36/666 (5%)
 Frame = -1

Query: 1894 NKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGFIPPKN 1715
            NK DG  RN +  + QK  +RP+E+FQ E+LPPKSAK I  +H++LLSPIKSPGFIP K 
Sbjct: 152  NKEDGPPRNLVKSKPQKVLSRPIEKFQTEILPPKSAKSIPTTHHKLLSPIKSPGFIPSKT 211

Query: 1714 XXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQR-----SSQSGVA 1550
                        E SP A +K   P+ GS S+P +VRDLKEK+E AQ+     SS + + 
Sbjct: 212  AAHIMEAAAKIIEPSPLAVAKAKMPALGSSSLPLKVRDLKEKLEVAQKMPLVGSSSAAIR 271

Query: 1549 SQKGREENSQSIK----------------------KQLNARGKGRMEDGYLYKGXXXXXX 1436
            +++ +E+   S K                      + LN    G  +  Y          
Sbjct: 272  TREAKEKVEASHKTSRLAETSRRPVESSAAKHLKGQSLNKSWNGSDDTSYR---AFSETD 328

Query: 1435 XXXXXXXXXXKPVSLAVQARSNIQRKDGPXXXXXXXSEKQKEHNGFVTRDP----PNAQK 1268
                      K +SLA+QA+ N+QR++G           QKE     +  P    PN QK
Sbjct: 329  EDSSSSKTKVKSISLAIQAKFNVQRREGLNASSSQGFVGQKEQAEVSSSQPFKSHPNFQK 388

Query: 1267 KVEKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXX 1088
              +K S   + S  LR NNQKQN    +D    +P  S  + K+  +      N      
Sbjct: 389  SSQKRSPILKASGALRQNNQKQNCMMDKDKLPSKPLVSNLQGKKVLSG-----NPPARHK 443

Query: 1087 XXXXXXXXXXTSRKTNFLAADPGKEVSSLRAKTTSKKRLLANGNIQ-SSGGVAQKAMAVK 911
                       SRK    + +  K  S+   ++  +K+   +GN+      VA K +  +
Sbjct: 444  TFCKTFGSKNGSRKLASDSREVEKGTSNYSTRSNPRKKRSIDGNLHLEKNQVADKLLIDR 503

Query: 910  DEKSIKCNVAFEVDSKWDGIDKKXXXXXXXXXXXSPIKKHV-GSNSSAAMLEATXXXXXX 734
            + K+++ N   +    W    K+           +P+ + + GS +   +++        
Sbjct: 504  NRKAVETNPVIDRHFSWVEESKRKGMDVVSFTFTAPLTRSMPGSETPTRVVQEKSGSCTD 563

Query: 733  XXXSARESDLRNSAVSSSGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXX 554
                    D  +  +SS G+NVIGGDALS LLEQK++ELT  VE S              
Sbjct: 564  NRSKRLLLDTDSMNLSSGGYNVIGGDALSTLLEQKMRELTKTVESSSS-----------L 612

Query: 553  XXXXNMGSAVNLVESMDNGVCESEREAQDGSDC--ASTEKLWLRADKSSKVPECFEGVVD 380
                + G+A  L ++ D  V   +R +    DC   ST+   LR  +  +  +  +    
Sbjct: 613  STFSSGGTAPRLHDNKDESVSCIDR-SDSCYDCHFLSTDPAALRLKRILQGVDEMDCSSK 671

Query: 379  GNNIQRY-XXXXXXXXXXXXXXXXXXSCDSFDVDRSASNEGPTGCLSLDSCEGTNWRATR 203
             N+ +++                   S  S D   S   EG   C S+   E     +++
Sbjct: 672  SNDSRKFLDCRRPSPVSVLEHSFSTESSSSLDSADSCITEGSRHCSSIQVQEVHGLSSSK 731

Query: 202  KPHVTEGDQEISDTASSLSFGTMSETVSSTLHMADSKDSPNWEVKYIEDILRHTELLLED 23
            K H  + D E+SD+ASS S GT+    ++ L +     S  WE++Y++ IL + EL+ +D
Sbjct: 732  KFHFVDVDTELSDSASSSSTGTVDRKHANMLAVTGLARSTKWEIEYVKKILCNIELMFQD 791

Query: 22   FSLGQA 5
            F+LG+A
Sbjct: 792  FALGRA 797


>ref|XP_002512492.1| conserved hypothetical protein [Ricinus communis]
            gi|223548453|gb|EEF49944.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 851

 Score =  215 bits (548), Expect = 5e-53
 Identities = 157/447 (35%), Positives = 223/447 (49%), Gaps = 8/447 (1%)
 Frame = -1

Query: 1903 SVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGFIP 1724
            ++ + L+G SRN L+ R QK QNRP+ERFQ E+LPPKSAK I+++H++LLSPIK+PGFIP
Sbjct: 143  NISSNLEGYSRNSLESRSQKVQNRPIERFQTEMLPPKSAKSIALTHHKLLSPIKNPGFIP 202

Query: 1723 PKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVASQ 1544
             KN            E SP+AT  G  PS GS SVP R+RDLK KME+A  +S+   ++ 
Sbjct: 203  TKNATYIMEAAAKIIEASPKATVNGKMPSIGSTSVPLRIRDLKRKMEAAHTASRPQRSND 262

Query: 1543 KGREENSQSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKPVSLAVQARSNIQ 1364
                +N++      +ARG   +      K                 K VS +VQ RSN+Q
Sbjct: 263  FFAAKNTKGQLCDRSARGSEGISS---CKISTFSEKDTSESVRNKGKLVSPSVQVRSNVQ 319

Query: 1363 RKDGPXXXXXXXSEKQKEHNGFVTRDPPNAQKKVE--KPSSSRRPSEVLRMNNQKQNRAS 1190
            R++G         +KQKE     +   P +Q   +  K +S  R + VLR NNQKQN +S
Sbjct: 320  RREG-VTSRNSNIKKQKEQKEIRSNQSPKSQSSSQKTKKTSENRTTNVLRQNNQKQNSSS 378

Query: 1189 ARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFLAADPGKEV 1010
             ++  N + S S    K    +S++    R              TSRK + +  D  KE 
Sbjct: 379  GKESTNLKNSFSNQAGKRVQTMSSSVGQSR----TTNKVVLKPETSRKMHLVVTDTEKE- 433

Query: 1009 SSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFE------VDSKWDGID 848
               +    S K+   NG  Q   GV+      + E+SIKCN+A +      VD++ +G+D
Sbjct: 434  ---KPNNISLKKRPVNGEPQIGRGVSDNESLNRVERSIKCNLAVDGCMNTAVDNRKNGMD 490

Query: 847  KKXXXXXXXXXXXSPIKKHVGSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGFNV 668
                         SP+KK       + M ++           +      N + S  G N+
Sbjct: 491  ------VVSFTFTSPVKKATPDPQPSVMEKS--KSSVIDLFGSNGHPYFNKSTSFPGLNI 542

Query: 667  IGGDALSVLLEQKLKELTSRVEFSQKD 587
            IGGDAL VLLEQKL+EL ++VE SQ +
Sbjct: 543  IGGDALGVLLEQKLRELANKVESSQSN 569


>ref|XP_006382417.1| hypothetical protein POPTR_0005s01960g [Populus trichocarpa]
            gi|550337777|gb|ERP60214.1| hypothetical protein
            POPTR_0005s01960g [Populus trichocarpa]
          Length = 915

 Score =  214 bits (546), Expect = 9e-53
 Identities = 200/654 (30%), Positives = 277/654 (42%), Gaps = 33/654 (5%)
 Frame = -1

Query: 1870 NPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGFIPPKNXXXXXXXX 1691
            N ++ R  K +NRP +RFQ E LPPKSAK I  +H++LLSPIK+PGF P KN        
Sbjct: 159  NSVESRPHKVENRPSKRFQTETLPPKSAKSIPSTHHKLLSPIKNPGFTPTKNAAYIMEAA 218

Query: 1690 XXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESA-------QRSSQSGVASQKGRE 1532
                E +P+ATS G  PS G+ SVP R+RDLK+KME+A       QRSS+S VA      
Sbjct: 219  AKIIEANPKATSSGKVPSIGTSSVPLRIRDLKQKMEAAAHTTSKPQRSSESSVAKNT--- 275

Query: 1531 ENSQSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKPVSLAVQARS-NIQRKD 1355
            +  QS K      G          K                 K V LA QA+S N QR+D
Sbjct: 276  KGQQSDKSWSGPEGLSSS------KASTSSEKGTPSSLKNKGKSVPLAAQAKSTNGQRRD 329

Query: 1354 GPXXXXXXXSEKQKEHNGFVT----RDPPNAQKKVEKPSSSRRPSEVLRMNNQKQNRASA 1187
            G          KQKE N   T    +  P  Q  V+K  S  R S VL+ NN KQN A  
Sbjct: 330  GSTLKSKSIV-KQKEKNEVKTNQMLKTQPRTQNTVQKRISESRTSNVLQQNNLKQNSAPN 388

Query: 1186 RDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFLAADPGKEVS 1007
            +D    + S S  + ++  + S +    R                RK   +  D  KE  
Sbjct: 389  KDSSGLKNSLSNQQGRKTKSTSGSVGQSRTVKKVVVKPETVP---RKMGLVMTDSEKE-- 443

Query: 1006 SLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDGIDKKXXXXX 827
              + K  ++K+   +G++Q            KDE S K NV  + +      ++K     
Sbjct: 444  --KTKNIARKKRSVSGDLQIDRNATPNVSFNKDEMSTKSNVVMDGNMNMAMDNRKSGMDV 501

Query: 826  XXXXXXSPIKKHV-GSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGFNVIGGDAL 650
                  SPIK+    S SS  MLE           S     L++S     G NV+GGD L
Sbjct: 502  VSFTFSSPIKRATPSSQSSGQMLEKCSSSAIDSFGSKDHPSLKSSMSYFPGLNVMGGDVL 561

Query: 649  SVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNMGSAVNLVESMDNGVCESEREAQ 470
             VLLEQKL+ELT +VE                       S  N++    +    S  +  
Sbjct: 562  GVLLEQKLRELTYKVE----------------------SSHCNVIREETSSTSLSIFQNS 599

Query: 469  DGSDCASTEKLWLRADKSSKVPECFEGVVDGNNIQRYXXXXXXXXXXXXXXXXXXSCDSF 290
               + AST    L  DK  +V        D ++   Y                    +  
Sbjct: 600  STPNVASTSSAAL--DKMLQVVH----DKDKSDSLGYFDCILVENSQLAMNQKWQQSEDM 653

Query: 289  DVDRSASNEGPTG----------------CLSLDSC---EGTNWRATRKPHVTEGDQEIS 167
            +V  S+SN   TG                  +  SC    G++  +T +    EG+ E+S
Sbjct: 654  EVQSSSSNYSETGKELKCQRTSPVSILEPSFASGSCSYLNGSSHCSTNESVGMEGETELS 713

Query: 166  DTASSLS-FGTMSETVSSTLHMADSKDSPNWEVKYIEDILRHTELLLEDFSLGQ 8
            D+ASS+S    + +  + T  + +SK+S +WE+ ++ DIL   EL L+DFSLGQ
Sbjct: 714  DSASSISTVDVVRKYTTRTCSITESKESSDWELDFMRDILVSAELNLKDFSLGQ 767


>ref|XP_004304870.1| PREDICTED: uncharacterized protein LOC101302284 [Fragaria vesca
            subsp. vesca]
          Length = 899

 Score =  213 bits (543), Expect = 2e-52
 Identities = 188/640 (29%), Positives = 288/640 (45%), Gaps = 7/640 (1%)
 Frame = -1

Query: 1909 FESVRNKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGF 1730
            + ++ NKLD  S NP++ R Q+ QNRP+ERFQ EVLPPKSAK I V+H++LLSPIK+PGF
Sbjct: 140  YMNMPNKLDRVSWNPVESRAQRVQNRPIERFQTEVLPPKSAKSIPVTHHKLLSPIKTPGF 199

Query: 1729 IPPKNXXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVA 1550
            IP KN            E SPRA+SK    S  S S+P ++RDLKEKME+  + S+    
Sbjct: 200  IPTKNAAYIMEAAAKMIEASPRASSKSKMSSMRS-SIPLKIRDLKEKMEAVPKVSR---P 255

Query: 1549 SQKGREENSQSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKPVSLAVQARSN 1370
             Q     +++ +K +   +     ++  + K                 K  SLAVQA++N
Sbjct: 256  EQPKEPSDAKYVKGRPGYKSYNGSDNVPVPKASVDSEKQDYHDIRNRGKAASLAVQAKAN 315

Query: 1369 IQRKDGPXXXXXXXSEKQKEHN----GFVTRDPPNAQKKVEKPSSSRRPSEVLRMNNQKQ 1202
            +QRK+G        S  QKE N      +++   + Q+ V K +S+     VL+ NNQKQ
Sbjct: 316  VQRKEGSPPFSNRSSTNQKEQNEVKQNELSKSRQSTQRPVHKRTSTVSNKSVLKQNNQKQ 375

Query: 1201 NRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNFLAADP 1022
            N  S +D    +   S    ++   L     + R               SRK   + +  
Sbjct: 376  NCLSNKDRMTSKNVVSNQPTRK---LRPTNGSSRPNRTVNKVLVNSDTGSRKMGSMESAT 432

Query: 1021 GKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWDGIDKK 842
            GKE S    K  S K   A+ +      VA   +  K E+S+KCNVA E  +     ++K
Sbjct: 433  GKEFSFSTVKDVSGKIRSASQDFHLEEIVADNGLIGKHERSVKCNVATEGYTNLCTDNRK 492

Query: 841  XXXXXXXXXXXSPIKKHVGS-NSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSGFNVI 665
                       SP+KK +    S   ++  +         +  +   ++   +S G NVI
Sbjct: 493  QDMDVVSFTFTSPLKKSISELQSDGQVVSMSDRFCIDSFSNNDQLYPKHFTFASPGLNVI 552

Query: 664  GGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNM-GSAVNLVESMDNGVCE 488
            GGDALSVLLEQKL+ELT +VE SQ++               ++  S V+           
Sbjct: 553  GGDALSVLLEQKLQELTCKVESSQRNLFGEGTSASSSSSLQDLVSSEVSTASRGKKFELG 612

Query: 487  SEREAQDGSDCASTEKLWLRADKSSKVPECFEGVVDGNNIQRYXXXXXXXXXXXXXXXXX 308
              R+  DG+  A    L   A++  + PE  +     +                      
Sbjct: 613  LLRDNVDGA--ADFGSLLANANQKLQGPEGTDERSSSSKNSVPGKDFDYQLDPISVFEPS 670

Query: 307  XSCDSFDVDRSASNEGPT-GCLSLDSCEGTNWRATRKPHVTEGDQEISDTASSLSFGTMS 131
                SF  +RS++N   +  C    + +  N  ++ +   +    E+SD AS  ++    
Sbjct: 671  FESGSFTDNRSSANGSESERCSFAQAQDQFNLFSSFEIQPSYSVSELSDLAS--TWEVSG 728

Query: 130  ETVSSTLHMADSKDSPNWEVKYIEDILRHTELLLEDFSLG 11
            +  S          S +WE++Y++ IL   +L+LEDF+LG
Sbjct: 729  KNTSRVYGFHSPNQSYDWELEYVQYILSKVDLVLEDFALG 768


>ref|XP_007035732.1| Uncharacterized protein TCM_021314 [Theobroma cacao]
            gi|508714761|gb|EOY06658.1| Uncharacterized protein
            TCM_021314 [Theobroma cacao]
          Length = 930

 Score =  213 bits (541), Expect = 3e-52
 Identities = 189/675 (28%), Positives = 300/675 (44%), Gaps = 45/675 (6%)
 Frame = -1

Query: 1894 NKLDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGFIPPKN 1715
            NK++G +RN  + + QK  ++P+ERFQ E LPPK+AK I ++H++LLSPIKSPGF+P KN
Sbjct: 150  NKMEGPARNFGESKPQKIISKPIERFQTESLPPKAAKTIPITHHKLLSPIKSPGFVPSKN 209

Query: 1714 XXXXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVASQ--- 1544
                        E  P A S+   P   S SVP +VRD KEKME+AQ+    G +S    
Sbjct: 210  AAHIMEAAARIIEPGPHAISRAKMPMVRSSSVPVKVRDFKEKMEAAQKMPMVGSSSVPLK 269

Query: 1543 ----KGREEN-------SQSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXK-- 1403
                K + E        +++ ++ + +     ++   L K                 +  
Sbjct: 270  VRDLKEKVETVHKTSRLTETTRRPVESNAAKFLKGQSLNKSWNGSTDTTSPRTSDTEEIS 329

Query: 1402 --------PVSLAVQARSNIQRKDGPXXXXXXXSEKQKEHNGFVTRDP----PNAQKKVE 1259
                     +SLA+QA+ N+Q+++G           QK+ +   +  P    P+AQK + 
Sbjct: 330  SVLKSKGKSISLAIQAKVNVQKREGLASSSSRSLLGQKDQSEVKSSQPFKSQPSAQKSLH 389

Query: 1258 KPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQSKEKE----ESNLSTNYINGRXXX 1091
            K SS+   S VLR NNQKQN    +D    + + S    ++    +S+   + ++G+   
Sbjct: 390  KKSSTHNASGVLRQNNQKQNCIVDKDKLPSKSTASNLHSRKVLSGDSSFGRHKMSGKTVG 449

Query: 1090 XXXXXXXXXXXTSRKTNFLAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVK 911
                        SRK  F   D  K       K   KKR +          V    +  K
Sbjct: 450  NSKTG-------SRKLGFGTTDSEKGGPYSGTKNPRKKRSIDRDIQFEKNQVVDNVLIEK 502

Query: 910  DEKS----IKCNVAFEVDSKWDGIDKKXXXXXXXXXXXSPIKKHVGSNSSAAMLEATXXX 743
            ++K      + N ++  DSK  G+D             +P+ + +   +SA + +     
Sbjct: 503  NQKEDHPVTERNFSWVEDSKKKGMD------VVSFTFTAPLTRSM--ETSAQLAQKKNGI 554

Query: 742  XXXXXXSARESDLRNSAVSSSGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXX 563
                       D  +  +SS G+NVIGGDALS+LLEQKL+EL++ VE S           
Sbjct: 555  CMDNRGKRLLLDTESLKLSSMGYNVIGGDALSMLLEQKLRELSNAVESSCHKSLNSGSAS 614

Query: 562  XXXXXXXNMGSAVNLVESMDN-----GVCESEREAQDGSDCASTEKLWLRA----DKSSK 410
                   ++    N V +M +     G C S       S+ +ST+   LR       + +
Sbjct: 615  TSTSFSQDLVHTPNAVTTMPSLYNKLGSCHS-------SNLSSTDLQLLRLKHKFQGADE 667

Query: 409  VPECFEGVVDGNNIQRYXXXXXXXXXXXXXXXXXXSCDSFDVDRSASNEGPTGCLSLDSC 230
              EC    +D                         SC+S D   S S EG   C S+ + 
Sbjct: 668  TDECSSSCLDARQ--------PSPVSILEPSFSTESCNSSDSTDSCSIEGSKHCSSVQAQ 719

Query: 229  EGTNWRATRKPHVTEGDQEISDTASSLSFGTMSETVSSTLHMADSKDSPNWEVKYIEDIL 50
            E     +++K    + D E+SD+ASS+  GT+++   +T+ M+D   S NWE++Y++ IL
Sbjct: 720  EVLGLSSSKKLRSLDADTELSDSASSICPGTVAKRNQNTVVMSDPMKSVNWELEYVKLIL 779

Query: 49   RHTELLLEDFSLGQA 5
             + EL+ +DF+LG+A
Sbjct: 780  CNVELMFKDFALGRA 794


>ref|XP_007226468.1| hypothetical protein PRUPE_ppa1027230mg [Prunus persica]
            gi|462423404|gb|EMJ27667.1| hypothetical protein
            PRUPE_ppa1027230mg [Prunus persica]
          Length = 942

 Score =  211 bits (537), Expect = 9e-52
 Identities = 194/671 (28%), Positives = 302/671 (45%), Gaps = 43/671 (6%)
 Frame = -1

Query: 1888 LDGSSRNPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGFIPPKNXX 1709
            ++G +RNPL+ + QK   RP+ERFQ E LPP+SAK I ++H++LLSPIK+PGF+P KN  
Sbjct: 154  MEGPTRNPLEAKPQKL--RPIERFQTETLPPRSAKSIPITHHKLLSPIKNPGFVPTKNAA 211

Query: 1708 XXXXXXXXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVASQ--KGR 1535
                      EQ P+ T+K   P  G  SVP +V+ LKEK+E++++    G AS+  KGR
Sbjct: 212  HIMEAAAKIMEQGPQTTAKAKMPLVGCSSVPLKVQALKEKVEASRKVPLVGSASETLKGR 271

Query: 1534 E------------ENSQSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKP--- 1400
            +              S+  +K + +     +    L K                 +    
Sbjct: 272  DLKDKVEAGYKIPRPSEVSRKPVESNAAKYLRGQSLNKSWNGSVDLSFGASSDTEETRGK 331

Query: 1399 -VSLAVQARSNIQRKDGPXXXXXXXSEKQKEHNGFVT----RDPPNAQKKVEKPSSSRRP 1235
             +SLA+QA+ N+Q++ G           QKE +   +    R  PN QK + K  S+   
Sbjct: 332  SISLAIQAKVNVQKR-GQNLSRNRSLVGQKEQSEVSSNQSFRSQPNVQKNLHKKPSTHNA 390

Query: 1234 SEVLRMNNQKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXT 1055
            S  LR NNQKQN    ++    +P  S S+ ++   LS +  +GR               
Sbjct: 391  SGALRQNNQKQNCLVDKEKLPSKPLVSNSQGRKV--LSGDSSSGRHKSSIRSSGNSKIG- 447

Query: 1054 SRKTNFLAADPGKEVSSLRAKTTSKKRLLANGNIQ-SSGGVAQKAMAVKDEKSIKCNVAF 878
            SRK    A D  KEVS   A+   +K+   +GN Q +        ++ K++K ++ N   
Sbjct: 448  SRKLGSEAMDSDKEVSYSNARNYPRKKRSIDGNFQYNKDRTVGDMLSEKNQKPVQSNPIT 507

Query: 877  EVDSKWDGIDKKXXXXXXXXXXXSPIKKHV-GSNSSAAMLEATXXXXXXXXXSARESDLR 701
            + +  W    +K           +P+ + + G+  SA + +                D  
Sbjct: 508  DRNYSWAEDSRKKGMDVVSFTFTAPLTRSLPGTEISAQVAQKNTSLCMDHGGKRLLLDKD 567

Query: 700  NSAVSSSGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNMGSAVN 521
            +  +SS G+NVIGGDALS+LLEQKL+EL+   + S  D               ++    N
Sbjct: 568  SMKLSSLGYNVIGGDALSMLLEQKLRELSYGTKSSSHD--SMKEGSASTASTFDLKPKFN 625

Query: 520  LVESMDNGVCESEREAQDGSDCASTEKLWLR-------ADKSS-KVPECFEGVVDGNNIQ 365
             V SM       +R+ Q       TEKL  R       AD  + ++ + F+GV   N   
Sbjct: 626  AVSSMQR--LNDQRDQQ-----LVTEKLGGRYEADFSFADSPAFRLKQNFQGV---NKTD 675

Query: 364  RYXXXXXXXXXXXXXXXXXXSC--------DSFDVDRSA---SNEGPTGCLSLDSCEGTN 218
             Y                            +S+D   S    S E    C S+ + E   
Sbjct: 676  EYSSSHGEAGLLLSGRHPSPVSVLEPSFSNESYDSSISTDSNSTEASRLCSSVQAQEVHV 735

Query: 217  WRATRKPHVTEGDQEISDTASSLSFGTMSETVSSTLHMADSKDSPNWEVKYIEDILRHTE 38
            + +++K H  E D E+ D+ASS S GT++   ++T++M +   S  WE++YI+  L + E
Sbjct: 736  FSSSKKFHSVEADTELLDSASSTSTGTVARNHAATVYMPEPLRSNEWELEYIKGTLCNVE 795

Query: 37   LLLEDFSLGQA 5
            L+  DFSLG+A
Sbjct: 796  LMFRDFSLGRA 806


>ref|XP_006433512.1| hypothetical protein CICLE_v10000207mg [Citrus clementina]
            gi|557535634|gb|ESR46752.1| hypothetical protein
            CICLE_v10000207mg [Citrus clementina]
          Length = 916

 Score =  206 bits (524), Expect = 3e-50
 Identities = 180/656 (27%), Positives = 291/656 (44%), Gaps = 35/656 (5%)
 Frame = -1

Query: 1870 NPLDLRLQKAQNRPLERFQREVLPPKSAKPISVSHNRLLSPIKSPGFIPPKNXXXXXXXX 1691
            N ++ R  K  NRP+ERFQ E+LPPKSAK IS++H++LLSPIK+PG  P +N        
Sbjct: 153  NTVESRPHKVHNRPIERFQTEMLPPKSAKSISITHHKLLSPIKNPGIAPSRNTAYIVEAA 212

Query: 1690 XXXXEQSPRATSKGNYPSFGSPSVPFRVRDLKEKMESAQRSSQSGVASQ--------KGR 1535
                E SP+AT+KG  PS  S S P R+ D K+KME+  R+S+  + S         KG+
Sbjct: 213  AKIIEASPQATTKGKRPSVVS-SAPLRIWDFKDKMEAKHRASRPQIKSNESVAVKYTKGQ 271

Query: 1534 EEN--------SQSIKKQLNARGKGRMEDGYLYKGXXXXXXXXXXXXXXXXKPVSLAVQA 1379
              N        + ++K  +N   + R  +    KG                   ++AVQA
Sbjct: 272  HHNQSHRETDCTSAVKASVNV--EKRNPENMRKKGKSD----------------TMAVQA 313

Query: 1378 RSNIQRKD--GPXXXXXXXSEKQKEHNGFVT----RDPPNAQKKVEKPSSSRRPSEVLRM 1217
            R N+ R+D           S  QKE +        + P ++Q+  +K + + R + VLR 
Sbjct: 314  RVNVLRRDVSASSSISGRSSMNQKEKSAVKANQFHKSPKDSQRTAQKGTPTNRTNNVLRQ 373

Query: 1216 NNQKQNRASARDDGNFEPSCSQSKEKEESNLSTNYINGRXXXXXXXXXXXXXXTSRKTNF 1037
            NNQKQN    +D  N +      + ++  + S +    R               SR+T  
Sbjct: 374  NNQKQNHILNKDGSNLKACVINQQVRKLKSTSGSIGPNRTVSKAVANSETG---SRRTGL 430

Query: 1036 LAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKSIKCNVAFEVDSKWD 857
               D  KE+SS +AK +S+K+  AN +  S      +    KDE+SIKCN+A E      
Sbjct: 431  TTNDTRKELSSSKAKNSSQKKQSANADSMSVESTDDEMK--KDERSIKCNIAIEGGMTRA 488

Query: 856  GIDKKXXXXXXXXXXXSPIKKHVGSNSSAAMLEATXXXXXXXXXSARESDLRNSAVSSSG 677
              ++K           SPI+    + SS  ++               +  LRN++ SS  
Sbjct: 489  TDNRKTGMDVVSFTFSSPIRSRPDTESSGRVMRTNNCFNIDHFGDNNQLYLRNTSSSSPW 548

Query: 676  FNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXXNMGSAVNLV------ 515
             N+IGG+ALSVLLEQKL ELT +V+ S  +               +     ++V      
Sbjct: 549  LNIIGGNALSVLLEQKLMELTCKVDSSHCNVIREGTSGLAASTLPDSMPTSSMVTAEEGQ 608

Query: 514  ------ESMDNGVCESEREAQDGSDCASTEKLWLRADKSSKVPECFEGVVDGNNIQRYXX 353
                  ++ ++ + ++     + +   S    W ++ +S ++           N + +  
Sbjct: 609  RLQVHLDNSNSDITDNSCSTSNDNSVLSINPKW-QSQQSEEMERQSSSSYYKENGREFDC 667

Query: 352  XXXXXXXXXXXXXXXXSCDSFDVDRSASNEGPTGCLSLDSCEGTNWRATRKPHVTEGDQE 173
                            +C      R+++N+     LS    E T W  T      + + E
Sbjct: 668  EHSSSVASLEHSNTTLNCSDI---RNSTNDCKQVSLS-QEIEPT-WLPTDVSLSMDCETE 722

Query: 172  ISDTASSLSFG-TMSETVSSTLHMADSKDSPNWEVKYIEDILRHTELLLEDFSLGQ 8
            +SD+A+S+S G T  + ++ T  + D  +S NWE +Y+ ++L + EL +  F+LGQ
Sbjct: 723  LSDSATSISVGNTGKKHMTRTFSLIDEIESSNWEFEYLRELLDNAELKINKFALGQ 778


Top