BLASTX nr result

ID: Paeonia23_contig00004770 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00004770
         (3632 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853...   746   0.0  
ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus c...   559   e-156
ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prun...   550   e-153
ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Popu...   538   e-150
ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Popu...   533   e-148
ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628...   519   e-144
ref|XP_007039222.1| Uncharacterized protein isoform 3 [Theobroma...   511   e-141
ref|XP_007039220.1| Uncharacterized protein isoform 1 [Theobroma...   511   e-141
ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citr...   508   e-140
ref|XP_007039224.1| Uncharacterized protein isoform 5 [Theobroma...   506   e-140
ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301...   493   e-136
ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citr...   491   e-135
ref|XP_007039221.1| Uncharacterized protein isoform 2 [Theobroma...   441   e-120
gb|EXB94712.1| hypothetical protein L484_002599 [Morus notabilis]     439   e-120
ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citr...   406   e-110
ref|XP_007039225.1| Uncharacterized protein isoform 6 [Theobroma...   390   e-105
ref|XP_006441269.1| hypothetical protein CICLE_v10018632mg [Citr...   333   3e-88
ref|XP_007039227.1| Uncharacterized protein isoform 8, partial [...   287   3e-74
ref|XP_006347526.1| PREDICTED: uncharacterized protein LOC102592...   286   5e-74
ref|XP_007039226.1| Uncharacterized protein isoform 7 [Theobroma...   283   5e-73

>ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853355 [Vitis vinifera]
            gi|302143995|emb|CBI23100.3| unnamed protein product
            [Vitis vinifera]
          Length = 1167

 Score =  746 bits (1926), Expect = 0.0
 Identities = 481/1189 (40%), Positives = 666/1189 (56%), Gaps = 99/1189 (8%)
 Frame = +1

Query: 181  TVDRPFFKPNSNPLAHFTEPPYAAPFXXXXXXXXXXXXXXXXXXXXTSRPDYFASPNSEV 360
            TVDRP  KP SNPL +FTE  YAAPF                     SRPDYF++PNS V
Sbjct: 26   TVDRPVSKPLSNPLVNFTESTYAAPFNSSLHNWVHPQSP-------VSRPDYFSNPNSAV 78

Query: 361  D------IMPN---RYLVSQSINSPTTHMPPPNSVATD-------------ATDAYSFPE 474
            D      + P+   RY VSQ +NSP  H+PP + + +               TD +SF +
Sbjct: 79   DSVQATGVPPSNAYRYSVSQPVNSPVVHLPPLSHIVSGIAHLPPLSPIVSAGTDVFSFGQ 138

Query: 475  YLDSVTTSLVEAKPYYPSYVSSAICEDSPLAGLNEPGYDLLSASRVGPLNGSS-VDDYTR 651
              D + TSLVEAKPYYP YV+ AI ++SPL  LNEP YDLLS S    LNGSS +DDYT+
Sbjct: 139  CSDRMKTSLVEAKPYYPPYVAPAIEDNSPLVVLNEPNYDLLSTSHAAHLNGSSSLDDYTQ 198

Query: 652  SLSGLDYSTQWGGFWNGLTDLESGRQTGFDGSFCSKESNVTSSSIYKDYVKQGAATSKDL 831
            S+SGL+Y ++W GFWNGL D+E G++   D S CSKESN   SSIY+ Y+ QG  T++ +
Sbjct: 199  SMSGLEYPSRWCGFWNGLADIEQGKKVELDESLCSKESNFVGSSIYRSYINQGDPTAEGV 258

Query: 832  SACVEASAVSHS--IIVPGRESCIGFSSTKPLDDNFLPQ---NFKSISDESLRTSVGGSS 996
            S   E S +S    + + GR++C+G  S    ++    +   N   +S +  RTS  GS+
Sbjct: 259  SNSEEGSVLSDRKYVDILGRDNCVGSLSPDHFNNKSFYEPKANPMVVSLDFPRTSFLGST 318

Query: 997  SVLPET-HPNSPFLESLTHSWNFQKANSASYEKCFRQHDTCLNDCKSAPKSPPAIVIRPP 1173
            SVLPET HP +P LE +T+SWN++K  SA YEKCFR+ D+C++D  S  KS PAIVIRPP
Sbjct: 319  SVLPETPHPRAPSLEPVTNSWNYRKPQSALYEKCFRKIDSCVDDPVSKAKSSPAIVIRPP 378

Query: 1174 PVGANSSASSTV-PFINLNSIGSDSARRSEDFADNIHCNLKEPRPPLSSERKEVCLDMSP 1350
               ANS +S  V  F + N I +D+   SE+ + +   N++EP  P+ SE +E+  D S 
Sbjct: 379  ---ANSPSSLGVNSFSSRNMICTDN---SENVSGHHLSNMEEPHIPVISEGRELYSDTSQ 432

Query: 1351 --NILGRRDYNNVDTFATKKEEFSNNISVTKDSFDHFFEKRCELQVPHMSNENGYSLAHD 1524
                  R D+ ++++ +TKK E  NN    K++ D+    R ELQ+PH++ E+G+S + +
Sbjct: 433  LNGHWQRNDHLSMESSSTKKHELLNNEMGVKET-DNLLRARSELQIPHLNVEDGFSFSPN 491

Query: 1525 A-EPVNFIENSSKSFDQYNPAVDSPCWKGATANRSSP--LSEAVTPQLFMRKLETCNSLN 1695
            + E VN I+N+S++ D YNPAVDSPCWKG+  +  SP  +SEA++P   M +LE  +  N
Sbjct: 492  SIEAVNSIDNTSETLDHYNPAVDSPCWKGSITSHFSPFEVSEALSPHNLMEQLEALDGFN 551

Query: 1696 LQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGNDSSLSAKSPLVAGFSSTGHGPDDA 1875
            LQG  IFPLN++DAV V S KP+EN  YH+NVCG N    S K P V    S      DA
Sbjct: 552  LQGHHIFPLNSDDAVNVSSLKPNENTEYHKNVCGENGLLPSWKRPSVVNHPSREQRSLDA 611

Query: 1876 VKPGSCSTKSSCAYGVQFSDYILKSRKEDALPDNLKDD---------------------- 1989
             K G    K S   G Q S+ I++ +++ +L ++ K D                      
Sbjct: 612  FKTGPYCQKLSSGDGNQSSNDIIQPKRDHSLLNSSKSDNLELSHTMRQSFEEVKFTSERK 671

Query: 1990 --ADFRMHASGTYVDDASEDGLSHVPFHVTKHVLSSPPSGEDAPSAIAKSCEENSSPKID 2163
              +   +  +G  ++D S DG SH  +H+T+++  SP SG+DA + + K     S+PKID
Sbjct: 672  LSSGVGVEVTGNNINDVSRDGSSHETYHLTENISCSPLSGDDASTKLTKQPASESTPKID 731

Query: 2164 VQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTLQDVIKNLNVCISKTAEQMISTREPLF 2343
            V ML+ T+ +LS L L HCS++A +LKEQD+ TL+ VI N + C++K  +++        
Sbjct: 732  VHMLINTVQDLSVLLLSHCSDNAFSLKEQDHETLKRVIDNFDACLTKKGQKIAE------ 785

Query: 2344 PYQDTSQFIEKSSYLHKGASVDRSQLTKGASVDRSQVTKVASAII-GQLDQQHIHEEKGD 2520
              Q +S F+            +   L K AS       KVA A +  Q   Q  H+ K  
Sbjct: 786  --QGSSHFLG-----------ELPDLNKSASASWPLGKKVADANVEDQFHCQSDHKGKRH 832

Query: 2521 DIVWGVKDEKFQNFASAQDDAEIVKNDKMTQAIKKILVENFHGEEETEPQVLLYKNLWLE 2700
              V G KDEK  +F S  +D + V +D   QAI+KIL +NFH EEET+PQ LLY+NLWLE
Sbjct: 833  CSVSGNKDEKLSDFVSLVNDEDTVNDDSTIQAIRKILDKNFHDEEETDPQALLYRNLWLE 892

Query: 2701 AEAVICSTNYRARFDRMKIEMEKCNSHEAK-------DVSESSTNRISPDVNAIN----- 2844
            AEA +CS +YRARFDRMKIEMEK    + +       DV + S++++S D++ ++     
Sbjct: 893  AEAALCSISYRARFDRMKIEMEKFKLRKTEDLLKNTIDVEKQSSSKVSSDISMVDKFERE 952

Query: 2845 -QLAPEPNGGIQDYPIINTSSHEVDVLARFHILKSRDNDSNSVNNLDAEKPSSSKFSPEL 3021
             Q  P P+  I+D P + T SH  DV+ RFHILK R  +S+S+N+ D  K SS K S ++
Sbjct: 953  AQENPVPDITIEDSPNVTTMSHAADVVDRFHILKRRYENSDSLNSKDVGKQSSCKVSHDM 1012

Query: 3022 DKVDKLVPQPQDIPTPEISVHDSRILSKTSHVDNVEDSVMDRLRILKCRVDNLTPTVNVE 3201
            +  D L P  +D  +P IS                 D VM R RILKCR D   P +N E
Sbjct: 1013 NSDDNLAPAAKDDHSPNIS------------TSTQSDDVMARFRILKCRADKSNP-MNAE 1059

Query: 3202 ---DPEVPDVN-PGCDESW----DENFEAIMG-----HVIEHEKLK------------VK 3306
                PE  D+   G    W    D   +  +G     H+  H K +            VK
Sbjct: 1060 RQQPPEEVDLEFAGKGSHWMFIKDRVEDVTLGPDLQVHIANHTKDRFDSYLDDFDCEIVK 1119

Query: 3307 E-SSRLCDDRMIHSCWTDKLGNQLPAGWCDSSSSSDWEHVLRDELPAQN 3450
            E      DD +I    +++L NQLPAG+ D  SS+DWEHVL++ELP  N
Sbjct: 1120 EFHEHAMDDPVIQLPRSNRLQNQLPAGFSD-GSSADWEHVLKEELPGGN 1167


>ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus communis]
            gi|223539484|gb|EEF41073.1| hypothetical protein
            RCOM_0756330 [Ricinus communis]
          Length = 1125

 Score =  559 bits (1441), Expect = e-156
 Identities = 420/1097 (38%), Positives = 581/1097 (52%), Gaps = 61/1097 (5%)
 Frame = +1

Query: 331  DYFASPNSEVDIMPNRYLVSQSINSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVEA 510
            DYF     E+D        S S N  +    P  SV+TD+        Y  S  T L EA
Sbjct: 86   DYFVIQKPELD--------SNSYNRYSASSNPHVSVSTDSV------LYGQSGVTGL-EA 130

Query: 511  KPYYPS-YVSSAICEDSPLAGL-NEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQW 684
            KPYYPS Y+S AI  D  L G+ +   Y LLSASRV    GSS +DYT+SLSG     QW
Sbjct: 131  KPYYPSTYISPAIGNDCSLGGVPHHSDYGLLSASRVSTSIGSS-EDYTQSLSG-----QW 184

Query: 685  GGFWNGLTDLESGRQTGFDGSFCSKESNVTSSSIYKDYVKQGAAT-SKDLSACVEASAVS 861
             G W+GLTD     Q   DGSFCSKE+          Y+ Q A   + + ++  EAS  +
Sbjct: 185  SGMWDGLTDWLQSEQVQLDGSFCSKET----------YMNQVAGLYASESTSKYEASQSA 234

Query: 862  HSIIVPGRESCIGFSSTKPLD-DNFLPQNFKSISDESLRTSVGGSSSVLPETHPNSPFLE 1038
             ++   GRE+ I  +    LD  +FL +N K    +    S   S+ ++PET    P  +
Sbjct: 235  DTV---GRETQIESAGVGKLDYKSFLGENRKFTPSDYPTPSSLASTLLVPETCSQVPSKK 291

Query: 1039 SLTHSWNFQKANSASYEKCFRQHDTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFI 1218
            ++ +SWN     SAS EKC R+HD   +D  +   S PA+VI+PP     S         
Sbjct: 292  AV-NSWNHHMPYSASNEKCLRRHDATSSDIATILYSSPAVVIKPPEHNKGS--------- 341

Query: 1219 NLNSIGSDSARRSEDFADNIHCNLKEPRPPLSSERKEVCLDMSPNI--LGRRDYNNVDTF 1392
             L ++ + S   ++DF+ N    + EPRP ++S +  VC D S     LG+ D    +  
Sbjct: 342  -LKNVNTSSDGDNKDFSCNSPSVVVEPRPFITS-KGSVCYDASQVSFHLGKTDQVIANFS 399

Query: 1393 ATKKEEFSNNISVTKDSFDHFFEKRCELQVPHMSNENGYSLAHDAEPVNFIENSSKSFDQ 1572
            + K EE S+N + + D   HF  ++  +QVP  ++  G SL    E ++  +N ++S D 
Sbjct: 400  SAKNEELSSNQNASMDVSGHFAGEKPVIQVP-CTSLGGISLVDKNEAIDPAKNHTESLDH 458

Query: 1573 YNPAVDSPCWKGATANRSSPL--SEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRV 1746
            YNPAVDSPCWKGA  +  S L  SEAVTPQ  M+ LE C+  N QG Q F ++++DAV+V
Sbjct: 459  YNPAVDSPCWKGAPVSNFSQLEVSEAVTPQN-MKNLEACSGSNHQGYQTFSVSSDDAVKV 517

Query: 1747 FSPKPSENLWYHENVCGGNDSSLSAKSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQ 1926
               K SE     +     N S+ S K PL         G D  V  G+  TK S  + VQ
Sbjct: 518  SPEKTSEKSIQQKGWSLENYSASSMKRPLADNMLHR-EGIDHFVNFGANCTKPSLFHQVQ 576

Query: 1927 FSDYILKSRKED----ALPDNLKDDADFRMHAS-------------GTYVDDASEDGLSH 2055
             SD  L ++  D     LP N K   +     +             G  ++D  ++  SH
Sbjct: 577  ISDDALPNKSFDDSNGKLPQNEKQSCESGKWTTESNSAPVISVADVGMNMNDDPDECSSH 636

Query: 2056 VPFHVTKHVLSSPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDAC 2235
            VPFH  +HVLSSPPS + A   + K+C   S+ K  ++ ++ TM NLSEL + H SND C
Sbjct: 637  VPFHAVEHVLSSPPSADSASIKLTKACGGVSTQKTYIRTVIDTMQNLSELLIFHLSNDLC 696

Query: 2236 TLKEQDNHTLQDVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRS 2415
             LKE D++ L+ +I NL +C+ K  E+M ST+E + P +D +Q   KSS L KG +    
Sbjct: 697  DLKEDDSNALKGMISNLELCMLKNVERMTSTQESIIPERDGAQLSGKSSKLQKGTN---- 752

Query: 2416 QLTKGASVDRSQVTKVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEIVK 2595
                G  + RS   +   ++      QH+ +E   +I  G  DE   ++ S +  A+++K
Sbjct: 753  --GNGFLISRSDPLEFQYSV----KYQHVQDE--HNISSGKNDETLSSYVSVRAAADMLK 804

Query: 2596 NDKMTQAIKKILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKCN 2775
             DKMTQAIK  L ENFHGEEETEPQVLLYKNLWLEAEA +C  +  ARF+R+K EMEKC+
Sbjct: 805  RDKMTQAIKNALTENFHGEEETEPQVLLYKNLWLEAEASLCYASCMARFNRIKSEMEKCD 864

Query: 2776 SHEAKDVSES-------STNRISPDVNAINQLA------PEPNGGIQDYPIINTSSHEVD 2916
            S +A    E+       S + I  D    N LA      P P+  I +  I+ TSSH  D
Sbjct: 865  SEKANGSPENCMVEEKLSKSNIRSDPCTGNVLASNTKGSPLPDTSIPESSILCTSSHADD 924

Query: 2917 VLARFHILKSRDNDSNSVN--NLD-----AEKPSSSKFSPELDKVDKLVPQPQDIPTPEI 3075
            V AR+HILK R + +N+VN  +LD     A+K SSS+FSP  + V+K V + +D   P+I
Sbjct: 925  VTARYHILKYRVDSTNAVNTSSLDKMLGSADKLSSSQFSPCPNNVEKGVCEEKDGQKPDI 984

Query: 3076 SVHDSRILSKTSHVDNVEDSVMDRLRILKCRVDNL-------TPTVNVEDPEVPDVNP-G 3231
            S+ DS + + TSH+++VE SVM R  ILKCR DN        T +V++    +P   P G
Sbjct: 985  SIQDSLVSNTTSHLNDVEASVMARFHILKCRDDNFSMHKEESTESVDLGYVGLPRHWPTG 1044

Query: 3232 CDESWDENFEAIMGHVIEH-------EKLKVKESSRLC-DDRMIHSCWTDKLGNQLPAGW 3387
             DE+ D   +  M   ++H       +KL VKE      DD +I S   ++LG+Q  A +
Sbjct: 1045 TDETEDRVLDVNMRTHLQHHDCNFTEDKLPVKEFHLFVKDDPVIGSRDINRLGDQSHASF 1104

Query: 3388 CDSSSSSDWEHVLRDEL 3438
            CD   SSDWEHVL +EL
Sbjct: 1105 CD--GSSDWEHVLLEEL 1119


>ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prunus persica]
            gi|462417047|gb|EMJ21784.1| hypothetical protein
            PRUPE_ppa000352mg [Prunus persica]
          Length = 1254

 Score =  550 bits (1417), Expect = e-153
 Identities = 400/1063 (37%), Positives = 557/1063 (52%), Gaps = 48/1063 (4%)
 Frame = +1

Query: 181  TVDRPFFKPNSNPLAHFTEPPYAAPFXXXXXXXXXXXXXXXXXXXXTSRPDYFASPNSEV 360
            TVDR   KP S+PL   TE PY AP                      +  ++FA+P  E 
Sbjct: 30   TVDRSVPKPISSPLVDVTETPYVAPLNSSSHNWLPSHPPI-------TGSNFFANPTPEF 82

Query: 361  DIMPN----RYLVSQSINSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVEAKPYYPS 528
            + +P+    RY  SQ ++ P T +PP N++   +++A+++ + LD+V TS VEAKPYYPS
Sbjct: 83   NSLPSSNAYRYAGSQIVDPPNTTLPPLNTITPASSNAFTYDQSLDAVATSFVEAKPYYPS 142

Query: 529  YVSSAICEDSPLAGLNEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLT 708
            Y+S  I  DSPL   ++P YD LS +   PL+G S  DYT+    L Y+ QWGG WNGL+
Sbjct: 143  YLSPTIHGDSPLVVPDQPSYDWLSTTHFAPLDGCSRKDYTQRPPDLKYTAQWGGLWNGLS 202

Query: 709  DLESGRQTGFDGSFCSKESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRE 888
            + E G+Q  FDGSFCSK+++V+ S +YK+++ Q   +S  L++  EA   SH I   G E
Sbjct: 203  EWEQGKQGDFDGSFCSKKTDVSGSFLYKNFMNQEPHSSNSLNSFEEA---SHGINTLGWE 259

Query: 889  SCIGFSSTKPLDDNFLPQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQK 1068
               G S    L D  L       +      SV GS SV+PE H  +P  + +T + N + 
Sbjct: 260  K-PGGSGNAHLGDKSLVGKNSKFTPSDFSKSVMGSLSVVPEPHLKAPSSQCVTKTSNCKT 318

Query: 1069 ANSASYEKCFRQHDTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINLNSIGSDSA 1248
              S S E   +Q D  L+   S  +S PA   R P +G   S   T  F  LN I SD+A
Sbjct: 319  PYSVSSET--QQLDASLDYITSISESSPAFATRTPALGTKLSEPGTGLFRRLNFI-SDAA 375

Query: 1249 RRSEDFADNIHCNLKEPRPPLSSERKEVCLDMSP--NILGRRDYNNVDTFATKKEEFSNN 1422
                D  D     ++E   P  SE K V  D S     LG +D  + ++ + + EE SNN
Sbjct: 376  --DTDHGDYYSSGVQESHLPQISEGK-VLFDSSQLGFHLGAKDCFSAESSSARNEELSNN 432

Query: 1423 IS-VTKDSFDHFFEKRCELQVPHMSNENGYSLAHDA-EPVNFIENSSKSFDQYNPAVDSP 1596
             + + KD++D  F+ +  LQ  H+  + G+ +A    E +N   +SS + D  NP VDSP
Sbjct: 433  RNIINKDAWDKVFKAKPGLQNSHVGLD-GFKMAFKTNETINSFLSSSDNVDPNNPGVDSP 491

Query: 1597 CWKGATANRSSPL--SEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSEN 1770
            CWKG   +  SP   SE   P+  ++KLE C+ LN+  P +FPL+  +   V S KP +N
Sbjct: 492  CWKGVPGSCFSPFGASEDGVPEQ-IKKLEDCSGLNIHMP-MFPLSAGE--NVSSQKPIKN 547

Query: 1771 -LWYHENVCGGNDSSLSAKSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQF------ 1929
             + Y+E     N      K   VA  +   H  D++VK  +   ++S   G Q       
Sbjct: 548  AVEYNEFGWLENGLRPPLKRYSVANSAFGEHKWDNSVKT-TYDAETSHDRGPQSYRDGLH 606

Query: 1930 --------------SDYILKSRKEDALPDNLKDD----ADFRMHASGTYVDDASEDGLSH 2055
                          S  + +   ED L   +K      AD +++A     +D  E G SH
Sbjct: 607  QSGNGDKSLGLLDDSHAMQQGHGEDGLATEVKQTWSCVADVKLNA-----NDTMEYGSSH 661

Query: 2056 VPFHVTKHVLSSPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDAC 2235
            VP HV ++VL S  S EDA + ++KS  E S  K+DVQMLV T+ NLSEL L +CSN  C
Sbjct: 662  VPSHVVENVLCS--SAEDAATKLSKSNGEESMLKVDVQMLVDTLKNLSELLLTNCSNGLC 719

Query: 2236 TLKEQDNHTLQDVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRS 2415
             LK+ D  TL+ VI NL++CISK  E+    +E     Q+TSQ   + S  HK  S DR 
Sbjct: 720  QLKKTDIATLKAVINNLHICISKNVEKWSPMQESPTFQQNTSQCYAELSEHHKVLSADRP 779

Query: 2416 QLTKGASVDRSQVTKVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEIVK 2595
                      +    +   +IG +   H+                       + D ++VK
Sbjct: 780  L--------SASAPDIQDQVIGSI---HV-----------------------KSDIDVVK 805

Query: 2596 NDKMTQAIKKILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKCN 2775
             DKMTQAIK+IL ENFH  EET+PQVLLYKNLWLEAEAV+CS NY+ARF+R+KIEM+KC 
Sbjct: 806  EDKMTQAIKEILSENFH-SEETDPQVLLYKNLWLEAEAVLCSINYKARFNRVKIEMDKCK 864

Query: 2776 SHEAKDVSE-------SSTNRISPDVNAINQLAPE----PNGGIQDYPIINTSSHEVDVL 2922
            +  +KDV E        S + +SPD N +N L PE    P   + D PI+   S E +VL
Sbjct: 865  AENSKDVFEYTADMMKQSKSEVSPDSNPVNPLTPEAQGCPTSNVPDLPIL---SQEDEVL 921

Query: 2923 ARFHILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHDSRILS 3102
            ARF IL+ R  ++NS+N  +A +  SSK SPE  KV+++ P+    P+P IS+ DS I S
Sbjct: 922  ARFDILRGRVENTNSINASNAAE-LSSKASPEPSKVERIAPEANGTPSPGISIQDSSISS 980

Query: 3103 KTSHVDNVEDSVMDRLRILKCRVD--NLTPTVNVEDPEVPDVN 3225
                 D+ E SVM R  IL+ RV+       VN+E+P  P V+
Sbjct: 981  TIGVTDDYEASVMARFHILRDRVEKSKFISAVNMEEPSSPKVS 1023



 Score = 93.2 bits (230), Expect = 8e-16
 Identities = 80/271 (29%), Positives = 127/271 (46%), Gaps = 32/271 (11%)
 Frame = +1

Query: 2734 ARFDRMKIEMEKCNSHEAKDVSESSTNRISPDVNAINQLAPEPNGG-------IQDYPII 2892
            ARF  ++  +EK     A ++ E S+ ++S +    + + P+ N G        QD P  
Sbjct: 994  ARFHILRDRVEKSKFISAVNMEEPSSPKVSLEPKT-DVIVPDRNDGSASEFNLFQDSPPS 1052

Query: 2893 NTSSH----EVDVLARFHILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDI 3060
             T+SH    E  V++R HILKSR ++ + ++    + P          K++ + P   D 
Sbjct: 1053 ITTSHANDCEASVMSRLHILKSRVDNCSDMHTEGQQLPEP--------KIEVIAPDTSDS 1104

Query: 3061 PTPEISVHDSRILSKTSHVDNVEDSVMDRLRILKCRVDNLTPTVNVEDPEVPDV----NP 3228
              PE S+ DS +   TS  ++ E SVM RL ILK RVDN +  ++ E  ++P++    N 
Sbjct: 1105 LMPEFSIQDSPVSRATSQANDCEASVMSRLHILKSRVDN-SSYMHREGKQLPEIGGLGNA 1163

Query: 3229 GCDESW---DENFEAIMGHVIEHEKLKVKESSR--------------LCDDRMIHSCWTD 3357
            G    W    +  E     + E   L+  ++                + DD +       
Sbjct: 1164 GKRHPWPIISKRSEGGSSDIKEQPILRSFKADNSEGKLDTAKEFHLFVEDDPLTQYFRIH 1223

Query: 3358 KLGNQLPAGWCDSSSSSDWEHVLRDELPAQN 3450
            K  NQLPAG  D +SSSDWEHV+++E+  QN
Sbjct: 1224 KPANQLPAGGHD-NSSSDWEHVMKEEIWGQN 1253


>ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Populus trichocarpa]
            gi|550326088|gb|EEE96055.2| hypothetical protein
            POPTR_0012s00720g [Populus trichocarpa]
          Length = 1227

 Score =  538 bits (1386), Expect = e-150
 Identities = 415/1102 (37%), Positives = 552/1102 (50%), Gaps = 46/1102 (4%)
 Frame = +1

Query: 181  TVDRPFFKPNSNPLAHFTEPPYAAPFXXXXXXXXXXXXXXXXXXXXTSRPDYFASPNSEV 360
            TVDR   KP    L   TEP Y                         SRPD F  PN E 
Sbjct: 31   TVDRSVAKP----LLDLTEPTYPVSLNPSLHNWATSNSHIP-----NSRPDLFPLPNLEF 81

Query: 361  DIMPNRYLVSQSINSPTTHMPPPNS-VATDATDAYSFPEYLDSVTTSLVEAKPYYPS-YV 534
            + +P+  +   S  SPT  +   N  +   +TDA  + +       SLVEA PYYPS YV
Sbjct: 82   NSIPSPNVFGYS--SPTPQVTSKNHPLVLASTDAVLYGQS----NPSLVEAVPYYPSSYV 135

Query: 535  SSAICEDSPLAGLNEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDL 714
            S AI  D  L   ++ GY+LLS S VG  NGSS DDYT+S  GL+++TQW G W G+TD 
Sbjct: 136  SPAIGSDGHLKIPHQSGYELLSNSYVGTSNGSSHDDYTQSSLGLEHATQWSGLWEGVTDW 195

Query: 715  ESGRQTGFDGSFCSKESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESC 894
               ++   DG FC KE+          ++ QG +  KD+S C E S     I + GR+  
Sbjct: 196  NQSKKLQLDGGFCEKEN----------FINQGFSAFKDVSKCEETSL---GIDMVGRQMH 242

Query: 895  IGFSSTKPLDDN-FLPQNFKSISDESLRTSVGGSSSVLPETHPNS-PFLES--LTHSWNF 1062
             G +ST  LD   FL +  KS+            S + P T P + P + S  + +S N 
Sbjct: 243  TGSASTGQLDYKAFLVEKPKSMPTTP-------PSLIFPPTAPQAYPQVSSSNVVNSPNN 295

Query: 1063 QKANSASYEKCFRQHDTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINLNSIGSD 1242
            Q  +  SY K  R+ D   ND     K  PA+VIRPP             F N+N+ G+D
Sbjct: 296  QMRHVTSYGKSSRKRDASSNDRMPMMKPSPAVVIRPP-------GQDRYSFKNINA-GTD 347

Query: 1243 SARRSEDFADNIHCNLKEPRPPLSSERKEVCLDMSP-NI-LGRRDYNNVDTFATKKEEFS 1416
               +  DFA N     +EP P +SS+ K VC D S  N  L + D +  +  +   EE  
Sbjct: 348  GDEK--DFAGNNTSFAQEPNPFISSKGK-VCYDSSQVNFHLKQNDDSFAEVPSKNHEELL 404

Query: 1417 NNISVTKDSFDHFFEKRCELQVPHMSNENGYSLAHDA-EPVNFIENSSKSFDQYNPAVDS 1593
            +N +++ D  D  F ++ E +VP   N + ++LA D  E    +E +S+S D Y PAVDS
Sbjct: 405  SNKNISIDFLDKLFREKMENRVP-CKNLDFFNLAMDGHEAAGSVEITSESLDHYFPAVDS 463

Query: 1594 PCWKGATANRSSPL--SEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSE 1767
            PCWKGA  +  S    SE V PQ    K+E CN LNLQGPQI P  TNDAV+    K S 
Sbjct: 464  PCWKGAPVSLPSAFEGSEVVNPQ---NKVEACNGLNLQGPQISPSTTNDAVKDCPEKQSN 520

Query: 1768 NLWYHENVCGGNDSSLSAKSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILK 1947
                  N    +  + S K PLVA       G DDAVK G C  KSS     Q SD I +
Sbjct: 521  ISMTFNNESLEHRPASSFKRPLVANVLFR-EGIDDAVKYGPCQRKSSYCNEAQISDVIDE 579

Query: 1948 SRKEDALPDNLKDDADFRMHASGTY--------------VDDASEDGLSHVPFHVTKHVL 2085
             RKE  LPD        +    G +              ++D  +D  SHVP+H  +HVL
Sbjct: 580  PRKESILPDFKPVHTKQKSLEEGEWPSKKNSDVAGVRRKINDNPDDCSSHVPYHAIEHVL 639

Query: 2086 SSPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTL 2265
             SPPS E AP+   +S    SS K+  + LV TMHNLSEL L + SND C LK++D   L
Sbjct: 640  CSPPSSEHAPAQHTQSQVGESSSKMHARTLVDTMHNLSELLLFYSSNDTCELKDEDFDVL 699

Query: 2266 QDVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASVDR 2445
             DVI NL++ ISK +E+  ST+E L P + TSQ   K S L+K                 
Sbjct: 700  NDVINNLDIFISKNSERKNSTQESLIPRRATSQSPGKLSELYK----------------- 742

Query: 2446 SQVTKVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEIVKNDKMTQAIKK 2625
                       GQL+ QH  +EK   IV   + EK  NF S +   + VK+D +TQAIKK
Sbjct: 743  -----------GQLEFQHFEDEKECKIVSDERKEKLSNFVSMRGATDTVKDDNVTQAIKK 791

Query: 2626 ILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKCNSHEAKDVSES 2805
            +L +NF  +EE+E Q+LLYKNLWLEAEA +C  N   RF+R+KIE+EK +S +  + S +
Sbjct: 792  VLAQNFPIKEESESQILLYKNLWLEAEASLCVVNCMDRFNRLKIEIEKGSSQKVNEFSSA 851

Query: 2806 S----------TNRISPDVNAINQLAPEPNG----GIQDYPIINTSSHEVDVLARFHILK 2943
            +           N + P V++ + L  E  G     + D  I++ +SH  DV+ARFHI+K
Sbjct: 852  APVVPENSMIMENLLGPKVSS-DILPAEDEGSPVHNVPDSSILSRNSHSDDVMARFHIIK 910

Query: 2944 SRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHDSRILSKTSHVDN 3123
            SR +DSNS+N   A   SS K SP+L+KVDK     +D     IS  DS I   +SH DN
Sbjct: 911  SRVDDSNSLNT-SAMDLSSPKVSPDLNKVDKFAHDTKDSSKSHISFQDS-IRGASSHADN 968

Query: 3124 VEDSVMDRLRILKCRVDNLTPTVNVEDPEV-------PDVNPGCDESWDENFEAIMGHVI 3282
                VMDR  ILKCRV+N + +VN     +       PD N   D+   +  ++IM + I
Sbjct: 969  ----VMDRFHILKCRVEN-SSSVNTATGGILASSMVSPDQNQ-VDKLAHDTKDSIMSYTI 1022

Query: 3283 EHEKLKVKESSRLCDDRMIHSC 3348
            +   +  + S    DD M   C
Sbjct: 1023 QDSPMSGRSSH--ADDVMTRFC 1042



 Score =  114 bits (286), Expect = 3e-22
 Identities = 90/265 (33%), Positives = 134/265 (50%), Gaps = 24/265 (9%)
 Frame = +1

Query: 2725 NYRARFDRMKIEMEKCNSHEAKDVSESSTNRISPDVNAINQLAPEPNGGI-----QDYPI 2889
            N   RF  +K  +E  +S         +++ +SPD N +++LA +    I     QD P+
Sbjct: 968  NVMDRFHILKCRVENSSSVNTATGGILASSMVSPDQNQVDKLAHDTKDSIMSYTIQDSPM 1027

Query: 2890 INTSSHEVDVLARFHILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTP 3069
               SSH  DV+ RF IL  RD++SNSV     EK SSSK S +L+KV KL    +D    
Sbjct: 1028 SGRSSHADDVMTRFCILNGRDDNSNSVTISAVEKLSSSKVSSDLNKVSKLTDDTKDSIKA 1087

Query: 3070 EISVHDSRILSKTSHVDNVEDSVMDRLRILKCRVDNLTPTVNVEDPEVPDVNPGC----- 3234
            +++  DS + S +S  ++VE SV     ILK R D  + ++++E+ +   ++ G      
Sbjct: 1088 DVTTQDSSMSSASSQAEDVEASV-----ILKHR-DGNSSSLDMEEHQRVSIDNGYMDLIR 1141

Query: 3235 ----------DESWDENFEAIMGHV---IEHEKLKVKESSR-LCDDRMIHSCWTDKLGNQ 3372
                      D + D N E ++ +       +K  VKE    + DD    S  TD+ G+Q
Sbjct: 1142 LARMNKDGTKDRTLDVNMEPLIPNFRADSTEDKPTVKEFRLFINDDVETQSRLTDRFGDQ 1201

Query: 3373 LPAGWCDSSSSSDWEHVLRDELPAQ 3447
              AGW D S SSDWEHVL++EL  Q
Sbjct: 1202 SHAGWYD-SCSSDWEHVLKEELAGQ 1225


>ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Populus trichocarpa]
            gi|550321678|gb|EEF06077.2| hypothetical protein
            POPTR_0015s00600g [Populus trichocarpa]
          Length = 1236

 Score =  533 bits (1373), Expect = e-148
 Identities = 376/981 (38%), Positives = 513/981 (52%), Gaps = 29/981 (2%)
 Frame = +1

Query: 322  SRPDYFASPNSEVDIMPNRYLVSQSINSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSL 501
            SRPD F  PN E D +P+      S  SPT      + + + +TDA  + +       S+
Sbjct: 69   SRPDLFPIPNLEFDSVPSPPAFGYS--SPTQMPSMSHPLVSASTDAVLYVQG----NPSI 122

Query: 502  VEAKPYYPS-YVSSAICEDSPLAGLNEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYST 678
            VEA+PYYPS YVS AI  D  L   N+ GY+LLS S VG  NGSS DDY++SL  L++  
Sbjct: 123  VEAEPYYPSSYVSPAIASDGSLKIPNQSGYELLSTSHVGTSNGSSRDDYSQSLVVLEHPA 182

Query: 679  QWGGFWNGLTDLESGRQTGFDGSFCSKESNVTSSSIYKDYVKQGAATSKDLSACVEASAV 858
            QW G W G+TD    ++   DG F +KE+          ++ QG +  KD+S C E S  
Sbjct: 183  QWSGLWEGVTDWHQSKKMQLDGGFSAKEN----------FINQGFSAFKDISKCEETSL- 231

Query: 859  SHSIIVPGRESCIGFSSTKPLDDN-FLPQNFKSISDESLRTSVGGSSSVLPETHPNSPFL 1035
               I V GR++    +ST  +D   FL +  K +       S     SV P+ +P  P  
Sbjct: 232  --GINVVGRQTHTESASTGQMDYKAFLGEKPKFMPAGYSTPSPLVFPSVAPQAYPQVPSS 289

Query: 1036 ESLTHSWNFQKANSASYEKCFRQHDTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPF 1215
              +    N Q  +   Y K  R+ D   ND     K  P +V+R P          T  F
Sbjct: 290  NVVNSPIN-QMPDVILYGKSSRKRDASPNDSMPVTKPSPVVVVRSP-------GQDTYSF 341

Query: 1216 INLNSIGSDSARRSEDFADNIHCNLKEPRPPLSSERKEVCLDMSPNILGRRDYNNVDTFA 1395
             N+N+ G D      D   N   +++EP P +SSE K        N   +++ + +   +
Sbjct: 342  KNMNT-GCDG-----DEKGNNSSSVQEPNPFISSEGKVFYDSSQINFHLKQNDDYLAEIS 395

Query: 1396 TKKEEFSNNISVTKDSFDHFFEKRCELQVPHMSNENGYSLAHDA-EPVNFIENSSKSFDQ 1572
            +K  E  +N +++ D FD  F+ + + +V    N + ++LA D  E +  +EN+S+S D 
Sbjct: 396  SKNNELPSNKNISVDFFDQLFKAKMDNKVLRR-NLDFFNLAMDGHEAIGSVENTSESLDH 454

Query: 1573 YNPAVDSPCWKGATANRSSP--LSEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRV 1746
            YNPAVDSPCWKGA  +  S   +SE V P L  +K+E CN L+ QGPQIFP  TNDAV+ 
Sbjct: 455  YNPAVDSPCWKGAPVSHLSAFEISEVVDP-LIPKKVEACNGLSPQGPQIFPSATNDAVKA 513

Query: 1747 FSPKPSENLWYHENVCGGNDSSLSA-KSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGV 1923
              P+   N+    N        +S  K PL A         DDA K G      S  +  
Sbjct: 514  -CPEKQSNISVPLNHESLEHQQVSLFKRPLDAKVLFREE-IDDAGKYGPYQRIPSYCHEA 571

Query: 1924 QFSDYIL-KSRKEDALPDNLKDDADFRMHASGTY--------------VDDASEDGLSHV 2058
            Q SD I  ++RKE  L D      + R    G +              ++D  +D  SHV
Sbjct: 572  QISDVIDDETRKESILSDFNSLHTEQRSLEDGEWPSKKNSYVADVRRKINDDPDDCSSHV 631

Query: 2059 PFHVTKHVLSSPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACT 2238
            PFH  + VL SPPS E AP+   +S  E S  K+  + LV TMHNL+EL L + SND C 
Sbjct: 632  PFHAIEQVLCSPPSSEHAPAQHTQSQGEESLSKMHARTLVDTMHNLAELLLFYSSNDTCE 691

Query: 2239 LKEQDNHTLQDVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQ 2418
            LK++D   L+DVI NL++CISK  E+ IST+E L P Q TSQF  K S L+K        
Sbjct: 692  LKDEDFDVLKDVINNLDICISKNLERKISTQESLIPQQATSQFHGKLSDLYK-------- 743

Query: 2419 LTKGASVDRSQVTKVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEIVKN 2598
                                GQL+ QH  +E+   I    + EK  N+AS +  A+ VK+
Sbjct: 744  --------------------GQLEFQHFEDEEEHKIASDKRKEKLSNWASTRCAADTVKD 783

Query: 2599 DKMTQAIKKILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKCNS 2778
            D MTQAIKK+L +NF  EEE+E Q+LLY+NLWLEAEA +CS NY ARF+RMKIEMEK +S
Sbjct: 784  DNMTQAIKKVLAKNFPIEEESESQILLYRNLWLEAEASLCSVNYMARFNRMKIEMEKGHS 843

Query: 2779 HEAKDVSESSTNRISPDVNAINQLAPEPNGGIQDYPIINTS-----SHEVDVLARFHILK 2943
             +A + S    N   P V++    A +    +QD   +++S     SH  DV+ARFHILK
Sbjct: 844  QKANEKSMVLENLSRPKVSSDILPADDKGSPVQDVSFLDSSILSRNSHSDDVMARFHILK 903

Query: 2944 SRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHDSRILSKTSHVDN 3123
            SR +DSNS++    EK SSSK SP+L+ VDKL    +D   P +S+ DS +   +S+ D+
Sbjct: 904  SRVDDSNSMSTSAVEKLSSSKVSPDLNLVDKLACDTKDSTKPNVSIQDSHMSGTSSNADD 963

Query: 3124 VE---DSVMDRLRILKCRVDN 3177
            V    D V+ R  ILKCRVDN
Sbjct: 964  VSSHADDVIARFHILKCRVDN 984



 Score =  127 bits (318), Expect = 5e-26
 Identities = 87/263 (33%), Positives = 139/263 (52%), Gaps = 25/263 (9%)
 Frame = +1

Query: 2734 ARFDRMKIEMEKCNSHEAKDVSESSTNRISPDVNAINQL------APEPNGGIQDYPIIN 2895
            ARF  +K  ++  +S     + + S++++SPD+N ++++      + +P+  IQD P+  
Sbjct: 973  ARFHILKCRVDNSSSGNTSAMEKLSSSKVSPDLNKVDKMVYDTKDSTKPHITIQDSPMAG 1032

Query: 2896 TSSHEVDVLARFHILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEI 3075
             SSH  DV+ARF  L+ R ++ NSVN    EK  SSK S  L  V KL  + +D   P+I
Sbjct: 1033 RSSHADDVMARFRTLEGRVDNCNSVNISAMEKLPSSKVSSNLSNVGKLTVEAKDSTKPDI 1092

Query: 3076 SVHDSRILSKTSHVDNVEDSVMDRLRILKCRVDNLTPTVNVEDPEVPDVNPGC------- 3234
            +  DS + S +SH +++E ++M RL ILK R D  + ++ +E+ +   ++ G        
Sbjct: 1093 TKQDSPLPSTSSHAEDIEAAIMARLLILKHR-DGCSSSLEMEEHQPESIDNGYTSLRRDV 1151

Query: 3235 --------DESWDENFEAIMGHV---IEHEKLKVKESSRLC-DDRMIHSCWTDKLGNQLP 3378
                    D   D N E ++ +       +K  VKE      DD    S  T++ G+Q  
Sbjct: 1152 PMGKGGLKDSILDVNMEPVIRNYPADSAEDKSTVKEFRLFVNDDAKTQSSLTNRFGDQPH 1211

Query: 3379 AGWCDSSSSSDWEHVLRDELPAQ 3447
            AGW D S SSDWEHVL++E+  Q
Sbjct: 1212 AGWYD-SCSSDWEHVLKEEIVGQ 1233


>ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628429 [Citrus sinensis]
          Length = 1065

 Score =  519 bits (1336), Expect = e-144
 Identities = 393/1092 (35%), Positives = 552/1092 (50%), Gaps = 73/1092 (6%)
 Frame = +1

Query: 394  SINSPTT-HMPPP-NSVATDATDAYSFPEYLDSV-TTSLVEAKPYYPSYVSSAICEDSPL 564
            S N P+T H+PPP N +   +  ++ + +  D++ + +LVEA PYYPSYVS         
Sbjct: 67   SFNPPSTAHIPPPENPIPITSASSFLYGQSSDAIPSANLVEANPYYPSYVSPT------- 119

Query: 565  AGLNEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDG 744
                                  + DDY +SLS L         W+   + E GR+     
Sbjct: 120  --------------------KYTYDDYAQSLSSL---------WDASREWEFGRKLELGE 150

Query: 745  SFCSKESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLD 924
            SFC+KE NV   SIY+DY  QGA +SK L+   + +   +++ + G E   G  + + LD
Sbjct: 151  SFCAKEMNVPDLSIYQDYADQGAHSSKGLNTFEQKN---NNLDMLGSEQHQGSINREQLD 207

Query: 925  -DNFLPQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFR 1101
              +F  Q  + +  E  R SV GS+S  PET+  + F +    SW+ Q    AS EK  +
Sbjct: 208  YKSFTGQISEFMPVEYSRKSVHGSTSFFPETYSLTSFEQG--RSWSHQTPYGASCEKGAK 265

Query: 1102 QHDTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINL-NSIGSDSARRSEDFADNI 1278
            QH    ND  S  KS P  VI+   V ++ S  ST  F NL NS G+ ++       DN+
Sbjct: 266  QHGISPNDISSVKKSSPVHVIKSQAVCSSLSPPSTGSFNNLENSSGAIASN------DNL 319

Query: 1279 HCNLKEPRPPLSSERKEVCLDMSPNI--LGRRDYNNVDTFATKKEEFSNNISVTKDSFDH 1452
              N+KE  P  SSE K V  D       L R  +        KKE+ S+N+SV KD    
Sbjct: 320  S-NMKEFYPLHSSEGK-VHFDAGQVSFHLERGSHIFPKLPLEKKEKLSSNVSVIKDPL-- 375

Query: 1453 FFEKRCELQVPHMSNENGYSLAHDAEPVNFIENSSKSFDQYNPAVDSPCWKGATANRSSP 1632
              +++  LQ+P +   +   +  +   +N  E SS+S D YNPAVDSPCWKGA    S  
Sbjct: 376  --KEKPGLQIPDIGPGSVSLMLANNGAINCSEGSSESLDHYNPAVDSPCWKGAPDYHSPV 433

Query: 1633 LSEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGNDSS 1812
             S        + K+E C+  N  GP      T+++ +V   KPS+  +Y E+    ND  
Sbjct: 434  ESSGPVTLQHINKIEACSGSNSFGP------TDNSGKVSPQKPSDYSFYQEHGYLENDPE 487

Query: 1813 LSAKSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPDNLKDDA 1992
             S K    A      HG D  +K GS   KSSC  GVQFSDYI K R++    +N  D+ 
Sbjct: 488  SSPKRSSRANLLFEEHGYDHDLKTGSYQMKSSCGLGVQFSDYIDKPRQDYVHANNSADEF 547

Query: 1993 DFRMHASGTY-------------------------VDDASEDGLSHVPFHVTKHVLSSPP 2097
             FR      Y                         ++  SE   SHVP H T+HVLSSP 
Sbjct: 548  KFRPFHQVQYDTVENKLTFERKCELGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPS 607

Query: 2098 SGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTLQDVI 2277
            S E  P+ + K   E  +P++ V+ L+ +MHNLSEL L HCSND C LKE D   L+ V+
Sbjct: 608  SVEAVPARLNKLHGEQLAPQMCVRTLISSMHNLSELLLFHCSNDMCGLKEHDFEALKLVV 667

Query: 2278 KNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASVDRSQVT 2457
             NL+ CISK        +E L   Q +S+FI +   LH+G +V   Q            T
Sbjct: 668  NNLDKCISKRMGPEAPIQESLLT-QKSSEFIREFPELHEGVTVSSPQ-----------ET 715

Query: 2458 KVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASA----------------QDDAEI 2589
            K A +++ Q + QH+ E++  DI  G K EK  +F S                 +DDAE 
Sbjct: 716  KAAFSVLNQPNYQHVQEQRSPDIAAGKKIEKCSDFTSQGGHAERVKDDDMTQVHKDDAER 775

Query: 2590 VKNDKMTQAIKKILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEK 2769
            VK+D MTQAIKK+L +NF  EE+ + QVLLY+NLWLEAEA +C+ NY+ARF+RMKIE+E 
Sbjct: 776  VKDDNMTQAIKKVLSDNFVKEEDEKLQVLLYRNLWLEAEAALCAINYKARFNRMKIELEN 835

Query: 2770 CNSHEAKDVSES-------STNRISPDVNAINQLAPE------PNGGIQDYPIINTSSHE 2910
            C   +AKD+SE+       S    SPD++A+N+L P+       +  ++D+PI N+SSH 
Sbjct: 836  CKLLKAKDLSENTSELEKLSQTTFSPDLHAVNKLPPQVKDDTTQDVSVRDFPIANSSSHP 895

Query: 2911 VDVLARFHILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHDS 3090
             DV+ARF ILK +++ S++      +KP++       D+VD  + + ++  TP  S    
Sbjct: 896  DDVVARFQILKCQESKSHA-----NQKPTA-------DEVDNFLFEARNDQTPPTSTCSL 943

Query: 3091 RILSKTSHVDNVEDSVMDRLRILKCRVDNLTPTVNVEDPEVP------------DVNPGC 3234
               + TS  D+VE SV+ R  ILK R++N +   N+ D  +P            DVN G 
Sbjct: 944  SNATSTSKADDVEASVIARFHILKNRIEN-SSCSNMGDQILPQVAFKLFENGTSDVNTGP 1002

Query: 3235 DESWDENFEAIMGHVIEHEKLKVKESSRLCDDRMIHSCWTDKLGNQLPAGWCDSSSSSDW 3414
            +   + +           +KL VKE     +D +I S   +KLGNQLPA  C  SSS DW
Sbjct: 1003 ELHRNSSTHM-------QDKLTVKEFH--LNDAVIQSPRLNKLGNQLPAS-CYDSSSLDW 1052

Query: 3415 EHVLRDELPAQN 3450
            EHV ++ELPAQN
Sbjct: 1053 EHVSKEELPAQN 1064


>ref|XP_007039222.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508776467|gb|EOY23723.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 1068

 Score =  511 bits (1315), Expect = e-141
 Identities = 401/1085 (36%), Positives = 555/1085 (51%), Gaps = 68/1085 (6%)
 Frame = +1

Query: 400  NSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVEAKPYYPSYVSSAICEDSPLAGLNE 579
            N  T + P P  +     +    P Y  +  + L E K YYPSYVS              
Sbjct: 53   NPYTFNSPQPAQLPQLDLEPTPTPSY--NQNSDLFEPKTYYPSYVSP------------- 97

Query: 580  PGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDGSFCSK 759
                        PL+  + ++  +SL GLD++ QWGG   GL D E G+     GSF  K
Sbjct: 98   ------------PLHVPTFNE--QSLPGLDHTAQWGG---GLWDWEKGKPAQLGGSFYLK 140

Query: 760  ESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLDDN-FL 936
            E++V  SSIY D++  GA  SK L  C E S   ++I  P RE   G ++ + LD N  L
Sbjct: 141  ETSVAPSSIYMDHINLGAHPSKSLKTCEETS---YNIYSP-REDQAGPANIEKLDYNPVL 196

Query: 937  PQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFRQHDTC 1116
             QN   +  + L+TSV GSSS + E +  +P L +L +  N     S  YEK  RQH T 
Sbjct: 197  GQNPSFMPVDYLKTSVIGSSSAISEANLQAPPL-NLVNCKNNHVQISTPYEKPLRQHGTT 255

Query: 1117 LNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINLNSIGSDSARRSEDFADNIHCNLKE 1296
            L+D   + KS P +VIRPP VG +SSAS++V F N+N+ G ++     + A N    ++E
Sbjct: 256  LSDSIPSVKSSPGVVIRPPAVGTSSSASNSVSFKNVNT-GINAT--DTNLAGNNRFIVEE 312

Query: 1297 PRPPLSSERKEVCLDMSPNIL--GRRDYNNVDTFATKKEEFSNNISVTKDSFDHFFEKRC 1470
            PR   +   K     +  + L  G    +   + +T+K    N  S      D+FF  + 
Sbjct: 313  PRFLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMAS------DNFFGAKS 366

Query: 1471 ELQVPHMSNENGYSLAHDA-EPVNFIENSSKSFDQYNPAVDSPCWKGATANRSSPL--SE 1641
             + +  +S +N +SLA +  E V  +ENS +S D YNP VDSPCWKGA A+ +SP   SE
Sbjct: 367  GVNLSRISPDN-FSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE 425

Query: 1642 AVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGNDSSLSA 1821
             V  QL  +KLE C+  N    +    NT + V+  S K  E L   EN    + S  S 
Sbjct: 426  PVAVQL-AKKLEACDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSL 484

Query: 1822 KSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPDNLKDDADFR 2001
            K P V+  S   H PD+A K GS   K+S A  V+FSD   + +K+  L D   D+ +  
Sbjct: 485  KLPPVSIPSFKEHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKA 544

Query: 2002 MHASGTY------------------------VDDASEDGLSHVPFHVTKHVLSSPPSGED 2109
             H S                           ++D S  G SHV  H  KH+  +P S ED
Sbjct: 545  SHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVED 604

Query: 2110 APSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTLQDVIKNLN 2289
              +   K   +       + +LV TM NLSEL L HCSN+AC L+EQD  +L+ VI NL+
Sbjct: 605  VSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLD 664

Query: 2290 VCISKTAEQ---MISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASVD-RSQVT 2457
             C+SK   Q   +    +  FP    +      S LHKG S    Q+   A++D  SQ T
Sbjct: 665  TCMSKNIGQETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQV---AAIDVLSQHT 721

Query: 2458 KVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEI-VKNDKMTQAIKKILV 2634
                    Q+ ++H          +G KDEK   F S +   +I VKNDKMTQAIKK+L+
Sbjct: 722  --------QVKRKH----------FGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLI 763

Query: 2635 ENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKCNSHEAKDVSES--- 2805
            ENFH +EET PQVLLYKNLWLEAEA +CS NY AR++ MKIE+EKC     KD+SE    
Sbjct: 764  ENFHEKEETHPQVLLYKNLWLEAEAALCSINYMARYNNMKIEIEKCKLDTEKDLSEDTPD 823

Query: 2806 ----STNRISPDVN------AINQLAPEPNGGIQDYPIINTSSHEVDVLARFHILKSRDN 2955
                S +++S D++      AI + AP  +   Q++PI ++S+H  DV ARFH+LK R N
Sbjct: 824  EDKISRSKLSADLDTNKKLTAIAESAPTLDVSNQNFPIASSSNHADDVTARFHVLKHRLN 883

Query: 2956 DSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHDSRILSKTSHVDNVEDS 3135
            +S SV+  DA++ SSSK S + D VDKL  + +D  T  +   DS +     H D+VE S
Sbjct: 884  NSYSVHTRDADELSSSKLSLDSDAVDKLATEVKDSSTSSLQTQDSPVPGTACHTDDVEAS 943

Query: 3136 VMDRLRILKCRVDNLTPTVNVED---PEVPDVN--------PGCDESWDE-----NFEAI 3267
            +M RL ILK R +    +  +E    PEV D+         P  +++ D+     N E++
Sbjct: 944  IMTRLHILKSRGNVDLDSNEMEQKPLPEVVDLGFAGKKKQIPIDEDTADDGVLGFNLESV 1003

Query: 3268 -MGHVIEHE-KLKVKESSRLC--DDRMIHSCWTDKLGNQLPAGWCDSSSSSDWEHVLRDE 3435
                V+++  +  V +   LC   D  I S  + +LGNQL AGW D S SSDWEHVL++E
Sbjct: 1004 SQNQVVDYAGEQSVVKDFHLCVKHDCTIQSPKSTRLGNQLSAGWYD-SCSSDWEHVLKEE 1062

Query: 3436 LPAQN 3450
            L  QN
Sbjct: 1063 LSGQN 1067


>ref|XP_007039220.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590674635|ref|XP_007039223.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508776465|gb|EOY23721.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508776468|gb|EOY23724.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1079

 Score =  511 bits (1315), Expect = e-141
 Identities = 401/1085 (36%), Positives = 555/1085 (51%), Gaps = 68/1085 (6%)
 Frame = +1

Query: 400  NSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVEAKPYYPSYVSSAICEDSPLAGLNE 579
            N  T + P P  +     +    P Y  +  + L E K YYPSYVS              
Sbjct: 64   NPYTFNSPQPAQLPQLDLEPTPTPSY--NQNSDLFEPKTYYPSYVSP------------- 108

Query: 580  PGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDGSFCSK 759
                        PL+  + ++  +SL GLD++ QWGG   GL D E G+     GSF  K
Sbjct: 109  ------------PLHVPTFNE--QSLPGLDHTAQWGG---GLWDWEKGKPAQLGGSFYLK 151

Query: 760  ESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLDDN-FL 936
            E++V  SSIY D++  GA  SK L  C E S   ++I  P RE   G ++ + LD N  L
Sbjct: 152  ETSVAPSSIYMDHINLGAHPSKSLKTCEETS---YNIYSP-REDQAGPANIEKLDYNPVL 207

Query: 937  PQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFRQHDTC 1116
             QN   +  + L+TSV GSSS + E +  +P L +L +  N     S  YEK  RQH T 
Sbjct: 208  GQNPSFMPVDYLKTSVIGSSSAISEANLQAPPL-NLVNCKNNHVQISTPYEKPLRQHGTT 266

Query: 1117 LNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINLNSIGSDSARRSEDFADNIHCNLKE 1296
            L+D   + KS P +VIRPP VG +SSAS++V F N+N+ G ++     + A N    ++E
Sbjct: 267  LSDSIPSVKSSPGVVIRPPAVGTSSSASNSVSFKNVNT-GINAT--DTNLAGNNRFIVEE 323

Query: 1297 PRPPLSSERKEVCLDMSPNIL--GRRDYNNVDTFATKKEEFSNNISVTKDSFDHFFEKRC 1470
            PR   +   K     +  + L  G    +   + +T+K    N  S      D+FF  + 
Sbjct: 324  PRFLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMAS------DNFFGAKS 377

Query: 1471 ELQVPHMSNENGYSLAHDA-EPVNFIENSSKSFDQYNPAVDSPCWKGATANRSSPL--SE 1641
             + +  +S +N +SLA +  E V  +ENS +S D YNP VDSPCWKGA A+ +SP   SE
Sbjct: 378  GVNLSRISPDN-FSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE 436

Query: 1642 AVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGNDSSLSA 1821
             V  QL  +KLE C+  N    +    NT + V+  S K  E L   EN    + S  S 
Sbjct: 437  PVAVQL-AKKLEACDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSL 495

Query: 1822 KSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPDNLKDDADFR 2001
            K P V+  S   H PD+A K GS   K+S A  V+FSD   + +K+  L D   D+ +  
Sbjct: 496  KLPPVSIPSFKEHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKA 555

Query: 2002 MHASGTY------------------------VDDASEDGLSHVPFHVTKHVLSSPPSGED 2109
             H S                           ++D S  G SHV  H  KH+  +P S ED
Sbjct: 556  SHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVED 615

Query: 2110 APSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTLQDVIKNLN 2289
              +   K   +       + +LV TM NLSEL L HCSN+AC L+EQD  +L+ VI NL+
Sbjct: 616  VSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLD 675

Query: 2290 VCISKTAEQ---MISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASVD-RSQVT 2457
             C+SK   Q   +    +  FP    +      S LHKG S    Q+   A++D  SQ T
Sbjct: 676  TCMSKNIGQETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQV---AAIDVLSQHT 732

Query: 2458 KVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEI-VKNDKMTQAIKKILV 2634
                    Q+ ++H          +G KDEK   F S +   +I VKNDKMTQAIKK+L+
Sbjct: 733  --------QVKRKH----------FGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLI 774

Query: 2635 ENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKCNSHEAKDVSES--- 2805
            ENFH +EET PQVLLYKNLWLEAEA +CS NY AR++ MKIE+EKC     KD+SE    
Sbjct: 775  ENFHEKEETHPQVLLYKNLWLEAEAALCSINYMARYNNMKIEIEKCKLDTEKDLSEDTPD 834

Query: 2806 ----STNRISPDVN------AINQLAPEPNGGIQDYPIINTSSHEVDVLARFHILKSRDN 2955
                S +++S D++      AI + AP  +   Q++PI ++S+H  DV ARFH+LK R N
Sbjct: 835  EDKISRSKLSADLDTNKKLTAIAESAPTLDVSNQNFPIASSSNHADDVTARFHVLKHRLN 894

Query: 2956 DSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHDSRILSKTSHVDNVEDS 3135
            +S SV+  DA++ SSSK S + D VDKL  + +D  T  +   DS +     H D+VE S
Sbjct: 895  NSYSVHTRDADELSSSKLSLDSDAVDKLATEVKDSSTSSLQTQDSPVPGTACHTDDVEAS 954

Query: 3136 VMDRLRILKCRVDNLTPTVNVED---PEVPDVN--------PGCDESWDE-----NFEAI 3267
            +M RL ILK R +    +  +E    PEV D+         P  +++ D+     N E++
Sbjct: 955  IMTRLHILKSRGNVDLDSNEMEQKPLPEVVDLGFAGKKKQIPIDEDTADDGVLGFNLESV 1014

Query: 3268 -MGHVIEHE-KLKVKESSRLC--DDRMIHSCWTDKLGNQLPAGWCDSSSSSDWEHVLRDE 3435
                V+++  +  V +   LC   D  I S  + +LGNQL AGW D S SSDWEHVL++E
Sbjct: 1015 SQNQVVDYAGEQSVVKDFHLCVKHDCTIQSPKSTRLGNQLSAGWYD-SCSSDWEHVLKEE 1073

Query: 3436 LPAQN 3450
            L  QN
Sbjct: 1074 LSGQN 1078


>ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citrus clementina]
            gi|557543533|gb|ESR54511.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
          Length = 1064

 Score =  508 bits (1307), Expect = e-140
 Identities = 389/1098 (35%), Positives = 542/1098 (49%), Gaps = 79/1098 (7%)
 Frame = +1

Query: 394  SINSPTT-HMPPP-NSVATDATDAYSFPEYLDSV-TTSLVEAKPYYPSYVSSAICEDSPL 564
            S N P+T H+PPP N +   +  ++ + +  D++ + +LVEA PYYPSYVS         
Sbjct: 67   SFNPPSTAHIPPPENPIPITSASSFLYGQSSDAIPSANLVEANPYYPSYVSPT------- 119

Query: 565  AGLNEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDG 744
                                  + DDY +SLS L  S +W          E  R+     
Sbjct: 120  --------------------KYTYDDYAQSLSSLWDSREW----------EFSRKLELGE 149

Query: 745  SFCSKESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLD 924
            SFCSKE NV   SIY+DY  QGA +SK L+   + +   +++ + G E   G  + + LD
Sbjct: 150  SFCSKEMNVPDLSIYQDYADQGAHSSKGLNTFEQKN---NNLDMLGSEQHQGSINREQLD 206

Query: 925  -DNFLPQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFR 1101
              +F  Q  + +  E  R SV GS+S+ PET+  + + +    SW+ Q    AS EK  +
Sbjct: 207  YKSFTGQISEFMPVEYSRKSVHGSTSLFPETYSLTSYEQG--RSWSHQTPYGASCEKGAK 264

Query: 1102 QHDTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINL-NSIGSDSARRSEDFADNI 1278
            QH    ND  S  KS P  V++   V  + S  STV F NL NS G  ++       DN+
Sbjct: 265  QHGISPNDISSVKKSSPVHVVKSQAVFTSLSPPSTVSFNNLENSSGVIASN------DNL 318

Query: 1279 HCNLKEPRPPLSSERK--------EVCLDMSPNILGRRDYNNVDTFATKKEEFSNNISVT 1434
              N+KE  P  SSE K           L+   +I  +  +        KKE+ S+N+SV 
Sbjct: 319  S-NMKEFYPLHSSEGKVHFDAGQVSFHLERGSHIFPKLPFE-------KKEKLSSNVSVI 370

Query: 1435 KDSFDHFFEKRCELQVPHMSNENGYSLAHDAEPVNFIENSSKSFDQYNPAVDSPCWKGAT 1614
            KD      +++  LQ+P +   +   +  +   +N  E SS+S D YNPAVDSPCWKGA 
Sbjct: 371  KDPL----KEKPGLQIPDIGPGSVSLMLANNRAINCSEGSSESLDHYNPAVDSPCWKGAP 426

Query: 1615 ANRSSPLSEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVC 1794
               S   S        + K+E C+  N  GP      T+++ +V   KPS+  +Y E+  
Sbjct: 427  DYHSPVESSGPVTLQHINKIEACSGSNSIGP------TDNSGKVSPQKPSDYSFYQEHGY 480

Query: 1795 GGNDSSLSAKSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPD 1974
              ND   S K    A      HG D  +K G    KSS   GVQFSD I K R++    +
Sbjct: 481  LENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMKSSYGLGVQFSDCIDKPRQDYVHAN 540

Query: 1975 NLKDDADFRMHASGTY-------------------------VDDASEDGLSHVPFHVTKH 2079
            N  D+  FR      Y                         ++  SE   SHVP H T+H
Sbjct: 541  NSADEFKFRPFHQVQYDSVENKLTFERKCELGSGVADVGLSINGTSEGCSSHVPLHATEH 600

Query: 2080 VLSSPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNH 2259
            VLSSP S E  P+ + K   E  +P++ V+ L+ TMHNLSEL L HCSND C LKE D  
Sbjct: 601  VLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFE 660

Query: 2260 TLQDVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASV 2439
             L+ V+ NL+ CISK        +E L   Q +S+FI +   LH+G +V   +       
Sbjct: 661  ALKLVVNNLDKCISKRMGPEAPIQESLLT-QKSSEFIREFPELHEGVTVSSPK------- 712

Query: 2440 DRSQVTKVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASA---------------- 2571
                 TK A +++ Q + QH+ E++  DI  G K EK  +F S                 
Sbjct: 713  ----ETKAAFSVLNQPNYQHVQEQRSPDIAAGKKSEKCSDFTSQGGHAERVKDDDMTQVH 768

Query: 2572 QDDAEIVKNDKMTQAIKKILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRM 2751
            +DDAE VK+D MTQAIKK+L +NF  EE+ + QVLLY+NLWLEAEA +CS NY+ARF+RM
Sbjct: 769  KDDAERVKDDNMTQAIKKVLSDNFVEEEDEKLQVLLYRNLWLEAEAALCSINYKARFNRM 828

Query: 2752 KIEMEKCNSHEAKDVSES-------STNRISPDVNAINQLAPE------PNGGIQDYPII 2892
            KIE+E C   +AKD SE+       S    SPD++A+N+L P+       +  + D+PI 
Sbjct: 829  KIELENCKLLKAKDFSENTSELEKLSQTTFSPDLHAVNKLPPQVKDDSTQDVSVHDFPIA 888

Query: 2893 NTSSHEVDVLARFHILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPE 3072
            N SSH  DV+AR  ILK ++++S++               P  D+VD  + + ++  TP 
Sbjct: 889  NISSHPDDVVARSQILKCQESESHANQR------------PTADEVDNFLFEARNDQTPP 936

Query: 3073 ISVHDSRILSKTSHVDNVEDSVMDRLRILKCRVDNLTPTVNVEDPEVP------------ 3216
             S       + TS  D+VE SV+ R  ILK R++N +   N+ D  +P            
Sbjct: 937  TSTCSLSNATSTSKADDVEASVIARFHILKNRIEN-SSCSNMGDQILPQVAFKLFENGTS 995

Query: 3217 DVNPGCDESWDENFEAIMGHVIEHEKLKVKESSRLCDDRMIHSCWTDKLGNQLPAGWCDS 3396
            DVN G      E       H+   +KL VKE     +D +I S   +KLGNQLPA  C  
Sbjct: 996  DVNTG-----PELHRNSSNHM--QDKLTVKEFH--LNDAVIQSPRLNKLGNQLPAS-CYD 1045

Query: 3397 SSSSDWEHVLRDELPAQN 3450
            SSS DWEHV ++ELPAQN
Sbjct: 1046 SSSLDWEHVSKEELPAQN 1063


>ref|XP_007039224.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508776469|gb|EOY23725.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 1059

 Score =  506 bits (1303), Expect = e-140
 Identities = 401/1082 (37%), Positives = 552/1082 (51%), Gaps = 65/1082 (6%)
 Frame = +1

Query: 400  NSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVEAKPYYPSYVSSAICEDSPLAGLNE 579
            N  T + P P  +     +    P Y  +  + L E K YYPSYVS              
Sbjct: 64   NPYTFNSPQPAQLPQLDLEPTPTPSY--NQNSDLFEPKTYYPSYVSP------------- 108

Query: 580  PGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDGSFCSK 759
                        PL+  + ++  +SL GLD++ QWGG   GL D E G+     GSF  K
Sbjct: 109  ------------PLHVPTFNE--QSLPGLDHTAQWGG---GLWDWEKGKPAQLGGSFYLK 151

Query: 760  ESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLDDN-FL 936
            E++V  SSIY D++  GA  SK L  C E S   ++I  P RE   G ++ + LD N  L
Sbjct: 152  ETSVAPSSIYMDHINLGAHPSKSLKTCEETS---YNIYSP-REDQAGPANIEKLDYNPVL 207

Query: 937  PQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFRQHDTC 1116
             QN   +  + L+TSV GSSS + E +  +P L +L +  N     S  YEK  RQH T 
Sbjct: 208  GQNPSFMPVDYLKTSVIGSSSAISEANLQAPPL-NLVNCKNNHVQISTPYEKPLRQHGTT 266

Query: 1117 LNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINLNSIGSDSARRSEDFADNIHCNLKE 1296
            L+D   + KS P +VIRPP VG +SSAS++V F N+N+ G ++     + A N    ++E
Sbjct: 267  LSDSIPSVKSSPGVVIRPPAVGTSSSASNSVSFKNVNT-GINAT--DTNLAGNNRFIVEE 323

Query: 1297 PRPPLSSERKEVCLDMSPNIL--GRRDYNNVDTFATKKEEFSNNISVTKDSFDHFFEKRC 1470
            PR   +   K     +  + L  G    +   + +T+K    N  S      D+FF  + 
Sbjct: 324  PRFLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMAS------DNFFGAKS 377

Query: 1471 ELQVPHMSNENGYSLAHDA-EPVNFIENSSKSFDQYNPAVDSPCWKGATANRSSPL--SE 1641
             + +  +S +N +SLA +  E V  +ENS +S D YNP VDSPCWKGA A+ +SP   SE
Sbjct: 378  GVNLSRISPDN-FSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE 436

Query: 1642 AVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGNDSSLSA 1821
             V  QL  +KLE C+  N    +    NT + V+  S K  E L   EN    + S  S 
Sbjct: 437  PVAVQL-AKKLEACDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSL 495

Query: 1822 KSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPDNLKDDADFR 2001
            K P V+  S   H PD+A K GS   K+S A  V+FSD   + +K+  L D   D+ +  
Sbjct: 496  KLPPVSIPSFKEHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKA 555

Query: 2002 MHASGTY------------------------VDDASEDGLSHVPFHVTKHVLSSPPSGED 2109
             H S                           ++D S  G SHV  H  KH+  +P S ED
Sbjct: 556  SHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVED 615

Query: 2110 APSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTLQDVIKNLN 2289
              +   K   +       + +LV TM NLSEL L HCSN+AC L+EQD  +L+ VI NL+
Sbjct: 616  VSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLD 675

Query: 2290 VCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASVD-RSQVTKVA 2466
             C+SK   Q     E L             S LHKG S    Q+   A++D  SQ T   
Sbjct: 676  TCMSKNIGQ-----ETLL------------SELHKGTSTGSPQV---AAIDVLSQHT--- 712

Query: 2467 SAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEI-VKNDKMTQAIKKILVENF 2643
                 Q+ ++H          +G KDEK   F S +   +I VKNDKMTQAIKK+L+ENF
Sbjct: 713  -----QVKRKH----------FGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLIENF 757

Query: 2644 HGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKCNSHEAKDVSES------ 2805
            H +EET PQVLLYKNLWLEAEA +CS NY AR++ MKIE+EKC     KD+SE       
Sbjct: 758  HEKEETHPQVLLYKNLWLEAEAALCSINYMARYNNMKIEIEKCKLDTEKDLSEDTPDEDK 817

Query: 2806 -STNRISPDVN------AINQLAPEPNGGIQDYPIINTSSHEVDVLARFHILKSRDNDSN 2964
             S +++S D++      AI + AP  +   Q++PI ++S+H  DV ARFH+LK R N+S 
Sbjct: 818  ISRSKLSADLDTNKKLTAIAESAPTLDVSNQNFPIASSSNHADDVTARFHVLKHRLNNSY 877

Query: 2965 SVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHDSRILSKTSHVDNVEDSVMD 3144
            SV+  DA++ SSSK S + D VDKL  + +D  T  +   DS +     H D+VE S+M 
Sbjct: 878  SVHTRDADELSSSKLSLDSDAVDKLATEVKDSSTSSLQTQDSPVPGTACHTDDVEASIMT 937

Query: 3145 RLRILKCRVDNLTPTVNVED---PEVPDVN--------PGCDESWDE-----NFEAI-MG 3273
            RL ILK R +    +  +E    PEV D+         P  +++ D+     N E++   
Sbjct: 938  RLHILKSRGNVDLDSNEMEQKPLPEVVDLGFAGKKKQIPIDEDTADDGVLGFNLESVSQN 997

Query: 3274 HVIEHE-KLKVKESSRLC--DDRMIHSCWTDKLGNQLPAGWCDSSSSSDWEHVLRDELPA 3444
             V+++  +  V +   LC   D  I S  + +LGNQL AGW D S SSDWEHVL++EL  
Sbjct: 998  QVVDYAGEQSVVKDFHLCVKHDCTIQSPKSTRLGNQLSAGWYD-SCSSDWEHVLKEELSG 1056

Query: 3445 QN 3450
            QN
Sbjct: 1057 QN 1058


>ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301835 [Fragaria vesca
            subsp. vesca]
          Length = 1218

 Score =  493 bits (1270), Expect = e-136
 Identities = 378/1066 (35%), Positives = 537/1066 (50%), Gaps = 58/1066 (5%)
 Frame = +1

Query: 181  TVDRPFFKPNSNPLAH-------FTEPPYAAPFXXXXXXXXXXXXXXXXXXXXTSRPDYF 339
            TV+RP  KP S+PL          TE PYAAP                     +S P++F
Sbjct: 29   TVERPVPKPISSPLVESFTPLVEVTEQPYAAP-------PNSTLHNWLPPHSPSSVPNFF 81

Query: 340  ASPNSEVDIMPN----RYLVSQSINSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVE 507
             +P    D +P+    RY    +++S +T++PP NSV+  +++A+S+ + LD   TS VE
Sbjct: 82   TNPPPAFDSVPSSNAYRYAGLPTVDSFSTNLPPMNSVSMPSSNAFSYDQRLDVAATSFVE 141

Query: 508  AKPYYPSYVSSAICEDSPLAGLNEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWG 687
            AKPYYPSY+S  I  D+P+   ++P YD LS S+  PL+GSS  +YT+  S   Y+ QWG
Sbjct: 142  AKPYYPSYLSPTIHGDNPVVPPDQPSYDWLSTSQFAPLDGSSHKEYTQRPSSSKYTAQWG 201

Query: 688  GFWNGLTDLESGRQTGFDGSFCSKESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHS 867
              WNG  + E G+Q  FDGSF  KE++V S+  Y +Y+ Q   +S  L +       SH+
Sbjct: 202  SSWNGPAEWEQGKQGQFDGSFRPKENDV-SNLPYNNYLNQEPHSSNSLKSYGVNEVASHN 260

Query: 868  IIVPGRESCIGFSSTKPLDDNFLPQNFKSISDESLRTSVGGSSSV--LPETHPNSPFLES 1041
            I  P     +        D +F+ +N K    +  + ++G  S V  +P   P+SPF+  
Sbjct: 261  I--PDWNGSVNAEHLG--DKSFVGRNSKFSPIDFTKPTMGSLSVVPEIPSKAPSSPFIGK 316

Query: 1042 LTHSWNFQKANSASYEKCFRQHDTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFIN 1221
             T+  + +K          RQHD   ND  S  KS PA +IRPP +G  SS      F  
Sbjct: 317  STYGVSCEK----------RQHDASWNDVTSISKSSPASIIRPPAIGTKSSEPKMGLFKR 366

Query: 1222 LNSIGSDSARRSEDFADNIHCNLKEPRPPLSSERKEVCLDMSP--NILGRRDYNNVDTFA 1395
            LNS G D+A          +   +E   P S   K V  D S     LGR D  +V++ +
Sbjct: 367  LNS-GRDAANADH----GGYYPSQESHLPQSFVDK-VPFDSSQLGIHLGRIDPFSVESSS 420

Query: 1396 TKKEEFSNNISVTKDSFDHFFEKRCELQVPHMSNENGYSLAHDAEPVNFIENSSKSFDQY 1575
            TK     NN S++ D  DH F+ +  L   H+  +   +  +  + +N   NSS++ D  
Sbjct: 421  TKDTALPNNGSISNDPLDHLFKVKPGLPNSHVKPDGFDAAVNINDSINSFLNSSENVDPN 480

Query: 1576 NPAVDSPCWKGATANRSSPL--SEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVF 1749
            NPAVDSPCWKG   +R SP   SE   P+  M+KLE CN LNL  P IF LNT + +   
Sbjct: 481  NPAVDSPCWKGVRGSRFSPFKASEEGGPEK-MKKLEGCNGLNLNMPMIFSLNTCENISTQ 539

Query: 1750 SPKPSENLWYHENVCGGNDSSLSAKSPLVAGFSSTGHGPDDAVK---------------- 1881
             P       +  N   GN   L  K   V   +   H  DD  K                
Sbjct: 540  KPVEYNEFGWLGNGLLGNGLPLPLKKSSVENSAFGEHKLDDTTKTTYYRESGHDRGLHGY 599

Query: 1882 ---PGSCS-TKSSCAYGVQFSDYILKSRKEDALPDNLKD-----DADFRMHASGTYVDDA 2034
               P S S  KSS  +  + S  + +   E  L    K+      AD +++     ++D 
Sbjct: 600  INTPHSGSGDKSSSPF--EHSYIVQEGCGEGGLTTESKNTTWSVGADVKLN-----INDT 652

Query: 2035 SEDGLSHV-PFHVTKHVLSSPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFL 2211
             E G SH  P   T       PS EDA + +  S  E S+  +D+QMLV  M++LSE+ L
Sbjct: 653  LECGSSHTSPIENT----FCSPSVEDADTKLTTSYGEESNMNMDIQMLVNKMNSLSEVLL 708

Query: 2212 LHCSNDACTLKEQDNHTLQDVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLH 2391
            ++CSN +C LK++D   L+ VI NLN CI K  E  +S  E     Q T ++IE+    +
Sbjct: 709  VNCSNSSCQLKKKDIDALKAVINNLNSCILKHDEDFLSMPESPPIQQSTIKYIEELCKPN 768

Query: 2392 KGASVDRSQLTKGASVDRSQVTKVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASA 2571
            K  S D  QLTK            A +I   L  Q + + K  D +    DE   +  SA
Sbjct: 769  KALSPDMPQLTK----------IFAPSIQDPLHLQGVQKVKNHDNLVKNDDEVISS-VSA 817

Query: 2572 QDDAEIVKNDKMTQAIKKILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRM 2751
            + D + VK ++MTQ IKKIL ENFH  ++T PQ LLYKNLWLEAEAVICSTNY+ARF+R+
Sbjct: 818  KSDIDFVKQEEMTQDIKKILSENFH-TDDTHPQTLLYKNLWLEAEAVICSTNYKARFNRL 876

Query: 2752 KIEMEKCNSHEAKDVSE-------SSTNRISPDVNAINQL------APEPNGGIQDYPII 2892
            K EMEKC + ++KDV E        S + +  + N + +L      +P P   +Q+ P +
Sbjct: 877  KTEMEKCKADQSKDVFEHTADMMTQSRSEVCVNSNPVEKLTSEVQGSPLPKLNLQESPTL 936

Query: 2893 NTSSHEVDVLARFHILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPE 3072
              +  + +V+ARFH+L++R  + +SVN    ++ SSS  S   DKVD++ P+    P+P 
Sbjct: 937  --TQGDDNVMARFHVLRNRIENLSSVNATFGDE-SSSTLSLVPDKVDEVAPEADARPSPR 993

Query: 3073 ISVHDSRILSKTSHVDNVEDSVMDRLRILKCRVDN--LTPTVNVED 3204
            IS+ DS   S T   ++ E SVM R  I++ RV+N       NVED
Sbjct: 994  ISLQDSPTSSITGLSNDYEASVMARFHIIRDRVENSKFISDANVED 1039



 Score = 94.0 bits (232), Expect = 5e-16
 Identities = 93/321 (28%), Positives = 148/321 (46%), Gaps = 41/321 (12%)
 Frame = +1

Query: 2599 DKMTQAIKKILVENFHGEE---ETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEK 2769
            D MTQ+  ++ V +   E+   E +   L   NL           N  ARF  ++  +E 
Sbjct: 897  DMMTQSRSEVCVNSNPVEKLTSEVQGSPLPKLNLQESPTLTQGDDNVMARFHVLRNRIEN 956

Query: 2770 CNSHEAKDVSESS-TNRISPDVNAINQLAPE------PNGGIQDYPIIN----TSSHEVD 2916
             +S  A    ESS T  + PD   ++++APE      P   +QD P  +    ++ +E  
Sbjct: 957  LSSVNATFGDESSSTLSLVPD--KVDEVAPEADARPSPRISLQDSPTSSITGLSNDYEAS 1014

Query: 2917 VLARFHILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHD--- 3087
            V+ARFHI++ R  +S  +++ + E  +SSK S E +  +       D P  E+++ D   
Sbjct: 1015 VMARFHIIRDRVENSKFISDANVEDTASSKVSREHEAEEGACETSDDGPIQELNIQDYPG 1074

Query: 3088 ------SRILSKTSHVDNVEDSVMDRLRILKCRVDNLT--PTV-----NVEDPEVPDVNP 3228
                      + T H    EDSV+ R  ILK RVDN +  PTV     +V+       N 
Sbjct: 1075 SVQDYPVSTSTTTGHAYQYEDSVLARFNILKSRVDNCSDIPTVGELLESVDLGYAGKRNL 1134

Query: 3229 G---CDESWDENFE-----AIMGHVIEHEK---LKVKESSRLCDDRMIHSCWTDKLGNQL 3375
            G   C+ S D + +      +  H+ ++ K   +  KE     +D   H    ++  NQL
Sbjct: 1135 GPIICNRSEDGSSDVKEQPVLQSHIADNSKGKCMDAKEFHLFVEDDPGH--MINRPANQL 1192

Query: 3376 PAGWCDSSSSSDWEHVLRDEL 3438
             AG  D S+SSDWEHV+++E+
Sbjct: 1193 SAGSPDQSTSSDWEHVMKEEV 1213


>ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citrus clementina]
            gi|557543530|gb|ESR54508.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
          Length = 1041

 Score =  491 bits (1263), Expect = e-135
 Identities = 381/1085 (35%), Positives = 531/1085 (48%), Gaps = 66/1085 (6%)
 Frame = +1

Query: 394  SINSPTT-HMPPP-NSVATDATDAYSFPEYLDSV-TTSLVEAKPYYPSYVSSAICEDSPL 564
            S N P+T H+PPP N +   +  ++ + +  D++ + +LVEA PYYPSYVS         
Sbjct: 67   SFNPPSTAHIPPPENPIPITSASSFLYGQSSDAIPSANLVEANPYYPSYVSPT------- 119

Query: 565  AGLNEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDG 744
                                  + DDY +SLS L  S +W          E  R+     
Sbjct: 120  --------------------KYTYDDYAQSLSSLWDSREW----------EFSRKLELGE 149

Query: 745  SFCSKESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLD 924
            SFCSKE NV   SIY+DY  QGA +SK L+   + +   +++ + G E   G  + + LD
Sbjct: 150  SFCSKEMNVPDLSIYQDYADQGAHSSKGLNTFEQKN---NNLDMLGSEQHQGSINREQLD 206

Query: 925  -DNFLPQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFR 1101
              +F  Q  + +  E  R SV GS+S+ PET+  + + +    SW+ Q    AS EK  +
Sbjct: 207  YKSFTGQISEFMPVEYSRKSVHGSTSLFPETYSLTSYEQG--RSWSHQTPYGASCEKGAK 264

Query: 1102 QHDTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINL-NSIGSDSARRSEDFADNI 1278
            QH    ND  S  KS P  V++   V  + S  STV F NL NS G  ++       DN+
Sbjct: 265  QHGISPNDISSVKKSSPVHVVKSQAVFTSLSPPSTVSFNNLENSSGVIASN------DNL 318

Query: 1279 HCNLKEPRPPLSSERK--------EVCLDMSPNILGRRDYNNVDTFATKKEEFSNNISVT 1434
              N+KE  P  SSE K           L+   +I  +  +        KKE+ S+N+SV 
Sbjct: 319  S-NMKEFYPLHSSEGKVHFDAGQVSFHLERGSHIFPKLPFE-------KKEKLSSNVSVI 370

Query: 1435 KDSFDHFFEKRCELQVPHMSNENGYSLAHDAEPVNFIENSSKSFDQYNPAVDSPCWKGAT 1614
            KD      +++  LQ+P +   +   +  +   +N  E SS+S D YNPAVDSPCWKGA 
Sbjct: 371  KDPL----KEKPGLQIPDIGPGSVSLMLANNRAINCSEGSSESLDHYNPAVDSPCWKGAP 426

Query: 1615 ANRSSPLSEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVC 1794
               S   S        + K+E C+  N  GP      T+++ +V   KPS+  +Y E+  
Sbjct: 427  DYHSPVESSGPVTLQHINKIEACSGSNSIGP------TDNSGKVSPQKPSDYSFYQEHGY 480

Query: 1795 GGNDSSLSAKSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPD 1974
              ND   S K    A      HG D  +K G    KSS   GVQFSD I K R++    +
Sbjct: 481  LENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMKSSYGLGVQFSDCIDKPRQDYVHAN 540

Query: 1975 NLKDDADFRMHASGTY-------------------------VDDASEDGLSHVPFHVTKH 2079
            N  D+  FR      Y                         ++  SE   SHVP H T+H
Sbjct: 541  NSADEFKFRPFHQVQYDSVENKLTFERKCELGSGVADVGLSINGTSEGCSSHVPLHATEH 600

Query: 2080 VLSSPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNH 2259
            VLSSP S E  P+ + K   E  +P++ V+ L+ TMHNLSEL L HCSND C LKE D  
Sbjct: 601  VLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFE 660

Query: 2260 TLQDVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASV 2439
             L+ V+ NL+ CISK        +E L   Q +S+FI +   LH+G +V   +       
Sbjct: 661  ALKLVVNNLDKCISKRMGPEAPIQESLLT-QKSSEFIREFPELHEGVTVSSPK------- 712

Query: 2440 DRSQVTKVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASA---------------- 2571
                 TK A +++ Q + QH+ E++  DI  G K EK  +F S                 
Sbjct: 713  ----ETKAAFSVLNQPNYQHVQEQRSPDIAAGKKSEKCSDFTSQGGHAERVKDDDMTQVH 768

Query: 2572 QDDAEIVKNDKMTQAIKKILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRM 2751
            +DDAE VK+D MTQAIKK+L +NF  EE+ + QVLLY+NLWLEAEA +CS NY+ARF+RM
Sbjct: 769  KDDAERVKDDNMTQAIKKVLSDNFVEEEDEKLQVLLYRNLWLEAEAALCSINYKARFNRM 828

Query: 2752 KIEMEKCNSHEAKDVSESSTNRISPDVNAINQLAPEPNGGIQDYPIINTSSHEVDVLARF 2931
            KIE+E C   +AK       N++ P V   +      +  + D+PI N SSH  DV+AR 
Sbjct: 829  KIELENCKLLKAK------VNKLPPQVKDDS----TQDVSVHDFPIANISSHPDDVVARS 878

Query: 2932 HILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHDSRILSKTS 3111
             ILK ++++S++               P  D+VD  + + ++  TP  S       + TS
Sbjct: 879  QILKCQESESHANQR------------PTADEVDNFLFEARNDQTPPTSTCSLSNATSTS 926

Query: 3112 HVDNVEDSVMDRLRILKCRVDNLTPTVNVEDPEVP------------DVNPGCDESWDEN 3255
              D+VE SV+ R  ILK R++N +   N+ D  +P            DVN G      E 
Sbjct: 927  KADDVEASVIARFHILKNRIEN-SSCSNMGDQILPQVAFKLFENGTSDVNTG-----PEL 980

Query: 3256 FEAIMGHVIEHEKLKVKESSRLCDDRMIHSCWTDKLGNQLPAGWCDSSSSSDWEHVLRDE 3435
                  H+   +KL VKE     +D +I S   +KLGNQLPA  C  SSS DWEHV ++E
Sbjct: 981  HRNSSNHM--QDKLTVKEFH--LNDAVIQSPRLNKLGNQLPAS-CYDSSSLDWEHVSKEE 1035

Query: 3436 LPAQN 3450
            LPAQN
Sbjct: 1036 LPAQN 1040


>ref|XP_007039221.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508776466|gb|EOY23722.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1017

 Score =  441 bits (1133), Expect = e-120
 Identities = 373/1076 (34%), Positives = 515/1076 (47%), Gaps = 59/1076 (5%)
 Frame = +1

Query: 400  NSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVEAKPYYPSYVSSAICEDSPLAGLNE 579
            N  T + P P  +     +    P Y  +  + L E K YYPSYVS              
Sbjct: 64   NPYTFNSPQPAQLPQLDLEPTPTPSY--NQNSDLFEPKTYYPSYVSP------------- 108

Query: 580  PGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDGSFCSK 759
                        PL+  + ++  +SL GLD++ QWGG   GL D E G+     GSF  K
Sbjct: 109  ------------PLHVPTFNE--QSLPGLDHTAQWGG---GLWDWEKGKPAQLGGSFYLK 151

Query: 760  ESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLDDN-FL 936
            E++V  SSIY D++  GA  SK L  C E S   ++I  P RE   G ++ + LD N  L
Sbjct: 152  ETSVAPSSIYMDHINLGAHPSKSLKTCEETS---YNIYSP-REDQAGPANIEKLDYNPVL 207

Query: 937  PQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFRQHDTC 1116
             QN   +  + L+TSV GSSS + E +  +P L +L +  N     S  YEK  RQH T 
Sbjct: 208  GQNPSFMPVDYLKTSVIGSSSAISEANLQAPPL-NLVNCKNNHVQISTPYEKPLRQHGTT 266

Query: 1117 LNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINLNSIGSDSARRSEDFADNIHCNLKE 1296
            L+D   + KS P +VIRPP VG +SSAS++V F N+N+ G ++     + A N    ++E
Sbjct: 267  LSDSIPSVKSSPGVVIRPPAVGTSSSASNSVSFKNVNT-GINAT--DTNLAGNNRFIVEE 323

Query: 1297 PRPPLSSERKEVCLDMSPNIL--GRRDYNNVDTFATKKEEFSNNISVTKDSFDHFFEKRC 1470
            PR   +   K     +  + L  G    +   + +T+K    N  S      D+FF  + 
Sbjct: 324  PRFLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMAS------DNFFGAKS 377

Query: 1471 ELQVPHMSNENGYSLAHDA-EPVNFIENSSKSFDQYNPAVDSPCWKGATANRSSPL--SE 1641
             + +  +S +N +SLA +  E V  +ENS +S D YNP VDSPCWKGA A+ +SP   SE
Sbjct: 378  GVNLSRISPDN-FSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE 436

Query: 1642 AVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGNDSSLSA 1821
             V  QL  +KLE C+  N    +    NT + V+  S K  E L   EN    + S  S 
Sbjct: 437  PVAVQL-AKKLEACDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSL 495

Query: 1822 KSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPDNLKDDADFR 2001
            K P V+  S   H PD+A K GS   K+S A  V+FSD   + +K+  L D   D+ +  
Sbjct: 496  KLPPVSIPSFKEHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKA 555

Query: 2002 MHASGTY------------------------VDDASEDGLSHVPFHVTKHVLSSPPSGED 2109
             H S                           ++D S  G SHV  H  KH+  +P S ED
Sbjct: 556  SHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVED 615

Query: 2110 APSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTLQDVIKNLN 2289
              +   K   +       + +LV TM NLSEL L HCSN+AC L+EQD  +L+ VI NL+
Sbjct: 616  VSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLD 675

Query: 2290 VCISKTAEQ---MISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASVD-RSQVT 2457
             C+SK   Q   +    +  FP    +      S LHKG S    Q+   A++D  SQ T
Sbjct: 676  TCMSKNIGQETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQV---AAIDVLSQHT 732

Query: 2458 KVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEI-VKNDKMTQAIKKILV 2634
                    Q+ ++H          +G KDEK   F S +   +I VKNDKMTQAIKK+L+
Sbjct: 733  --------QVKRKH----------FGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLI 774

Query: 2635 ENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKCNSHEAKDVSES--- 2805
            ENFH +EET PQVLLYKNLWLEAEA +CS NY AR++ MKIE+EKC     KD+SE    
Sbjct: 775  ENFHEKEETHPQVLLYKNLWLEAEAALCSINYMARYNNMKIEIEKCKLDTEKDLSEDTPD 834

Query: 2806 -----------STNRISPDVNAINQLAPEPNGG------IQDYPIINTSSH----EVDVL 2922
                       S++++S D +A+++LA E           QD P+  T+ H    E  ++
Sbjct: 835  EDKISRDADELSSSKLSLDSDAVDKLATEVKDSSTSSLQTQDSPVPGTACHTDDVEASIM 894

Query: 2923 ARFHILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHDSRILS 3102
             R HILKSR N     N ++ +                        P PE  V D     
Sbjct: 895  TRLHILKSRGNVDLDSNEMEQK------------------------PLPE--VVDLGFAG 928

Query: 3103 KTSHVDNVEDSVMDRLRILKCRVDNLTPTVNVEDPEVPDVNPGCDESWDENFEAIMGHVI 3282
            K   +   ED+  D   +L   ++      +V   +V D     ++S  ++F   + H  
Sbjct: 929  KKKQIPIDEDTADD--GVLGFNLE------SVSQNQVVDY--AGEQSVVKDFHLCVKH-- 976

Query: 3283 EHEKLKVKESSRLCDDRMIHSCWTDKLGNQLPAGWCDSSSSSDWEHVLRDELPAQN 3450
                           D  I S  + +LGNQL AGW D S SSDWEHVL++EL  QN
Sbjct: 977  ---------------DCTIQSPKSTRLGNQLSAGWYD-SCSSDWEHVLKEELSGQN 1016


>gb|EXB94712.1| hypothetical protein L484_002599 [Morus notabilis]
          Length = 1159

 Score =  439 bits (1128), Expect = e-120
 Identities = 357/1099 (32%), Positives = 550/1099 (50%), Gaps = 61/1099 (5%)
 Frame = +1

Query: 319  TSRPDYFASPNSEVDIM--PNRYLVSQSINSPTTHMPPPNSVATDATDAYSFPEYLDSVT 492
            TS  ++F+ PN E++ +  PN      S+N   TH+P  ++  + + DA+S+ +  D V 
Sbjct: 71   TSETNFFSDPNLELNSVASPNNPYNYASLN---THLPHLSTSVSASADAFSYAQCGDGV- 126

Query: 493  TSLVEAKPYYPSYVSSAICEDSPLAGLNEPGYDLLSASR---VGPLNGSSVDDYTRSLSG 663
                 AKPYY S++S    +D  L   ++  YD LS+S    V  L+GSS  DY++    
Sbjct: 127  -----AKPYYFSFLSPPTQKDGSLVVPDQTSYDWLSSSSHVAVTALDGSSNKDYSQRSGD 181

Query: 664  LDYSTQWGGFWNGLTDLESGRQTGFDGSFC-SKESNVTSSSIYKDYVKQGAATSKDLSAC 840
                 QWGG WNG ++ E G Q  FDGSFC SKES++  SS+Y++++ Q   + K L+  
Sbjct: 182  SKKPAQWGGLWNGFSEWEQGNQGLFDGSFCCSKESDIPVSSMYENFMNQETHSPKGLNRG 241

Query: 841  VEASAVSHSIIVPGRESCIGFSSTKPLDDNFLPQNFKSISDESLRTSVGGSSSVLPETHP 1020
             EA  ++      G+E+  G ++    D +    N ++ +D+S     G +S+ LP    
Sbjct: 242  EEAMRLNR-----GKEAFHGINNLDS-DKHGGSVNAENFNDKSFS---GKTSNFLPADCS 292

Query: 1021 NSPFLESLTH---------------SWNFQKANSASYEKCFRQHDTCLNDCKSAPKSPPA 1155
             S FLESL+                S   Q    AS EK  +QH T       + KS P 
Sbjct: 293  RS-FLESLSGFPDSGLESPCFMIGTSSGHQIPYGASNEKHLKQHAT------DSAKSSPT 345

Query: 1156 IVIRPPPVGANSSASSTVPF--INLNSIGSDS---ARRSEDFADNIHCNLKEPRPPLSSE 1320
             VI PP  G+  S S+  PF  +NL S  +D+   ++++  F D         +P   S 
Sbjct: 346  PVIGPPVAGSGFSPSNNAPFKIVNLGSCKTDADMCSKKAPSFID-----ADGVKPAFDSS 400

Query: 1321 RKEVCLDMSPNILGRRDYNNVDTFATKKEEFSNNISVTKDSFDHFFEKRCELQVPHMSNE 1500
            +  + LD+        D  ++ ++ TK EE  N   ++ D+  H    +   Q  ++ +E
Sbjct: 401  KLSIHLDID-------DPASLGSYVTKNEEMLNKECISSDTLHHVLIPKSGPQTSNVPHE 453

Query: 1501 NGYSLAHDAEPVNFIENSSKSFDQYNPAVDSPCWKGATANRSSPLSEAVTPQLFMRKLET 1680
                  +  E +N +E+SS++ D YN AVDSPCWKG  A RSSP   +V      ++ E 
Sbjct: 454  GFKLDLNTNENINSVEDSSENVDHYNHAVDSPCWKGVPATRSSPFDASVPET---KRQEV 510

Query: 1681 CNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGNDSSLSAKSPLVAGFSSTGH 1860
             ++ N+Q  QIF LNT D  +V S K ++N+  HE     N       +   A  + +  
Sbjct: 511  FSNSNVQTKQIFQLNTGD--KVSSQKRNDNMMCHEFGSPENGLEFPLNTSPAAKSTFSDR 568

Query: 1861 GPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPDNLKDDADFRMHAS-GTYVDDAS 2037
              DD VK GS        +     ++  +S     L  +L  + + + +      +++A 
Sbjct: 569  KSDDIVKIGSDLETKGIQHSNDIHEHGSRSTGCSDLKSSLNGEQNIQRNGLISENINEAL 628

Query: 2038 EDGLSHVPFHVTKHVLSSPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLH 2217
            +     +PF + ++++SS  S EDA + + KS E  SSP IDV +LV T+ NLSEL L H
Sbjct: 629  QCVSPRLPFPM-ENIISS--SVEDASTKLNKSNEGPSSPTIDVPVLVSTIRNLSELLLFH 685

Query: 2218 CSNDACTLKEQDNHTLQDVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKG 2397
            C++ +  LK++D  T+Q +I NL+VC SK +E+ +ST       QD++     S YL   
Sbjct: 686  CTSGSYQLKQKDLETIQSMIDNLSVCASKNSEKTVST-------QDSTSEKYTSDYLG-- 736

Query: 2398 ASVDRSQLTKGASVDRSQVTKVASAIIGQLDQQHIHEEKGDDI-VWGVKDEKFQNFASAQ 2574
               D++   KG ++++ QVTK A  I+  L  Q++H  KG+   V G ++++  +  S +
Sbjct: 737  ---DKNH--KGFTLNKLQVTKTAGPILDLLADQNVH--KGNKYYVAGKENDELLDSVSVR 789

Query: 2575 DDAEIVKNDKMTQAIKKILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMK 2754
             D +IV  DK  QA+KK+L +NF  EEE  PQ LLYKNLWLEAEA +CS + +ARF+R+K
Sbjct: 790  ADVDIVDEDKAIQALKKVLTDNFDYEEEASPQALLYKNLWLEAEAALCSMSCKARFNRVK 849

Query: 2755 IEMEKCNSHEAKD---------VSESSTNRISPDVNAINQLAPEPNG----GIQDYPIIN 2895
            +EME     ++KD         + + S + +SPD+N  N L+P+  G      Q+  +++
Sbjct: 850  LEMENPKLPKSKDAHGNTITTEMDKVSRSEVSPDLNGANTLSPKAKGCATTKSQESSVLS 909

Query: 2896 TSSHEVDVLARFHILKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQD---IPT 3066
            T++ + DV+ RF IL+ R   SN     D +KPSS K SP  +KV K++P+  +      
Sbjct: 910  TNAEDDDVMDRFQILRCRAKKSNYGIVADKDKPSSPKVSPHSNKVGKILPEANEETGSSK 969

Query: 3067 PEISVHDSRILSKTSHVDNVEDSVMDRLRILKCRVDNLTP-------TVNVEDPEV---P 3216
            P+I    S   S     ++ E SVM R  ILK R DN +P         NV+   +    
Sbjct: 970  PDIRRQASSNSSTDKPSNDYEASVMARFHILKSRGDNCSPLSTQGQLAENVDGSTIGSKS 1029

Query: 3217 DVNPGCDESWDENFEAIMGHVIEHE------KLKVKESSRLCD-DRMIHSCWTDKLGNQL 3375
            +V   C E      E  + H   H+      +L   E     D D M  S   ++  N L
Sbjct: 1030 EVGSSCVEP-----EPTLQH---HDADSTEGQLTGGEFPMFIDYDSMSQSHRPNRRENSL 1081

Query: 3376 PAGWCDSSSSSDWEHVLRD 3432
             AGW D   SS+WEHV  D
Sbjct: 1082 LAGWFD-RVSSEWEHVGND 1099


>ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citrus clementina]
            gi|557543534|gb|ESR54512.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
          Length = 842

 Score =  406 bits (1044), Expect = e-110
 Identities = 303/853 (35%), Positives = 420/853 (49%), Gaps = 54/853 (6%)
 Frame = +1

Query: 394  SINSPTT-HMPPP-NSVATDATDAYSFPEYLDSV-TTSLVEAKPYYPSYVSSAICEDSPL 564
            S N P+T H+PPP N +   +  ++ + +  D++ + +LVEA PYYPSYVS         
Sbjct: 67   SFNPPSTAHIPPPENPIPITSASSFLYGQSSDAIPSANLVEANPYYPSYVSPT------- 119

Query: 565  AGLNEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDG 744
                                  + DDY +SLS L  S +W          E  R+     
Sbjct: 120  --------------------KYTYDDYAQSLSSLWDSREW----------EFSRKLELGE 149

Query: 745  SFCSKESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLD 924
            SFCSKE NV   SIY+DY  QGA +SK L+   + +   +++ + G E   G  + + LD
Sbjct: 150  SFCSKEMNVPDLSIYQDYADQGAHSSKGLNTFEQKN---NNLDMLGSEQHQGSINREQLD 206

Query: 925  -DNFLPQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFR 1101
              +F  Q  + +  E  R SV GS+S+ PET+  + + +    SW+ Q    AS EK  +
Sbjct: 207  YKSFTGQISEFMPVEYSRKSVHGSTSLFPETYSLTSYEQG--RSWSHQTPYGASCEKGAK 264

Query: 1102 QHDTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINL-NSIGSDSARRSEDFADNI 1278
            QH    ND  S  KS P  V++   V  + S  STV F NL NS G  ++       DN+
Sbjct: 265  QHGISPNDISSVKKSSPVHVVKSQAVFTSLSPPSTVSFNNLENSSGVIASN------DNL 318

Query: 1279 HCNLKEPRPPLSSERK--------EVCLDMSPNILGRRDYNNVDTFATKKEEFSNNISVT 1434
              N+KE  P  SSE K           L+   +I  +  +        KKE+ S+N+SV 
Sbjct: 319  S-NMKEFYPLHSSEGKVHFDAGQVSFHLERGSHIFPKLPFE-------KKEKLSSNVSVI 370

Query: 1435 KDSFDHFFEKRCELQVPHMSNENGYSLAHDAEPVNFIENSSKSFDQYNPAVDSPCWKGAT 1614
            KD      +++  LQ+P +   +   +  +   +N  E SS+S D YNPAVDSPCWKGA 
Sbjct: 371  KDPL----KEKPGLQIPDIGPGSVSLMLANNRAINCSEGSSESLDHYNPAVDSPCWKGAP 426

Query: 1615 ANRSSPLSEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVC 1794
               S   S        + K+E C+  N  GP      T+++ +V   KPS+  +Y E+  
Sbjct: 427  DYHSPVESSGPVTLQHINKIEACSGSNSIGP------TDNSGKVSPQKPSDYSFYQEHGY 480

Query: 1795 GGNDSSLSAKSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPD 1974
              ND   S K    A      HG D  +K G    KSS   GVQFSD I K R++    +
Sbjct: 481  LENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMKSSYGLGVQFSDCIDKPRQDYVHAN 540

Query: 1975 NLKDDADFRMHASGTY-------------------------VDDASEDGLSHVPFHVTKH 2079
            N  D+  FR      Y                         ++  SE   SHVP H T+H
Sbjct: 541  NSADEFKFRPFHQVQYDSVENKLTFERKCELGSGVADVGLSINGTSEGCSSHVPLHATEH 600

Query: 2080 VLSSPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNH 2259
            VLSSP S E  P+ + K   E  +P++ V+ L+ TMHNLSEL L HCSND C LKE D  
Sbjct: 601  VLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFE 660

Query: 2260 TLQDVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASV 2439
             L+ V+ NL+ CISK        +E L   Q +S+FI +   LH+G +V   +       
Sbjct: 661  ALKLVVNNLDKCISKRMGPEAPIQESLLT-QKSSEFIREFPELHEGVTVSSPK------- 712

Query: 2440 DRSQVTKVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASA---------------- 2571
                 TK A +++ Q + QH+ E++  DI  G K EK  +F S                 
Sbjct: 713  ----ETKAAFSVLNQPNYQHVQEQRSPDIAAGKKSEKCSDFTSQGGHAERVKDDDMTQVH 768

Query: 2572 QDDAEIVKNDKMTQAIKKILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRM 2751
            +DDAE VK+D MTQAIKK+L +NF  EE+ + QVLLY+NLWLEAEA +CS NY+ARF+RM
Sbjct: 769  KDDAERVKDDNMTQAIKKVLSDNFVEEEDEKLQVLLYRNLWLEAEAALCSINYKARFNRM 828

Query: 2752 KIEMEKCNSHEAK 2790
            KIE+E C   +AK
Sbjct: 829  KIELENCKLLKAK 841


>ref|XP_007039225.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508776470|gb|EOY23726.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 827

 Score =  390 bits (1001), Expect = e-105
 Identities = 306/826 (37%), Positives = 416/826 (50%), Gaps = 35/826 (4%)
 Frame = +1

Query: 400  NSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVEAKPYYPSYVSSAICEDSPLAGLNE 579
            N  T + P P  +     +    P Y  +  + L E K YYPSYVS              
Sbjct: 64   NPYTFNSPQPAQLPQLDLEPTPTPSY--NQNSDLFEPKTYYPSYVSP------------- 108

Query: 580  PGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDGSFCSK 759
                        PL+  + ++  +SL GLD++ QWGG   GL D E G+     GSF  K
Sbjct: 109  ------------PLHVPTFNE--QSLPGLDHTAQWGG---GLWDWEKGKPAQLGGSFYLK 151

Query: 760  ESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLDDN-FL 936
            E++V  SSIY D++  GA  SK L  C E S   ++I  P RE   G ++ + LD N  L
Sbjct: 152  ETSVAPSSIYMDHINLGAHPSKSLKTCEETS---YNIYSP-REDQAGPANIEKLDYNPVL 207

Query: 937  PQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFRQHDTC 1116
             QN   +  + L+TSV GSSS + E +  +P L +L +  N     S  YEK  RQH T 
Sbjct: 208  GQNPSFMPVDYLKTSVIGSSSAISEANLQAPPL-NLVNCKNNHVQISTPYEKPLRQHGTT 266

Query: 1117 LNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINLNSIGSDSARRSEDFADNIHCNLKE 1296
            L+D   + KS P +VIRPP VG +SSAS++V F N+N+ G ++     + A N    ++E
Sbjct: 267  LSDSIPSVKSSPGVVIRPPAVGTSSSASNSVSFKNVNT-GINAT--DTNLAGNNRFIVEE 323

Query: 1297 PRPPLSSERKEVCLDMSPNIL--GRRDYNNVDTFATKKEEFSNNISVTKDSFDHFFEKRC 1470
            PR   +   K     +  + L  G    +   + +T+K    N  S      D+FF  + 
Sbjct: 324  PRFLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMAS------DNFFGAKS 377

Query: 1471 ELQVPHMSNENGYSLAHDA-EPVNFIENSSKSFDQYNPAVDSPCWKGATANRSSPL--SE 1641
             + +  +S +N +SLA +  E V  +ENS +S D YNP VDSPCWKGA A+ +SP   SE
Sbjct: 378  GVNLSRISPDN-FSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE 436

Query: 1642 AVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGNDSSLSA 1821
             V  QL  +KLE C+  N    +    NT + V+  S K  E L   EN    + S  S 
Sbjct: 437  PVAVQL-AKKLEACDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSL 495

Query: 1822 KSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPDNLKDDADFR 2001
            K P V+  S   H PD+A K GS   K+S A  V+FSD   + +K+  L D   D+ +  
Sbjct: 496  KLPPVSIPSFKEHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKA 555

Query: 2002 MHASGTY------------------------VDDASEDGLSHVPFHVTKHVLSSPPSGED 2109
             H S                           ++D S  G SHV  H  KH+  +P S ED
Sbjct: 556  SHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVED 615

Query: 2110 APSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTLQDVIKNLN 2289
              +   K   +       + +LV TM NLSEL L HCSN+AC L+EQD  +L+ VI NL+
Sbjct: 616  VSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLD 675

Query: 2290 VCISKTAEQ---MISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASVD-RSQVT 2457
             C+SK   Q   +    +  FP    +      S LHKG S    Q+   A++D  SQ T
Sbjct: 676  TCMSKNIGQETLLSELHKVWFPMSKKNGQESLLSELHKGTSTGSPQV---AAIDVLSQHT 732

Query: 2458 KVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEI-VKNDKMTQAIKKILV 2634
                    Q+ ++H          +G KDEK   F S +   +I VKNDKMTQAIKK+L+
Sbjct: 733  --------QVKRKH----------FGKKDEKCSEFVSVRSGTDIKVKNDKMTQAIKKVLI 774

Query: 2635 ENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKC 2772
            ENFH +EET PQVLLYKNLWLEAEA +CS NY AR++ MKIE+EKC
Sbjct: 775  ENFHEKEETHPQVLLYKNLWLEAEAALCSINYMARYNNMKIEIEKC 820


>ref|XP_006441269.1| hypothetical protein CICLE_v10018632mg [Citrus clementina]
            gi|567897564|ref|XP_006441270.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
            gi|557543531|gb|ESR54509.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
            gi|557543532|gb|ESR54510.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
          Length = 807

 Score =  333 bits (855), Expect = 3e-88
 Identities = 264/781 (33%), Positives = 370/781 (47%), Gaps = 38/781 (4%)
 Frame = +1

Query: 394  SINSPTT-HMPPP-NSVATDATDAYSFPEYLDSV-TTSLVEAKPYYPSYVSSAICEDSPL 564
            S N P+T H+PPP N +   +  ++ + +  D++ + +LVEA PYYPSYVS         
Sbjct: 67   SFNPPSTAHIPPPENPIPITSASSFLYGQSSDAIPSANLVEANPYYPSYVSPT------- 119

Query: 565  AGLNEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDG 744
                                  + DDY +SLS L  S +W          E  R+     
Sbjct: 120  --------------------KYTYDDYAQSLSSLWDSREW----------EFSRKLELGE 149

Query: 745  SFCSKESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLD 924
            SFCSKE NV   SIY+DY  QGA +SK L+   + +   +++ + G E   G  + + LD
Sbjct: 150  SFCSKEMNVPDLSIYQDYADQGAHSSKGLNTFEQKN---NNLDMLGSEQHQGSINREQLD 206

Query: 925  -DNFLPQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFR 1101
              +F  Q  + +  E  R SV GS+S+ PET+  + + +    SW+ Q    AS EK  +
Sbjct: 207  YKSFTGQISEFMPVEYSRKSVHGSTSLFPETYSLTSYEQG--RSWSHQTPYGASCEKGAK 264

Query: 1102 QHDTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINL-NSIGSDSARRSEDFADNI 1278
            QH    ND  S  KS P  V++   V  + S  STV F NL NS G  ++       DN+
Sbjct: 265  QHGISPNDISSVKKSSPVHVVKSQAVFTSLSPPSTVSFNNLENSSGVIASN------DNL 318

Query: 1279 HCNLKEPRPPLSSERK--------EVCLDMSPNILGRRDYNNVDTFATKKEEFSNNISVT 1434
              N+KE  P  SSE K           L+   +I  +  +        KKE+ S+N+SV 
Sbjct: 319  S-NMKEFYPLHSSEGKVHFDAGQVSFHLERGSHIFPKLPFE-------KKEKLSSNVSVI 370

Query: 1435 KDSFDHFFEKRCELQVPHMSNENGYSLAHDAEPVNFIENSSKSFDQYNPAVDSPCWKGAT 1614
            KD      +++  LQ+P +   +   +  +   +N  E SS+S D YNPAVDSPCWKGA 
Sbjct: 371  KDPL----KEKPGLQIPDIGPGSVSLMLANNRAINCSEGSSESLDHYNPAVDSPCWKGAP 426

Query: 1615 ANRSSPLSEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVC 1794
               S   S        + K+E C+  N  GP      T+++ +V   KPS+  +Y E+  
Sbjct: 427  DYHSPVESSGPVTLQHINKIEACSGSNSIGP------TDNSGKVSPQKPSDYSFYQEHGY 480

Query: 1795 GGNDSSLSAKSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPD 1974
              ND   S K    A      HG D  +K G    KSS   GVQFSD I K R++    +
Sbjct: 481  LENDPESSPKRSSRANLLFEEHGYDRDLKTGFYQMKSSYGLGVQFSDCIDKPRQDYVHAN 540

Query: 1975 NLKDDADFRMHASGTY-------------------------VDDASEDGLSHVPFHVTKH 2079
            N  D+  FR      Y                         ++  SE   SHVP H T+H
Sbjct: 541  NSADEFKFRPFHQVQYDSVENKLTFERKCELGSGVADVGLSINGTSEGCSSHVPLHATEH 600

Query: 2080 VLSSPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNH 2259
            VLSSP S E  P+ + K   E  +P++ V+ L+ TMHNLSEL L HCSND C LKE D  
Sbjct: 601  VLSSPSSVEAVPARLNKLHGEQLAPQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFE 660

Query: 2260 TLQDVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASV 2439
             L+ V+ NL+ CISK        +E L   Q +S+FI +   LH           +G +V
Sbjct: 661  ALKLVVNNLDKCISKRMGPEAPIQESLLT-QKSSEFIREFPELH-----------EGVTV 708

Query: 2440 DRSQVTKVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEIVKNDKMTQAI 2619
               + TK A +++ Q + QH+ E++  DI  G K EK  +F S    AE VK+D MTQ  
Sbjct: 709  SSPKETKAAFSVLNQPNYQHVQEQRSPDIAAGKKSEKCSDFTSQGGHAERVKDDDMTQVH 768

Query: 2620 K 2622
            K
Sbjct: 769  K 769


>ref|XP_007039227.1| Uncharacterized protein isoform 8, partial [Theobroma cacao]
            gi|508776472|gb|EOY23728.1| Uncharacterized protein
            isoform 8, partial [Theobroma cacao]
          Length = 828

 Score =  287 bits (734), Expect = 3e-74
 Identities = 281/892 (31%), Positives = 402/892 (45%), Gaps = 37/892 (4%)
 Frame = +1

Query: 400  NSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVEAKPYYPSYVSSAICEDSPLAGLNE 579
            N  T + P P  +     +    P Y  +  + L E K YYPSYVS              
Sbjct: 53   NPYTFNSPQPAQLPQLDLEPTPTPSY--NQNSDLFEPKTYYPSYVSP------------- 97

Query: 580  PGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDGSFCSK 759
                        PL+  + ++  +SL GLD++ QWGG   GL D E G+     GSF  K
Sbjct: 98   ------------PLHVPTFNE--QSLPGLDHTAQWGG---GLWDWEKGKPAQLGGSFYLK 140

Query: 760  ESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLDDN-FL 936
            E++V  SSIY D++  GA  SK L  C E S   ++I  P RE   G ++ + LD N  L
Sbjct: 141  ETSVAPSSIYMDHINLGAHPSKSLKTCEETS---YNIYSP-REDQAGPANIEKLDYNPVL 196

Query: 937  PQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFRQHDTC 1116
             QN   +  + L+TSV GSSS + E +  +P L +L +  N     S  YEK  RQH T 
Sbjct: 197  GQNPSFMPVDYLKTSVIGSSSAISEANLQAPPL-NLVNCKNNHVQISTPYEKPLRQHGTT 255

Query: 1117 LNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINLNSIGSDSARRSEDFADNIHCNLKE 1296
            L+D   + KS P +VIRPP VG +SSAS++V F N+N+ G ++     + A N    ++E
Sbjct: 256  LSDSIPSVKSSPGVVIRPPAVGTSSSASNSVSFKNVNT-GINAT--DTNLAGNNRFIVEE 312

Query: 1297 PRPPLSSERKEVCLDMSPNIL--GRRDYNNVDTFATKKEEFSNNISVTKDSFDHFFEKRC 1470
            PR   +   K     +  + L  G    +   + +T+K    N  S      D+FF  + 
Sbjct: 313  PRFLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMAS------DNFFGAKS 366

Query: 1471 ELQVPHMSNENGYSLAHDA-EPVNFIENSSKSFDQYNPAVDSPCWKGATANRSSPL--SE 1641
             + +  +S +N +SLA +  E V  +ENS +S D YNP VDSPCWKGA A+ +SP   SE
Sbjct: 367  GVNLSRISPDN-FSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE 425

Query: 1642 AVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGNDSSLSA 1821
             V  QL  +KLE C+  N    +    NT + V+  S K  E L   EN    + S  S 
Sbjct: 426  PVAVQL-AKKLEACDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSL 484

Query: 1822 KSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPDNLKDDADFR 2001
            K P V+  S   H PD+A K GS   K+S A  V+FSD   + +K+  L D   D+ +  
Sbjct: 485  KLPPVSIPSFKEHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKA 544

Query: 2002 MHASGTY------------------------VDDASEDGLSHVPFHVTKHVLSSPPSGED 2109
             H S                           ++D S  G SHV  H  KH+  +P S ED
Sbjct: 545  SHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVED 604

Query: 2110 APSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTLQDVIKNLN 2289
              +   K   +       + +LV TM NLSEL L HCSN+AC L+EQD  +L+ VI NL+
Sbjct: 605  VSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLD 664

Query: 2290 VCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASVDRSQVTKVAS 2469
             C+SK   Q     E L    D S+             + RS+L+  A +D ++     +
Sbjct: 665  TCMSKNIGQ-----ETLLSELDLSEDTPDED------KISRSKLS--ADLDTNKKLTAIA 711

Query: 2470 AIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFASAQDDAEIVKNDKMTQAIKKILVENFHG 2649
                 LD               V ++ F   AS+ + A+ V        + K  + N + 
Sbjct: 712  ESAPTLD---------------VSNQNFP-IASSSNHADDV---TARFHVLKHRLNNSYS 752

Query: 2650 EEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKCNSHEAKDVSESSTNRISPD 2829
                +   L    L L+++AV          D++           A +V +SST+ +   
Sbjct: 753  VHTRDADELSSSKLSLDSDAV----------DKL-----------ATEVKDSSTSSLQ-- 789

Query: 2830 VNAINQLAPEPNGGIQDYPIINTSSH----EVDVLARFHILKSRDN---DSN 2964
                           QD P+  T+ H    E  ++ R HILKSR N   DSN
Sbjct: 790  --------------TQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSN 827



 Score = 94.7 bits (234), Expect = 3e-16
 Identities = 55/137 (40%), Positives = 81/137 (59%), Gaps = 6/137 (4%)
 Frame = +1

Query: 2776 SHEAKDVSESSTNRISPDVN------AINQLAPEPNGGIQDYPIINTSSHEVDVLARFHI 2937
            S +  D  + S +++S D++      AI + AP  +   Q++PI ++S+H  DV ARFH+
Sbjct: 683  SEDTPDEDKISRSKLSADLDTNKKLTAIAESAPTLDVSNQNFPIASSSNHADDVTARFHV 742

Query: 2938 LKSRDNDSNSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISVHDSRILSKTSHV 3117
            LK R N+S SV+  DA++ SSSK S + D VDKL  + +D  T  +   DS +     H 
Sbjct: 743  LKHRLNNSYSVHTRDADELSSSKLSLDSDAVDKLATEVKDSSTSSLQTQDSPVPGTACHT 802

Query: 3118 DNVEDSVMDRLRILKCR 3168
            D+VE S+M RL ILK R
Sbjct: 803  DDVEASIMTRLHILKSR 819


>ref|XP_006347526.1| PREDICTED: uncharacterized protein LOC102592566 isoform X1 [Solanum
            tuberosum]
          Length = 1173

 Score =  286 bits (732), Expect = 5e-74
 Identities = 292/984 (29%), Positives = 433/984 (44%), Gaps = 55/984 (5%)
 Frame = +1

Query: 394  SINSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVEAKPYYPSYVSSAICEDSPLAGL 573
            S ++PT H          +TD YSF              + YY  YV S +  + P A  
Sbjct: 106  STSNPTVHA---------STDTYSFGR------------EGYYAPYVPSIVSNEHPSAAF 144

Query: 574  NEPGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDGSFC 753
            NEP  D+L  S    ++ SS  DYT+SLSGL+Y   W  F++ + D +   + G DGSF 
Sbjct: 145  NEPSLDVLPNSGSIHVDASSQVDYTQSLSGLEYP-HWS-FFSKVADGKQDERNGVDGSFS 202

Query: 754  SKESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLDDNF 933
                N  +S  Y++ + QG +      A  ++ A +    V    S +G    K    ++
Sbjct: 203  LGNVNAGASYGYRNCMSQGNSLEGVNIAREDSGAGNFIDGVYTGPSSMGHMDAK----SY 258

Query: 934  LPQN--FKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFRQH 1107
            L Q   ++S++ E   T++G   S+LP +      L S  +  N++   +  +EK F+  
Sbjct: 259  LTQEPIYQSLNSE---TAMG---SILPVSCQVGLSLGSSNNYLNYENPFTP-HEKFFQPL 311

Query: 1108 DTCLNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINLN--SIGSDSARRSEDFADNIH 1281
            D+C  D  S  KS P +VIRP P G+   A       N++    G+ ++ +S D  D + 
Sbjct: 312  DSCPRDTTSTSKSSPVVVIRPAPSGSRFFAPKIDLHKNVDICKTGATNSEKS-DVCDLLK 370

Query: 1282 CNLKEPRPPLSSERKEVCLDMSPNILGRRDYNNVDTFATKKEEFSNNISV---TKDSFDH 1452
               +E R P+ S  KE  L  S  +    D++ +          +N  S    + +S + 
Sbjct: 371  S--QETRLPIDSPIKEFSLGSSTPL----DFDKIKNIFFASSSVNNLCSTRPCSSNSIEI 424

Query: 1453 FFEKRCELQVPHMSNENGYSLAHDAEPVNFIENSSKSFDQYNPAVDSPCWKGATANRSS- 1629
              ++R   Q P  S          A PV F E  S + D +NP VDSPCWKGA A R S 
Sbjct: 425  AVKERSGSQAPCAS----------APPVTFAEKCSDALDLHNPNVDSPCWKGAPAFRISL 474

Query: 1630 -PLSEAVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGND 1806
                +A +P LF  K+E  +    Q   +FP               ENL  H NV  GN 
Sbjct: 475  GDSVDASSPCLFTSKVEFADFS--QSNPLFPPAEYSGKTSLKKLGEENLHNH-NVYAGNG 531

Query: 1807 SSLSAKSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGV-QFSDYILKSRKEDALPDNLK 1983
             S+ +       +++      D  K        S   G+ +FS+ + K  K  +LP   +
Sbjct: 532  LSVPSVGTGTNNYTTEELRTIDVTKETFVPMDLSSNGGIPKFSEDLNKPSKGYSLPQYSE 591

Query: 1984 DDADFR-------------------------MHASGTYVDDASEDGLSHVPFHVTKHVLS 2088
            +D   +                         MH +G  ++D  E G+  V     ++VL 
Sbjct: 592  NDCQLQYSWGKHLSVDGHQYGPKKHNLPEGYMH-TGLSLNDTLEGGV--VALDAAENVLR 648

Query: 2089 SPPSGEDAPSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTLQ 2268
            SP S EDA  A  +  +  SSPK+DVQ LV  +HNLSEL    C  +AC L+ QD  TL+
Sbjct: 649  SPASQEDAKQA--QQYQMGSSPKLDVQTLVHAIHNLSELLKSQCLANACLLEGQDIDTLK 706

Query: 2269 DVIKNLNVCISKTAEQMISTREPLFPYQDTSQFIEKSSYLHKGASVDRSQLTKGASVDRS 2448
              I NL  C +K  E    T++ +    DT +  E+S     G      Q  +  + D  
Sbjct: 707  SAITNLGACTAKKIE----TKDTMVSQHDTFEKFEESRRSFMGTETGHPQFMEEVAWDSC 762

Query: 2449 QVTKVASAIIGQLDQQHIHEEKGDDIVWGVKDEKFQNFA--SAQDDAEIVKNDKMTQAIK 2622
                        LD Q   E+K  +       +K +N A  +  DD      +++ QAIK
Sbjct: 763  G-----------LDNQPTPEDKSKN-----NGKKTENSALLTPADDLGDSNEEQVVQAIK 806

Query: 2623 KILVENFHGEEETEPQVLLYKNLWLEAEAVICSTNYRARFDRMKIEMEKCNSHEAKDVSE 2802
            K+L ENF  +E  +PQ LL+KNLWLEAEA +CS +Y++RFDRMKIEMEK           
Sbjct: 807  KVLNENFLSDEGMQPQALLFKNLWLEAEAKLCSLSYKSRFDRMKIEMEK----------- 855

Query: 2803 SSTNRISPDVNAINQLAPEPNGG-----IQDYPIINTSSHEVD--VLARFHILKSRDNDS 2961
               +R S ++N  + +APE             P  ++ S  +D  V+ RF+IL  R+   
Sbjct: 856  ---HRFSQELNLNSSVAPEAENDSASKITTQSPSTSSKSVHIDDSVMERFNILNRRE--- 909

Query: 2962 NSVNNLDAEKPSSSKFSPELDKVDKLVPQPQDIPTPEISV-------HDSRILSKTSHVD 3120
                    EK SSS    E D V K+    +D  T  +++         S  + +    D
Sbjct: 910  --------EKLSSSFMKEENDSV-KVGSDSEDSVTMRLNILRKQGNNSSSSFMQEKKASD 960

Query: 3121 ----NVEDSVMDRLRILKCRVDNL 3180
                + EDSVM+R  IL+ R DNL
Sbjct: 961  IVSSDTEDSVMERFNILRRREDNL 984


>ref|XP_007039226.1| Uncharacterized protein isoform 7 [Theobroma cacao]
            gi|508776471|gb|EOY23727.1| Uncharacterized protein
            isoform 7 [Theobroma cacao]
          Length = 761

 Score =  283 bits (723), Expect = 5e-73
 Identities = 234/669 (34%), Positives = 325/669 (48%), Gaps = 30/669 (4%)
 Frame = +1

Query: 400  NSPTTHMPPPNSVATDATDAYSFPEYLDSVTTSLVEAKPYYPSYVSSAICEDSPLAGLNE 579
            N  T + P P  +     +    P Y  +  + L E K YYPSYVS              
Sbjct: 64   NPYTFNSPQPAQLPQLDLEPTPTPSY--NQNSDLFEPKTYYPSYVSP------------- 108

Query: 580  PGYDLLSASRVGPLNGSSVDDYTRSLSGLDYSTQWGGFWNGLTDLESGRQTGFDGSFCSK 759
                        PL+  + ++  +SL GLD++ QWGG   GL D E G+     GSF  K
Sbjct: 109  ------------PLHVPTFNE--QSLPGLDHTAQWGG---GLWDWEKGKPAQLGGSFYLK 151

Query: 760  ESNVTSSSIYKDYVKQGAATSKDLSACVEASAVSHSIIVPGRESCIGFSSTKPLDDN-FL 936
            E++V  SSIY D++  GA  SK L  C E S   ++I  P RE   G ++ + LD N  L
Sbjct: 152  ETSVAPSSIYMDHINLGAHPSKSLKTCEETS---YNIYSP-REDQAGPANIEKLDYNPVL 207

Query: 937  PQNFKSISDESLRTSVGGSSSVLPETHPNSPFLESLTHSWNFQKANSASYEKCFRQHDTC 1116
             QN   +  + L+TSV GSSS + E +  +P L +L +  N     S  YEK  RQH T 
Sbjct: 208  GQNPSFMPVDYLKTSVIGSSSAISEANLQAPPL-NLVNCKNNHVQISTPYEKPLRQHGTT 266

Query: 1117 LNDCKSAPKSPPAIVIRPPPVGANSSASSTVPFINLNSIGSDSARRSEDFADNIHCNLKE 1296
            L+D   + KS P +VIRPP VG +SSAS++V F N+N+ G ++     + A N    ++E
Sbjct: 267  LSDSIPSVKSSPGVVIRPPAVGTSSSASNSVSFKNVNT-GINAT--DTNLAGNNRFIVEE 323

Query: 1297 PRPPLSSERKEVCLDMSPNIL--GRRDYNNVDTFATKKEEFSNNISVTKDSFDHFFEKRC 1470
            PR   +   K     +  + L  G    +   + +T+K    N  S      D+FF  + 
Sbjct: 324  PRFLFNFGSKNEFDPIQHSFLLDGNCYMSGESSTSTEKLSTRNMAS------DNFFGAKS 377

Query: 1471 ELQVPHMSNENGYSLAHDA-EPVNFIENSSKSFDQYNPAVDSPCWKGATANRSSPL--SE 1641
             + +  +S +N +SLA +  E V  +ENS +S D YNP VDSPCWKGA A+ +SP   SE
Sbjct: 378  GVNLSRISPDN-FSLAFENNEAVIAVENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSE 436

Query: 1642 AVTPQLFMRKLETCNSLNLQGPQIFPLNTNDAVRVFSPKPSENLWYHENVCGGNDSSLSA 1821
             V  QL  +KLE C+  N    +    NT + V+  S K  E L   EN    + S  S 
Sbjct: 437  PVAVQL-AKKLEACDGSNGLVLKFISSNTANMVKHPSGKAGEILMSDENGNVEDGSMSSL 495

Query: 1822 KSPLVAGFSSTGHGPDDAVKPGSCSTKSSCAYGVQFSDYILKSRKEDALPDNLKDDADFR 2001
            K P V+  S   H PD+A K GS   K+S A  V+FSD   + +K+  L D   D+ +  
Sbjct: 496  KLPPVSIPSFKEHEPDEAGKAGSHKNKASSACEVKFSDNASEWKKDYVLFDKSVDEVEKA 555

Query: 2002 MHASGTY------------------------VDDASEDGLSHVPFHVTKHVLSSPPSGED 2109
             H S                           ++D S  G SHV  H  KH+  +P S ED
Sbjct: 556  SHTSQQCLAEGRLASKNLCRSETGVADLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVED 615

Query: 2110 APSAIAKSCEENSSPKIDVQMLVKTMHNLSELFLLHCSNDACTLKEQDNHTLQDVIKNLN 2289
              +   K   +       + +LV TM NLSEL L HCSN+AC L+EQD  +L+ VI NL+
Sbjct: 616  VSTKHTKFLGKEPVSNSSISVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLD 675

Query: 2290 VCISKTAEQ 2316
             C+SK   Q
Sbjct: 676  TCMSKNIGQ 684


Top