BLASTX nr result

ID: Sinomenium21_contig00007170 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00007170
         (2514 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853...   257   2e-65
ref|XP_007039221.1| Uncharacterized protein isoform 2 [Theobroma...   179   6e-42
ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Popu...   177   2e-41
ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citr...   164   2e-37
ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628...   162   9e-37
ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citr...   159   6e-36
ref|XP_007039224.1| Uncharacterized protein isoform 5 [Theobroma...   159   6e-36
ref|XP_007039222.1| Uncharacterized protein isoform 3 [Theobroma...   159   8e-36
ref|XP_007039220.1| Uncharacterized protein isoform 1 [Theobroma...   159   8e-36
ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus c...   156   5e-35
ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Popu...   155   1e-34
ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301...   150   3e-33
ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prun...   150   3e-33
gb|EXB94712.1| hypothetical protein L484_002599 [Morus notabilis]     147   2e-32
ref|XP_007039225.1| Uncharacterized protein isoform 6 [Theobroma...   145   1e-31
ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citr...   143   3e-31
gb|EYU45327.1| hypothetical protein MIMGU_mgv1a001518mg [Mimulus...   133   5e-28
ref|XP_004235030.1| PREDICTED: uncharacterized protein LOC101252...   125   7e-26
ref|XP_002300521.2| hypothetical protein POPTR_0001s45660g [Popu...   124   2e-25
ref|XP_003523306.2| PREDICTED: uncharacterized protein LOC100778...   122   8e-25

>ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853355 [Vitis vinifera]
            gi|302143995|emb|CBI23100.3| unnamed protein product
            [Vitis vinifera]
          Length = 1167

 Score =  257 bits (657), Expect = 2e-65
 Identities = 224/770 (29%), Positives = 347/770 (45%), Gaps = 77/770 (10%)
 Frame = +2

Query: 152  DNDDNADSHSPAASNIKNPSIKVSAEDKGC---SHIIGNKMEQNDHFIMDLSPVEKKEPS 322
            DN +N   H    SN++ P I V +E +     +  +    ++NDH  M+ S  +K E  
Sbjct: 399  DNSENVSGHH--LSNMEEPHIPVISEGRELYSDTSQLNGHWQRNDHLSMESSSTKKHELL 456

Query: 323  NCNLIIHDYSNHLCKNSELQDSQFDLTDTFISAPSVASGVCSFVQSSSVTCDQFNPVVDS 502
            N  + + +  N L   SELQ    ++ D F  +P+    V S + ++S T D +NP VDS
Sbjct: 457  NNEMGVKETDNLLRARSELQIPHLNVEDGFSFSPNSIEAVNS-IDNTSETLDHYNPAVDS 515

Query: 503  PCWKGSLASRQSPFSVTDLVTP-KLVNGVAGGNVLNHQDLQSLPVNAEEAFSVSSQYLHK 679
            PCWKGS+ S  SPF V++ ++P  L+  +   +  N Q     P+N+++A +VSS   ++
Sbjct: 516  PCWKGSITSHFSPFEVSEALSPHNLMEQLEALDGFNLQGHHIFPLNSDDAVNVSSLKPNE 575

Query: 680  GLDYNSNRSVEN-------ESSFLKIPS----------------KMSSRNEVHISYGVEE 790
              +Y+ N   EN         S +  PS                K+SS +    S  + +
Sbjct: 576  NTEYHKNVCGENGLLPSWKRPSVVNHPSREQRSLDAFKTGPYCQKLSSGDGNQSSNDIIQ 635

Query: 791  PIKKGSLPGKTK---LTPFQTLASSHEEGNIAPTGQIGLFGGVVDPFMDIKDSNYPSTF- 958
            P +  SL   +K   L    T+  S EE       ++    GV     +I D +   +  
Sbjct: 636  PKRDHSLLNSSKSDNLELSHTMRQSFEEVKFTSERKLSSGVGVEVTGNNINDVSRDGSSH 695

Query: 959  -LFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDSQLLVSTMKNMSEVLCS 1135
              ++  E+            T+L           A+   P ID  +L++T++++S +L S
Sbjct: 696  ETYHLTENISCSPLSGDDASTKLTKQ-------PASESTPKIDVHMLINTVQDLSVLLLS 748

Query: 1136 ICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIRSVPELQCPQSGVSHHLGKQACTHKV 1315
             CS+N ++L EQDH  L+ VI+N DAC+  K         +  + G SH LG+    +K 
Sbjct: 749  HCSDNAFSLKEQDHETLKRVIDNFDACLTKKGQ-------KIAEQGSSHFLGELPDLNKS 801

Query: 1316 P--------KIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQDF-SYLSSDTFEEDDSM 1468
                     K+    ++ Q   QS  +   H  +   K +   DF S ++ +    DDS 
Sbjct: 802  ASASWPLGKKVADANVEDQFHCQSDHKGKRHCSVSGNKDEKLSDFVSLVNDEDTVNDDST 861

Query: 1469 IQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDIEMEKYKLCKT 1648
            IQAI+K+L K FHDE+E  PQ LLY+NLWLEAEAALCSI Y+ARF R+ IEMEK+KL KT
Sbjct: 862  IQAIRKILDKNFHDEEETDPQALLYRNLWLEAEAALCSISYRARFDRMKIEMEKFKLRKT 921

Query: 1649 K-ARSGTVELPLNMEKLWNSTVSDSNLYTDATNEMSTPKIYDPSYSRITGHTEDVEASVM 1825
            +     T+++        +S +S  + +     E   P I       +T  T    A V+
Sbjct: 922  EDLLKNTIDVEKQSSSKVSSDISMVDKFEREAQENPVPDITIEDSPNVT--TMSHAADVV 979

Query: 1826 DRFHVLK-------------------CRIDKPVPSDER--------------KFQEAVDV 1906
            DRFH+LK                   C++   + SD+                  ++ DV
Sbjct: 980  DRFHILKRRYENSDSLNSKDVGKQSSCKVSHDMNSDDNLAPAAKDDHSPNISTSTQSDDV 1039

Query: 1907 VVHERMEETTNPCSRNKQNGRMYSQPTNFDVDFVRRKNPCMFIGRELEDGILEARGNLQG 2086
            +   R+ +     S N  N      P   D++F  + +  MFI   +ED  L    +LQ 
Sbjct: 1040 MARFRILKCRADKS-NPMNAERQQPPEEVDLEFAGKGSHWMFIKDRVEDVTLGP--DLQV 1096

Query: 2087 HITNNRGKK--SALDLEEGDNVKGFQACFSDGSMIQSPVPNKCGGWPAAG 2230
            HI N+   +  S LD  + + VK F     D  +IQ P  N+      AG
Sbjct: 1097 HIANHTKDRFDSYLDDFDCEIVKEFHEHAMDDPVIQLPRSNRLQNQLPAG 1146


>ref|XP_007039221.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508776466|gb|EOY23722.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1017

 Score =  179 bits (454), Expect = 6e-42
 Identities = 185/656 (28%), Positives = 283/656 (43%), Gaps = 55/656 (8%)
 Frame = +2

Query: 452  VQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVLNHQDLQSLP 631
            V++S  + D +NP VDSPCWKG+ AS  SPF  ++ V  +L   +   +  N   L+ + 
Sbjct: 402  VENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPVAVQLAKKLEACDGSNGLVLKFIS 461

Query: 632  VNAEEAFSVSSQYLHKGLDYNSNRSVENES-SFLKIP----------------------S 742
             N        S    + L  + N +VE+ S S LK+P                      +
Sbjct: 462  SNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKLPPVSIPSFKEHEPDEAGKAGSHKN 521

Query: 743  KMSSRNEVHISYGVEEPIKKGSLPGKTKLTPFQTLASSHE---EGNIAPTGQIGLFGGVV 913
            K SS  EV  S    E  K   L  K+     +   +S +   EG +A         GV 
Sbjct: 522  KASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEGRLASKNLCRSETGVA 581

Query: 914  DPFMDIKDSNY--PSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDS 1087
            D  M I D +    S    +A +H            T+        +  L   P      
Sbjct: 582  DLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTK-------HTKFLGKEPVSNSSI 634

Query: 1088 QLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIRSV-PELQCP 1264
             +LV TM+N+SE+L   CSN    L EQD   L+ VINNLD C+   +   ++  EL   
Sbjct: 635  SVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQETLLSELHKV 694

Query: 1265 QSGVSHHLGKQACTHKV--------PKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQ 1420
               +S   G+++   ++        P++ A  + SQ          +      +K +   
Sbjct: 695  WFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQ-------HTQVKRKHFGKKDEKCS 747

Query: 1421 DFSYLSS--DTFEEDDSMIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYK 1594
            +F  + S  D   ++D M QAIKKVL + FH+++E  PQ+LLYKNLWLEAEAALCSI Y 
Sbjct: 748  EFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYM 807

Query: 1595 ARFARLDIEMEKYKLCKTKARSGTV----ELPLNMEKLWNSTVSDSNLYTDATNEMSTPK 1762
            AR+  + IE+EK KL   K  S       ++  + ++L +S +S   L +DA ++++T +
Sbjct: 808  ARYNNMKIEIEKCKLDTEKDLSEDTPDEDKISRDADELSSSKLS---LDSDAVDKLAT-E 863

Query: 1763 IYDPSYSRI----------TGHTEDVEASVMDRFHVLKCRIDKPVPSDERKFQEAVDVVV 1912
            + D S S +            HT+DVEAS+M R H+LK R +  + S+E + +   +VV 
Sbjct: 864  VKDSSTSSLQTQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSNEMEQKPLPEVV- 922

Query: 1913 HERMEETTNPCSRNKQNGRMYSQPTNFDVDFVRRKNPCMFIGRELEDGIL--EARGNLQG 2086
                                       D+ F  +K          +DG+L        Q 
Sbjct: 923  ---------------------------DLGFAGKKKQIPIDEDTADDGVLGFNLESVSQN 955

Query: 2087 HITNNRGKKSALDLEEGDNVKGFQACFSDGSMIQSPVPNKCGGWPAAGGYENAVND 2254
             + +  G++S         VK F  C      IQSP   + G   +AG Y++  +D
Sbjct: 956  QVVDYAGEQSV--------VKDFHLCVKHDCTIQSPKSTRLGNQLSAGWYDSCSSD 1003


>ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Populus trichocarpa]
            gi|550321678|gb|EEF06077.2| hypothetical protein
            POPTR_0015s00600g [Populus trichocarpa]
          Length = 1236

 Score =  177 bits (449), Expect = 2e-41
 Identities = 166/610 (27%), Positives = 267/610 (43%), Gaps = 34/610 (5%)
 Frame = +2

Query: 131  GYTGDEHDNDDNADSHSPAASNIKNPSIKVSAEDKGC--SHIIGNKMEQNDHFIMDLSPV 304
            G  GDE  N+         +S+++ P+  +S+E K    S  I   ++QND ++ ++S  
Sbjct: 347  GCDGDEKGNN---------SSSVQEPNPFISSEGKVFYDSSQINFHLKQNDDYLAEISSK 397

Query: 305  EKKEPSNCNLIIHDYSNHLCKNSELQDSQFDLTDTFISAPSVASGVCSFVQSSSVTCDQF 484
              + PSN N+ + D+ + L K +++ +        F +           V+++S + D +
Sbjct: 398  NNELPSNKNISV-DFFDQLFK-AKMDNKVLRRNLDFFNLAMDGHEAIGSVENTSESLDHY 455

Query: 485  NPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVLNHQDLQSLPVNAEEAFSVSS 664
            NP VDSPCWKG+  S  S F ++++V P +   V   N L+ Q  Q  P    +A     
Sbjct: 456  NPAVDSPCWKGAPVSHLSAFEISEVVDPLIPKKVEACNGLSPQGPQIFPSATNDAVKACP 515

Query: 665  QYLHKGLDYNSNRSVENES-SFLKIP--SKMSSRNEVHISYGVEEPIKKGSLPGKTKLTP 835
            +         ++ S+E++  S  K P  +K+  R E+  +                K  P
Sbjct: 516  EKQSNISVPLNHESLEHQQVSLFKRPLDAKVLFREEIDDA---------------GKYGP 560

Query: 836  FQTLASSHEEGNIAPT--------GQIGLFGGVVDPFMDIKDSNYPSTFLFYAKEHXXXX 991
            +Q + S   E  I+            +  F  +      ++D  +PS    Y  +     
Sbjct: 561  YQRIPSYCHEAQISDVIDDETRKESILSDFNSLHTEQRSLEDGEWPSKKNSYVADVRRKI 620

Query: 992  XXXXXXXXTRLANPFSGASDTLANNPRPT-----------------IDSQLLVSTMKNMS 1120
                    + +  PF      L + P                    + ++ LV TM N++
Sbjct: 621  NDDPDDCSSHV--PFHAIEQVLCSPPSSEHAPAQHTQSQGEESLSKMHARTLVDTMHNLA 678

Query: 1121 EVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIR-SVPELQCPQSGVSHHLGKQ 1297
            E+L    SN+   L ++D  VL+ VINNLD C+   +  + S  E   PQ   S   GK 
Sbjct: 679  ELLLFYSSNDTCELKDEDFDVLKDVINNLDICISKNLERKISTQESLIPQQATSQFHGKL 738

Query: 1298 ACTHKVPKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQDFSYL--SSDTFEEDDSMI 1471
            +  +K           Q + Q   +   H     ++++   +++    ++DT + DD+M 
Sbjct: 739  SDLYK----------GQLEFQHFEDEEEHKIASDKRKEKLSNWASTRCAADTVK-DDNMT 787

Query: 1472 QAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDIEMEKYKLCKTK 1651
            QAIKKVL K F  E+E   QILLY+NLWLEAEA+LCS+ Y ARF R+ IEMEK    K  
Sbjct: 788  QAIKKVLAKNFPIEEESESQILLYRNLWLEAEASLCSVNYMARFNRMKIEMEKGHSQKAN 847

Query: 1652 ARSGTVELPLNMEKLWNSTVSDSNL-YTDATNEMSTPKIYDPSYSRITGHTEDVEASVMD 1828
             +S      + +E L    VS   L   D  + +      D S      H++D    VM 
Sbjct: 848  EKS------MVLENLSRPKVSSDILPADDKGSPVQDVSFLDSSILSRNSHSDD----VMA 897

Query: 1829 RFHVLKCRID 1858
            RFH+LK R+D
Sbjct: 898  RFHILKSRVD 907


>ref|XP_006441271.1| hypothetical protein CICLE_v10018632mg [Citrus clementina]
            gi|557543533|gb|ESR54511.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
          Length = 1064

 Score =  164 bits (414), Expect = 2e-37
 Identities = 197/754 (26%), Positives = 310/754 (41%), Gaps = 90/754 (11%)
 Frame = +2

Query: 263  MEQNDHFIMDLSPVEKKEPSNCNLIIHDYSNHLCKNSELQDSQFDLTDTFISAPSVASGV 442
            +E+  H    L P EKKE  + N+ +    + L +   LQ    D+    +S     +  
Sbjct: 345  LERGSHIFPKL-PFEKKEKLSSNVSV--IKDPLKEKPGLQIP--DIGPGSVSLMLANNRA 399

Query: 443  CSFVQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGV---AGGNVLNHQ 613
             +  + SS + D +NP VDSPCWKG+     SP   +  VT + +N +   +G N +   
Sbjct: 400  INCSEGSSESLDHYNPAVDSPCWKGA-PDYHSPVESSGPVTLQHINKIEACSGSNSIGPT 458

Query: 614  DLQSLPVNAEEAFSVSSQYLHKGLDYNSNRSVENESSFLKIPSKMSSRNEVHISYGVEEP 793
            D          +  VS Q       Y  +  +EN+      P + S  N +   +G +  
Sbjct: 459  D---------NSGKVSPQKPSDYSFYQEHGYLENDPE--SSPKRSSRANLLFEEHGYDRD 507

Query: 794  IKKGSLPGKT-----------------------------KLTPFQTLASSHEEGNIAPTG 886
            +K G    K+                             K  PF  +     E  +    
Sbjct: 508  LKTGFYQMKSSYGLGVQFSDCIDKPRQDYVHANNSADEFKFRPFHQVQYDSVENKLTFER 567

Query: 887  QIGLFGGVVDPFMDIKDSNY--PSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLA 1060
            +  L  GV D  + I  ++    S    +A EH             RL N   G  + LA
Sbjct: 568  KCELGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARL-NKLHG--EQLA 624

Query: 1061 NNPRPTIDSQLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVS-- 1234
                P +  + L+STM N+SE+L   CSN++  L E D   L+ V+NNLD C+  ++   
Sbjct: 625  ----PQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPE 680

Query: 1235 ---------------IRSVPELQCPQSGVSHHLGKQACTHKVPKIEANGIQSQCDHQSCI 1369
                           IR  PEL         H G    + K  K  A  + +Q ++Q   
Sbjct: 681  APIQESLLTQKSSEFIREFPEL---------HEGVTVSSPKETKA-AFSVLNQPNYQHVQ 730

Query: 1370 ERSIHSPICSEKQDMFQDFS---------------YLSSDTFE--EDDSMIQAIKKVLKK 1498
            E+        +K +   DF+                +  D  E  +DD+M QAIKKVL  
Sbjct: 731  EQRSPDIAAGKKSEKCSDFTSQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSD 790

Query: 1499 KFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDIEMEKYKLCKTKARSGTVELP 1678
             F +E+++  Q+LLY+NLWLEAEAALCSI YKARF R+ IE+E  KL K K  S   E  
Sbjct: 791  NFVEEEDEKLQVLLYRNLWLEAEAALCSINYKARFNRMKIELENCKLLKAKDFS---ENT 847

Query: 1679 LNMEKLWNSTVSDS---------NLYTDATNEMSTPKIYDPSYSRITGHTEDVEASVMDR 1831
              +EKL  +T S            +  D+T ++S   ++D   + I+ H +DV A    R
Sbjct: 848  SELEKLSQTTFSPDLHAVNKLPPQVKDDSTQDVS---VHDFPIANISSHPDDVVA----R 900

Query: 1832 FHVLKCRIDKPVPSDERKFQEAVDVVVHERMEETTNPCSR-NKQNGRMYSQPTNFDVDFV 2008
              +LKC+ +    +++R   + VD  + E   + T P S  +  N    S+  + +   +
Sbjct: 901  SQILKCQ-ESESHANQRPTADEVDNFLFEARNDQTPPTSTCSLSNATSTSKADDVEASVI 959

Query: 2009 RR---------KNPCMFIGRELEDGI---LEARGNLQGHITNNRGKKSALDLEEGDNVKG 2152
             R          + C  +G ++   +   L   G    +      + S+  +++   VK 
Sbjct: 960  ARFHILKNRIENSSCSNMGDQILPQVAFKLFENGTSDVNTGPELHRNSSNHMQDKLTVKE 1019

Query: 2153 FQACFSDGSMIQSPVPNKCGGWPAAGGYENAVND 2254
            F     + ++IQSP  NK G    A  Y+++  D
Sbjct: 1020 FHL---NDAVIQSPRLNKLGNQLPASCYDSSSLD 1050


>ref|XP_006478087.1| PREDICTED: uncharacterized protein LOC102628429 [Citrus sinensis]
          Length = 1065

 Score =  162 bits (409), Expect = 9e-37
 Identities = 191/745 (25%), Positives = 305/745 (40%), Gaps = 81/745 (10%)
 Frame = +2

Query: 263  MEQNDHFIMDLSPVEKKEPSNCNLIIHDYSNHLCKNSELQDSQFDLTDTFISAPSVASGV 442
            +E+  H    L P+EKKE  + N+ +    + L +   LQ    D+    +S     +G 
Sbjct: 346  LERGSHIFPKL-PLEKKEKLSSNVSV--IKDPLKEKPGLQIP--DIGPGSVSLMLANNGA 400

Query: 443  CSFVQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGV---AGGNVLNHQ 613
             +  + SS + D +NP VDSPCWKG+     SP   +  VT + +N +   +G N     
Sbjct: 401  INCSEGSSESLDHYNPAVDSPCWKGA-PDYHSPVESSGPVTLQHINKIEACSGSNSFGPT 459

Query: 614  DLQSLPVNAEEAFSVSSQYLHKGLDYNSNRSVENESSFLKIPSKMSSRNEVHISYGVEEP 793
            D          +  VS Q       Y  +  +EN+      P + S  N +   +G +  
Sbjct: 460  D---------NSGKVSPQKPSDYSFYQEHGYLENDPE--SSPKRSSRANLLFEEHGYDHD 508

Query: 794  IKKGSLPGKT-----------------------------KLTPFQTLASSHEEGNIAPTG 886
            +K GS   K+                             K  PF  +     E  +    
Sbjct: 509  LKTGSYQMKSSCGLGVQFSDYIDKPRQDYVHANNSADEFKFRPFHQVQYDTVENKLTFER 568

Query: 887  QIGLFGGVVDPFMDIKDSNY--PSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLA 1060
            +  L  GV D  + I  ++    S    +A EH             RL N   G  + LA
Sbjct: 569  KCELGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARL-NKLHG--EQLA 625

Query: 1061 NNPRPTIDSQLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVS-- 1234
                P +  + L+S+M N+SE+L   CSN++  L E D   L+ V+NNLD C+  ++   
Sbjct: 626  ----PQMCVRTLISSMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPE 681

Query: 1235 ---------------IRSVPEL------QCPQ-SGVSHHLGKQACTHKVPKIEANGIQSQ 1348
                           IR  PEL        PQ +  +  +  Q     V +  +  I + 
Sbjct: 682  APIQESLLTQKSSEFIREFPELHEGVTVSSPQETKAAFSVLNQPNYQHVQEQRSPDIAAG 741

Query: 1349 CDHQSCIERSIHSPICSEKQDMFQDFSYLSSDTFE--EDDSMIQAIKKVLKKKFHDEDEQ 1522
               + C + +         +D   D + +  D  E  +DD+M QAIKKVL   F  E+++
Sbjct: 742  KKIEKCSDFTSQGGHAERVKD--DDMTQVHKDDAERVKDDNMTQAIKKVLSDNFVKEEDE 799

Query: 1523 LPQILLYKNLWLEAEAALCSIKYKARFARLDIEMEKYKLCKTKARSGTVELPLNMEKLWN 1702
              Q+LLY+NLWLEAEAALC+I YKARF R+ IE+E  KL K K  S   E    +EKL  
Sbjct: 800  KLQVLLYRNLWLEAEAALCAINYKARFNRMKIELENCKLLKAKDLS---ENTSELEKLSQ 856

Query: 1703 STVSDS---------NLYTDATNEMSTPKIYDPSYSRITGHTEDVEASVMDRFHVLKCRI 1855
            +T S            +  D T ++S   + D   +  + H +DV A    RF +LKC+ 
Sbjct: 857  TTFSPDLHAVNKLPPQVKDDTTQDVS---VRDFPIANSSSHPDDVVA----RFQILKCQE 909

Query: 1856 DKPVPSDERKFQEAVDVVVHERMEETTNPCSRNKQNGRMYSQPTNFDVDFVRR------- 2014
             K   + +    E  + +   R ++T    + +  N    S+  + +   + R       
Sbjct: 910  SKSHANQKPTADEVDNFLFEARNDQTPPTSTCSLSNATSTSKADDVEASVIARFHILKNR 969

Query: 2015 --KNPCMFIGRELEDGI---LEARGNLQGHITNNRGKKSALDLEEGDNVKGFQACFSDGS 2179
               + C  +G ++   +   L   G    +      + S+  +++   VK F     + +
Sbjct: 970  IENSSCSNMGDQILPQVAFKLFENGTSDVNTGPELHRNSSTHMQDKLTVKEFHL---NDA 1026

Query: 2180 MIQSPVPNKCGGWPAAGGYENAVND 2254
            +IQSP  NK G    A  Y+++  D
Sbjct: 1027 VIQSPRLNKLGNQLPASCYDSSSLD 1051


>ref|XP_006441268.1| hypothetical protein CICLE_v10018632mg [Citrus clementina]
            gi|557543530|gb|ESR54508.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
          Length = 1041

 Score =  159 bits (402), Expect = 6e-36
 Identities = 192/745 (25%), Positives = 305/745 (40%), Gaps = 81/745 (10%)
 Frame = +2

Query: 263  MEQNDHFIMDLSPVEKKEPSNCNLIIHDYSNHLCKNSELQDSQFDLTDTFISAPSVASGV 442
            +E+  H    L P EKKE  + N+ +    + L +   LQ    D+    +S     +  
Sbjct: 345  LERGSHIFPKL-PFEKKEKLSSNVSV--IKDPLKEKPGLQIP--DIGPGSVSLMLANNRA 399

Query: 443  CSFVQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGV---AGGNVLNHQ 613
             +  + SS + D +NP VDSPCWKG+     SP   +  VT + +N +   +G N +   
Sbjct: 400  INCSEGSSESLDHYNPAVDSPCWKGA-PDYHSPVESSGPVTLQHINKIEACSGSNSIGPT 458

Query: 614  DLQSLPVNAEEAFSVSSQYLHKGLDYNSNRSVENESSFLKIPSKMSSRNEVHISYGVEEP 793
            D          +  VS Q       Y  +  +EN+      P + S  N +   +G +  
Sbjct: 459  D---------NSGKVSPQKPSDYSFYQEHGYLENDPE--SSPKRSSRANLLFEEHGYDRD 507

Query: 794  IKKGSLPGKT-----------------------------KLTPFQTLASSHEEGNIAPTG 886
            +K G    K+                             K  PF  +     E  +    
Sbjct: 508  LKTGFYQMKSSYGLGVQFSDCIDKPRQDYVHANNSADEFKFRPFHQVQYDSVENKLTFER 567

Query: 887  QIGLFGGVVDPFMDIKDSNY--PSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLA 1060
            +  L  GV D  + I  ++    S    +A EH             RL N   G  + LA
Sbjct: 568  KCELGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARL-NKLHG--EQLA 624

Query: 1061 NNPRPTIDSQLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVS-- 1234
                P +  + L+STM N+SE+L   CSN++  L E D   L+ V+NNLD C+  ++   
Sbjct: 625  ----PQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPE 680

Query: 1235 ---------------IRSVPELQCPQSGVSHHLGKQACTHKVPKIEANGIQSQCDHQSCI 1369
                           IR  PEL         H G    + K  K  A  + +Q ++Q   
Sbjct: 681  APIQESLLTQKSSEFIREFPEL---------HEGVTVSSPKETKA-AFSVLNQPNYQHVQ 730

Query: 1370 ERSIHSPICSEKQDMFQDFS---------------YLSSDTFE--EDDSMIQAIKKVLKK 1498
            E+        +K +   DF+                +  D  E  +DD+M QAIKKVL  
Sbjct: 731  EQRSPDIAAGKKSEKCSDFTSQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSD 790

Query: 1499 KFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDIEMEKYKLCKTKARSGTVELP 1678
             F +E+++  Q+LLY+NLWLEAEAALCSI YKARF R+ IE+E  KL K K      +LP
Sbjct: 791  NFVEEEDEKLQVLLYRNLWLEAEAALCSINYKARFNRMKIELENCKLLKAKVN----KLP 846

Query: 1679 LNMEKLWNSTVSDSNLYTDATNEMSTPKIYDPSYSRITGHTEDVEASVMDRFHVLKCRID 1858
              ++              D+T ++S   ++D   + I+ H +DV A    R  +LKC+ +
Sbjct: 847  PQVK-------------DDSTQDVS---VHDFPIANISSHPDDVVA----RSQILKCQ-E 885

Query: 1859 KPVPSDERKFQEAVDVVVHERMEETTNPCSR-NKQNGRMYSQPTNFDVDFVRR------- 2014
                +++R   + VD  + E   + T P S  +  N    S+  + +   + R       
Sbjct: 886  SESHANQRPTADEVDNFLFEARNDQTPPTSTCSLSNATSTSKADDVEASVIARFHILKNR 945

Query: 2015 --KNPCMFIGRELEDGI---LEARGNLQGHITNNRGKKSALDLEEGDNVKGFQACFSDGS 2179
               + C  +G ++   +   L   G    +      + S+  +++   VK F     + +
Sbjct: 946  IENSSCSNMGDQILPQVAFKLFENGTSDVNTGPELHRNSSNHMQDKLTVKEFHL---NDA 1002

Query: 2180 MIQSPVPNKCGGWPAAGGYENAVND 2254
            +IQSP  NK G    A  Y+++  D
Sbjct: 1003 VIQSPRLNKLGNQLPASCYDSSSLD 1027


>ref|XP_007039224.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508776469|gb|EOY23725.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 1059

 Score =  159 bits (402), Expect = 6e-36
 Identities = 185/684 (27%), Positives = 275/684 (40%), Gaps = 83/684 (12%)
 Frame = +2

Query: 452  VQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVLNHQDLQSLP 631
            V++S  + D +NP VDSPCWKG+ AS  SPF  ++ V  +L   +   +  N   L+ + 
Sbjct: 402  VENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPVAVQLAKKLEACDGSNGLVLKFIS 461

Query: 632  VNAEEAFSVSSQYLHKGLDYNSNRSVENES-SFLKIP----------------------S 742
             N        S    + L  + N +VE+ S S LK+P                      +
Sbjct: 462  SNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKLPPVSIPSFKEHEPDEAGKAGSHKN 521

Query: 743  KMSSRNEVHISYGVEEPIKKGSLPGKTKLTPFQTLASSHE---EGNIAPTGQIGLFGGVV 913
            K SS  EV  S    E  K   L  K+     +   +S +   EG +A         GV 
Sbjct: 522  KASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEGRLASKNLCRSETGVA 581

Query: 914  DPFMDIKDSNY--PSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDS 1087
            D  M I D +    S    +A +H            T+        +  L   P      
Sbjct: 582  DLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTK-------HTKFLGKEPVSNSSI 634

Query: 1088 QLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIRSVPELQCPQ 1267
             +LV TM+N+SE+L   CSN    L EQD   L+ VINNLD C+   +           Q
Sbjct: 635  SVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIG----------Q 684

Query: 1268 SGVSHHLGKQACTHKVPKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQDFSYLSS-- 1441
              +   L K   T   P++ A  + SQ          +      +K +   +F  + S  
Sbjct: 685  ETLLSELHKGTSTGS-PQVAAIDVLSQ-------HTQVKRKHFGKKDEKCSEFVSVRSGT 736

Query: 1442 DTFEEDDSMIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDIE 1621
            D   ++D M QAIKKVL + FH+++E  PQ+LLYKNLWLEAEAALCSI Y AR+  + IE
Sbjct: 737  DIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYMARYNNMKIE 796

Query: 1622 MEKYKLCKTKARSGTVELPLNMEKLWNSTVS---DSNLYTDATNEMS-TPKIYDPSY--S 1783
            +EK   CK        E   + +K+  S +S   D+N    A  E + T  + + ++  +
Sbjct: 797  IEK---CKLDTEKDLSEDTPDEDKISRSKLSADLDTNKKLTAIAESAPTLDVSNQNFPIA 853

Query: 1784 RITGHTEDVEASVMDRFHVLKCRIDKPVPSDERKFQE-----------AVDVVVHERMEE 1930
              + H +DV A    RFHVLK R++       R   E           AVD +  E  + 
Sbjct: 854  SSSNHADDVTA----RFHVLKHRLNNSYSVHTRDADELSSSKLSLDSDAVDKLATEVKDS 909

Query: 1931 TTN------------PCSRNKQNGRMYSQ----------------------PTNFDVDFV 2008
            +T+             C  +     + ++                      P   D+ F 
Sbjct: 910  STSSLQTQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSNEMEQKPLPEVVDLGFA 969

Query: 2009 RRKNPCMFIGRELEDGIL--EARGNLQGHITNNRGKKSALDLEEGDNVKGFQACFSDGSM 2182
             +K          +DG+L        Q  + +  G++S         VK F  C      
Sbjct: 970  GKKKQIPIDEDTADDGVLGFNLESVSQNQVVDYAGEQSV--------VKDFHLCVKHDCT 1021

Query: 2183 IQSPVPNKCGGWPAAGGYENAVND 2254
            IQSP   + G   +AG Y++  +D
Sbjct: 1022 IQSPKSTRLGNQLSAGWYDSCSSD 1045


>ref|XP_007039222.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508776467|gb|EOY23723.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 1068

 Score =  159 bits (401), Expect = 8e-36
 Identities = 185/693 (26%), Positives = 282/693 (40%), Gaps = 92/693 (13%)
 Frame = +2

Query: 452  VQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVLNHQDLQSLP 631
            V++S  + D +NP VDSPCWKG+ AS  SPF  ++ V  +L   +   +  N   L+ + 
Sbjct: 391  VENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPVAVQLAKKLEACDGSNGLVLKFIS 450

Query: 632  VNAEEAFSVSSQYLHKGLDYNSNRSVENES-SFLKIP----------------------S 742
             N        S    + L  + N +VE+ S S LK+P                      +
Sbjct: 451  SNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKLPPVSIPSFKEHEPDEAGKAGSHKN 510

Query: 743  KMSSRNEVHISYGVEEPIKKGSLPGKTKLTPFQTLASSHE---EGNIAPTGQIGLFGGVV 913
            K SS  EV  S    E  K   L  K+     +   +S +   EG +A         GV 
Sbjct: 511  KASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEGRLASKNLCRSETGVA 570

Query: 914  DPFMDIKDSNY--PSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDS 1087
            D  M I D +    S    +A +H            T+        +  L   P      
Sbjct: 571  DLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTK-------HTKFLGKEPVSNSSI 623

Query: 1088 QLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIRSV-PELQCP 1264
             +LV TM+N+SE+L   CSN    L EQD   L+ VINNLD C+   +   ++  EL   
Sbjct: 624  SVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQETLLSELHKV 683

Query: 1265 QSGVSHHLGKQACTHKV--------PKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQ 1420
               +S   G+++   ++        P++ A  + SQ          +      +K +   
Sbjct: 684  WFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQ-------HTQVKRKHFGKKDEKCS 736

Query: 1421 DFSYLSS--DTFEEDDSMIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYK 1594
            +F  + S  D   ++D M QAIKKVL + FH+++E  PQ+LLYKNLWLEAEAALCSI Y 
Sbjct: 737  EFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYM 796

Query: 1595 ARFARLDIEMEKYKLCKTKARSGTVELPLNMEKLWNSTVS---DSNLYTDATNEMS-TPK 1762
            AR+  + IE+EK   CK        E   + +K+  S +S   D+N    A  E + T  
Sbjct: 797  ARYNNMKIEIEK---CKLDTEKDLSEDTPDEDKISRSKLSADLDTNKKLTAIAESAPTLD 853

Query: 1763 IYDPSY--SRITGHTEDVEASVMDRFHVLKCRIDKPVPSDERKFQE-----------AVD 1903
            + + ++  +  + H +DV A    RFHVLK R++       R   E           AVD
Sbjct: 854  VSNQNFPIASSSNHADDVTA----RFHVLKHRLNNSYSVHTRDADELSSSKLSLDSDAVD 909

Query: 1904 VVVHERMEETTN------------PCSRNKQNGRMYSQ---------------------- 1981
             +  E  + +T+             C  +     + ++                      
Sbjct: 910  KLATEVKDSSTSSLQTQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSNEMEQKPL 969

Query: 1982 PTNFDVDFVRRKNPCMFIGRELEDGIL--EARGNLQGHITNNRGKKSALDLEEGDNVKGF 2155
            P   D+ F  +K          +DG+L        Q  + +  G++S         VK F
Sbjct: 970  PEVVDLGFAGKKKQIPIDEDTADDGVLGFNLESVSQNQVVDYAGEQSV--------VKDF 1021

Query: 2156 QACFSDGSMIQSPVPNKCGGWPAAGGYENAVND 2254
              C      IQSP   + G   +AG Y++  +D
Sbjct: 1022 HLCVKHDCTIQSPKSTRLGNQLSAGWYDSCSSD 1054


>ref|XP_007039220.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590674635|ref|XP_007039223.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508776465|gb|EOY23721.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508776468|gb|EOY23724.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1079

 Score =  159 bits (401), Expect = 8e-36
 Identities = 185/693 (26%), Positives = 282/693 (40%), Gaps = 92/693 (13%)
 Frame = +2

Query: 452  VQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVLNHQDLQSLP 631
            V++S  + D +NP VDSPCWKG+ AS  SPF  ++ V  +L   +   +  N   L+ + 
Sbjct: 402  VENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPVAVQLAKKLEACDGSNGLVLKFIS 461

Query: 632  VNAEEAFSVSSQYLHKGLDYNSNRSVENES-SFLKIP----------------------S 742
             N        S    + L  + N +VE+ S S LK+P                      +
Sbjct: 462  SNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKLPPVSIPSFKEHEPDEAGKAGSHKN 521

Query: 743  KMSSRNEVHISYGVEEPIKKGSLPGKTKLTPFQTLASSHE---EGNIAPTGQIGLFGGVV 913
            K SS  EV  S    E  K   L  K+     +   +S +   EG +A         GV 
Sbjct: 522  KASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEGRLASKNLCRSETGVA 581

Query: 914  DPFMDIKDSNY--PSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDS 1087
            D  M I D +    S    +A +H            T+        +  L   P      
Sbjct: 582  DLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTK-------HTKFLGKEPVSNSSI 634

Query: 1088 QLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIRSV-PELQCP 1264
             +LV TM+N+SE+L   CSN    L EQD   L+ VINNLD C+   +   ++  EL   
Sbjct: 635  SVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQETLLSELHKV 694

Query: 1265 QSGVSHHLGKQACTHKV--------PKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQ 1420
               +S   G+++   ++        P++ A  + SQ          +      +K +   
Sbjct: 695  WFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQ-------HTQVKRKHFGKKDEKCS 747

Query: 1421 DFSYLSS--DTFEEDDSMIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYK 1594
            +F  + S  D   ++D M QAIKKVL + FH+++E  PQ+LLYKNLWLEAEAALCSI Y 
Sbjct: 748  EFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYM 807

Query: 1595 ARFARLDIEMEKYKLCKTKARSGTVELPLNMEKLWNSTVS---DSNLYTDATNEMS-TPK 1762
            AR+  + IE+EK   CK        E   + +K+  S +S   D+N    A  E + T  
Sbjct: 808  ARYNNMKIEIEK---CKLDTEKDLSEDTPDEDKISRSKLSADLDTNKKLTAIAESAPTLD 864

Query: 1763 IYDPSY--SRITGHTEDVEASVMDRFHVLKCRIDKPVPSDERKFQE-----------AVD 1903
            + + ++  +  + H +DV A    RFHVLK R++       R   E           AVD
Sbjct: 865  VSNQNFPIASSSNHADDVTA----RFHVLKHRLNNSYSVHTRDADELSSSKLSLDSDAVD 920

Query: 1904 VVVHERMEETTN------------PCSRNKQNGRMYSQ---------------------- 1981
             +  E  + +T+             C  +     + ++                      
Sbjct: 921  KLATEVKDSSTSSLQTQDSPVPGTACHTDDVEASIMTRLHILKSRGNVDLDSNEMEQKPL 980

Query: 1982 PTNFDVDFVRRKNPCMFIGRELEDGIL--EARGNLQGHITNNRGKKSALDLEEGDNVKGF 2155
            P   D+ F  +K          +DG+L        Q  + +  G++S         VK F
Sbjct: 981  PEVVDLGFAGKKKQIPIDEDTADDGVLGFNLESVSQNQVVDYAGEQSV--------VKDF 1032

Query: 2156 QACFSDGSMIQSPVPNKCGGWPAAGGYENAVND 2254
              C      IQSP   + G   +AG Y++  +D
Sbjct: 1033 HLCVKHDCTIQSPKSTRLGNQLSAGWYDSCSSD 1065


>ref|XP_002521299.1| hypothetical protein RCOM_0756330 [Ricinus communis]
            gi|223539484|gb|EEF41073.1| hypothetical protein
            RCOM_0756330 [Ricinus communis]
          Length = 1125

 Score =  156 bits (394), Expect = 5e-35
 Identities = 167/622 (26%), Positives = 262/622 (42%), Gaps = 56/622 (9%)
 Frame = +2

Query: 161  DNADSHSPAASNIKNPSIKVSAEDKGC--SHIIGNKMEQNDHFIMDLSPVEKKEPSNCNL 334
            DN D    + S +  P   ++++   C  +  +   + + D  I + S  + +E S+   
Sbjct: 352  DNKDFSCNSPSVVVEPRPFITSKGSVCYDASQVSFHLGKTDQVIANFSSAKNEELSSNQN 411

Query: 335  IIHDYSNHLCKNSELQDSQFDLTDTFISAPSVASGVCSFVQSSSV---------TCDQFN 487
               D S H      +           I  P  + G  S V  +           + D +N
Sbjct: 412  ASMDVSGHFAGEKPV-----------IQVPCTSLGGISLVDKNEAIDPAKNHTESLDHYN 460

Query: 488  PVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVLNHQDLQSLPVNAEEAFSVSSQ 667
            P VDSPCWKG+  S  S   V++ VTP+ +  +   +  NHQ  Q+  V++++A  VS +
Sbjct: 461  PAVDSPCWKGAPVSNFSQLEVSEAVTPQNMKNLEACSGSNHQGYQTFSVSSDDAVKVSPE 520

Query: 668  YLHKGLDYNSNRSVENES-SFLKIP--SKMSSRNEVH--ISYGV---------EEPIKKG 805
               +        S+EN S S +K P    M  R  +   +++G          +  I   
Sbjct: 521  KTSEKSIQQKGWSLENYSASSMKRPLADNMLHREGIDHFVNFGANCTKPSLFHQVQISDD 580

Query: 806  SLP--------GKTKLTPFQTLASSH--EEGNIAPTGQIGLFGGVVDPFMDIKDSNYPST 955
            +LP        GK      Q+  S     E N AP   +   G  ++   D   S+ P  
Sbjct: 581  ALPNKSFDDSNGKLPQNEKQSCESGKWTTESNSAPVISVADVGMNMNDDPDECSSHVP-- 638

Query: 956  FLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDSQLLVSTMKNMSEVLCS 1135
              F+A EH             +L     G S T     R  ID      TM+N+SE+L  
Sbjct: 639  --FHAVEHVLSSPPSADSASIKLTKACGGVS-TQKTYIRTVID------TMQNLSELLIF 689

Query: 1136 ICSNNIYALTEQDHAVLQHVINNLDACVVNKVS-IRSVPELQCPQSGVSHHLGKQACTHK 1312
              SN++  L E D   L+ +I+NL+ C++  V  + S  E   P+   +   GK +   K
Sbjct: 690  HLSNDLCDLKEDDSNALKGMISNLELCMLKNVERMTSTQESIIPERDGAQLSGKSSKLQK 749

Query: 1313 --------VPKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQDF-SYLSSDTFEEDDS 1465
                    + + +    Q    +Q   +   H+    +  +    + S  ++    + D 
Sbjct: 750  GTNGNGFLISRSDPLEFQYSVKYQHVQDE--HNISSGKNDETLSSYVSVRAAADMLKRDK 807

Query: 1466 MIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDIEMEKYKLCK 1645
            M QAIK  L + FH E+E  PQ+LLYKNLWLEAEA+LC     ARF R+  EMEK   C 
Sbjct: 808  MTQAIKNALTENFHGEEETEPQVLLYKNLWLEAEASLCYASCMARFNRIKSEMEK---CD 864

Query: 1646 TKARSGTVELPLNMEKLWNSTVSDSNLYTD-------ATNEMSTP----KIYDPSYSRIT 1792
            ++  +G+ E  +  EKL     S SN+ +D       A+N   +P     I + S    +
Sbjct: 865  SEKANGSPENCMVEEKL-----SKSNIRSDPCTGNVLASNTKGSPLPDTSIPESSILCTS 919

Query: 1793 GHTEDVEASVMDRFHVLKCRID 1858
             H +DV A    R+H+LK R+D
Sbjct: 920  SHADDVTA----RYHILKYRVD 937


>ref|XP_002317835.2| hypothetical protein POPTR_0012s00720g [Populus trichocarpa]
            gi|550326088|gb|EEE96055.2| hypothetical protein
            POPTR_0012s00720g [Populus trichocarpa]
          Length = 1227

 Score =  155 bits (391), Expect = 1e-34
 Identities = 173/608 (28%), Positives = 260/608 (42%), Gaps = 32/608 (5%)
 Frame = +2

Query: 131  GYTGDEHDNDDNADSHSPAASNIKNPSIKVSAEDKGC--SHIIGNKMEQNDHFIMDLSPV 304
            G  GDE D   N  S +      + P+  +S++ K C  S  +   ++QND    ++   
Sbjct: 345  GTDGDEKDFAGNNTSFA------QEPNPFISSKGKVCYDSSQVNFHLKQNDDSFAEVPSK 398

Query: 305  EKKEP-SNCNLIIHDYSNHLCKNSELQDSQFDLTDTFISAPSVASGVCSFVQSSSVTCDQ 481
              +E  SN N+ I D+ + L +            D F  A        S V+ +S + D 
Sbjct: 399  NHEELLSNKNISI-DFLDKLFREKMENRVPCKNLDFFNLAMDGHEAAGS-VEITSESLDH 456

Query: 482  FNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVLNHQDLQSLPVNAEEAF--- 652
            + P VDSPCWKG+  S  S F  +++V P+  N V   N LN Q  Q  P    +A    
Sbjct: 457  YFPAVDSPCWKGAPVSLPSAFEGSEVVNPQ--NKVEACNGLNLQGPQISPSTTNDAVKDC 514

Query: 653  -----SVSSQYLHKGLDYNSNRS--------------VENESSFLKIPSKMSSRNEVHIS 775
                 ++S  + ++ L++    S              +++   +     K S  NE  IS
Sbjct: 515  PEKQSNISMTFNNESLEHRPASSFKRPLVANVLFREGIDDAVKYGPCQRKSSYCNEAQIS 574

Query: 776  YGVEEPIKKGSLPGKTKLTPFQTLASSHEEGNIAPTGQIGLFGGVVDPFMDIKDSNYPST 955
              ++EP K+  LP      P  T   S EEG   P+ +     GV     D  D +  S 
Sbjct: 575  DVIDEPRKESILPD---FKPVHTKQKSLEEGEW-PSKKNSDVAGVRRKINDNPD-DCSSH 629

Query: 956  FLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDSQLLVSTMKNMSEVLCS 1135
              ++A EH             +      G S +        + ++ LV TM N+SE+L  
Sbjct: 630  VPYHAIEHVLCSPPSSEHAPAQHTQSQVGESSS-------KMHARTLVDTMHNLSELLLF 682

Query: 1136 ICSNNIYALTEQDHAVLQHVINNLDACVV-NKVSIRSVPELQCPQSGVSHHLGKQACTHK 1312
              SN+   L ++D  VL  VINNLD  +  N     S  E   P+   S   GK +  +K
Sbjct: 683  YSSNDTCELKDEDFDVLNDVINNLDIFISKNSERKNSTQESLIPRRATSQSPGKLSELYK 742

Query: 1313 VPKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQDFSYL--SSDTFEEDDSMIQAIKK 1486
                       Q + Q   +      +  E+++   +F  +  ++DT + DD++ QAIKK
Sbjct: 743  ----------GQLEFQHFEDEKECKIVSDERKEKLSNFVSMRGATDTVK-DDNVTQAIKK 791

Query: 1487 VLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDIEMEKYKLCKTKARSGT 1666
            VL + F  ++E   QILLYKNLWLEAEA+LC +    RF RL IE+EK    K    S  
Sbjct: 792  VLAQNFPIKEESESQILLYKNLWLEAEASLCVVNCMDRFNRLKIEIEKGSSQKVNEFSSA 851

Query: 1667 V----ELPLNMEKLWNSTVSDSNLYTDATNEMSTPKIYDPSYSRITGHTEDVEASVMDRF 1834
                 E  + ME L    VS S++            + D S      H++D    VM RF
Sbjct: 852  APVVPENSMIMENLLGPKVS-SDILPAEDEGSPVHNVPDSSILSRNSHSDD----VMARF 906

Query: 1835 HVLKCRID 1858
            H++K R+D
Sbjct: 907  HIIKSRVD 914


>ref|XP_004309093.1| PREDICTED: uncharacterized protein LOC101301835 [Fragaria vesca
            subsp. vesca]
          Length = 1218

 Score =  150 bits (379), Expect = 3e-33
 Identities = 157/572 (27%), Positives = 237/572 (41%), Gaps = 36/572 (6%)
 Frame = +2

Query: 251  IGNKMEQNDHFIMDLSPVEKKEPSNCNLIIHDYSNHLCKNSELQDSQFDLTDTFISAPSV 430
            +G  + + D F ++ S  +     N   I +D  +HL K      +     D F +A ++
Sbjct: 404  LGIHLGRIDPFSVESSSTKDTALPNNGSISNDPLDHLFKVKPGLPNSHVKPDGFDAAVNI 463

Query: 431  ASGVCSFVQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVLNH 610
               + SF+ SS    D  NP VDSPCWKG   SR SPF  ++   P+ +  + G N LN 
Sbjct: 464  NDSINSFLNSSE-NVDPNNPAVDSPCWKGVRGSRFSPFKASEEGGPEKMKKLEGCNGLNL 522

Query: 611  QDLQSLPVNAEEAFSVSSQYLHKGLDYNSNRSVENESSFLKIPSKMSS-RNEVHISYGVE 787
                   +N  E  S      +    +  N  + N    L +P K SS  N     + ++
Sbjct: 523  NMPMIFSLNTCENISTQKPVEYNEFGWLGNGLLGNG---LPLPLKKSSVENSAFGEHKLD 579

Query: 788  EPIKKG-------------------SLPGKTKLTPFQTLASSHE---EGNIAPTGQIGLF 901
            +  K                     S  G    +PF+      E   EG +    +   +
Sbjct: 580  DTTKTTYYRESGHDRGLHGYINTPHSGSGDKSSSPFEHSYIVQEGCGEGGLTTESKNTTW 639

Query: 902  GGVVDPFMDIKDSNYPSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTI 1081
                D  ++I D+      L     H               A+  +  + +        +
Sbjct: 640  SVGADVKLNINDT------LECGSSHTSPIENTFCSPSVEDAD--TKLTTSYGEESNMNM 691

Query: 1082 DSQLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVV-NKVSIRSVPELQ 1258
            D Q+LV+ M ++SEVL   CSN+   L ++D   L+ VINNL++C++ +     S+PE  
Sbjct: 692  DIQMLVNKMNSLSEVLLVNCSNSSCQLKKKDIDALKAVINNLNSCILKHDEDFLSMPESP 751

Query: 1259 CPQSGVSHHLGKQACTHK--------VPKIEANGIQSQCDHQSCIERSIHSPICSEKQDM 1414
              Q     ++ +    +K        + KI A  IQ     Q   +   H  +     ++
Sbjct: 752  PIQQSTIKYIEELCKPNKALSPDMPQLTKIFAPSIQDPLHLQGVQKVKNHDNLVKNDDEV 811

Query: 1415 FQDFSYLSSDTFEEDDSMIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYK 1594
                S  S   F + + M Q IKK+L + FH +D   PQ LLYKNLWLEAEA +CS  YK
Sbjct: 812  ISSVSAKSDIDFVKQEEMTQDIKKILSENFHTDDTH-PQTLLYKNLWLEAEAVICSTNYK 870

Query: 1595 ARFARLDIEMEKYKLCKTKARSGTVELPLNMEKLWNSTVS-DSNLYTDATNEMS---TPK 1762
            ARF RL  EMEK   CK        E   +M     S V  +SN     T+E+     PK
Sbjct: 871  ARFNRLKTEMEK---CKADQSKDVFEHTADMMTQSRSEVCVNSNPVEKLTSEVQGSPLPK 927

Query: 1763 IYDPSYSRITGHTEDVEASVMDRFHVLKCRID 1858
            +       +T      + +VM RFHVL+ RI+
Sbjct: 928  LNLQESPTLT----QGDDNVMARFHVLRNRIE 955


>ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prunus persica]
            gi|462417047|gb|EMJ21784.1| hypothetical protein
            PRUPE_ppa000352mg [Prunus persica]
          Length = 1254

 Score =  150 bits (379), Expect = 3e-33
 Identities = 171/562 (30%), Positives = 243/562 (43%), Gaps = 26/562 (4%)
 Frame = +2

Query: 251  IGNKMEQNDHFIMDLSPVEKKEPSNC-NLIIHDYSNHLCKNSE-LQDSQFDLTDTFISAP 424
            +G  +   D F  + S    +E SN  N+I  D  + + K    LQ+S   L D F  A 
Sbjct: 407  LGFHLGAKDCFSAESSSARNEELSNNRNIINKDAWDKVFKAKPGLQNSHVGL-DGFKMAF 465

Query: 425  SVASGVCSFVQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVL 604
                 + SF+ SSS   D  NP VDSPCWKG   S  SPF  ++   P+ +  +   + L
Sbjct: 466  KTNETINSFL-SSSDNVDPNNPGVDSPCWKGVPGSCFSPFGASEDGVPEQIKKLEDCSGL 524

Query: 605  NHQDLQSLPVNAEEAFSVSSQYLHKGLDYNSNRSVENESSFLKIPSKMSS---------- 754
            N   +   P++A E  S S + +   ++YN    +EN    L+ P K  S          
Sbjct: 525  NIH-MPMFPLSAGENVS-SQKPIKNAVEYNEFGWLENG---LRPPLKRYSVANSAFGEHK 579

Query: 755  -RNEVHISYGVEEPIKKGSLPGKTKL-------TPFQTLASSH--EEGNIAPTGQIGLFG 904
              N V  +Y  E    +G    +  L            L  SH  ++G+    G+ GL  
Sbjct: 580  WDNSVKTTYDAETSHDRGPQSYRDGLHQSGNGDKSLGLLDDSHAMQQGH----GEDGLAT 635

Query: 905  GVVDPFMDIKDSNYPSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTID 1084
             V   +  + D    +                     +   +  +  S +        +D
Sbjct: 636  EVKQTWSCVADVKLNANDTMEYGSSHVPSHVVENVLCSSAEDAATKLSKSNGEESMLKVD 695

Query: 1085 SQLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIRSVPELQCP 1264
             Q+LV T+KN+SE+L + CSN +  L + D A L+ VINNL  C+   V   S P  + P
Sbjct: 696  VQMLVDTLKNLSELLLTNCSNGLCQLKKTDIATLKAVINNLHICISKNVEKWS-PMQESP 754

Query: 1265 --QSGVSHHLGKQACTHKVPKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQDFSYLS 1438
              Q   S    + +  HKV   +     S  D Q  +  SIH      K D+        
Sbjct: 755  TFQQNTSQCYAELSEHHKVLSADRPLSASAPDIQDQVIGSIHV-----KSDI-------- 801

Query: 1439 SDTFEEDDSMIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDI 1618
             D  +ED  M QAIK++L + FH E+   PQ+LLYKNLWLEAEA LCSI YKARF R+ I
Sbjct: 802  -DVVKED-KMTQAIKEILSENFHSEETD-PQVLLYKNLWLEAEAVLCSINYKARFNRVKI 858

Query: 1619 EMEKYKLCKTKARSGTVELPLNMEKLWNSTVS-DSNLYTDATNE-MSTPKIYDPSYSRIT 1792
            EM+K   CK +      E   +M K   S VS DSN     T E    P    P    ++
Sbjct: 859  EMDK---CKAENSKDVFEYTADMMKQSKSEVSPDSNPVNPLTPEAQGCPTSNVPDLPILS 915

Query: 1793 GHTEDVEASVMDRFHVLKCRID 1858
               E     V+ RF +L+ R++
Sbjct: 916  QEDE-----VLARFDILRGRVE 932


>gb|EXB94712.1| hypothetical protein L484_002599 [Morus notabilis]
          Length = 1159

 Score =  147 bits (371), Expect = 2e-32
 Identities = 157/540 (29%), Positives = 227/540 (42%), Gaps = 46/540 (8%)
 Frame = +2

Query: 452  VQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVLNHQDLQSLP 631
            V+ SS   D +N  VDSPCWKG  A+R SPF   D   P+        N  N Q  Q   
Sbjct: 468  VEDSSENVDHYNHAVDSPCWKGVPATRSSPF---DASVPETKRQEVFSNS-NVQTKQIFQ 523

Query: 632  VNAEEAFSVSSQYLH------------KGLDYNSNRSVENESSF--------LKIPSKMS 751
            +N  +   VSSQ  +             GL++  N S   +S+F        +KI S + 
Sbjct: 524  LNTGD--KVSSQKRNDNMMCHEFGSPENGLEFPLNTSPAAKSTFSDRKSDDIVKIGSDLE 581

Query: 752  SRNEVHISYGVEEPIKKGSLPGKTKLTPFQTLASSHEEGNIAPTG----QIGLFGGVVDP 919
            ++   H S  + E   + +     K       +S + E NI   G     I      V P
Sbjct: 582  TKGIQH-SNDIHEHGSRSTGCSDLK-------SSLNGEQNIQRNGLISENINEALQCVSP 633

Query: 920  FMDIKDSNYPSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDSQLLV 1099
             +     N  S+ +  A               T+L     G S        PTID  +LV
Sbjct: 634  RLPFPMENIISSSVEDAS--------------TKLNKSNEGPSS-------PTIDVPVLV 672

Query: 1100 STMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIRSVPELQCPQSGVS 1279
            ST++N+SE+L   C++  Y L ++D   +Q +I+NL  C           +    +   S
Sbjct: 673  STIRNLSELLLFHCTSGSYQLKQKDLETIQSMIDNLSVCASKNSEKTVSTQDSTSEKYTS 732

Query: 1280 HHLGKQACTHK---VPKIE----ANGIQSQCDHQSCIERSIHSPICSEKQDMFQDFSYLS 1438
             +LG +   HK   + K++    A  I      Q+  + + +     E  ++    S  +
Sbjct: 733  DYLGDK--NHKGFTLNKLQVTKTAGPILDLLADQNVHKGNKYYVAGKENDELLDSVSVRA 790

Query: 1439 SDTFEEDDSMIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDI 1618
                 ++D  IQA+KKVL   F  E+E  PQ LLYKNLWLEAEAALCS+  KARF R+ +
Sbjct: 791  DVDIVDEDKAIQALKKVLTDNFDYEEEASPQALLYKNLWLEAEAALCSMSCKARFNRVKL 850

Query: 1619 EMEKYKLCKTKARSGTVELPLNMEKLWNSTVS----DSNLYTDATNEMSTPKIYDPSYSR 1786
            EME  KL K+K   G   +   M+K+  S VS     +N  +      +T K  + S   
Sbjct: 851  EMENPKLPKSKDAHGNT-ITTEMDKVSRSEVSPDLNGANTLSPKAKGCATTKSQESSVLS 909

Query: 1787 ITGHTEDVEASVMDRFHVLKCRI-----------DKPVPSDERKFQEAVDVVVHERMEET 1933
                 +D    VMDRF +L+CR            DKP           V  ++ E  EET
Sbjct: 910  TNAEDDD----VMDRFQILRCRAKKSNYGIVADKDKPSSPKVSPHSNKVGKILPEANEET 965


>ref|XP_007039225.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508776470|gb|EOY23726.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 827

 Score =  145 bits (365), Expect = 1e-31
 Identities = 134/435 (30%), Positives = 195/435 (44%), Gaps = 39/435 (8%)
 Frame = +2

Query: 452  VQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGVAGGNVLNHQDLQSLP 631
            V++S  + D +NP VDSPCWKG+ AS  SPF  ++ V  +L   +   +  N   L+ + 
Sbjct: 402  VENSLESLDHYNPPVDSPCWKGAPASNNSPFGSSEPVAVQLAKKLEACDGSNGLVLKFIS 461

Query: 632  VNAEEAFSVSSQYLHKGLDYNSNRSVENES-SFLKIP----------------------S 742
             N        S    + L  + N +VE+ S S LK+P                      +
Sbjct: 462  SNTANMVKHPSGKAGEILMSDENGNVEDGSMSSLKLPPVSIPSFKEHEPDEAGKAGSHKN 521

Query: 743  KMSSRNEVHISYGVEEPIKKGSLPGKTKLTPFQTLASSHE---EGNIAPTGQIGLFGGVV 913
            K SS  EV  S    E  K   L  K+     +   +S +   EG +A         GV 
Sbjct: 522  KASSACEVKFSDNASEWKKDYVLFDKSVDEVEKASHTSQQCLAEGRLASKNLCRSETGVA 581

Query: 914  DPFMDIKDSNY--PSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDS 1087
            D  M I D +    S    +A +H            T+        +  L   P      
Sbjct: 582  DLEMKINDVSGCGSSHVSCHAVKHLSCAPSSVEDVSTK-------HTKFLGKEPVSNSSI 634

Query: 1088 QLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIRSV-PELQCP 1264
             +LV TM+N+SE+L   CSN    L EQD   L+ VINNLD C+   +   ++  EL   
Sbjct: 635  SVLVDTMQNLSELLLYHCSNEACELREQDVKSLEKVINNLDTCMSKNIGQETLLSELHKV 694

Query: 1265 QSGVSHHLGKQACTHKV--------PKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQ 1420
               +S   G+++   ++        P++ A  + SQ          +      +K +   
Sbjct: 695  WFPMSKKNGQESLLSELHKGTSTGSPQVAAIDVLSQ-------HTQVKRKHFGKKDEKCS 747

Query: 1421 DFSYLSS--DTFEEDDSMIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYK 1594
            +F  + S  D   ++D M QAIKKVL + FH+++E  PQ+LLYKNLWLEAEAALCSI Y 
Sbjct: 748  EFVSVRSGTDIKVKNDKMTQAIKKVLIENFHEKEETHPQVLLYKNLWLEAEAALCSINYM 807

Query: 1595 ARFARLDIEMEKYKL 1639
            AR+  + IE+EK KL
Sbjct: 808  ARYNNMKIEIEKCKL 822


>ref|XP_006441272.1| hypothetical protein CICLE_v10018632mg [Citrus clementina]
            gi|557543534|gb|ESR54512.1| hypothetical protein
            CICLE_v10018632mg [Citrus clementina]
          Length = 842

 Score =  143 bits (361), Expect = 3e-31
 Identities = 149/531 (28%), Positives = 220/531 (41%), Gaps = 68/531 (12%)
 Frame = +2

Query: 263  MEQNDHFIMDLSPVEKKEPSNCNLIIHDYSNHLCKNSELQDSQFDLTDTFISAPSVASGV 442
            +E+  H    L P EKKE  + N+ +    + L +   LQ    D+    +S     +  
Sbjct: 345  LERGSHIFPKL-PFEKKEKLSSNVSV--IKDPLKEKPGLQIP--DIGPGSVSLMLANNRA 399

Query: 443  CSFVQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVTDLVTPKLVNGV---AGGNVLNHQ 613
             +  + SS + D +NP VDSPCWKG+     SP   +  VT + +N +   +G N +   
Sbjct: 400  INCSEGSSESLDHYNPAVDSPCWKGA-PDYHSPVESSGPVTLQHINKIEACSGSNSIGPT 458

Query: 614  DLQSLPVNAEEAFSVSSQYLHKGLDYNSNRSVENESSFLKIPSKMSSRNEVHISYGVEEP 793
            D          +  VS Q       Y  +  +EN+      P + S  N +   +G +  
Sbjct: 459  D---------NSGKVSPQKPSDYSFYQEHGYLENDPE--SSPKRSSRANLLFEEHGYDRD 507

Query: 794  IKKGSLPGKT-----------------------------KLTPFQTLASSHEEGNIAPTG 886
            +K G    K+                             K  PF  +     E  +    
Sbjct: 508  LKTGFYQMKSSYGLGVQFSDCIDKPRQDYVHANNSADEFKFRPFHQVQYDSVENKLTFER 567

Query: 887  QIGLFGGVVDPFMDIKDSNY--PSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLA 1060
            +  L  GV D  + I  ++    S    +A EH             RL N   G  + LA
Sbjct: 568  KCELGSGVADVGLSINGTSEGCSSHVPLHATEHVLSSPSSVEAVPARL-NKLHG--EQLA 624

Query: 1061 NNPRPTIDSQLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVS-- 1234
                P +  + L+STM N+SE+L   CSN++  L E D   L+ V+NNLD C+  ++   
Sbjct: 625  ----PQMCVRTLISTMHNLSELLLFHCSNDMCGLKEHDFEALKLVVNNLDKCISKRMGPE 680

Query: 1235 ---------------IRSVPELQCPQSGVSHHLGKQACTHKVPKIEANGIQSQCDHQSCI 1369
                           IR  PEL         H G    + K  K  A  + +Q ++Q   
Sbjct: 681  APIQESLLTQKSSEFIREFPEL---------HEGVTVSSPKETKA-AFSVLNQPNYQHVQ 730

Query: 1370 ERSIHSPICSEKQDMFQDFS---------------YLSSDTFE--EDDSMIQAIKKVLKK 1498
            E+        +K +   DF+                +  D  E  +DD+M QAIKKVL  
Sbjct: 731  EQRSPDIAAGKKSEKCSDFTSQGGHAERVKDDDMTQVHKDDAERVKDDNMTQAIKKVLSD 790

Query: 1499 KFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDIEMEKYKLCKTK 1651
             F +E+++  Q+LLY+NLWLEAEAALCSI YKARF R+ IE+E  KL K K
Sbjct: 791  NFVEEEDEKLQVLLYRNLWLEAEAALCSINYKARFNRMKIELENCKLLKAK 841


>gb|EYU45327.1| hypothetical protein MIMGU_mgv1a001518mg [Mimulus guttatus]
          Length = 804

 Score =  133 bits (334), Expect = 5e-28
 Identities = 148/511 (28%), Positives = 224/511 (43%), Gaps = 26/511 (5%)
 Frame = +2

Query: 440  VCSFVQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSV----TDLVTPKLVNGVAGGNVLN 607
            V    + SS   D  NP  DSPCW+G+ +S+ S F +    ++ V  KL +   G +   
Sbjct: 214  VIDSTEDSSDFVDHHNPAEDSPCWRGAPSSQFSQFDIETGNSNHVRKKL-DEFYGFDHEE 272

Query: 608  HQDLQSLPVNAEEAFSVSSQYLHKGLDYNSNRSVENESS-FLKIPSKMSS-----RNEVH 769
            HQ++ S+ V++   FS        G  YN+N   EN+S  F    SK +S     +  V 
Sbjct: 273  HQNIHSI-VDSSGVFSEKD-----GEGYNNN---ENQSGGFHPCSSKKASLHNDAKGGVW 323

Query: 770  IS-YGVEEP----IKKGSLPGKTKLTPFQTLASSH---EEGNIAPTGQIGLFGGVVDPFM 925
            +S    ++P    I  G+L   T +     L +S    EEG+      +   G V     
Sbjct: 324  VSAISGDDPNMPRIGSGTLNNLTSVFHMNVLDTSQLIGEEGSGTSQNDVSEAGAVA---- 379

Query: 926  DIKDSNYPSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDSQLLVST 1105
                         +A E               LA+P   AS   A  P P ++   ++ T
Sbjct: 380  ------------VHAAEEV-------------LASP---ASQEDATEPDPKLNVPKIIKT 411

Query: 1106 MKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVS-IRSVPELQCPQSGVSH 1282
            M N+S +L    S++  +L E+    L+H ++NL + +  K++   + PE +      S 
Sbjct: 412  MHNLSALLLFHLSSDTCSLDEESSETLKHTMSNLGSSLCEKLNRATNHPEPKNHVGDTSD 471

Query: 1283 HLGKQ------ACTHKVPKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQDFSYLSSD 1444
             LG+       +  H +    AN    + D+    E      +  +K D    FS L  D
Sbjct: 472  KLGESREVFTISGNHNMANEAANP-HIKLDYHQVHEGERTYSLPGKKDDKSPVFSPLRDD 530

Query: 1445 T-FEEDDSMIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDIE 1621
                 DD M +AIKKVL + FH  ++   Q LL+K+LWL+AEA LCSI YKARF R+ I 
Sbjct: 531  LDITSDDDMAKAIKKVLDENFHLNEDMDSQALLFKSLWLDAEAKLCSITYKARFDRMKIL 590

Query: 1622 MEKYKLCKTKARSGTVELPLNMEKLWNSTVSDSNLYTDATNEMSTPKIYDPSYSRITGHT 1801
            M++ KL   KA+     +   + K+                 +S P +   + S +  H 
Sbjct: 591  MDETKL---KAQQENENIAQMLSKV----------------SISKPTL--QNISSLPEHA 629

Query: 1802 EDVEASVMDRFHVLKCRIDKPVPSDERKFQE 1894
            EDVE SVM RF++LK R D P P    K Q+
Sbjct: 630  EDVETSVMARFNILKSREDNPKPLIIEKEQQ 660


>ref|XP_004235030.1| PREDICTED: uncharacterized protein LOC101252062 [Solanum
            lycopersicum]
          Length = 1175

 Score =  125 bits (315), Expect = 7e-26
 Identities = 161/613 (26%), Positives = 250/613 (40%), Gaps = 42/613 (6%)
 Frame = +2

Query: 353  NHLCKNSELQDSQFDLTDTFIS---APSVASGVCSFVQSSSVTCDQFNPVVDSPCWKGSL 523
            N+LC       +  ++     S   AP  ++   +  +  S   D  NP VDSPCWKG+ 
Sbjct: 410  NNLCSTRPCSSNSIEIAVKERSGSQAPCASAPPVTSAEKCSDALDLHNPNVDSPCWKGAP 469

Query: 524  ASRQSPFSVTDLVTPKLVNG------VAGGNVL----NHQDLQSLPVNAEEAFSVSSQYL 673
            A R S     +  +P ++            N L     +    SL    EE     + Y 
Sbjct: 470  AFRVSLSDSVEAPSPCILTSKVEFSDFGQSNHLFPPAEYSGKTSLKKLGEENLHNHNVYA 529

Query: 674  HKGLDYNSNRSVENESSFLK-----------IPSKMSSRNEV-HISYGVEEPIKKGSLPG 817
              GL   S  +V N  +  +           +P  +SS   +   S  + +P K  SLP 
Sbjct: 530  GNGLSVPSVGTVTNNYTTEELRTIDVTKGTFVPVDLSSNGVILKFSEDLNKPSKGYSLPQ 589

Query: 818  KTKLTPFQTLASSHEEGNIAPTGQIG-----LFGGVVDPFMDIKDSNYPSTFLFYAKEHX 982
             ++    Q   S  E  ++    Q G     L  G +   +++ D+         A E+ 
Sbjct: 590  YSE-NDCQKQYSWGEHLSV-DCHQYGPKKHNLPEGYMHTGLNLNDTLEGGVVALDAAENV 647

Query: 983  XXXXXXXXXXXTRLANPFSGASDTLANNPRPTIDSQLLVSTMKNMSEVLCSICSNNIYAL 1162
                        + A P+   S        P +D Q LV  + N+SE+L S C  N   L
Sbjct: 648  LRSPASQED--AKQAQPYQMGSS-------PKLDVQTLVHAIHNLSELLKSQCLPNACLL 698

Query: 1163 TEQDHAVLQHVINNLDACVVNKVSIRS--VPELQCPQSGVSHHLGKQACTHKVPKIEANG 1336
              QD+  L+  I NL AC V K+  +   V E    +     H          P+     
Sbjct: 699  EGQDYDTLKSAITNLGACTVKKIETKDTMVTEHDTFERLKESHRSYMGTETGNPQFMEEV 758

Query: 1337 IQSQC--DHQSCIE--------RSIHSPICSEKQDMFQDFSYLSSDTFEEDDSMIQAIKK 1486
             +  C  D+Q   E        ++ +SP+ +   D+         D+ EE   ++QAIKK
Sbjct: 759  ARDSCGLDNQPMPEDKSKNNGKKTENSPLLTSADDL--------GDSNEEQ--VVQAIKK 808

Query: 1487 VLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYKARFARLDIEMEKYKLCKTKARSGT 1666
            VL + F  ++   PQ LL+KNLWLEAEA LCS+ YK+RF R+ IEMEK++  +       
Sbjct: 809  VLNENFLSDEGMQPQALLFKNLWLEAEAKLCSLSYKSRFDRMKIEMEKHRFSQ------- 861

Query: 1667 VELPLNMEKLWNSTVSDSNLYTDATNEMSTPKIYDPSYSRITGHTEDVEASVMDRFHVLK 1846
             +L LN           S++  +A N+ S  KI   S S  + +   V+ S+M+RF++L 
Sbjct: 862  -DLNLN-----------SSVAPEAKND-SASKISSQSPSTSSKNVH-VDYSLMERFNILN 907

Query: 1847 CRIDKPVPSDERKFQEAVDVVVHERMEETTNPCSRNKQNGRMYSQPTNFDVDFVRRKNPC 2026
             R +K   S   K +E   V V    E+     S   +   +  Q  NF   F++ K   
Sbjct: 908  RREEKLNSSFFMK-EENDSVKVGSDSED-----SVTMKLNILRKQGNNFSSSFMQEKKAS 961

Query: 2027 MFIGRELEDGILE 2065
              +  + ED ++E
Sbjct: 962  DIVSSDTEDSVME 974


>ref|XP_002300521.2| hypothetical protein POPTR_0001s45660g [Populus trichocarpa]
            gi|550349961|gb|EEE85326.2| hypothetical protein
            POPTR_0001s45660g [Populus trichocarpa]
          Length = 911

 Score =  124 bits (311), Expect = 2e-25
 Identities = 113/421 (26%), Positives = 194/421 (46%), Gaps = 39/421 (9%)
 Frame = +2

Query: 452  VQSSSVTCDQFNPVVDSPCWKGSLASRQSPFSVT----------DLVTPKLVNGVAG--- 592
            +++SS   ++ +  +DSPCWKG LA+ QS   V+          +      +N +A    
Sbjct: 471  IENSSKIINENDSDLDSPCWKGKLAAEQSSCEVSVPDNFQHLKSEQEACSYLNPLAPHFF 530

Query: 593  -----------GNVLNHQDLQSLPVNAEEAFSVSSQYLHKGLDYNSNRSVENESSFLKIP 739
                       GN  +  D  S    A    ++ S+           + +++ ++     
Sbjct: 531  PSSDKQKVNYCGNEGDGNDCFSFQKTASSVVNLVSR----------EQRLQHSATAGSSS 580

Query: 740  SKMSSRNEVHISYGVEEPIKKGSLPGKTKLTPFQ----TLASSHEEGNIAPTGQIGLFGG 907
            S+ SS  E H    +  P K+  L   +  +        +  S  E     +GQ+ L G 
Sbjct: 581  SEQSSITEAHCYSDMHVPNKEYELLTDSSSSSMHGSSCVVLPSVLEDYFTSSGQL-LTGQ 639

Query: 908  VVDPF-MDIKDS--NYPSTFLFYAKEHXXXXXXXXXXXXTRLANPFSGASDTLANNPRPT 1078
             V  F   IKD+  N  ++   +A +H            T L+  + GA+  L + PR  
Sbjct: 640  CVGGFGKAIKDTAPNGSTSVSLFASKHVFDSSSCREGVSTDLSETYGGATKPLCSPPR-- 697

Query: 1079 IDSQLLVSTMKNMSEVLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIRSV-PEL 1255
            +D Q++V TM  +SE+L   C+N++ +L E +H +++ +I+NL  C+ N+V   ++  E 
Sbjct: 698  LDFQIVVKTMNELSELLMQNCTNDLDSLNEHEHDIIKRIIHNLTLCIRNRVGEHTLMSES 757

Query: 1256 QCPQSGV----SHHLGKQACTH---KVPKIEANGIQSQCDHQSCIERSIHSPICSEKQDM 1414
              P +      S HL K  C++   +  + +A  +  +  HQ+  ER + S    E+   
Sbjct: 758  SHPHTSYCVRKSTHLNK--CSNMELQTTRTKAVMVSHELGHQNKHERQMSSTSFRER--- 812

Query: 1415 FQDFSYLSSDTFEEDDSMIQAIKKVLKKKFHDEDEQLPQILLYKNLWLEAEAALCSIKYK 1594
            F D     +  F +++ + Q  +K L+  +  E+E+ PQ+L YKNLWLEAEAALCS+KYK
Sbjct: 813  FLDSLNARNGGFNKNEDITQVNEKALEGHYELEEEENPQVLFYKNLWLEAEAALCSMKYK 872

Query: 1595 A 1597
            A
Sbjct: 873  A 873


>ref|XP_003523306.2| PREDICTED: uncharacterized protein LOC100778126 [Glycine max]
          Length = 1048

 Score =  122 bits (306), Expect = 8e-25
 Identities = 173/746 (23%), Positives = 296/746 (39%), Gaps = 93/746 (12%)
 Frame = +2

Query: 23   PSAACASSVLVSQSTSFGNDSTAPIKI--SMSTNVVVNGYTGDEH----DNDDNADSHS- 181
            P     +SV+ S S S      AP+K+      N  +N  + D+H    D     D+ S 
Sbjct: 231  PVEFSGTSVMRSPSMSLETHQEAPLKVVSDSGNNHSLNIGSYDKHSRHGDKPSRVDTVSS 290

Query: 182  -PAASNIKNPSIKVSAEDKGCSHIIGNKMEQNDHF-------IMDLSPVE----KKEPSN 325
             P    + + +I+    D+   H      ++  H          D  P+     + EPS+
Sbjct: 291  MPRTGLVTDLNIEDIIADEHVGHNDFYNTKEASHMPSPGTAGFFDSGPIHMHLGRNEPSS 350

Query: 326  CN-LIIHDYSNHLCKNSELQDSQFDLTDTFISAPSVASGVCSFVQSSSVTCDQFNPVVDS 502
             N  +I D +  +     +        D     P+   G  +FVQ S    DQ NP  DS
Sbjct: 351  SNKAMISDKNVSMNVVDYIFRGSHANVDNLRLRPNATEGA-NFVQKSFEGVDQCNPAEDS 409

Query: 503  PCWKGSLASRQSPFSVTDLVTPKLVNG--VAGGNVLNHQDLQSLPVNAEEAFSVSSQYLH 676
            PCWKG+ A+R S F  +  +  + V+   ++ G+++  Q+ Q++ ++ E     S +  +
Sbjct: 410  PCWKGASAARFSHFEPSAALPQEYVHKKEISFGSII--QEPQNILLDTENNMKKSGENSN 467

Query: 677  KGLDYNSNRSVENESSFLKIPSKMSSRNEVHISYGVEEPIKKGSLPGKTKLTPFQTLAS- 853
                Y ++  + N+       S   S  +  ++    E  K GS        PFQ+  S 
Sbjct: 468  ---GYQTHTKIVNQER-----SSAGSPRKFSVTKFAPEYFKSGSAVNDG---PFQSKPSC 516

Query: 854  ----------SHEEGNIAPTGQIGLFGGVVDPFMDIKDSNYPSTFLFYAKEHXXXXXXXX 1003
                        +E  + P        G     M ++  +     +F  ++         
Sbjct: 517  GFGLHYLDITKMKENTVPPAKPTDCASGSSQ--MGLQHVDLKEFIIFQKQQALVCTGDVD 574

Query: 1004 XXXXTRLANPFSGASDTLANNPRPT--------------------IDSQLLVSTMKNMSE 1123
                    + +S +       P P+                    ++ Q+L+ T++N+SE
Sbjct: 575  SGCNVNNCSEYSSSCSAEHVPPSPSSVVDTTTTPENSARKVSTEKLNVQMLLDTLQNLSE 634

Query: 1124 VLCSICSNNIYALTEQDHAVLQHVINNLDACVVNKVSIRSVPELQCPQSGVSHHLGKQAC 1303
            +L   C N+   L E+D  +L++VI+NL+ C +         E   P          Q C
Sbjct: 635  LLLYHCLNDACELKERDCNILKNVISNLNTCALKNA------EQIAPA---------QEC 679

Query: 1304 THKVPKIEANGIQSQCDHQSCIERSIHSPICSEKQDMFQDFSYLSSDTFEEDDSMIQAIK 1483
                P+   +  +S+  HQ+    S   P              L+     +  +M + +K
Sbjct: 680  FFNQPETSKSAGESREFHQNA---SFKRP-------------QLTKTEMTKACNMTKDLK 723

Query: 1484 KVLKKKFHDEDEQL-PQILLYKNLWLEAEAALCSIKYKARFARLDIEMEK--YKLCKTKA 1654
            ++L + FHD+DE   PQ +LYKNLWLEAEAALCS+ YKAR+ ++ IEM+K  Y+  + + 
Sbjct: 724  RILSENFHDDDEGAEPQTVLYKNLWLEAEAALCSVYYKARYNQIKIEMDKHSYQEKEMEK 783

Query: 1655 RSGTVELP-LNMEKLWNSTVSDSN---------LYTDATN-----------EMSTPKIYD 1771
            +S +  +P L+  + + + V   N            DATN           +M+ P    
Sbjct: 784  QSKSEVVPSLSQSQSFATKVHHPNPDSSAALKFRVLDATNLEELSCLNISTDMNKPNAMT 843

Query: 1772 PS-------YSRITGH---------TEDVEASVMDRFHVLKCRIDKPVPSDERKFQEAVD 1903
            P         S I  +           + E+SVM R+ VLK R+D+   S     +E +D
Sbjct: 844  PEGKGGQNLDSFINNYFVPCSDDEAERNDESSVMARYQVLKARVDQ---SSIDNLEEPLD 900

Query: 1904 VVVHERMEETTNPCSRNKQNGRMYSQ 1981
            +       + ++P  R+ QN    SQ
Sbjct: 901  IA------DKSSPRGRDNQNQVNLSQ 920


Top