BLASTX nr result

ID: Achyranthes23_contig00006309 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00006309
         (2783 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI16022.3| unnamed protein product [Vitis vinifera]              316   3e-83
ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II tra...   295   6e-77
ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citr...   291   8e-76
ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus c...   274   1e-70
ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Popu...   271   1e-69
gb|EOY33856.1| Uncharacterized protein isoform 7 [Theobroma cacao]    261   1e-66
gb|EOY33851.1| Uncharacterized protein isoform 2 [Theobroma caca...   261   1e-66
gb|EOY33857.1| Uncharacterized protein isoform 8 [Theobroma cacao]    260   3e-66
ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Popu...   252   7e-64
gb|EMJ06149.1| hypothetical protein PRUPE_ppa000292mg [Prunus pe...   236   3e-59
ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214...   223   3e-55
ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205...   223   3e-55
emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera]   221   2e-54
ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227...   220   2e-54
gb|EXB30469.1| hypothetical protein L484_006018 [Morus notabilis]     214   2e-52
ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314...   209   7e-51
gb|EOY33855.1| Uncharacterized protein isoform 6 [Theobroma cacao]    197   2e-47
gb|EOY33854.1| Uncharacterized protein isoform 5 [Theobroma cacao]    197   2e-47
gb|EOY33850.1| Uncharacterized protein isoform 1 [Theobroma cacao]    197   2e-47
ref|XP_006848046.1| hypothetical protein AMTR_s00029p00190880 [A...   184   2e-43

>emb|CBI16022.3| unnamed protein product [Vitis vinifera]
          Length = 1669

 Score =  316 bits (810), Expect = 3e-83
 Identities = 226/625 (36%), Positives = 299/625 (47%), Gaps = 71/625 (11%)
 Frame = +1

Query: 802  VDQGRNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADN 981
            +D GR+Q  P+QYGP+VQ RP A S  ++ P P       +                A  
Sbjct: 1020 LDGGRHQPPPMQYGPTVQQRPAAPSSGQAMPPPGLVHNAPVPGQPSTQLQP-----QALG 1074

Query: 982  SQPSSIKHPPGRVPHENASGAMVTPGLSGPF----------------PRPDNMGYY-QAS 1110
              P   +   G   HE   G ++ PG +  F                P   + G+Y Q  
Sbjct: 1075 LLPHPAQQSRGSFHHEIPPGGILGPGSAASFGRGLSHFAPPQRSFEPPSVVSQGHYNQGH 1134

Query: 1111 MPPYQAGQPQNPAGEPFGGSSFAAQRPGALDSHVGVREREPA---DFEQRPPYPMENEKF 1281
              P  AG  +   GE  G         G+ DSH G+  R P    D +QRP  P+E+E F
Sbjct: 1135 GLPSHAGPSRISQGELIGRPPLGPLPAGSFDSHGGMMVRAPPHGPDGQQRPVNPVESEIF 1194

Query: 1282 PVQRPGSFDGRKPESLPHGSLDRAAYGPVQP-GVQLGAMKIGGPPAHDSMSAPGMRDERG 1458
               RP  FDGR+ +S   GS +R  +G  QP GVQ   M++ G    +S    G++DER 
Sbjct: 1195 SNPRPNYFDGRQSDSHIPGSSERGPFG--QPSGVQSNMMRMNGGLGIESSLPVGLQDERF 1252

Query: 1459 MPFPEERLKRIPHREFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPS 1638
               PE   +   H +F +D ++F RSSH ++    K G  F SS PLD G   +  D   
Sbjct: 1253 KSLPEPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFGNYFSSSRPLDRGSQGFVMDAAQ 1312

Query: 1639 RPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHR 1818
               +KAP GF  D G K  S+ G+G  RF PP HP   GER R   F +D++GR D   R
Sbjct: 1313 GLLDKAPLGFNYDSGFK--SSAGTGTSRFFPPPHPGGDGERSRAVGFHEDNVGRSDMA-R 1369

Query: 1819 ADFSGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGA--------------------- 1935
               +  G+ P YGR  MDG  PRSP R+F G+P   FG                      
Sbjct: 1370 THPNFLGSVPEYGRHHMDGLNPRSPTREFSGIPHRGFGGLSGVPGRQSDLDDIDGRESRR 1429

Query: 1936 FGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEIDVPG---------------NLRVGGP 2070
            FG+G  +F   S      +SRFPV P+HL+RGE++ PG               +LR G  
Sbjct: 1430 FGEGSKTFNLPS-----DESRFPVLPSHLRRGELEGPGELVMADPIASRPAPHHLRGGDL 1484

Query: 2071 RNQDMLPNHLRR-DLVGPRN----MHMGDPT------KPRMGEPPLARNFPQHLPFGESF 2217
              QD+LP+HL+R +  G RN    +  G+P        PRMGE     NFP  L  GESF
Sbjct: 1485 IGQDILPSHLQRGEHFGSRNIPGQLRFGEPVFDAFLGHPRMGELSGPGNFPSRLSAGESF 1544

Query: 2218 GG-EKPGHPLAGEPXXXXXXXXXXXXHEGGFY-PDEMEPFDDPRKWKPVGI-MCRICKVE 2388
            GG  K GHP  GEP            ++ GF  P +ME FD+ RK KP+ +  CRIC ++
Sbjct: 1545 GGSNKSGHPRIGEPGFRSTYSLHGYPNDHGFRPPGDMESFDNSRKRKPLSMAWCRICNID 1604

Query: 2389 CGTVEGLDLHSQSREHQRKARDMVL 2463
            C TV+GLD+HSQ+REHQ+ A D+VL
Sbjct: 1605 CETVDGLDMHSQTREHQQMAMDIVL 1629


>ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            15-like isoform X1 [Citrus sinensis]
            gi|568870502|ref|XP_006488441.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 15-like isoform
            X2 [Citrus sinensis] gi|568870504|ref|XP_006488442.1|
            PREDICTED: mediator of RNA polymerase II transcription
            subunit 15-like isoform X3 [Citrus sinensis]
            gi|568870506|ref|XP_006488443.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 15-like isoform
            X4 [Citrus sinensis]
          Length = 1392

 Score =  295 bits (756), Expect = 6e-77
 Identities = 281/845 (33%), Positives = 360/845 (42%), Gaps = 88/845 (10%)
 Frame = +1

Query: 304  VRPPQL-------NQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHD- 459
            VRP QL       NQ+  S  +NQ+  +SEQQAG   +P       + +++V  K  H+ 
Sbjct: 616  VRPAQLGANQSSSNQSNLSWTSNQVQLSSEQQAGATSKPEM-----SEKNEVAVKIAHER 670

Query: 460  --QNNPNKVAK----NLVGGSGAGVLMN-------------------EAKMNAESTLDSG 564
              +++  K AK    +  G   A V M                    E K N   T    
Sbjct: 671  EAESSSEKTAKTDNFDTPGPEAAAVGMKVPKSETDVKAAVDEIKTEVEDKTNVVDTSSKE 730

Query: 565  FDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTK---IGAEDHKDVRKKSEVQESKQTA 735
            F  +         +P+     E V+E   G K +    I  E+H       EVQE     
Sbjct: 731  FVTDRESHIAENVQPINKMVKEEVIENVEGQKDSANVDIKQEEHS---VSKEVQEEPLLK 787

Query: 736  KSGAPNMPQSNLSTQVHGTNASVDQGRNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQ 915
             S      Q    ++       V Q +    P    P+ Q + G    S  + +  ++ Q
Sbjct: 788  TSTMQQGTQFGEQSEKVQKEQKVPQAQGAQGPGAVPPAGQAQAGGFVQSAPSLYGSSTLQ 847

Query: 916  PTIXXXXXXXXXXXXXFNSADNSQPSSIKHPP-GRVPHENASG--------AMVTPG--- 1059
                                  + PS  + PP G VP   A          A V PG   
Sbjct: 848  QR-------------------PAAPSIFQAPPPGAVPQTQAPTQFRPPMFKAEVPPGGIP 888

Query: 1060 LSGP---FPR-PDNMGYYQASM-PPYQAGQ-PQN---PAGEPFGGSSFAAQRPGALDSHV 1212
            +SGP   F R P + G +Q S  PP  A Q P N   P   P GG    +      DSHV
Sbjct: 889  VSGPAASFGRGPGHNGPHQHSFEPPLVAPQGPYNLGHPHPSPVGGPPQRSVPLSGFDSHV 948

Query: 1213 GVRERE------PADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSLDRAAYGPVQP 1374
            G           P D +Q P  PME E F  QRPG  DGR+ +S   GS  R+  GP   
Sbjct: 949  GTMVGPAYGPGGPMDLKQ-PSNPMEAEMFTGQRPGYMDGRESDSHFPGSQQRSPLGPPS- 1006

Query: 1375 GVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP---------HREFEDDPRKF 1527
            G +   M++ G P  +      +RDER   FP+ RL   P           EFE+D ++F
Sbjct: 1007 GTRSNMMRMNGGPGSE------LRDERFKSFPDGRLNPFPVDPARSVIDRGEFEEDLKQF 1060

Query: 1528 PRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPGLKMDSAVG 1707
             R SH +A P  KLG+ F  S P D GPH Y  D   RPFE+   G   DPGLK+D    
Sbjct: 1061 SRPSHLDAEPVPKLGSHFLPSRPFDRGPHGYGMDMGPRPFER---GLSYDPGLKLDPMGA 1117

Query: 1708 SGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGH-RADFSGPGAGPGYGRSRMDGFPP 1884
            S P RFLP +H              DD+ GR D  H   DF  PG    YGR  M G  P
Sbjct: 1118 SAPSRFLPAYH--------------DDAAGRSDSSHAHPDFPRPGRA--YGRRHMGGLSP 1161

Query: 1885 RSPGRDFPGLPSGTFGAFGD--------GGNSFP--AESFGKSIHDSRFPVPPNHLQRGE 2034
            RS  R+F G   G  G+ G         GG  F    +  G S HDSRFPV P+HL+RGE
Sbjct: 1162 RSSFREFCGF-GGLPGSLGGSRSVREDIGGREFRRFGDPIGNSFHDSRFPVLPSHLRRGE 1220

Query: 2035 IDVPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPFGE 2211
             + PG  R G    Q+ LP+HLRR + +GP N+ +G+ T    G P  AR         E
Sbjct: 1221 FEGPG--RTGDLIGQEFLPSHLRRGEPLGPHNLRLGE-TVGLGGFPGPARM--------E 1269

Query: 2212 SFGGEKPGH---PLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCRIC 2379
              GG  PG+   P  GEP            ++GGFY  +ME  D+ RK KP  +  CRIC
Sbjct: 1270 ELGG--PGNFPPPRLGEPGFRSSFSRQGFPNDGGFYTGDMESIDNSRKRKPPSMGWCRIC 1327

Query: 2380 KVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSSFQ 2559
            KV+C TV+GLDLHSQ+REHQ+ A DMVL                     D  + RN +F 
Sbjct: 1328 KVDCETVDGLDLHSQTREHQKMAMDMVLSIKQNAKKQKLTSGDRC-STDDANKSRNVNFD 1386

Query: 2560 GRGNK 2574
            GRG K
Sbjct: 1387 GRGKK 1391


>ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citrus clementina]
            gi|557526921|gb|ESR38227.1| hypothetical protein
            CICLE_v10027683mg [Citrus clementina]
          Length = 1392

 Score =  291 bits (746), Expect = 8e-76
 Identities = 280/847 (33%), Positives = 360/847 (42%), Gaps = 90/847 (10%)
 Frame = +1

Query: 304  VRPPQL--NQTYPSRVN-----NQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHD- 459
            VRP QL  NQ+  ++ N     NQ+  +SEQQAG   +P       + +++V  K  H+ 
Sbjct: 616  VRPAQLGANQSSSNQSNLFWTSNQVQLSSEQQAGATSKPEM-----SEKNEVAVKIAHER 670

Query: 460  --QNNPNKVAK----NLVGGSGAGVLMN-------------------EAKMNAESTLDSG 564
              +++  K AK    +  G   A V M                    E K N   T    
Sbjct: 671  EAESSSEKTAKTDNFDTPGPEAAAVGMKVPKSETDVKAAVDEIKTEVEDKTNVVDTSSKE 730

Query: 565  FDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTK---IGAEDHKDVRKKSEVQESKQTA 735
            F  +         +P+     E V+E   G K +    I  E+H       EVQE     
Sbjct: 731  FVTDRESHIAENVQPINKMVKEEVIENVEGQKDSANVDIKQEEHS---VSKEVQEEPLLK 787

Query: 736  KSGAPNMPQSNLSTQVHGTNASVDQGRNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQ 915
             S      Q    ++       V Q +    P    P+ Q + G    S  + +  ++ Q
Sbjct: 788  TSTMQQGTQFGEQSEKVQKEQKVPQAQGAQGPGAVPPAGQAQAGGFVQSAPSLYGSSTLQ 847

Query: 916  PTIXXXXXXXXXXXXXFNSADNSQPSSIKHPP-GRVPHENASG--------AMVTPG--- 1059
                                  + PS  + PP G VP   A          A V PG   
Sbjct: 848  QR-------------------PAAPSIFQAPPPGAVPQTQAPTQFRPPMFKAEVPPGGIP 888

Query: 1060 LSGP---FPR-PDNMGYYQASM-PPYQAGQPQNPAG------EPFGGSSFAAQRPGALDS 1206
            +SGP   F R P + G +Q S  PP  A  PQ P         P GG    +      DS
Sbjct: 889  VSGPAASFGRGPGHNGPHQHSFEPPLVA--PQGPYNLGHLHPSPVGGPPQRSVPLSGFDS 946

Query: 1207 HVGVRERE------PADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSLDRAAYGPV 1368
            HVG           P D +Q P  PME E F  QRPG  DGR+ +S   GS  R+  GP 
Sbjct: 947  HVGTMVGPAYGPGGPMDLKQ-PSNPMEAEMFTGQRPGYMDGRESDSHFPGSQQRSPLGPP 1005

Query: 1369 QPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP---------HREFEDDPR 1521
              G +   M++ G P  +      +RDER   FP+ RL   P           EFE+D +
Sbjct: 1006 S-GTRSNMMRMNGGPGSE------LRDERFKSFPDGRLNPFPVDPARSVIDRGEFEEDLK 1058

Query: 1522 KFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPGLKMDSA 1701
            +F R SH +A P  KLG+ F  S P D GPH Y  D   RPFE+   G   DPGLK+D  
Sbjct: 1059 QFSRPSHLDAEPVPKLGSHFLPSRPFDRGPHGYGMDMGPRPFER---GLSYDPGLKLDPM 1115

Query: 1702 VGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGH-RADFSGPGAGPGYGRSRMDGF 1878
              S P RFLP +H              DD+ GR D  H   DF  PG    YGR  M G 
Sbjct: 1116 GASAPSRFLPAYH--------------DDAAGRSDSSHAHPDFPRPGRA--YGRRHMGGL 1159

Query: 1879 PPRSPGRDFPGLPSGTFGAFGD--------GGNSFP--AESFGKSIHDSRFPVPPNHLQR 2028
             PRS  R+F G   G  G+ G         GG  F    +  G S HDSRFPV P+HL+R
Sbjct: 1160 SPRSSFREFCGF-GGLPGSLGGSRSVREDIGGREFRRFGDPIGNSFHDSRFPVLPSHLRR 1218

Query: 2029 GEIDVPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205
            GE + PG  R G    Q+ LP+HLRR + +GP N+ +G+ T    G P  AR        
Sbjct: 1219 GEFEGPG--RTGDLIGQEFLPSHLRRGEPLGPHNLRLGE-TVGLGGFPGPARM------- 1268

Query: 2206 GESFGGEKPGH---PLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCR 2373
             E  GG  PG+   P  GEP            ++GGFY  +ME  D+ RK KP  +  CR
Sbjct: 1269 -EELGG--PGNFPPPRLGEPGFRSSFSHQGFPNDGGFYTGDMESIDNSRKRKPPSMGWCR 1325

Query: 2374 ICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSS 2553
            ICKV+C TV+GLDLHSQ+REHQ+ A DMVL                     D  + RN +
Sbjct: 1326 ICKVDCETVDGLDLHSQTREHQKMAMDMVLSIKQNAKKQKLTSGDRC-STDDANKSRNVN 1384

Query: 2554 FQGRGNK 2574
            F GRG K
Sbjct: 1385 FDGRGKK 1391


>ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus communis]
            gi|223540292|gb|EEF41863.1| hypothetical protein
            RCOM_0731250 [Ricinus communis]
          Length = 1329

 Score =  274 bits (701), Expect = 1e-70
 Identities = 270/919 (29%), Positives = 369/919 (40%), Gaps = 61/919 (6%)
 Frame = +1

Query: 1    HSQQPGYPFQHRPGVXXXXXXXXXXXXS----FAGH------------GPYMQPQQPTAA 132
            H+QQPG P    P +                 F G             G YMQ       
Sbjct: 508  HAQQPGLPVHQLPVMQSVQQPIHQQYVQQQPPFPGQALGPVQNQVHQQGAYMQQH----L 563

Query: 133  HGHAXXXXXXXXXXXX----NYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXXXXXXXX 300
            HGH+                N    HG Q + AQ+   RP                    
Sbjct: 564  HGHSQLRPQGPSHAYTQPLQNVPLPHGTQAHQAQNLGGRP----PYGVPTYPHPHSSVGM 619

Query: 301  XVRPPQLNQTYPS----RVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQNN 468
             VRP Q+     S    R NNQM  +SEQ +G +   SRPT      D +++KS    ++
Sbjct: 620  QVRPMQVGADQQSGNAFRANNQMQLSSEQPSGAI---SRPTS-NRQGDDIIEKSSEADSS 675

Query: 469  PNKVAKNLVGGSGAGVLMNEAKMNAESTLDSGFDANDNKVSGMGSKPLESDASEGVLEPS 648
              K            V  +   ++  S L S        +S    KP++ D ++ + E  
Sbjct: 676  SQK-----------NVRRDPNDLDVASGLGSDVSDLKTVISESNLKPVDDD-NKSINEVK 723

Query: 649  PGSKSTKIGAEDHKDVRKKSEVQESKQTAKSGAPNMPQSNLSTQVHGTNASVDQGRNQLH 828
               +  K G +D KD+       E K   K G P M    L    H  + S+   R +  
Sbjct: 724  ---EEPKKGNDDQKDISNTDNDAEDKGV-KDG-PVMKNRPLPEAEHLEDQSMKSQRGRNV 778

Query: 829  PIQYGPSVQLR-----PGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADNSQPS 993
              Q+     L       G    S S P     +Q                       QP 
Sbjct: 779  TPQHSGGFILHGQVQGEGLAQPSHSIPIAEQGKQ-----------------------QPP 815

Query: 994  SIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQ-PQNP----AGEP 1158
             I H P  +       +++T    G        G+  A + P   G  P  P    AG  
Sbjct: 816  VIPHGPSALQQRPIGSSLLTAPPPGSLHHGQIPGHPSARVRPLGPGHIPHGPEVSSAGMT 875

Query: 1159 FGGSSFAAQRPGALDSHVGVR------EREPADFEQRPPYPMENEKFPVQRPGSFDGRKP 1320
              GS+    R G   SH G++         P+  + R PY  + + F  QRP   DG++ 
Sbjct: 876  GLGSTPITGRGG---SHYGLQGTYTQGHALPSQAD-RTPYGHDTDMFANQRPNYTDGKRL 931

Query: 1321 ESLPHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP-- 1494
            + L             Q G+   AM++ G P  DS SA G+RD+R  PF +E +   P  
Sbjct: 932  DPLGQ-----------QSGMHSNAMRMNGAPGMDSSSALGLRDDRFRPFSDEYMNPFPKD 980

Query: 1495 -------HREFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEK 1653
                    REFE+D + F R S  +   ++K G  F SS PLD GP            +K
Sbjct: 981  PSQRIVDRREFEEDLKHFSRPSDLDTQSTTKFGANFSSSRPLDRGP-----------LDK 1029

Query: 1654 APHGFERDPGLKMDSAVGSGPLRFLPPFH------PNDVGERGRPPTFPDDSMGRGDFGH 1815
              HG   D G+K++S  G  P RF PP+H      PND+ ER     F D+++GR     
Sbjct: 1030 GLHGPNYDSGMKLESLGGPPPSRFFPPYHHDGLMHPNDIAERSIG--FHDNTLGRQPDSV 1087

Query: 1816 RA--DFSGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGAFG--DGGNSFPAESFGKS 1983
            RA  +F GPG    Y R   DG  PRSPGRD+PG+ S  FGA    D  +   +  FG S
Sbjct: 1088 RAHPEFFGPGRR--YDRRHRDGMAPRSPGRDYPGVSSRGFGAIPGLDDIDGRESRRFGDS 1145

Query: 1984 IHDSRFPVPPNHLQRGEIDVPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRM 2160
             H SRFPV P+H++ GE + P         +QD   NH RR + +G  NM      + R+
Sbjct: 1146 FHGSRFPVLPSHMRMGEFEGP---------SQDGFSNHFRRGEHLGHHNM------RNRL 1190

Query: 2161 GEPPLARNFPQHLPFGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDP 2340
            GEP     FP     G+  G     +P  GEP             +GG Y  E+E FD+ 
Sbjct: 1191 GEPIGFGAFPGPAGMGDLSGTGNFFNPRLGEPGFRSSFSFKGFPGDGGIYAGELESFDNS 1250

Query: 2341 RKWKPVGI-MCRICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATF 2517
            R+ K   +  CRICKV+C TVEGLDLHSQ+REHQ++A DMV+                + 
Sbjct: 1251 RRRKSSSMGWCRICKVDCETVEGLDLHSQTREHQKRAMDMVVTIKQNAKKQKLANNDHS- 1309

Query: 2518 EGRDGGRPRNSSFQGRGNK 2574
               D  + +N+S +GRGNK
Sbjct: 1310 SVDDASKSKNTSIEGRGNK 1328


>ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa]
            gi|550331020|gb|ERP56830.1| hypothetical protein
            POPTR_0009s04520g [Populus trichocarpa]
          Length = 1315

 Score =  271 bits (692), Expect = 1e-69
 Identities = 263/914 (28%), Positives = 358/914 (39%), Gaps = 56/914 (6%)
 Frame = +1

Query: 1    HSQQPGYPFQHRPGVXXXXXXXXXXXXS----FAGH------------GPYMQPQQ---- 120
            H+ QPG P Q RPG+                 F+G             GPY+Q QQ    
Sbjct: 500  HAHQPGLPVQQRPGMQPTPQPMHQQYAQHQQPFSGQPWGAVHNQAHQQGPYVQQQQLHPL 559

Query: 121  ----PTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXXXX 288
                P                   N    HGA  + A+S A  P                
Sbjct: 560  TQLRPQGLPQSFQQPSHAYPHPQQNVLLPHGAHPHQAKSLAVGP------GLPAQSYPQS 613

Query: 289  XXXXXVRPPQLNQTYPS----RVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMH 456
                 VR  Q+     S    + NNQ+  +S+QQ+G +    R  D+E        K   
Sbjct: 614  ASGMQVRSIQIGANQQSGNILKTNNQVELSSDQQSG-VSSRQRQGDIE--------KGAE 664

Query: 457  DQNNPNKVAKNLVGGSGAGVLMNEAKMNA-ESTLDSGFDANDNKVSGMGSKPLESDASEG 633
             + +  K  K  +    AG+  + ++M   +S  D     + NK +G      ES     
Sbjct: 665  GELSAQKTIKKELNDLDAGLAADASEMKTIKSESDLKQVDDKNKPTGEAKDVPES----- 719

Query: 634  VLEPSPGSKSTKIGAEDHKDVRKKSEVQESKQTAKSGAPNMPQSNLSTQVHGTNASVDQG 813
             L  + G  S K   E+H+D          +Q   S A +  +  LS   H     ++  
Sbjct: 720  -LAAANGESSIKQVKEEHRD-------GADEQNDVSNADH-EKVELSVSEHKDGPLLETA 770

Query: 814  RNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADNS--Q 987
             + L            P + S     P+     Q                 ++ D    +
Sbjct: 771  PSHLEEQIMKLQKDKTPTSQSFGGFPPNGHVQSQSV---------------SAVDQGKLE 815

Query: 988  PSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFGG 1167
            P  I H P          ++V     GP       G+     PP Q G+           
Sbjct: 816  PLPIHHGPSAAQQRPVGPSLVQASPLGPPHHMQLPGH-----PPTQHGR----------- 859

Query: 1168 SSFAAQRPGALDSHVGVRE-------REPADFEQRPPYPMENEKFPVQRPGSFDGRKPES 1326
                   PG + SH G  +         P+  E+ P +  E   F  QRP   DGR+   
Sbjct: 860  -----LGPGHVPSHYGPPQGAYPHAPAPPSQGERTPSHVHEATMFANQRPKYPDGRQ--- 911

Query: 1327 LPHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIPHR-E 1503
                            G     + + G    +S     + DE   PFP        H+ E
Sbjct: 912  ----------------GTYSNVVGMNGAQGPNSDRFSSLPDEHLNPFPRGPAHHNVHQGE 955

Query: 1504 FEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPG 1683
            FE+D + FPR SH +  P  K  + FPSS PLD GP  +  DG  RP +K  HGF  D G
Sbjct: 956  FEEDLKHFPRPSHLDTEPVPKSSSHFPSSRPLDRGPRGFGVDGAPRPLDKGSHGFNYDSG 1015

Query: 1684 LKMDSAVGSGPLRFLPPFHPNDV---GERGRPPTFPDDSMGRGDFGH-RADFSGPGAGPG 1851
            L M+   GS P RF PP+H +      +      + D   GR DF   R  F GP   PG
Sbjct: 1016 LNMEPLGGSAPPRFFPPYHHDKALHPSDAEVSLGYHDSLAGRSDFARTRPGFLGPPI-PG 1074

Query: 1852 YGRSRMDGFPPRSPGRDFPGLPSGTFGAF-------GDGGNSFPAESFGKSIHDSRFPVP 2010
            Y    MD   PRSP RD+PG+P+  FGA        G   + F  + F  S+ DSRFPV 
Sbjct: 1075 YDHRHMDNLAPRSPVRDYPGMPTRRFGALPGLDDIDGRDPHRF-GDKFSSSLRDSRFPVF 1133

Query: 2011 PNHLQRGEIDVPGNLRVGGPRNQDML-----PNHLRR-DLVGPRNMHMGDPTKPRMGEPP 2172
            P+HL+RGE++ PGNL +G   + D++     P HLRR + +GPRN+    P+   +GEP 
Sbjct: 1134 PSHLRRGELEGPGNLHMGEHLSGDLMGHDGRPAHLRRGEHLGPRNL----PSHLWVGEPG 1189

Query: 2173 LARNFPQHLPFGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWK 2352
                FP H   GE  G     H   GEP              GG Y  +++ FD+ RK K
Sbjct: 1190 NFGAFPGHARMGELAGPGNFYHHQLGEPGFRSSF--------GGNYAGDLQFFDNSRKRK 1241

Query: 2353 PVGIMCRICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDG 2532
            P    CRICKV+C TVE LDLHSQ+REHQ+ A DMV+                +    D 
Sbjct: 1242 PSMGWCRICKVDCETVEALDLHSQTREHQKMALDMVVTIKQNAKKHKSTPCHHS-SLEDK 1300

Query: 2533 GRPRNSSFQGRGNK 2574
             + RN+SF+GRGNK
Sbjct: 1301 SKSRNASFEGRGNK 1314


>gb|EOY33856.1| Uncharacterized protein isoform 7 [Theobroma cacao]
          Length = 975

 Score =  261 bits (667), Expect = 1e-66
 Identities = 270/907 (29%), Positives = 356/907 (39%), Gaps = 83/907 (9%)
 Frame = +1

Query: 103  YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282
            Y QPQQ  A   HA               P HG Q       AA                
Sbjct: 178  YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 220

Query: 283  XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462
                   V+P  L    PS   N +   + Q +G   QP      ++  D+ V +   D 
Sbjct: 221  ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 275

Query: 463  NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603
            ++P    K     ++    GA V   N AK+ A+        T D G D+N   +S   +
Sbjct: 276  SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 335

Query: 604  KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717
               ES  + G       +P   +  T    ED KDV                  K   +Q
Sbjct: 336  P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 393

Query: 718  ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813
            E+K    Q  K           G P  P  N       S+QV             +VDQG
Sbjct: 394  EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 453

Query: 814  RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984
            R+Q   + YG +  Q RP   ++ ++ P   P ++Q P +                 +N 
Sbjct: 454  RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 513

Query: 985  QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164
             P S    P               G  GP+    N G      PP  +G P+   GEP  
Sbjct: 514  PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 551

Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344
            G S+      A DSH                        P+  P S   +   ++     
Sbjct: 552  GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 589

Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500
            D     P   G+             DS S   +R ER  P  +E   + P    HR    
Sbjct: 590  DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 636

Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680
            +FE+D + FPR SH +  P  K G+   SS PLD GPH +  D   R  EK PHGF  DP
Sbjct: 637  QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 696

Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860
                   +GSGP RFLPP+HP+D GE  RP   P D++GR DF         G  P YGR
Sbjct: 697  ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 740

Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040
             RMDGF  RSPGR++PG+    FG  G  G+        +     RFP  P HL RG  +
Sbjct: 741  HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 795

Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205
                +  +LR     NQD  P + RR + VG  NM    P   R+GEP    +F  H   
Sbjct: 796  SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 851

Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCR 2373
            GE FGG  PG   HP  GEP            ++GG Y   M+ F++ RK KP+ +  CR
Sbjct: 852  GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCR 908

Query: 2374 ICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSS 2553
            ICK++C TVEGLDLHSQ+REHQ+ A DMV+                +    D  + +N  
Sbjct: 909  ICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQNAKKQKLTSSDHSIR-NDTSKSKNVK 967

Query: 2554 FQGRGNK 2574
            F+GR NK
Sbjct: 968  FEGRVNK 974


>gb|EOY33851.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508786596|gb|EOY33852.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508786597|gb|EOY33853.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 1408

 Score =  261 bits (667), Expect = 1e-66
 Identities = 270/907 (29%), Positives = 356/907 (39%), Gaps = 83/907 (9%)
 Frame = +1

Query: 103  YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282
            Y QPQQ  A   HA               P HG Q       AA                
Sbjct: 611  YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 653

Query: 283  XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462
                   V+P  L    PS   N +   + Q +G   QP      ++  D+ V +   D 
Sbjct: 654  ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 708

Query: 463  NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603
            ++P    K     ++    GA V   N AK+ A+        T D G D+N   +S   +
Sbjct: 709  SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 768

Query: 604  KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717
               ES  + G       +P   +  T    ED KDV                  K   +Q
Sbjct: 769  P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 826

Query: 718  ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813
            E+K    Q  K           G P  P  N       S+QV             +VDQG
Sbjct: 827  EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 886

Query: 814  RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984
            R+Q   + YG +  Q RP   ++ ++ P   P ++Q P +                 +N 
Sbjct: 887  RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 946

Query: 985  QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164
             P S    P               G  GP+    N G      PP  +G P+   GEP  
Sbjct: 947  PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 984

Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344
            G S+      A DSH                        P+  P S   +   ++     
Sbjct: 985  GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 1022

Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500
            D     P   G+             DS S   +R ER  P  +E   + P    HR    
Sbjct: 1023 DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 1069

Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680
            +FE+D + FPR SH +  P  K G+   SS PLD GPH +  D   R  EK PHGF  DP
Sbjct: 1070 QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 1129

Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860
                   +GSGP RFLPP+HP+D GE  RP   P D++GR DF         G  P YGR
Sbjct: 1130 ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 1173

Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040
             RMDGF  RSPGR++PG+    FG  G  G+        +     RFP  P HL RG  +
Sbjct: 1174 HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 1228

Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205
                +  +LR     NQD  P + RR + VG  NM    P   R+GEP    +F  H   
Sbjct: 1229 SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 1284

Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCR 2373
            GE FGG  PG   HP  GEP            ++GG Y   M+ F++ RK KP+ +  CR
Sbjct: 1285 GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCR 1341

Query: 2374 ICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSS 2553
            ICK++C TVEGLDLHSQ+REHQ+ A DMV+                +    D  + +N  
Sbjct: 1342 ICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQNAKKQKLTSSDHSIR-NDTSKSKNVK 1400

Query: 2554 FQGRGNK 2574
            F+GR NK
Sbjct: 1401 FEGRVNK 1407


>gb|EOY33857.1| Uncharacterized protein isoform 8 [Theobroma cacao]
          Length = 972

 Score =  260 bits (664), Expect = 3e-66
 Identities = 270/907 (29%), Positives = 355/907 (39%), Gaps = 83/907 (9%)
 Frame = +1

Query: 103  YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282
            Y QPQQ  A   HA               P HG Q       AA                
Sbjct: 178  YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 220

Query: 283  XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462
                   V+P  L    PS   N +   + Q +G   QP      ++  D+ V +   D 
Sbjct: 221  ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 275

Query: 463  NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603
            ++P    K     ++    GA V   N AK+ A+        T D G D+N   +S   +
Sbjct: 276  SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 335

Query: 604  KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717
               ES  + G       +P   +  T    ED KDV                  K   +Q
Sbjct: 336  P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 393

Query: 718  ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813
            E+K    Q  K           G P  P  N       S+QV             +VDQG
Sbjct: 394  EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 453

Query: 814  RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984
            R+Q   + YG +  Q RP   ++ ++ P   P ++Q P +                 +N 
Sbjct: 454  RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 513

Query: 985  QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164
             P S    P               G  GP+    N G      PP  +G P+   GEP  
Sbjct: 514  PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 551

Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344
            G S+      A DSH                        P+  P S   +   ++     
Sbjct: 552  GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 589

Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500
            D     P   G+             DS S   +R ER  P  +E   + P    HR    
Sbjct: 590  DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 636

Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680
            +FE+D + FPR SH +  P  K G+   SS PLD GPH +  D   R  EK PHGF  DP
Sbjct: 637  QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 696

Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860
                   +GSGP RFLPP+HP+D GE  RP   P D++GR DF         G  P YGR
Sbjct: 697  ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 740

Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040
             RMDGF  RSPGR++PG+    FG  G  G+        +     RFP  P HL RG  +
Sbjct: 741  HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 795

Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205
                +  +LR     NQD  P + RR + VG  NM    P   R+GEP    +F  H   
Sbjct: 796  SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 851

Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCR 2373
            GE FGG  PG   HP  GEP            ++GG Y   M+ F++ RK KP+ +  CR
Sbjct: 852  GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCR 908

Query: 2374 ICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSS 2553
            ICK++C TVEGLDLHSQ+REHQ+ A DMV+                     D  + +N  
Sbjct: 909  ICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQNAKKQKLDHSIR----NDTSKSKNVK 964

Query: 2554 FQGRGNK 2574
            F+GR NK
Sbjct: 965  FEGRVNK 971


>ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa]
            gi|222845587|gb|EEE83134.1| hypothetical protein
            POPTR_0001s25430g [Populus trichocarpa]
          Length = 1327

 Score =  252 bits (643), Expect = 7e-64
 Identities = 255/854 (29%), Positives = 340/854 (39%), Gaps = 27/854 (3%)
 Frame = +1

Query: 94   HGPYMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXX 273
            HGP    QQP+ A+ H             N     GA  + AQS A              
Sbjct: 568  HGPVQSFQQPSHAYPHPQQ----------NVPLPRGAHPHQAQSLAVGTGVSPHGVLSVQ 617

Query: 274  XXXXXXXXXXVRPPQLNQTYPS----RVNNQMSAASEQQAGHLQQP--SRPTDVENPRDQ 435
                       RP Q+     S    + NNQ+  +SEQQA    +P   R  D+E   + 
Sbjct: 618  SYPQSTAVMQARPVQIGANQQSGNILKTNNQVEFSSEQQAWVASRPISERQGDIEKGAEG 677

Query: 436  VVDKSMHDQNNPNKVAKNLVGGSGAGVLMNEAK-MNAESTLDSGFDANDNKVSGMGSKPL 612
              + S H  N   K    L  G GA    +E K + +ES L    D  +NK +G      
Sbjct: 678  --ESSAH--NTIKKELNELDAGLGASA--SEMKTIKSESDLKQVDD--ENKPTG------ 723

Query: 613  ESDASEGVLEPSPGSKSTKIGAEDHKDVRKKSEVQESKQTAKSGAPNMPQSNLSTQVHGT 792
            E+    G    + G  S K   EDH+DV  K +   +    K       + +LS  + G 
Sbjct: 724  EAKDIPGAPAAANGEPSIKQVKEDHRDVTDKQKDISNADQKKV------ELSLSEYMDGK 777

Query: 793  NASVDQGRNQLHPIQYGPSVQLRPGAVSMSKSTP--HPFNSQQPTIXXXXXXXXXXXXXF 966
            +            ++  PS        S    TP    F    P                
Sbjct: 778  DGL---------SLETAPSHLEEQSKKSQKDKTPTSQGFGGFPP---------------- 812

Query: 967  NSADNSQPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASM-PPYQAGQPQN 1143
            N    SQP S+      V         +  G +    RP    + QA   PP+    P +
Sbjct: 813  NGHMQSQPVSV------VDQGKLHPLPIHQGPAALQQRPVGPSWLQAPHGPPHHMQLPGH 866

Query: 1144 PAGEPFGGSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPE 1323
            P       S      PG + SH G  +     +   P    E     V     F  ++P 
Sbjct: 867  PP------SHHGRLPPGHMPSHYGPPQ---GPYTHAPTSQGERTSSYVHETSMFGNQRP- 916

Query: 1324 SLPHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIPHR- 1500
            S P G          + G+   A+   G    +S       DE   PFP +  +R  H+ 
Sbjct: 917  SYPGG----------RQGILSNAVGTNGAQDPNSDRFRSFPDEHLNPFPHDPARRNAHQG 966

Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680
            EFE+D + F   S  +  P  K G  F SS PLD GPH +  DG  +  +K  HG   D 
Sbjct: 967  EFEEDLKHFTAPSCLDTKPVPKSGGHFSSSRPLDRGPHGFGVDGAPKHLDKGSHGLNYDS 1026

Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPP---TFPDDSMGRGDFGH-RADFSGPGAGP 1848
            GL ++   GS P RF PP H +    R        F D+  GR DF   R    GP   P
Sbjct: 1027 GLNVEPLGGSAPPRFFPPIHHDRTLHRSEAEGSLGFHDNLAGRTDFARTRPGLLGPPM-P 1085

Query: 1849 GYGRSRMDGFPPRSPGRDFPGLPSGTFGA---FGDGGNSFPAES---FGKSIHDSRFPVP 2010
            GY    MD   PRSPGRD+PG+    FGA     D     P  S      S+HDSRFP+ 
Sbjct: 1086 GYDHRDMDNLAPRSPGRDYPGMSMQRFGALPGLDDIDGRAPQRSSDPITSSLHDSRFPLF 1145

Query: 2011 PNHLQRGEIDVPGNLRVGGPRNQDML-----PNHLRR-DLVGPRNMHMGDPTKPRMGEPP 2172
            P+HL+RGE++ PGN  +G   + D++     P HLRR + +GPRN     P+  R+GE  
Sbjct: 1146 PSHLRRGELNGPGNFHMGEHLSGDLMGHDGWPAHLRRGERLGPRN----PPSHLRLGERG 1201

Query: 2173 LARNFPQHLPFGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWK 2352
               +FP H   GE  G     H   GEP              GG Y  +++  ++ RK K
Sbjct: 1202 GFGSFPGHARMGELAGPGNLYHQQLGEPGFRSSF--------GGSYAGDLQYSENSRKRK 1253

Query: 2353 PVGIMCRICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDG 2532
                 CRICKV+C T EGLDLHSQ+REHQ+ A DMV+                +    D 
Sbjct: 1254 SSMGWCRICKVDCETFEGLDLHSQTREHQKMAMDMVVTIKQNVKKHKSAPSDHS-SLEDT 1312

Query: 2533 GRPRNSSFQGRGNK 2574
             + RN+SF+GRGNK
Sbjct: 1313 SKLRNASFEGRGNK 1326


>gb|EMJ06149.1| hypothetical protein PRUPE_ppa000292mg [Prunus persica]
          Length = 1334

 Score =  236 bits (603), Expect = 3e-59
 Identities = 220/747 (29%), Positives = 300/747 (40%), Gaps = 43/747 (5%)
 Frame = +1

Query: 349  NQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQNNPNKVAKNLVGGSGAGVLMNE 528
            NQ +       G     S PT  E   +Q  +     Q N  KV  ++  G+ + V+ + 
Sbjct: 643  NQNNMVRTNNLGQSGANSGPTTSERQAEQ--ESEFSAQQNAKKVVHDV--GTASAVVADA 698

Query: 529  AKMNAESTLDSGFDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTKIGAEDHKDVRKKS 708
                A+S  D     N+NK +G   K ++ D S   +   P   + + G    K + K+ 
Sbjct: 699  EVKTAKSETDMKSIDNENKPTGE-DKTIQGDTSSKEI---PDIHALENGESVSKSILKEE 754

Query: 709  EVQESKQTAKSGAPNMPQSNLSTQVHGTNASV--DQGRNQLHPIQYGPSVQLRPGAVSMS 882
             V  +   +     +M Q  L  ++    A +  +QG          P   +     S +
Sbjct: 755  GVDGTLDHSNVSISDMKQRELK-EIPSEEAQLREEQGWMLQKDASGDPQPFIGTDEGSQA 813

Query: 883  KSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADNSQPSSIKHPPGRVPHENASGAMVTPGL 1062
             ST  P + Q   +                     P  ++ PPG   H    G  + P  
Sbjct: 814  VSTSAPISDQGKHLPHHGPTTLPQRP-------GAPLLLQVPPGPPCHTQGPGHHLRP-- 864

Query: 1063 SGPFPRPDNMGYYQASMPPYQAGQPQNPAGEP--FGGSSFAAQRPGALDSHVGVREREPA 1236
             GP   P           P+ + +   P G    FG SS  A + G   S          
Sbjct: 865  PGPAHVPGQ---------PFHSSEHFQPHGGNLGFGASSGRASQYGPQGS---------I 906

Query: 1237 DFEQRPPYPMENE-KFPVQRPGSFDGRKPESLPHGSLDRAAYGPVQPGVQLGAMKIGGPP 1413
            + +   P+   NE   P+    +FD         G + RAA      G+    +++ G P
Sbjct: 907  ELQSVTPHGPYNEGHLPLPPTSAFDSHG------GMMSRAAPIGQPSGIHPNMLRMNGTP 960

Query: 1414 AHDSMSAPGMRDERGMPFPEERLKRIP---------HREFEDDPRKFPRSSHFEAGPSSK 1566
              DS S  G RDER   FP ERL   P           EFEDD ++FPR S+ ++ P +K
Sbjct: 961  GLDSSSTHGPRDERFKAFPGERLNPFPVDPTRHVIDRVEFEDDLKQFPRPSYLDSEPVAK 1020

Query: 1567 LGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPF--- 1737
             G                     SRPF++APHGF+ D G   D   G+ P RFL P+   
Sbjct: 1021 FGNY------------------SSRPFDRAPHGFKYDSGPHTDPLAGTAPSRFLSPYRLG 1062

Query: 1738 ---HPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGRSRMDGFPPRSPGRDFP 1908
               H ND G+ GR     + + G  DF               GR  +DG  PRSP RD+P
Sbjct: 1063 GSVHGNDAGDFGRM----EPTHGHPDF--------------VGRRLVDGLAPRSPVRDYP 1104

Query: 1909 GLPSGTFGAFGDG---GNSFP--AESFGKSIHDSRFPVPPNHLQRGEIDVPGNLRVGGPR 2073
            GLP   F  FG     G  F    +  G   H+ RF   P H +RGE + PGNLR+   R
Sbjct: 1105 GLPPHGFRGFGPDDFDGREFHRFGDPLGNQFHEGRFSNLPGHFRRGEFEGPGNLRMVDHR 1164

Query: 2074 NQDML-----PNHLRR-DLVGPRNM-----------HMGDPTKPRMGEPPLARNFPQHLP 2202
              D +     P HLRR D +GP N+           HMGD   P   EP           
Sbjct: 1165 RNDFIGQDGHPGHLRRGDHLGPHNLREPLGFGSRHSHMGDMAGPGNFEP----------- 1213

Query: 2203 FGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCRIC 2379
                F G +P HP  GEP            ++G  Y  ++E FD  RK KP  +  CRIC
Sbjct: 1214 ----FRGNRPNHPRLGEPGFRSSFSLQRFPNDGT-YTGDLESFDHSRKRKPASMGWCRIC 1268

Query: 2380 KVECGTVEGLDLHSQSREHQRKARDMV 2460
            KV+C TVEGLDLHSQ+REHQ+ A DMV
Sbjct: 1269 KVDCETVEGLDLHSQTREHQKMAMDMV 1295


>ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214768 [Cucumis sativus]
          Length = 1177

 Score =  223 bits (569), Expect = 3e-55
 Identities = 225/737 (30%), Positives = 310/737 (42%), Gaps = 51/737 (6%)
 Frame = +1

Query: 517  LMNEAKMNAESTLDSGFDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTKIGAEDHKDV 696
            L+ E K N E   +    + D ++    SK +++D S G   PS G+  ++ GA     +
Sbjct: 502  LVIENKGNQE---EFKISSQDTELREEQSKRMQNDTS-GTPHPSSGTNESQQGATTTSSL 557

Query: 697  RKKSEVQESKQTAKSGAPNMPQSNLSTQVHGTNAS-----VDQGRNQLHPIQYGPSVQLR 861
               S    ++   +   P  PQ+   TQ+     S     V   R+Q  P  Y  S  L+
Sbjct: 558  ILGSPGMLNQHGYQDKNP--PQTG-GTQIGAAVTSHPASLVAHTRHQTPPSSYVSSA-LQ 613

Query: 862  PGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADNSQP--SSIKHPPGRVPHENA 1035
             G  + S   P P    Q                   A   QP   S     G +P E+ 
Sbjct: 614  HGVAAPSLPGPPPGPYHQAQFSNNPSMQVRPRAPGLVAHPGQPFNPSESFHLGGIP-ESG 672

Query: 1036 SGAMVTPGLSGPFPRP------DNMGYYQASMPPYQAGQPQNPAGEPFGGSSFAAQRPGA 1197
            S +    GL    P+        +   Y  S P    G  +   G+P G + F ++ PGA
Sbjct: 673  SASSFGRGLGQYGPQQALERSIGSQATYSLSQPSASQGGSKMSLGDPVG-AHFRSKLPGA 731

Query: 1198 LDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSLDRAAYGPVQPG 1377
             DS   +   E     QRP +P+E E F  QRP   D   P ++ H       + P   G
Sbjct: 732  FDSRGLLHAPEAQIGVQRPIHPLEAEIFSNQRP-RLDSHLPGTMEH-------HPPHLTG 783

Query: 1378 VQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP---------HREFEDDPRKFP 1530
            +    + + G P  DS S  G+RDER     EE+L   P           + ED  R+FP
Sbjct: 784  IPPNVLPLNGAPGPDSSSKLGLRDERFKLLHEEQLNSFPLDPARRPINQTDAEDILRQFP 843

Query: 1531 RSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPGLKMDSAVGS 1710
            R SH E+  + ++G                      RPF++  HG   D GL +D A  S
Sbjct: 844  RPSHLESELAQRIGNY------------------SLRPFDRGVHGQNFDTGLTIDGAAAS 885

Query: 1711 GPLRFLPPFH------PNDVGERGRPPTFPDDSMGRGDF--GHRADFSGPGAGPGYGRSR 1866
               R LPP H      P D     RP  F +DS G+ D   GH +DF  PG+   YGR  
Sbjct: 886  ---RVLPPRHIGGALYPTDAE---RPIAFYEDSTGQADRSRGH-SDFPAPGS---YGRRF 935

Query: 1867 MDGFPPRSPGRDFPGLPSGTFGAFGD---GGNSFPAESFGK--SIHDSRFPVPPNHLQRG 2031
            +DGF PRSP  ++ G   G  G  G     G  FP   FG   S  +SRFP+  +HLQRG
Sbjct: 936  VDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQDFP-HHFGDPLSFRESRFPIFRSHLQRG 994

Query: 2032 EIDVPGNLRVG---------------GPRNQDMLPNHLRRDLVGPRNMHMGDPTKPRMGE 2166
            + +  GN R+                GPR+   LP HLR   +G        P   R+G+
Sbjct: 995  DFESSGNFRMSEHLRTGDLIGQDRHFGPRS---LPGHLR---LGELTAFGSHPGHSRIGD 1048

Query: 2167 PPLARNFPQHLPFGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRK 2346
              +  NF    PFG   GG +P +P  GEP             +G F+  ++E FD+ RK
Sbjct: 1049 LSVLGNFE---PFG---GGHRPNNPRLGEPGFRSSFSRQGLVDDGRFFAGDVESFDNSRK 1102

Query: 2347 WKPVGI-MCRICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEG 2523
             KP+ +  CRICKV+C TVEGL+LHSQ+REHQ+ A DMV                 + E 
Sbjct: 1103 RKPISMGWCRICKVDCETVEGLELHSQTREHQKMAMDMVQSIKQNAKKHKVTPNDHSSE- 1161

Query: 2524 RDGGRPRNSSFQGRGNK 2574
               G+ +N   + RG K
Sbjct: 1162 --DGKSKNVGLESRGKK 1176


>ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205914 [Cucumis sativus]
          Length = 1434

 Score =  223 bits (569), Expect = 3e-55
 Identities = 225/737 (30%), Positives = 310/737 (42%), Gaps = 51/737 (6%)
 Frame = +1

Query: 517  LMNEAKMNAESTLDSGFDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTKIGAEDHKDV 696
            L+ E K N E   +    + D ++    SK +++D S G   PS G+  ++ GA     +
Sbjct: 759  LVIENKGNQE---EFKISSQDTELREEQSKRMQNDTS-GTPHPSSGTNESQQGATTTSSL 814

Query: 697  RKKSEVQESKQTAKSGAPNMPQSNLSTQVHGTNAS-----VDQGRNQLHPIQYGPSVQLR 861
               S    ++   +   P  PQ+   TQ+     S     V   R+Q  P  Y  S  L+
Sbjct: 815  ILGSPGMLNQHGYQDKNP--PQTG-GTQIGAAVTSHPASLVAHTRHQTPPSSYVSSA-LQ 870

Query: 862  PGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADNSQP--SSIKHPPGRVPHENA 1035
             G  + S   P P    Q                   A   QP   S     G +P E+ 
Sbjct: 871  HGVAAPSLPGPPPGPYHQAQFSNNPSMQVRPRAPGLVAHPGQPFNPSESFHLGGIP-ESG 929

Query: 1036 SGAMVTPGLSGPFPRP------DNMGYYQASMPPYQAGQPQNPAGEPFGGSSFAAQRPGA 1197
            S +    GL    P+        +   Y  S P    G  +   G+P G + F ++ PGA
Sbjct: 930  SASSFGRGLGQYGPQQALERSIGSQATYSLSQPSASQGGSKMSLGDPVG-AHFRSKLPGA 988

Query: 1198 LDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSLDRAAYGPVQPG 1377
             DS   +   E     QRP +P+E E F  QRP   D   P ++ H       + P   G
Sbjct: 989  FDSRGLLHAPEAQIGVQRPIHPLEAEIFSNQRP-RLDSHLPGTMEH-------HPPHLTG 1040

Query: 1378 VQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP---------HREFEDDPRKFP 1530
            +    + + G P  DS S  G+RDER     EE+L   P           + ED  R+FP
Sbjct: 1041 IPPNVLPLNGAPGPDSSSKLGLRDERFKLLHEEQLNSFPLDPARRPINQTDAEDILRQFP 1100

Query: 1531 RSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDPGLKMDSAVGS 1710
            R SH E+  + ++G                      RPF++  HG   D GL +D A  S
Sbjct: 1101 RPSHLESELAQRIGNY------------------SLRPFDRGVHGQNFDTGLTIDGAAAS 1142

Query: 1711 GPLRFLPPFH------PNDVGERGRPPTFPDDSMGRGDF--GHRADFSGPGAGPGYGRSR 1866
               R LPP H      P D     RP  F +DS G+ D   GH +DF  PG+   YGR  
Sbjct: 1143 ---RVLPPRHIGGALYPTDAE---RPIAFYEDSTGQADRSRGH-SDFPAPGS---YGRRF 1192

Query: 1867 MDGFPPRSPGRDFPGLPSGTFGAFGD---GGNSFPAESFGK--SIHDSRFPVPPNHLQRG 2031
            +DGF PRSP  ++ G   G  G  G     G  FP   FG   S  +SRFP+  +HLQRG
Sbjct: 1193 VDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQDFP-HHFGDPLSFRESRFPIFRSHLQRG 1251

Query: 2032 EIDVPGNLRVG---------------GPRNQDMLPNHLRRDLVGPRNMHMGDPTKPRMGE 2166
            + +  GN R+                GPR+   LP HLR   +G        P   R+G+
Sbjct: 1252 DFESSGNFRMSEHLRTGDLIGQDRHFGPRS---LPGHLR---LGELTAFGSHPGHSRIGD 1305

Query: 2167 PPLARNFPQHLPFGESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRK 2346
              +  NF    PFG   GG +P +P  GEP             +G F+  ++E FD+ RK
Sbjct: 1306 LSVLGNFE---PFG---GGHRPNNPRLGEPGFRSSFSRQGLVDDGRFFAGDVESFDNSRK 1359

Query: 2347 WKPVGI-MCRICKVECGTVEGLDLHSQSREHQRKARDMVLXXXXXXXXXXXXXXXATFEG 2523
             KP+ +  CRICKV+C TVEGL+LHSQ+REHQ+ A DMV                 + E 
Sbjct: 1360 RKPISMGWCRICKVDCETVEGLELHSQTREHQKMAMDMVQSIKQNAKKHKVTPNDHSSE- 1418

Query: 2524 RDGGRPRNSSFQGRGNK 2574
               G+ +N   + RG K
Sbjct: 1419 --DGKSKNVGLESRGKK 1433


>emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera]
          Length = 1131

 Score =  221 bits (562), Expect = 2e-54
 Identities = 188/582 (32%), Positives = 248/582 (42%), Gaps = 28/582 (4%)
 Frame = +1

Query: 802  VDQGRNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSADN 981
            +D GR+Q  P+QYGP+VQ RP A S  ++ P P       +                A  
Sbjct: 591  LDGGRHQPPPMQYGPTVQQRPAAPSSGQAMPPPGLVHNAPVPGQPSTQLQP-----QALG 645

Query: 982  SQPSSIKHPPGRVPHENASGAMVTPGLSGPF----------------PRPDNMGYY-QAS 1110
              P   +   G   HE   G ++ PG +  F                P   + G+Y Q  
Sbjct: 646  LLPHPAQQSRGSFHHEIPPGGILGPGSAASFGRGLSHFAPPQRSFEPPSVVSQGHYNQGH 705

Query: 1111 MPPYQAGQPQNPAGEPFGGSSFAAQRPGALDSHVGVREREPA---DFEQRPPYPMENEKF 1281
              P  AG  +   GE  G         G+ DSH G+  R P    D +QRP  P+E+E F
Sbjct: 706  GLPSHAGPSRISQGELIGRPPLGPLPAGSFDSHGGMMVRAPPHGPDGQQRPVNPVESEIF 765

Query: 1282 PVQRPGSFDGRKPESLPHGSLDRAAYGPVQP-GVQLGAMKIGGPPAHDSMSAPGMRDERG 1458
               RP  FDGR+ +S   GS +R  +G  QP G Q   M++ G    +S    G++DER 
Sbjct: 766  SNPRPNYFDGRQSDSHIPGSSERGPFG--QPSGXQSNMMRMNGGLGIESSLPVGLQDERF 823

Query: 1459 MPFPEERLKRIPHREFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPS 1638
               PE   +   H +F +D ++F RSSH ++    K G  F SS PLD G   +  D   
Sbjct: 824  KSLPEPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFGNYFSSSRPLDRGSQGFVMDAAQ 883

Query: 1639 RPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHR 1818
               +KAP GF  D G K  S+ G+G  R       +D+          DD  GR      
Sbjct: 884  GLLDKAPLGFNYDSGFK--SSAGTGTSR------QSDL----------DDIDGR------ 919

Query: 1819 ADFSGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSR 1998
                    G GY    +     R     FP LPS                         R
Sbjct: 920  ---ESRRFGEGYQTFNLPSDESR-----FPVLPS-----------------------HLR 948

Query: 1999 FPVPPNHLQRGE----IDVPGNLRVGGPRNQDMLPNHLRRDLVGPRNMHMGDPTKPRMGE 2166
              + P+HLQRGE     ++PG LR G P     L +                   PRMGE
Sbjct: 949  RDILPSHLQRGEHFGSRNIPGQLRFGEPVFDAFLGH-------------------PRMGE 989

Query: 2167 PPLARNFPQHLPFGESFGG-EKPGHPLAGEPXXXXXXXXXXXXHEGGFY-PDEMEPFDDP 2340
                 NFP  L  GESFGG  K GHP  GEP            ++ GF  P +ME FD+ 
Sbjct: 990  LSGPGNFPSRLSAGESFGGSNKSGHPRIGEPGFRSTYSLHGYPNDHGFRPPGDMESFDNS 1049

Query: 2341 RKWKPVGI-MCRICKVECGTVEGLDLHSQSREHQRKARDMVL 2463
            RK KP+ +  CRIC ++C TV+GLD+HSQ+REHQ+ A D+VL
Sbjct: 1050 RKRKPLSMAWCRICNIDCETVDGLDMHSQTREHQQMAMDIVL 1091


>ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227701 [Cucumis sativus]
          Length = 538

 Score =  220 bits (561), Expect = 2e-54
 Identities = 179/530 (33%), Positives = 239/530 (45%), Gaps = 38/530 (7%)
 Frame = +1

Query: 1099 YQASMPPYQAGQPQNPAGEPFGGSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEK 1278
            Y  S P    G  +   G+P G + F ++ PGA DS   +   E     QRP +P+E E 
Sbjct: 61   YSLSQPSASQGGSKMSLGDPVG-AHFRSKLPGAFDSRGLLHAPEAQIGVQRPIHPLEAEI 119

Query: 1279 FPVQRPGSFDGRKPESLPHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERG 1458
            F  QRP   D   P ++ H       + P   G+    + + G P  DS S  G+RDER 
Sbjct: 120  FSNQRP-RLDSHLPGTMEH-------HPPHLTGIPPNVLPLNGAPGPDSSSKLGLRDERF 171

Query: 1459 MPFPEERLKRIP---------HREFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGP 1611
                EE+L   P           + ED  R+FPR SH E+  + ++G             
Sbjct: 172  KLLHEEQLNSFPLDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY----------- 220

Query: 1612 HVYAGDGPSRPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPFH------PNDVGERGRPP 1773
                     RPF++  HG   D GL +D A  S   R LPP H      P D     RP 
Sbjct: 221  -------SLRPFDRGVHGQNFDTGLTIDGAAAS---RVLPPRHIGGALYPTDAE---RPI 267

Query: 1774 TFPDDSMGRGDF--GHRADFSGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGAFGD- 1944
             F +DS G+ D   GH +DF  PG+   YGR  +DGF PRSP  ++ G   G  G  G  
Sbjct: 268  AFYEDSTGQADRSRGH-SDFPAPGS---YGRRFVDGFGPRSPLHEYHGRGFGGRGFTGVE 323

Query: 1945 --GGNSFPAESFGK--SIHDSRFPVPPNHLQRGEIDVPGNLRVG---------------G 2067
               G  FP   FG   S  +SRFP+  +HLQRG+ +  GN R+                G
Sbjct: 324  EIDGQDFP-HHFGDPLSFRESRFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQDRHFG 382

Query: 2068 PRNQDMLPNHLRRDLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPFGESFGGEKPGHPLA 2247
            PR+   LP HLR   +G        P   R+G+  +  NF    PFG   GG +P +P  
Sbjct: 383  PRS---LPGHLR---LGELTAFGSHPGHSRIGDLSVLGNFE---PFG---GGHRPNNPRL 430

Query: 2248 GEPXXXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCRICKVECGTVEGLDLHSQ 2424
            GEP             +G F+  ++E FD+ RK KP+ +  CRICKV+C TVEGL+LHSQ
Sbjct: 431  GEPGFRSSFSRQGLVDDGRFFAGDVESFDNSRKRKPISMGWCRICKVDCETVEGLELHSQ 490

Query: 2425 SREHQRKARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSSFQGRGNK 2574
            +REHQ+ A DMV                 + E    G+ +N   + RG K
Sbjct: 491  TREHQKMAMDMVQSIKQNAKKHKVTPNDHSSE---DGKSKNVGLESRGKK 537


>gb|EXB30469.1| hypothetical protein L484_006018 [Morus notabilis]
          Length = 1320

 Score =  214 bits (545), Expect = 2e-52
 Identities = 240/821 (29%), Positives = 321/821 (39%), Gaps = 72/821 (8%)
 Frame = +1

Query: 322  NQTYPSRVNNQMSAASEQQAG-------HLQQPSRPTDVENPRDQVVDKSMHDQNNPNKV 480
            NQ    + NNQM   SE+ +G        ++Q ++     + + +VV  S    +   KV
Sbjct: 584  NQNNILKTNNQMKLPSEEHSGANSTATMSIRQGNQDFVKGSAQQEVVASS----HKTVKV 639

Query: 481  AKNLVGGSGAGVLMNEAKMNAESTLDSGFDANDNKVSGMGSKPLESDAS-EGVLEPSPGS 657
              N    S   +L N  ++  E +        D K +    KP+  +   E  L+ S   
Sbjct: 640  GTNN-SDSVLDLLANVGEVKTEKS------KTDLKSTDPVVKPMMKEEDVESTLKNSSNG 692

Query: 658  KSTKIGAEDHKDVRK------KSEVQESKQTAKSGAPNMPQSNLSTQVHGTNASVDQ--- 810
            KS K+ AED KDV K      K+   E K    S     P   +         SV     
Sbjct: 693  KSGKVVAEDKKDVLKVEPEKMKNSTVEDKDVGGSLQKKSPLQAVERHEGQGGDSVKDAAS 752

Query: 811  GRNQLHPIQYGPSVQ-LRPGAVSMSKSTP---------HPFNSQQPTIXXXXXXXXXXXX 960
            G ++   +   PS Q LR  A      +P         H      P              
Sbjct: 753  GSDRASKVVPTPSAQILRSPASGGEVKSPYSRSVQVQGHQLPGPPPLSQVPPPGPPHKTQ 812

Query: 961  XFNSADN----SQPSSIKHPPGRVPHENASGAMVTPGLSGPFPR-PDNMGYYQAS--MPP 1119
             F ++        P    HPPG +P           G + PF R P+  G  Q S  +  
Sbjct: 813  EFGASQTHCRPQVPGDPLHPPGSIP-----------GSAIPFGRGPNQYGPNQQSSELQS 861

Query: 1120 YQAGQPQNPA---------GEPFGGSSFAAQRPGALDSHVGVREREPADFEQRPPYPMEN 1272
                +P NP          GEP G  S    +P A +SH G+  R         P P   
Sbjct: 862  LAPQRPYNPGPFGAFRLSQGEPTGAESSGVLQPRAFNSHGGMMAR---------PTPHGP 912

Query: 1273 EKFPVQRPGSFDGRKPESLPHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDE 1452
            E F  QRP   D R P+    GSL+  A+     G+     ++      DS+S  G RDE
Sbjct: 913  EMFSNQRPDFMDSRGPDPHFAGSLEHGAHSQ-SFGIHPNMTRMNDSHGFDSLSTLGPRDE 971

Query: 1453 RGMPFPEERLKRIPHREFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDG 1632
            R  PFP       P  EFEDD ++FPR                                 
Sbjct: 972  RFNPFPAGPN---PRAEFEDDLKQFPR--------------------------------- 995

Query: 1633 PSRPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPFHPNDVGERG-RPPTFPDDSMGRGD- 1806
               PF++  HG +   GLKMDS VGS P R L P++     + G R      D+ GR D 
Sbjct: 996  ---PFDRGLHGLKYHTGLKMDSGVGSVPSRSLSPYNGGGANDGGDRLGWHRGDAFGRMDP 1052

Query: 1807 -FGHRADFSGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPA------ 1965
              GH  DF GPG G  Y R RMD    RSP R+ PG+     G  G G +          
Sbjct: 1053 TRGH-LDFLGPGLG--YDRRRMDSLASRSPIREHPGI--SLRGFVGPGPDDIHGRELRRF 1107

Query: 1966 -ESFGKSIHDSRFPVPPNHLQRGEIDVPGNLRVGGPRNQDMLPNHLRRDLVGPRNM---- 2130
             E F  S H+SRF + P HL+RGE + P N+ +G         +HLR DL+G   +    
Sbjct: 1108 GEPFDSSFHESRFSMLPGHLRRGEFEGPRNMGMG---------DHLRNDLIGRDGLSGPL 1158

Query: 2131 ----HMGD-PTKPRMGEPPLARNFPQHLPFGE--------SFG-GEKPGHPLAGEPXXXX 2268
                HMGD      +GEP       +H    E        SFG G+ P  P  GEP    
Sbjct: 1159 RWGEHMGDFHGHFHLGEPVGFGAHSRHARIREIGGPGSFDSFGRGDGPSFPHLGEPGFRS 1218

Query: 2269 XXXXXXXXHEGGFYPDEMEPFDDPRKWK-PVGIMCRICKVECGTVEGLDLHSQSREHQRK 2445
                       G + +++  FD  RK K P    CRICKV+C TVEGL+LHSQ+REHQ+ 
Sbjct: 1219 RFSSHGFPTGDGIFTEDLA-FDKSRKRKLPTMGWCRICKVDCETVEGLELHSQTREHQKM 1277

Query: 2446 ARDMVLXXXXXXXXXXXXXXXATFEGRDGGRPRNSSFQGRG 2568
            A DMV+                +  G D  +PR++  +G G
Sbjct: 1278 AMDMVVAIKQNAKKQKLTFGDQSSLG-DASQPRSAGTEGHG 1317


>ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314450 [Fragaria vesca
            subsp. vesca]
          Length = 1316

 Score =  209 bits (531), Expect = 7e-51
 Identities = 226/765 (29%), Positives = 318/765 (41%), Gaps = 52/765 (6%)
 Frame = +1

Query: 322  NQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQNNPNKVAKNLVGG 501
            NQ    R NNQ+   +          SRPT    P ++  + S  +      V+  +V  
Sbjct: 610  NQNNIGRTNNQVQPGAN---------SRPTMTTRPAEKEAELSAKNGAQDVGVSSAVVAD 660

Query: 502  SGAGVLMNEAKMNAESTLDSGFDANDNKVSGMGSKPLESDASEGVLEPSPGSKSTKIGAE 681
            S A  + +E  ++ +ST D    +++++ S  G+K  E   S+G+L  +  S+S     E
Sbjct: 661  SEAKTVKSE--VDIKSTDDGNKPSSEDR-SYQGTK--EIPESKGMLGANGESESKPTLKE 715

Query: 682  DHKDVRKKSEVQESK--QTAKSGAPNMPQSNLS-----------TQVHGTN--------A 798
            +  D   + ++   K  +    GA + P S +             Q+HG          +
Sbjct: 716  EGVDSTLE-DLSNGKLGELVAEGAKDAPSSGMKLGEHKEMPPEEAQLHGVKDKKLQKVVS 774

Query: 799  SVDQGRNQLHPIQYGPSVQLRPGAVSMSKSTPHPFNSQQPTIXXXXXXXXXXXXXFNSAD 978
            S ++G +Q   I   P  Q++ G + M  S P     QQ                 +   
Sbjct: 775  STEEG-SQTVSISSAPIGQVQAGGL-MQPSHPGSAILQQKPGAPPLLQVPSSGPPHHILG 832

Query: 979  NSQPSSIKHP--PGRVP------HENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQ 1134
            + QP +   P  PG VP       E+        G +         G Y  S  P  +G 
Sbjct: 833  SGQPLAHVRPQGPGHVPGHPSHLSEHFQSPRGNLGFAASSANASQHGPYNQSHAPPHSGA 892

Query: 1135 PQNPAGEPFGGSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGR 1314
            P+ P   PF      A  P A DSH G+  R         PY  E +   +QRP      
Sbjct: 893  PRGP---PF------APPPSAFDSHGGIMARAA-------PYGHEGQ-MGLQRPAF---- 931

Query: 1315 KPESLPHGSLDRAAYGPVQP-GVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRI 1491
                     +++ A G  QP G+    +++ G P  +S S  G+RDER    P+ RL   
Sbjct: 932  --------QMEQGATG--QPSGIISNMLRMNGNPGFESSSTLGLRDERFKALPDGRLNPF 981

Query: 1492 PHRE--------FEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPF 1647
            P           FEDD ++FPR S  ++ P  KLG                     SR F
Sbjct: 982  PGDPTRVISRVGFEDDLKQFPRPSFLDSEPLPKLGNY------------------SSRAF 1023

Query: 1648 EKAPHGFERDPGLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADF 1827
            ++ P G   D  L +D A GS P RFL P+     G  G      +D++G  DFG     
Sbjct: 1024 DRRPFGVNYDTRLNIDPAAGSAP-RFLSPY-----GHAGL--IHANDTIGHPDFG----- 1070

Query: 1828 SGPGAGPGYGRSRMDGFPPRSPGRDFPGLPSGTFGAFGDG---GNSFP--AESFGKSIHD 1992
                     GR  MDG   RSP RD+PG+PS  F  FG     G  F    +  G+  HD
Sbjct: 1071 ---------GRRLMDGLARRSPIRDYPGIPS-RFRGFGPDDFDGREFHRFGDPLGREFHD 1120

Query: 1993 SRFPVPPNHLQRGEIDVPGNLRVGGPRNQDMLPN-----HLRR-DLVGPRNMHMGDPTKP 2154
            +RFP    H +RGE + PGN+RV      D++       HL+R + +GP N+    P   
Sbjct: 1121 NRFP--NQHFRRGEFEGPGNMRVDDRMRNDLIGQDGHLGHLQRGEHLGPHNL----PGHL 1174

Query: 2155 RMGEPPLARNFPQHLPFG--ESFGGEKPGHPLAGEPXXXXXXXXXXXXHEGGFYPDEMEP 2328
             M E       P+H   G  ESF G +  HP  GEP            ++G  Y  E+E 
Sbjct: 1175 HMREHVGFGVHPRHAGPGSFESFIGNRANHPRLGEPGFRSSFSLKRFPNDGT-YAGELES 1233

Query: 2329 FDDPRKWKPVGI-MCRICKVECGTVEGLDLHSQSREHQRKARDMV 2460
            FD  RK KP  +  CRICKV C TVEGLD+HSQ+REHQR A +MV
Sbjct: 1234 FDHSRKRKPASMGWCRICKVNCETVEGLDVHSQTREHQRMAMEMV 1278


>gb|EOY33855.1| Uncharacterized protein isoform 6 [Theobroma cacao]
          Length = 1345

 Score =  197 bits (502), Expect = 2e-47
 Identities = 233/818 (28%), Positives = 305/818 (37%), Gaps = 82/818 (10%)
 Frame = +1

Query: 103  YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282
            Y QPQQ  A   HA               P HG Q       AA                
Sbjct: 611  YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 653

Query: 283  XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462
                   V+P  L    PS   N +   + Q +G   QP      ++  D+ V +   D 
Sbjct: 654  ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 708

Query: 463  NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603
            ++P    K     ++    GA V   N AK+ A+        T D G D+N   +S   +
Sbjct: 709  SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 768

Query: 604  KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717
               ES  + G       +P   +  T    ED KDV                  K   +Q
Sbjct: 769  P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 826

Query: 718  ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813
            E+K    Q  K           G P  P  N       S+QV             +VDQG
Sbjct: 827  EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 886

Query: 814  RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984
            R+Q   + YG +  Q RP   ++ ++ P   P ++Q P +                 +N 
Sbjct: 887  RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 946

Query: 985  QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164
             P S    P               G  GP+    N G      PP  +G P+   GEP  
Sbjct: 947  PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 984

Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344
            G S+      A DSH                        P+  P S   +   ++     
Sbjct: 985  GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 1022

Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500
            D     P   G+             DS S   +R ER  P  +E   + P    HR    
Sbjct: 1023 DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 1069

Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680
            +FE+D + FPR SH +  P  K G+   SS PLD GPH +  D   R  EK PHGF  DP
Sbjct: 1070 QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 1129

Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860
                   +GSGP RFLPP+HP+D GE  RP   P D++GR DF         G  P YGR
Sbjct: 1130 ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 1173

Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040
             RMDGF  RSPGR++PG+    FG  G  G+        +     RFP  P HL RG  +
Sbjct: 1174 HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 1228

Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205
                +  +LR     NQD  P + RR + VG  NM    P   R+GEP    +F  H   
Sbjct: 1229 SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 1284

Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFY 2310
            GE FGG  PG   HP  GEP            ++GG Y
Sbjct: 1285 GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIY 1319


>gb|EOY33854.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 1358

 Score =  197 bits (502), Expect = 2e-47
 Identities = 233/818 (28%), Positives = 305/818 (37%), Gaps = 82/818 (10%)
 Frame = +1

Query: 103  YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282
            Y QPQQ  A   HA               P HG Q       AA                
Sbjct: 611  YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 653

Query: 283  XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462
                   V+P  L    PS   N +   + Q +G   QP      ++  D+ V +   D 
Sbjct: 654  ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 708

Query: 463  NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603
            ++P    K     ++    GA V   N AK+ A+        T D G D+N   +S   +
Sbjct: 709  SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 768

Query: 604  KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717
               ES  + G       +P   +  T    ED KDV                  K   +Q
Sbjct: 769  P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 826

Query: 718  ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813
            E+K    Q  K           G P  P  N       S+QV             +VDQG
Sbjct: 827  EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 886

Query: 814  RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984
            R+Q   + YG +  Q RP   ++ ++ P   P ++Q P +                 +N 
Sbjct: 887  RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 946

Query: 985  QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164
             P S    P               G  GP+    N G      PP  +G P+   GEP  
Sbjct: 947  PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 984

Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344
            G S+      A DSH                        P+  P S   +   ++     
Sbjct: 985  GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 1022

Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500
            D     P   G+             DS S   +R ER  P  +E   + P    HR    
Sbjct: 1023 DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 1069

Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680
            +FE+D + FPR SH +  P  K G+   SS PLD GPH +  D   R  EK PHGF  DP
Sbjct: 1070 QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 1129

Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860
                   +GSGP RFLPP+HP+D GE  RP   P D++GR DF         G  P YGR
Sbjct: 1130 ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 1173

Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040
             RMDGF  RSPGR++PG+    FG  G  G+        +     RFP  P HL RG  +
Sbjct: 1174 HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 1228

Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205
                +  +LR     NQD  P + RR + VG  NM    P   R+GEP    +F  H   
Sbjct: 1229 SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 1284

Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFY 2310
            GE FGG  PG   HP  GEP            ++GG Y
Sbjct: 1285 GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIY 1319


>gb|EOY33850.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 1326

 Score =  197 bits (502), Expect = 2e-47
 Identities = 233/818 (28%), Positives = 305/818 (37%), Gaps = 82/818 (10%)
 Frame = +1

Query: 103  YMQPQQPTAAHGHAXXXXXXXXXXXXNYAPGHGAQHNMAQSYAARPFXXXXXXXXXXXXX 282
            Y QPQQ  A   HA               P HG Q       AA                
Sbjct: 611  YAQPQQNVAG-SHAVHFHPSHNLVGRPMTPNHGVQSQPYPHSAA---------------- 653

Query: 283  XXXXXXXVRPPQLNQTYPSRVNNQMSAASEQQAGHLQQPSRPTDVENPRDQVVDKSMHDQ 462
                   V+P  L    PS   N +   + Q +G   QP      ++  D+ V +   D 
Sbjct: 654  ----GTPVKPVHLGANQPSSYQNNVFRTNNQ-SGVTSQPMSEVPGDHGTDKNVAEQEADS 708

Query: 463  NNPNKVAK-----NLVGGSGAGVL-MNEAKMNAES-------TLDSGFDANDNKVSGMGS 603
            ++P    K     ++    GA V   N AK+ A+        T D G D+N   +S   +
Sbjct: 709  SSPGTARKEANELDMASSLGADVAEKNTAKLEADLKSVDEKLTGDVGDDSNGVDISTKET 768

Query: 604  KPLESDASEGV-----LEPSPGSKSTKIGAEDHKDVR-----------------KKSEVQ 717
               ES  + G       +P   +  T    ED KDV                  K   +Q
Sbjct: 769  P--ESRRTVGTDLEQHRDPVSKNMVTCEAIEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQ 826

Query: 718  ESK----QTAK----------SGAPNMPQSN------LSTQVHGTN--------ASVDQG 813
            E+K    Q  K           G P  P  N       S+QV             +VDQG
Sbjct: 827  EAKLGEEQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSVPNVDQG 886

Query: 814  RNQLHPIQYGPSV-QLRPGAVSMSKSTPH--PFNSQQPTIXXXXXXXXXXXXXFNSADNS 984
            R+Q   + YG +  Q RP   ++ ++ P   P ++Q P +                 +N 
Sbjct: 887  RHQPLQMPYGSNNNQQRPAVSAILQAPPPGLPSHAQTPGLPPNQFRPQGPGQALVPPENL 946

Query: 985  QPSSIKHPPGRVPHENASGAMVTPGLSGPFPRPDNMGYYQASMPPYQAGQPQNPAGEPFG 1164
             P S    P               G  GP+    N G      PP  +G P+   GEP  
Sbjct: 947  PPGSFGRDPSNY------------GPQGPY----NQG------PPSLSGAPRISQGEPLV 984

Query: 1165 GSSFAAQRPGALDSHVGVREREPADFEQRPPYPMENEKFPVQRPGSFDGRKPESLPHGSL 1344
            G S+      A DSH                        P+  P S   +   ++     
Sbjct: 985  GLSYGTPPLTAFDSHGA----------------------PLYGPESHSVQHSANMVDYHA 1022

Query: 1345 DRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIP----HR---- 1500
            D     P   G+             DS S   +R ER  P  +E   + P    HR    
Sbjct: 1023 DNRQLDPRASGL-------------DSTSTFSLRGERLKPVQDECSNQFPLDRGHRGDRG 1069

Query: 1501 EFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGPHVYAGDGPSRPFEKAPHGFERDP 1680
            +FE+D + FPR SH +  P  K G+   SS PLD GPH +  D   R  EK PHGF  DP
Sbjct: 1070 QFEEDLKHFPRPSHLDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDP 1129

Query: 1681 GLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDSMGRGDFGHRADFSGPGAGPGYGR 1860
                   +GSGP RFLPP+HP+D GE  RP   P D++GR DF         G  P YGR
Sbjct: 1130 ------MIGSGPSRFLPPYHPDDTGE--RPVGLPKDTLGRPDF--------LGTVPSYGR 1173

Query: 1861 SRMDGFPPRSPGRDFPGLPSGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEID 2040
             RMDGF  RSPGR++PG+    FG  G  G+        +     RFP  P HL RG  +
Sbjct: 1174 HRMDGFVSRSPGREYPGISPHGFG--GHPGDEIDGR---ERRFSDRFPGLPGHLHRGGFE 1228

Query: 2041 ----VPGNLRVGGPRNQDMLPNHLRR-DLVGPRNMHMGDPTKPRMGEPPLARNFPQHLPF 2205
                +  +LR     NQD  P + RR + VG  NM    P   R+GEP    +F  H   
Sbjct: 1229 SSDRMEEHLRSRDMINQDNRPAYFRRGEHVGHHNM----PGHLRLGEPIGFGDFSSHERI 1284

Query: 2206 GESFGGEKPG---HPLAGEPXXXXXXXXXXXXHEGGFY 2310
            GE FGG  PG   HP  GEP            ++GG Y
Sbjct: 1285 GE-FGG--PGNFRHPRLGEPGFRSSFSLQEFPNDGGIY 1319


>ref|XP_006848046.1| hypothetical protein AMTR_s00029p00190880 [Amborella trichopoda]
            gi|548851351|gb|ERN09627.1| hypothetical protein
            AMTR_s00029p00190880 [Amborella trichopoda]
          Length = 1626

 Score =  184 bits (466), Expect = 2e-43
 Identities = 175/589 (29%), Positives = 235/589 (39%), Gaps = 59/589 (10%)
 Frame = +1

Query: 988  PSSIKHPPGRVPHENASGAMVTPGLSGPFPR-PDNMGYYQASMPPY-QAGQPQNPAGEPF 1161
            P  I+ PPG   H       V  G  G   R P+ +G    S+PP   +  P  P  +  
Sbjct: 1071 PDMIEKPPGPPLHHGPLHPGVQTGGPGDIGRGPNQLGMPPPSLPPQGHSSVPMYPPSKHA 1130

Query: 1162 GGSSFAAQRPGALDSHVGVREREPA---DFEQRPPYPMEN-EKFPVQRPGSFDGRKPESL 1329
             G        G  D    +  R P    D +   P PM++ + F   RPG FDGR+P+  
Sbjct: 1131 PGERLPGPPSGPFDGPGSMMPRAPVHGIDNQMGRP-PMDHVDTFLKNRPGYFDGRQPDVH 1189

Query: 1330 PHGSLDRAAYGPVQPGVQLGAMKIGGPPAHDSMSAPGMRDERGMPFPEERLKRIPH---- 1497
                 DRA YG V      G+         +S    G+ +ER  P PE+R K +P     
Sbjct: 1190 QSLPSDRAPYGLVNGAAGKGSN------VPESAFPHGLPEERFGPLPEDRFKHLPEDGLK 1243

Query: 1498 ----------------------REFEDDPRKFPRSSHFEAGPSSKLGTQFPSSGPLDHGP 1611
                                  REFE+D +KFPRS H +  P+S+    F S  P  H P
Sbjct: 1244 KPLPDDHFRPYALDPSRRAIDRREFEEDLKKFPRSGHLDGEPASRYDGYFSSRNPSGHSP 1303

Query: 1612 HVYAGDGPSRPFEKAPHGFERDPGLKMDSAVGSGPLRFLPPFHPNDVGERGRPPTFPDDS 1791
                  G +    + P G    P        G   L         D+G+R +P  F  D 
Sbjct: 1304 RSLERPGLNLDAPRYPEGMSVPPY----RGAGGSSL---------DLGDRSKPGGFHGDL 1350

Query: 1792 MGR--GDFGHRADFSGPGAGPGYGRSRMDGF-PPRSPGRDFPGLP--------------- 1917
            +GR     G R+D+ GP   P   RS  DG  PPRSP RD+ G+                
Sbjct: 1351 IGRKLDTTGARSDYGGPF--PEVSRSHRDGLGPPRSPVRDYAGVRVSGVRPDYAGIPHPL 1408

Query: 1918 SGTFGAFGDGGNSFPAESFGKSIHDSRFPVPPNHLQRGEIDVPGNLRVGGPRNQDMLPNH 2097
             G  G    G     A +F   IH  + P  P      E  +P   R+         P H
Sbjct: 1409 DGLGGREPLGFGEQRARAFLDPIHGGKIPSGPF-----ESRLPIPSRIAESAGFGDFPGH 1463

Query: 2098 LRR-DLVGPRNMHMGD-PTKPRMGEPPLARNFPQHLPFGESFGG----EKPGHPLAGEPX 2259
            LR  D  GP +   G+ P+  R  E   + N P HL  GE+ G      +PG  + G P 
Sbjct: 1464 LRGGDPFGPSHFRSGELPSHLRGRELAGSGNLPPHLRIGEAMGPGGHLREPGFGMQGYPK 1523

Query: 2260 XXXXXXXXXXXHEGGFYPDEMEPFDDPRKWKPVGI-MCRICKVECGTVEGLDLHSQSREH 2436
                       + G F P +++  +  RK KP     CRICKV+C TVEGLDLHSQ+REH
Sbjct: 1524 DGGFY------NPGSFPPSDVDALEYSRKRKPGSTGWCRICKVDCETVEGLDLHSQTREH 1577

Query: 2437 QRKARDMVLXXXXXXXXXXXXXXXAT--FEGRDGGRPRNSSFQGRGNKR 2577
            Q+ A DMVL               +       +  + R +SF+ RG++R
Sbjct: 1578 QKMAMDMVLSIKQDSAKKQKLYGSSEDHVPQEEPTKGRRASFESRGSRR 1626


Top