BLASTX nr result

ID: Catharanthus22_contig00009450 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00009450
         (3408 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006349926.1| PREDICTED: uncharacterized protein LOC102586...   533   e-148
ref|XP_004252993.1| PREDICTED: uncharacterized protein LOC101261...   529   e-147
emb|CBI17315.3| unnamed protein product [Vitis vinifera]              482   e-133
emb|CAN62042.1| hypothetical protein VITISV_006702 [Vitis vinifera]   444   e-121
ref|XP_006420909.1| hypothetical protein CICLE_v10004416mg [Citr...   407   e-110
ref|XP_006493818.1| PREDICTED: intracellular protein transport p...   401   e-109
gb|EXC16951.1| Lysine-specific demethylase 3B [Morus notabilis]       387   e-104
ref|XP_006493819.1| PREDICTED: intracellular protein transport p...   386   e-104
ref|XP_002323777.2| hypothetical protein POPTR_0017s08220g [Popu...   380   e-102
gb|EOY05294.1| Uncharacterized protein TCM_020328 [Theobroma cacao]   371   1e-99
ref|XP_006840152.1| hypothetical protein AMTR_s00089p00065300 [A...   326   4e-86
ref|XP_004134377.1| PREDICTED: uncharacterized protein LOC101204...   323   2e-85
ref|XP_004300993.1| PREDICTED: uncharacterized protein LOC101300...   289   5e-75
gb|EOX93928.1| Uncharacterized protein TCM_002927 [Theobroma cacao]   287   3e-74
ref|XP_002266466.2| PREDICTED: uncharacterized protein LOC100250...   281   1e-72
gb|ESW06083.1| hypothetical protein PHAVU_010G018600g [Phaseolus...   267   2e-68
ref|XP_002518099.1| hypothetical protein RCOM_1019660 [Ricinus c...   267   2e-68
gb|EMJ28544.1| hypothetical protein PRUPE_ppa002653mg [Prunus pe...   262   7e-67
ref|XP_002518101.1| conserved hypothetical protein [Ricinus comm...   248   2e-62
ref|XP_002868347.1| hypothetical protein ARALYDRAFT_355453 [Arab...   240   4e-60

>ref|XP_006349926.1| PREDICTED: uncharacterized protein LOC102586054 [Solanum tuberosum]
          Length = 745

 Score =  533 bits (1372), Expect = e-148
 Identities = 350/789 (44%), Positives = 465/789 (58%), Gaps = 28/789 (3%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFF-IQ 2823
            MAC+  Y WSLSG+VGAFVDL+I+Y LLCA+ +AF ASKFL  FGL LPCPCDGL F   
Sbjct: 1    MACEGRYMWSLSGIVGAFVDLAIAYFLLCAATVAFLASKFLDFFGLRLPCPCDGLLFGTV 60

Query: 2822 PSRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEF---QDQNCNMNFRLVG-EKESD--- 2664
            P+RN CFH+L +D+PAEKVSNVQLSI++ FPFN+    +DQNC++N+RL+G EKE+    
Sbjct: 61   PNRNLCFHRLLVDFPAEKVSNVQLSIKANFPFNDTILGKDQNCDVNWRLIGHEKENSPHG 120

Query: 2663 VIELEGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKS 2484
             +E+  EASCSSVSD+RK  N   I  EL+ RNE G             +KGK +M+Q+ 
Sbjct: 121  YLEMGDEASCSSVSDARKSHNIAMI--ELSPRNEFG-------------IKGKGVMNQRQ 165

Query: 2483 RSGLRHRR-KGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGN 2307
            R G+R RR K + ++G+ SS +SYDP++ +    P SPPS +K   G             
Sbjct: 166  RGGVRRRRRKAAVDYGRSSSVSSYDPQYEEFPLGPPSPPSTNKEDGG------------- 212

Query: 2306 GSHSGNYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSVETGQSSSDDKY 2127
                          MR G+R S  +N S DE E   KN++ + EL+ + E   SS  ++ 
Sbjct: 213  ---------HPPLVMRLGQRDSFELNGSSDEVEHTEKNIASIEELRHNGEP-VSSFHEEN 262

Query: 2126 TIXXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKR 1947
             I                    L+KER +         AMI RLQEEKA+IEM++RQY+R
Sbjct: 263  RIRFLEQALEHEREARDALCIELEKERNAAASAADEAMAMILRLQEEKASIEMDARQYQR 322

Query: 1946 LIEEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATE--NIAGMLD 1773
            LIEEKSA++AEEMNIL EIV+R EREKHFLEKE+EVYRQM  LGNE+   +  N+   L 
Sbjct: 323  LIEEKSAFEAEEMNILMEIVMRTEREKHFLEKELEVYRQMTYLGNEESTVDSGNVVDALR 382

Query: 1772 SQAFDSLLNSNEDPVMMLRHLSASIDKKNGAIGASKDSSVQGD-ENAARLTGDL------ 1614
                 S LN  EDP++ML  +SA  +K+      + +  +  D +N   L G+       
Sbjct: 383  RPVASSDLN--EDPLLMLHQISAFFNKRTVVENRNSEEVISLDKQNYIALGGEALIQRQN 440

Query: 1613 ------PQFSVGSNDENQELKDK-MVPIDNNSCAPLGNMTSVETLTWSSKVSSSVGKLPE 1455
                   Q  +  +  +QE ++K MV + N+S    GN   ++T     +   S  KLP+
Sbjct: 441  KDVNSQKQVDLAEHSCSQEFQEKEMVFMVNHSDVGPGNGKILDTSLKPCETGLSEQKLPD 500

Query: 1454 ENISLIGKG-KEQVDRDTELLEKNFETSHIVXXXXXXXXXXXDTKQRTKTSCN-SSNKEP 1281
            + ISL G+  KE  D +T     +     +               Q +K  CN +S+KEP
Sbjct: 501  QAISLEGEVLKENPDMETS----DRACIDVSRKDKCLKYHETVGYQGSKCPCNMTSDKEP 556

Query: 1280 CVHDVHVI-DQSNSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQRVTAIRDTPTPSNM 1104
             VHDVHVI D SN  N +S G+  +     + +  +  EA  +       IRD P+ S +
Sbjct: 557  RVHDVHVIVDGSNFCNDVSSGESRKSALEFSGKTSLPIEASPTQ----DVIRDRPSTSTL 612

Query: 1103 DVGVGINRSTSDTTSRLPPIVPKGKSVVSDLRRNSLSAVDNERLKLDIEIGRLRERLKIV 924
               V +  S +DTT  LPP+   GK ++ D R NS+S+VDNERLK+++E+ RLR RLKIV
Sbjct: 613  YTQVDLKIS-ADTTGGLPPVGSCGKPLLCDSRGNSISSVDNERLKIEMEVERLRGRLKIV 671

Query: 923  QEGREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQASLPLPSSKVITKK 744
            QEGREKL+L+ EH+ER  +QLKLLEDIAHQLQE+RQL EP KAVRQASLPLP SK ++KK
Sbjct: 672  QEGREKLDLTAEHRERGKMQLKLLEDIAHQLQEIRQLAEPEKAVRQASLPLPFSKGMSKK 731

Query: 743  RRCRSVSSG 717
            RR RSVS G
Sbjct: 732  RRSRSVSVG 740


>ref|XP_004252993.1| PREDICTED: uncharacterized protein LOC101261797 [Solanum
            lycopersicum]
          Length = 745

 Score =  529 bits (1363), Expect = e-147
 Identities = 356/794 (44%), Positives = 465/794 (58%), Gaps = 33/794 (4%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFF-IQ 2823
            M+C+  Y WSLSG+VGAFVDL+I+Y LLCA+ +AF ASKFL  FGL LPCP DGL F   
Sbjct: 1    MSCEGRYMWSLSGIVGAFVDLAIAYFLLCAATVAFIASKFLDFFGLRLPCPGDGLLFGTV 60

Query: 2822 PSRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEF---QDQNCNMNFRLVGEKESD---- 2664
            P+RN CFH+L +D+PAEKVSNVQLSIR+ FPF +    +DQNC++N RL+G+++ +    
Sbjct: 61   PNRNLCFHRLLVDFPAEKVSNVQLSIRANFPFTDTILGKDQNCDLNLRLIGQEKGNSPHG 120

Query: 2663 VIELEGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKS 2484
             +E+  EASCSSVSD+RK  N   I  EL+ RNE G+             KGK +M+Q+ 
Sbjct: 121  YLEMGDEASCSSVSDARKSHNIAMI--ELSPRNEFGQ-------------KGKGVMNQRQ 165

Query: 2483 RSGLRHRR-KGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGN 2307
            R G+R RR K + ++G+ SS +SYDP++      P SPPS +K   G  A          
Sbjct: 166  RGGVRRRRRKTAVDYGRSSSVSSYDPQYEDFPLGPPSPPSTNKEDGGHPA---------- 215

Query: 2306 GSHSGNYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSVETGQSSSDDKY 2127
                          MR G+R S  +  S DE E I KNV+ + EL+ + E   SS  +  
Sbjct: 216  ------------LVMRLGQRDSFELTGSSDEIEHIEKNVASIEELRHNGEP-VSSFHEGN 262

Query: 2126 TIXXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKR 1947
             I                    L+KER +         AMI RLQEEKAAIEM++RQY+R
Sbjct: 263  RIRLLERALEHEREARDALCIELEKERNAAASAADEAMAMILRLQEEKAAIEMDARQYQR 322

Query: 1946 LIEEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATE--NIAGMLD 1773
            LIEEKSA++AEEMNIL EI++R EREKHFLEKE+EVYRQM  LGNE+   +  N+   L 
Sbjct: 323  LIEEKSAFEAEEMNILMEILMRTEREKHFLEKELEVYRQMTYLGNEEPTGDSGNVVDALR 382

Query: 1772 SQAFDSLLNSNEDPVMMLRHLSASIDKKNGAIGA-SKDSSVQGDENAARLTGDLP----- 1611
                   LN  EDP++ML  +SAS +K+  A    S++ S    +N   L G+ P     
Sbjct: 383  RHVASPDLN--EDPLLMLHQISASFNKRTVAENKNSEEVSSLDKQNYIALAGEAPIQRQN 440

Query: 1610 -------QFSVGSNDENQELKDK-MVPIDNNSCAPLGNMTSVETLTWSSKVSSSVGKLPE 1455
                   Q  +  +  +QE ++K MV + N+S    GN   ++T     +   S  KLP+
Sbjct: 441  KDVNSQKQVDLTEHSCSQEYQEKEMVFMVNHSDVAPGNGKILDTSLKPCETGLSEQKLPD 500

Query: 1454 ENISLIGKG-KEQVDRDTELLEKNFETSHIVXXXXXXXXXXXDTKQRTKTSCN-SSNKEP 1281
            + I L G+  KE  D +T     +     +               Q +K  CN +S KEP
Sbjct: 501  QAIPLEGEVLKENPDMETS----DRACIDVSRKDKCLMYHETVGYQGSKCPCNLTSVKEP 556

Query: 1280 CVHDVHVI-DQSNSYNQISGGKEARMP-----KSSTPEIHMIKEAETSVLQRVTAIRDTP 1119
             VHDVHVI D SN  N +S G+  +       K+S P        E S  Q V  IRD P
Sbjct: 557  RVHDVHVIVDGSNFCNDVSNGESRKSALEFCGKTSHP-------VEASPTQDV--IRDRP 607

Query: 1118 TPSNMDVGVGINRSTSDTTSRLPPIVPKGKSVVSDLRRNSLSAVDNERLKLDIEIGRLRE 939
            + S +   V +  S SDTT  LPP+  +GK ++ D R NS+ +VDNERLK+++E+ RLRE
Sbjct: 608  STSTLYTQVDLKIS-SDTTGGLPPVGSRGKPLLRDSRGNSVCSVDNERLKIEMEVERLRE 666

Query: 938  RLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQASLPLPSSK 759
            RLKIVQEGREKL+L+ EH+ERE +QLKLLEDIAHQLQE+RQL EP KAVRQASLPLP SK
Sbjct: 667  RLKIVQEGREKLDLTAEHREREKMQLKLLEDIAHQLQEIRQLGEPEKAVRQASLPLPFSK 726

Query: 758  VITKKRRCRSVSSG 717
             ++KKRR RSVS G
Sbjct: 727  GMSKKRRSRSVSVG 740


>emb|CBI17315.3| unnamed protein product [Vitis vinifera]
          Length = 797

 Score =  482 bits (1240), Expect = e-133
 Identities = 314/804 (39%), Positives = 456/804 (56%), Gaps = 54/804 (6%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            MACQ +++W+  GLVGA++DL+I+YLLLC S +AF ASKFL  FGL LPCPC+G FF  P
Sbjct: 1    MACQEIHSWTFGGLVGAYLDLAIAYLLLCGSTLAFFASKFLSFFGLCLPCPCNG-FFGNP 59

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEF--QDQNCNMNFRLVGEKESD--VIEL 2652
            + + C  K  +DYP E++S+VQL ++SKFPF+     + + + N++L+  + SD   + L
Sbjct: 60   NGDNCLQKFLVDYPTERISSVQLCVKSKFPFDSVWANEGSPHPNWKLLKGRNSDDGAVGL 119

Query: 2651 EGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSGL 2472
            EGEASCSS  D  +  +    K  ++     G M     KEG+ D KGK + +Q+ ++G+
Sbjct: 120  EGEASCSSFWDVMRSPDIAG-KDSISRNGSCGVMNTPALKEGKSDTKGKRVSNQRPKTGV 178

Query: 2471 RHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHSG 2292
            R RR+ + +HGK+SS +S+DP    A     SP S+S+ G       ++   + +G   G
Sbjct: 179  RRRRRSAVDHGKFSSVSSFDPPRLDAPSGLRSPSSVSETGEAFVGKTLV--PDASGGEDG 236

Query: 2291 NYHEDATTAMRFGRR--HSNSINESPDEDETIAKNVSFLRELKGSVETGQS-SSDDKYTI 2121
               E     +  G R  H   +NE  DED+   K+ S   E+K +     S + + + T+
Sbjct: 237  FQDELVPILIDLGERALHGIKLNEHIDEDKPSEKDASSAEEVKCNARGKLSFNGNTENTV 296

Query: 2120 XXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRLI 1941
                                L+KER +         AMI R+QEEKA+IEME+RQ++R+I
Sbjct: 297  RVLEQALEEEHAARAALYHELEKERSAAASAADEAMAMILRIQEEKASIEMEARQFQRII 356

Query: 1940 EEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNE--QQATENIAGMLDSQ 1767
            EEKSAYDAEEMN+LKEI++RREREKHFLEKEVE YRQM+   N+  +  T +I    + +
Sbjct: 357  EEKSAYDAEEMNLLKEILLRREREKHFLEKEVEAYRQMMFSENDLLEGNTHDIVDTPEQR 416

Query: 1766 AFDSLLNSNEDPVMMLRHLSASIDK--------------------KNGAIGASKDSSVQG 1647
               SL  S EDPV+MLR +S SIDK                    K   +  SK+  +  
Sbjct: 417  PISSLYLS-EDPVLMLRRISESIDKEEKVKDADRCSVYESTSIEMKYPTLSFSKELPIPD 475

Query: 1646 DENAARLT-----------GDLPQFSVGSNDE-NQELKDK-MVPIDNNSCAPLGNMTSVE 1506
             E  A L+           G       G N + N+E ++K M+P+D N CA    +  + 
Sbjct: 476  WEEDADLSKGGEIHVNPNVGKHHSHKSGLNGKCNEEFQEKGMLPVDENQCAQKRGVQKLG 535

Query: 1505 TLTWSSKVSSSV-GKLPEENISLIGKGKEQVDRDTELLEKNFETS---------HIVXXX 1356
              +   + SSS      E+  + IG+ ++Q   + +L +    T+         H+    
Sbjct: 536  ACSQLYRSSSSQENSFLEKASAPIGEDQKQ-SGEIKLFQGIISTTTKTHAEAEMHVPHDG 594

Query: 1355 XXXXXXXXDTKQRTKTSCNSS-NKEPCVHDVHVIDQSNSYNQISGGKEARMPKSSTPEIH 1179
                         +K  C S+ + EP VHDVHVID  +  N  +   E+++ ++      
Sbjct: 595  EDLDKLGKTADHESKDHCCSAFDIEPRVHDVHVIDHES--NLCNEANESKIEQTPDIPAK 652

Query: 1178 MIKEAETSVLQRVTAIRDTPTPSNMDVGVGINRSTSDTTSRLPPIV-PKGKSVVSDLRRN 1002
            +    E S++QR+  + D P  S ++     ++S SD T+ LPP+   +GK+++SD+RR+
Sbjct: 653  LDPPVEASLIQRIGVVCDFPMMSTLESETNNDQSFSDITNGLPPLGGSRGKALLSDMRRH 712

Query: 1001 SLSAVDNERLKLDIEIGRLRERLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQEL 822
            S+S+VDNERLK++ E+ RLRERL+IVQEGREKLN SVEH+EREN+QL+LLEDIA QL+E+
Sbjct: 713  SISSVDNERLKIETEVERLRERLRIVQEGREKLNFSVEHRERENIQLQLLEDIASQLREI 772

Query: 821  RQLKEPGKAVRQASLPLPSSKVIT 750
            RQL EPGKAVRQASLP PSSK +T
Sbjct: 773  RQLTEPGKAVRQASLPPPSSKEMT 796


>emb|CAN62042.1| hypothetical protein VITISV_006702 [Vitis vinifera]
          Length = 829

 Score =  444 bits (1141), Expect = e-121
 Identities = 297/799 (37%), Positives = 438/799 (54%), Gaps = 52/799 (6%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            MACQ +++W+  GLVGA++DL+I+YLLLC S +AF ASKFL  FGL LPCPC+G FF  P
Sbjct: 1    MACQEIHSWTFGGLVGAYLDLAIAYLLLCGSTLAFFASKFLSFFGLCLPCPCNG-FFGNP 59

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEF--QDQNCNMNFRLVGEKESD--VIEL 2652
            + + C  K  +DYP E++S+VQL ++SKFPF+     + + + N++L+  + SD   + L
Sbjct: 60   NGDNCLQKFLVDYPTERISSVQLCVKSKFPFDSVWANEGSPHPNWKLLKGRNSDDGAVGL 119

Query: 2651 EGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSGL 2472
            EGEASCSS  D  +  +    K  ++     G M     KEG+ D KGK + +Q+ ++G+
Sbjct: 120  EGEASCSSFWDVMRSPDIAG-KDSISRNGSCGVMNTPALKEGKSDTKGKRVSNQRPKTGV 178

Query: 2471 RHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHSG 2292
            R RR+ + +HGK+SS +S+DP    A     SP S+S+ G       ++   + +G   G
Sbjct: 179  RRRRRSAVDHGKFSSVSSFDPPRLDAPSGLRSPSSVSETGEAFVGKTLV--PDASGGEDG 236

Query: 2291 NYHEDATTAMRFGRR--HSNSINESPDEDETIAKNVSFLRELKGSVETGQS-SSDDKYTI 2121
               E     +  G R  H   +NE  DED+   K+ S   E+K +     S + + + T+
Sbjct: 237  FQDELVPILIDLGERALHGIKLNEHIDEDKPSEKDASSAEEVKCNARGKLSFNGNTENTV 296

Query: 2120 XXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRLI 1941
                                L+KER +         AMI R+QEEKA+IEME+RQ++R+I
Sbjct: 297  RVLEQALEEEHAARAALYHELEKERSAAASAADEAMAMILRIQEEKASIEMEARQFQRII 356

Query: 1940 EEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGMLDSQAF 1761
            EEKSAYDAEEMN+LKEI+++  + + FL               E   T +I    + +  
Sbjct: 357  EEKSAYDAEEMNLLKEILLKERKGETFL--------------GEGSNTHDIVDTPEQRPI 402

Query: 1760 DSLLNSNEDPVMMLRHLSASIDK--------------------KNGAIGASKDSSVQGDE 1641
             SL  S EDPV+MLR +S SIDK                    K   +  SK+  +   E
Sbjct: 403  SSLYLS-EDPVLMLRRISESIDKEEKVKDADRCSVYESTSIEMKYPTLSFSKELPIPDWE 461

Query: 1640 NAARLT-----------GDLPQFSVGSNDE-NQELKDK-MVPIDNNSCAPLGNMTSVETL 1500
              A L+           G       G N + N+E ++K M+P+D N CA    +  +   
Sbjct: 462  EDADLSKGGEIHVNPNVGKHHSHKSGLNGKCNEEFQEKGMLPVDENQCAQKRGVQKLGAC 521

Query: 1499 TWSSKVSSSV-GKLPEENISLIGKGKEQVDRDTELLEKNFETS---------HIVXXXXX 1350
            +   + SSS      E+  + IG+ ++Q   + +L +    T+         H+      
Sbjct: 522  SQLYRSSSSQENSFLEKASAPIGEDQKQ-SGEIKLFQGIISTTTKTHAEAEMHVPHDGED 580

Query: 1349 XXXXXXDTKQRTKTSCNSS-NKEPCVHDVHVIDQSNSYNQISGGKEARMPKSSTPEIHMI 1173
                       +K  C S+ + EP VHDVHVID  +  N  +   E+++ ++      + 
Sbjct: 581  LDKLGKTADHESKDHCCSAFDIEPRVHDVHVIDHES--NLCNEANESKIEQTPDIPAKLD 638

Query: 1172 KEAETSVLQRVTAIRDTPTPSNMDVGVGINRSTSDTTSRLPPIV-PKGKSVVSDLRRNSL 996
               E S++QR+  + D P  S ++     ++S SD T+ LPP+   +GK+++SD+RR+S+
Sbjct: 639  PPVEASLIQRIGVVCDFPMMSTLESETNNDQSFSDITNGLPPLGGSRGKALLSDMRRHSI 698

Query: 995  SAVDNERLKLDIEIGRLRERLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQELRQ 816
            S+VDNERLK++ E+ RLRERL+IVQEGREKLN SVEH+EREN+QL+LLEDIA QL+E+RQ
Sbjct: 699  SSVDNERLKIETEVERLRERLRIVQEGREKLNFSVEHRERENIQLQLLEDIASQLREIRQ 758

Query: 815  LKEPGKAVRQASLPLPSSK 759
            L EPGKAVRQASLP PSSK
Sbjct: 759  LTEPGKAVRQASLPPPSSK 777


>ref|XP_006420909.1| hypothetical protein CICLE_v10004416mg [Citrus clementina]
            gi|557522782|gb|ESR34149.1| hypothetical protein
            CICLE_v10004416mg [Citrus clementina]
          Length = 738

 Score =  407 bits (1045), Expect = e-110
 Identities = 299/784 (38%), Positives = 412/784 (52%), Gaps = 25/784 (3%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            M CQA+  W+ S LVGAF++L+I+Y LLC SA+A+ ASKFLG+FGL LPCPC+G  F +P
Sbjct: 5    MMCQAIDVWTFSELVGAFLNLAIAYFLLCGSALAYFASKFLGLFGLSLPCPCNG-HFGKP 63

Query: 2819 SR---NFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEFQDQNCNMNFRLVGEKESDV--IE 2655
            ++     C+    +D P EK+SN+Q   ++KFPF+     N N   +   E+E D   + 
Sbjct: 64   NKISYGNCWQGFLVDCPTEKISNIQFLAKTKFPFDSILASNMNPQSK---EREFDKGHVA 120

Query: 2654 LEGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSG 2475
             EGE SC+S S                 R  +GR   +  KEGRFD KGK +   + R G
Sbjct: 121  SEGETSCASSS-----------------RERIGRDSSM--KEGRFDFKGKVVKSHRPRYG 161

Query: 2474 LRHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHS 2295
            +R  RK  F + K  S +S+D   S  Q    SP S SK G  ++    + +  G+ +  
Sbjct: 162  IRRHRKSVFRNEKSLSFSSFDQLVSDWQSVLSSPSSFSKIGTEISEGSSVPVHRGSETI- 220

Query: 2294 GNYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSV--ETGQSSSDDKYTI 2121
                ED+  A +     ++  NE  D++ T+ K+ S +  L   +  E G   +D   TI
Sbjct: 221  ----EDSRGASKEDVTMTSESNEPVDKNNTVEKDASSVEVLNCDLPGELGLDGNDIS-TI 275

Query: 2120 XXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRLI 1941
                                L+KER +         AMI RLQEEKA+IEME+RQY+R+I
Sbjct: 276  RSLEEALGEEHAARSALYLELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQRMI 335

Query: 1940 EEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGMLDSQAF 1761
            EEKSAYDAEEMNILKEI+IRREREKHFLE+EVE  RQM        A  +       Q  
Sbjct: 336  EEKSAYDAEEMNILKEIIIRREREKHFLEREVETLRQMFF-----DAETDDTATTQEQTA 390

Query: 1760 DSLLNSNEDPVMMLRHLSASIDKKNG-------------AIGASKDSSVQGDENAARLTG 1620
             S   S EDP+MML+ +S S+ +K               +IG+   S   G         
Sbjct: 391  TSSSYSCEDPLMMLQKISKSVVEKQKVKSENDFPDYEVTSIGSQNYSVTSGKGLLNPELD 450

Query: 1619 DLPQFSVGSNDENQELKDK-MVPIDNNSCAPLGNMTSVETLTWSSKVSSS---VGKLPEE 1452
            ++     G    +  L++K M  ++ N   P+      +    SS+++ S   V  + E+
Sbjct: 451  EVDSSKPGYIHRHPSLQEKGMESMEKN---PIQQTREEQIPAESSQLNCSTTQVRNVHEK 507

Query: 1451 NISLIGKGKEQVDRDTELLEKNFETSHIVXXXXXXXXXXXDTKQRTKTSCNSSNKEPCVH 1272
             I+L  + +E  D   + +E   ET  IV             K    +  +   ++ CV+
Sbjct: 508  IITLEVEKQEPADELKKAMEACDETKIIVKYSNDKVE-----KHGKGSQSSVPVRDLCVY 562

Query: 1271 DVHVI-DQSNSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQRVTAIRDTPTPSNMDVG 1095
            DVHVI D+  + N+ SG     + K+ T +I                  D+P     D  
Sbjct: 563  DVHVIGDELTTCNEDSGNNHGDLSKNLTLDI--------------PTRCDSPVIDRSDTE 608

Query: 1094 VGINRSTSDTTSRLPPIVPKGKSVVSDLRRNSLSAVDNERLKLDIEIGRLRERLKIVQEG 915
            V + RS SD  SR      +GK++++DLRRNS+SA D ERLK+D E+G LRERLKIVQEG
Sbjct: 609  VDMKRSISDLVSRQRSF-SQGKTLLTDLRRNSMSAFDYERLKIDNEVGCLRERLKIVQEG 667

Query: 914  REKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQASLPLPSSKVITKKRRC 735
            REKLN S  HK RE +QL+LLEDIA QL+E+RQL EPGKA RQASLP  S+K ++KKRR 
Sbjct: 668  REKLNFSKGHKGREKIQLQLLEDIASQLREIRQLTEPGKAARQASLPPLSTKAVSKKRRS 727

Query: 734  RSVS 723
            RS+S
Sbjct: 728  RSIS 731


>ref|XP_006493818.1| PREDICTED: intracellular protein transport protein USO1-like isoform
            X1 [Citrus sinensis]
          Length = 738

 Score =  401 bits (1031), Expect = e-109
 Identities = 300/785 (38%), Positives = 410/785 (52%), Gaps = 26/785 (3%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            M CQA+  W+ S LVGAF++L+I+Y LLC SA+A+ ASKFLG+FGL LPCPC+G  F +P
Sbjct: 5    MMCQAIDVWTFSELVGAFLNLAIAYFLLCGSALAYFASKFLGLFGLSLPCPCNG-HFGKP 63

Query: 2819 SR---NFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEFQDQNCNMNFRLVGEKESDV--IE 2655
            ++     C+    +D P EK+SN+Q   ++KFPF+     N N       E+E D   + 
Sbjct: 64   NKISYGNCWQGFLVDCPTEKISNIQFLAKTKFPFDSILASNMNPQSN---EREFDKGHVA 120

Query: 2654 LEGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSG 2475
             EGE SC+S S                 R  +GR   +  KEGRFD KGK +  Q+ R G
Sbjct: 121  SEGETSCASSS-----------------REIIGRDSSM--KEGRFDFKGKVVKSQRPRYG 161

Query: 2474 LRHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHS 2295
            +R  RK +F++ K  S +S+D   S  Q    SP S S  G  ++    + +  G+ +  
Sbjct: 162  IRRHRKSAFHNEKSVSFSSFDQLVSDWQSVLPSPSSFSNIGTEISEGSSVPVHRGSETI- 220

Query: 2294 GNYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSV--ETGQSSSDDKYTI 2121
                ED+  A +     ++  NE  D++ T+ K+ S +  L   +  E G   +D   TI
Sbjct: 221  ----EDSRGASKEDVTMTSESNEPVDKNNTVEKDASSVEVLNCDLPGELGLDGNDIS-TI 275

Query: 2120 XXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRLI 1941
                                L+KER +         AMI RLQEEKA+IEME+RQY+R+I
Sbjct: 276  RSLEEALEEEHAARSALYLELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQRMI 335

Query: 1940 EEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGMLDSQAF 1761
            EEKSAYDAEEMNILKEI+IRREREK+FLE+EVE  RQ+        A  +       Q  
Sbjct: 336  EEKSAYDAEEMNILKEIIIRREREKYFLEREVETLRQLFF-----DAETDDTATTQEQTA 390

Query: 1760 DSLLNSNEDPVMMLRHLSASIDKKNGAIGASKDSSVQGDENAARLTGDLPQFSVGSNDEN 1581
             S   S EDP+MML+ +S S+ +K       K  +   D     +       + G    N
Sbjct: 391  TSSSYSCEDPLMMLQKISKSVVEKQKV----KSENDFPDYEVTSIGSQNYSVTSGKGLLN 446

Query: 1580 QELKD----KMVPIDNNSCAPLGNMTSVETL-----------TWSSKVSSS---VGKLPE 1455
             EL +    K V I  +       M S+E               SS+ +SS   V  + E
Sbjct: 447  PELDEVDSSKPVYIHRHPSLQEKGMESMEKNPIQQTREEQIPAESSQFNSSTTQVRNVHE 506

Query: 1454 ENISLIGKGKEQVDRDTELLEKNFETSHIVXXXXXXXXXXXDTKQRTKTSCNSSNKEPCV 1275
            + I+L  + +E  D   + +E   ET  IV             K    +  +   ++ CV
Sbjct: 507  KIITLEAEKQEPADELKKAMEACDETKIIVKYSNDKVE-----KHGKGSQSSVPVRDLCV 561

Query: 1274 HDVHVI-DQSNSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQRVTAIRDTPTPSNMDV 1098
            +DVHVI D+  + N+ SG     + K+ T +I                  D+P     D 
Sbjct: 562  YDVHVIGDELTTCNEDSGNNHGDLSKNLTLDI--------------PTRCDSPVIDRSDT 607

Query: 1097 GVGINRSTSDTTSRLPPIVPKGKSVVSDLRRNSLSAVDNERLKLDIEIGRLRERLKIVQE 918
             V + RS SD  SR      +GK++++DLRRNS+SA D ERLK+D E+G LRERLKIVQE
Sbjct: 608  EVDMKRSISDLVSRQRSF-SQGKTLLTDLRRNSMSAFDYERLKIDNEVGCLRERLKIVQE 666

Query: 917  GREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQASLPLPSSKVITKKRR 738
            GREKLN S  HK RE +QL+LLEDIA QL+E+RQL EPGKA RQASLP  S+K ++KKRR
Sbjct: 667  GREKLNFSKGHKGREKIQLQLLEDIASQLREIRQLTEPGKAARQASLPPLSTKAVSKKRR 726

Query: 737  CRSVS 723
             RS+S
Sbjct: 727  SRSIS 731


>gb|EXC16951.1| Lysine-specific demethylase 3B [Morus notabilis]
          Length = 2152

 Score =  387 bits (994), Expect = e-104
 Identities = 289/770 (37%), Positives = 409/770 (53%), Gaps = 21/770 (2%)
 Frame = -2

Query: 3002 EMACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQ 2823
            ++  QAM +W+ S LV AF+DLSI+Y LLCASA AF AS+FL +FGL LPCPCDGLF+  
Sbjct: 2    KLTSQAMNSWTFSELVAAFLDLSIAYCLLCASAFAFFASRFLALFGLCLPCPCDGLFW-N 60

Query: 2822 PSRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEF---QDQNCNMNFRLVGEKESDVIEL 2652
            P  N   ++  +D P EK+S+V  S++SKFPF+      +QN N + +L  E E +  E 
Sbjct: 61   PRNNS--NRQLVDCPYEKISSVYFSVKSKFPFDSVLGGDEQNGNSHLKL--ENEGNHGEN 116

Query: 2651 EGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSGL 2472
                + SS S   +  + V    +     E G +   G KE  +  +G+ ++ Q+ R GL
Sbjct: 117  GECGTSSSSSPGTRFHDLVETDVKRKTGAEFGAVNFEGSKEEEYGAEGQRVVEQRPRHGL 176

Query: 2471 RHRRKG--SFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSH 2298
            R RRKG  S N+GK S  +SYD   S A+  P+SPPSISK GN ++ D            
Sbjct: 177  RRRRKGGGSVNYGKASFVSSYDTLQSDARNIPQSPPSISKMGNEVSEDP----------- 225

Query: 2297 SGNYHEDATTAMRFGRRHSNSI----NESPDEDETIAKNVSFLREL-KGSVETGQSSSDD 2133
              +Y +D      FG     S+    N S D+ +   +    + EL +G+    +   ++
Sbjct: 226  -NDYGDDREAITAFGSLEGVSLGLASNNSNDDCKHAEREAVSIEELGRGTQGDFELDKNE 284

Query: 2132 KYTIXXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQY 1953
            K  I                    L+KER +         AMI RLQ+EKA+IEME++QY
Sbjct: 285  KNMIRLLEQALEEEHAACSALYLELEKERSAAASAADEAMAMILRLQKEKASIEMEAKQY 344

Query: 1952 KRLIEEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQAT--ENIAGM 1779
            +R+IE K+AYDAEEMNILKEI++RREREKHFLEKEVE  RQM+ +GN Q  +  +++A +
Sbjct: 345  QRMIEAKAAYDAEEMNILKEILLRREREKHFLEKEVEACRQMI-VGNAQSDSDAQDVADL 403

Query: 1778 LDSQAFDSLLNSNEDPVMMLRHLSASIDKKNGAIG-------ASKDSSVQGDENAARLTG 1620
               +  +S + S EDP+++L+ L+ SI+K    +         ++D     D      + 
Sbjct: 404  --QELTNSAIYSGEDPILLLQQLNESIEKPKLKVANKLAIPELAEDDDALVDMQEHPSSD 461

Query: 1619 DLPQFSVGSNDENQELKDKMVPIDNNSCAPLGNMTSVETLTWSSKVSSSVGKLPEENISL 1440
              P F  G N+ N E K+K V  +            + T      + +SV    E+    
Sbjct: 462  GHPHFLSG-NEFNHECKEKGVVKEVPRWEEDFKYYELSTPKGLDMLETSVTPFVED---- 516

Query: 1439 IGKGKEQVDRDTELLEKNFETSHIVXXXXXXXXXXXDTKQRTKTSCN-SSNKEPCVHDVH 1263
                 +Q+   +  ++   + S I               + +K   N   + E  V+DVH
Sbjct: 517  ----PDQIGSTSLCVDFASKASDI-------------PYRESKDPPNLVFDGESHVYDVH 559

Query: 1262 VIDQSNSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQRVTAIRDTPTPSNMDVGVGIN 1083
            VID                 KS T     +K    SV       ++  +PS +     +N
Sbjct: 560  VIDH----------------KSKTLNAVSVKNERQSVSAPSNVPKNCGSPS-LSTHYEMN 602

Query: 1082 RSTSDTTSRLPPI-VPKGKSVVSDLRRNSLSAVDNERLKLDIEIGRLRERLKIVQEGREK 906
            RS+SD TS LPP    + K+V++D RRNS+SAVD E+LK++ E+  LRERL+IVQ+GREK
Sbjct: 603  RSSSDITSGLPPTGFSQEKAVLTDCRRNSMSAVDYEKLKIENEVEWLRERLRIVQKGREK 662

Query: 905  LNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQASLPLPSSKV 756
            LN SV H+ERE LQL+LLEDIA QL+E+RQL EPGKA RQASLP   SKV
Sbjct: 663  LNFSVGHREREKLQLQLLEDIASQLREIRQLNEPGKAERQASLPPTYSKV 712


>ref|XP_006493819.1| PREDICTED: intracellular protein transport protein USO1-like isoform
            X2 [Citrus sinensis]
          Length = 723

 Score =  386 bits (992), Expect = e-104
 Identities = 293/773 (37%), Positives = 400/773 (51%), Gaps = 26/773 (3%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            M CQA+  W+ S LVGAF++L+I+Y LLC SA+A+ ASKFLG+FGL LPCPC+G  F +P
Sbjct: 5    MMCQAIDVWTFSELVGAFLNLAIAYFLLCGSALAYFASKFLGLFGLSLPCPCNG-HFGKP 63

Query: 2819 SR---NFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEFQDQNCNMNFRLVGEKESDV--IE 2655
            ++     C+    +D P EK+SN+Q   ++KFPF+     N N       E+E D   + 
Sbjct: 64   NKISYGNCWQGFLVDCPTEKISNIQFLAKTKFPFDSILASNMNPQSN---EREFDKGHVA 120

Query: 2654 LEGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSG 2475
             EGE SC+S S                 R  +GR   +  KEGRFD KGK +  Q+ R G
Sbjct: 121  SEGETSCASSS-----------------REIIGRDSSM--KEGRFDFKGKVVKSQRPRYG 161

Query: 2474 LRHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHS 2295
            +R  RK +F++ K  S +S+D   S  Q    SP S S  G  ++    + +  G+ +  
Sbjct: 162  IRRHRKSAFHNEKSVSFSSFDQLVSDWQSVLPSPSSFSNIGTEISEGSSVPVHRGSETI- 220

Query: 2294 GNYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSV--ETGQSSSDDKYTI 2121
                ED+  A +     ++  NE  D++ T+ K+ S +  L   +  E G   +D   TI
Sbjct: 221  ----EDSRGASKEDVTMTSESNEPVDKNNTVEKDASSVEVLNCDLPGELGLDGNDIS-TI 275

Query: 2120 XXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRLI 1941
                                L+KER +         AMI RLQEEKA+IEME+RQY+R+I
Sbjct: 276  RSLEEALEEEHAARSALYLELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQRMI 335

Query: 1940 EEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGMLDSQAF 1761
            EEKSAYDAEEMNILKEI+IRREREK+FLE+EVE  RQ+        A  +       Q  
Sbjct: 336  EEKSAYDAEEMNILKEIIIRREREKYFLEREVETLRQLFF-----DAETDDTATTQEQTA 390

Query: 1760 DSLLNSNEDPVMMLRHLSASIDKKNGAIGASKDSSVQGDENAARLTGDLPQFSVGSNDEN 1581
             S   S EDP+MML+ +S S+ +K       K  +   D     +       + G    N
Sbjct: 391  TSSSYSCEDPLMMLQKISKSVVEKQKV----KSENDFPDYEVTSIGSQNYSVTSGKGLLN 446

Query: 1580 QELKD----KMVPIDNNSCAPLGNMTSVETL-----------TWSSKVSSS---VGKLPE 1455
             EL +    K V I  +       M S+E               SS+ +SS   V  + E
Sbjct: 447  PELDEVDSSKPVYIHRHPSLQEKGMESMEKNPIQQTREEQIPAESSQFNSSTTQVRNVHE 506

Query: 1454 ENISLIGKGKEQVDRDTELLEKNFETSHIVXXXXXXXXXXXDTKQRTKTSCNSSNKEPCV 1275
            + I+L  + +E  D   + +E   ET  IV             K    +  +   ++ CV
Sbjct: 507  KIITLEAEKQEPADELKKAMEACDETKIIVKYSNDKVE-----KHGKGSQSSVPVRDLCV 561

Query: 1274 HDVHVI-DQSNSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQRVTAIRDTPTPSNMDV 1098
            +DVHVI D+  + N+ SG     + K+ T +I                  D+P     D 
Sbjct: 562  YDVHVIGDELTTCNEDSGNNHGDLSKNLTLDI--------------PTRCDSPVIDRSDT 607

Query: 1097 GVGINRSTSDTTSRLPPIVPKGKSVVSDLRRNSLSAVDNERLKLDIEIGRLRERLKIVQE 918
             V + RS SD  SR      +GK++++DLRRNS+SA D ERLK+D E+G LRERLKIVQE
Sbjct: 608  EVDMKRSISDLVSRQRSF-SQGKTLLTDLRRNSMSAFDYERLKIDNEVGCLRERLKIVQE 666

Query: 917  GREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQASLPLPSSK 759
            GREKLN S  HK RE +QL+LLEDIA QL+E+RQL EPGKA RQASLP  S+K
Sbjct: 667  GREKLNFSKGHKGREKIQLQLLEDIASQLREIRQLTEPGKAARQASLPPLSTK 719


>ref|XP_002323777.2| hypothetical protein POPTR_0017s08220g [Populus trichocarpa]
            gi|550319756|gb|EEF03910.2| hypothetical protein
            POPTR_0017s08220g [Populus trichocarpa]
          Length = 781

 Score =  380 bits (975), Expect = e-102
 Identities = 298/815 (36%), Positives = 419/815 (51%), Gaps = 54/815 (6%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            M CQ + +W+   LVGA++DL+I+Y LLCAS  AF A KFLG+FGL LPCPC+GLF    
Sbjct: 1    MPCQEIKSWAFDELVGAYLDLAIAYFLLCASTFAFFAEKFLGLFGLCLPCPCNGLFG-DH 59

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEFQDQNCNMNFRLVGEKE----SDVIEL 2652
            +RN C+  +  D P+E +S+VQ S++S+FPF+   D++ N    +    E    SD   L
Sbjct: 60   NRNKCWRSVLADRPSENISSVQFSVKSRFPFDSMWDKHLNFESSVGTINEVNCGSDNAGL 119

Query: 2651 EGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSGL 2472
            EGEA C S+ +        R  G+  ER+ V    +   KEG+FD+K +    QK R  L
Sbjct: 120  EGEAWCGSLRE--------RKSGKGVERSVVNVRDV---KEGKFDVKERGFSIQKGRY-L 167

Query: 2471 RHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHSG 2292
            R RRK + + G +SS +SYD   S ++  P+SP S++K  N     D++   +G  +   
Sbjct: 168  RRRRKVAADKGLFSSVSSYDHSQSNSRTHPQSPASVNKLMNKHHEGDMVPASSGADALHF 227

Query: 2291 NYHEDATTAMRFGRRHSNSI--NESPDEDETIAKNVSFLRELKGSVETGQSSSD--DKYT 2124
               ++++    F    SN    NE   E++ + K      +LK   + G+   D  +K+ 
Sbjct: 228  EDSKESSVDTGFVGTVSNDFESNEPLGENKPMEKAAPLGDDLKCKAQ-GEPCFDGEEKHG 286

Query: 2123 IXXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRL 1944
            I                    L+KER +         AMI RLQE+KA IEME+RQY R+
Sbjct: 287  IRVLEQASEEEHAAFSALYLELEKERSAAASAADEAMAMILRLQEDKALIEMEARQYHRM 346

Query: 1943 IEEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGMLDSQA 1764
            IEEKSAYD EEMNILKEI++RREREKHFLEKEVE YRQ++  GNE+  ++        + 
Sbjct: 347  IEEKSAYDLEEMNILKEILLRREREKHFLEKEVETYRQVI-FGNEEWESDVQDIGTTHEQ 405

Query: 1763 FDSLLNSNEDPVMMLRHLSASIDKKNGAIGASK--DSSVQGDENAARLTGDLPQFSVGSN 1590
              S   S EDP ++L+ +S SID+K     ++K   S VQ  E+ +       +  +   
Sbjct: 406  MASSQYSREDPFLVLQRISESIDEKEKGEESNKFLRSKVQSIESQSCALAFGKELPIPEL 465

Query: 1589 DENQELK----------DKM---VPIDNNSCAPLGNMTSVETLTWSSKVSSSVGKLPEEN 1449
            DE + LK          DK+   + +DN+     G     E     S  +S   +L E  
Sbjct: 466  DEVESLKGRCIHRHPGIDKLRRHLSMDND-----GTQEEFEEKELLSPDNSLFDQLREPQ 520

Query: 1448 ISLIGKGKEQVDRDTELLEKNFET----------SHIVXXXXXXXXXXXDTKQRTKT--- 1308
            I       +   R   L EK   T          S  +           +T  +TK    
Sbjct: 521  IMESCSQFDLSTRGCNLNEKTISTSVEAQQQSDQSDCINAGHGLASKTTETCDQTKIIFP 580

Query: 1307 -SCNSSNKE------------PCVHDVHVIDQSNSYNQISGGKEARMPKSSTPEIHMIKE 1167
             +C+ S K               VHDVHVID     N   G  E    K S         
Sbjct: 581  YNCDDSEKHARDSSDAEFDLGSLVHDVHVIDDKT--NLSCGINENGSEKLS--------- 629

Query: 1166 AETSVLQRVTAIRDTP----TPSNMDVGVGINRSTSDTTSRLPPI-VPKGKSVVSDLRRN 1002
                    V+A  D P    +P        + +S SD T+ LPP+   KGK + SDLRRN
Sbjct: 630  --------VSAASDIPRTCGSPRISWAEQDVRKSCSDMTNGLPPLGSSKGKFLTSDLRRN 681

Query: 1001 SLSAVDNERLKLDIEIGRLRERLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQEL 822
            S+SAVD ER K+D E+G LRERL+I+Q GRE+LN+S+E++E++ +QL+LLE+   QL+E+
Sbjct: 682  SMSAVDYERFKIDSEVGWLRERLRIIQVGREQLNISMENREKKKVQLQLLENTVSQLREI 741

Query: 821  RQLKEPGKAVRQASLPLPSSKVITKKRRCRSVSSG 717
            +Q  E G+AVRQASLP  S KV++KKR+ RS S G
Sbjct: 742  QQSTEHGQAVRQASLPPLSYKVMSKKRQWRSASLG 776


>gb|EOY05294.1| Uncharacterized protein TCM_020328 [Theobroma cacao]
          Length = 758

 Score =  371 bits (952), Expect = 1e-99
 Identities = 290/809 (35%), Positives = 404/809 (49%), Gaps = 46/809 (5%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            MAC  + +W+ +GLVGAF+DLSI+YLLLC S +++ ASKFLG+FGL LPCPC GLF    
Sbjct: 1    MACNVINSWTFNGLVGAFLDLSIAYLLLCGSTLSYLASKFLGLFGLSLPCPCSGLFGSTD 60

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFN---------EFQDQNCNMNFRLVGEKES 2667
              N C   + ++ P+ K+S+VQ S++ K PF+         E +D+  + +   V + ++
Sbjct: 61   KSN-CLQAILVNKPSLKISSVQSSVKKKLPFDSIWNNFYDDEDEDEEQHDSQSNVDKWQN 119

Query: 2666 DVIELEGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDL--KGKEIMH 2493
              +E+EGEAS  S ++ +   NFV                  G K+G F    K K    
Sbjct: 120  RNVEMEGEASSCSWNEKK---NFV------------------GVKKGSFTPFPKWKGFGS 158

Query: 2492 QKSRSGLRHRRKG-SFNHGKYSSRASYDPRFSKAQGDP-ESPPSISKGGNGLAADDIMVL 2319
            Q+ R GLR R++  S   GK  S  SYD   S        S  SI K GN +        
Sbjct: 159  QRPRVGLRRRKRAASGRRGKVLS-FSYDSLVSMTTPTGLNSSASIGKFGNDITE------ 211

Query: 2318 DNGNGSHSGNYHEDATTAMRFGRRHSNS----INESPDEDETIAKNVSFLRELKGSVETG 2151
                G+ S N  +   T+         S    +++ P  + T+ +    L E K      
Sbjct: 212  ---GGTTSANSEDGWETSKEIEMPEQGSQGFEMDDDPFAENTLIEKEVALAEFKCLPPDQ 268

Query: 2150 QSSSDDKYTIXXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIE 1971
                 D+  I                    L+KER +         AMI RLQEEKA IE
Sbjct: 269  DFDGSDRNAIRVLEQALEEEHAARTALYLELEKERSAAATAADEAMAMILRLQEEKATIE 328

Query: 1970 MESRQYKRLIEEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATEN 1791
            ME+RQY+R+IEEKSAYDAEEMNILKEI++RREREKHFLEKEVE Y+QM    NEQ   E 
Sbjct: 329  MEARQYQRMIEEKSAYDAEEMNILKEILLRREREKHFLEKEVESYKQMF-FENEQLDAEM 387

Query: 1790 IAGMLDSQAFDSLLNSNEDPVMMLRHLSASIDKK-----NGAIGASKDSSVQG------- 1647
                   +   S + SNE+PV+ L+  + S+ +K     NG     + +S++        
Sbjct: 388  YDTAATQEQKSSSIYSNEEPVLKLQQNTESVGEKEKTKINGDFSEYEITSIRSLNHTLAF 447

Query: 1646 ---------DENAARLTGDL----PQFSVGSNDENQELKDKMVPIDNNSCAPLGNMTSVE 1506
                     +E+A  L   +       S   ++ NQE ++K + + N S      +   E
Sbjct: 448  GKEIPIPELNEDAGSLNSSVEINRAHLSRIHDEVNQEFQNKGMALKNKS------LNHQE 501

Query: 1505 TLTWSSKVSSSVGKLPEENISLIGKGKEQVDRDT---ELLEKNFETSHIVXXXXXXXXXX 1335
                SS+ S+    L E+ I+ + + +EQ    +    L+ K  E               
Sbjct: 502  RHVQSSQ-STEGPDLHEKAINPMVEEEEQCGETSPHQRLMPKTTEALE-EAKIIFPYNNE 559

Query: 1334 XDTKQRTKTSCNSSNKEPCVHDVHVIDQSNSYNQISGGKEARMPKSSTPEIHMIKEAETS 1155
               K       + S  +  VHDVHVI    + N +  G E+             ++   S
Sbjct: 560  KVEKHGEDLHGSYSGIDHHVHDVHVIYDECNVNNVENGNES-------------EKKSIS 606

Query: 1154 VLQRVTAIRDTPTPSNMDVGVGINRSTSDTTSRLPPIVP-KGKSVVSDLRRNSLSAVDNE 978
            V   +    D PT   + +     R++ D + RLPPI P +GK +   LRRNS+SA D E
Sbjct: 607  VTSNLPGTCDNPTIGGLVIEPDRKRNSLDRSGRLPPIGPSRGKHLPPILRRNSMSAFDYE 666

Query: 977  RLKLDIEIGRLRERLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGK 798
            RLK+D E+G LRERLKIVQ+GR+KLN  V H+ERE  QL++LE+IA QL+E+RQL EPGK
Sbjct: 667  RLKIDNEVGWLRERLKIVQQGRDKLNFPVGHREREQAQLQILENIASQLREIRQLTEPGK 726

Query: 797  AVRQASLPLPSSKVITKKRRCRSVSSGQL 711
            A+RQASLP PSSKV++KKRR R    G L
Sbjct: 727  ALRQASLPPPSSKVMSKKRRWRGAPLGVL 755


>ref|XP_006840152.1| hypothetical protein AMTR_s00089p00065300 [Amborella trichopoda]
            gi|548841851|gb|ERN01827.1| hypothetical protein
            AMTR_s00089p00065300 [Amborella trichopoda]
          Length = 903

 Score =  326 bits (836), Expect = 4e-86
 Identities = 255/794 (32%), Positives = 390/794 (49%), Gaps = 32/794 (4%)
 Frame = -2

Query: 3002 EMACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLF--- 2832
            +MAC  +++W+   L+GAF+DLSI++ +LC SA+AF  +KF+G+FGLY PC C+GLF   
Sbjct: 116  KMACTVIHSWTFCSLIGAFLDLSIAFFMLCGSAMAFFTAKFMGIFGLYFPCTCNGLFGDP 175

Query: 2831 FIQPSRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEF--QDQNCNMNFRLVGEKESDVI 2658
                + +FC  ++ +++P  K S++Q+S+++KFP++    +D     +   +G      +
Sbjct: 176  MNSGNGHFCIQRVLVEFPPRKSSSLQMSLQNKFPYDTIWLRDIGREHDNLKIGNTSDGAL 235

Query: 2657 ELEGEASCSSVSDSRKPSNF-VRIKGELNERNEVGRMKLLGGK-EGRFDL--KGKEIMHQ 2490
             L  +    S S + +       I  E   ++E   ++      +G   L  KGK + + 
Sbjct: 236  GLNHKTDDESSSSASEAQTLRSSIVDETRAKSEWDMVEFRASPCQGSVSLSSKGKGVWNP 295

Query: 2489 KSRSGLRHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNG 2310
            +SRS L HRR+ +    + S + S    F   +G   SP + S+       +      + 
Sbjct: 296  RSRSSL-HRRRRASGDARSSIKLSLSRSFG--EGGDHSPLNSSELSRENPVESFHPSSHF 352

Query: 2309 NGSHSGNYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLREL-KGSVETGQSSSDD 2133
              + S    E +      G  H  S     +++ TI ++ SF+  L +GS+       ++
Sbjct: 353  KMNQSQYSGEGSDGNAMVGDMHGVSKELFSEDNNTIEEDTSFVEALVEGSLGEQGLMGNE 412

Query: 2132 KYTIXXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQY 1953
              TI                    L+KER +         AMI RLQEEKA IEME+RQY
Sbjct: 413  ADTIRILGKALEEERTSRTALYNELEKERSAAATAADEAMAMILRLQEEKAVIEMEARQY 472

Query: 1952 KRLIEEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGMLD 1773
            +R+IEEK+ YD EE ++LKEI++RREREK  LEKEVEVYRQML  G  + A E +    +
Sbjct: 473  QRMIEEKATYDEEERSVLKEILVRREREKLVLEKEVEVYRQMLSSGTNELAVEFLGDEAN 532

Query: 1772 SQAFDSLL--NSNEDPVMMLRHLSASIDKK------NGAIGASKDSSVQGDENAARLTGD 1617
                  L   +S +DP+++L+ +  S+ +K      + A   +  + VQ  E+ A     
Sbjct: 533  ELVGRRLFSHDSIDDPILILQQIGDSLSRKEKIGDNHPAEENTSQNGVQRSESIANEDPC 592

Query: 1616 LPQFSVGSNDENQELKDKMVPIDNNSCAPLGNMTSVETLTWSSKVSSSVGKLPEENISL- 1440
             P+F+ G + E  E    MV ID +        +  E      K+       P EN  L 
Sbjct: 593  PPEFTDGFSLEVHE--KSMVSIDCSQYVFPSQHSQSEEKFVLYKLDR-----PHENCFLE 645

Query: 1439 -----IGKGKEQVDRDTELLEKNFETSHIVXXXXXXXXXXXDTKQRTKTSCNSSN----K 1287
                 I K  E+ + +T    ++ +   I+             ++      +S N     
Sbjct: 646  NACFPINKDTEKNNGNTRACCEDTDEGGIMKHNISCNSLDNCREEIVGQGEDSQNPQIDN 705

Query: 1286 EPCVHDVHVIDQSNSYNQISGGKEARMPKSSTPEIHMIKEAE---TSVLQRVTAIRDTPT 1116
            E  +HDVHVI     +     G +        P + + KE     TS   +        +
Sbjct: 706  ESKIHDVHVITNEVLFCDEGKGGDGE-SHLLNPIVDISKELPLVCTSGFSKNDGSSGHSS 764

Query: 1115 PSNMDVGVGINRSTSDTTSR-LPPIVPKGKSVVSDLRRNSLSAVDNERLKLDIEIGRLRE 939
             S  + G  ++RS+ D+    LP     G S    LRRNS+S VDNE+LK++ E+G L E
Sbjct: 765  KSQWEPGFNVSRSSLDSIRELLPSNGACGNSSFPKLRRNSMSVVDNEKLKIENEVGWLTE 824

Query: 938  RLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQASLPLPSSK 759
            +LK +  GREKLN+S+EH+ERE  QL+LLE+IAHQL+ +R L EP    RQ+SLP  SSK
Sbjct: 825  KLKAIDAGREKLNISLEHREREKFQLRLLEEIAHQLRAIRHLTEPKIGGRQSSLPPLSSK 884

Query: 758  VITKKRRCRSVSSG 717
               KKRRCRSVS G
Sbjct: 885  DNLKKRRCRSVSWG 898


>ref|XP_004134377.1| PREDICTED: uncharacterized protein LOC101204513 [Cucumis sativus]
            gi|449487622|ref|XP_004157718.1| PREDICTED:
            uncharacterized LOC101204513 [Cucumis sativus]
          Length = 724

 Score =  323 bits (829), Expect = 2e-85
 Identities = 267/783 (34%), Positives = 384/783 (49%), Gaps = 24/783 (3%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            MAC+A+  W+ +GLV AF+DL I++LLL AS++ F  SKFL +FGL LPCPCDGLF    
Sbjct: 1    MACEAIKLWTFNGLVAAFLDLGIAFLLLSASSLVFFTSKFLALFGLCLPCPCDGLFG-NL 59

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEFQD-QNCNMNFRLVGEK--ESDVIELE 2649
            S + CF KL +D  + K+S+V  S R KFP +   D   C     LV E+  + D +ELE
Sbjct: 60   SSDHCFQKLLVDRSSRKISSVVHSTREKFPLDSLLDGPKCCSKSMLVHERNVKGDRVELE 119

Query: 2648 GEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSGLR 2469
            GEAS SS    R P   V       +   V  +    G + R  +     +  ++   L 
Sbjct: 120  GEASGSSSFKIRSPQAMV-----YGDYPSVNELHCGDGGDRRKVISASSYVISQADVELE 174

Query: 2468 HRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHSGN 2289
               +   +   + +  + D  F       E   S           D+ + D+ +      
Sbjct: 175  DLSRSPSSFSGFGNDNTEDDGFFSVDSGDEREDSSDNSDQYKVFPDLELDDSCDEKICAE 234

Query: 2288 YHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSVETGQSSSDDKYTIXXXX 2109
              E   +    G      +    +E +TI       ++L+ ++E  QS     Y      
Sbjct: 235  MCE--ASVAEAGNSCRRELRLDGNESDTI-------KQLEQALEEEQSVRAALYL----- 280

Query: 2108 XXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRLIEEKS 1929
                            L+KER +         AMI RLQEEKA+IEM++RQY+R+IEEK+
Sbjct: 281  ---------------ELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKT 325

Query: 1928 AYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGMLDSQAFD--- 1758
            AYDAEEM+ILKEI++RRERE HFLEKE+E  R       E         MLDS+      
Sbjct: 326  AYDAEEMSILKEILVRREREMHFLEKEIEALRTSFF---EYDGVG--VDMLDSEVTPPRA 380

Query: 1757 -SLLNSNEDPVMMLRHLSASIDKKNGAIGASK--------DSSVQGDE--NAARLTGDL- 1614
             S     EDP + + +   S+  +  ++G+ K          S+  DE  +AA+  G L 
Sbjct: 381  PSFTYPTEDPCINIFNKKHSLQHEIPSVGSQKLTFEFGEESPSIGADETADAAKARGMLL 440

Query: 1613 ---PQFSVGSNDENQELKDKMVPIDNNSCAPLGNMTSVETLTWSSKVSSSVGKLPEENIS 1443
               P    GS + + EL+ K +  D N     G +T +E    S++ S+++GK+ E+   
Sbjct: 441  LQVPDIYKGSEEIDYELQGKDMVEDENLYVVPGKVTELEPYLQSNE-SNALGKV-EKCTE 498

Query: 1442 LIGKGKE--QVDRDTELLEKNFETSHIVXXXXXXXXXXXDTKQRTKTSCNSSNKEPCVHD 1269
            LI   +E  +V  D     K     H                QRT+   + +N +P +HD
Sbjct: 499  LIADEQEVHEVSYDGLAFAKTTLPCH----------EKNGDHQRTRDLYSVNNTDPHLHD 548

Query: 1268 VHVIDQSNSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQRVTAIRDTPTPSNMDVGVG 1089
            +HV++     ++     EA    S  P ++            +    D+P+ S +   + 
Sbjct: 549  IHVVE-----DEAKTSNEAVDNASEEPLVNGTSN--------IPGKCDSPSFSLLQNELD 595

Query: 1088 INRSTSDTTSRLPPIV-PKGKSVVSDLRRNSLSAVDNERLKLDIEIGRLRERLKIVQEGR 912
              RS+SD + R PPI   +  S+ S LRRNS+SAVD ER K+  E+  LR RLKIVQEGR
Sbjct: 596  FTRSSSDASGRFPPIARSRSHSMRSQLRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEGR 655

Query: 911  EKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQASLPLPSSKVITKKRRCR 732
            EKL  SVEHKE+E+ Q +LLE+I +Q +E+RQL +PGKA  QA LP PSSK ++KKR  R
Sbjct: 656  EKLKFSVEHKEKESNQFQLLENITNQHREIRQLTDPGKASLQAPLP-PSSKDVSKKRCWR 714

Query: 731  SVS 723
            S S
Sbjct: 715  SSS 717


>ref|XP_004300993.1| PREDICTED: uncharacterized protein LOC101300919 [Fragaria vesca
            subsp. vesca]
          Length = 869

 Score =  289 bits (740), Expect = 5e-75
 Identities = 243/727 (33%), Positives = 361/727 (49%), Gaps = 43/727 (5%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            MACQ +++W+LSGLVGAF+DLSI+Y+L CASA+AF  SKFL +FGL LPCPCDGLF   P
Sbjct: 1    MACQMIHSWTLSGLVGAFLDLSIAYMLFCASALAFFTSKFLDLFGLSLPCPCDGLFG-NP 59

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEFQDQNCNMNFRLVGEKESDVIELEGEA 2640
              N+CF K  +  P+EK+  VQL ++SKFPF+     + +++ +   + ES   E EG+A
Sbjct: 60   KNNYCFQKQLVQGPSEKIGRVQLQLKSKFPFDVMWSGDPHLHAKCKSDHESGHFEFEGDA 119

Query: 2639 SCSSVSDSRKPSNFVRIKGELNERN-EVGRMKLLGGKEGRFDLKGKEIMHQKSRSGLRHR 2463
            SCSS SD +  S   R     NE++ E G +KL    E R D KGK++  ++   GLR  
Sbjct: 120  SCSSFSDGKGLSGVGRGSVSGNEQSCESGAVKL----EERLDHKGKKVGGRRPSHGLRRC 175

Query: 2462 RKG-SFNHGKYSSRASYDPRFSKAQGDPESP----------------------------- 2373
            R G S + GK  S +S D   S +Q D  +P                             
Sbjct: 176  RTGASVDFGKLFSVSSCDVVQSDSQ-DMVTPINSRRPFHGMQRRRKGCSVDYAKVFSVSP 234

Query: 2372 -PSISKGGNGLAADDIMVLDNGNGSHSGNYHEDATTAMRFGRRHSNSINESPDEDETIAK 2196
                  G   +A     + + GN         D   A +  R     +NE   + ++I  
Sbjct: 235  YDMFQSGARDIANPPSSLSNVGNQGAEVPSSSDGMEAPKPERVSRLELNEHVGKTKSIEN 294

Query: 2195 NVSFLRELKGSV-ETGQSSSDDKYTIXXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXX 2019
            + S +     +  E     S+DK  +                    L+KER +       
Sbjct: 295  DASSVENFGANEHEKPAYDSNDKTMVRVLEQALDEEHTARTALYYELEKERSAAATAADE 354

Query: 2018 XXAMIQRLQEEKAAIEMESRQYKRLIEEKSAYDAEEMNILKEIVIRREREKHFLEKEVEV 1839
              AMI RLQEEKA+IEME+RQY+R+I+EKS YDAEEMNILKEI++RREREKHFLEKEVE 
Sbjct: 355  AMAMILRLQEEKASIEMEARQYQRMIQEKSIYDAEEMNILKEILLRREREKHFLEKEVEY 414

Query: 1838 YRQMLRLGNEQ--QATENIAGMLDSQAFDSLLNSNEDPVMMLRHLSASIDKKNGAIGASK 1665
            YR++L  GN+Q      N+A    +Q+   L   +++ ++ML   S SI +K+    A+ 
Sbjct: 415  YRKIL-FGNDQVDDDMHNVAA-TGAQSISRL--PSKELLLMLERTSDSISEKSKLKMANS 470

Query: 1664 DS-----SVQGDENAARLTGDLPQFSVGSNDENQELKDKMVPIDNNSCAPLGNMTSVETL 1500
             S     S         L  +LP   +   D ++++   ++   ++    L NM  +   
Sbjct: 471  TSDYGAPSSDSQNRTLDLGTELPVPDLEEGDSSKQIDMHLLQSVDSHSHILNNMYEINHK 530

Query: 1499 TWSSKVSSSVGKLPEENISLIGKGKEQVDRDTELLEKNFETSHIVXXXXXXXXXXXDTKQ 1320
                 + S    + E+ +S   +GKE    + E LE   +T+              +  Q
Sbjct: 531  VQEKGMVS----VDEKRVS---QGKEVQVLNAEGLELVEKTN--PPNEHELDVAGSNVDQ 581

Query: 1319 RTKTSCNSS-NKEPCVHDVHVI-DQSNSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQ 1146
             +K   +S+ N EP VHDVHVI ++S+ YN+          KS+     ++  A   + +
Sbjct: 582  GSKDPLSSTPNTEPHVHDVHVIVEKSDFYNE----------KSAEKSEQLLANATLDISE 631

Query: 1145 RVTAIRDTPTPSNMDVGVGINRSTSDTTSRLPPI-VPKGKSVVSDLRRNSLSAVDNERLK 969
             + +I  + T   +  GVG     SDT S L  +   +GK++  D+RRNS+S+VD E+ K
Sbjct: 632  TLKSITGSGTEHKVH-GVG-----SDTGSELSIVDGSQGKALPYDMRRNSMSSVDYEKSK 685

Query: 968  LDIEIGR 948
            +  ++ R
Sbjct: 686  IQNKLHR 692



 Score =  133 bits (334), Expect = 6e-28
 Identities = 71/101 (70%), Positives = 83/101 (82%)
 Frame = -2

Query: 1019 SDLRRNSLSAVDNERLKLDIEIGRLRERLKIVQEGREKLNLSVEHKERENLQLKLLEDIA 840
            S +RRNS+SAVD ER KLD E+  LRERL+IVQEGREKLN SV H+ERE +Q KLLEDIA
Sbjct: 764  SVMRRNSMSAVDYERCKLDNEVEWLRERLRIVQEGREKLNFSVGHREREKVQRKLLEDIA 823

Query: 839  HQLQELRQLKEPGKAVRQASLPLPSSKVITKKRRCRSVSSG 717
             QLQE+RQL EPGK   QA+LP PSSKV++KKRR R++S G
Sbjct: 824  SQLQEIRQLTEPGKVKCQAALPPPSSKVMSKKRRWRTLSLG 864


>gb|EOX93928.1| Uncharacterized protein TCM_002927 [Theobroma cacao]
          Length = 649

 Score =  287 bits (734), Expect = 3e-74
 Identities = 234/767 (30%), Positives = 352/767 (45%), Gaps = 8/767 (1%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGL-YLPCPCDGLFFIQ 2823
            M  + + +W+L GL+ AF+D++++Y LLC S + F A KF  VFGL YLPCPC G F  Q
Sbjct: 1    MVSRTIPSWTLFGLIRAFLDVAVAYFLLCGSTLGFFAWKFYHVFGLYYLPCPCTGFFGYQ 60

Query: 2822 PSRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFN--EFQDQNCNMNFRLVGEKE--SDVIE 2655
             S N C+HKL I++PA K+ +VQ    ++FPFN   F DQ CN+N + + +++  + VIE
Sbjct: 61   NS-NLCWHKLLIEWPARKIYSVQKLALNRFPFNLVWFNDQECNLNAKYIKDRKFGNGVIE 119

Query: 2654 LEGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSG 2475
             +GEA CSS      PS                R++ +  KE  +D KGK+I++QK +SG
Sbjct: 120  SDGEA-CSS-----SPSGL--------------RLRTMVDKESGYDAKGKKIINQKQKSG 159

Query: 2474 LRHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHS 2295
            +R  R+ +F +GK SS       FS A                  ++ +  +   + S  
Sbjct: 160  IRRCRRAAFGYGK-SSPVLLSGNFSSAVAGVSCSSYNGGETRSEISEHLGPVSEIDDSFP 218

Query: 2294 GNYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSVETGQSSSDDKYTIXX 2115
             N +    T    G  H    +   ++  T  K ++     K  +     + D+   I  
Sbjct: 219  DNKNNQTGTDGGDGTWHGFEFSNGEEKVSTSMKKINCNTNGKLGI-----TGDEANRIRM 273

Query: 2114 XXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRLIEE 1935
                              L+KER +         AMI RLQE+KA+IEME+ QY+R+IEE
Sbjct: 274  LEQALEEEKAAYAALYLELEKERAAAATAADEAMAMILRLQEDKASIEMEAMQYQRMIEE 333

Query: 1934 KSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGMLDSQAFDS 1755
            K AYD EEMNILKEI++RRE+E H LEKEVE YRQM  L + QQ  +    +   Q    
Sbjct: 334  KFAYDEEEMNILKEILVRREKENHLLEKEVEAYRQMNILEDLQQEHDLSYNLSKGQQTPL 393

Query: 1754 L-LNSNEDPVMMLRHLSASIDKKNGAIGASKDSSVQGDENAARLTGDLPQFSVGSNDENQ 1578
            + +  +EDP++M+  +  S   +   +G  K SS      A          +V    + +
Sbjct: 394  VSVGLDEDPLLMMNQMGNSGYTRKKEVG--KGSSWPSKNEAPSAGKRSHTVAVNLAGKGK 451

Query: 1577 ELKDKMVPIDNNSCAPLGNMTSVETLTWSSKVSSSVGKLPEENISLIGKGKEQVDRDTEL 1398
               D  +     +     N+ S+E  + S +   S  +L ++                  
Sbjct: 452  AQVDDAIVCQAIATKAAQNVCSIEKTSLSEEGLESNAELGDQ------------------ 493

Query: 1397 LEKNFETSHIVXXXXXXXXXXXDTKQRTKTSCNSSNKEPCVHDVHVIDQSNSYNQISGGK 1218
            L  N   S +                         + EP ++DVHV+D +    +    K
Sbjct: 494  LGSNLHNSTL-------------------------DMEPDIYDVHVVDDTLDIPREENIK 528

Query: 1217 EARMPKSSTPEIHMIKEAETSVLQRVTAIRDTPTPSNMDVGVGINRSTSDTTSRLPPIVP 1038
            E+ +P  S  +                                                 
Sbjct: 529  ESTLPTFSASD------------------------------------------------- 539

Query: 1037 KGKSVVSDLRRNSLSAVDNERLKLDIEIGRLRERLKIVQEGREKLNLSVEHKERENLQLK 858
              K+ + D  R+S  AV NERL++D EI RLR RL++VQ  +EKL  S + +ER + QLK
Sbjct: 540  -HKNSLCDSGRSSFCAVSNERLEIDAEIERLRGRLQVVQGEKEKLTFSADQRERLDTQLK 598

Query: 857  LLEDIAHQLQELRQLKEPGKAVRQASLP--LPSSKVITKKRRCRSVS 723
            L+E++ +QL+E +QLKEP   V+Q+S+P    SSKV + +R CRS S
Sbjct: 599  LIEEMVNQLREFQQLKEP---VQQSSVPPLASSSKVSSNRRCCRSAS 642


>ref|XP_002266466.2| PREDICTED: uncharacterized protein LOC100250255 [Vitis vinifera]
          Length = 588

 Score =  281 bits (720), Expect = 1e-72
 Identities = 189/556 (33%), Positives = 299/556 (53%), Gaps = 19/556 (3%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            MACQ +++W+  GLVGA++DL+I+YLLLC S +AF ASKFL  FGL LPCPC+G FF  P
Sbjct: 1    MACQEIHSWTFGGLVGAYLDLAIAYLLLCGSTLAFFASKFLSFFGLCLPCPCNG-FFGNP 59

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEF--QDQNCNMNFRLVGEKESD--VIEL 2652
            + + C  K  +DYP E++S+VQL ++SKFPF+     + + + N++L+  + SD   + L
Sbjct: 60   NGDNCLQKFLVDYPTERISSVQLCVKSKFPFDSVWANEGSPHPNWKLLKGRNSDDGAVGL 119

Query: 2651 EGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSGL 2472
            EGEASCSS  D  +  +    K  ++     G M     KEG+ D KGK + +Q+ ++G+
Sbjct: 120  EGEASCSSFWDVMRSPDIAG-KDSISRNGSCGVMNTPALKEGKSDTKGKRVSNQRPKTGV 178

Query: 2471 RHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHSG 2292
            R RR+ + +HGK+SS +S+DP    A     SP S+S+       D+++ +    G  + 
Sbjct: 179  RRRRRSAVDHGKFSSVSSFDPPRLDAPSGLRSPSSVSE------TDELVPILIDLGERA- 231

Query: 2291 NYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSVETGQS-SSDDKYTIXX 2115
                           H   +NE  DED+   K+ S   E+K +     S + + + T+  
Sbjct: 232  --------------LHGIKLNEHIDEDKPSEKDASSAEEVKCNARGKLSFNGNTENTVRV 277

Query: 2114 XXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRLIEE 1935
                              L+KER +         AMI R+QEEKA+IEME+RQ++R+IEE
Sbjct: 278  LEQALEEEHAARAALYHELEKERSAAASAADEAMAMILRIQEEKASIEMEARQFQRIIEE 337

Query: 1934 KSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGN--EQQATENIAGMLDSQAF 1761
            KSAYDAEEMN+LKEI++RREREKHFLEKEVE YR + +LG   + ++ ++     D +  
Sbjct: 338  KSAYDAEEMNLLKEILLRREREKHFLEKEVEAYRHLDKLGKTADHESKDHCCSAFDIEPR 397

Query: 1760 DSLLNSNEDPVMMLRHLSASIDKKNGAIGASKDSSVQGDENAARLTGDLPQFSVGSNDEN 1581
               ++  +    +    + S  ++   I A  D  V+   +  +  G +  F + S  E+
Sbjct: 398  VHDVHVIDHESNLCNEANESKIEQTPDIPAKLDPPVEA--SLIQRIGVVCDFPMMSTLES 455

Query: 1580 QELKDKMVPIDNNSCAPLGN------MTSVETLTWSS------KVSSSVGKLPEENISLI 1437
            +   D+      N   PLG       ++ +   + SS      K+ + V +L  E + ++
Sbjct: 456  ETNNDQSFSDITNGLPPLGGSRGKALLSDMRRHSISSVDNERLKIETEVERL-RERLRIV 514

Query: 1436 GKGKEQVDRDTELLEK 1389
             +G+E+++   E  E+
Sbjct: 515  QEGREKLNFSVEHRER 530



 Score =  199 bits (505), Expect = 9e-48
 Identities = 106/197 (53%), Positives = 146/197 (74%), Gaps = 1/197 (0%)
 Frame = -2

Query: 1304 CNSSNKEPCVHDVHVIDQSNSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQRVTAIRD 1125
            C++ + EP VHDVHVID  +  N  +   E+++ ++      +    E S++QR+  + D
Sbjct: 389  CSAFDIEPRVHDVHVIDHES--NLCNEANESKIEQTPDIPAKLDPPVEASLIQRIGVVCD 446

Query: 1124 TPTPSNMDVGVGINRSTSDTTSRLPPIV-PKGKSVVSDLRRNSLSAVDNERLKLDIEIGR 948
             P  S ++     ++S SD T+ LPP+   +GK+++SD+RR+S+S+VDNERLK++ E+ R
Sbjct: 447  FPMMSTLESETNNDQSFSDITNGLPPLGGSRGKALLSDMRRHSISSVDNERLKIETEVER 506

Query: 947  LRERLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQASLPLP 768
            LRERL+IVQEGREKLN SVEH+EREN+QL+LLEDIA QL+E+RQL EPGKAVRQASLP P
Sbjct: 507  LRERLRIVQEGREKLNFSVEHRERENIQLQLLEDIASQLREIRQLTEPGKAVRQASLPPP 566

Query: 767  SSKVITKKRRCRSVSSG 717
            SSKV++KKRR RS S G
Sbjct: 567  SSKVMSKKRRWRSASLG 583


>gb|ESW06083.1| hypothetical protein PHAVU_010G018600g [Phaseolus vulgaris]
          Length = 703

 Score =  267 bits (683), Expect = 2e-68
 Identities = 244/803 (30%), Positives = 364/803 (45%), Gaps = 44/803 (5%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            MA Q  ++W+L GL+GAF+DL ++Y LLCASA AF  S++  +FGL LPCPC G F  + 
Sbjct: 1    MASQETHSWTLGGLIGAFIDLVLAYFLLCASAFAFCVSEWFRIFGLSLPCPCKGSFGYRD 60

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNE--FQDQNCNMNFRLVGEKESD--VIEL 2652
            SR FC HKL +++P+ K+ ++Q+    +FPF+       + + N ++V E+  D  V+EL
Sbjct: 61   SR-FCVHKLLLEWPSRKICSIQVMAIKRFPFDLVWIHGHSFSANDKVVAERIHDHRVVEL 119

Query: 2651 EGEASCSSVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSGL 2472
            E EASCSS S S   S FV                    +E  ++ KGK+ M  K RSG+
Sbjct: 120  EDEASCSSCS-SPHFSPFV-------------------DRENVYNAKGKKAMSTKRRSGI 159

Query: 2471 RHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNG--------LAADDIMVLD 2316
            R RR+GS + GK SS    D      Q D    P +   G G         +   + V+D
Sbjct: 160  RRRRRGSSDPGKVSSAVPLD----NLQSDVVLTPLLPFDGRGKTNATTSPTSGKGVSVVD 215

Query: 2315 NGNGSHSGNYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSVETGQSSSD 2136
              +     +  E    +  F    + S+ +SP +D+ +     ++  +  +V+  ++  D
Sbjct: 216  GEDDQTCHDLDEKTCHSYEF----NGSMVDSPGQDKRLLSLEHYMDNVCDNVQIDRNEED 271

Query: 2135 DKYTIXXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQ 1956
                                     LDKER S         AMI RLQEEKA++EME RQ
Sbjct: 272  ---RFKMLEMALEEEKAAYTALYLELDKERASAATAADETMAMILRLQEEKASLEMEMRQ 328

Query: 1955 YKRLIEEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGML 1776
            Y+R+IEE+ AYD EEM+IL+EI+IRRERE HFLE+E+E YRQ                 +
Sbjct: 329  YQRMIEERVAYDEEEMDILQEILIRRERENHFLEEELESYRQ-----------------M 371

Query: 1775 DSQAFDSLLNSNEDPVMMLRHLSASIDKKNGAIGASKDSSVQGDENAARLTGDLPQFSVG 1596
            DS+  D L                              + VQ D+      G  P  SV 
Sbjct: 372  DSKGSDQLYGK---------------------------AMVQHDQ-----CGQRPPISVE 399

Query: 1595 SNDENQELKDKMVPIDNNSCAPLGNMTSVETLTWSSKVSSSVGKLPEENISLIGKGKEQV 1416
            +  E+Q  K  MVP D     P     SVET                E+ S I  GK  V
Sbjct: 400  TY-EDQSCK-AMVPHDQRVQRP---PISVETY---------------EDQSCISYGKAMV 439

Query: 1415 DRD-----TELLEKNFETSHIVXXXXXXXXXXXDTKQRTKTSCNSSN----------KEP 1281
              D     T +  + +E    +            +      +C +S           K+ 
Sbjct: 440  QHDLCGQKTPISVETYEDRSTISFFKKDEITDISSSYMVSQTCTNSEDGEEPDKSIEKKA 499

Query: 1280 CVHD----------VHVIDQSNSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQ----- 1146
             +HD          + V+D     + +   +E     SS+    +  E  T  L+     
Sbjct: 500  QMHDKLRRFFYDTDLDVLDIHVIEDNVEQREEENEKLSSSSLCSVTSEITTRYLEFGSRK 559

Query: 1145 --RVTAIRDTPTPSNMDVGVGINRSTSDTTSRLPPIVPKGKSVVSDLRRNSLSAVDNERL 972
                 A+R+    + ++ G  ++  +S     L     +  ++  D   +S SA +NE+L
Sbjct: 560  VRSNNALRNCRRINKIENGNSVDGPSSQL---LMLSNSRSNTLPFDYGSDSSSAFENEKL 616

Query: 971  KLDIEIGRLRERLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAV 792
            ++  EI  L ERL+ V+ G++KL+   E+ E    +LKLLEDIA+ LQ+++ ++ P   V
Sbjct: 617  RIGNEIEILGERLRKVKYGKQKLSSFQENGESLKGRLKLLEDIANNLQKIKNMRNP---V 673

Query: 791  RQASLPLPSSKVITKKRRCRSVS 723
            R ASLP  SSKV  +KRR +SV+
Sbjct: 674  RGASLPPSSSKVSLRKRRSQSVN 696


>ref|XP_002518099.1| hypothetical protein RCOM_1019660 [Ricinus communis]
            gi|223542695|gb|EEF44232.1| hypothetical protein
            RCOM_1019660 [Ricinus communis]
          Length = 571

 Score =  267 bits (683), Expect = 2e-68
 Identities = 203/551 (36%), Positives = 285/551 (51%), Gaps = 48/551 (8%)
 Frame = -2

Query: 2255 GRRHSNSINESPDEDET-IAKNVSFLRELKGSVETGQSS-SDDKYTIXXXXXXXXXXXXX 2082
            GR    S+++ P +D+    K+  +  +LK +VE   S  +D+K+TI             
Sbjct: 7    GRESPGSVSKEPTKDKKPTEKDGLYADDLKFNVEGELSDGNDEKHTIRVLEQALEEEQAA 66

Query: 2081 XXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRLIEEKSAYDAEEMNI 1902
                   LDKER +         AMI RLQ EKA+IEME+RQY+R+IEEKSAYD EEMNI
Sbjct: 67   HSALYLELDKERSAAATAADEAMAMILRLQGEKASIEMEARQYQRMIEEKSAYDFEEMNI 126

Query: 1901 LKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGMLDSQAFDSLLNSNEDPVMM 1722
            LKEI++RRE+EKHFLEKEVE YRQM+    +  +     G    +   SLL S+EDPV+M
Sbjct: 127  LKEILLRREKEKHFLEKEVETYRQMIFGSEQLDSDAQDIGTTRGRRASSLLYSSEDPVLM 186

Query: 1721 LRHLSASIDKKNGAIGASKDS-----SVQGDENAARLTGDLPQFSVGSNDE--------- 1584
            L+ +S S  +       +K S     S+   +++     +LP   +   D          
Sbjct: 187  LQKISESFYETEHVENTNKFSECEVTSIVSQDHSLAFGKELPIPELDGADSSKQGCIPRE 246

Query: 1583 ------------NQELKDKMVPIDNNSCAPLG----------------NMTSVETLTWSS 1488
                        N E+ +K   I + S                     N+ + E  T S 
Sbjct: 247  PSVNKYHCLSGGNGEINEKFEEIRSLSWERSSFSQETDMQSLEMHTEINLPTAEGYTSSE 306

Query: 1487 KVSSSVGKLPEE-NISLIGKGKEQVDRDTELLEKNFETSHIVXXXXXXXXXXXDTKQRTK 1311
            + ++SVG++ ++ ++    +G     +  ++ +  F                 D+K+  K
Sbjct: 307  RFNTSVGEIKQQGDVISTSQGSPNSIQPCDVTKNIFPYG------------CDDSKKYDK 354

Query: 1310 TSCNS-SNKEPCVHDVHVIDQS-NSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQRVT 1137
             S N  S+  PCVHDVHVID   N  +++ G    R   S    +   K   + V+ R  
Sbjct: 355  DSSNGLSDTGPCVHDVHVIDDKLNLCDEVRGN--GRETLSEIVSLDFPKSCNSPVMNR-- 410

Query: 1136 AIRDTPTPSNMDVGVGINRSTSDTTSRLPPI-VPKGKSVVSDLRRNSLSAVDNERLKLDI 960
              R T           I+RS S+ TS LPP    + K +VSDLRRNS+SAVD ER K+D 
Sbjct: 411  --RQTEQH--------ISRSCSEITSGLPPRGFLQRKPLVSDLRRNSMSAVDYERFKIDN 460

Query: 959  EIGRLRERLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQAS 780
            E+G LRERL+I+QEGREKLN+S+EH+E+EN+QL LLE+I  QLQE+RQL EPG AVRQ S
Sbjct: 461  EVGWLRERLRIIQEGREKLNISLEHREKENVQLLLLENIVSQLQEIRQLTEPGNAVRQVS 520

Query: 779  LPLPSSKVITK 747
            LP PS K + +
Sbjct: 521  LPPPSCKGVAR 531


>gb|EMJ28544.1| hypothetical protein PRUPE_ppa002653mg [Prunus persica]
          Length = 648

 Score =  262 bits (670), Expect = 7e-67
 Identities = 179/446 (40%), Positives = 245/446 (54%), Gaps = 9/446 (2%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            M CQ +++W  + LVGAF+DL+I+YLLLCA+A+AF  SKF+ VFGL LPCPCDG FF  P
Sbjct: 1    MTCQMIHSWRFNELVGAFLDLAIAYLLLCAAAVAFFTSKFVSVFGLCLPCPCDG-FFGTP 59

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEFQDQNCNMN--FRLVGEK--ESDVIEL 2652
             ++ CF + F D P EK+S VQ +++SKFPF+    +N N+N   + V E   E+   E 
Sbjct: 60   RKSHCFQRQFADVPCEKISAVQWAVKSKFPFDVLWSENSNINSKSKFVDETYYENGHFEF 119

Query: 2651 EGEASCSSVSDSRKPSNFVRIKGELNERN-EVGRMKLLGGKEGRFDLKGKEIMHQKSRSG 2475
            EGEASCSS+S+ R            N+++ E G   L  GKE  F+LK K++  ++ R  
Sbjct: 120  EGEASCSSLSERRLLDMVESDSVAENDQSVEFGVANLETGKEQHFELKPKKVSGRRPRLR 179

Query: 2474 LRHRRK-GSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSH 2298
             R RR+ GS ++G   S +SYD  +S A     SP SI                  N + 
Sbjct: 180  RRRRRRGGSVDYGNPVSVSSYDVFYSDAGDISTSPSSI------------------NCTE 221

Query: 2297 SGNYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSVETGQS---SSDDKY 2127
            +  Y     +  R          ES DE +   K+ S + +       G+     S++  
Sbjct: 222  APTYISSPDSVSR------PEFKESMDETKPTGKDGSVVED--SGCNAGEKLGFDSNETT 273

Query: 2126 TIXXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKR 1947
            T+                    L+KER +         AMI RLQEEKA+IEME+RQY+R
Sbjct: 274  TVRVLEQALEEEHATRAALYLELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQR 333

Query: 1946 LIEEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAGMLDSQ 1767
            +IEEKSAYDAEEMNILKEI++RREREKHFL KEVE +RQ+     E Q   ++  +  +Q
Sbjct: 334  MIEEKSAYDAEEMNILKEILVRREREKHFLVKEVEEFRQI--FFGEDQVDFDMHDVATTQ 391

Query: 1766 AFDSLLNSNEDPVMMLRHLSASIDKK 1689
            A    L S+ED V ML+  S SI ++
Sbjct: 392  AQKPALRSSEDQVPMLQWASESITEE 417



 Score =  144 bits (364), Expect = 2e-31
 Identities = 78/135 (57%), Positives = 102/135 (75%), Gaps = 1/135 (0%)
 Frame = -2

Query: 1127 DTPTPSNMDVGVGINRSTSDTTSRLPPI-VPKGKSVVSDLRRNSLSAVDNERLKLDIEIG 951
            D PT + +     I R +SDT S L  +  P+G+S+ SD+RRNS+SA+D ER K+D E+ 
Sbjct: 506  DIPTTTGLKTQRKIQRVSSDTASVLLTMGCPRGRSLPSDMRRNSMSAIDYERFKIDNEVE 565

Query: 950  RLRERLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVRQASLPL 771
            RLRERL+IVQEGREKLN SV H+ERE +QL+LLEDIA QL+E++QL EPGKA  QA +  
Sbjct: 566  RLRERLRIVQEGREKLNFSVGHRERERIQLQLLEDIASQLREIQQLTEPGKAECQAGMLP 625

Query: 770  PSSKVITKKRRCRSV 726
            P SKV++KKRR R++
Sbjct: 626  PLSKVMSKKRRWRTL 640


>ref|XP_002518101.1| conserved hypothetical protein [Ricinus communis]
            gi|223542697|gb|EEF44234.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 641

 Score =  248 bits (632), Expect = 2e-62
 Identities = 167/413 (40%), Positives = 232/413 (56%), Gaps = 7/413 (1%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            M C A+  W+ SGLVGAF+DLSI++ LLCASA+A+ ASKFL  FGL LPCPC+G F I  
Sbjct: 1    MPCLAIRRWTFSGLVGAFLDLSITFFLLCASALAYFASKFLAFFGLNLPCPCNGFFAIPD 60

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPF----NEFQDQNCNMNFRLVGEKESDVIEL 2652
            + N C  + F+DYP +K+S++Q S++SKFPF    N  Q ++   N+R + + E  V   
Sbjct: 61   ASNNCLQRQFVDYPLQKISSIQSSVKSKFPFDSIGNRSQWKSNLENYRNIVKNE--VAGS 118

Query: 2651 EGEASCSSVSDSRKPSNFVRIKGELNERNEV-GRMKLLGGKEGRFDLKGKEIMHQKSRSG 2475
            EGE+SC S S +R  ++      ++ E+  V G M L   KE +FD K K ++  +SR+ 
Sbjct: 119  EGESSCISSSVTRAENSRDGDLAKMKEKGFVMGAMNLQDVKERKFDCKWKGLLRHRSRNN 178

Query: 2474 LRHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHS 2295
            LR RRK   ++GK S  +S+   +S A+  P+SPP+  +  N    D +  L+       
Sbjct: 179  LRRRRK---DNGKLSQVSSFKSLWSDAE-TPQSPPARIR--NETCKDGMEPLNYRGTVSE 232

Query: 2294 GNYHE--DATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSVETGQSSSDDKYTI 2121
             N +E  D              I++  +  E + +N     E      T     + + TI
Sbjct: 233  VNCYEILDGKEGSVVDIGSKRKISQGFELYEPVDEN-----ETSDHENTSDLDGNARNTI 287

Query: 2120 XXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQYKRLI 1941
                                L+KER +         AMIQRLQ+EKA IEME+RQ +R+I
Sbjct: 288  RLLELALEEEHAARAVLYVELEKERSAAATAADEAMAMIQRLQKEKALIEMEARQCQRMI 347

Query: 1940 EEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQMLRLGNEQQATENIAG 1782
            EEK AYDAEEMNILKEI++RREREK+FLEKEVE YRQ   + NEQ   +N +G
Sbjct: 348  EEKYAYDAEEMNILKEILLRREREKYFLEKEVEAYRQ--AICNEQFEADNTSG 398



 Score =  119 bits (299), Expect = 7e-24
 Identities = 79/191 (41%), Positives = 109/191 (57%), Gaps = 2/191 (1%)
 Frame = -2

Query: 1322 QRTKTSCNSSNKEPCVHDVHVI-DQSNSYNQISGGKEARMPKSSTPEIHMIKEAETSVLQ 1146
            Q T +  + +  +  +HD+HVI DQS+   Q+   K  ++  +++   H  K        
Sbjct: 474  QTTTSLVHKTVDKESIHDIHVIEDQSSMSKQVIEDKNKQLLANASTLTHTAK-------- 525

Query: 1145 RVTAIRDTPTPSNMDVGVGINRSTSDTTSRLPPI-VPKGKSVVSDLRRNSLSAVDNERLK 969
                               I R++S   S LPPI   +G+S+ S++RR S+SA D ER K
Sbjct: 526  -------------------IGRTSSIIPSGLPPIGSSRGRSMRSEMRRKSMSAFDAERYK 566

Query: 968  LDIEIGRLRERLKIVQEGREKLNLSVEHKERENLQLKLLEDIAHQLQELRQLKEPGKAVR 789
            +D EI  LRERL+ VQEGREKL  S   KE E  QL++LEDI +QLQE+RQL EPGKA+R
Sbjct: 567  IDNEIIWLRERLRSVQEGREKLKFSKGSKEGEKTQLQMLEDITNQLQEIRQLTEPGKALR 626

Query: 788  QASLPLPSSKV 756
            +ASLP  +S V
Sbjct: 627  RASLPPLTSNV 637


>ref|XP_002868347.1| hypothetical protein ARALYDRAFT_355453 [Arabidopsis lyrata subsp.
            lyrata] gi|297314183|gb|EFH44606.1| hypothetical protein
            ARALYDRAFT_355453 [Arabidopsis lyrata subsp. lyrata]
          Length = 644

 Score =  240 bits (612), Expect = 4e-60
 Identities = 224/728 (30%), Positives = 336/728 (46%), Gaps = 16/728 (2%)
 Frame = -2

Query: 2999 MACQAMYTWSLSGLVGAFVDLSISYLLLCASAIAFTASKFLGVFGLYLPCPCDGLFFIQP 2820
            M CQ + +W+  GLV AFVDLS+++ LLCAS++ +  SKFLG+FGL LPCPCDGL F +P
Sbjct: 1    MRCQEVRSWTFKGLVAAFVDLSVAFSLLCASSLVYVTSKFLGLFGLALPCPCDGL-FSEP 59

Query: 2819 SRNFCFHKLFIDYPAEKVSNVQLSIRSKFPFNEFQDQNCNMNFRLVGEK---ESDVIELE 2649
             +  CF +   + P +K+S+VQ S+R++ PF+    +   +N   VG+K   E   +ELE
Sbjct: 60   HK--CFQESLGNLPVKKISSVQRSVRNRTPFDSILCK--EVNGGCVGKKRKGERKRVELE 115

Query: 2648 GEASCS-SVSDSRKPSNFVRIKGELNERNEVGRMKLLGGKEGRFDLKGKEIMHQKSRSGL 2472
             E S + SV    K S F  +  +               K+G F +K K +   +S  G 
Sbjct: 116  DEVSSTPSVGKIEKASGFDLLTAQ-------------SLKKGSFKVKSKRLSFHRSPYGF 162

Query: 2471 RHRRKGSFNHGKYSSRASYDPRFSKAQGDPESPPSISKGGNGLAADDIMVLDNGNGSHSG 2292
            ++  +GS  H               ++G  ES     K  N    D ++V        SG
Sbjct: 163  KNHFQGSLGH-------------KNSEGSNESVMEYLKDVN--ENDPLLV----KWKDSG 203

Query: 2291 NYHEDATTAMRFGRRHSNSINESPDEDETIAKNVSFLRELKGSVETGQSSSDD------- 2133
               ED +           S++ S    E  A+N    R+   + E   SS  D       
Sbjct: 204  TTLEDVSL--------RKSVSLSSVGCEAGAQNKQPERKFSWAGEGTCSSPVDLTYSGMT 255

Query: 2132 KYTIXXXXXXXXXXXXXXXXXXXXLDKERISXXXXXXXXXAMIQRLQEEKAAIEMESRQY 1953
            + TI                    L+KER +          MI RLQEEKA+IEME+RQY
Sbjct: 256  QKTIEILEQVLAEERAARASLALELEKERNAAASAADEALGMILRLQEEKASIEMEARQY 315

Query: 1952 KRLIEEKSAYDAEEMNILKEIVIRREREKHFLEKEVEVYRQM-LRLGNEQQATENIAGML 1776
            +R+IEEKSA+DAEEM+ILKEI++RREREKHFLEKEV+ YRQM L         ++    +
Sbjct: 316  QRMIEEKSAFDAEEMSILKEILLRREREKHFLEKEVDTYRQMFLETEQPHNTPDSKPARI 375

Query: 1775 DSQAFDSLLNSNEDPVMMLRHLSASIDKKNGAIGASKDSSVQGDENAARLTGDLPQFSVG 1596
            +       +  + D +      +A +           DS +    N + L GD  +  V 
Sbjct: 376  ERLQTPQQITESWDDME-----TADVSFGFEIFTNQMDSRLLAHGNKSVLPGDYSE--VD 428

Query: 1595 SNDENQELKDKMVPIDNNSCAPLGNMTSVETLTWSSKVSSSVG-KLPEENISLIGKGKEQ 1419
            ++DEN+   D+      +   P       +     +K+ S V  K  EE ++      E 
Sbjct: 429  NDDENKNGVDQSPERGQSRSEPF------DVHHEKAKLLSDVELKEREEGVTSF---PEL 479

Query: 1418 VDRDTEL-LEKNF-ETSHIVXXXXXXXXXXXDTKQRTKTSCNSSNKEPCVHDVHVIDQSN 1245
            V R +++ + KN  E SH +                          +  VHD+HV+   +
Sbjct: 480  VSRTSDITVTKNLGEESHDI--------------------------DGHVHDIHVVTDED 513

Query: 1244 SYNQISGGKEARMPKSSTPEIHMIKEAETSVLQRVTAIRDTPTPSNMDVGVGINRSTSDT 1065
            +  Q+           + P  H I                       D+ +  ++S SDT
Sbjct: 514  NKAQL-----------NVPFDHAIS----------------------DLKLDRSQSVSDT 540

Query: 1064 TSRLPPIVPKGKSVVS-DLRRNSLSAVDNERLKLDIEIGRLRERLKIVQEGREKLNLSVE 888
            +  LPP    GKS +S ++RRNS+SA+D ERLK++ E+G LR RL+ VQ+GREK++ S +
Sbjct: 541  SYVLPP----GKSNMSPNMRRNSMSAIDYERLKIESEVGLLRGRLRAVQKGREKISFSSK 596

Query: 887  HKERENLQ 864
             + +  +Q
Sbjct: 597  EQSKSQIQ 604


Top