BLASTX nr result

ID: Cinnamomum23_contig00005052 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum23_contig00005052
         (2644 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010261088.1| PREDICTED: uncharacterized protein LOC104599...   796   0.0  
ref|XP_010243331.1| PREDICTED: uncharacterized protein LOC104587...   777   0.0  
ref|XP_010939550.1| PREDICTED: uncharacterized protein LOC105058...   767   0.0  
ref|XP_010937059.1| PREDICTED: uncharacterized protein LOC105056...   765   0.0  
ref|XP_002271060.2| PREDICTED: uncharacterized protein LOC100249...   763   0.0  
emb|CAN82910.1| hypothetical protein VITISV_015279 [Vitis vinifera]   763   0.0  
ref|XP_007043029.1| Emb:CAB89363.1 [Theobroma cacao] gi|50870696...   761   0.0  
ref|XP_008804473.1| PREDICTED: uncharacterized protein LOC103717...   759   0.0  
gb|KHG22844.1| putative WRKY transcription factor 19 -like prote...   748   0.0  
ref|XP_012462950.1| PREDICTED: uncharacterized protein LOC105782...   748   0.0  
ref|XP_010916986.1| PREDICTED: uncharacterized protein LOC105041...   742   0.0  
ref|XP_002513710.1| conserved hypothetical protein [Ricinus comm...   739   0.0  
ref|XP_008784519.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   736   0.0  
ref|XP_011004973.1| PREDICTED: uncharacterized protein LOC105111...   732   0.0  
ref|XP_012079959.1| PREDICTED: uncharacterized protein LOC105640...   731   0.0  
ref|XP_011092401.1| PREDICTED: uncharacterized protein LOC105172...   731   0.0  
ref|XP_002304079.2| hypothetical protein POPTR_0003s01750g [Popu...   731   0.0  
ref|XP_010103840.1| hypothetical protein L484_024142 [Morus nota...   729   0.0  
ref|XP_009586702.1| PREDICTED: uncharacterized protein LOC104084...   729   0.0  
ref|XP_009586701.1| PREDICTED: uncharacterized protein LOC104084...   729   0.0  

>ref|XP_010261088.1| PREDICTED: uncharacterized protein LOC104599999 [Nelumbo nucifera]
          Length = 641

 Score =  796 bits (2057), Expect = 0.0
 Identities = 405/627 (64%), Positives = 472/627 (75%), Gaps = 8/627 (1%)
 Frame = -3

Query: 2030 KLENMADTTLTLDYVGYGAIKRFKTPESGNYESNLAHSSQASDDGCRLVLGLGPSPSCYA 1851
            K +N +DT L LD++GYG  K  ++ +SGN   NL+ +  A DDGC LVLGLGP+PS Y 
Sbjct: 19   KNDNFSDTALCLDFLGYGTKKISRSRDSGNNHCNLS-AQVAPDDGCGLVLGLGPTPSYYC 77

Query: 1850 DDYYPVGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQCEYDPMS 1671
            DDY+  G+ KN GS  +L + L SD DS +LKLGL +G+ +     E+  + Q   +   
Sbjct: 78   DDYHRYGSDKNTGSANILGEKLSSDGDSGILKLGLPQGTGDALSSDENLFADQGNVNGSF 137

Query: 1670 LQNQWLVNGH---IPIVDEGSTSAKRNSGGYMPALLLAPSLENRN-VPETRELLEHGTGA 1503
             ++Q     +   IPIVDEGSTSAK+ SGGYMP+LLLAP     N + ET+ LLE GT +
Sbjct: 138  HRSQMSAEENRISIPIVDEGSTSAKK-SGGYMPSLLLAPRQYTGNFLMETQSLLELGTRS 196

Query: 1502 GA--YHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGASGLC 1329
                  +Q SPEPS  TDS+ G  S P+TS  SSE  +HHPK+C+FDGCSKGARGASGLC
Sbjct: 197  HPPPAPVQLSPEPSVITDST-GRRSEPVTSLTSSEHRTHHPKKCKFDGCSKGARGASGLC 255

Query: 1328 IAHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXX 1149
            IAHGGGQRC K GC KGAES+TA+CKAHGGGRRCQHLGCTKSAEGKTDFCIA        
Sbjct: 256  IAHGGGQRCHKPGCNKGAESKTAYCKAHGGGRRCQHLGCTKSAEGKTDFCIAHGGGRRCG 315

Query: 1148 XXXCTKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQG 969
               CTKAARGKSG CI+HGGGKRCK EGC+RSAEGQAGLCISHGGGRRCQ+ GCTKGAQG
Sbjct: 316  QPGCTKAARGKSGRCIKHGGGKRCKAEGCTRSAEGQAGLCISHGGGRRCQYSGCTKGAQG 375

Query: 968  STMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVA 789
            STM+CKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRCL+ GGGICPKSVHGGTDYCVA
Sbjct: 376  STMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFQGGGICPKSVHGGTDYCVA 435

Query: 788  HGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAWG 609
            HGGGKRCAVP CTKSARGRTDCCVRHGGGKRC++ENCGKSAQGSTDFCKAHGGGKRC W 
Sbjct: 436  HGGGKRCAVPSCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGKRCTWR 495

Query: 608  QGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVSESTVGSSSDNNFYGGA 429
            QG C+KFARG+SGLCAAH ++   ++   +GGMI   LFRGLVS STVGSS DN +   A
Sbjct: 496  QGTCEKFARGKSGLCAAHSSMVQDRDTS-KGGMIGPGLFRGLVSVSTVGSSMDNGYSSSA 554

Query: 428  -SAVSNCVESSVNTNTRQ-LIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXXXXXXXF 255
             SA+S+C++S  N   RQ L+PPQVLVPLSMK SS+S ++                   F
Sbjct: 555  VSAISDCIDSPENPAKRQHLLPPQVLVPLSMKFSSSSGILGAEREEGRSNQGGGARSLDF 614

Query: 254  ALPEGRVHGGILMSLLKGDMKNIDNGL 174
             +PEGRVHGG L+SLL G +KN  +G+
Sbjct: 615  MVPEGRVHGGGLLSLLGGSLKNAIDGI 641


>ref|XP_010243331.1| PREDICTED: uncharacterized protein LOC104587418 [Nelumbo nucifera]
          Length = 640

 Score =  777 bits (2006), Expect = 0.0
 Identities = 396/628 (63%), Positives = 460/628 (73%), Gaps = 9/628 (1%)
 Frame = -3

Query: 2030 KLENMADTTLTLDYVGYGAIKRFKTPESGNYESNLAHSSQASDDGCRLVLGLGPSPSCYA 1851
            K +N  +T L LD++GYG+ K  ++ +SG+    ++      DDG  LVLGLGP+PS   
Sbjct: 18   KRDNFGNTALHLDFLGYGSKKTAQSGDSGSNHHKIS-VQVPPDDGYHLVLGLGPTPSSCH 76

Query: 1850 DDYYPVGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQCEYDPMS 1671
            D  Y VG  K      +  Q L SD DS +LKLGL  G+ +  G+ ++  S+Q       
Sbjct: 77   D--YSVGINKRKEIANIFGQGLSSDGDSGILKLGLPLGTGDTLGVCQNPFSVQSHLSTSH 134

Query: 1670 LQNQWLVNGH---IPIVDEGSTSAKRNSGGYMPALLLAPSLE-NRNVPETRELLEHGTGA 1503
              NQ  +  +   IPIVDEGSTSAK+ SGGYMP+LLLAP       V +T+ LLE GT +
Sbjct: 135  HLNQMPIEENRLSIPIVDEGSTSAKK-SGGYMPSLLLAPRAPVMEKVVKTQPLLELGTRS 193

Query: 1502 GAYHLQF---SPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGASGL 1332
              +H  F   SPEP+  TD S G  S P+T+  SSE  + HPK+C+FDGCSKGARGASGL
Sbjct: 194  NTHHSHFPQLSPEPTVTTDYSTGTHSEPVTAMVSSEHRASHPKKCKFDGCSKGARGASGL 253

Query: 1331 CIAHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXX 1152
            CIAHGGGQRCQK GC KGAES+TA+CKAHGGGRRCQ LGCTKSAEGKTD+CIA       
Sbjct: 254  CIAHGGGQRCQKPGCNKGAESKTAYCKAHGGGRRCQRLGCTKSAEGKTDYCIAHGGGRRC 313

Query: 1151 XXXXCTKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQ 972
                CTKAARGKSG CIRHGGGKRC VEGC+RSAEGQAGLCISHGGGRRCQ PGCTKGAQ
Sbjct: 314  GQPGCTKAARGKSGRCIRHGGGKRCNVEGCTRSAEGQAGLCISHGGGRRCQFPGCTKGAQ 373

Query: 971  GSTMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCV 792
            GSTM+CKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRCL+ GGGICPKSVHGGTDYCV
Sbjct: 374  GSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFQGGGICPKSVHGGTDYCV 433

Query: 791  AHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAW 612
            AHGGGKRCAVPGCTKSARGRTDCCV+HGGGKRC++ENCGKSAQGSTDFCKAHGGGKRC W
Sbjct: 434  AHGGGKRCAVPGCTKSARGRTDCCVKHGGGKRCKFENCGKSAQGSTDFCKAHGGGKRCTW 493

Query: 611  GQGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVSESTVGSSSDNNF-YG 435
            GQG C+KFARG+SGLCAAH ++   +N   +G MI   LFRGLV  STVGSS +N +   
Sbjct: 494  GQGTCEKFARGKSGLCAAHSSMVQDRNTD-KGNMIGPGLFRGLVPVSTVGSSMENEYSSS 552

Query: 434  GASAVSNCVESSVNTNTRQ-LIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXXXXXXX 258
            G SA+S+C +S  N   RQ LIPPQVLVPLSMKSSS+S ++                   
Sbjct: 553  GVSAISDCTDSPENPAKRQHLIPPQVLVPLSMKSSSSSGMLGAEREEEGSSRVGGTRNFD 612

Query: 257  FALPEGRVHGGILMSLLKGDMKNIDNGL 174
            F +PEGRVHGG L+SLL G +KN  +G+
Sbjct: 613  FMVPEGRVHGGGLLSLLGGSLKNTIDGI 640


>ref|XP_010939550.1| PREDICTED: uncharacterized protein LOC105058355 [Elaeis guineensis]
            gi|743849164|ref|XP_010939551.1| PREDICTED:
            uncharacterized protein LOC105058355 [Elaeis guineensis]
            gi|743849168|ref|XP_010939552.1| PREDICTED:
            uncharacterized protein LOC105058355 [Elaeis guineensis]
            gi|743849171|ref|XP_010939553.1| PREDICTED:
            uncharacterized protein LOC105058355 [Elaeis guineensis]
          Length = 653

 Score =  767 bits (1981), Expect = 0.0
 Identities = 408/669 (60%), Positives = 466/669 (69%), Gaps = 22/669 (3%)
 Frame = -3

Query: 2117 MDLNATDSS--SARKVQLISPYADKHISDHRKLENMADTTLTLDYVGYGAIKRFKTPESG 1944
            MDLN T+        + + S     HIS   K +N+ DTTL LD +GYG     +   S 
Sbjct: 1    MDLNMTEVFLYGTEPLMITSGTQTAHISTSGKNDNLGDTTLHLDCLGYGINNTCRRVHS- 59

Query: 1943 NYESNLAHSSQASDDGCRLVLGLGPSPSCYADDYYPVGTVKNNGSVPVLSQWLPSDSDSE 1764
              +SN   SSQ  DDGC+LVLGLGP+P+ Y+ DYYP G  K   S+ + SQ  P   D+ 
Sbjct: 60   --QSNSDLSSQ--DDGCKLVLGLGPTPNIYSADYYPSGINKAKQSMSLFSQRFPGP-DAA 114

Query: 1763 MLKLGLSRGSAE-VPGLQESSASLQCEYDPMSLQNQWLVNGHIPIVDEGSTSAKRNSGGY 1587
            MLKLGLSRG++E +P   ES+           L         +PIVDEGSTSAKR SGGY
Sbjct: 115  MLKLGLSRGNSESIPAPAESNRETPSPAAKRCLP--------VPIVDEGSTSAKRKSGGY 166

Query: 1586 MPALLLAPSLENRNV----PETRELLEHGTGAGAY-------HLQFSPEPSAPTDSSVGA 1440
            MP+LL AP LE+ N     PET ELL+ G+    +        LQFSPEPSA TD S+ A
Sbjct: 167  MPSLLFAPRLESLNTMETAPETAELLDQGSEGSTHKHHSSHHELQFSPEPSATTDGSLSA 226

Query: 1439 ISGPI---TSGASSERWSHHPKRCRFDGCSKGARGASGLCIAHGGGQRCQKTGCTKGAES 1269
            +S P+   +  A +    HH K+CRF+GCSKGARG+SGLCI HGGGQRCQK GC KGAES
Sbjct: 227  MSEPMIAASEAADTRSHHHHLKKCRFNGCSKGARGSSGLCIGHGGGQRCQKPGCNKGAES 286

Query: 1268 RTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXXXXXCTKAARGKSGLCIRHGG 1089
            RTA+CKAHGGGRRCQ LGCTKSAEGKTDFCIA           CTKAARGKSGLCIRHGG
Sbjct: 287  RTAYCKAHGGGRRCQQLGCTKSAEGKTDFCIAHGGGRRCAHAGCTKAARGKSGLCIRHGG 346

Query: 1088 GKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQGSTMHCKAHGGGRRCVFAGCT 909
            GKRC VEGC+RSAEGQAGLCISHGGGRRCQ  GC KGAQGSTM CKAHGGG+RC+F GCT
Sbjct: 347  GKRCTVEGCTRSAEGQAGLCISHGGGRRCQFTGCCKGAQGSTMFCKAHGGGKRCIFEGCT 406

Query: 908  KGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVAHGGGKRCAVPGCTKSARGRT 729
            KGAEGSTPLCKGHGGGKRC ++GGG+CPKSVHGGTD+CVAHGGGKRCAVP CTKSARGRT
Sbjct: 407  KGAEGSTPLCKGHGGGKRCRFEGGGVCPKSVHGGTDFCVAHGGGKRCAVPACTKSARGRT 466

Query: 728  DCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAWGQGKCDKFARGRSGLCAAHGN 549
            DCCVRHGGGKRC++E CGKSAQGSTDFCKAHGGGKRC WGQG C+KFARG+SGLCAAHG+
Sbjct: 467  DCCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCTWGQG-CEKFARGKSGLCAAHGS 525

Query: 548  IFLGQ---NAGRRGGMIDTSLFRGLVS-ESTVGSSSDNNFYGGA-SAVSNCVESSVNTNT 384
            + L Q    AG+ G MI   LF G+VS  +TVGSS DN     A S + +CVES  +   
Sbjct: 526  LVLSQQEREAGKSGSMIGPGLFHGIVSISTTVGSSIDNEHPSSAMSTLWDCVESQESLGR 585

Query: 383  RQLIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXXXXXXXFALPEGRVHGGILMSLLK 204
             QLIP QVLVPLSMKS S S  + D                 F +PEGRVHGG LMSLL 
Sbjct: 586  MQLIPSQVLVPLSMKSPSQSSGLTD---VGRDGAGSQKKSFGFVIPEGRVHGGGLMSLLG 642

Query: 203  GDMKNIDNG 177
             ++ +   G
Sbjct: 643  RNLNDALGG 651


>ref|XP_010937059.1| PREDICTED: uncharacterized protein LOC105056536 [Elaeis guineensis]
          Length = 652

 Score =  765 bits (1975), Expect = 0.0
 Identities = 401/668 (60%), Positives = 470/668 (70%), Gaps = 21/668 (3%)
 Frame = -3

Query: 2117 MDLNATDSSSARKVQLISPYADK--HISDHRKLENMADTTLTLDYVGYGAIKRFKTPESG 1944
            MDLN T+     K  L+S    +  HIS   K +N+ DT L LD +G G     +T    
Sbjct: 1    MDLNMTEVFLHGKESLMSGSVIQTAHISTCGKNDNLGDTALCLDCLGNGINNTNRT---- 56

Query: 1943 NYESNLAHSSQASDDGCRLVLGLGPSPSCYADDYYPVGTVKNNGSVPVLSQWLPSDSDSE 1764
               + ++    + DDGC+LVLGLGP+P+ Y  D++P G  K   S+ +L+Q   S  D+ 
Sbjct: 57   -VHTQISCDLSSQDDGCKLVLGLGPTPNTYCADHHPSGIHKTRPSMSLLAQH-SSGPDAA 114

Query: 1763 MLKLGLSRGSAE-VPGLQESSASLQCEYDPMSLQNQWLVNGHIPIVDEGSTSAKRNSGGY 1587
            MLKLGLS G+ E +  L +++          SL         +PIVDEGSTSAKR SGGY
Sbjct: 115  MLKLGLSGGNLETIVALPQNNHKTHSPPAKRSLP--------VPIVDEGSTSAKRRSGGY 166

Query: 1586 MPALLLAPSLEN----RNVPETRELLEHGTGAGA-------YHLQFSPEPSAPTDSSVGA 1440
            MP LL AP L++    R  PETR+LL+ G+G          + LQ SPEPS  TD S+GA
Sbjct: 167  MPFLLFAPRLDSLDRMRIAPETRDLLDQGSGGSTRKHHIPHHQLQLSPEPSVTTDGSLGA 226

Query: 1439 IS-GPITSGASSERWSHH--PKRCRFDGCSKGARGASGLCIAHGGGQRCQKTGCTKGAES 1269
            IS   I + A+++  +HH  PK+CRF GCSKGARGASGLCIAHGGGQRCQK GC KGAES
Sbjct: 227  ISEAMIAAAATTDMRTHHQHPKKCRFTGCSKGARGASGLCIAHGGGQRCQKPGCNKGAES 286

Query: 1268 RTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXXXXXCTKAARGKSGLCIRHGG 1089
            RTA+CKAHGGGRRCQ LGCTKSAEGKTDFCI            CTKAARGKSGLCIRHGG
Sbjct: 287  RTAYCKAHGGGRRCQQLGCTKSAEGKTDFCIGHGGGRRCVHPGCTKAARGKSGLCIRHGG 346

Query: 1088 GKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQGSTMHCKAHGGGRRCVFAGCT 909
            GKRC VEGC+RSAEGQAGLCISHGGGRRCQ+PGC KGAQGST  CKAHGGGRRC+F GCT
Sbjct: 347  GKRCTVEGCTRSAEGQAGLCISHGGGRRCQYPGCGKGAQGSTKFCKAHGGGRRCIFEGCT 406

Query: 908  KGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVAHGGGKRCAVPGCTKSARGRT 729
            KGAEGSTPLCKGHGGGKRCL++GGG+CPKSVHGGT++CVAHGGGKRC VPGCTKSARGRT
Sbjct: 407  KGAEGSTPLCKGHGGGKRCLFEGGGVCPKSVHGGTNFCVAHGGGKRCTVPGCTKSARGRT 466

Query: 728  DCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAWGQGKCDKFARGRSGLCAAHGN 549
            DCCVRHGGGKRC+++ CGKSAQGSTDFCKAHGGGKRC W QG C+KFARGRSGLCAAHG+
Sbjct: 467  DCCVRHGGGKRCKFDGCGKSAQGSTDFCKAHGGGKRCLWSQG-CEKFARGRSGLCAAHGS 525

Query: 548  IFLGQ---NAGRRGGMIDTSLFRGLVSESTVGSSSDN-NFYGGASAVSNCVESSVNTNTR 381
            + L     NAG+ G MI   LF+G+VS  T   SS N +   G S +S+ VES  +   +
Sbjct: 526  LMLSPQECNAGKSGSMIGPGLFQGIVSTFTAAKSSINEHSSSGMSTISDSVESEESMGRK 585

Query: 380  QLIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXXXXXXXFALPEGRVHGGILMSLLKG 201
            QLIPPQVLVPLSMKS S S +++D                   +PEGRVHGG LM+LL G
Sbjct: 586  QLIPPQVLVPLSMKSPSPSSVLMD----AGREEGGSHQDFGLVVPEGRVHGGGLMTLLGG 641

Query: 200  DMKNIDNG 177
            D+KN  +G
Sbjct: 642  DLKNAFDG 649


>ref|XP_002271060.2| PREDICTED: uncharacterized protein LOC100249189 [Vitis vinifera]
          Length = 653

 Score =  763 bits (1971), Expect = 0.0
 Identities = 393/646 (60%), Positives = 468/646 (72%), Gaps = 23/646 (3%)
 Frame = -3

Query: 2048 HISDHRKLENMADTTLTLDYVGYGAIKRFKTPESGNYESNLAHSSQASDDGCRLVLGLGP 1869
            H  +  K +N  DTTL+L+  G+G     +   + N  S     S   DDGCRLVLGLGP
Sbjct: 12   HTCEFIKNDNFGDTTLSLNCFGFGGSNTARIVNTRN--SLGVKPSNPPDDGCRLVLGLGP 69

Query: 1868 SPSCYADDYYPVGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQC 1689
            +P+ Y DDYY V   K+ GS  +  + LPS+ DS +LKLG S G  E  GL + S S+Q 
Sbjct: 70   TPNTYCDDYYHVDVNKSKGSATMYPKRLPSEVDS-ILKLGPSGGVGEFLGL-DCSVSVQT 127

Query: 1688 EYDPMSLQNQWLVNGH---IPIVDEGSTSAKRNSGGYMPALLLAPSLENRNVPETRELLE 1518
            + +     NQ   + +   IP+VDEGSTSAK+ SGGYMP+LLLAP ++ +   +T+EL E
Sbjct: 128  DVNSSCHPNQVSDDDNRVLIPVVDEGSTSAKK-SGGYMPSLLLAPRMDRKVSMQTQELFE 186

Query: 1517 HGTGAGAYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGAS 1338
             GT +  +  Q SPEPSA TD S G IS   T+  SS+  +++PK+C+F  C+KGARGAS
Sbjct: 187  LGTKSHHHLSQLSPEPSATTDYSTGTISESATAVTSSDHRNNNPKKCKFMDCTKGARGAS 246

Query: 1337 GLCIAHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXX 1158
            GLCI HGGGQRCQK GC KGAESRTA+CKAHGGGRRCQ LGCTKSAEGKT+FCIA     
Sbjct: 247  GLCIGHGGGQRCQKPGCNKGAESRTAYCKAHGGGRRCQQLGCTKSAEGKTNFCIAHGGGR 306

Query: 1157 XXXXXXC-TKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTK 981
                    TKAARGKSGLCI+HGGGKRCK+EGC+RSAEGQAGLCISHGGGRRCQ+ GCTK
Sbjct: 307  RCGHPAGCTKAARGKSGLCIKHGGGKRCKIEGCTRSAEGQAGLCISHGGGRRCQYQGCTK 366

Query: 980  GAQGSTMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTD 801
            GAQGSTM CKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRCL+DGGGICPKSVHGGT+
Sbjct: 367  GAQGSTMFCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGTN 426

Query: 800  YCVAHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKR 621
            +CVAHGGGKRC+VPGCTKSARGRTDCCV+HGGGKRC++ENCGKSAQGSTDFCKAHGGGKR
Sbjct: 427  FCVAHGGGKRCSVPGCTKSARGRTDCCVKHGGGKRCKFENCGKSAQGSTDFCKAHGGGKR 486

Query: 620  CAWGQGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLV--SESTVGSSSDN 447
            C+WG+GKC+KFARG+SGLCAAH ++ + +   ++GGMI   LF GLV  + ST GSS DN
Sbjct: 487  CSWGEGKCEKFARGKSGLCAAHSSL-VQERETKKGGMIGPGLFHGLVPTATSTGGSSFDN 545

Query: 446  NFYGGASAVSNCVESSVNTNTR---QLIPPQVLVPLSMKSSST-SRLM------------ 315
            N   G S +S+C+ S    + R   QLIPPQVLVPLSMKSSS+ SRL+            
Sbjct: 546  NSSSGVSVISDCINSLEKASKRRQQQLIPPQVLVPLSMKSSSSYSRLVSAERQEEASHGG 605

Query: 314  -VDXXXXXXXXXXXXXXXXXFALPEGRVHGGILMSLLKGDMKNIDN 180
             +                    +PEGRVHGG LMS+L G++KN  N
Sbjct: 606  GIGGSSSNNTAGGKSFNMMMMMIPEGRVHGGGLMSMLGGNLKNACN 651


>emb|CAN82910.1| hypothetical protein VITISV_015279 [Vitis vinifera]
          Length = 692

 Score =  763 bits (1971), Expect = 0.0
 Identities = 393/646 (60%), Positives = 468/646 (72%), Gaps = 23/646 (3%)
 Frame = -3

Query: 2048 HISDHRKLENMADTTLTLDYVGYGAIKRFKTPESGNYESNLAHSSQASDDGCRLVLGLGP 1869
            H  +  K +N  DTTL+L+  G+G     +   + N  S     S   DDGCRLVLGLGP
Sbjct: 12   HTCEFIKNDNFGDTTLSLNCFGFGGSNTARIVNTRN--SLGVKPSNPPDDGCRLVLGLGP 69

Query: 1868 SPSCYADDYYPVGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQC 1689
            +P+ Y DDYY V   K+ GS  +  + LPS+ DS +LKLG S G  E  GL + S S+Q 
Sbjct: 70   TPNTYCDDYYHVDVNKSKGSATMYPKRLPSEVDS-ILKLGPSGGVGEFLGL-DXSVSVQT 127

Query: 1688 EYDPMSLQNQWLVNGH---IPIVDEGSTSAKRNSGGYMPALLLAPSLENRNVPETRELLE 1518
            + +     NQ   + +   IP+VDEGSTSAK+ SGGYMP+LLLAP ++ +   +T+EL E
Sbjct: 128  DVNSSCHPNQVSDDDNRVLIPVVDEGSTSAKK-SGGYMPSLLLAPRMDRKVSMQTQELFE 186

Query: 1517 HGTGAGAYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGAS 1338
             GT +  +  Q SPEPSA TD S G IS   T+  SS+  +++PK+C+F  C+KGARGAS
Sbjct: 187  LGTKSHHHLSQLSPEPSATTDYSTGTISESATAVTSSDHRNNNPKKCKFMDCTKGARGAS 246

Query: 1337 GLCIAHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXX 1158
            GLCI HGGGQRCQK GC KGAESRTA+CKAHGGGRRCQ LGCTKSAEGKT+FCIA     
Sbjct: 247  GLCIGHGGGQRCQKPGCNKGAESRTAYCKAHGGGRRCQQLGCTKSAEGKTNFCIAHGGGR 306

Query: 1157 XXXXXXC-TKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTK 981
                    TKAARGKSGLCI+HGGGKRCK+EGC+RSAEGQAGLCISHGGGRRCQ+ GCTK
Sbjct: 307  RCGHPAGCTKAARGKSGLCIKHGGGKRCKIEGCTRSAEGQAGLCISHGGGRRCQYQGCTK 366

Query: 980  GAQGSTMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTD 801
            GAQGSTM CKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRCL+DGGGICPKSVHGGT+
Sbjct: 367  GAQGSTMFCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGTN 426

Query: 800  YCVAHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKR 621
            +CVAHGGGKRC+VPGCTKSARGRTDCCV+HGGGKRC++ENCGKSAQGSTDFCKAHGGGKR
Sbjct: 427  FCVAHGGGKRCSVPGCTKSARGRTDCCVKHGGGKRCKFENCGKSAQGSTDFCKAHGGGKR 486

Query: 620  CAWGQGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLV--SESTVGSSSDN 447
            C+WG+GKC+KFARG+SGLCAAH ++ + +   ++GGMI   LF GLV  + ST GSS DN
Sbjct: 487  CSWGEGKCEKFARGKSGLCAAHSSL-VQERETKKGGMIGPGLFHGLVPTATSTGGSSFDN 545

Query: 446  NFYGGASAVSNCVESSVNTNTR---QLIPPQVLVPLSMKSSST-SRLM------------ 315
            N   G S +S+C+ S    + R   QLIPPQVLVPLSMKSSS+ SRL+            
Sbjct: 546  NSSSGVSVISDCINSLEKASKRRQQQLIPPQVLVPLSMKSSSSYSRLVSAERQEEASHGG 605

Query: 314  -VDXXXXXXXXXXXXXXXXXFALPEGRVHGGILMSLLKGDMKNIDN 180
             +                    +PEGRVHGG LMS+L G++KN  N
Sbjct: 606  GIGGSNSNNTAGGKSFNMMMMMIPEGRVHGGGLMSMLGGNLKNACN 651


>ref|XP_007043029.1| Emb:CAB89363.1 [Theobroma cacao] gi|508706964|gb|EOX98860.1|
            Emb:CAB89363.1 [Theobroma cacao]
          Length = 644

 Score =  761 bits (1966), Expect = 0.0
 Identities = 394/641 (61%), Positives = 469/641 (73%), Gaps = 16/641 (2%)
 Frame = -3

Query: 2048 HISDHRKLENMADTTLTLDYVGYGAIKRFKTPESGNYESNL-AHSSQASDDGCRLVLGLG 1872
            H+S+  K EN  DTTL L+++GYG   + +    G+ +SNL A  S A DDGCRLVLGLG
Sbjct: 11   HVSELSKNENFGDTTLCLNFLGYGGSNKARF---GSTQSNLHADLSNAPDDGCRLVLGLG 67

Query: 1871 PSPSCYADDYYPVGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQ 1692
            P+PS Y ++YY VG  KN  +    +Q L  + DS +LKLGLS G+ E   L E S S +
Sbjct: 68   PTPSVYCNNYYNVGLNKNKSTGAFFTQGLSPEDDS-ILKLGLSGGTKESMSLLECSLSTE 126

Query: 1691 CEYDPMSLQNQWLVNGH--IPIVDEGSTSAKRNSGGYMPALLLAPSLEN-RNVPETRELL 1521
             +   M L NQ   +    IP+VDEGSTSAK+ SGGYMP+LLLAP +++ + + +TREL 
Sbjct: 127  TDTS-MPLSNQVSADSRLSIPVVDEGSTSAKK-SGGYMPSLLLAPRMDSGKGLVQTRELF 184

Query: 1520 EHGTGAGAYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGA 1341
            + G  +  + L  S EPSA TD S   +S   T+  S +  + + K+C+F GC+KGARGA
Sbjct: 185  QFGAKSHCHQLHRSCEPSAQTDFSGDTLSEQTTTMTSLDNRTSNSKKCKFAGCTKGARGA 244

Query: 1340 SGLCIAHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXX 1161
            SGLCI HGGGQRCQK GC KGAESRTA+CKAHGGGRRCQHLGCTKSAEGKT+FCIA    
Sbjct: 245  SGLCIGHGGGQRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG 304

Query: 1160 XXXXXXXC-TKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCT 984
                     TKAARGKSGLCIRHGGGKRCKVEGC+RSAEGQAGLCISHGGGRRCQ   CT
Sbjct: 305  RRCGFPGGCTKAARGKSGLCIRHGGGKRCKVEGCTRSAEGQAGLCISHGGGRRCQFQECT 364

Query: 983  KGAQGSTMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGT 804
            KG+QGSTM+CKAHGGG+RC+FAGCT+GAEGSTPLCKGHGGGKRCLY+GGGICPKSVHGGT
Sbjct: 365  KGSQGSTMYCKAHGGGKRCIFAGCTRGAEGSTPLCKGHGGGKRCLYNGGGICPKSVHGGT 424

Query: 803  DYCVAHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGK 624
            ++CVAHGGGKRC VPGCTKSARGRTDCCVRHGGGKRC++ENCGKSAQGSTDFCKAHGGGK
Sbjct: 425  NFCVAHGGGKRCVVPGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGK 484

Query: 623  RCAWGQGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVSE-STVGSSSD- 450
            RC+WG+GKC+KFARGRSGLCAAH ++   + A + GG+I   +F GLVS  ST GSS D 
Sbjct: 485  RCSWGEGKCEKFARGRSGLCAAHSSMVQEREASK-GGLIAPGVFHGLVSAGSTTGSSVDY 543

Query: 449  NNFYGGASAVSNCVESSVNTNTRQ-LIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXX 273
            N+   G S +S+C++S      RQ LIPPQVLVPLSMKSSS+   ++             
Sbjct: 544  NHSSSGTSVISDCIDSLEKPARRQHLIPPQVLVPLSMKSSSSYSSLLSAEKQVEGRNGYG 603

Query: 272  XXXXXFA--------LPEGRVHGGILMSLLKGDMKNIDNGL 174
                           +PEGRVHGG LMSLL G++KN  +G+
Sbjct: 604  MGIGGGVGNESFNFMIPEGRVHGGGLMSLLGGNLKNPIDGI 644


>ref|XP_008804473.1| PREDICTED: uncharacterized protein LOC103717758 [Phoenix dactylifera]
          Length = 659

 Score =  759 bits (1960), Expect = 0.0
 Identities = 403/670 (60%), Positives = 466/670 (69%), Gaps = 23/670 (3%)
 Frame = -3

Query: 2117 MDLNATDS--SSARKVQLISPYADKHISDHRKLENMADTTLTLDYVGYGAIKRFKTPESG 1944
            MDLN T+        + + S     HIS   + +N+ DTTL LD +G G     +   S 
Sbjct: 1    MDLNMTEVFLHGTEPLMITSVTQTAHISTCGRNDNLGDTTLRLDCLGNGINNTCRRVHS- 59

Query: 1943 NYESNLAHSSQASDDGCRLVLGLGPSPSCYADDYYPVGTVKNNGSVPVLSQWLPSDSDSE 1764
              +SN   SSQ  DDGCRLVLGLGP+P+ Y+ DYYP G  K   S+ +  Q  P   D+ 
Sbjct: 60   --QSNSDLSSQ--DDGCRLVLGLGPTPNSYSADYYPSGINKAKPSMSLFGQCFPGP-DAA 114

Query: 1763 MLKLGLSRGSAE-VPGLQESS-ASLQCEYDPMSLQNQWLVNGHIPIVDEGSTSAKRNSGG 1590
            MLKLGLS G++E +P L +S+  +L  +  P     + L    +PIVDEGSTSAKR SGG
Sbjct: 115  MLKLGLSGGNSETIPALAQSNHETLSPQVSPGPPAKRSLP---VPIVDEGSTSAKRKSGG 171

Query: 1589 YMPALLLAPSLENRNV----PETRELLEHGTGAGAY-------HLQFSPEPSAPTDSSVG 1443
            YMP+LL AP L + N     PET ELL+ G+    +        LQFSPE SA TD S+ 
Sbjct: 172  YMPSLLFAPRLGSLNTKETTPETTELLDLGSDGSTHKHHSSHHQLQFSPESSATTDGSMS 231

Query: 1442 AISGPITS---GASSERWSHHPKRCRFDGCSKGARGASGLCIAHGGGQRCQKTGCTKGAE 1272
            AIS P+ +    A +    HHPK+CRF+GCS+GARG+SGLCI HGGGQRCQK GC KGAE
Sbjct: 232  AISEPMIADVVAADTTSHHHHPKKCRFNGCSRGARGSSGLCIGHGGGQRCQKPGCNKGAE 291

Query: 1271 SRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXXXXXCTKAARGKSGLCIRHG 1092
            SRTA+CKAHGGGRRCQ LGCTKSAEGKTDFCIA           CTKAARGKSGLCIRHG
Sbjct: 292  SRTAYCKAHGGGRRCQRLGCTKSAEGKTDFCIAHGGGRRCGNPGCTKAARGKSGLCIRHG 351

Query: 1091 GGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQGSTMHCKAHGGGRRCVFAGC 912
            GGKRC VEGC+RSAEGQAGLCISHGGGRRC  PGC KGAQGSTM CKAHGGG+RC+F GC
Sbjct: 352  GGKRCTVEGCTRSAEGQAGLCISHGGGRRCHFPGCCKGAQGSTMFCKAHGGGKRCIFEGC 411

Query: 911  TKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVAHGGGKRCAVPGCTKSARGR 732
            TKGAEGSTPLCKGHGGGKRC ++GGG+CPKSVHGGTD+CVAHGGGKRCAVPGCTKSARGR
Sbjct: 412  TKGAEGSTPLCKGHGGGKRCRFEGGGVCPKSVHGGTDFCVAHGGGKRCAVPGCTKSARGR 471

Query: 731  TDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAWGQGKCDKFARGRSGLCAAHG 552
            TDCCV+HGGGKRC++  CGKSAQGSTDFCKAHGGGKRC W QG C+KFARG+SGLCAAHG
Sbjct: 472  TDCCVKHGGGKRCKFLGCGKSAQGSTDFCKAHGGGKRCTWDQG-CEKFARGKSGLCAAHG 530

Query: 551  NIFLGQ---NAGRRGGMIDTSLFRGLVSESTVGSSSDNNFYGGA--SAVSNCVESSVNTN 387
            ++ L Q    AG  G MI   LF G+VS ST   SS +N +  +  S +S+C ES  +  
Sbjct: 531  SLMLSQQEREAGNSGSMIGPGLFHGIVSTSTTVRSSIDNEHPSSVMSTISDCAESQESMR 590

Query: 386  TRQLIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXXXXXXXFALPEGRVHGGILMSLL 207
               LIPPQVLVPLSMKSSS S  + D                 F +PEGRVHGG LMSLL
Sbjct: 591  RMHLIPPQVLVPLSMKSSSPSSGLTD---VDRDGAGSQEKSFGFVIPEGRVHGGGLMSLL 647

Query: 206  KGDMKNIDNG 177
              ++ N   G
Sbjct: 648  GRNLNNAFGG 657


>gb|KHG22844.1| putative WRKY transcription factor 19 -like protein [Gossypium
            arboreum]
          Length = 645

 Score =  748 bits (1931), Expect = 0.0
 Identities = 389/641 (60%), Positives = 460/641 (71%), Gaps = 17/641 (2%)
 Frame = -3

Query: 2045 ISDHRKLENMADTTLTLDYVGYGAIKRFKTPESGNYESNL-AHSSQASDDGCRLVLGLGP 1869
            +S+  K EN  DTTL L+++G+G   +      G+ +S+L    S ASDDGCRLVLGLGP
Sbjct: 12   VSELSKNENFGDTTLRLNFLGHGGSNK---AGFGSTQSDLHIDLSSASDDGCRLVLGLGP 68

Query: 1868 SPSCYADDYYPVGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQC 1689
            +PS Y +DY+ VG  KN  +  + +  L S  D+ +LKLGLS G+     L E S S   
Sbjct: 69   TPSVYCNDYHNVGLNKNKSTAALFTPGL-SPEDNSILKLGLSGGTKGSMNLLERSLSTDT 127

Query: 1688 EYDPMSLQNQWLVNGH---IPIVDEGSTSAKRNSGGYMPALLLAPSLEN-RNVPETRELL 1521
            +   +   NQ+   G    IP VDEGSTSAK+ SGGYMP+LLLAP +++ + + +T EL 
Sbjct: 128  DLS-VHFSNQFSAEGSQLSIPFVDEGSTSAKK-SGGYMPSLLLAPRMDSGKALVQTHELF 185

Query: 1520 EHGTGAGAYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGA 1341
            + G  + ++    S EPS  TD SV  IS   T+  SS+  + + K+C+F GC KGARGA
Sbjct: 186  QFGAKSRSHQFYQSCEPSTQTDFSVDTISEQTTTITSSDNRTSNSKKCKFAGCFKGARGA 245

Query: 1340 SGLCIAHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXX 1161
            +GLCI HGGGQRCQK GC KGAESRT FCKAHGGGRRCQHLGCTKSAEGKTDFCIA    
Sbjct: 246  TGLCIGHGGGQRCQKAGCNKGAESRTVFCKAHGGGRRCQHLGCTKSAEGKTDFCIAHGGG 305

Query: 1160 XXXXXXXC-TKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCT 984
                     TKAARGKSGLCIRHGGGKRCKVEGC+RSAEGQAGLCISHGGGRRCQ P CT
Sbjct: 306  RRCGFSGGCTKAARGKSGLCIRHGGGKRCKVEGCTRSAEGQAGLCISHGGGRRCQFPACT 365

Query: 983  KGAQGSTMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGT 804
            KGAQGSTM CKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRCLY+GGGICPKSVHGGT
Sbjct: 366  KGAQGSTMFCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLYNGGGICPKSVHGGT 425

Query: 803  DYCVAHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGK 624
            ++CVAHGGGKRC VPGCTKSARGRTDCCVRHGGGKRC++ENCGKSAQGSTD CKAHGGGK
Sbjct: 426  NFCVAHGGGKRCVVPGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDLCKAHGGGK 485

Query: 623  RCAWGQGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVS-ESTVGSSSDN 447
            RC+WG+GKC+KFARGRSGLCAAH ++ L +    +GG+I   +F GLVS  ST GSSS+N
Sbjct: 486  RCSWGEGKCEKFARGRSGLCAAHSSM-LQERQASKGGLIAPGVFHGLVSASSTTGSSSNN 544

Query: 446  NFYG-GASAVSNCVESSVNTNTRQ-LIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXX 273
            N    G S +S+C++S      RQ LIPPQVLVP SMKSS++    +             
Sbjct: 545  NHSSSGNSVISDCIDSPDKPVKRQQLIPPQVLVPPSMKSSASYSSFLSSEQQDEGINRHG 604

Query: 272  XXXXXFA--------LPEGRVHGGILMSLLKGDMKNIDNGL 174
                           +PEGRVHGG LMSLL G++KN  +G+
Sbjct: 605  NHNAGGVGNTSFDFLIPEGRVHGGGLMSLLGGNLKNPIDGI 645


>ref|XP_012462950.1| PREDICTED: uncharacterized protein LOC105782634 [Gossypium raimondii]
            gi|823260459|ref|XP_012462951.1| PREDICTED:
            uncharacterized protein LOC105782634 [Gossypium
            raimondii] gi|823260461|ref|XP_012462952.1| PREDICTED:
            uncharacterized protein LOC105782634 [Gossypium
            raimondii] gi|763812857|gb|KJB79709.1| hypothetical
            protein B456_013G063300 [Gossypium raimondii]
            gi|763812858|gb|KJB79710.1| hypothetical protein
            B456_013G063300 [Gossypium raimondii]
          Length = 645

 Score =  748 bits (1930), Expect = 0.0
 Identities = 384/637 (60%), Positives = 455/637 (71%), Gaps = 17/637 (2%)
 Frame = -3

Query: 2048 HISDHRKLENMADTTLTLDYVGYGAIKRFKTPESGNYESNL-AHSSQASDDGCRLVLGLG 1872
            H+S+  K EN  DTTL L+++G+G   +      G+ +S+L    S A DDGCRLVLGLG
Sbjct: 11   HVSELSKNENFGDTTLRLNFLGHGGSNK---AGFGSTQSDLHVDLSSAPDDGCRLVLGLG 67

Query: 1871 PSPSCYADDYYPVGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQ 1692
            P+PS Y +DY+ VG  KN  +  + +  L S  D+ +LKLGLS G+     L E S S +
Sbjct: 68   PTPSVYCNDYHNVGLNKNKSTAALFTPGL-SPEDNSILKLGLSGGTKGSMNLLERSLSTE 126

Query: 1691 CEYDPMSLQNQWLVNGH---IPIVDEGSTSAKRNSGGYMPALLLAPSLENRNVP-ETREL 1524
             +   +   NQ+   G    IP VDEGSTSAK+ SGGYMP+LLLAP +++     +T EL
Sbjct: 127  TDVS-VHFSNQFSAEGSQLSIPFVDEGSTSAKK-SGGYMPSLLLAPRMDSGKASVQTHEL 184

Query: 1523 LEHGTGAGAYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARG 1344
             + G  + ++    S E S  TD SV  IS   T+  SS+  + + K+C+F GC KGARG
Sbjct: 185  FQFGAKSHSHQFHLSCEHSTQTDFSVDTISEQTTTITSSDYRTSNSKKCKFAGCFKGARG 244

Query: 1343 ASGLCIAHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXX 1164
            A+GLCI HGGGQRCQK GC KGAESRT FCKAHGGGRRCQHLGCTKSAEGKTDFCIA   
Sbjct: 245  ATGLCIGHGGGQRCQKAGCNKGAESRTVFCKAHGGGRRCQHLGCTKSAEGKTDFCIAHGG 304

Query: 1163 XXXXXXXXC-TKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGC 987
                      TKAARGKSGLCIRHGGGKRCKVEGC+RSAEGQAGLCISHGGGRRCQ P C
Sbjct: 305  GRRCGFSGGCTKAARGKSGLCIRHGGGKRCKVEGCTRSAEGQAGLCISHGGGRRCQFPAC 364

Query: 986  TKGAQGSTMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGG 807
            TKGAQGSTM CKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRCLY+GGGICPKSVHGG
Sbjct: 365  TKGAQGSTMFCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLYNGGGICPKSVHGG 424

Query: 806  TDYCVAHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGG 627
            T++CVAHGGGKRC VPGCTKSARGRTDCCVRHGGGKRC++ENCGKSAQGSTD CKAHGGG
Sbjct: 425  TNFCVAHGGGKRCVVPGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDLCKAHGGG 484

Query: 626  KRCAWGQGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVSESTVGSSSDN 447
            KRC+WG+GKC+KFARGRSGLCAAH ++ L +    +GG+I   +F GLVS ++   SS N
Sbjct: 485  KRCSWGEGKCEKFARGRSGLCAAHSSM-LQERQASKGGLIAPGVFHGLVSATSTTGSSSN 543

Query: 446  NFYG--GASAVSNCVESSVNTNTRQ-LIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXX 276
            N++   G S +S+C++S      RQ LIPPQVLVPLSMKSS++    +            
Sbjct: 544  NYHSSSGNSVISDCIDSPDKPVERQQLIPPQVLVPLSMKSSASYSSFLSSEQQDEGINRH 603

Query: 275  XXXXXXFA--------LPEGRVHGGILMSLLKGDMKN 189
                            +PEGRVHGG LMSLL G++KN
Sbjct: 604  GNHIAGGVGNTSFDFLIPEGRVHGGGLMSLLGGNLKN 640


>ref|XP_010916986.1| PREDICTED: uncharacterized protein LOC105041692 [Elaeis guineensis]
          Length = 649

 Score =  742 bits (1915), Expect = 0.0
 Identities = 401/667 (60%), Positives = 454/667 (68%), Gaps = 20/667 (2%)
 Frame = -3

Query: 2117 MDLNATDSSSARKVQLISPYADKHI--SDHRKLENMADTTLTLDYVGYGAIKRFKT--PE 1950
            MD    +  SA K  L+S    +    S H      +DT L L+ +GY   K  KT  P 
Sbjct: 1    MDFTMANLLSAGKRPLMSSTVTESARNSMHGDTNGFSDTALHLNCLGYDTSKTSKTVYPR 60

Query: 1949 SGNYESNLAHSSQASDDGCRLVLGLGPSPSCYADDYYPVGTVKNNGSVPVLSQWLPSDSD 1770
            + N  S       A DDGCRLVLGLGP+P+    DY   G      S    SQ   S+SD
Sbjct: 61   NNNDLS-------AQDDGCRLVLGLGPTPTTCFSDYSTAGVSNTKESATFTSQSGGSESD 113

Query: 1769 SEMLKLGLSRGSAEVPGLQESSASLQCEYDPMSLQNQWLVNGHIPIVDEGSTSAKRNSGG 1590
            S ML+LGLSRGS E   + +   +     +      + L+   IP+VDE STSAKRNSGG
Sbjct: 114  SAMLELGLSRGSVEPMAMVDGCPNFSHHQNQSPPIEKRLL---IPVVDENSTSAKRNSGG 170

Query: 1589 YMPALLLAPSLEN----RNVPETRELLEHGTGAGAYH--------LQFSPEPSAPTDSSV 1446
            +MPALL AP LEN    +  PET  LL+ G+G  A H        LQ +PEP A TDSS+
Sbjct: 171  HMPALLFAPMLENADCTKGSPETCNLLDLGSGINANHHHHLLQHELQVTPEPIAATDSSM 230

Query: 1445 GAISGPITSGASSERWSHHPKRCRFDGCSKGARGASGLCIAHGGGQRCQKTGCTKGAESR 1266
               SG   SG   +R  H PK+C F+GCSKGARGAS LCIAHGGGQRCQK GC KGAESR
Sbjct: 231  DITSGCPPSG---QRTRHQPKKCMFNGCSKGARGASRLCIAHGGGQRCQKPGCNKGAESR 287

Query: 1265 TAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXXXXXCTKAARGKSGLCIRHGGG 1086
            TAFCKAHGGGRRCQ LGCTKSAEGKT+FCIA           CTKAARGKSGLCIRHGGG
Sbjct: 288  TAFCKAHGGGRRCQMLGCTKSAEGKTEFCIAHGGGRRCGHPWCTKAARGKSGLCIRHGGG 347

Query: 1085 KRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQGSTMHCKAHGGGRRCVFAGCTK 906
            KRC VEGC+RSAEGQAG CISHGGGRRCQ PGC KGAQGST +CKAHGGG+RC+F GCTK
Sbjct: 348  KRCTVEGCTRSAEGQAGRCISHGGGRRCQCPGCGKGAQGSTRYCKAHGGGKRCIFEGCTK 407

Query: 905  GAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVAHGGGKRCAVPGCTKSARGRTD 726
            GAEGSTPLCKGHGGGKRCL++GGG+CPKSVHGGT++CVAHGGGKRCA+PGCTKSARGRTD
Sbjct: 408  GAEGSTPLCKGHGGGKRCLFEGGGVCPKSVHGGTNFCVAHGGGKRCAMPGCTKSARGRTD 467

Query: 725  CCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAWGQGKCDKFARGRSGLCAAHGNI 546
             CVRHGGGKRC +E CGKSAQGSTDFCKAHGGGKRC+WG G CDKFARGRSGLCAAHG++
Sbjct: 468  HCVRHGGGKRCHFEGCGKSAQGSTDFCKAHGGGKRCSWGSG-CDKFARGRSGLCAAHGSM 526

Query: 545  FLGQ--NAGRRGGMIDTSLFRGLVSESTVG--SSSDNNFYGGASAVSNCVESSVNTNTRQ 378
               Q   AGR G MI   LF+G+VS S +G  S   +    GASAVS+C ES VN   + 
Sbjct: 527  KARQESEAGRSGSMIGPGLFQGIVSVSAMGGISMDKDCSTSGASAVSDCTESPVNGQRQL 586

Query: 377  LIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXXXXXXXFALPEGRVHGGILMSLLKGD 198
            LIPPQVLVP+S+KS S+S   V                  F +PEGRVHGG LMSLL G 
Sbjct: 587  LIPPQVLVPVSVKSPSSSSASV--------GVGSLAKTSGFMIPEGRVHGGNLMSLLGGS 638

Query: 197  MKNIDNG 177
             K   +G
Sbjct: 639  FKTAVDG 645


>ref|XP_002513710.1| conserved hypothetical protein [Ricinus communis]
            gi|223547161|gb|EEF48657.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 646

 Score =  739 bits (1909), Expect = 0.0
 Identities = 393/650 (60%), Positives = 458/650 (70%), Gaps = 25/650 (3%)
 Frame = -3

Query: 2048 HISDHRKLENMADTTLTLDYVGYGA--IKRFKTPESGNYESNL-AHSSQASDDGCRLVLG 1878
            H S+  K +N  DTTL L+ + YG   +  F+  +S     NL    +   DDGC+LVLG
Sbjct: 12   HKSELPKSDNFGDTTLRLNCLSYGGTNMNGFECTQS-----NLKVDFTNGPDDGCKLVLG 66

Query: 1877 LGPSPSCYADDYYPVGTVKNNGSVP--VLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESS 1704
            LGP+P+ Y DDYY +   K  GS    VL + L SD DS +L+LGLS G+      +E+ 
Sbjct: 67   LGPTPTAYCDDYYSMRFNKTKGSTAAAVLHRGLSSDGDS-ILQLGLSGGT------KEAL 119

Query: 1703 ASLQCEYDPMSLQNQWL--VNGH-----IPIVDEGSTSAKRNSGGYMPALLLAPSLENRN 1545
            + L+C +    +    L   +GH     IP+VDEGSTSAK+ SGGYMP+LLLAP ++   
Sbjct: 120  SELECSFLETDISTPILNQFSGHEDRFLIPVVDEGSTSAKK-SGGYMPSLLLAPRMDGAK 178

Query: 1544 VP-ETRELLEHGTGAGAYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFD 1368
            V  E  E L+ G      H Q     SA TD S+G IS   T+  S +R   +PK+C+F 
Sbjct: 179  VSLEGEEFLQFGAAKSQSH-QLIHGTSASTDISMGTISEQATTATSVDRKISNPKKCKFF 237

Query: 1367 GCSKGARGASGLCIAHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKT 1188
            GCSKGARGA GLCI HGGGQRCQK GC KGAESRTA+CKAHGGGRRCQHLGCTKSAEGKT
Sbjct: 238  GCSKGARGALGLCIGHGGGQRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKT 297

Query: 1187 DFCIAXXXXXXXXXXXC-TKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGG 1011
            DFCIA             TKAARGKSGLCI+HGGGKRCKV+GCSRSAEGQAGLCISHGGG
Sbjct: 298  DFCIAHGGGRRCGFGGGCTKAARGKSGLCIKHGGGKRCKVDGCSRSAEGQAGLCISHGGG 357

Query: 1010 RRCQHPGCTKGAQGSTMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGI 831
            RRCQ+ GCTKGAQGSTMHCKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRCLYDGGGI
Sbjct: 358  RRCQYEGCTKGAQGSTMHCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGI 417

Query: 830  CPKSVHGGTDYCVAHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTD 651
            CPKSVHGGT++CVAHGGGKRC VPGCTKSARGRTDCCV+HGGGKRC++ENCGKSAQGSTD
Sbjct: 418  CPKSVHGGTNFCVAHGGGKRCVVPGCTKSARGRTDCCVKHGGGKRCKFENCGKSAQGSTD 477

Query: 650  FCKAHGGGKRCAWGQGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVSE- 474
            FCKAHGGGKRC WG+GKC+KFARGRSGLCAAH ++ L Q + + G +I   LF+GLVS  
Sbjct: 478  FCKAHGGGKRCTWGEGKCEKFARGRSGLCAAHSSMVLEQGSNK-GSLIGPGLFQGLVSAA 536

Query: 473  STVGSSSDNNFYG-GASAVSNCVESSVNTNTRQ-LIPPQVLVPLSMKSSSTSRLMVDXXX 300
            S  GSS DNN+   G SAVS+C +S      RQ LIP QVLVP SMKSSS+    ++   
Sbjct: 537  SNAGSSIDNNYSSSGISAVSDCTDSLGKPTKRQHLIPAQVLVPPSMKSSSSYSSFLNAEK 596

Query: 299  XXXXXXXXXXXXXXFA--------LPEGRVHGGILMSLLKGDMKNIDNGL 174
                           +         PEGRVHGG LMSL  G++KN  +G+
Sbjct: 597  QEEGRNEYSAGAGSTSRVTSFDYMAPEGRVHGGGLMSLFGGNLKNAIDGI 646


>ref|XP_008784519.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103703444
            [Phoenix dactylifera]
          Length = 658

 Score =  736 bits (1899), Expect = 0.0
 Identities = 391/670 (58%), Positives = 453/670 (67%), Gaps = 23/670 (3%)
 Frame = -3

Query: 2117 MDLNATDSSSARK--VQLISPYADKHISDHRKLENMADTTLTLDYVGYGAIKRFKTPESG 1944
            MDLN T+     K   +  S     HIS   K +N  DT L LD +G G     +T    
Sbjct: 1    MDLNMTEVFLHGKEPCRSTSVIQTAHISTCGKKDNSGDTVLRLDCLGRGINNTSRT---- 56

Query: 1943 NYESNLAHSSQASDDGCRLVLGLGPSPSCYADDYYPVGTVKNNGSVPVLSQWLPSDSDSE 1764
               + +     + DDGCRLVLGLGP+P+    DY+P G  K   S+ + S    S  D+ 
Sbjct: 57   -VHTQIGCDISSQDDGCRLVLGLGPTPNTSCADYHPSGINKTEPSMSLRS----SGPDAA 111

Query: 1763 MLKLGLSRGSAE-VPGLQESSASLQCEYDPMSLQNQWLVNGHIPIVDEGSTSAKRNSGGY 1587
            MLKLGLSRG++E +  L +++   Q      SL         +P  DEGSTSAKR SGGY
Sbjct: 112  MLKLGLSRGNSETILALAQNNHETQSPPAERSLP--------VPFADEGSTSAKRKSGGY 163

Query: 1586 MPALLLAPSLENRNV----PETRELLEHGTGAGAY-------HLQFSPEPSAPTDSSVGA 1440
            +P+LL AP L   +     PETR+LL+ G+G   +        LQ SPEPS  TD S+GA
Sbjct: 164  VPSLLFAPRLGRLDTMKIAPETRDLLDQGSGWSTHKHHIPHHQLQLSPEPSVTTDGSLGA 223

Query: 1439 ISGPITSGASSERWS---HHPKRCRFDGCSKGARGASGLCIAHGGGQRCQKTGCTKGAES 1269
            +S P+ + A++       HHPK+CRF GCSKGARGASGLCIAHGGGQRCQK GC KGAES
Sbjct: 224  LSEPMIADAATADTRIHRHHPKKCRFTGCSKGARGASGLCIAHGGGQRCQKPGCNKGAES 283

Query: 1268 RTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXXXXXCTKAARGKSGLCIRHGG 1089
            RTA+CKAHGGGRRCQ LGCTKSAEGKTDFCIA           CTKAARGKSGLCIRHGG
Sbjct: 284  RTAYCKAHGGGRRCQQLGCTKSAEGKTDFCIAHGGGRRCGHPGCTKAARGKSGLCIRHGG 343

Query: 1088 GKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQGSTMHCKAHGGGRRCVFAGCT 909
            GKRC +EGCSRSAEGQAGLCISHGGGRRCQ+PGC KGAQGST  CKAHGGG+RC+F GCT
Sbjct: 344  GKRCTIEGCSRSAEGQAGLCISHGGGRRCQYPGCGKGAQGSTKFCKAHGGGKRCIFEGCT 403

Query: 908  KGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVAHGGGKRCAVPGCTKSARGRT 729
            KGAEGSTPLCKGHGGGKRCL++GGGICPKSVHGGT++CVAHGGGKRCAVPGCTKSARGRT
Sbjct: 404  KGAEGSTPLCKGHGGGKRCLFEGGGICPKSVHGGTNFCVAHGGGKRCAVPGCTKSARGRT 463

Query: 728  DCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKR-----CAWGQGKCDKFARGRSGLC 564
            DCCVRHGGGKRC +E CGKSAQGSTD CKAHGGGKR              KFARGRS LC
Sbjct: 464  DCCVRHGGGKRCMFEGCGKSAQGSTDLCKAHGGGKRXXXXXXXXXXXXXXKFARGRSSLC 523

Query: 563  AAHGNIFLGQNAGRRGGMIDTSLFRGLVSES-TVGSSSDNNFYGGASAVSNCVESSVNTN 387
            AAHG++ L Q     G MI   LF+G+VS S  V S  D +   G S++S+C+ES  +  
Sbjct: 524  AAHGSLMLLQQERESGSMIGRGLFQGIVSTSAAVKSDIDEHSSSGMSSISDCIESQESMG 583

Query: 386  TRQLIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXXXXXXXFALPEGRVHGGILMSLL 207
             RQLIPPQVLVPLS+ S S S  ++D                  A+PEGRVHGG LM+LL
Sbjct: 584  RRQLIPPQVLVPLSVNSPSPSSGLMD---AGREGVGSHEKGFGLAVPEGRVHGGGLMTLL 640

Query: 206  KGDMKNIDNG 177
             GD+KN  +G
Sbjct: 641  GGDLKNAFDG 650


>ref|XP_011004973.1| PREDICTED: uncharacterized protein LOC105111348 [Populus euphratica]
            gi|743921810|ref|XP_011004974.1| PREDICTED:
            uncharacterized protein LOC105111348 [Populus euphratica]
          Length = 642

 Score =  732 bits (1889), Expect = 0.0
 Identities = 385/631 (61%), Positives = 449/631 (71%), Gaps = 12/631 (1%)
 Frame = -3

Query: 2030 KLENMADTTLTLDYVGYGAIKRFKTPESGNYESNLAHSSQASDDGCRLVLGLGPSPSCYA 1851
            K +   DT L+L+ +GYG      T   G   +     S  SDDGC+LVLGLGP+PS Y 
Sbjct: 18   KNDCFGDTALSLNCLGYGGGS--STSAEGAQNNLKVDFSNGSDDGCKLVLGLGPTPSAYF 75

Query: 1850 DDYYPVGTVKNNG--SVPVLSQWLPSDSDSEMLKLGLSRGSAE-VPGLQESSASLQCEYD 1680
            DD Y +G  K  G  S  +  + L S+SDS +LKLGLSRG  E + GL  S +       
Sbjct: 76   DDCYCLGVNKKKGLDSAAIFPKGLLSESDS-ILKLGLSRGDKEALSGLDYSISETDTNTP 134

Query: 1679 PMSLQNQWLVNGHIPIVDEGSTSAKRNSGGYMPALLLAPSLENRNVPETRELLEHGTGAG 1500
             ++  +   +   IP+VDEGSTSAK+ SGGYM +LLLAP ++ R  P   ELL  GT + 
Sbjct: 135  MLNQISDDEIRSLIPVVDEGSTSAKK-SGGYMTSLLLAPRMDARKAPSQTELLNFGTRSN 193

Query: 1499 AYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGASGLCIAH 1320
             +  Q S E SA TD S+G +S    S  SS+  + +PK+C+F GCSKGARGASGLCI H
Sbjct: 194  -HQFQLSNELSANTDFSMGIMSEQAISTTSSDHRTSNPKKCKFLGCSKGARGASGLCIGH 252

Query: 1319 GGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXXXXX 1140
            GGGQRCQK GC+KGAESRTA+CK HGGGRRCQHLGCTKSAEGKTD CIA           
Sbjct: 253  GGGQRCQKPGCSKGAESRTAYCKVHGGGRRCQHLGCTKSAEGKTDLCIAHGGGRRCGFPG 312

Query: 1139 C-TKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQGST 963
              TKAARGKSGLCIRHGGGKRCKVE C+RSAEGQAGLCISHGGGRRC+H GCTKGAQGST
Sbjct: 313  GCTKAARGKSGLCIRHGGGKRCKVEDCTRSAEGQAGLCISHGGGRRCEHQGCTKGAQGST 372

Query: 962  MHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVAHG 783
             +CKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRC++DGGGICPKSVHGGT++CVAHG
Sbjct: 373  GYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCMFDGGGICPKSVHGGTNFCVAHG 432

Query: 782  GGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAWGQG 603
            GGKRC VPGCTKSARGRTDCCVRHGGGKRC+ ENCGKSAQGSTDFCKAHGGGKRC WG+G
Sbjct: 433  GGKRCVVPGCTKSARGRTDCCVRHGGGKRCKVENCGKSAQGSTDFCKAHGGGKRCTWGEG 492

Query: 602  KCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVS-ESTVGSSSDNNF-YGGA 429
            KC+KFARG+SGLCAAH ++   + A R  G+I   LF GLVS  ST G + D+N  Y G 
Sbjct: 493  KCEKFARGKSGLCAAHSSMAQERQANRT-GLIRPGLFHGLVSAASTAGCTIDSNHSYSGV 551

Query: 428  SAVSNCVESSVNTNTR-QLIPPQVLVPLSMK--SSSTSRLMVD---XXXXXXXXXXXXXX 267
            SAVS+C +S      R  LIPPQVLVP SMK  SS TS ++ D                 
Sbjct: 552  SAVSDCSDSLEKPAKRLHLIPPQVLVPHSMKATSSFTSFMIADKLEEGTNGYGATSGGKK 611

Query: 266  XXXFALPEGRVHGGILMSLLKGDMKNIDNGL 174
               + +PEGRVHGG LMSL  G+++N  +G+
Sbjct: 612  NFDYLVPEGRVHGGGLMSLFGGNLRNAIDGV 642


>ref|XP_012079959.1| PREDICTED: uncharacterized protein LOC105640258 [Jatropha curcas]
            gi|643741545|gb|KDP46973.1| hypothetical protein
            JCGZ_02409 [Jatropha curcas]
          Length = 647

 Score =  731 bits (1887), Expect = 0.0
 Identities = 388/636 (61%), Positives = 445/636 (69%), Gaps = 19/636 (2%)
 Frame = -3

Query: 2024 ENMADTTLTLDYVGYGAIKRFKTPESGNYESNLAHSSQASDDGCRLVLGLGPSPSCYADD 1845
            EN  DTTL L+ +G G  +   T       S     S   DD C+LVLGLGP+P+ Y DD
Sbjct: 20   ENFGDTTLGLNCLGNG--RSNMTGFENTQISIKVDLSNNLDDSCKLVLGLGPTPTAYCDD 77

Query: 1844 YYPVGTVKNNGSVPVLS--QWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQCEYDPMS 1671
            Y  +   KN GS P  +  Q L S +DS  LKLGLS G+ E   L     SL        
Sbjct: 78   YCSMRFSKNKGSTPAATSPQGLSSIADST-LKLGLSGGTKEA--LSNLECSLLETDSNTP 134

Query: 1670 LQNQWLVNGH----IPIVDEGSTSAKRNSGGYMPALLLAPSLENRNVPE-TRELLEHGTG 1506
            L NQ +   H    IP+VDEGSTSAK+ SGGYMP+LLLAP ++   V     E L+ G  
Sbjct: 135  LLNQ-VAGDHNRFSIPVVDEGSTSAKK-SGGYMPSLLLAPRMDAGKVSSHAEEFLQFGAK 192

Query: 1505 AGAYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGASGLCI 1326
            +  + LQ + EPSA TD S   +S   TS  S +R + +PK+C+F GCSKGARGASGLCI
Sbjct: 193  SRCHQLQLNHEPSATTDFSAATVSENETSATSIDRKTSNPKKCKFFGCSKGARGASGLCI 252

Query: 1325 AHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXXX 1146
             HGGGQRCQK GC KG+ESRTA+CKAHGGGRRCQHLGCTKSAEGKTDFCIA         
Sbjct: 253  GHGGGQRCQKAGCNKGSESRTAYCKAHGGGRRCQHLGCTKSAEGKTDFCIAHGGGRRCGF 312

Query: 1145 XXC-TKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQG 969
                TKAARGKSGLCIRHGGGKRCKVEGC+RSAEGQAGLCISHGGGRRCQ+ GCTKGAQG
Sbjct: 313  AGGCTKAARGKSGLCIRHGGGKRCKVEGCTRSAEGQAGLCISHGGGRRCQYQGCTKGAQG 372

Query: 968  STMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVA 789
            STM CKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGT++CVA
Sbjct: 373  STMFCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTNFCVA 432

Query: 788  HGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAWG 609
            HGGGKRC VPGCTKSARGRTDCCVRHGGGKRC++ENCGKSAQGSTDFCKAHGGGKRC WG
Sbjct: 433  HGGGKRCVVPGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGKRCNWG 492

Query: 608  QGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVSE-STVGSSSDNNFYG- 435
            +GKC+KFARG+SGLCAAH ++ + +   ++G +I   LF GLVS  S  GSS +NN+   
Sbjct: 493  EGKCEKFARGKSGLCAAHSSM-VQERGSKKGNLIGPGLFHGLVSAVSNAGSSINNNYSSS 551

Query: 434  GASAVSNCVESSVNTNTRQ-LIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXXXXXXX 258
            G SAVS+C +S      RQ LIP QVLVP SMKSSS+    +                  
Sbjct: 552  GISAVSDCTDSLEKPAKRQHLIPAQVLVPSSMKSSSSYSSFLSAEKQEEDRNGYGTGIAS 611

Query: 257  FA--------LPEGRVHGGILMSLLKGDMKNIDNGL 174
                      +PEGRVHGG LMSL  G++KN  +G+
Sbjct: 612  IGRITNFDYMVPEGRVHGGGLMSLFAGNLKNAIDGV 647


>ref|XP_011092401.1| PREDICTED: uncharacterized protein LOC105172591 isoform X1 [Sesamum
            indicum] gi|747089522|ref|XP_011092403.1| PREDICTED:
            uncharacterized protein LOC105172591 isoform X1 [Sesamum
            indicum] gi|747089524|ref|XP_011092404.1| PREDICTED:
            uncharacterized protein LOC105172591 isoform X1 [Sesamum
            indicum] gi|747089526|ref|XP_011092405.1| PREDICTED:
            uncharacterized protein LOC105172591 isoform X1 [Sesamum
            indicum]
          Length = 636

 Score =  731 bits (1886), Expect = 0.0
 Identities = 375/631 (59%), Positives = 446/631 (70%), Gaps = 9/631 (1%)
 Frame = -3

Query: 2039 DHRKLENMADTTLTLDYVGYGAIKRFKTPESGNYESNLAHSSQASDDGCRLVLGLGPSPS 1860
            D  K  N+ DTTL LD  GYG  +     +S          + A DDGC+LVLGLGP+PS
Sbjct: 9    DFTKNGNVGDTTLRLDGFGYGTRETAPYRDSQTNIGGRGICTSAPDDGCKLVLGLGPTPS 68

Query: 1859 CYADDYYPVGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQCEYD 1680
             Y+ +Y P       G   +L+Q L S  DS  LKLGLS G+     + E SAS+   ++
Sbjct: 69   AYSGEYSPSRGSGTKGMTSILNQGLSSQHDST-LKLGLSGGTDAFSNVLEFSASMHSSFN 127

Query: 1679 PMSLQNQWLVNGH---IPIVDEGSTSAKRNSGGYMPALLLAPSLENRNVPETRELLEHGT 1509
                 +Q   +G+   IP+VDEGSTSAK+ SGGYM  L+LAP +EN  +    +  E G 
Sbjct: 128  APRQPDQVSSDGNRIAIPVVDEGSTSAKK-SGGYMTRLILAPRMENAEIVLANKSFELGA 186

Query: 1508 GAGAYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGASGLC 1329
             +     Q S EPSA +  S+ AIS P ++  SS+    + K+C+F GC+KGARGA+GLC
Sbjct: 187  KSHCQISQLSSEPSAVSSYSMSAISEPGSAAVSSDHKGSNTKKCKFAGCTKGARGATGLC 246

Query: 1328 IAHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXX 1149
            I HGGG+RCQK GC KGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIA        
Sbjct: 247  IGHGGGERCQKPGCNKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAHGGGRRCG 306

Query: 1148 XXXC-TKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQ 972
                 TKAARGKSGLCIRHGGGKRCKVEGC+RSAEGQ GLCISHGGGRRCQ  GCTKGAQ
Sbjct: 307  HPGGCTKAARGKSGLCIRHGGGKRCKVEGCTRSAEGQVGLCISHGGGRRCQFLGCTKGAQ 366

Query: 971  GSTMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCV 792
            GSTM CKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRC++DGGGICPKSVHGGT++CV
Sbjct: 367  GSTMFCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCMFDGGGICPKSVHGGTNFCV 426

Query: 791  AHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAW 612
            AHGGGKRC+VPGCTKSARGRTDCCVRHGGGKRC++ENCGKSAQGSTDFCKAHGGGKRC+W
Sbjct: 427  AHGGGKRCSVPGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGKRCSW 486

Query: 611  GQGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVSESTVGSSSDNNFYGG 432
            G+GKC+KFARG+SGLCAAH ++  G+   ++ GMI   LFRGLV  ++   SS +N    
Sbjct: 487  GEGKCEKFARGKSGLCAAHTSLLQGRET-KKEGMIGPGLFRGLVPATSAARSSFDNTCSS 545

Query: 431  A--SAVSNCVESSVNTNTR-QLIPPQVLVPLSMKSSSTS--RLMVDXXXXXXXXXXXXXX 267
            +  S +S+ ++S      R QLIPPQVLVPLSMK+SS S      +              
Sbjct: 546  SSVSVISDSIDSLEKPAKRQQLIPPQVLVPLSMKASSFSPRPQQAETSTGGDTRSGDGRK 605

Query: 266  XXXFALPEGRVHGGILMSLLKGDMKNIDNGL 174
               F +PEGRVHGG L+SLL G++ N  +G+
Sbjct: 606  SLDFMVPEGRVHGGGLLSLLGGNLNNAIDGI 636


>ref|XP_002304079.2| hypothetical protein POPTR_0003s01750g [Populus trichocarpa]
            gi|550342152|gb|EEE79058.2| hypothetical protein
            POPTR_0003s01750g [Populus trichocarpa]
          Length = 642

 Score =  731 bits (1886), Expect = 0.0
 Identities = 387/631 (61%), Positives = 445/631 (70%), Gaps = 12/631 (1%)
 Frame = -3

Query: 2030 KLENMADTTLTLDYVGYGAIKRFKTPESGNYESNLAHSSQASDDGCRLVLGLGPSPSCYA 1851
            K +   DT L+L+ +GYG      T   G   +     S  SDDGC+LVLGLGP+PS Y 
Sbjct: 18   KNDCFGDTALSLNCLGYGGSS--STNAEGAQNNLKVDFSNGSDDGCKLVLGLGPTPSAYF 75

Query: 1850 DDYYPVGTVKNNG--SVPVLSQWLPSDSDSEMLKLGLSRGSAE-VPGLQESSASLQCEYD 1680
            DD Y +G  K  G  S  +    L S+SDS +LKLGLS G  E + GL  S +       
Sbjct: 76   DDCYCLGVNKKKGLDSAVIFPMGLLSESDS-ILKLGLSGGDKEALSGLDYSISETDTNTP 134

Query: 1679 PMSLQNQWLVNGHIPIVDEGSTSAKRNSGGYMPALLLAPSLENRNVPETRELLEHGTGAG 1500
             ++  +       IP+VDEGSTSAK+ SGGYM +LLLAP ++ R  P   ELL  GT + 
Sbjct: 135  MLNQISDDDSRSLIPVVDEGSTSAKK-SGGYMTSLLLAPRMDVRKAPSQTELLNFGTRSN 193

Query: 1499 AYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGASGLCIAH 1320
             +  Q S E SA TD S+G +S    S  SS+  + +PK+C+F GCSKGARGASGLCI H
Sbjct: 194  -HQFQLSHELSANTDFSMGIMSEQAISTTSSDHRTSNPKKCKFLGCSKGARGASGLCIGH 252

Query: 1319 GGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXXXXX 1140
            GGGQRCQK GC KGAESRTA+CK HGGGRRCQHLGCTKSAEGKTD CIA           
Sbjct: 253  GGGQRCQKPGCNKGAESRTAYCKVHGGGRRCQHLGCTKSAEGKTDLCIAHGGGRRCGFPG 312

Query: 1139 C-TKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQGST 963
              TKAARGKSGLCIRHGGGKRCKVE C+RSAEGQAGLCISHGGGRRC+H GCTKGAQGST
Sbjct: 313  GCTKAARGKSGLCIRHGGGKRCKVEDCTRSAEGQAGLCISHGGGRRCEHQGCTKGAQGST 372

Query: 962  MHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVAHG 783
             +CKAHGGG+RC+FAGCTKGAEGSTPLCKGHGGGKRC++DGGGICPKSVHGGT++CVAHG
Sbjct: 373  GYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCMFDGGGICPKSVHGGTNFCVAHG 432

Query: 782  GGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAWGQG 603
            GGKRC VPGCTKSARGRTDCCVRHGGGKRC+ +NCGKSAQGSTDFCKAHGGGKRC WG+G
Sbjct: 433  GGKRCVVPGCTKSARGRTDCCVRHGGGKRCRVDNCGKSAQGSTDFCKAHGGGKRCTWGEG 492

Query: 602  KCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVS-ESTVGSSSDNNF-YGGA 429
            KC+KFARG+SGLCAAH ++   + A R  G+I   LF GLVS  ST GSS DNN  Y G 
Sbjct: 493  KCEKFARGKSGLCAAHSSMVQEREANRT-GLIRPGLFHGLVSAASTAGSSIDNNHSYSGV 551

Query: 428  SAVSNCVESSVNTNTR-QLIPPQVLVPLSMK--SSSTSRLMVD---XXXXXXXXXXXXXX 267
            SAVS+C +S      R  LIPPQVLVP SMK  SS TS +  D                 
Sbjct: 552  SAVSDCSDSLEKPAKRLHLIPPQVLVPHSMKATSSFTSFMNADNLEEGTNGYGATSGGKK 611

Query: 266  XXXFALPEGRVHGGILMSLLKGDMKNIDNGL 174
               + +PEGRVHGG LMSL  G+++N  NG+
Sbjct: 612  NFDYLVPEGRVHGGGLMSLFGGNLRNAINGV 642


>ref|XP_010103840.1| hypothetical protein L484_024142 [Morus notabilis]
            gi|587909368|gb|EXB97281.1| hypothetical protein
            L484_024142 [Morus notabilis]
          Length = 631

 Score =  729 bits (1882), Expect = 0.0
 Identities = 386/635 (60%), Positives = 455/635 (71%), Gaps = 16/635 (2%)
 Frame = -3

Query: 2030 KLENMADTTLTLDYVGYGAIKRFKTPESGNYESNLAHS-SQASDDGCRLVLGLGPSPSCY 1854
            K +N  DTTL L+  G+G            Y+SN+  S S A DDGCRLVLGLGP+PS Y
Sbjct: 18   KNDNSRDTTLCLNCPGFGG------SHLSRYQSNVGVSFSNAPDDGCRLVLGLGPTPSAY 71

Query: 1853 ADDY--YPVGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSAS-LQCEY 1683
            ++DY  +P+   K   +V  L    PSD DS +L+LGLS GS E   +  SS + +   Y
Sbjct: 72   SNDYHNFPLKRSKELSTVQPLG--FPSDGDS-ILQLGLSGGSKETSTISVSSETDVNTSY 128

Query: 1682 DPMSL---QNQWLVNGHIPIVDEGSTSAKRNSGGYMPALLLAPSLENRNVP-ETRELLEH 1515
             P  +   +NQ L    IP+VDEGSTSAK+ SGGYMP+LLLAP ++  N+  E+R  LE 
Sbjct: 129  IPGQVSPRENQHL----IPVVDEGSTSAKK-SGGYMPSLLLAPKMDGFNISFESRGPLER 183

Query: 1514 GTGAGAYHLQFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGASG 1335
             + +     Q S EPS   + SV  +S   T+G +S+  + +PKRC F GC+KGARGASG
Sbjct: 184  QSKS-----QLS-EPSLSVEYSVDTMSEQETAGTNSDLRTSNPKRCNFLGCTKGARGASG 237

Query: 1334 LCIAHGGGQRCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXX 1155
            LCI HGGGQRCQK GC KGAESRTA+CKAHGGG+RC HLGCTKSAEGKTD+CIA      
Sbjct: 238  LCIGHGGGQRCQKPGCNKGAESRTAYCKAHGGGKRCLHLGCTKSAEGKTDYCIAHGGGRR 297

Query: 1154 XXXXXCTKAARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGA 975
                 CTKAARG+SGLCIRHGGGKRCK+EGC+RSAEGQAGLCISHGGGRRCQ  GC+KGA
Sbjct: 298  CGHPRCTKAARGRSGLCIRHGGGKRCKIEGCARSAEGQAGLCISHGGGRRCQFQGCSKGA 357

Query: 974  QGSTMHCKAHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYC 795
            QGSTM CKAHGGG+RC+F GCTKGAEGSTPLCKGHGGGKRCL+DGGGICPKSVHGGT++C
Sbjct: 358  QGSTMFCKAHGGGKRCIFQGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGTNFC 417

Query: 794  VAHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCA 615
            VAHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRC+YENCGKSAQGSTDFCKAHGGGKRC 
Sbjct: 418  VAHGGGKRCAVPGCTKSARGRTDCCVRHGGGKRCKYENCGKSAQGSTDFCKAHGGGKRCN 477

Query: 614  WGQGKCDKFARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLVSE-STVGSSSDNNFY 438
            WG+GKC+KFARG+SGLCAAH ++   +   + GG+I   LF GLVS  ST   SS+N+  
Sbjct: 478  WGEGKCEKFARGKSGLCAAHSSMAQERELSK-GGLIGPRLFHGLVSAASTTAGSSNNHST 536

Query: 437  GGASAVSNCVES-SVNTNTRQLIPPQVLVPLSMKSSSTSRLMVDXXXXXXXXXXXXXXXX 261
             G S VS+C+ S       RQLIPPQVLVPLSMKSSS+   +++                
Sbjct: 537  SGISVVSDCINSLEKRAKRRQLIPPQVLVPLSMKSSSSYSNILNAEKAEGNRFDIGVGSS 596

Query: 260  XFA------LPEGRVHGGILMSLLKGDMKNIDNGL 174
                     +PEGRVHGG LMSL  G++ N  +G+
Sbjct: 597  SGRKSFDFDIPEGRVHGGPLMSLFGGNLNNAIDGI 631


>ref|XP_009586702.1| PREDICTED: uncharacterized protein LOC104084522 isoform X2 [Nicotiana
            tomentosiformis]
          Length = 634

 Score =  729 bits (1881), Expect = 0.0
 Identities = 373/626 (59%), Positives = 452/626 (72%), Gaps = 17/626 (2%)
 Frame = -3

Query: 2015 ADTTLTLDYVGYGAIKRFKTPESGNYESNLAHSSQASDDGCRLVLGLGPSPSCYADDYYP 1836
            +DTTL LD  GYG  +  +   SG    +      A DDG +LVLGLGP+P+ ++DDYYP
Sbjct: 12   SDTTLRLDCFGYGGNEFIRFGGSGTSSRSHVMQQNAVDDGFKLVLGLGPTPTMFSDDYYP 71

Query: 1835 VGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQCEYDPMSLQNQW 1656
             G  KN G   +L+Q L S+ DS +LKLGLS  + E+    + SA  Q         +Q 
Sbjct: 72   GGANKNKGFTALLNQGLSSEGDS-ILKLGLSGSTDEISNALDFSAITQSTAGAPHHLDQL 130

Query: 1655 LVNGH---IPIVDEGSTSAKRNSGGYMPALLLAPSLENRNVP-ETRELLEHGTGAGAYHL 1488
               G+   IP++DEGST+AK+ SGGYMP+LLLAP +EN  +  + +ELLE G  +  +  
Sbjct: 131  SSGGNRPAIPVLDEGSTTAKK-SGGYMPSLLLAPRIENNQLSFQNKELLELGAKSHFHLP 189

Query: 1487 QFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGASGLCIAHGGGQ 1308
            + S EPS  +D S+  +S P T  A+S R + +PKRC+F GC KGARGA+GLCI HGGGQ
Sbjct: 190  ELSSEPSCISDYSMSTMSEPTTM-ATSSRKTTNPKRCKFPGCPKGARGATGLCIGHGGGQ 248

Query: 1307 RCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXXXXXC-TK 1131
            RCQK GC KGAESRTA+CKAHGGG+RC HLGCTKSAEGKTD+CIA             T+
Sbjct: 249  RCQKPGCNKGAESRTAYCKAHGGGKRCDHLGCTKSAEGKTDYCIAHGGGRRCSFTGGCTR 308

Query: 1130 AARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQGSTMHCK 951
            AARG+SGLCI+HGGGKRC VEGC+RSAEG+ GLCISHGGGRRCQ+P C KGAQGST++CK
Sbjct: 309  AARGRSGLCIKHGGGKRCSVEGCTRSAEGKVGLCISHGGGRRCQYPSCAKGAQGSTLYCK 368

Query: 950  AHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVAHGGGKR 771
            AHGGG+RC+FAGCTKGAEGSTPLCK HGGGKRCL+DGGGICPKSVHGGT++CVAHGGGKR
Sbjct: 369  AHGGGKRCIFAGCTKGAEGSTPLCKAHGGGKRCLFDGGGICPKSVHGGTNFCVAHGGGKR 428

Query: 770  CAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAWGQGKCDK 591
            CAVPGCTKSARGRTDCCV+HGGGKRC++ENCGKSAQGSTDFCKAHGGGKRC+WG+GKC+K
Sbjct: 429  CAVPGCTKSARGRTDCCVKHGGGKRCKFENCGKSAQGSTDFCKAHGGGKRCSWGEGKCEK 488

Query: 590  FARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLV-SESTVGSSSDNNFYGGASAVSN 414
            FARGR GLCAAH ++  G+NA  +GGMI   LF GLV + S V ++ + N    +S++ +
Sbjct: 489  FARGRGGLCAAHSSLLHGRNA-NKGGMIGPGLFHGLVPAASPVKTTFEKN--NRSSSMVS 545

Query: 413  CVESSVNT-----NTRQLIPPQVLVPLSMKSSS------TSRLMVDXXXXXXXXXXXXXX 267
             V  SV++       +QLIPPQVLVPLSMK+ S      TS    D              
Sbjct: 546  MVSDSVHSLGKPAERQQLIPPQVLVPLSMKALSTCSNPLTSEKQDDRSTDLGIGRSNTNN 605

Query: 266  XXXFALPEGRVHGGILMSLLKGDMKN 189
               F +PEGRVHGG LMSLL G++KN
Sbjct: 606  NFEFVVPEGRVHGGGLMSLLGGNLKN 631


>ref|XP_009586701.1| PREDICTED: uncharacterized protein LOC104084522 isoform X1 [Nicotiana
            tomentosiformis]
          Length = 642

 Score =  729 bits (1881), Expect = 0.0
 Identities = 373/626 (59%), Positives = 452/626 (72%), Gaps = 17/626 (2%)
 Frame = -3

Query: 2015 ADTTLTLDYVGYGAIKRFKTPESGNYESNLAHSSQASDDGCRLVLGLGPSPSCYADDYYP 1836
            +DTTL LD  GYG  +  +   SG    +      A DDG +LVLGLGP+P+ ++DDYYP
Sbjct: 20   SDTTLRLDCFGYGGNEFIRFGGSGTSSRSHVMQQNAVDDGFKLVLGLGPTPTMFSDDYYP 79

Query: 1835 VGTVKNNGSVPVLSQWLPSDSDSEMLKLGLSRGSAEVPGLQESSASLQCEYDPMSLQNQW 1656
             G  KN G   +L+Q L S+ DS +LKLGLS  + E+    + SA  Q         +Q 
Sbjct: 80   GGANKNKGFTALLNQGLSSEGDS-ILKLGLSGSTDEISNALDFSAITQSTAGAPHHLDQL 138

Query: 1655 LVNGH---IPIVDEGSTSAKRNSGGYMPALLLAPSLENRNVP-ETRELLEHGTGAGAYHL 1488
               G+   IP++DEGST+AK+ SGGYMP+LLLAP +EN  +  + +ELLE G  +  +  
Sbjct: 139  SSGGNRPAIPVLDEGSTTAKK-SGGYMPSLLLAPRIENNQLSFQNKELLELGAKSHFHLP 197

Query: 1487 QFSPEPSAPTDSSVGAISGPITSGASSERWSHHPKRCRFDGCSKGARGASGLCIAHGGGQ 1308
            + S EPS  +D S+  +S P T  A+S R + +PKRC+F GC KGARGA+GLCI HGGGQ
Sbjct: 198  ELSSEPSCISDYSMSTMSEPTTM-ATSSRKTTNPKRCKFPGCPKGARGATGLCIGHGGGQ 256

Query: 1307 RCQKTGCTKGAESRTAFCKAHGGGRRCQHLGCTKSAEGKTDFCIAXXXXXXXXXXXC-TK 1131
            RCQK GC KGAESRTA+CKAHGGG+RC HLGCTKSAEGKTD+CIA             T+
Sbjct: 257  RCQKPGCNKGAESRTAYCKAHGGGKRCDHLGCTKSAEGKTDYCIAHGGGRRCSFTGGCTR 316

Query: 1130 AARGKSGLCIRHGGGKRCKVEGCSRSAEGQAGLCISHGGGRRCQHPGCTKGAQGSTMHCK 951
            AARG+SGLCI+HGGGKRC VEGC+RSAEG+ GLCISHGGGRRCQ+P C KGAQGST++CK
Sbjct: 317  AARGRSGLCIKHGGGKRCSVEGCTRSAEGKVGLCISHGGGRRCQYPSCAKGAQGSTLYCK 376

Query: 950  AHGGGRRCVFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGICPKSVHGGTDYCVAHGGGKR 771
            AHGGG+RC+FAGCTKGAEGSTPLCK HGGGKRCL+DGGGICPKSVHGGT++CVAHGGGKR
Sbjct: 377  AHGGGKRCIFAGCTKGAEGSTPLCKAHGGGKRCLFDGGGICPKSVHGGTNFCVAHGGGKR 436

Query: 770  CAVPGCTKSARGRTDCCVRHGGGKRCQYENCGKSAQGSTDFCKAHGGGKRCAWGQGKCDK 591
            CAVPGCTKSARGRTDCCV+HGGGKRC++ENCGKSAQGSTDFCKAHGGGKRC+WG+GKC+K
Sbjct: 437  CAVPGCTKSARGRTDCCVKHGGGKRCKFENCGKSAQGSTDFCKAHGGGKRCSWGEGKCEK 496

Query: 590  FARGRSGLCAAHGNIFLGQNAGRRGGMIDTSLFRGLV-SESTVGSSSDNNFYGGASAVSN 414
            FARGR GLCAAH ++  G+NA  +GGMI   LF GLV + S V ++ + N    +S++ +
Sbjct: 497  FARGRGGLCAAHSSLLHGRNA-NKGGMIGPGLFHGLVPAASPVKTTFEKN--NRSSSMVS 553

Query: 413  CVESSVNT-----NTRQLIPPQVLVPLSMKSSS------TSRLMVDXXXXXXXXXXXXXX 267
             V  SV++       +QLIPPQVLVPLSMK+ S      TS    D              
Sbjct: 554  MVSDSVHSLGKPAERQQLIPPQVLVPLSMKALSTCSNPLTSEKQDDRSTDLGIGRSNTNN 613

Query: 266  XXXFALPEGRVHGGILMSLLKGDMKN 189
               F +PEGRVHGG LMSLL G++KN
Sbjct: 614  NFEFVVPEGRVHGGGLMSLLGGNLKN 639


Top