BLASTX nr result

ID: Cinnamomum24_contig00026487 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum24_contig00026487
         (1139 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010918465.1| PREDICTED: serine/arginine repetitive matrix...   241   8e-61
ref|XP_010274903.1| PREDICTED: cyclin-dependent kinase 13-like [...   238   5e-60
ref|XP_008806680.1| PREDICTED: uncharacterized protein LOC103719...   223   2e-55
ref|XP_010933318.1| PREDICTED: serine/arginine repetitive matrix...   219   4e-54
ref|XP_010274904.1| PREDICTED: cyclin-dependent kinase 13-like [...   214   8e-53
ref|XP_006429727.1| hypothetical protein CICLE_v10011149mg [Citr...   194   1e-46
ref|XP_007032151.1| Uncharacterized protein isoform 2 [Theobroma...   192   3e-46
ref|XP_007032150.1| Uncharacterized protein isoform 1 [Theobroma...   192   3e-46
ref|XP_009389233.1| PREDICTED: serine/arginine repetitive matrix...   192   6e-46
gb|KDO64284.1| hypothetical protein CISIN_1g004639mg [Citrus sin...   192   6e-46
ref|XP_011096915.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   189   5e-45
ref|XP_011030171.1| PREDICTED: uncharacterized protein LOC105129...   187   1e-44
ref|XP_002318801.2| hypothetical protein POPTR_0012s12820g [Popu...   186   3e-44
ref|XP_002530557.1| hypothetical protein RCOM_0303940 [Ricinus c...   180   2e-42
gb|KHN41015.1| hypothetical protein glysoja_013357 [Glycine soja]     179   3e-42
gb|KHN18623.1| hypothetical protein glysoja_033730 [Glycine soja]     179   3e-42
ref|XP_003518355.1| PREDICTED: dentin sialophosphoprotein-like [...   179   3e-42
ref|XP_002263918.1| PREDICTED: uncharacterized protein LOC100261...   178   6e-42
emb|CAN76723.1| hypothetical protein VITISV_042980 [Vitis vinifera]   178   6e-42
gb|KHG20408.1| hypothetical protein F383_04684 [Gossypium arboreum]   178   8e-42

>ref|XP_010918465.1| PREDICTED: serine/arginine repetitive matrix protein 1-like [Elaeis
            guineensis]
          Length = 692

 Score =  241 bits (615), Expect = 8e-61
 Identities = 166/355 (46%), Positives = 205/355 (57%), Gaps = 20/355 (5%)
 Frame = -2

Query: 1138 QKPHENAAAQTPKHKERNIRTVEIPH--ERTSNSSRGIREQLMTCRSMEQQQEPEIVEAA 965
            QK  EN A    K ++RN   VE+ +  + T+  +  +REQL+ C++ EQQ E EI E A
Sbjct: 350  QKASENIA-NLRKSEKRNGGAVEVSNGVKSTNVITSSLREQLVNCQAKEQQMEHEIREGA 408

Query: 964  ADSAKAGLPKRSEVSAENPGAENLIPQTITXXXXXXXXR-DLDLVMGLNPDTLLNNPSSY 788
                  G  K  E    + G E+L P TIT          D +  + LNPD  LN P+SY
Sbjct: 409  LQVK--GASKDGEAHLTSNGVESLHPITITRTRSSRRSSRDFENALDLNPDNHLN-PASY 465

Query: 787  ASLLLEDIQNYHCQNVAFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEPNRP 608
             SLLLEDI NYH +N AFSLP CVSKACSILEAVADLNS  S N        K SE +R 
Sbjct: 466  TSLLLEDIHNYHQKNTAFSLPACVSKACSILEAVADLNSSCSEN--------KSSEADRS 517

Query: 607  NCYNNSISLNGRFNRK-LMGKDPFVESEVVANDDDLMKPSLHKYVTVREPVRDMEQQESA 431
            N  N++ SLNGRF R+ L+ K PFVESE+V  DD LM+PSLHKY++VR+   ++E QESA
Sbjct: 518  N--NDNDSLNGRFGRRGLVPKGPFVESEIVVKDD-LMEPSLHKYISVRDLGGEVEPQESA 574

Query: 430  GSNSFLGHHWSSSWEPNSAESTDRWTSSLSYTGEEVEQSEATGSFEAGAGRLKRG--ARX 257
            GSNSF+G  WSSSWEPNS +STDR+ +S S  G+E EQ +     +       RG   R 
Sbjct: 575  GSNSFIGQPWSSSWEPNSVDSTDRYWTSQSINGDEAEQQQQQSMPDVARNSEDRGRRLRS 634

Query: 256  XXXXSMLPATSLSG-KKRECD-------------GGNGKQGLASFSLAPVATAAA 134
                + LP T  SG KK E D             GG+GK G +     PVA AAA
Sbjct: 635  GSCTNSLPTTMPSGSKKIEVDHHRPLHRGGSSLGGGSGKAGGSRSLSLPVAAAAA 689


>ref|XP_010274903.1| PREDICTED: cyclin-dependent kinase 13-like [Nelumbo nucifera]
          Length = 763

 Score =  238 bits (608), Expect = 5e-60
 Identities = 159/349 (45%), Positives = 213/349 (61%), Gaps = 26/349 (7%)
 Frame = -2

Query: 1102 KHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAADSAKAGLPKRSE- 926
            K  E++ R VE+P        +G  ++  + ++ E+QQ+ +  ++ A     G+P ++  
Sbjct: 431  KPDEKSNRVVELP--------QGDTDRRESSKAKEEQQKIDEEQSGAK----GIPVKANE 478

Query: 925  --VSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASLLLEDIQNYH 752
              V+    G+ENL PQTIT        RDLD+ +G NPD+ LN P+SYASLLLEDIQN+H
Sbjct: 479  VAVTVVATGSENLKPQTITRSRSSRRSRDLDIALGFNPDSHLN-PNSYASLLLEDIQNFH 537

Query: 751  CQN--VAFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEPNRPNCYNNSISLN 578
             QN   AFSLP CV+KACSILEAVADLNSCTSSN+S AFSE+KIS       ++ ++S N
Sbjct: 538  QQNNSTAFSLPACVTKACSILEAVADLNSCTSSNLSCAFSEDKIS--GADGSHSKNVSSN 595

Query: 577  GRF-NRKLMGKDPFVESEVVANDDDLMKPSLHKYVTVREPV--RDMEQQESAGSNSFLGH 407
                 R+++ K+PF+ESEVV + DDLM+PSLHKYVTVR  V   +M++QES+GSNSF+G 
Sbjct: 596  FHLGKRRMVAKEPFLESEVVVS-DDLMEPSLHKYVTVRRGVAGEEMDEQESSGSNSFVGQ 654

Query: 406  HW-SSSWEPNSAESTDRWTSSLSYTGEEVEQSEATGSF--------EAGAGRLKRGAR-- 260
            HW +SSWEPNSA+ST+RWTS  +Y  E V++     S         EA       GAR  
Sbjct: 655  HWAASSWEPNSADSTERWTSQSNYGDEVVDKEREPSSLGIENKAISEAAVHGAVVGARRL 714

Query: 259  --XXXXXSMLPATSLSGKKRECD-----GGNGKQGLASFSLAPVATAAA 134
                   + LP  + S K RE D      G G+ G A     P+ TAA+
Sbjct: 715  RHSGNNNNYLPTATASAKSREFDYHQMGSGLGRIG-ARGQTIPIVTAAS 762


>ref|XP_008806680.1| PREDICTED: uncharacterized protein LOC103719283 [Phoenix dactylifera]
          Length = 685

 Score =  223 bits (569), Expect = 2e-55
 Identities = 165/350 (47%), Positives = 201/350 (57%), Gaps = 26/350 (7%)
 Frame = -2

Query: 1138 QKPHENAAAQTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAAD 959
            QK  EN A    K ++RN   VE+  + T+  +  +REQ M  R+ EQQ EPEI E A  
Sbjct: 356  QKASENIA-NLRKSEQRNGGAVEV--KGTNVITNFVREQPMNRRAKEQQMEPEIGEGALQ 412

Query: 958  SAKAGLPKRSEVSAENPGAEN-LIPQTITXXXXXXXXR-DLDLVMGLNPDTLLNNPSSYA 785
                G  K  E    + G E+ L P+TIT          D D  + LNPD  L  P+SY 
Sbjct: 413  VK--GASKDGEAHLTSNGVESSLNPRTITRTRSSRRSSRDFDHALDLNPDNHLI-PTSYT 469

Query: 784  SLLLEDIQNYHCQNVAFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEPNRPN 605
            SLLLEDI NYH +N AFSLP CVSKACSILEAVADL S        + SE K S+ +R N
Sbjct: 470  SLLLEDIHNYHQKNTAFSLPACVSKACSILEAVADLKS--------SCSENKSSDADRSN 521

Query: 604  CYNNSISLNGRFNRK-LMGKDPFVESEVVANDDDLMKPSLHKYVTVRE-PVRDMEQQESA 431
              N+  SL+GRF R+ L+ K PFVESE+V   DDLM+PSLHKYV+VR+    +ME QESA
Sbjct: 522  --NDDGSLDGRFGRRGLVPKGPFVESEIVVK-DDLMEPSLHKYVSVRDLGGGEMEPQESA 578

Query: 430  GSNSFLGHHWSSSWEPNSAESTDRWTSSLSYTGEEVEQS--------EATGSFEAGAGRL 275
            GSNSF+G  WSS WEPNS +STDR+ +S S  G+EVEQ         E   + EA   RL
Sbjct: 579  GSNSFIGQPWSSPWEPNSGDSTDRYWTSQSINGDEVEQQQQQQQSTREVARNSEARGRRL 638

Query: 274  KRGARXXXXXSMLPATSLSG-KKRECD-------------GGNGKQGLAS 167
            + G+      + LP T  SG KKRE D             GG+GK   AS
Sbjct: 639  RDGS----TTNSLPTTMSSGSKKRELDHHRALHRGGSGFGGGSGKAAAAS 684


>ref|XP_010933318.1| PREDICTED: serine/arginine repetitive matrix protein 2-like [Elaeis
            guineensis]
          Length = 703

 Score =  219 bits (557), Expect = 4e-54
 Identities = 163/363 (44%), Positives = 208/363 (57%), Gaps = 29/363 (7%)
 Frame = -2

Query: 1135 KPHENAAAQTPKHKERNIRTVEIPHERTSNS--SRGIREQLMTCRSMEQQ-QEPEIVEAA 965
            K  EN A Q  K ++R    VE+ +   SN+  S  +REQL+ C + ++Q +EPE+V   
Sbjct: 363  KSSENGA-QMRKSEQRAGGAVEVSNGLRSNNVISTSVREQLIRCHAKDRQLEEPEMVVKG 421

Query: 964  ADSAKAGLPKRSEVSAENPGAENLIPQTITXXXXXXXXR-DLDLVMGLNPDTLLNNPSSY 788
            A     G    + V  E+P      P+TIT          D D  +G NPD  LN P+SY
Sbjct: 422  AVQPMDGEAPSTGVGVESPN-----PRTITRSRSLKRSSRDFDHALGPNPDGHLN-PTSY 475

Query: 787  ASLLLEDIQNYHCQNVAFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEPNRP 608
             SLLL+DI NYH Q+ AFSLP CVSKACSILEAVADLNS  S N S   S  + S+    
Sbjct: 476  TSLLLQDIHNYHHQSTAFSLPACVSKACSILEAVADLNSSCSENKS---SHAEGSD---- 528

Query: 607  NCYNNSISLNGRFNRK-LMGKDPFVESEVVANDDDLMKPSLHKYVTVREPVRDMEQQESA 431
               +N+ SL+GRF R+ ++ K PFVESE++  DD LM+PSLHKYV+VR+   +M+ QESA
Sbjct: 529  ---HNNGSLHGRFGRRGVVSKAPFVESEIIVKDD-LMEPSLHKYVSVRDFGGEMDPQESA 584

Query: 430  GSNSFLGHHWSSSWEPNSAESTDRWTSSLSYTGEEVEQ--------SEATGSFEAGAGRL 275
            GSNSF+G  WSS WEPNS +STDR  +S S  GEEVEQ         E     EA   RL
Sbjct: 585  GSNSFIGQPWSSPWEPNSIDSTDRDWTSHSNNGEEVEQLQQQQESMPEIALHSEARGRRL 644

Query: 274  KRGARXXXXXSMLPATSLSG-KKRECD--------------GGNGKQGLASFSLA-PVAT 143
            + G+      + LP T  SG KKRE D              GG+GK G  + S + PVA 
Sbjct: 645  RGGS----CSNSLPTTMSSGRKKREFDHHHHQRRRGRSGFGGGSGKAGRTTRSSSLPVAA 700

Query: 142  AAA 134
            A++
Sbjct: 701  ASS 703


>ref|XP_010274904.1| PREDICTED: cyclin-dependent kinase 13-like [Nelumbo nucifera]
          Length = 739

 Score =  214 bits (546), Expect = 8e-53
 Identities = 139/281 (49%), Positives = 181/281 (64%), Gaps = 9/281 (3%)
 Frame = -2

Query: 1114 AQTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAADSAKAGLPK 935
            +Q  K  E++ R VE+P  + +N  R      M+ R+ E QQ+  IVE   ++   G+P 
Sbjct: 428  SQMQKPDEKSNRVVELP--QGANDRR--ENNPMSSRAKEGQQK--IVEEQTEAK--GIPA 479

Query: 934  RSEVSA---ENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASLLLEDI 764
            ++ V A      G E+  PQT+T        RDLD+ +G NPD+ LN P+SY SLLLEDI
Sbjct: 480  KANVVAVIVAATGDESQKPQTLTRSRSSRRSRDLDIALGFNPDSHLN-PNSYTSLLLEDI 538

Query: 763  QNYHCQN--VAFSLPPCVSKACSILEAVADLNSCTSSNIS-PAFSEEKISEPNRPNCYNN 593
            QN+H QN    FSLP CV+KACSILEAVADLNSC+SSN+S  AFSE K SE +  +  N 
Sbjct: 539  QNFHQQNNNTVFSLPACVTKACSILEAVADLNSCSSSNLSCAAFSENKTSEADGSHRKNV 598

Query: 592  SISLNGRFNRKLMGKDPFVESEVVANDDDLMKPSLHKYVTVREPVR--DMEQQESAGSNS 419
            S   +    R++  K+P +ESE+V   DDL +PSLHKYVTVR  V   +MEQQES+GSNS
Sbjct: 599  SSDFH-LGKRRMEAKEPILESEMVVR-DDLTEPSLHKYVTVRRGVTGGEMEQQESSGSNS 656

Query: 418  FLGHHW-SSSWEPNSAESTDRWTSSLSYTGEEVEQSEATGS 299
            F+ H+W +SSWEPNSA+ST+RWTS  SY  E V++     S
Sbjct: 657  FVSHNWAASSWEPNSADSTERWTSQSSYGDEVVDKEREPSS 697


>ref|XP_006429727.1| hypothetical protein CICLE_v10011149mg [Citrus clementina]
            gi|568855457|ref|XP_006481321.1| PREDICTED:
            serine/arginine repetitive matrix protein 2-like [Citrus
            sinensis] gi|557531784|gb|ESR42967.1| hypothetical
            protein CICLE_v10011149mg [Citrus clementina]
          Length = 740

 Score =  194 bits (492), Expect = 1e-46
 Identities = 144/349 (41%), Positives = 190/349 (54%), Gaps = 19/349 (5%)
 Frame = -2

Query: 1123 NAAAQTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAADSAKAG 944
            N   Q P HK       +  +   S+         +T  ++ ++++ +I+E      KA 
Sbjct: 415  NVLYQAPIHKPNAENIAQGTNNHKSSCRGTTLNNKVTGANITEKEQRQILE----EDKAQ 470

Query: 943  LPKRSEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASLLLEDI 764
            LP  +  +      E+  PQT+T        RDLDL   LNP+TLLN   SY +LLLEDI
Sbjct: 471  LPMTANAAVVT---ESQKPQTLTRTRSSRRSRDLDL--DLNPETLLNPTPSYTALLLEDI 525

Query: 763  QNYHCQNV-AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEK----ISEPNRPNCY 599
            QN+H ++  + SLP CV+KACSILEAVADLNS TSSN+S AFSE++      + N  N Y
Sbjct: 526  QNFHQKSTPSVSLPACVTKACSILEAVADLNSTTSSNLSCAFSEDRKPPSADQSNNKNAY 585

Query: 598  NNSISLNGRFNRKLMGKDPFVESEVVANDDDLMKPSLHKYVTVREPVR-----DMEQQES 434
            N S  +N    +    KDPFVESEV+A DDDLM+PS H+YVTVR         DM+ QES
Sbjct: 586  NFSAGVNLVGKKMTEAKDPFVESEVLA-DDDLMEPSFHRYVTVRRGGSELGGVDMDGQES 644

Query: 433  AGSNSFLG----HHW--SSSWEPNSAESTDRWTSSLSYTGEEVEQSEATGSFEAGAGRLK 272
            +GSNSF+G     +W  SSSWEPNSA+STDRWTS  +      E+ ++   F+  A    
Sbjct: 645  SGSNSFVGCTTQQNWTSSSSWEPNSADSTDRWTSRSNMK----EEDQSPLGFQRQAMSEA 700

Query: 271  RGARXXXXXSMLPATSLSGKKRECD---GGNGKQGLASFSLAPVATAAA 134
             G               SGK+R+ D    GN +  +A      VATAAA
Sbjct: 701  AGCEATKN-----RKGFSGKRRDTDYQQNGNWRGRVA------VATAAA 738


>ref|XP_007032151.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508711180|gb|EOY03077.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 718

 Score =  192 bits (489), Expect = 3e-46
 Identities = 132/270 (48%), Positives = 162/270 (60%), Gaps = 25/270 (9%)
 Frame = -2

Query: 910  PGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNN-PSSYASLLLEDIQNYHCQN--V 740
            PGAEN  PQT+T        RDLDL    NP+TLLN  PSSY +LLLEDIQN+H  N   
Sbjct: 455  PGAENPKPQTLTRSRSSRRSRDLDL----NPETLLNPIPSSYTTLLLEDIQNFHQTNNPP 510

Query: 739  AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEK---ISEPNRPNCYNNSISLNGRF 569
            +FSLP CVSKACSILEAVADLNS TSSN+S AFSE++    ++ +  N YN ++      
Sbjct: 511  SFSLPSCVSKACSILEAVADLNSTTSSNLSCAFSEDRKGLSTDESSKNGYNATVG----- 565

Query: 568  NRKLMGKDPFVESEVVANDDDLMKPSLHKYVTVREPV----RDMEQQESAGSNSFLG--- 410
             +    +DPFVESEVV   DDLM+PS HKYVTVR        DME+QES+GSNSF+G   
Sbjct: 566  RKMAETRDPFVESEVVGR-DDLMEPSFHKYVTVRRGATLGGTDMEEQESSGSNSFVGSGQ 624

Query: 409  -HHWS---SSWEPNSAESTDRWTSSLSYTGEE-----VEQSEATGSFEAGAGRLKRGARX 257
              HW    SSWEPNSA+STDRWTS      E+       Q +A    ++G+  +K   R 
Sbjct: 625  QQHWGFSPSSWEPNSADSTDRWTSRTKSREEDHSSSLEPQRQALAEPQSGSD-IKNSTR- 682

Query: 256  XXXXSMLPATSLSGKKRECD---GGNGKQG 176
                       LSG++R+ D    G G+ G
Sbjct: 683  ---------KGLSGRRRDVDLQHAGIGRAG 703


>ref|XP_007032150.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508711179|gb|EOY03076.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 785

 Score =  192 bits (489), Expect = 3e-46
 Identities = 132/270 (48%), Positives = 162/270 (60%), Gaps = 25/270 (9%)
 Frame = -2

Query: 910  PGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNN-PSSYASLLLEDIQNYHCQN--V 740
            PGAEN  PQT+T        RDLDL    NP+TLLN  PSSY +LLLEDIQN+H  N   
Sbjct: 522  PGAENPKPQTLTRSRSSRRSRDLDL----NPETLLNPIPSSYTTLLLEDIQNFHQTNNPP 577

Query: 739  AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEK---ISEPNRPNCYNNSISLNGRF 569
            +FSLP CVSKACSILEAVADLNS TSSN+S AFSE++    ++ +  N YN ++      
Sbjct: 578  SFSLPSCVSKACSILEAVADLNSTTSSNLSCAFSEDRKGLSTDESSKNGYNATVG----- 632

Query: 568  NRKLMGKDPFVESEVVANDDDLMKPSLHKYVTVREPV----RDMEQQESAGSNSFLG--- 410
             +    +DPFVESEVV   DDLM+PS HKYVTVR        DME+QES+GSNSF+G   
Sbjct: 633  RKMAETRDPFVESEVVGR-DDLMEPSFHKYVTVRRGATLGGTDMEEQESSGSNSFVGSGQ 691

Query: 409  -HHWS---SSWEPNSAESTDRWTSSLSYTGEE-----VEQSEATGSFEAGAGRLKRGARX 257
              HW    SSWEPNSA+STDRWTS      E+       Q +A    ++G+  +K   R 
Sbjct: 692  QQHWGFSPSSWEPNSADSTDRWTSRTKSREEDHSSSLEPQRQALAEPQSGSD-IKNSTR- 749

Query: 256  XXXXSMLPATSLSGKKRECD---GGNGKQG 176
                       LSG++R+ D    G G+ G
Sbjct: 750  ---------KGLSGRRRDVDLQHAGIGRAG 770


>ref|XP_009389233.1| PREDICTED: serine/arginine repetitive matrix protein 2 [Musa
            acuminata subsp. malaccensis]
          Length = 671

 Score =  192 bits (487), Expect = 6e-46
 Identities = 144/355 (40%), Positives = 175/355 (49%), Gaps = 34/355 (9%)
 Frame = -2

Query: 1138 QKPHENAAAQTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAAD 959
            QK  EN+   +    +RN    E      SN       Q+M+CR+ E + E  + E A  
Sbjct: 341  QKTSENSIRASKSSSQRNGSAAEFTSATRSNDV-----QVMSCRAKELETEAAVAEEAIA 395

Query: 958  SAKAGLPKRSEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASL 779
             A + + +     + N G E+ I +TI+         DLD    LN +  LN P+SYAS 
Sbjct: 396  KASSKVTE-----SPNLGVESHILKTISGTRSSR---DLDHPSELNQEAFLN-PNSYASS 446

Query: 778  LLEDIQNYHCQ--NVAFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEPNRPN 605
            LLEDI NY  Q    +FSLP CVSKACSILEAVADLNS +S N                 
Sbjct: 447  LLEDIHNYQQQLPKASFSLPACVSKACSILEAVADLNSASSENNG--------------- 491

Query: 604  CYNNSISLNGRFNRK-LMGKDPFVESEVVANDDDLMKPSLHKYVTVREPVR-DMEQQESA 431
                  SLNGR  R+    K PFVESE+V  DD L++PSLHKYVTVR+  R D+E QESA
Sbjct: 492  ------SLNGRHQRRGSASKVPFVESEIVVKDD-LLEPSLHKYVTVRDMRRSDVEPQESA 544

Query: 430  GSNSFLGHHWSSSWEPNSAESTDRWTSSLSYTGEEVEQSEATGSFEAGA----------- 284
            GSNSF+G  WSS WEPNS +STDR+ +S S  GEEVE+ EA  S  +             
Sbjct: 545  GSNSFMGQPWSSGWEPNSVDSTDRYQNSRSIDGEEVEEEEANQSPVSHGSRYHQQPVPEV 604

Query: 283  -------GRLKRGARXXXXXSMLPATSLSGKKREC------------DGGNGKQG 176
                   GR  RG          PA S    KRE             D G GK G
Sbjct: 605  VREPETRGRRSRGVSGNSSNHRSPAKSSKNSKRELHHRVQLHRTGSGDSGGGKPG 659


>gb|KDO64284.1| hypothetical protein CISIN_1g004639mg [Citrus sinensis]
          Length = 740

 Score =  192 bits (487), Expect = 6e-46
 Identities = 144/349 (41%), Positives = 189/349 (54%), Gaps = 19/349 (5%)
 Frame = -2

Query: 1123 NAAAQTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAADSAKAG 944
            N   Q   HK       +  +   S+S        +T  ++ ++++ +I+E      KA 
Sbjct: 415  NVLYQATIHKPNAENIAQGTNNHKSSSRGTTLNNKVTGANITEKEQRQILE----EDKAQ 470

Query: 943  LPKRSEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASLLLEDI 764
            LP  +  +      E+  PQT+T        RDLDL   LNP+TLLN   SY +LLLEDI
Sbjct: 471  LPMTANAAVVT---ESQKPQTLTRTRSSRRSRDLDL--DLNPETLLNPAPSYTALLLEDI 525

Query: 763  QNYHCQNV-AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEK----ISEPNRPNCY 599
            QN+H ++  + SLP CV+KACSILEAVADLNS TSSN+S AFSE +      + N  N Y
Sbjct: 526  QNFHQKSTPSVSLPACVTKACSILEAVADLNSTTSSNLSCAFSENRKPPSADQSNNKNAY 585

Query: 598  NNSISLNGRFNRKLMGKDPFVESEVVANDDDLMKPSLHKYVTVREPVR-----DMEQQES 434
            N S  +N    +    KDPFVESEV+A DDDLM+PS H+YVTVR         DM+ QES
Sbjct: 586  NFSAGVNLVGKKMTEAKDPFVESEVLA-DDDLMEPSFHRYVTVRRGGSELGGVDMDGQES 644

Query: 433  AGSNSFLG----HHW--SSSWEPNSAESTDRWTSSLSYTGEEVEQSEATGSFEAGAGRLK 272
            +GSNSF+G     +W  SSSWEPNSA+STDRWTS  +      E+ ++   F+  A    
Sbjct: 645  SGSNSFVGCTTQQNWTSSSSWEPNSADSTDRWTSRSNMK----EEDQSPLGFQRQAMSEA 700

Query: 271  RGARXXXXXSMLPATSLSGKKRECD---GGNGKQGLASFSLAPVATAAA 134
             G               SGK+R+ D    GN +  +A      VATAAA
Sbjct: 701  AGCEATKN-----RKGFSGKRRDTDYQQNGNWRGRVA------VATAAA 738


>ref|XP_011096915.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105175967
            [Sesamum indicum]
          Length = 687

 Score =  189 bits (479), Expect = 5e-45
 Identities = 145/352 (41%), Positives = 182/352 (51%), Gaps = 26/352 (7%)
 Frame = -2

Query: 1123 NAAAQTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAADSAKAG 944
            N  +Q P  K  +        E   NSS  +    + C++ EQQ   E ++A       G
Sbjct: 356  NVISQAPVQKPNSENKTVQGAENRINSSSSLAN--VNCKNQEQQLMGEEMKALY----GG 409

Query: 943  LPKRSEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLN-NPSSYASLLLED 767
            +     ++  + G EN  P  +         RDLD+    NP+TL N N SSY +LLLED
Sbjct: 410  IAGHVALNVISSGPENPKPHAVARSRSSRLSRDLDI----NPETLSNPNTSSYTALLLED 465

Query: 766  IQNYHCQN---VAFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEP-----NR 611
            IQN+H +N    AFSLPPCV+KACSILEAVADLNS TSSN+S  FSE++   P     N+
Sbjct: 466  IQNFHQKNNPPPAFSLPPCVTKACSILEAVADLNSSTSSNLSSVFSEDRRRNPVTEQMNK 525

Query: 610  PNCYNNSISLNGRFNRKLMGKDPFVESEVVANDDDLMKPSLHKYVTVREPVR---DMEQQ 440
             N   +S   N     K   KDP +ESE++A  DDLM+PS HKYVTVR       D+E++
Sbjct: 526  SNENKSSSGANLVGKTKPEIKDPILESEIIAT-DDLMEPSFHKYVTVRRGTSGGDDLEEE 584

Query: 439  ESAGSNSFLG--HHW--SSSWEPNSAESTDRWTSSLSYTGEEVEQSEATG-----SFEAG 287
            ES+GSNSF+G    W  SS WEPNSAESTDRW SS S +          G       E G
Sbjct: 585  ESSGSNSFVGSQQQWVSSSCWEPNSAESTDRWMSSSSRSASRQNDVSPVGFQRHAVSEPG 644

Query: 286  AGRLKRGARXXXXXSMLPATSLSGKKRECDG-----GNGKQGLASFSLAPVA 146
             G  + G R       + AT    KKR+ D      G GK G      AP A
Sbjct: 645  RGFDESGKR-------MSAT----KKRDSDHQQNGIGRGKIGTRGPHAAPAA 685


>ref|XP_011030171.1| PREDICTED: uncharacterized protein LOC105129689 [Populus euphratica]
          Length = 753

 Score =  187 bits (476), Expect = 1e-44
 Identities = 136/319 (42%), Positives = 173/319 (54%), Gaps = 33/319 (10%)
 Frame = -2

Query: 1123 NAAAQTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAADSAKAG 944
            N   QTP  K+ + +          N+   ++     C SM   +     E   + AK  
Sbjct: 443  NPLNQTPMKKQNSEK----------NNRVNVQVANYRCSSMASLENKLSKEQQMEEAKGH 492

Query: 943  LPKRSEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASLLLEDI 764
            LP  + V   + G E L PQ +T        RDLDL    NP+TLLN   SY +LLLEDI
Sbjct: 493  LPVTTNVV--DLGGECLKPQALTRSRSARRSRDLDL----NPETLLNPTPSYTALLLEDI 546

Query: 763  QNYHCQNV--AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEPNRPNCYNNS 590
            QN+H +N   +FSLP CV+KACSILEAVADLNS TSSN+S AFS+++IS P        +
Sbjct: 547  QNFHQKNTPPSFSLPACVTKACSILEAVADLNSTTSSNLSCAFSDDRISPP--------A 598

Query: 589  ISLNGRFNRKL-MGKDPFVESEVVANDDDLMKPSLHKYVTVREP-----VRDMEQQESAG 428
            ++      +KL   KDPFVESEV+A+ DDLM+PS HKYVTVR         DM+ QES+G
Sbjct: 599  VAAVNLVGKKLPEAKDPFVESEVIAS-DDLMEPSFHKYVTVRRGGGTLCGEDMDGQESSG 657

Query: 427  SNSFLGHHW------SSSWEPNSAESTDRWTS------------------SLSYTGEEVE 320
            SNSF+G         +SSWEPNSA+STDRW+S                   L  TG +VE
Sbjct: 658  SNSFVGGSHQHLGLSTSSWEPNSADSTDRWSSRSNTRDEDDKSPLGYQKHGLPETGRDVE 717

Query: 319  QS-EATGSFEAGAGRLKRG 266
            Q+  A      G GR + G
Sbjct: 718  QARRAFSGQRTGIGRGRHG 736


>ref|XP_002318801.2| hypothetical protein POPTR_0012s12820g [Populus trichocarpa]
            gi|550327002|gb|EEE97021.2| hypothetical protein
            POPTR_0012s12820g [Populus trichocarpa]
          Length = 754

 Score =  186 bits (472), Expect = 3e-44
 Identities = 134/319 (42%), Positives = 173/319 (54%), Gaps = 33/319 (10%)
 Frame = -2

Query: 1123 NAAAQTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAADSAKAG 944
            N   QTP  K+ + +          N+   ++     C SM   +     E   + AK  
Sbjct: 444  NPLNQTPMKKQNSEK----------NNRVNVQVANYRCSSMASLENKLSKEQQMEEAKGH 493

Query: 943  LPKRSEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASLLLEDI 764
             P  + V   + G E+L PQ +T        RDLDL    NP+TLLN   SY +LLLEDI
Sbjct: 494  PPVTTNVV--DLGGESLKPQALTRSRSARRSRDLDL----NPETLLNPTPSYTALLLEDI 547

Query: 763  QNYHCQNV--AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEPNRPNCYNNS 590
            QN+H +N   +FSLP CV+KACSILEAVADLNS TSSN+S AFS+++IS P        +
Sbjct: 548  QNFHQKNTPPSFSLPACVTKACSILEAVADLNSTTSSNLSCAFSDDRISPP--------A 599

Query: 589  ISLNGRFNRKL-MGKDPFVESEVVANDDDLMKPSLHKYVTVREP-----VRDMEQQESAG 428
            ++      +KL   KDPFVESE++A+ DDLM+PS HKYVTVR         DM+ QES+G
Sbjct: 600  VAAVNLVGKKLPEAKDPFVESEIIAS-DDLMEPSFHKYVTVRRGGGTLCGEDMDGQESSG 658

Query: 427  SNSFLGHHW------SSSWEPNSAESTDRWTS------------------SLSYTGEEVE 320
            SNSF+G         +SSWEPNSA+STDRW+S                   L  TG +VE
Sbjct: 659  SNSFVGGSQQHLGLSTSSWEPNSADSTDRWSSRSNTRDEDDKSPLGYQKHGLPETGRDVE 718

Query: 319  QS-EATGSFEAGAGRLKRG 266
            Q+  A      G GR + G
Sbjct: 719  QARRAFSGQRTGIGRGRHG 737


>ref|XP_002530557.1| hypothetical protein RCOM_0303940 [Ricinus communis]
            gi|223529895|gb|EEF31825.1| hypothetical protein
            RCOM_0303940 [Ricinus communis]
          Length = 725

 Score =  180 bits (457), Expect = 2e-42
 Identities = 119/230 (51%), Positives = 146/230 (63%), Gaps = 22/230 (9%)
 Frame = -2

Query: 937  KRSEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASLLLEDIQN 758
            K   V+AE  GA+ L PQT+         RDLD     NP+T LN   SY +LLLEDIQN
Sbjct: 460  KEQTVTAEASGAD-LKPQTVARSRSARRSRDLDF----NPETSLNPNPSYTALLLEDIQN 514

Query: 757  YHCQNV-------AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEPNRPNCY 599
            +H ++        +FS+P CV+KACSI+EAVADLNS TSSN+S AFS+EK S        
Sbjct: 515  FHQKSTNTNTNTPSFSVPACVTKACSIVEAVADLNSTTSSNLSCAFSDEKRSP------- 567

Query: 598  NNSISLNGRFNRKL-MGKDPFVESEVVANDDDLMKPSLHKYVTVR--------EPVRDME 446
              +  ++    +KL  GKDPFVESEV+ N DDLM+PS HKYVTVR          V DM+
Sbjct: 568  --TTVVSNLVGKKLEEGKDPFVESEVLVN-DDLMEPSFHKYVTVRRGGNGKGTSSVEDMD 624

Query: 445  QQESAGSNSFLG---HHW---SSSWEPNSAESTDRWTSSLSYTGEEVEQS 314
             QES+GSNSF+G    HW   +SSWEPNSA+STDRWTS  S T +E E+S
Sbjct: 625  GQESSGSNSFVGSSQQHWGYSTSSWEPNSADSTDRWTSR-SNTRDEEEKS 673


>gb|KHN41015.1| hypothetical protein glysoja_013357 [Glycine soja]
          Length = 675

 Score =  179 bits (455), Expect = 3e-42
 Identities = 141/357 (39%), Positives = 187/357 (52%), Gaps = 33/357 (9%)
 Frame = -2

Query: 1138 QKPHENAAAQTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAAD 959
            Q P+ N+  Q  K K      ++ P+ R +   +G+    + C++ EQ +E E       
Sbjct: 359  QSPYSNSKVQQNKPKIE-AEAIQKPNGRVA-LEKGVS---VNCKTKEQHEEEE------- 406

Query: 958  SAKAGLPKRSEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASL 779
             +   +    + +A + G +NL PQ +T        RDLD           N  +SYASL
Sbjct: 407  -SSVPISAVVKTTAVSSGVDNLKPQGLTRSRSSRRSRDLDT----------NATNSYASL 455

Query: 778  LLEDIQNYHCQNV--------AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKIS 623
            LLEDIQN+H +N         + SLP C++K CSILEAVADLNS TSSN    F+E+K S
Sbjct: 456  LLEDIQNFHQKNTQQQQQQPSSVSLPACLNKVCSILEAVADLNSTTSSN----FTEDKRS 511

Query: 622  EPNRPNCYNNSISLNGRFNRKLMG--KDPFVESEVVANDDDLMKPSLHKYVTVRE----P 461
                P+   ++I  +  + +K+ G  KDPFVESE VA  DD+M+PSLHKYVTV+      
Sbjct: 512  ----PSTQQSNIRNDEYYGKKVAGSNKDPFVESE-VAVSDDVMEPSLHKYVTVKRGGGVV 566

Query: 460  VRDMEQQESAGSNSFL------GHHW------SSSWEPNSAESTDRWTSSLSYTGEEVEQ 317
            V DME QES+GSNSF        HHW      SSSWEPNSA+STD WTSS   + EE  Q
Sbjct: 567  VEDMEDQESSGSNSFTVSSSSGQHHWGNNISCSSSWEPNSADSTDCWTSSRLSSREEEAQ 626

Query: 316  SEATG---SFEAGAGRLKRGARXXXXXSMLPATSLSGKKRECD----GGNGKQGLAS 167
                G   S  + A + K+G              L+ K+RECD    GG G+  L S
Sbjct: 627  KTPLGLGCSLSSEAKKKKKG--------------LNSKRRECDHEHSGGIGRGRLGS 669


>gb|KHN18623.1| hypothetical protein glysoja_033730 [Glycine soja]
          Length = 725

 Score =  179 bits (455), Expect = 3e-42
 Identities = 139/360 (38%), Positives = 183/360 (50%), Gaps = 27/360 (7%)
 Frame = -2

Query: 1132 PHENAAAQTPKHKERNIRTVEIPHERTSNSSRGIREQLMT--CRSMEQQQEPEIVEAAAD 959
            PH  A   + K + R  +  E    + +N SR   ++ M   C++  QQ+E   V+++  
Sbjct: 387  PHSTANNSSSKVQNRPKKEFETEANQKTNGSRTALDKGMNVNCKTKVQQEEDVKVQSSIT 446

Query: 958  S---AKAGLPKRSEVSAENPGAENLIPQ-TITXXXXXXXXRDLDLVMGLNPDTLLNNPSS 791
                 K  +P         PG +NL P  T+T        RDLDL    NP+ LLN P S
Sbjct: 447  DNVVVKTMVP---------PGVDNLKPPYTLTRSRSSRQSRDLDL----NPEALLNPPQS 493

Query: 790  YASLLLEDIQNYHCQNV-AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEK-ISEP 617
            YASLLLEDIQN+H +N    SLP CV+KACSILEAVADLNS    N   A      ++  
Sbjct: 494  YASLLLEDIQNFHQKNTPPVSLPACVTKACSILEAVADLNSNAGLNFCGAEDRRSPLAFQ 553

Query: 616  NRPNCYNNSISLNGRFNRKLMGKDPFVESEVVANDDDLMKPSLHKYVTVREPVR----DM 449
               N YN S++ +    R+   +DP VES ++ NDDD+M+ SLHKYVTV         DM
Sbjct: 554  CSRNDYNVSLTTHDYGKREPDAEDPVVESMLLFNDDDVMEQSLHKYVTVNRGGLLGGVDM 613

Query: 448  EQQESAGSNSFL----GHHW---SSSWEPNSAESTDRWTSSLSYTGEEVEQSEATGSFEA 290
            + QES+GSNSF       HW   SSSWEP+S ES D WTS  +Y+ EE ++    G   +
Sbjct: 614  DDQESSGSNSFTVSSGQQHWGVSSSSWEPSSVESKDCWTSRSNYSKEEGQKLGLEGRVAS 673

Query: 289  GAGRLKRGARXXXXXSMLPATSLSGKKRECDGGNGKQGLASFSLA--------PVATAAA 134
             AG    GA+            L+ ++RECD      G+    L         PV TAAA
Sbjct: 674  EAGLDAGGAK----------KKLNSQRRECDHHQHGSGIGRGRLGANKVLHNRPVVTAAA 723


>ref|XP_003518355.1| PREDICTED: dentin sialophosphoprotein-like [Glycine max]
            gi|947124684|gb|KRH72890.1| hypothetical protein
            GLYMA_02G239000 [Glycine max]
          Length = 678

 Score =  179 bits (455), Expect = 3e-42
 Identities = 141/357 (39%), Positives = 187/357 (52%), Gaps = 33/357 (9%)
 Frame = -2

Query: 1138 QKPHENAAAQTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAAD 959
            Q P+ N+  Q  K K      ++ P+ R +   +G+    + C++ EQ +E E       
Sbjct: 362  QSPYSNSKVQQNKPKIE-AEAIQKPNGRVA-LEKGVS---VNCKTKEQHEEEE------- 409

Query: 958  SAKAGLPKRSEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASL 779
             +   +    + +A + G +NL PQ +T        RDLD           N  +SYASL
Sbjct: 410  -SSVPISAVVKTTAVSSGVDNLKPQGLTRSRSSRRSRDLDT----------NATNSYASL 458

Query: 778  LLEDIQNYHCQNV--------AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKIS 623
            LLEDIQN+H +N         + SLP C++K CSILEAVADLNS TSSN    F+E+K S
Sbjct: 459  LLEDIQNFHQKNTQQQQQQPSSVSLPACLNKVCSILEAVADLNSTTSSN----FTEDKRS 514

Query: 622  EPNRPNCYNNSISLNGRFNRKLMG--KDPFVESEVVANDDDLMKPSLHKYVTVRE----P 461
                P+   ++I  +  + +K+ G  KDPFVESE VA  DD+M+PSLHKYVTV+      
Sbjct: 515  ----PSTQQSNIRNDEYYGKKVAGSNKDPFVESE-VAVSDDVMEPSLHKYVTVKRGGGVV 569

Query: 460  VRDMEQQESAGSNSFL------GHHW------SSSWEPNSAESTDRWTSSLSYTGEEVEQ 317
            V DME QES+GSNSF        HHW      SSSWEPNSA+STD WTSS   + EE  Q
Sbjct: 570  VEDMEDQESSGSNSFTVSSSSGQHHWGNNISCSSSWEPNSADSTDCWTSSRLSSREEEAQ 629

Query: 316  SEATG---SFEAGAGRLKRGARXXXXXSMLPATSLSGKKRECD----GGNGKQGLAS 167
                G   S  + A + K+G              L+ K+RECD    GG G+  L S
Sbjct: 630  KTPLGLGCSLSSEAKKKKKG--------------LNSKRRECDHEHSGGIGRGRLGS 672


>ref|XP_002263918.1| PREDICTED: uncharacterized protein LOC100261489 [Vitis vinifera]
          Length = 710

 Score =  178 bits (452), Expect = 6e-42
 Identities = 127/271 (46%), Positives = 162/271 (59%), Gaps = 9/271 (3%)
 Frame = -2

Query: 1111 QTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAADSAKAGLPKR 932
            Q P  K+ N   V +      +SSRG   Q++     E+  EP+ ++   +S +  +   
Sbjct: 398  QKPNMKDMNNGKVVVHGTNNRSSSRGKVFQVV-----EEAGEPKGLQPRTNSIETTIVVA 452

Query: 931  SEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASLLLEDIQNYH 752
            S       GAE+L PQ +T        RDLDL    NP+TLLN   SY +LLLEDIQN+H
Sbjct: 453  S-------GAESLKPQALTRTRSSRRSRDLDL----NPETLLNPTPSYTTLLLEDIQNFH 501

Query: 751  CQNV---AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEPNRPNCYNNSI-S 584
             +N    + SLP CVSKA SILEAVADLNSCTSSN S AFS+++    N    + NS+  
Sbjct: 502  QKNTTTPSISLPACVSKAHSILEAVADLNSCTSSNPSYAFSDDR---RNFTETHQNSMDD 558

Query: 583  LNGRFNRKLMGKDPF-VESEVVANDDDLMKPSLHKYVTVREPV----RDMEQQESAGSNS 419
             N    ++L  KDPF VESE+V   +DLM+PSLHKYVTV+        +ME+QES+GSNS
Sbjct: 559  KNPAGKKRLEAKDPFVVESEIVV-CNDLMEPSLHKYVTVKRGTIGGGGEMEEQESSGSNS 617

Query: 418  FLGHHWSSSWEPNSAESTDRWTSSLSYTGEE 326
            F+G     SWEPNSA+STD WTS  S T EE
Sbjct: 618  FVGVSQLHSWEPNSADSTDCWTSR-SNTREE 647


>emb|CAN76723.1| hypothetical protein VITISV_042980 [Vitis vinifera]
          Length = 685

 Score =  178 bits (452), Expect = 6e-42
 Identities = 127/271 (46%), Positives = 162/271 (59%), Gaps = 9/271 (3%)
 Frame = -2

Query: 1111 QTPKHKERNIRTVEIPHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAADSAKAGLPKR 932
            Q P  K+ N   V +      +SSRG   Q++     E+  EP+ ++   +S +  +   
Sbjct: 398  QKPNMKDMNNGKVVVHGSNNRSSSRGKVFQVV-----EEAGEPKGLQPRTNSIETTIVVA 452

Query: 931  SEVSAENPGAENLIPQTITXXXXXXXXRDLDLVMGLNPDTLLNNPSSYASLLLEDIQNYH 752
            S       GAE+L PQ +T        RDLDL    NP+TLLN   SY +LLLEDIQN+H
Sbjct: 453  S-------GAESLKPQALTRTRSSRRSRDLDL----NPETLLNLTPSYTTLLLEDIQNFH 501

Query: 751  CQNV---AFSLPPCVSKACSILEAVADLNSCTSSNISPAFSEEKISEPNRPNCYNNSI-S 584
             +N    + SLP CVSKA SILEAVADLNSCTSSN S AFS+++    N    + NS+  
Sbjct: 502  QKNTTTPSISLPACVSKAHSILEAVADLNSCTSSNPSYAFSDDR---RNFTETHQNSMDD 558

Query: 583  LNGRFNRKLMGKDPF-VESEVVANDDDLMKPSLHKYVTVREPV----RDMEQQESAGSNS 419
             N    ++L  KDPF VESE+V   +DLM+PSLHKYVTV+        +ME+QES+GSNS
Sbjct: 559  KNPAGKKRLEAKDPFVVESEIVV-CNDLMEPSLHKYVTVKRGTIGGGGEMEEQESSGSNS 617

Query: 418  FLGHHWSSSWEPNSAESTDRWTSSLSYTGEE 326
            F+G     SWEPNSA+STD WTS  S T EE
Sbjct: 618  FVGVSQLHSWEPNSADSTDCWTSR-SNTREE 647


>gb|KHG20408.1| hypothetical protein F383_04684 [Gossypium arboreum]
          Length = 647

 Score =  178 bits (451), Expect = 8e-42
 Identities = 129/286 (45%), Positives = 160/286 (55%), Gaps = 21/286 (7%)
 Frame = -2

Query: 1066 PHERTSNSSRGIREQLMTCRSMEQQQEPEIVEAAADSAKAGLPKRSEVSAENP--GAENL 893
            P   T+N  +G  +     +S  +      V+ +   A +      +V  + P  GA+NL
Sbjct: 377  PQSATTNKGQGGIK-----KSNVEMNHKASVQGSNHKAGSIATNVEDVKTQPPKAGADNL 431

Query: 892  IPQTITXXXXXXXXRDLDLVMGLNPDTLLNN-PSSYASLLLEDIQNYHCQN----VAFSL 728
             PQ +T        RDLDL    NP+ LLN  PSSY +LLLEDIQN+H  N      FSL
Sbjct: 432  KPQQLTRSRSSRRSRDLDL----NPEILLNPIPSSYTTLLLEDIQNFHQNNNNNPPQFSL 487

Query: 727  PPCVSKACSILEAVADLNSCTSSNISPAFSEEK--ISEPNRPNCYNNSISLNGRFNRKLM 554
            P CVSKACSILEAVADLNS TSSN+S A S+ K   ++ +  N YNN         RK+ 
Sbjct: 488  PACVSKACSILEAVADLNSTTSSNLSGALSDRKGPPTDDSNKNSYNNM-----TVGRKMT 542

Query: 553  -GKDPFVESEVVANDDDLMKPSLHKYVTVRE--PVRDMEQQESAGSNSFLG---HHW--- 401
               DPFVESEV+ + D LM+PS HKYVTVR      DME+QES+GSNS  G    HW   
Sbjct: 543  EAGDPFVESEVIGS-DHLMEPSFHKYVTVRRGGGGADMEEQESSGSNSIAGSGQQHWGFS 601

Query: 400  SSSWEPNSAESTDRWTSSLSYTGEEVEQS---EATGSFEAGAGRLK 272
            SSSWEPNSA+STDRW+S      E+       +     EAG+G  K
Sbjct: 602  SSSWEPNSADSTDRWSSRTKSRQEDYNSPLGLQRHAFAEAGSGMKK 647


Top