BLASTX nr result

ID: Papaver29_contig00006806 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver29_contig00006806
         (1002 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010266102.1| PREDICTED: large proline-rich protein bag6-B...   139   3e-30
ref|XP_010266100.1| PREDICTED: large proline-rich protein bag6-B...   139   3e-30
gb|KNA12098.1| hypothetical protein SOVF_129010 [Spinacia oleracea]   129   3e-27
gb|KRH06007.1| hypothetical protein GLYMA_17G261600 [Glycine max]     124   9e-26
gb|KHN48524.1| Large proline-rich protein BAG6 [Glycine soja]         124   9e-26
ref|XP_006601373.1| PREDICTED: large proline-rich protein BAG6-l...   124   9e-26
ref|XP_006601370.1| PREDICTED: large proline-rich protein BAG6-l...   124   9e-26
ref|XP_006601374.1| PREDICTED: large proline-rich protein BAG6-l...   124   2e-25
ref|XP_011099667.1| PREDICTED: large proline-rich protein BAG6 i...   123   3e-25
ref|XP_004291311.1| PREDICTED: large proline-rich protein BAG6 [...   122   4e-25
emb|CDP07318.1| unnamed protein product [Coffea canephora]            119   3e-24
ref|XP_011099675.1| PREDICTED: large proline-rich protein BAG6 i...   118   8e-24
ref|XP_007017960.1| Ubiquitin-like superfamily protein, putative...   116   2e-23
gb|KCW68033.1| hypothetical protein EUGRSUZ_F01713 [Eucalyptus g...   116   3e-23
ref|XP_010061138.1| PREDICTED: large proline-rich protein BAG6 [...   115   4e-23
gb|KCW68032.1| hypothetical protein EUGRSUZ_F01713 [Eucalyptus g...   115   7e-23
ref|XP_007017961.1| Ubiquitin-like superfamily protein, putative...   113   3e-22
ref|XP_012446475.1| PREDICTED: large proline-rich protein BAG6 i...   112   5e-22
gb|KRH17500.1| hypothetical protein GLYMA_14G223100 [Glycine max...   111   8e-22
gb|KRH17498.1| hypothetical protein GLYMA_14G223100 [Glycine max...   111   8e-22

>ref|XP_010266102.1| PREDICTED: large proline-rich protein bag6-B isoform X2 [Nelumbo
            nucifera]
          Length = 920

 Score =  139 bits (351), Expect = 3e-30
 Identities = 131/395 (33%), Positives = 182/395 (46%), Gaps = 63/395 (15%)
 Frame = -2

Query: 1001 TSSLFGGAAAT-QTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXG---------------- 873
            TSSLF GA+AT + N G   P GLGD PR++NIHIHA      G                
Sbjct: 376  TSSLFSGASATSRANPGNMGPFGLGDVPRNINIHIHAGTSLASGALSVGSRANAGESTHD 435

Query: 872  ----------RMLPVRSVLTTAVPSSRVPVDPSHTRSSNTTSI-----PRTEQSNPNAAS 738
                      R++PVR+V+  AVP S          ++N  S+      R++Q NPN ++
Sbjct: 436  GSGSGDSGPARVVPVRNVIAAAVPRSSA-------ETANVLSVIYPFHGRSQQINPNHSA 488

Query: 737  AAQGSNEGTPSV-------------GPSSASTIHP--MVSEINAQLRN------LVRDMG 621
              QGS+  +P+              G SS   IH   + S +++ L N       ++  G
Sbjct: 489  PIQGSSTSSPNSRQSDASGENQDPSGQSSTPIIHDSSVGSGVDSNLENHQPESMAIKGAG 548

Query: 620  GENVAPSGPSESPSNQGLATGSVSGDGAGNSQQERSHKD--DKSPCKEXXXXXXXXXXXX 447
            G   A SGP ++ + +G+ T           +Q  +++D  D S  +E            
Sbjct: 549  G--TALSGPKQNSAGEGIQTHH-------KHRQFNNNEDTIDNSSSREASSSNSGDGSAV 599

Query: 446  XXXATKQXXXXXXXXXXXXXXVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVP 267
                  +                 VP       LQPK +R RQ   Q ++ N   S  +P
Sbjct: 600  ----ASENVPRSSQSYDPPEGSNAVPLGLGLGGLQPK-RRSRQMRPQGRD-NSGISHAIP 653

Query: 266  TSNQNQQAITSGQQILQSLVARGSNA-RTDSNGVS-------DQIMGARPSAVPASNSQF 111
              N+NQ  I  GQ++LQSL+++ +NA R  +NG S        QIM + P     S+ Q 
Sbjct: 654  V-NENQSTIAGGQRVLQSLLSQSTNANRIGANGPSVQLPSVLGQIMESLPLQRQGSSGQV 712

Query: 110  DAAGMMSQVLNSPALNGLLSGVSEQAGMGSPDGLR 6
            DAAG MSQVLNSPALNGLL+GVSEQAG+GSP  LR
Sbjct: 713  DAAGAMSQVLNSPALNGLLAGVSEQAGIGSPAVLR 747


>ref|XP_010266100.1| PREDICTED: large proline-rich protein bag6-B isoform X1 [Nelumbo
            nucifera] gi|720032403|ref|XP_010266101.1| PREDICTED:
            large proline-rich protein bag6-B isoform X1 [Nelumbo
            nucifera]
          Length = 950

 Score =  139 bits (351), Expect = 3e-30
 Identities = 131/395 (33%), Positives = 182/395 (46%), Gaps = 63/395 (15%)
 Frame = -2

Query: 1001 TSSLFGGAAAT-QTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXG---------------- 873
            TSSLF GA+AT + N G   P GLGD PR++NIHIHA      G                
Sbjct: 406  TSSLFSGASATSRANPGNMGPFGLGDVPRNINIHIHAGTSLASGALSVGSRANAGESTHD 465

Query: 872  ----------RMLPVRSVLTTAVPSSRVPVDPSHTRSSNTTSI-----PRTEQSNPNAAS 738
                      R++PVR+V+  AVP S          ++N  S+      R++Q NPN ++
Sbjct: 466  GSGSGDSGPARVVPVRNVIAAAVPRSSA-------ETANVLSVIYPFHGRSQQINPNHSA 518

Query: 737  AAQGSNEGTPSV-------------GPSSASTIHP--MVSEINAQLRN------LVRDMG 621
              QGS+  +P+              G SS   IH   + S +++ L N       ++  G
Sbjct: 519  PIQGSSTSSPNSRQSDASGENQDPSGQSSTPIIHDSSVGSGVDSNLENHQPESMAIKGAG 578

Query: 620  GENVAPSGPSESPSNQGLATGSVSGDGAGNSQQERSHKD--DKSPCKEXXXXXXXXXXXX 447
            G   A SGP ++ + +G+ T           +Q  +++D  D S  +E            
Sbjct: 579  G--TALSGPKQNSAGEGIQTHH-------KHRQFNNNEDTIDNSSSREASSSNSGDGSAV 629

Query: 446  XXXATKQXXXXXXXXXXXXXXVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVP 267
                  +                 VP       LQPK +R RQ   Q ++ N   S  +P
Sbjct: 630  ----ASENVPRSSQSYDPPEGSNAVPLGLGLGGLQPK-RRSRQMRPQGRD-NSGISHAIP 683

Query: 266  TSNQNQQAITSGQQILQSLVARGSNA-RTDSNGVS-------DQIMGARPSAVPASNSQF 111
              N+NQ  I  GQ++LQSL+++ +NA R  +NG S        QIM + P     S+ Q 
Sbjct: 684  V-NENQSTIAGGQRVLQSLLSQSTNANRIGANGPSVQLPSVLGQIMESLPLQRQGSSGQV 742

Query: 110  DAAGMMSQVLNSPALNGLLSGVSEQAGMGSPDGLR 6
            DAAG MSQVLNSPALNGLL+GVSEQAG+GSP  LR
Sbjct: 743  DAAGAMSQVLNSPALNGLLAGVSEQAGIGSPAVLR 777


>gb|KNA12098.1| hypothetical protein SOVF_129010 [Spinacia oleracea]
          Length = 901

 Score =  129 bits (325), Expect = 3e-27
 Identities = 113/376 (30%), Positives = 164/376 (43%), Gaps = 44/376 (11%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGL-GDTPRHVNIHIHAXXXXXXG---------------- 873
            TS LF   A + TN G  VPVG+ G  PR++NIHIHA                       
Sbjct: 395  TSPLFNAPAVSATNTGTSVPVGIIGSAPRNINIHIHAGTPLAPLASAVGARTAGGEGTQG 454

Query: 872  -------------RMLPVRSVLTTAVPSSRVPVDPSHTRSSNTTSIPRTEQSNP-NAASA 735
                         R++ VR+V+  +VPS    +  S          P T  S P    S 
Sbjct: 455  EHGGVTAASDSGGRVISVRNVVAASVPSHTATIAVS----------PVTTASQPVGVVST 504

Query: 734  AQGSNEGTPSVGPSSASTIHPMVSEINAQLRNLVRDMGGENVAPSGPSESP----SNQGL 567
            +Q          P  A+++  ++S +N+Q+R+L+ +M G+N  PSG  ++P    S+ G 
Sbjct: 505  SQ----------PPDAASLSSVISHVNSQIRSLLENMRGDNQNPSGQQDNPVIPNSSAGS 554

Query: 566  ATGSVSGDGAGNSQQERSHKDDKSPCKEXXXXXXXXXXXXXXXATKQXXXXXXXXXXXXX 387
              G  + +   +S   +  + DK+  +E                 +              
Sbjct: 555  GGGREAHESHTSSLPRQQAEGDKTDRRENIQSDSSQSNE------ESPSCMSGDPAAKSI 608

Query: 386  XVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAITSGQQILQSLV 207
              K+VP        QPK +RGRQ   Q        SS     +Q+QQA  SGQQILQSL 
Sbjct: 609  GPKEVPLGLGGGL-QPK-RRGRQSQMQPVAGEAGPSSSF---DQSQQARISGQQILQSLA 663

Query: 206  ARGSNARTD---------SNGVSDQIMGARPSAVPASNSQFDAAGMMSQVLNSPALNGLL 54
            +R +  R+            G+   + G +      S+ QFD  G MSQ+L SPAL+GLL
Sbjct: 664  SRNTAGRSSLTNPGSGQTGQGIEQSVAGTQ--VTQGSDGQFDITGSMSQLLQSPALDGLL 721

Query: 53   SGVSEQAGMGSPDGLR 6
            SGV++QAG+GSP+ LR
Sbjct: 722  SGVAQQAGVGSPNVLR 737


>gb|KRH06007.1| hypothetical protein GLYMA_17G261600 [Glycine max]
          Length = 937

 Score =  124 bits (312), Expect = 9e-26
 Identities = 119/386 (30%), Positives = 171/386 (44%), Gaps = 54/386 (13%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPS-- 828
            TSSLFGG     T A +   +G+G+ PR+VNIHIHA        + P+ S + +   +  
Sbjct: 399  TSSLFGGPVPPSTPATLGT-IGIGNAPRNVNIHIHAGTS-----LAPIVSAIGSGANNGE 452

Query: 827  ---SRVPVDPSHTRSSNTTSIP------RTEQSNPNAASAAQGSNEG---TPSVGPSSAS 684
               S    +P    S +T  +P       T  S+P     +  +  G     S  PS ++
Sbjct: 453  GTRSEHRNEPGSGDSGSTRVLPVRNVIAATIPSHPPGVGISSSTQTGFGIPTSQPPSDSA 512

Query: 683  TIHPMVSEINAQLRNLVRDMGGENVAPSG---------PSESPS-----NQGLATGSVSG 546
            ++  +++EIN++LRN+V +M G+N  PSG         PS S S     N+   T  ++G
Sbjct: 513  SLSSVLAEINSRLRNVVGNMHGDNTVPSGQMESNSRDLPSGSESRPATVNEQRDTMDMNG 572

Query: 545  DGAGN-----------------------SQQERSHKDDK--SPCKEXXXXXXXXXXXXXX 441
             GA +                       S  ER    DK  S                  
Sbjct: 573  FGATSASSVGCTSESEVQKLQTKAVQTSSNDERDVLVDKFVSSSSNQDLRSCSSGETIVK 632

Query: 440  XATKQXXXXXXXXXXXXXXVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTS 261
               +Q               K  P       L+ K +R R      K A+D +SS   ++
Sbjct: 633  PEKEQDVPAVSERQNVTEPAKAAPLGLGVGGLERK-RRTRLQPPVSKGADDRSSSS--SA 689

Query: 260  NQNQQAITSGQQILQSLVARGSNART-DSNGVSDQIMGARPSAVPASNSQFDAAGMMSQV 84
            NQNQQ  T GQ ILQ+L + GS   + ++NG S +       ++P+S+   D AG+MSQ 
Sbjct: 690  NQNQQTRTDGQHILQTLASHGSGLNSRNANGPSQR-------SLPSSDRPIDVAGLMSQA 742

Query: 83   LNSPALNGLLSGVSEQAGMGSPDGLR 6
            L SPALNGLL GVS+Q G+ SPDGLR
Sbjct: 743  LRSPALNGLLEGVSQQTGVDSPDGLR 768


>gb|KHN48524.1| Large proline-rich protein BAG6 [Glycine soja]
          Length = 940

 Score =  124 bits (312), Expect = 9e-26
 Identities = 119/386 (30%), Positives = 171/386 (44%), Gaps = 54/386 (13%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPS-- 828
            TSSLFGG     T A +   +G+G+ PR+VNIHIHA        + P+ S + +   +  
Sbjct: 399  TSSLFGGPVPPSTPATLGT-IGIGNAPRNVNIHIHAGTS-----LAPIVSAIGSGANNGE 452

Query: 827  ---SRVPVDPSHTRSSNTTSIP------RTEQSNPNAASAAQGSNEG---TPSVGPSSAS 684
               S    +P    S +T  +P       T  S+P     +  +  G     S  PS ++
Sbjct: 453  GTRSEHRNEPGSGDSGSTRVLPVRNVIAATIPSHPPGVGISSSTQTGFGIPTSQPPSDSA 512

Query: 683  TIHPMVSEINAQLRNLVRDMGGENVAPSG---------PSESPS-----NQGLATGSVSG 546
            ++  +++EIN++LRN+V +M G+N  PSG         PS S S     N+   T  ++G
Sbjct: 513  SLSSVLAEINSRLRNVVGNMHGDNTVPSGQMESNSRDLPSGSESRPATVNEQRDTMDMNG 572

Query: 545  DGAGN-----------------------SQQERSHKDDK--SPCKEXXXXXXXXXXXXXX 441
             GA +                       S  ER    DK  S                  
Sbjct: 573  FGATSASSVGCTSESEVQKLQTKAVQTSSNDERDVLVDKFVSSSSNQDLRSCSSGETIVK 632

Query: 440  XATKQXXXXXXXXXXXXXXVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTS 261
               +Q               K  P       L+ K +R R      K A+D +SS   ++
Sbjct: 633  PEKEQDVPAVSERQNVTEPAKAAPLGLGVGGLERK-RRTRLQPPVSKGADDRSSSS--SA 689

Query: 260  NQNQQAITSGQQILQSLVARGSNART-DSNGVSDQIMGARPSAVPASNSQFDAAGMMSQV 84
            NQNQQ  T GQ ILQ+L + GS   + ++NG S +       ++P+S+   D AG+MSQ 
Sbjct: 690  NQNQQTRTDGQHILQTLASHGSGLNSRNANGPSQR-------SLPSSDRPIDVAGLMSQA 742

Query: 83   LNSPALNGLLSGVSEQAGMGSPDGLR 6
            L SPALNGLL GVS+Q G+ SPDGLR
Sbjct: 743  LRSPALNGLLEGVSQQTGVDSPDGLR 768


>ref|XP_006601373.1| PREDICTED: large proline-rich protein BAG6-like isoform X4 [Glycine
            max] gi|947056548|gb|KRH06001.1| hypothetical protein
            GLYMA_17G261600 [Glycine max] gi|947056549|gb|KRH06002.1|
            hypothetical protein GLYMA_17G261600 [Glycine max]
            gi|947056550|gb|KRH06003.1| hypothetical protein
            GLYMA_17G261600 [Glycine max]
          Length = 936

 Score =  124 bits (312), Expect = 9e-26
 Identities = 119/386 (30%), Positives = 171/386 (44%), Gaps = 54/386 (13%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPS-- 828
            TSSLFGG     T A +   +G+G+ PR+VNIHIHA        + P+ S + +   +  
Sbjct: 395  TSSLFGGPVPPSTPATLGT-IGIGNAPRNVNIHIHAGTS-----LAPIVSAIGSGANNGE 448

Query: 827  ---SRVPVDPSHTRSSNTTSIP------RTEQSNPNAASAAQGSNEG---TPSVGPSSAS 684
               S    +P    S +T  +P       T  S+P     +  +  G     S  PS ++
Sbjct: 449  GTRSEHRNEPGSGDSGSTRVLPVRNVIAATIPSHPPGVGISSSTQTGFGIPTSQPPSDSA 508

Query: 683  TIHPMVSEINAQLRNLVRDMGGENVAPSG---------PSESPS-----NQGLATGSVSG 546
            ++  +++EIN++LRN+V +M G+N  PSG         PS S S     N+   T  ++G
Sbjct: 509  SLSSVLAEINSRLRNVVGNMHGDNTVPSGQMESNSRDLPSGSESRPATVNEQRDTMDMNG 568

Query: 545  DGAGN-----------------------SQQERSHKDDK--SPCKEXXXXXXXXXXXXXX 441
             GA +                       S  ER    DK  S                  
Sbjct: 569  FGATSASSVGCTSESEVQKLQTKAVQTSSNDERDVLVDKFVSSSSNQDLRSCSSGETIVK 628

Query: 440  XATKQXXXXXXXXXXXXXXVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTS 261
               +Q               K  P       L+ K +R R      K A+D +SS   ++
Sbjct: 629  PEKEQDVPAVSERQNVTEPAKAAPLGLGVGGLERK-RRTRLQPPVSKGADDRSSSS--SA 685

Query: 260  NQNQQAITSGQQILQSLVARGSNART-DSNGVSDQIMGARPSAVPASNSQFDAAGMMSQV 84
            NQNQQ  T GQ ILQ+L + GS   + ++NG S +       ++P+S+   D AG+MSQ 
Sbjct: 686  NQNQQTRTDGQHILQTLASHGSGLNSRNANGPSQR-------SLPSSDRPIDVAGLMSQA 738

Query: 83   LNSPALNGLLSGVSEQAGMGSPDGLR 6
            L SPALNGLL GVS+Q G+ SPDGLR
Sbjct: 739  LRSPALNGLLEGVSQQTGVDSPDGLR 764


>ref|XP_006601370.1| PREDICTED: large proline-rich protein BAG6-like isoform X1 [Glycine
            max] gi|571539936|ref|XP_006601371.1| PREDICTED: large
            proline-rich protein BAG6-like isoform X2 [Glycine max]
            gi|571539940|ref|XP_006601372.1| PREDICTED: large
            proline-rich protein BAG6-like isoform X3 [Glycine max]
            gi|947056551|gb|KRH06004.1| hypothetical protein
            GLYMA_17G261600 [Glycine max] gi|947056552|gb|KRH06005.1|
            hypothetical protein GLYMA_17G261600 [Glycine max]
            gi|947056553|gb|KRH06006.1| hypothetical protein
            GLYMA_17G261600 [Glycine max]
          Length = 940

 Score =  124 bits (312), Expect = 9e-26
 Identities = 119/386 (30%), Positives = 171/386 (44%), Gaps = 54/386 (13%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPS-- 828
            TSSLFGG     T A +   +G+G+ PR+VNIHIHA        + P+ S + +   +  
Sbjct: 399  TSSLFGGPVPPSTPATLGT-IGIGNAPRNVNIHIHAGTS-----LAPIVSAIGSGANNGE 452

Query: 827  ---SRVPVDPSHTRSSNTTSIP------RTEQSNPNAASAAQGSNEG---TPSVGPSSAS 684
               S    +P    S +T  +P       T  S+P     +  +  G     S  PS ++
Sbjct: 453  GTRSEHRNEPGSGDSGSTRVLPVRNVIAATIPSHPPGVGISSSTQTGFGIPTSQPPSDSA 512

Query: 683  TIHPMVSEINAQLRNLVRDMGGENVAPSG---------PSESPS-----NQGLATGSVSG 546
            ++  +++EIN++LRN+V +M G+N  PSG         PS S S     N+   T  ++G
Sbjct: 513  SLSSVLAEINSRLRNVVGNMHGDNTVPSGQMESNSRDLPSGSESRPATVNEQRDTMDMNG 572

Query: 545  DGAGN-----------------------SQQERSHKDDK--SPCKEXXXXXXXXXXXXXX 441
             GA +                       S  ER    DK  S                  
Sbjct: 573  FGATSASSVGCTSESEVQKLQTKAVQTSSNDERDVLVDKFVSSSSNQDLRSCSSGETIVK 632

Query: 440  XATKQXXXXXXXXXXXXXXVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTS 261
               +Q               K  P       L+ K +R R      K A+D +SS   ++
Sbjct: 633  PEKEQDVPAVSERQNVTEPAKAAPLGLGVGGLERK-RRTRLQPPVSKGADDRSSSS--SA 689

Query: 260  NQNQQAITSGQQILQSLVARGSNART-DSNGVSDQIMGARPSAVPASNSQFDAAGMMSQV 84
            NQNQQ  T GQ ILQ+L + GS   + ++NG S +       ++P+S+   D AG+MSQ 
Sbjct: 690  NQNQQTRTDGQHILQTLASHGSGLNSRNANGPSQR-------SLPSSDRPIDVAGLMSQA 742

Query: 83   LNSPALNGLLSGVSEQAGMGSPDGLR 6
            L SPALNGLL GVS+Q G+ SPDGLR
Sbjct: 743  LRSPALNGLLEGVSQQTGVDSPDGLR 768


>ref|XP_006601374.1| PREDICTED: large proline-rich protein BAG6-like isoform X5 [Glycine
            max]
          Length = 931

 Score =  124 bits (310), Expect = 2e-25
 Identities = 121/393 (30%), Positives = 168/393 (42%), Gaps = 61/393 (15%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXG----------------- 873
            TSSLFGG     T A +   +G+G+ PR+VNIHIHA                        
Sbjct: 399  TSSLFGGPVPPSTPATLGT-IGIGNAPRNVNIHIHAAIGSGANNGEGTRSEHRNEPGSGD 457

Query: 872  ----RMLPVRSVLTTAVPSSRVPVDPSHTRSSNTTSIPRTEQSNPNAASAAQGSNEGTPS 705
                R+LPVR+V+   +PS      P    SS+T +      S P               
Sbjct: 458  SGSTRVLPVRNVIAATIPSH----PPGVGISSSTQTGFGIPTSQP--------------- 498

Query: 704  VGPSSASTIHPMVSEINAQLRNLVRDMGGENVAPSG---------PSESPS-----NQGL 567
              PS ++++  +++EIN++LRN+V +M G+N  PSG         PS S S     N+  
Sbjct: 499  --PSDSASLSSVLAEINSRLRNVVGNMHGDNTVPSGQMESNSRDLPSGSESRPATVNEQR 556

Query: 566  ATGSVSGDGAGN-----------------------SQQERSHKDDK--SPCKEXXXXXXX 462
             T  ++G GA +                       S  ER    DK  S           
Sbjct: 557  DTMDMNGFGATSASSVGCTSESEVQKLQTKAVQTSSNDERDVLVDKFVSSSSNQDLRSCS 616

Query: 461  XXXXXXXXATKQXXXXXXXXXXXXXXVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDAT 282
                      +Q               K  P       L+ K +R R      K A+D +
Sbjct: 617  SGETIVKPEKEQDVPAVSERQNVTEPAKAAPLGLGVGGLERK-RRTRLQPPVSKGADDRS 675

Query: 281  SSGVPTSNQNQQAITSGQQILQSLVARGSNART-DSNGVSDQIMGARPSAVPASNSQFDA 105
            SS   ++NQNQQ  T GQ ILQ+L + GS   + ++NG S +       ++P+S+   D 
Sbjct: 676  SSS--SANQNQQTRTDGQHILQTLASHGSGLNSRNANGPSQR-------SLPSSDRPIDV 726

Query: 104  AGMMSQVLNSPALNGLLSGVSEQAGMGSPDGLR 6
            AG+MSQ L SPALNGLL GVS+Q G+ SPDGLR
Sbjct: 727  AGLMSQALRSPALNGLLEGVSQQTGVDSPDGLR 759


>ref|XP_011099667.1| PREDICTED: large proline-rich protein BAG6 isoform X1 [Sesamum
            indicum]
          Length = 910

 Score =  123 bits (308), Expect = 3e-25
 Identities = 112/371 (30%), Positives = 162/371 (43%), Gaps = 39/371 (10%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPSS- 825
            TSSLFG ++    +     PVG+   PR+VNIHIH         + P+ S L    P+  
Sbjct: 389  TSSLFGASSGVSPSPMAMGPVGVASIPRNVNIHIHTGAS-----LAPMVSQLGNRAPNGE 443

Query: 824  -----RVPVDPS----HTRSSNTTSIPRTEQSNPNAASAAQGSNEGTPSVGPSSASTIHP 672
                 R  V  S      R S+   +     S P   S +     GT       +++I  
Sbjct: 444  GTQEERANVSESSDFGQARGSSGVGVTANVVSQPAMVSISGALAPGTGVQQTPDSNSISS 503

Query: 671  MVSEINAQLRNLVRDMGGENVAPSGPSESPSNQGLATGSVSGDGAGNSQQERSHKDDKSP 492
            +V+EINAQ+R+L+ +M  +  APSG +E  +NQ    GS    G   S+Q  +     SP
Sbjct: 504  VVAEINAQMRSLLSNMQNDQ-APSGQTEDSANQDQPVGS----GENTSRQRTAEASHTSP 558

Query: 491  -------------CKEXXXXXXXXXXXXXXXATKQXXXXXXXXXXXXXXVKD-------- 375
                         C +                +                  D        
Sbjct: 559  HVSMMDDQKAQTACNQPDIKGKGAGVGSVSEPSISSEGGSDKRPTASDRGDDNDNADSPS 618

Query: 374  -VPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAITSGQQILQSLVA-- 204
             +P       LQPK +RGRQ  +Q K+ +++++S     +QNQQ+   GQ +LQSL +  
Sbjct: 619  GIPLGLGLGGLQPK-RRGRQQKAQTKDTDNSSAS-----SQNQQSRAVGQHVLQSLASLP 672

Query: 203  -RGSNARTDSNGVSDQIMGARPSAVPASNS----QFDAAGMMSQVLNSPALNGLLSGVSE 39
             RG+     S   SD   G    +VP ++     Q D A  MSQ+L+SPAL+GLLSGVS+
Sbjct: 673  TRGNRNPLPSRQSSDIARGGGGQSVPTASQNADGQGDVADAMSQILHSPALDGLLSGVSQ 732

Query: 38   QAGMGSPDGLR 6
            Q G+G+PD LR
Sbjct: 733  QTGVGTPDMLR 743


>ref|XP_004291311.1| PREDICTED: large proline-rich protein BAG6 [Fragaria vesca subsp.
            vesca]
          Length = 931

 Score =  122 bits (306), Expect = 4e-25
 Identities = 112/374 (29%), Positives = 159/374 (42%), Gaps = 61/374 (16%)
 Frame = -2

Query: 944  PVGLGDTPRHVNIHIHAXXXXXXG--------------------------RMLPVRSVLT 843
            PVG+G  PR+VNIHIHA                                 R+LPVR+V+ 
Sbjct: 424  PVGIGSAPRNVNIHIHAGTSLSALGARGSNGEGMQGEHRNGPGSRDSGAVRVLPVRNVIA 483

Query: 842  TAVPSSRVPVDPSH-TRSSNTTSIPRTEQSNPNAASAAQGSNEGTPSVGPSSASTIHPMV 666
            T +PSS+  +  S  T+  +  S+P+                       P S S++  +V
Sbjct: 484  TTIPSSQTGISMSSATQPGSGVSVPQ-----------------------PPSDSSLSSIV 520

Query: 665  SEINAQLRNLVRDMGGENVAPSGPS----ESPS----------NQGLATGSVSGDGAGNS 528
            +E+N+Q+RNLV +  G +   SG +    ++PS          N+ L+   V+G    N+
Sbjct: 521  AELNSQIRNLVGNNQGNDAVQSGQAVPNVQNPSAGIESRNNTGNEQLSNSDVNGGLQSNA 580

Query: 527  QQERSHKDDK--------SPCKEXXXXXXXXXXXXXXXAT---KQXXXXXXXXXXXXXXV 381
               RS  + +         P K+                    K                
Sbjct: 581  SLPRSTSESEVQKASGSVPPLKDDSKFQARDSLSSGQMPCQDDKGNTSQTAAKQGMTEGA 640

Query: 380  KDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAITSGQQILQSLVAR 201
            K VP       ++ K ++GRQ  +  +N++  T+S   +SNQNQQ +TS QQ+LQSL  R
Sbjct: 641  KAVPLGLGLGMMERK-RQGRQQKTPQENSDSGTTSS--SSNQNQQ-VTSAQQLLQSLATR 696

Query: 200  ---GSNARTDSNGVSD------QIMGARPSAVPASNSQFDAAGMMSQVLNSPALNGLLSG 48
               GS   T             Q+   R S V     Q D   +MSQVL SPALNGLL+G
Sbjct: 697  STAGSRVSTIDTPARQAAPNVGQVRDGRSSGVQGPGGQVDMGSVMSQVLQSPALNGLLTG 756

Query: 47   VSEQAGMGSPDGLR 6
            VSEQ G+GSPD LR
Sbjct: 757  VSEQTGVGSPDALR 770


>emb|CDP07318.1| unnamed protein product [Coffea canephora]
          Length = 936

 Score =  119 bits (299), Expect = 3e-24
 Identities = 115/384 (29%), Positives = 165/384 (42%), Gaps = 52/384 (13%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXG------RMLPVRSVLTT 840
            TSSLFG +A+  +N G   PVG+G+  RHVNIHIH              RM      L  
Sbjct: 395  TSSLFGSSASAPSNPGAFGPVGIGNISRHVNIHIHTGTPLGPFVSGVGARMNNGEGTLGE 454

Query: 839  AVPSSRVPVDPSHTRSSNTTSIPRTEQSNPNAASAAQGSNEGTPSVGPSSASTIHPM--- 669
               +     + + +R    T++  T      A  A  G+ E  PSVG S    + P+   
Sbjct: 455  RA-NGTASGESAQSRVQGVTNVNTTAVPLRPAVVAVSGTLE--PSVGVSLPPDLFPLSTV 511

Query: 668  VSEINAQLRNLVRDMGGENVAPSGPS--------------ESPSNQ--GLATGSVSGD-- 543
            V E+N+Q+RN V ++ G + A S  S              E  SN+   +++G   G+  
Sbjct: 512  VPEVNSQIRNFVGNIRGGHQASSESSTVQERAVGAAAAGDEGRSNEQNNISSGHGFGETS 571

Query: 542  ----GAGNSQQERSHKDDKSPCKEXXXXXXXXXXXXXXXA---------------TKQXX 420
                G  N+  + +      P                  +               T +  
Sbjct: 572  QLFPGVSNTTNQETQPSGHQPSNSKDSGVAVNPKYEPSSSSLGGSNEPSSTPVVVTVEGA 631

Query: 419  XXXXXXXXXXXXVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAI 240
                            P       LQPK +R RQ  SQ    +++ SS + TSNQ +Q  
Sbjct: 632  SSSSQAMDTTGGSSTTPLGLGLGSLQPK-RRSRQSRSQ----SNSGSSSLVTSNQTEQPR 686

Query: 239  TSGQQILQSL--VARGSNARTDSNGVSDQ----IMGARPSAVPASNSQFDAAGMMSQVLN 78
             +GQQ+LQSL  +A  SN  T ++G   Q    ++ + P A   ++ QFD    MSQVL 
Sbjct: 687  IAGQQVLQSLASLAARSNGNTQASGQLSQPAGVVVDSLPPAEENADGQFDIGNAMSQVLQ 746

Query: 77   SPALNGLLSGVSEQAGMGSPDGLR 6
            SPALNGLL+GVS+Q G+GSP+ LR
Sbjct: 747  SPALNGLLAGVSQQTGIGSPNALR 770


>ref|XP_011099675.1| PREDICTED: large proline-rich protein BAG6 isoform X2 [Sesamum
            indicum]
          Length = 907

 Score =  118 bits (295), Expect = 8e-24
 Identities = 110/371 (29%), Positives = 159/371 (42%), Gaps = 39/371 (10%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPSS- 825
            TSSLFG ++    +     PVG+   PR+VNIHIH         + P+ S L    P+  
Sbjct: 389  TSSLFGASSGVSPSPMAMGPVGVASIPRNVNIHIHTGAS-----LAPMVSQLGNRAPNGE 443

Query: 824  -----RVPVDPS----HTRSSNTTSIPRTEQSNPNAASAAQGSNEGTPSVGPSSASTIHP 672
                 R  V  S      R S+   +     S P   S +     GT       +++I  
Sbjct: 444  GTQEERANVSESSDFGQARGSSGVGVTANVVSQPAMVSISGALAPGTGVQQTPDSNSISS 503

Query: 671  MVSEINAQLRNLVRDMGGENVAPSGPSESPSNQGLATGSVSGDGAGNSQQERSHKDDKSP 492
            +V+EINAQ+R+L+ +M  +      PSE  +NQ    GS    G   S+Q  +     SP
Sbjct: 504  VVAEINAQMRSLLSNMQNDQA----PSEDSANQDQPVGS----GENTSRQRTAEASHTSP 555

Query: 491  -------------CKEXXXXXXXXXXXXXXXATKQXXXXXXXXXXXXXXVKD-------- 375
                         C +                +                  D        
Sbjct: 556  HVSMMDDQKAQTACNQPDIKGKGAGVGSVSEPSISSEGGSDKRPTASDRGDDNDNADSPS 615

Query: 374  -VPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAITSGQQILQSLVA-- 204
             +P       LQPK +RGRQ  +Q K+ +++++S     +QNQQ+   GQ +LQSL +  
Sbjct: 616  GIPLGLGLGGLQPK-RRGRQQKAQTKDTDNSSAS-----SQNQQSRAVGQHVLQSLASLP 669

Query: 203  -RGSNARTDSNGVSDQIMGARPSAVPASNS----QFDAAGMMSQVLNSPALNGLLSGVSE 39
             RG+     S   SD   G    +VP ++     Q D A  MSQ+L+SPAL+GLLSGVS+
Sbjct: 670  TRGNRNPLPSRQSSDIARGGGGQSVPTASQNADGQGDVADAMSQILHSPALDGLLSGVSQ 729

Query: 38   QAGMGSPDGLR 6
            Q G+G+PD LR
Sbjct: 730  QTGVGTPDMLR 740


>ref|XP_007017960.1| Ubiquitin-like superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|508723288|gb|EOY15185.1| Ubiquitin-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
          Length = 914

 Score =  116 bits (291), Expect = 2e-23
 Identities = 110/359 (30%), Positives = 158/359 (44%), Gaps = 27/359 (7%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPSSR 822
            TSSLF G+ +  +N     PVG+G  PRH+NIHIHA        + P+ S +     S+ 
Sbjct: 401  TSSLFSGSHSP-SNPPTLGPVGVGTAPRHINIHIHAGTA-----LAPIISAVGNRT-SNG 453

Query: 821  VPVDPSHTRSSNTTSIPRTEQSNPNAASAAQGSNEGTPSVGPSSA--STIHPMVSEINAQ 648
              V      ++ + S+      N  AA+          S   S+   S+I  +V+E+N++
Sbjct: 454  EGVQGERGNNAGSGSMRVLPVRNVLAAAVPARPTGAVSSAAQSAPTDSSISSIVAEVNSR 513

Query: 647  LRNLVRDMGGENVAPSGPSESPSNQGLATGSVSGDGAGNS--------------QQERSH 510
            LRN V +M G N   SG  +         G+V+  GAG+S              + +  H
Sbjct: 514  LRNFVSNMQGGNQVASGNGQP--------GNVAVSGAGDSSVALPADILQTEEQKSQPQH 565

Query: 509  KDDKSPCKEXXXXXXXXXXXXXXXA--------TKQXXXXXXXXXXXXXXVKDVPXXXXX 354
             +  +   E                         K                K VP     
Sbjct: 566  AEGSNNIMESGVSSKDVSTGTVECPPSSSGELLVKSEDPSGSVLRSGEDNAKAVPLGLGL 625

Query: 353  XXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAITSGQQILQSLVARGSNART--- 183
              L+ K KR +Q  S V   +  T+S   + +QN    T+GQQILQSLV+R S+      
Sbjct: 626  GGLERK-KRIKQTKSPVSTGDSGTTSS--SLDQNLSVRTTGQQILQSLVSRSSSVNRVEH 682

Query: 182  DSNGVSDQIMGARPSAVPASNSQFDAAGMMSQVLNSPALNGLLSGVSEQAGMGSPDGLR 6
            D++  +  +  +R S    S+ Q DAA  +SQVL SPALNGLL+GVSEQ G+GSPD  R
Sbjct: 683  DASPSNPGVQSSRLSGGQGSDDQLDAANAVSQVLQSPALNGLLAGVSEQTGVGSPDVFR 741


>gb|KCW68033.1| hypothetical protein EUGRSUZ_F01713 [Eucalyptus grandis]
          Length = 938

 Score =  116 bits (290), Expect = 3e-23
 Identities = 113/372 (30%), Positives = 159/372 (42%), Gaps = 40/372 (10%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPSSR 822
            T+SLFGG            PVG+G  PRH+NIHIHA        + P+ S L +   +S 
Sbjct: 406  TNSLFGGPVPPSNPVAFG-PVGIGSPPRHINIHIHAGGSS----LAPIVSALGSRASNSE 460

Query: 821  VPVDPSHTRSSNTTSIPRTEQSNPNAASA-----------AQGSNEGTPSVGPSSAST-I 678
              +      + +  S P    S+  AAS            A  SN       P+S S   
Sbjct: 461  GMLGQRGDTTGSGGSGPSHPASSVAAASFLARPSGLGGSHAPISNVAVSMTQPNSESAPA 520

Query: 677  HPMVSEINAQLRNLVRDMG-GENVAPSGPSESPSNQGLATGSVSGD-------------- 543
              ++SEIN++LRNL  ++  G N   S      + +  A GS                  
Sbjct: 521  SSLISEINSRLRNLAGNLQEGRNQMQSAAQAELNVENPAVGSAEATVQSDNVAVDRVGEP 580

Query: 542  --GAGNSQQERSHK--DDKSPCKEXXXXXXXXXXXXXXXATKQXXXXXXXXXXXXXXVKD 375
               +G S++E  HK  D  S  +                   +                 
Sbjct: 581  AVSSGASREESEHKSMDVTSRGEPSCPSGSSPTCSSEDAIIVKEGAASHNEEPDVCGESH 640

Query: 374  VPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAITSGQQILQSLVARGS 195
            VP       L+ K +RGRQ     K  + ++S+ +   +QNQ   TSGQQILQSL+++G 
Sbjct: 641  VPKGLGLGSLERK-RRGRQSKPPAKEDSGSSSASL---DQNQLIRTSGQQILQSLLSQGL 696

Query: 194  NA-RTDSNG--------VSDQIMGARPSAVPASNSQFDAAGMMSQVLNSPALNGLLSGVS 42
            +A R ++N          S Q+  +        + Q D  G+MSQVL+SPALNGLL+GVS
Sbjct: 697  SADRAEANNQPLGASAVASAQVTESLSQGQQGPDGQVDMGGLMSQVLHSPALNGLLAGVS 756

Query: 41   EQAGMGSPDGLR 6
            EQ G+GSPD LR
Sbjct: 757  EQTGVGSPDALR 768


>ref|XP_010061138.1| PREDICTED: large proline-rich protein BAG6 [Eucalyptus grandis]
            gi|629102565|gb|KCW68034.1| hypothetical protein
            EUGRSUZ_F01713 [Eucalyptus grandis]
          Length = 939

 Score =  115 bits (289), Expect = 4e-23
 Identities = 113/373 (30%), Positives = 159/373 (42%), Gaps = 41/373 (10%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPSSR 822
            T+SLFGG            PVG+G  PRH+NIHIHA        + P+ S L +   +S 
Sbjct: 406  TNSLFGGPVPPSNPVAFG-PVGIGSPPRHINIHIHAGGSS----LAPIVSALGSRASNSE 460

Query: 821  VPVDPSHTRSSNTTSIPRTEQSNPNAASA-----------AQGSNEGTPSVGPSSAST-I 678
              +      + +  S P    S+  AAS            A  SN       P+S S   
Sbjct: 461  GMLGQRGDTTGSGGSGPSHPASSVAAASFLARPSGLGGSHAPISNVAVSMTQPNSESAPA 520

Query: 677  HPMVSEINAQLRNLVRDMG-GENVAPSGPSESPSNQGLATGSVSGD-------------- 543
              ++SEIN++LRNL  ++  G N   S      + +  A GS                  
Sbjct: 521  SSLISEINSRLRNLAGNLQEGRNQMQSAAQAELNVENPAVGSAEATVQSDNVAVDRVGEP 580

Query: 542  --GAGNSQQERSHK---DDKSPCKEXXXXXXXXXXXXXXXATKQXXXXXXXXXXXXXXVK 378
               +G S++E  HK   D  S  +                   +                
Sbjct: 581  AVSSGASREESEHKESMDVTSRGEPSCPSGSSPTCSSEDAIIVKEGAASHNEEPDVCGES 640

Query: 377  DVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAITSGQQILQSLVARG 198
             VP       L+ K +RGRQ     K  + ++S+ +   +QNQ   TSGQQILQSL+++G
Sbjct: 641  HVPKGLGLGSLERK-RRGRQSKPPAKEDSGSSSASL---DQNQLIRTSGQQILQSLLSQG 696

Query: 197  SNA-RTDSNG--------VSDQIMGARPSAVPASNSQFDAAGMMSQVLNSPALNGLLSGV 45
             +A R ++N          S Q+  +        + Q D  G+MSQVL+SPALNGLL+GV
Sbjct: 697  LSADRAEANNQPLGASAVASAQVTESLSQGQQGPDGQVDMGGLMSQVLHSPALNGLLAGV 756

Query: 44   SEQAGMGSPDGLR 6
            SEQ G+GSPD LR
Sbjct: 757  SEQTGVGSPDALR 769


>gb|KCW68032.1| hypothetical protein EUGRSUZ_F01713 [Eucalyptus grandis]
          Length = 930

 Score =  115 bits (287), Expect = 7e-23
 Identities = 115/366 (31%), Positives = 155/366 (42%), Gaps = 34/366 (9%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGR----MLPVRSVLTTAV 834
            T+SLFGG            PVG+G  PRH+NIHIHA            ML  R   T + 
Sbjct: 406  TNSLFGGPVPPSNPVAFG-PVGIGSPPRHINIHIHAGALGSRASNSEGMLGQRGDTTGSG 464

Query: 833  PSSRVPVDPSHTRSSNTTSIPRTEQSNPNAASAAQGSNEGTPSVGPSSAST-IHPMVSEI 657
             S      PSH  SS   +      S     S A  SN       P+S S     ++SEI
Sbjct: 465  GSG-----PSHPASSVAAASFLARPSGLGG-SHAPISNVAVSMTQPNSESAPASSLISEI 518

Query: 656  NAQLRNLVRDMG-GENVAPSGPSESPSNQGLATGSVSGD----------------GAGNS 528
            N++LRNL  ++  G N   S      + +  A GS                     +G S
Sbjct: 519  NSRLRNLAGNLQEGRNQMQSAAQAELNVENPAVGSAEATVQSDNVAVDRVGEPAVSSGAS 578

Query: 527  QQERSHK---DDKSPCKEXXXXXXXXXXXXXXXATKQXXXXXXXXXXXXXXVKDVPXXXX 357
            ++E  HK   D  S  +                   +                 VP    
Sbjct: 579  REESEHKESMDVTSRGEPSCPSGSSPTCSSEDAIIVKEGAASHNEEPDVCGESHVPKGLG 638

Query: 356  XXXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAITSGQQILQSLVARGSNA-RTD 180
               L+ K +RGRQ     K  + ++S+ +   +QNQ   TSGQQILQSL+++G +A R +
Sbjct: 639  LGSLERK-RRGRQSKPPAKEDSGSSSASL---DQNQLIRTSGQQILQSLLSQGLSADRAE 694

Query: 179  SNG--------VSDQIMGARPSAVPASNSQFDAAGMMSQVLNSPALNGLLSGVSEQAGMG 24
            +N          S Q+  +        + Q D  G+MSQVL+SPALNGLL+GVSEQ G+G
Sbjct: 695  ANNQPLGASAVASAQVTESLSQGQQGPDGQVDMGGLMSQVLHSPALNGLLAGVSEQTGVG 754

Query: 23   SPDGLR 6
            SPD LR
Sbjct: 755  SPDALR 760


>ref|XP_007017961.1| Ubiquitin-like superfamily protein, putative isoform 2 [Theobroma
            cacao] gi|508723289|gb|EOY15186.1| Ubiquitin-like
            superfamily protein, putative isoform 2 [Theobroma cacao]
          Length = 879

 Score =  113 bits (282), Expect = 3e-22
 Identities = 110/357 (30%), Positives = 155/357 (43%), Gaps = 25/357 (7%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPSSR 822
            TSSLF G+ +  +N     PVG+G  PRH+NIHIHA        + P+ S +     S+ 
Sbjct: 401  TSSLFSGSHSP-SNPPTLGPVGVGTAPRHINIHIHAGTA-----LAPIISAVGNRT-SNG 453

Query: 821  VPVDPSHTRSSNTTSIPRTEQSNPNAASAAQGSNEGTPSVGPSSA--STIHPMVSEINAQ 648
              V      ++ + S+      N  AA+          S   S+   S+I  +V+E+N++
Sbjct: 454  EGVQGERGNNAGSGSMRVLPVRNVLAAAVPARPTGAVSSAAQSAPTDSSISSIVAEVNSR 513

Query: 647  LRNLVRDMGGENVAPSGPSESPSN---QGLATGSVS-----------------GDGAGNS 528
            LRN V +M G N   SG  + P N    G    SV+                  +G+ N 
Sbjct: 514  LRNFVSNMQGGNQVASGNGQ-PGNVAVSGAGDSSVALPADILQTEEQKSQPQHAEGSNNI 572

Query: 527  QQERSHKDDKSPCKEXXXXXXXXXXXXXXXATKQXXXXXXXXXXXXXXVKDVPXXXXXXX 348
             +      D S                     K                K VP       
Sbjct: 573  MESGVSSKDVST-----GTVECPPSSSGELLVKSEDPSGSVLRSGEDNAKAVPLGLGLGG 627

Query: 347  LQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAITSGQQILQSLVARGSNART---DS 177
            L+ K  R +Q  S V   +  T+S   + +QN    T+GQQILQSLV+R S+      D+
Sbjct: 628  LERK--RIKQTKSPVSTGDSGTTSS--SLDQNLSVRTTGQQILQSLVSRSSSVNRVEHDA 683

Query: 176  NGVSDQIMGARPSAVPASNSQFDAAGMMSQVLNSPALNGLLSGVSEQAGMGSPDGLR 6
            +  +  +  +R S    S+ Q DAA  +SQVL SPALNGLL+GVSEQ G+GSPD  R
Sbjct: 684  SPSNPGVQSSRLSGGQGSDDQLDAANAVSQVLQSPALNGLLAGVSEQTGVGSPDVFR 740


>ref|XP_012446475.1| PREDICTED: large proline-rich protein BAG6 isoform X2 [Gossypium
            raimondii] gi|763792712|gb|KJB59708.1| hypothetical
            protein B456_009G268000 [Gossypium raimondii]
          Length = 880

 Score =  112 bits (280), Expect = 5e-22
 Identities = 112/354 (31%), Positives = 164/354 (46%), Gaps = 22/354 (6%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVL--TTAVPS 828
            T SLF G+ +   +  +  PVG+G+ PRH+NIHIH         + PV S +   T    
Sbjct: 392  TRSLFSGSHSPSNSLTVG-PVGVGNAPRHINIHIHPGTA-----LSPVVSAVGNRTNNGE 445

Query: 827  SRVPVDPSHTRSSNTTSIP--RTEQSNPNAASAAQGSNEGTPSVGPSSASTIHPMVSEIN 654
             R     ++  S +   +P   T  S P A + A  S+    ++  SS S+   MV+EIN
Sbjct: 446  GRQGERGNNAGSGSMRVLPVRNTVASAPQARATAAMSSAAQSALTESSLSS---MVAEIN 502

Query: 653  AQLRNLVRDMGGENVAPSGPSESPSNQGLATGSVSGDGAG-----------NSQQERSH- 510
            +++R+LV  M G+N   SG S+ P+N      S +GD               SQ E +  
Sbjct: 503  SRIRDLV-SMQGDNQDASG-SQQPNNM---VASGAGDSTVALPANLETEELKSQPEHAEG 557

Query: 509  KDDKSPCKEXXXXXXXXXXXXXXXAT-----KQXXXXXXXXXXXXXXVKDVPXXXXXXXL 345
            ++D +   E               ++     K                K VP       L
Sbjct: 558  RNDNTESGESSQDISLGTVGCPPSSSGEPLVKLEDPSGSAPRSSEENAKPVPLGLGLGGL 617

Query: 344  QPKLKRGRQGSSQVKNANDATSSGVPTSNQNQQAITSGQQILQSLVARGSNA-RTDSNGV 168
            + K +R +   S +   N   SS +   +QN  A  +GQQILQSL ++ S+  R DS+  
Sbjct: 618  ERK-RRVKPTKSSIAGVNGTASSSL---DQNLSARMAGQQILQSLASQSSSLNRVDSSSG 673

Query: 167  SDQIMGARPSAVPASNSQFDAAGMMSQVLNSPALNGLLSGVSEQAGMGSPDGLR 6
            +  ++G+R S    S+ Q  AA  +SQVL SPALNGLL+GVS+Q G GSPD  R
Sbjct: 674  NQGVLGSRLSGGQGSDDQLAAANAVSQVLQSPALNGLLAGVSQQTGAGSPDDFR 727


>gb|KRH17500.1| hypothetical protein GLYMA_14G223100 [Glycine max]
            gi|947068358|gb|KRH17501.1| hypothetical protein
            GLYMA_14G223100 [Glycine max]
          Length = 922

 Score =  111 bits (278), Expect = 8e-22
 Identities = 111/382 (29%), Positives = 165/382 (43%), Gaps = 50/382 (13%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPS-- 828
            TSSLFGG     T A +   +G+G+ PR+VNIHIHA        + P+ S + +   +  
Sbjct: 399  TSSLFGGPVPPSTPATLGT-IGIGNAPRNVNIHIHAGTS-----LAPIVSAIGSRENNGE 452

Query: 827  ---SRVPVDPSHTRSSNTTSIP------RTEQSNPNAASAAQGSNEG---TPSVGPSSAS 684
               S    +P    S +T  +P       T  S+P     +  +  G   + S  PS ++
Sbjct: 453  GTRSEHHNEPGSGDSGSTRVLPVRNVIAATIPSHPPGVGVSSSTQTGFGISTSQPPSDSA 512

Query: 683  TIHPMVSEINAQLRNLVRDM--------------------GGENVAPSGPSESPSNQGLA 564
            ++  +++EIN++LRN+V +M                    G E+  P+   +  +     
Sbjct: 513  SLSSVLAEINSRLRNVVGNMQGDNTVPSGQMESNSRDLSSGSESRPPTVNKQQDTVDVNG 572

Query: 563  TGSVSGDGAGNSQQERSHKD-------------DK--SPCKEXXXXXXXXXXXXXXXATK 429
             G++S    G + +    K              DK  S                     +
Sbjct: 573  FGAISASSVGCTSESEVQKVQTEAVQTSSNVLVDKFVSSSSNQDLQSCSSGETIVKPEIE 632

Query: 428  QXXXXXXXXXXXXXXVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQ 249
            Q               K  P       L+ K +R R      K A+D +SS   + NQNQ
Sbjct: 633  QDVLAVSERQNVTEPAKAAPLGLGVGGLERK-RRTRLQPPVSKGADDGSSSS--SVNQNQ 689

Query: 248  QAITSGQQILQSLVARGSNART-DSNGVSDQIMGARPSAVPASNSQFDAAGMMSQVLNSP 72
            Q  T GQ ILQ+L + GS   + ++NG S      RP  +P+S+   D AG+MSQ L+SP
Sbjct: 690  QTRTDGQHILQTLASHGSGLNSRNANGPSQ-----RP--LPSSDRPIDVAGLMSQALHSP 742

Query: 71   ALNGLLSGVSEQAGMGSPDGLR 6
            ALNGLL GVS+Q G+ SPDGLR
Sbjct: 743  ALNGLLEGVSQQTGVDSPDGLR 764


>gb|KRH17498.1| hypothetical protein GLYMA_14G223100 [Glycine max]
            gi|947068356|gb|KRH17499.1| hypothetical protein
            GLYMA_14G223100 [Glycine max]
          Length = 918

 Score =  111 bits (278), Expect = 8e-22
 Identities = 111/382 (29%), Positives = 165/382 (43%), Gaps = 50/382 (13%)
 Frame = -2

Query: 1001 TSSLFGGAAATQTNAGIPVPVGLGDTPRHVNIHIHAXXXXXXGRMLPVRSVLTTAVPS-- 828
            TSSLFGG     T A +   +G+G+ PR+VNIHIHA        + P+ S + +   +  
Sbjct: 395  TSSLFGGPVPPSTPATLGT-IGIGNAPRNVNIHIHAGTS-----LAPIVSAIGSRENNGE 448

Query: 827  ---SRVPVDPSHTRSSNTTSIP------RTEQSNPNAASAAQGSNEG---TPSVGPSSAS 684
               S    +P    S +T  +P       T  S+P     +  +  G   + S  PS ++
Sbjct: 449  GTRSEHHNEPGSGDSGSTRVLPVRNVIAATIPSHPPGVGVSSSTQTGFGISTSQPPSDSA 508

Query: 683  TIHPMVSEINAQLRNLVRDM--------------------GGENVAPSGPSESPSNQGLA 564
            ++  +++EIN++LRN+V +M                    G E+  P+   +  +     
Sbjct: 509  SLSSVLAEINSRLRNVVGNMQGDNTVPSGQMESNSRDLSSGSESRPPTVNKQQDTVDVNG 568

Query: 563  TGSVSGDGAGNSQQERSHKD-------------DK--SPCKEXXXXXXXXXXXXXXXATK 429
             G++S    G + +    K              DK  S                     +
Sbjct: 569  FGAISASSVGCTSESEVQKVQTEAVQTSSNVLVDKFVSSSSNQDLQSCSSGETIVKPEIE 628

Query: 428  QXXXXXXXXXXXXXXVKDVPXXXXXXXLQPKLKRGRQGSSQVKNANDATSSGVPTSNQNQ 249
            Q               K  P       L+ K +R R      K A+D +SS   + NQNQ
Sbjct: 629  QDVLAVSERQNVTEPAKAAPLGLGVGGLERK-RRTRLQPPVSKGADDGSSSS--SVNQNQ 685

Query: 248  QAITSGQQILQSLVARGSNART-DSNGVSDQIMGARPSAVPASNSQFDAAGMMSQVLNSP 72
            Q  T GQ ILQ+L + GS   + ++NG S      RP  +P+S+   D AG+MSQ L+SP
Sbjct: 686  QTRTDGQHILQTLASHGSGLNSRNANGPSQ-----RP--LPSSDRPIDVAGLMSQALHSP 738

Query: 71   ALNGLLSGVSEQAGMGSPDGLR 6
            ALNGLL GVS+Q G+ SPDGLR
Sbjct: 739  ALNGLLEGVSQQTGVDSPDGLR 760


Top