BLASTX nr result

ID: Cornus23_contig00015228 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00015228
         (1214 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010658710.1| PREDICTED: uncharacterized protein LOC100245...   436   e-119
ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3...   410   e-111
ref|XP_009793129.1| PREDICTED: uncharacterized protein LOC104240...   409   e-111
ref|XP_008446786.1| PREDICTED: OTU domain-containing protein At3...   409   e-111
ref|XP_009603537.1| PREDICTED: uncharacterized protein LOC104098...   406   e-110
ref|XP_007220473.1| hypothetical protein PRUPE_ppa008484mg [Prun...   402   e-109
ref|XP_008232087.1| PREDICTED: OTU domain-containing protein At3...   400   e-109
ref|XP_007010219.1| Cysteine proteinases superfamily protein iso...   394   e-107
ref|XP_004307032.1| PREDICTED: OTU domain-containing protein At3...   392   e-106
ref|XP_004250001.1| PREDICTED: uncharacterized protein LOC101253...   390   e-105
ref|XP_007010220.1| Cysteine proteinases superfamily protein iso...   389   e-105
ref|XP_006360486.1| PREDICTED: uncharacterized protein LOC102606...   386   e-104
ref|XP_010032108.1| PREDICTED: OTU domain-containing protein At3...   384   e-104
ref|XP_002316423.1| hypothetical protein POPTR_0010s24050g [Popu...   381   e-103
ref|XP_012456105.1| PREDICTED: uncharacterized protein LOC105777...   379   e-102
ref|XP_010243634.1| PREDICTED: uncharacterized protein LOC104587...   378   e-102
ref|XP_011024271.1| PREDICTED: uncharacterized protein LOC105125...   378   e-102
ref|XP_008345443.1| PREDICTED: uncharacterized protein LOC103408...   377   e-102
gb|KHG26701.1| hypothetical protein F383_04817 [Gossypium arboreum]   376   e-101
emb|CDO99851.1| unnamed protein product [Coffea canephora]            374   e-100

>ref|XP_010658710.1| PREDICTED: uncharacterized protein LOC100245448 [Vitis vinifera]
           gi|296090402|emb|CBI40221.3| unnamed protein product
           [Vitis vinifera]
          Length = 317

 Score =  436 bits (1120), Expect = e-119
 Identities = 217/317 (68%), Positives = 248/317 (78%), Gaps = 14/317 (4%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLSFAHGSVAHHRL-------VGSPVRLVGG--DQPRRHHSSACR 813
           MLGVLCAR KPWILA+LSF HGS  HH L       +G+P++  GG  D  RRHHS ACR
Sbjct: 1   MLGVLCARHKPWILATLSFVHGSATHHHLHLNHHHLLGTPIQFNGGGDDHRRRHHSRACR 60

Query: 812 LGGSRGGAASIWHAILPSGGGRRGNIRPAF---HQGEGSWNVAWDVRPARWLHWPDSVWL 642
            G S GGAASIWHAILPSGG RR ++RPA     +GEGSWNVAWD RPARWLH PDS WL
Sbjct: 61  QGSSGGGAASIWHAILPSGGDRRSSLRPALLHDQKGEGSWNVAWDARPARWLHRPDSAWL 120

Query: 641 LFGVCPCLSAQLDLPDVNTDSASGDVKIDSCGTAVISSDENDEGN--YRITGVTADGRCL 468
           LFGVC CL A LD  DV+ +  + D KI+ C      SDEN+  +  YR+TGV ADGRCL
Sbjct: 121 LFGVCACL-APLDSFDVDNEVVAVDDKIEGCNQVNEISDENNNSSADYRVTGVPADGRCL 179

Query: 467 FRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDDDFDAYVKRMQ 288
           FRAIAH ACLR+GEEAPDE RQ ELAD+LRAQVVDELL+RREETEWFI+ +FDAYVKR+Q
Sbjct: 180 FRAIAHSACLRSGEEAPDENRQTELADDLRAQVVDELLKRREETEWFIEGNFDAYVKRIQ 239

Query: 287 QPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKSPIKVLFHGYG 108
           QPYVWGGEPEL+MASHV+K  ISVFM+ RSSG L NIANYG+EY+ D +SPI VLFHGYG
Sbjct: 240 QPYVWGGEPELIMASHVLKMPISVFMIGRSSGDLKNIANYGKEYRIDNESPINVLFHGYG 299

Query: 107 HYDILDAISDQTFQKVE 57
           HYDIL+  SD ++QK+E
Sbjct: 300 HYDILETFSDHSYQKLE 316


>ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
           sativus] gi|700197033|gb|KGN52210.1| hypothetical
           protein Csa_5G615810 [Cucumis sativus]
          Length = 313

 Score =  410 bits (1054), Expect = e-111
 Identities = 210/317 (66%), Positives = 247/317 (77%), Gaps = 15/317 (4%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLS-FAHGSVAHHR-------LVGSPVRLVGGDQPRRHHSSACRL 810
           MLGVLCARPKPWIL SLS F HGS  +H        LV SP++    D+ +RHHSSAC+L
Sbjct: 1   MLGVLCARPKPWILVSLSNFIHGSAVYHHHHHQSRLLVQSPIQF---DRRQRHHSSACKL 57

Query: 809 GGSRGGAASIWHAILPSGGGRRGNI-RPAFH----QGEGSWNVAWDVRPARWLHWPDSVW 645
            G  GGAASIWHAI+PSG G   N+ RPA H    +GEGSWNVAWD RPARWLH PDS W
Sbjct: 58  AG--GGAASIWHAIMPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSAW 115

Query: 644 LLFGVCPCLSAQLDLPDVNTDSASGDVKIDSCGTAVISSDENDEGN--YRITGVTADGRC 471
           LLFGVC C+ A LD  D + ++ S D K + C ++    ++NDE +  YR+TGV ADGRC
Sbjct: 116 LLFGVCACI-APLDWVDASHEAVSLDQKKEVCESSGPEFNQNDESSADYRVTGVLADGRC 174

Query: 470 LFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDDDFDAYVKRM 291
           LFRAIAH ACLR+GEEAPD+ RQRELADELRA+VVDELL+RR+ETEW+I+ DFDAYVKR+
Sbjct: 175 LFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDAYVKRI 234

Query: 290 QQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKSPIKVLFHGY 111
           QQP+VWGGEPELLMASHV+KT ISVFM +RSS GL+NIA YG+EYQK E+SPI VLFHGY
Sbjct: 235 QQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQKGEESPINVLFHGY 294

Query: 110 GHYDILDAISDQTFQKV 60
           GHYDIL+  SD+   K+
Sbjct: 295 GHYDILETSSDKVSLKL 311


>ref|XP_009793129.1| PREDICTED: uncharacterized protein LOC104240043 [Nicotiana
           sylvestris]
          Length = 328

 Score =  409 bits (1052), Expect = e-111
 Identities = 214/327 (65%), Positives = 252/327 (77%), Gaps = 24/327 (7%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLSF--AHGSV--AHHRLVGSPVR--LVGGDQP--RRHHSSACRL 810
           MLGVLCARPKPW+ ASLS   AHGS   A++RL+G+P +  LVGG     RRHHSS CRL
Sbjct: 1   MLGVLCARPKPWLFASLSLSHAHGSAPAAYNRLIGTPTKSVLVGGSDQLQRRHHSSHCRL 60

Query: 809 GGS--RGGAASIWHAILPSGGGRRGNIR--PAFH--------QGEGSWNVAWDVRPARWL 666
           G S  RGGAASIWHAILP+G   +   R    FH        +GEGSWNVAWD RPARWL
Sbjct: 61  GASVNRGGAASIWHAILPAGRRNKDVKRRNTVFHHHHYELAKKGEGSWNVAWDTRPARWL 120

Query: 665 HWPDSVWLLFGVCPCLSA-QLDLPDVNTDSASGDVKIDS-CGTAVISSDENDEG--NYRI 498
           H PDS WLLFGVC CL+A  LDLPD N+D  +    +     +  ++SDE D    NY +
Sbjct: 121 HNPDSAWLLFGVCSCLAAPSLDLPDSNSDVVAPIENMSQGFSSNTVNSDEADRNSANYTV 180

Query: 497 TGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDD 318
           TGV ADGRCLFRAIAH+ACLRNGE APDE RQRELADELRAQVVDELL+RR+E EWFI+ 
Sbjct: 181 TGVPADGRCLFRAIAHMACLRNGEGAPDENRQRELADELRAQVVDELLKRRKEAEWFIEG 240

Query: 317 DFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKS 138
           DFDAYV+R+++PYVWGGEPELLMASHV+K+ ISV+MVDRSSG L+NI+NYGEEY+K+ ++
Sbjct: 241 DFDAYVERIEKPYVWGGEPELLMASHVLKSPISVYMVDRSSGSLINISNYGEEYRKEGEN 300

Query: 137 PIKVLFHGYGHYDILDAISDQTFQKVE 57
           PI VLFHGYGHYDIL+ IS++  QK+E
Sbjct: 301 PINVLFHGYGHYDILETISEKGHQKLE 327


>ref|XP_008446786.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
           melo]
          Length = 313

 Score =  409 bits (1050), Expect = e-111
 Identities = 210/317 (66%), Positives = 246/317 (77%), Gaps = 15/317 (4%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLS-FAHGSVAHHR-------LVGSPVRLVGGDQPRRHHSSACRL 810
           MLGVLCARPKPWIL SLS F HGS  +H        LV SP++    D+ +RHHSSAC+L
Sbjct: 1   MLGVLCARPKPWILVSLSNFIHGSAVYHHHHHQSRLLVQSPIQF---DRRQRHHSSACKL 57

Query: 809 GGSRGGAASIWHAILPSGGGRRGNI-RPAFH----QGEGSWNVAWDVRPARWLHWPDSVW 645
            G  GGAASIWHAILPSG G   N+ RPA H    +GEGSWNVAWD RPARWLH PDS W
Sbjct: 58  AG--GGAASIWHAILPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSAW 115

Query: 644 LLFGVCPCLSAQLDLPDVNTDSASGDVKIDSCGTAVISSDENDEGN--YRITGVTADGRC 471
           LLFGVC C+ A LD  D + ++ S D K + C ++    ++NDE +  YR+TGV ADGRC
Sbjct: 116 LLFGVCACI-APLDWVDASHEAVSLDQKKEVCESSGPEFNQNDESSADYRVTGVLADGRC 174

Query: 470 LFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDDDFDAYVKRM 291
           LFRAIAH ACLR+GEEAPD+ RQRELADELRA+VVDELL+RR+ETEW+I+ DFDAYVKR+
Sbjct: 175 LFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDAYVKRI 234

Query: 290 QQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKSPIKVLFHGY 111
           QQP+VWGGEPELLMASHV+KT ISVFM +RSS GL+NIA YG+EYQ  E+SPI VLFHGY
Sbjct: 235 QQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQMGEESPINVLFHGY 294

Query: 110 GHYDILDAISDQTFQKV 60
           GHYDIL+  SD+   K+
Sbjct: 295 GHYDILETSSDKVSLKL 311


>ref|XP_009603537.1| PREDICTED: uncharacterized protein LOC104098494 [Nicotiana
           tomentosiformis]
          Length = 328

 Score =  406 bits (1043), Expect = e-110
 Identities = 213/327 (65%), Positives = 251/327 (76%), Gaps = 24/327 (7%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLSF--AHGSV--AHHRLVGSPVR--LVGGDQP--RRHHSSACRL 810
           MLGVLCARPKPW+ ASLS   AHGS   A++RL+G+P +  LVGG     RRHHSS CRL
Sbjct: 1   MLGVLCARPKPWLFASLSLSHAHGSAPAAYNRLIGTPTKSVLVGGSDQLQRRHHSSHCRL 60

Query: 809 GGS--RGGAASIWHAILPSGGGRRGNIR--PAFH--------QGEGSWNVAWDVRPARWL 666
           G S  RGGAASIWHAILP+G   +   R    FH        +GEGSWNVAWD RPARWL
Sbjct: 61  GASVNRGGAASIWHAILPAGRRNKDVKRRNTVFHHHHYVLAKKGEGSWNVAWDTRPARWL 120

Query: 665 HWPDSVWLLFGVCPCLSAQ-LDLPDVNTDSASG-DVKIDSCGTAVISSDENDEG--NYRI 498
           H PDS WLLFGVC CL+A  LDLPD N++  +  + K     +  ++SDE D    NY +
Sbjct: 121 HNPDSAWLLFGVCSCLAAPTLDLPDSNSEVVAPIENKSQGFSSNTVNSDEVDRNSANYTV 180

Query: 497 TGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDD 318
           TGV ADGRCLFRAIAH+ACLRNGE APDE RQRELADELRAQVVDELL+RR+E EWFI+ 
Sbjct: 181 TGVPADGRCLFRAIAHMACLRNGEGAPDENRQRELADELRAQVVDELLKRRKEAEWFIEG 240

Query: 317 DFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKS 138
           DFDAYV+R+++PYVWGGEPELLMASHV+K+ ISV+MVDRSSG L+NI+NYGEEY+K+ ++
Sbjct: 241 DFDAYVERIEKPYVWGGEPELLMASHVLKSPISVYMVDRSSGSLINISNYGEEYRKEGEN 300

Query: 137 PIKVLFHGYGHYDILDAISDQTFQKVE 57
           PI VLFHGYGHYDIL+ IS +  Q +E
Sbjct: 301 PINVLFHGYGHYDILETISAKGHQNLE 327


>ref|XP_007220473.1| hypothetical protein PRUPE_ppa008484mg [Prunus persica]
           gi|462416935|gb|EMJ21672.1| hypothetical protein
           PRUPE_ppa008484mg [Prunus persica]
          Length = 329

 Score =  402 bits (1032), Expect = e-109
 Identities = 210/328 (64%), Positives = 244/328 (74%), Gaps = 26/328 (7%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLS-FAHGSVAHHR---LVGSPVRLV---------GGDQPRRHHS 825
           MLG LCAR K WI++SLS FAHGS A H+   L    + L+         G +  R HHS
Sbjct: 1   MLGFLCARRKTWIVSSLSSFAHGSAAAHQSRLLQAHTLPLIHQQIASFSCGFETRRHHHS 60

Query: 824 SACRLGGSRG-GAASIWHAILPSGGGRRGNI--RPAFH---QGEGSWNVAWDVRPARWLH 663
           SAC+LG + G GAASIWHA+LPS   RR     RPA H   +GEGSWN AWD RPARWLH
Sbjct: 61  SACQLGSACGTGAASIWHALLPSSCNRRSRDLRRPAIHYELKGEGSWNAAWDARPARWLH 120

Query: 662 WPDSVWLLFGVCPCLSAQLDLPDVNTDSASGDVKIDS-------CGTAVISSDENDEGNY 504
            PDS WLLFGVC CL A +D  D +T   +  V  ++       C  A   ++ +   +Y
Sbjct: 121 RPDSAWLLFGVCNCL-APIDWADDSTPDGNDGVSNENAESFDSKCSAAPDQNNIDSSADY 179

Query: 503 RITGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFI 324
           R+TGV ADGRCLFRAIAHVACLRNGEEAPDE RQR+LADELRAQVVDELL+RREETEWFI
Sbjct: 180 RVTGVPADGRCLFRAIAHVACLRNGEEAPDENRQRDLADELRAQVVDELLKRREETEWFI 239

Query: 323 DDDFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDE 144
           + DFDAYVKR+QQPYVWGGEPELLMASHV+KT ISVFM+DRSS GLVNIANYGEEY+K+E
Sbjct: 240 EGDFDAYVKRLQQPYVWGGEPELLMASHVLKTPISVFMIDRSSAGLVNIANYGEEYRKEE 299

Query: 143 KSPIKVLFHGYGHYDILDAISDQTFQKV 60
           + PI VLFHGYGHYDILD+ S+Q+ +K+
Sbjct: 300 EKPINVLFHGYGHYDILDSFSEQSLKKL 327


>ref|XP_008232087.1| PREDICTED: OTU domain-containing protein At3g57810-like [Prunus
           mume]
          Length = 329

 Score =  400 bits (1029), Expect = e-109
 Identities = 209/328 (63%), Positives = 244/328 (74%), Gaps = 26/328 (7%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLS-FAHGSVAHHR---LVGSPVRLV---------GGDQPRRHHS 825
           MLG LCAR K WI++SLS FAHGS A H+   L    + L+         G +  R HHS
Sbjct: 1   MLGFLCARRKTWIVSSLSSFAHGSAAAHQSRLLQAHTLPLIHQQIASFSCGFETRRHHHS 60

Query: 824 SACRLGGSRG-GAASIWHAILPSGGGRRGNI--RPAFH---QGEGSWNVAWDVRPARWLH 663
           SAC+LG + G GAASIWHA+LPS   RR     RPA H   +GEGSWN AWD RPARWLH
Sbjct: 61  SACQLGSACGTGAASIWHALLPSSCNRRSRDLRRPAIHYELKGEGSWNAAWDARPARWLH 120

Query: 662 WPDSVWLLFGVCPCLSAQLDLPDVNTDSASGDVKIDS-------CGTAVISSDENDEGNY 504
            PDS WLLFGVC CL A +D  D +T   +  V  ++       C  A   ++ +   +Y
Sbjct: 121 RPDSAWLLFGVCNCL-APIDWADDSTPDGNDGVSNENAESFDSKCSAASDQNNIDSSADY 179

Query: 503 RITGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFI 324
           R+TGV ADGRCLFRAIAHVACLRNGEEAPDE RQR+LADELRAQVVDELL+RREETEWFI
Sbjct: 180 RVTGVPADGRCLFRAIAHVACLRNGEEAPDENRQRDLADELRAQVVDELLKRREETEWFI 239

Query: 323 DDDFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDE 144
           + DFDAYVKR+QQPYVWGGEPELLMASHV+KT ISVFM+DRSS GLVNIANYGE+Y+K+E
Sbjct: 240 EGDFDAYVKRLQQPYVWGGEPELLMASHVLKTPISVFMIDRSSAGLVNIANYGEDYRKEE 299

Query: 143 KSPIKVLFHGYGHYDILDAISDQTFQKV 60
           + PI VLFHGYGHYDILD+ S+Q+ +K+
Sbjct: 300 EKPINVLFHGYGHYDILDSFSEQSLKKL 327


>ref|XP_007010219.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma
           cacao] gi|508727132|gb|EOY19029.1| Cysteine proteinases
           superfamily protein isoform 1 [Theobroma cacao]
          Length = 327

 Score =  394 bits (1013), Expect = e-107
 Identities = 207/330 (62%), Positives = 241/330 (73%), Gaps = 28/330 (8%)
 Frame = -1

Query: 965 MLGVLCARP-KPWILASLSF-AHGSVAHH----RLVGSPVR---LVGGDQPRRHHSSACR 813
           MLGVLCARP KPWIL SLS  AHG +A H    RLV  P     L   D+  RHHS+ACR
Sbjct: 1   MLGVLCARPPKPWILNSLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHHSTACR 60

Query: 812 LGGSRGGAASIWHAILPSGGG----RRGNI-RPAFHQGEGSWNVAWDVRPARWLHWPDSV 648
           LGGS GGAASIWHAILP GGG    RRG + +    +GEGSWNVAWD RPARWLH PDS 
Sbjct: 61  LGGSDGGAASIWHAILPCGGGGGGRRRGEVWKNVERKGEGSWNVAWDARPARWLHRPDSA 120

Query: 647 WLLFGVCPCLSAQLDLPDVNTDS--------------ASGDVKIDSCGTAVISSDENDEG 510
           WLLFGVC CL+  ++  DVN D+               S D K  S  ++V ++D     
Sbjct: 121 WLLFGVCACLAPMIEFVDVNPDADDKIEGAELNLVSRLSADEKSSSSSSSVAAAD----- 175

Query: 509 NYRITGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEW 330
           N ++TGV ADGRCLFRAIAH ACLR+GE+APDE  QRELADELRAQVV+ELL+RREETEW
Sbjct: 176 NCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVVNELLKRREETEW 235

Query: 329 FIDDDFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQK 150
           FI+ DFDAYVK +QQPYVWGGEPE+LMASHV+KT ISV+M+ RSS  L  IA YGEEYQK
Sbjct: 236 FIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNLTKIAKYGEEYQK 295

Query: 149 DEKSPIKVLFHGYGHYDILDAISDQTFQKV 60
           D+++PI VLFHGYGHYDIL+++ +Q   +V
Sbjct: 296 DKENPINVLFHGYGHYDILESLPEQNCAQV 325


>ref|XP_004307032.1| PREDICTED: OTU domain-containing protein At3g57810-like [Fragaria
           vesca subsp. vesca]
          Length = 324

 Score =  392 bits (1008), Expect = e-106
 Identities = 209/323 (64%), Positives = 241/323 (74%), Gaps = 21/323 (6%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLS-FAHGSVAHH--RLVGSPV------RLVGGDQPRRHHSSACR 813
           MLG LCAR K WI++SLS FAHG  A H  R+V SP+         G  + R HH+S+C+
Sbjct: 1   MLGFLCARRKTWIVSSLSSFAHGPAAIHQSRIVHSPLIQHQFTNFSGETRGRHHHNSSCQ 60

Query: 812 LGGSRGG--AASIWHAILPSGG--GRRGNIRPAFH---QGEGSWNVAWDVRPARWLHWPD 654
           LG + GG  AASIWHAILPS G   RR   RPA H   +GEGSWN A D RPARWLH PD
Sbjct: 61  LGSACGGGAAASIWHAILPSSGLWRRRDLRRPAIHYELKGEGSWNAALDARPARWLHRPD 120

Query: 653 SVWLLFGVCPCLSAQLDLPDVNTDSASGDV---KIDSCGT--AVISSDENDEGNYRITGV 489
           S WLLFGVC CL A +D       + + +V   K ++C +  ++ S  + +  +YR+TGV
Sbjct: 121 SAWLLFGVCNCL-APIDWGSTTNSTTNDEVSNNKTEACDSKSSITSDVQLETPDYRVTGV 179

Query: 488 TADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDDDFD 309
            ADGRCLFRAIAHVACLRNGEE PDE RQRELADELRAQVVDELL+RREETEWFI+ DFD
Sbjct: 180 LADGRCLFRAIAHVACLRNGEEPPDENRQRELADELRAQVVDELLKRREETEWFIEGDFD 239

Query: 308 AYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKSPIK 129
           AYVKR+QQPYVWGGEPELLMASHV K  ISV+MVDRSSGGLVNIA YGEEY K E+ PI 
Sbjct: 240 AYVKRIQQPYVWGGEPELLMASHVKKAPISVYMVDRSSGGLVNIAKYGEEYGKQEEKPIN 299

Query: 128 VLFHGYGHYDILDAISDQTFQKV 60
           VLFHGYGHYDIL++ S+Q+ QKV
Sbjct: 300 VLFHGYGHYDILESFSEQSLQKV 322


>ref|XP_004250001.1| PREDICTED: uncharacterized protein LOC101253339 [Solanum
            lycopersicum]
          Length = 338

 Score =  390 bits (1001), Expect = e-105
 Identities = 215/341 (63%), Positives = 250/341 (73%), Gaps = 38/341 (11%)
 Frame = -1

Query: 965  MLGVLCARPKPWILASL--SFAHGSV--AHHRLVG------SPVRLVGGD---------- 846
            MLGVLCARPKPW+ ASL  S AHGS    + RL+       S + L+ G           
Sbjct: 1    MLGVLCARPKPWLFASLCLSHAHGSTPSGYSRLIPTNTANKSSLLLISGGGGGGGGGIGV 60

Query: 845  QPRRHHSSACRLGGSR---GGAASIWHAILPSGG------GRRGNIRPAFH-----QGEG 708
              RR+HSS CR+  S    GGAASIWHAILP+G        RR N     H     +GEG
Sbjct: 61   DQRRNHSSHCRIASSVNRVGGAASIWHAILPAGRRNKKDINRRNNTVFKHHYELAKKGEG 120

Query: 707  SWNVAWDVRPARWLHWPDSVWLLFGVCPCLSA-QLDL-PDVNTDSASGDVKIDSCGTAVI 534
            SWNV WD RPARWLH PDS WLLFGVC CL+A  LDL PD N+D A   V ID   +AV 
Sbjct: 121  SWNVNWDSRPARWLHNPDSAWLLFGVCSCLAAPSLDLLPDANSDVA---VPIDK-QSAVN 176

Query: 533  SSDENDEG--NYRITGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDE 360
            SSDE+D+   NYR+TGV ADGRCLFRAIAH+ACLRNGEEAPDE RQRELADELRAQVVDE
Sbjct: 177  SSDEDDQNSANYRVTGVPADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDE 236

Query: 359  LLRRREETEWFIDDDFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVN 180
            LL+RR+E EWFI+ DFDAYV+R+++PYVWGGEPELLMASHV+K+ ISV+MVDRSSG L+N
Sbjct: 237  LLKRRKEAEWFIEGDFDAYVERIEKPYVWGGEPELLMASHVLKSAISVYMVDRSSGSLIN 296

Query: 179  IANYGEEYQKDEKSPIKVLFHGYGHYDILDAISDQTFQKVE 57
            I+NYGEEY+K+ +SPI VLFHGYGHYDIL+ I ++  QK+E
Sbjct: 297  ISNYGEEYRKEGESPINVLFHGYGHYDILETIPEKIHQKLE 337


>ref|XP_007010220.1| Cysteine proteinases superfamily protein isoform 2 [Theobroma
           cacao] gi|508727133|gb|EOY19030.1| Cysteine proteinases
           superfamily protein isoform 2 [Theobroma cacao]
          Length = 330

 Score =  389 bits (999), Expect = e-105
 Identities = 207/333 (62%), Positives = 241/333 (72%), Gaps = 31/333 (9%)
 Frame = -1

Query: 965 MLGVLCARP-KPWILASLSF-AHGSVAHH----RLVGSPVR---LVGGDQPRRHHSSACR 813
           MLGVLCARP KPWIL SLS  AHG +A H    RLV  P     L   D+  RHHS+ACR
Sbjct: 1   MLGVLCARPPKPWILNSLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHHSTACR 60

Query: 812 LGGSRGGAASIWHAILPSGGG----RRGNI-RPAFHQGEGSWNVAWDVRPARWLHWPDSV 648
           LGGS GGAASIWHAILP GGG    RRG + +    +GEGSWNVAWD RPARWLH PDS 
Sbjct: 61  LGGSDGGAASIWHAILPCGGGGGGRRRGEVWKNVERKGEGSWNVAWDARPARWLHRPDSA 120

Query: 647 WLLFGVCPCLSAQLDLPDVNTDS--------------ASGDVKIDSCGTAVISSDENDEG 510
           WLLFGVC CL+  ++  DVN D+               S D K  S  ++V ++D     
Sbjct: 121 WLLFGVCACLAPMIEFVDVNPDADDKIEGAELNLVSRLSADEKSSSSSSSVAAAD----- 175

Query: 509 NYRITGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQV---VDELLRRREE 339
           N ++TGV ADGRCLFRAIAH ACLR+GE+APDE  QRELADELRAQV   V+ELL+RREE
Sbjct: 176 NCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVSLVVNELLKRREE 235

Query: 338 TEWFIDDDFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEE 159
           TEWFI+ DFDAYVK +QQPYVWGGEPE+LMASHV+KT ISV+M+ RSS  L  IA YGEE
Sbjct: 236 TEWFIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNLTKIAKYGEE 295

Query: 158 YQKDEKSPIKVLFHGYGHYDILDAISDQTFQKV 60
           YQKD+++PI VLFHGYGHYDIL+++ +Q   +V
Sbjct: 296 YQKDKENPINVLFHGYGHYDILESLPEQNCAQV 328


>ref|XP_006360486.1| PREDICTED: uncharacterized protein LOC102606023 isoform X1 [Solanum
            tuberosum]
          Length = 338

 Score =  386 bits (992), Expect = e-104
 Identities = 213/341 (62%), Positives = 247/341 (72%), Gaps = 38/341 (11%)
 Frame = -1

Query: 965  MLGVLCARPKPWILASL--SFAHGSV--AHHRLVG------SPVRLVGGD---------- 846
            MLGVLCARPKPW+ ASL  S AHGS    + RL+       S + L+ G           
Sbjct: 1    MLGVLCARPKPWLFASLCLSHAHGSTPSGYSRLIATNTANKSSLLLISGGGSGGGGGTGV 60

Query: 845  QPRRHHSSACRLGGS---RGGAASIWHAILPSGG------GRRGNIRPAFH-----QGEG 708
              RR+HS  CR+  S    GGAASIWHAILP+G        RR N     H     +GEG
Sbjct: 61   DQRRNHSIHCRIASSVNRGGGAASIWHAILPAGRRNKKDINRRNNTVFKHHYELAKKGEG 120

Query: 707  SWNVAWDVRPARWLHWPDSVWLLFGVCPCLSA-QLDL-PDVNTDSASGDVKIDSCGTAVI 534
            SWNV WD RPARWLH PDS WLLFGVC CL+A  LDL PD N D A   V ID   + V 
Sbjct: 121  SWNVNWDSRPARWLHNPDSAWLLFGVCSCLAAPSLDLLPDANFDVA---VPIDK-QSVVN 176

Query: 533  SSDENDEG--NYRITGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDE 360
            SSDE+D+   NYR+TGV ADGRCLFRAIAH+ACLRNGEEAPDE RQRELADELRAQVVDE
Sbjct: 177  SSDEDDQNSANYRVTGVPADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDE 236

Query: 359  LLRRREETEWFIDDDFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVN 180
            LL+RR+E EWFI+ DFDAYV+R+++PYVWGGEPELLMASHV+K+ ISV+MVDRSSG L+N
Sbjct: 237  LLKRRKEAEWFIEGDFDAYVERIEKPYVWGGEPELLMASHVLKSSISVYMVDRSSGSLIN 296

Query: 179  IANYGEEYQKDEKSPIKVLFHGYGHYDILDAISDQTFQKVE 57
            I+NYGEEY+K+ +SPI VLFHGYGHYDIL+ I ++  QK+E
Sbjct: 297  ISNYGEEYRKEGESPINVLFHGYGHYDILETIPEKIHQKLE 337


>ref|XP_010032108.1| PREDICTED: OTU domain-containing protein At3g57810-like [Eucalyptus
           grandis] gi|629085145|gb|KCW51502.1| hypothetical
           protein EUGRSUZ_J01018 [Eucalyptus grandis]
          Length = 314

 Score =  384 bits (986), Expect = e-104
 Identities = 200/318 (62%), Positives = 233/318 (73%), Gaps = 19/318 (5%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLSFAHGSVAHH--RLV---GSPVRL-VGGDQP----RRHHSSAC 816
           MLGVLCARPKPWILAS  F+H S AHH  RL     +  RL +  D P    RRHHSS+C
Sbjct: 1   MLGVLCARPKPWILASC-FSHASAAHHCGRLAWVSAAAARLQLAADSPDRWRRRHHSSSC 59

Query: 815 RLGGSRG-----GAASIWHAILPSGGG----RRGNIRPAFHQGEGSWNVAWDVRPARWLH 663
           RLGG+       G ASIWHAILPSG G    R    R    +GEGSWNVAWD RPARWLH
Sbjct: 60  RLGGASSCAHPCGVASIWHAILPSGEGDPPRRMDQPRRPVFRGEGSWNVAWDARPARWLH 119

Query: 662 WPDSVWLLFGVCPCLSAQLDLPDVNTDSASGDVKIDSCGTAVISSDENDEGNYRITGVTA 483
            PDS WLLFGVC CL A +D  + + +    + +++   +  +   +    +YR+TGV A
Sbjct: 120 RPDSAWLLFGVCACL-APVDAAEPSREEVVPEARVEDRDS--LDEAKRSSPDYRVTGVLA 176

Query: 482 DGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDDDFDAY 303
           DGRCLFRAIAH ACLR GE APD+ RQRELADELRAQVV ELL+RREETEW I+ DFDAY
Sbjct: 177 DGRCLFRAIAHCACLRKGEAAPDDNRQRELADELRAQVVAELLKRREETEWAIEGDFDAY 236

Query: 302 VKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKSPIKVL 123
           ++R+QQPYVWGGEPELLMASHV+KT ISVFMVDRSSG LVN+A YGEEY+KDE+ PI VL
Sbjct: 237 IERIQQPYVWGGEPELLMASHVLKTPISVFMVDRSSGNLVNVAKYGEEYRKDEEIPINVL 296

Query: 122 FHGYGHYDILDAISDQTF 69
           FHGYGHYDIL++   Q++
Sbjct: 297 FHGYGHYDILESFPGQSY 314


>ref|XP_002316423.1| hypothetical protein POPTR_0010s24050g [Populus trichocarpa]
           gi|222865463|gb|EEF02594.1| hypothetical protein
           POPTR_0010s24050g [Populus trichocarpa]
          Length = 318

 Score =  381 bits (978), Expect = e-103
 Identities = 200/320 (62%), Positives = 225/320 (70%), Gaps = 17/320 (5%)
 Frame = -1

Query: 965 MLGVLCARPKP-WILASLSFAHGSVAHHRLVGSPVRLV-----GGDQPRRHHSSACRLGG 804
           MLGVLCARPKP WIL SL F H  + HH    S  RL           RRHHS+ C    
Sbjct: 1   MLGVLCARPKPNWILNSL-FTHFHLNHHHHHNSNNRLSLHLSGSSTAARRHHSNLCSADS 59

Query: 803 SRGGAASIWHAILPSGGGRRGNIRPAFHQGEGSWNVAWDVRPARWLHWPDSVWLLFGVCP 624
             GGAA+IWH I P+   RR   R    +GEGSWN AWD RPARWLH PDS WLLFGVC 
Sbjct: 60  GCGGAAAIWHVIQPADWRRRTERRSV--RGEGSWNAAWDGRPARWLHRPDSAWLLFGVCA 117

Query: 623 CLSAQLD-LPDVNTDSA----------SGDVKIDSCGTAVISSDENDEGNYRITGVTADG 477
           CL+  ++ L DVN               GD+   S      +SD     +Y++TGV ADG
Sbjct: 118 CLAPAIEFLSDVNNIDDVDHQEKERIDGGDLNASSDDAKQDNSDATVGSDYKVTGVLADG 177

Query: 476 RCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDDDFDAYVK 297
           RCLFRAIAH+ACLRNGEEAPDE RQRELADELRAQVVDELL+RREETEWFI+ DFDAYVK
Sbjct: 178 RCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREETEWFIEGDFDAYVK 237

Query: 296 RMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKSPIKVLFH 117
           R+QQPYVWGGEPELLMASHV+KT ISVFM DR++G LVNI NYGEEYQKDE +PI VLFH
Sbjct: 238 RIQQPYVWGGEPELLMASHVLKTMISVFMRDRTTGNLVNIVNYGEEYQKDEVNPINVLFH 297

Query: 116 GYGHYDILDAISDQTFQKVE 57
           GYGHYDIL+    Q++QK +
Sbjct: 298 GYGHYDILETTPGQSYQKAD 317


>ref|XP_012456105.1| PREDICTED: uncharacterized protein LOC105777394 [Gossypium
           raimondii] gi|763806450|gb|KJB73388.1| hypothetical
           protein B456_011G230700 [Gossypium raimondii]
          Length = 319

 Score =  379 bits (974), Expect = e-102
 Identities = 207/322 (64%), Positives = 242/322 (75%), Gaps = 26/322 (8%)
 Frame = -1

Query: 965 MLGVLCARP-KPWILASLSF-AHG-SVAHH---RLVGSPVR---LVGGDQPRRHHSSACR 813
           MLGVLCARP KPWIL SLS  AHG S AHH   RL+  P     L   ++  RHHS+ACR
Sbjct: 1   MLGVLCARPPKPWILNSLSLIAHGGSAAHHHENRLLHWPSHFADLSAANRRCRHHSTACR 60

Query: 812 LGG-SRGGAASIWHAILPSGGGR----RGNI-RPAFHQGEGSWNVAWDVRPARWLHWPDS 651
           LGG S GGAASIWHAILP GG R    RG++ +    +GEGSWNV+WD RPARWL   DS
Sbjct: 61  LGGGSEGGAASIWHAILPCGGDRGVKNRGDVWKNVERKGEGSWNVSWDARPARWLR-SDS 119

Query: 650 VWLLFGVCPCLSAQL--DLPDVNTDSASGDVKIDSCGTAVISSDENDEG---------NY 504
            WLLFGVC CL+     +  DVN D+   D K D    A ++SDEN            NY
Sbjct: 120 AWLLFGVCACLAPMPMDEFDDVNLDA---DNKTD----ASLNSDENSSNHLSSVAAADNY 172

Query: 503 RITGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFI 324
           ++TG+ ADGRCLFRAIAH ACLR+GEEAPDE RQRELADELRAQVV+ELL+RREETEWFI
Sbjct: 173 KVTGILADGRCLFRAIAHGACLRSGEEAPDENRQRELADELRAQVVNELLKRREETEWFI 232

Query: 323 DDDFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDE 144
           + DFDAYVK +QQPYVWGGEPELLMASHV+KT+ISV+M+ RSSG L+NIA YGEEYQK++
Sbjct: 233 EGDFDAYVKEIQQPYVWGGEPELLMASHVLKTRISVYMIHRSSGNLINIAKYGEEYQKEK 292

Query: 143 KSPIKVLFHGYGHYDILDAISD 78
           ++PI VLFHGYGHYDIL+++ +
Sbjct: 293 ENPINVLFHGYGHYDILESLPE 314


>ref|XP_010243634.1| PREDICTED: uncharacterized protein LOC104587637 [Nelumbo nucifera]
          Length = 312

 Score =  378 bits (971), Expect = e-102
 Identities = 196/313 (62%), Positives = 228/313 (72%), Gaps = 10/313 (3%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLSFAHGSVAHHRLVG---SPVRLVGGDQPRRHHSSACRLGGSRG 795
           MLGVLC RPKPW L++LS+A  ++ HHR      SP+   G D  RRHHSS CRL G   
Sbjct: 1   MLGVLCLRPKPWTLSALSYAQVAI-HHRFTERSISPISCSGVDHWRRHHSSECRLSGIGN 59

Query: 794 GAASIWHAILPSGGGRRGN--IRPAF---HQGEGSWNVAWDVRPARWLHWPDSVWLLFGV 630
           G ASIWHAILPSGG  R +  +RPAF   H+GEGSWNVA DVRPARWLH  DS WLLFGV
Sbjct: 60  GTASIWHAILPSGGRARSDSTLRPAFQYVHKGEGSWNVALDVRPARWLHGSDSAWLLFGV 119

Query: 629 CPCLSAQLDLPDVNTDSASGDVKIDSCGTAVISSDENDEG--NYRITGVTADGRCLFRAI 456
           C CL A LD    +  +   D        A I  +  D    +YR+TGV ADGRCLFRA+
Sbjct: 120 CNCL-APLDCCRESLSAPVDDGACSELENAEIRLNGEDTSLIDYRVTGVLADGRCLFRAV 178

Query: 455 AHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDDDFDAYVKRMQQPYV 276
           AH ACL++GEEAPDETRQRELAD+LRA+V DELL+RR+E EWF++ DFDAYVK +QQP  
Sbjct: 179 AHGACLKSGEEAPDETRQRELADDLRARVADELLKRRKEIEWFVEGDFDAYVKSIQQPNS 238

Query: 275 WGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKSPIKVLFHGYGHYDI 96
           WGGEPELLMASHV++T ISVFM DRSSG L+NIA YG+EY +D ++PIKVLFHGYGHYD 
Sbjct: 239 WGGEPELLMASHVLRTPISVFMRDRSSGSLINIAIYGQEYAQDNENPIKVLFHGYGHYDA 298

Query: 95  LDAISDQTFQKVE 57
           L+   D   QKVE
Sbjct: 299 LETFLDGNNQKVE 311


>ref|XP_011024271.1| PREDICTED: uncharacterized protein LOC105125498 [Populus
           euphratica]
          Length = 320

 Score =  378 bits (970), Expect = e-102
 Identities = 200/319 (62%), Positives = 225/319 (70%), Gaps = 18/319 (5%)
 Frame = -1

Query: 965 MLGVLCARPKP-WILASL-SFAHGSVAHHRLVGSPVRLV-----GGDQPRRHHSSACRLG 807
           MLGVLCARPKP WIL SL +  H +  HH+   S  RL           RRHHSS C   
Sbjct: 1   MLGVLCARPKPNWILNSLFTHFHLNHHHHQHHNSNNRLSLHLSGSSTAARRHHSSLCSAD 60

Query: 806 GSRGGAASIWHAILPSGGGRRGNIRPAFHQGEGSWNVAWDVRPARWLHWPDSVWLLFGVC 627
              GGAA+IWH I P+   RR   R    +GEGSWN AWD RPARWLH PDS WLLFGVC
Sbjct: 61  SGCGGAAAIWHVIQPADWRRRTERRSV--RGEGSWNAAWDGRPARWLHRPDSAWLLFGVC 118

Query: 626 PCLSAQLD-LPDVNTDSA----------SGDVKIDSCGTAVISSDENDEGNYRITGVTAD 480
            C++  ++ L DVN               GD+   S      SSD     +Y++TGV AD
Sbjct: 119 ACVTPAIEFLSDVNNIDDVDHQEKERIDGGDLNASSDDARQDSSDSTVGSDYKVTGVLAD 178

Query: 479 GRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDDDFDAYV 300
           GRCLFRAIAH+ACLRNGEEAPDE RQRELADELRAQVVDELL+RREETEWFI+ DFDAYV
Sbjct: 179 GRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREETEWFIEGDFDAYV 238

Query: 299 KRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKSPIKVLF 120
           KR+QQPYVWGGEPELLMASHV+KT ISVFM DR++G LVNI NYGEEYQKDE +PI VLF
Sbjct: 239 KRIQQPYVWGGEPELLMASHVLKTMISVFMRDRTTGNLVNIVNYGEEYQKDEVNPINVLF 298

Query: 119 HGYGHYDILDAISDQTFQK 63
           HGYGHYDIL+    Q++QK
Sbjct: 299 HGYGHYDILETTPGQSYQK 317


>ref|XP_008345443.1| PREDICTED: uncharacterized protein LOC103408366 [Malus domestica]
          Length = 323

 Score =  377 bits (969), Expect = e-102
 Identities = 206/326 (63%), Positives = 232/326 (71%), Gaps = 24/326 (7%)
 Frame = -1

Query: 965 MLGVLCARPKPWILASLS-FAHGSVAHHR---LVGSPVRLVG--------GDQPRRHHSS 822
           MLG LCAR K WI+ SLS F HGS A H+   L+ +  R +         G + RR+HSS
Sbjct: 1   MLGFLCARRKTWIVTSLSAFTHGSAAAHKTRLLLQTHSRPIHQQIANFSCGFENRRYHSS 60

Query: 821 ACRLGGSRG-GAASIWHAILPSGGGRRGN--IRPAFH-QGEGSWNVAWDVRPARWLHWPD 654
              LG   G GAASIWH +LPS G  R     RPA H + EGSWN AWD RPARWLH PD
Sbjct: 61  P--LGSDCGAGAASIWHGLLPSAGNSRSRNLCRPAIHYEREGSWNAAWDARPARWLHRPD 118

Query: 653 SVWLLFGVCPCL--------SAQLDLPDVNTDSASGDVKIDSCGTAVISSDENDEGNYRI 498
           S WLLFGV  CL        SA  +  DV       D K  S    +I S   D   YR+
Sbjct: 119 SAWLLFGVRSCLAPTNWAVDSAPGEXNDVYNSKTDCDSKSSSSPENIIDSSAAD---YRV 175

Query: 497 TGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDD 318
           TGV ADGRCLFRAIAHVACLRNGEEAPDE RQR+LADELR+QVVDELL+RR+ETEWFI+ 
Sbjct: 176 TGVLADGRCLFRAIAHVACLRNGEEAPDENRQRDLADELRSQVVDELLKRRKETEWFIEG 235

Query: 317 DFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKS 138
           DFDAYVKR+QQPYVWGGEPELLMASHV+KT ISVFMVDRSS GLVNIA YGEEYQK+E+ 
Sbjct: 236 DFDAYVKRLQQPYVWGGEPELLMASHVLKTPISVFMVDRSSSGLVNIAKYGEEYQKEEEK 295

Query: 137 PIKVLFHGYGHYDILDAISDQTFQKV 60
           PI VLFHGYGHYDIL++ S+Q+ QK+
Sbjct: 296 PINVLFHGYGHYDILESFSEQSLQKL 321


>gb|KHG26701.1| hypothetical protein F383_04817 [Gossypium arboreum]
          Length = 319

 Score =  376 bits (965), Expect = e-101
 Identities = 205/322 (63%), Positives = 241/322 (74%), Gaps = 26/322 (8%)
 Frame = -1

Query: 965 MLGVLCARP-KPWILASLSF-AHG-SVAHH---RLVGSPVRL--VGGDQPR-RHHSSACR 813
           MLGVLC RP KPWIL SLS  AHG S AHH   RL+  P     +  D  R RHHS+ACR
Sbjct: 1   MLGVLCTRPPKPWILNSLSLIAHGGSAAHHHENRLLHWPSHFADLSADNRRCRHHSTACR 60

Query: 812 LGG-SRGGAASIWHAILPSGGGR----RGNI-RPAFHQGEGSWNVAWDVRPARWLHWPDS 651
           LGG S GGAASIWHAILP GG R    RG++ +    +GEGSWNV+WD RPARWL  PDS
Sbjct: 61  LGGGSEGGAASIWHAILPCGGDRGVKNRGDVWKNVERKGEGSWNVSWDARPARWLR-PDS 119

Query: 650 VWLLFGVCPCLSAQL--DLPDVNTDSASGDVKIDSCGTAVISSDENDEG---------NY 504
            WLLFGVC CL+     +  DVN D+   D K D    A ++SDE             N+
Sbjct: 120 AWLLFGVCACLAPMPMDEFDDVNLDA---DNKTD----ASLNSDEKSSNHLSSVAAADNF 172

Query: 503 RITGVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFI 324
           ++TG+ ADGRCLFRAIAH ACLR+GEEAPDE RQRELADELRAQVV+ELL+RREETEW+I
Sbjct: 173 KVTGILADGRCLFRAIAHGACLRSGEEAPDENRQRELADELRAQVVNELLKRREETEWYI 232

Query: 323 DDDFDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDE 144
           + DFDAYVK +QQPYVWGGEPELLMASHV+KT+ISV+M+ RSSG L+NIA YGEEYQK++
Sbjct: 233 EGDFDAYVKEIQQPYVWGGEPELLMASHVLKTRISVYMIHRSSGNLINIAKYGEEYQKEK 292

Query: 143 KSPIKVLFHGYGHYDILDAISD 78
           ++PI VLFHGYGHYDIL+++ +
Sbjct: 293 ENPINVLFHGYGHYDILESLPE 314


>emb|CDO99851.1| unnamed protein product [Coffea canephora]
          Length = 337

 Score =  374 bits (959), Expect = e-100
 Identities = 198/337 (58%), Positives = 242/337 (71%), Gaps = 23/337 (6%)
 Frame = -1

Query: 965  MLGVLCARPKPWILASL--SFAHGSVA---HHRLVGSPVR---LVGGDQPRRHHSSACRL 810
            ML  LCARPK W+  +L  S AH S A   H+RL+GSP+    +V     RRHHSS+CRL
Sbjct: 1    MLSALCARPKSWLFTALFLSHAHSSAAALVHNRLIGSPLLKSVVVANADQRRHHSSSCRL 60

Query: 809  --GGSRGGAASIWHAILPSGGG------RRGNIRPAFH-----QGEGSWNVAWDVRPARW 669
                ++GGAASIWHAILP+G G       + N+    H     +GEGSWNVAWD RPARW
Sbjct: 61   VDTSAQGGAASIWHAILPAGDGDLDLHRTKRNVLVHHHDELMNKGEGSWNVAWDARPARW 120

Query: 668  LHWPDSVWLLFGVCPCLSAQ-LDLPDVNTDSASGDVKIDSCGTAVISSDENDE-GNYRIT 495
            LH  DS WLLFGVC CL+A  L L   +++   G+        A ++  EN +  N+R+T
Sbjct: 121  LHNRDSAWLLFGVCACLAAPPLPLLADSSEFVDGETDEFRHEAAAMTVVENGKCANFRVT 180

Query: 494  GVTADGRCLFRAIAHVACLRNGEEAPDETRQRELADELRAQVVDELLRRREETEWFIDDD 315
            GV ADGRCLFRAIAHVA LR GE  PDE RQRELADELRA VV+ELL+RR++ EWFI+ D
Sbjct: 181  GVPADGRCLFRAIAHVAWLRKGESVPDENRQRELADELRALVVEELLKRRKDAEWFIEGD 240

Query: 314  FDAYVKRMQQPYVWGGEPELLMASHVVKTQISVFMVDRSSGGLVNIANYGEEYQKDEKSP 135
            FDAYV+R+++PYVWGGEPELLMASHV+K  ISVFM+DRSSG L+NIA YGEEY+KDE+SP
Sbjct: 241  FDAYVERIEKPYVWGGEPELLMASHVLKAPISVFMIDRSSGNLINIAKYGEEYKKDEESP 300

Query: 134  IKVLFHGYGHYDILDAISDQTFQKVEA*TSSCRSEVQ 24
            I +LFHGYGHYDI+D +S + +QKVE   S  RS ++
Sbjct: 301  INILFHGYGHYDIVDVVS-EGYQKVEGGISESRSSLE 336


Top