BLASTX nr result

ID: Wisteria21_contig00010191 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Wisteria21_contig00010191
         (1573 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004496177.1| PREDICTED: OTU domain-containing protein At3...   483   e-133
dbj|BAE71258.1| hypothetical protein [Trifolium pratense]             446   e-122
ref|XP_003536306.1| PREDICTED: uncharacterized protein LOC100793...   439   e-120
gb|KOM35649.1| hypothetical protein LR48_Vigan02g179900 [Vigna a...   434   e-118
ref|XP_014512510.1| PREDICTED: uncharacterized protein LOC106771...   431   e-118
ref|XP_013469378.1| OTU-like cysteine protease [Medicago truncat...   431   e-118
ref|XP_003556279.1| PREDICTED: OTU domain-containing protein At3...   429   e-117
ref|XP_007143828.1| hypothetical protein PHAVU_007G105100g [Phas...   423   e-115
ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3...   397   e-107
ref|XP_008446786.1| PREDICTED: OTU domain-containing protein At3...   395   e-107
ref|XP_010658710.1| PREDICTED: uncharacterized protein LOC100245...   387   e-104
ref|XP_007010219.1| Cysteine proteinases superfamily protein iso...   379   e-102
ref|XP_007010220.1| Cysteine proteinases superfamily protein iso...   374   e-100
ref|XP_010032108.1| PREDICTED: OTU domain-containing protein At3...   373   e-100
ref|XP_012456105.1| PREDICTED: uncharacterized protein LOC105777...   370   2e-99
ref|XP_007220473.1| hypothetical protein PRUPE_ppa008484mg [Prun...   370   2e-99
gb|KHG26701.1| hypothetical protein F383_04817 [Gossypium arboreum]   369   3e-99
ref|XP_009793129.1| PREDICTED: uncharacterized protein LOC104240...   369   3e-99
ref|XP_008232087.1| PREDICTED: OTU domain-containing protein At3...   369   4e-99
ref|XP_009603537.1| PREDICTED: uncharacterized protein LOC104098...   366   3e-98

>ref|XP_004496177.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cicer
            arietinum]
          Length = 313

 Score =  483 bits (1243), Expect = e-133
 Identities = 243/319 (76%), Positives = 254/319 (79%), Gaps = 2/319 (0%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRRHHSS 1140
            MLGVLCATR +PWIFSFLH SA+HH ARLAH T   S    RF     D  FA RRHHSS
Sbjct: 1    MLGVLCATRSRPWIFSFLHSSASHHAARLAHCTVACSSLSTRF-----DATFAARRHHSS 55

Query: 1139 ACKLRGSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHRP 960
            AC+L+  GGGAASIWH I PCGGDGFRRGVV V HDH+LKGEGSWNVAWDARPARWLHR 
Sbjct: 56   ACELQ-LGGGAASIWHAIRPCGGDGFRRGVVTVQHDHDLKGEGSWNVAWDARPARWLHRS 114

Query: 959  DSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXE--GREMKGVECDKEREDEVSTDYR 786
            DSAWLLFGVCACLA                        GREMK  E DKER DE+S DYR
Sbjct: 115  DSAWLLFGVCACLAPPVIADVDLEAPPTPAINTDENSEGREMKYAEGDKERNDELSADYR 174

Query: 785  VTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREETEWFIE 606
            VTGVLADGRCLFRAIAHGACL NGEEAPNENRQ ELADELRA+V EELLKRR+ETEWFIE
Sbjct: 175  VTGVLADGRCLFRAIAHGACLNNGEEAPNENRQRELADELRARVAEELLKRRKETEWFIE 234

Query: 605  GDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNGKE 426
            GDFDAYV RI+Q YVWGGEPELLMASHVLKTPI VFMRD SSIDLVNIAKYGEEY N KE
Sbjct: 235  GDFDAYVNRIRQTYVWGGEPELLMASHVLKTPIYVFMRDASSIDLVNIAKYGEEYMNDKE 294

Query: 425  ISINVLFHGYGHYDILETL 369
            ISINVLFH +GHY+ILETL
Sbjct: 295  ISINVLFHRHGHYEILETL 313


>dbj|BAE71258.1| hypothetical protein [Trifolium pratense]
          Length = 326

 Score =  446 bits (1148), Expect = e-122
 Identities = 232/331 (70%), Positives = 254/331 (76%), Gaps = 10/331 (3%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFLHGSAAHH--VARLAHGTAYSSLS-PPRFSRLGQDVAFAGRRH 1149
            MLGVLCATR +PWIFSFLH S++HH   ARLAH T  SS S  P F        F+ RR+
Sbjct: 1    MLGVLCATRSRPWIFSFLHHSSSHHHHTARLAHITVASSSSLSPTF--------FSARRN 52

Query: 1148 HSSACKLR-GSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARW 972
            HSS CKL+  +GGGAASIWH IMPCGGDGF+RG   VHHDHELKGEGSWNVAWDARPARW
Sbjct: 53   HSSQCKLQISAGGGAASIWHAIMPCGGDGFQRGAFMVHHDHELKGEGSWNVAWDARPARW 112

Query: 971  LHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXE------GREMKGVECDKERE 810
            LHR DSAWLLFGV A LA                     +      G E+K  E DK   
Sbjct: 113  LHRSDSAWLLFGVRAWLAPPPVIVDVDPEVPLPTSVISPDEISRSEGLEIKDAESDKPN- 171

Query: 809  DEVSTDYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRR 630
            DE+S+DYRVTGVLADGRCLFRA+AHGACL+NGEEAPNENRQ ELADELRA+V EELLKRR
Sbjct: 172  DELSSDYRVTGVLADGRCLFRALAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRR 231

Query: 629  EETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYG 450
            +ETEWFIEGDFD YV RIQQ +VWGGEPELLMASHVLKTPI VFMRD +SIDLVNIAKYG
Sbjct: 232  KETEWFIEGDFDTYVTRIQQSFVWGGEPELLMASHVLKTPIFVFMRDPNSIDLVNIAKYG 291

Query: 449  EEYRNGKEISINVLFHGYGHYDILETLSPKL 357
            EEY N + ISINVLFH +GHY++LETL PKL
Sbjct: 292  EEYMNDEGISINVLFHRHGHYELLETLCPKL 322


>ref|XP_003536306.1| PREDICTED: uncharacterized protein LOC100793001 [Glycine max]
            gi|947086009|gb|KRH34730.1| hypothetical protein
            GLYMA_10G202000 [Glycine max]
          Length = 296

 Score =  439 bits (1130), Expect = e-120
 Identities = 230/320 (71%), Positives = 246/320 (76%), Gaps = 1/320 (0%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRRHHSS 1140
            MLGVLCATRPKPW+ S +H  A+  + RL H     S SPP             RR HS+
Sbjct: 1    MLGVLCATRPKPWLLSLVHVHAS--LPRLPHSPLSPSASPPP------------RRRHST 46

Query: 1139 ACKLRGSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHRP 960
            ACKL  SGG AASIWH IMP G DG RRGVVAVH   +LKGEGSWNVAWDARPARWLHRP
Sbjct: 47   ACKLFLSGGAAASIWHAIMPRGDDGLRRGVVAVH---DLKGEGSWNVAWDARPARWLHRP 103

Query: 959  DSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKER-EDEVSTDYRV 783
            DSAWLLFGVCACLA                        E  G+  DKER EDEVS DYRV
Sbjct: 104  DSAWLLFGVCACLAPPPGCVDADTNSAGIAVD------ESCGL-LDKEREEDEVSADYRV 156

Query: 782  TGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREETEWFIEG 603
            TGV ADGRCLFRAIAHGACLRNGE+AP+ENRQ ELADELRA+VV+ELLKRREETEWFIEG
Sbjct: 157  TGVPADGRCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEG 216

Query: 602  DFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNGKEI 423
            DFD Y++RIQQPYVWGGEPELLMASHVLKTPISVFMRDT S++LVNIAKYGEEYRN K+I
Sbjct: 217  DFDTYLQRIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDI 276

Query: 422  SINVLFHGYGHYDILETLSP 363
            SINVLFHGYGHYDILETL P
Sbjct: 277  SINVLFHGYGHYDILETLRP 296


>gb|KOM35649.1| hypothetical protein LR48_Vigan02g179900 [Vigna angularis]
          Length = 290

 Score =  434 bits (1115), Expect = e-118
 Identities = 227/318 (71%), Positives = 241/318 (75%), Gaps = 1/318 (0%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRRHHSS 1140
            MLGVLCATRPKPW+FS +H S      RL H +     SPPR             RHHSS
Sbjct: 1    MLGVLCATRPKPWLFSLVHASPP----RLPHASVSLLASPPR-------------RHHSS 43

Query: 1139 ACKLRGSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHRP 960
            ACKL GS GGA SIWH IMP  GDGFRRGVVAVH   +LKGEGSWNVAWD RPARWLHR 
Sbjct: 44   ACKLFGSAGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRS 100

Query: 959  DSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDEVSTDYRVT 780
            DSAWLLFGVCACLA                        E  GV  DKE + +VS DYRVT
Sbjct: 101  DSAWLLFGVCACLAPPGCVDAVTDSDAVAAD-------ESCGV-LDKELKVDVSADYRVT 152

Query: 779  GVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREETEWFIEGD 600
            GV ADGRCLFRAIAHGACLRNGE+AP+ENRQ ELADELRA+VV+ELLKRREETEWFIEGD
Sbjct: 153  GVPADGRCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGD 212

Query: 599  FDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNGK-EI 423
            FD YVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+YRN K E 
Sbjct: 213  FDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYRNDKEEN 272

Query: 422  SINVLFHGYGHYDILETL 369
            SINVLFHGYGHYDILE++
Sbjct: 273  SINVLFHGYGHYDILESV 290


>ref|XP_014512510.1| PREDICTED: uncharacterized protein LOC106771118 [Vigna radiata var.
            radiata]
          Length = 290

 Score =  431 bits (1109), Expect = e-118
 Identities = 226/318 (71%), Positives = 240/318 (75%), Gaps = 1/318 (0%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRRHHSS 1140
            MLGVLCATRPKPW+FS +H S      RL H +     SPPR             RHHSS
Sbjct: 1    MLGVLCATRPKPWLFSLVHASPP----RLPHASVSLLASPPR-------------RHHSS 43

Query: 1139 ACKLRGSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHRP 960
            ACKL GS GGA SIWH IMP  GDGFRRGVVAVH   +LKGEGSWNVAWD RPARWLHR 
Sbjct: 44   ACKLFGSAGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRS 100

Query: 959  DSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDEVSTDYRVT 780
            DSAWLLFGVCACLA                        E  GV  DKE + +VS DYRVT
Sbjct: 101  DSAWLLFGVCACLAPPGCVDAVTDSDAVAAD-------ESCGV-LDKELKVDVSADYRVT 152

Query: 779  GVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREETEWFIEGD 600
            GV ADGRCLFRAIAHGACLRNGE+AP+ENRQ ELADELRA+VV+ELLKRREETEWFIEGD
Sbjct: 153  GVPADGRCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGD 212

Query: 599  FDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNGK-EI 423
            FD YVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+Y N K E 
Sbjct: 213  FDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYMNDKEEN 272

Query: 422  SINVLFHGYGHYDILETL 369
            SINVLFHGYGHYDILE++
Sbjct: 273  SINVLFHGYGHYDILESV 290


>ref|XP_013469378.1| OTU-like cysteine protease [Medicago truncatula]
            gi|657404800|gb|KEH43416.1| OTU-like cysteine protease
            [Medicago truncatula]
          Length = 305

 Score =  431 bits (1109), Expect = e-118
 Identities = 221/322 (68%), Positives = 239/322 (74%), Gaps = 5/322 (1%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRRHHSS 1140
            MLGVLCATR +PWIFS  H    HH  RL+H T                + F  RRHHS+
Sbjct: 1    MLGVLCATRSRPWIFSSHHH---HHAFRLSHATV-------------APLTFPARRHHST 44

Query: 1139 ACKLR--GSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLH 966
            AC      +GGGAASIWH I PCGGDGFR G V +HHDHELKGEGSWNVAWDARPARWLH
Sbjct: 45   ACNNLQISTGGGAASIWHAITPCGGDGFRTGGVMLHHDHELKGEGSWNVAWDARPARWLH 104

Query: 965  RPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXE---GREMKGVECDKEREDEVST 795
            R DSAWLLFGVCACLA                     E   GREMK  E   ER+DE++ 
Sbjct: 105  RSDSAWLLFGVCACLAPPVVLDVDPEAAAPTPAVFPNESSEGREMKD-ELSDERDDELNA 163

Query: 794  DYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREETEW 615
            DYRVTGVLADGRCLFRAIAHGACL+NGEEAPNE+RQ ELADELR +V EELL RR+ETEW
Sbjct: 164  DYRVTGVLADGRCLFRAIAHGACLKNGEEAPNESRQRELADELRVKVAEELLNRRKETEW 223

Query: 614  FIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 435
            FIEGDFD YV RIQQ YVWGGEPELLMASHVLKTPI VFMRD SS+DLVNIAKYGEEY N
Sbjct: 224  FIEGDFDTYVTRIQQTYVWGGEPELLMASHVLKTPIYVFMRDASSMDLVNIAKYGEEYMN 283

Query: 434  GKEISINVLFHGYGHYDILETL 369
             +EISINVLFH +GHY++LETL
Sbjct: 284  DEEISINVLFHRHGHYELLETL 305


>ref|XP_003556279.1| PREDICTED: OTU domain-containing protein At3g57810-like [Glycine max]
            gi|734312743|gb|KHN00921.1| OTU domain-containing protein
            [Glycine soja] gi|947042330|gb|KRG92054.1| hypothetical
            protein GLYMA_20G188400 [Glycine max]
          Length = 294

 Score =  429 bits (1104), Expect = e-117
 Identities = 226/320 (70%), Positives = 241/320 (75%), Gaps = 3/320 (0%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRRHHSS 1140
            MLGVLCATR KPW+FS +H S    + RL+H     S SPP             RR HS+
Sbjct: 1    MLGVLCATRSKPWLFSLVHAS----LPRLSHAPLSPSASPPP------------RRRHST 44

Query: 1139 ACKLRGSGGGAASIWHVIMPC--GGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLH 966
            ACKL  S GGAASIWH IMP     DGFRRGVVA H   ++KGEGSWNVAWDARPARWLH
Sbjct: 45   ACKLFLSAGGAASIWHAIMPRVNDDDGFRRGVVAFH---DMKGEGSWNVAWDARPARWLH 101

Query: 965  RPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKERED-EVSTDY 789
            RPDSAWLLFGVCACLA                               DKERE+ EVS DY
Sbjct: 102  RPDSAWLLFGVCACLAPPSSCVDADTNTDAIAVDESCR-------LLDKEREEYEVSADY 154

Query: 788  RVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREETEWFI 609
            RVTGV ADGRCLFRAIAHGACLRNGE+AP+ENRQ ELADELRA+VV+EL+KRREETEWFI
Sbjct: 155  RVTGVPADGRCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELMKRREETEWFI 214

Query: 608  EGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNGK 429
            EGDFD YV+RIQQPYVWGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGEEYRN K
Sbjct: 215  EGDFDTYVQRIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEEYRNDK 274

Query: 428  EISINVLFHGYGHYDILETL 369
            EISINVLFHGYGHYDILETL
Sbjct: 275  EISINVLFHGYGHYDILETL 294


>ref|XP_007143828.1| hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris]
            gi|561017018|gb|ESW15822.1| hypothetical protein
            PHAVU_007G105100g [Phaseolus vulgaris]
          Length = 305

 Score =  423 bits (1088), Expect = e-115
 Identities = 229/334 (68%), Positives = 243/334 (72%), Gaps = 2/334 (0%)
 Frame = -3

Query: 1367 VRRI-TKPAHDSHLHSSMLGVLCATRPKPWIFSFLHGSAAHHVARLAHGTAYSSLSPPRF 1191
            +RRI T PAHDS   S MLGVLCATRP+PW+FS +H S    + RL H +   S SPPR 
Sbjct: 1    MRRIGTNPAHDS-FSSPMLGVLCATRPRPWLFSHVHAS----LPRLVHASVSLSASPPR- 54

Query: 1190 SRLGQDVAFAGRRHHSSACKLRGSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEG 1011
                        RHHSSACK+ GS GGAASIWH IMP  GD FRRGVV VH   +LKGEG
Sbjct: 55   ------------RHHSSACKIFGSAGGAASIWHAIMPRSGDRFRRGVVPVH---DLKGEG 99

Query: 1010 SWNVAWDARPARWLHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGV 831
            SWNVAWD RPARWLHRPDSAWLLFGVCACLA                        E  GV
Sbjct: 100  SWNVAWDTRPARWLHRPDSAWLLFGVCACLAPPGCVDVVTDFEAVAVD-------ESCGV 152

Query: 830  ECDKEREDEVSTDYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVV 651
               K        DYRVTGV ADGRCLFRAIAHG CLRNGE+AP+EN Q ELADELRA+VV
Sbjct: 153  L--KVEASADYADYRVTGVPADGRCLFRAIAHGDCLRNGEKAPDENCQRELADELRAKVV 210

Query: 650  EELLKRREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDL 471
            +ELLKRREETEWFIEGDFD YVKRIQQP+VWGGEPELLMASHVLKTPISVFMR T S+ L
Sbjct: 211  DELLKRREETEWFIEGDFDTYVKRIQQPFVWGGEPELLMASHVLKTPISVFMRATGSVGL 270

Query: 470  VNIAKYGEEYRNGK-EISINVLFHGYGHYDILET 372
            VNIAKYGEEYRN K E SINVLFHGYGHYDILET
Sbjct: 271  VNIAKYGEEYRNDKEENSINVLFHGYGHYDILET 304


>ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
            sativus] gi|700197033|gb|KGN52210.1| hypothetical protein
            Csa_5G615810 [Cucumis sativus]
          Length = 313

 Score =  397 bits (1020), Expect = e-107
 Identities = 211/325 (64%), Positives = 236/325 (72%), Gaps = 4/325 (1%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIF----SFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRR 1152
            MLGVLCA RPKPWI     +F+HGSA +H     H +     SP +F R         +R
Sbjct: 1    MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH--HQSRLLVQSPIQFDRR--------QR 49

Query: 1151 HHSSACKLRGSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARW 972
            HHSSACKL  +GGGAASIWH IMP G            H HE KGEGSWNVAWDARPARW
Sbjct: 50   HHSSACKL--AGGGAASIWHAIMPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARW 107

Query: 971  LHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDEVSTD 792
            LHRPDSAWLLFGVCAC+A                        E  G E ++   DE S D
Sbjct: 108  LHRPDSAWLLFGVCACIAPLDWVDASHEAVSLDQKKEVC---ESSGPEFNQN--DESSAD 162

Query: 791  YRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREETEWF 612
            YRVTGVLADGRCLFRAIAHGACLR+GEEAP+++RQ ELADELRA+VV+ELLKRR+ETEW+
Sbjct: 163  YRVTGVLADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWY 222

Query: 611  IEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNG 432
            IEGDFDAYVKRIQQP+VWGGEPELLMASHVLKTPISVFMR+ SS  L+NIAKYG+EY+ G
Sbjct: 223  IEGDFDAYVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQKG 282

Query: 431  KEISINVLFHGYGHYDILETLSPKL 357
            +E  INVLFHGYGHYDILET S K+
Sbjct: 283  EESPINVLFHGYGHYDILETSSDKV 307


>ref|XP_008446786.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
            melo]
          Length = 313

 Score =  395 bits (1015), Expect = e-107
 Identities = 210/325 (64%), Positives = 236/325 (72%), Gaps = 4/325 (1%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIF----SFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRR 1152
            MLGVLCA RPKPWI     +F+HGSA +H     H +     SP +F R         +R
Sbjct: 1    MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH--HQSRLLVQSPIQFDRR--------QR 49

Query: 1151 HHSSACKLRGSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARW 972
            HHSSACKL  +GGGAASIWH I+P G            H HE KGEGSWNVAWDARPARW
Sbjct: 50   HHSSACKL--AGGGAASIWHAILPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARW 107

Query: 971  LHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDEVSTD 792
            LHRPDSAWLLFGVCAC+A                        E  G E ++   DE S D
Sbjct: 108  LHRPDSAWLLFGVCACIAPLDWVDASHEAVSLDQKKEVC---ESSGPEFNQN--DESSAD 162

Query: 791  YRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREETEWF 612
            YRVTGVLADGRCLFRAIAHGACLR+GEEAP+++RQ ELADELRA+VV+ELLKRR+ETEW+
Sbjct: 163  YRVTGVLADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWY 222

Query: 611  IEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNG 432
            IEGDFDAYVKRIQQP+VWGGEPELLMASHVLKTPISVFMR+ SS  L+NIAKYG+EY+ G
Sbjct: 223  IEGDFDAYVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQMG 282

Query: 431  KEISINVLFHGYGHYDILETLSPKL 357
            +E  INVLFHGYGHYDILET S K+
Sbjct: 283  EESPINVLFHGYGHYDILETSSDKV 307


>ref|XP_010658710.1| PREDICTED: uncharacterized protein LOC100245448 [Vitis vinifera]
            gi|296090402|emb|CBI40221.3| unnamed protein product
            [Vitis vinifera]
          Length = 317

 Score =  387 bits (993), Expect = e-104
 Identities = 211/324 (65%), Positives = 229/324 (70%), Gaps = 6/324 (1%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIF---SFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRRH 1149
            MLGVLCA R KPWI    SF+HGSA HH   L H     +  P +F+  G D     RRH
Sbjct: 1    MLGVLCA-RHKPWILATLSFVHGSATHHHLHLNHHHLLGT--PIQFNGGGDDHR---RRH 54

Query: 1148 HSSACKLRGSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWL 969
            HS AC+   SGGGAASIWH I+P GGD  RR  +     H+ KGEGSWNVAWDARPARWL
Sbjct: 55   HSRACRQGSSGGGAASIWHAILPSGGD--RRSSLRPALLHDQKGEGSWNVAWDARPARWL 112

Query: 968  HRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDE---VS 798
            HRPDSAWLLFGVCACLA                        +++G     E  DE    S
Sbjct: 113  HRPDSAWLLFGVCACLAPLDSFDVDNEVVAVDD--------KIEGCNQVNEISDENNNSS 164

Query: 797  TDYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREETE 618
             DYRVTGV ADGRCLFRAIAH ACLR+GEEAP+ENRQ ELAD+LRAQVV+ELLKRREETE
Sbjct: 165  ADYRVTGVPADGRCLFRAIAHSACLRSGEEAPDENRQTELADDLRAQVVDELLKRREETE 224

Query: 617  WFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYR 438
            WFIEG+FDAYVKRIQQPYVWGGEPEL+MASHVLK PISVFM   SS DL NIA YG+EYR
Sbjct: 225  WFIEGNFDAYVKRIQQPYVWGGEPELIMASHVLKMPISVFMIGRSSGDLKNIANYGKEYR 284

Query: 437  NGKEISINVLFHGYGHYDILETLS 366
               E  INVLFHGYGHYDILET S
Sbjct: 285  IDNESPINVLFHGYGHYDILETFS 308


>ref|XP_007010219.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao]
            gi|508727132|gb|EOY19029.1| Cysteine proteinases
            superfamily protein isoform 1 [Theobroma cacao]
          Length = 327

 Score =  379 bits (974), Expect = e-102
 Identities = 208/331 (62%), Positives = 229/331 (69%), Gaps = 14/331 (4%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFL----HGSAA--HHVARLAHGTAYSSLSPPRFSRLGQDVAFAG 1158
            MLGVLCA  PKPWI + L    HG  A  HH +RL          P  F+ L  D     
Sbjct: 1    MLGVLCARPPKPWILNSLSLIAHGGLAAHHHDSRLVEW-------PTHFADLSADDRRC- 52

Query: 1157 RRHHSSACKLRGSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPA 978
             RHHS+AC+L GS GGAASIWH I+PCGG G  R    V  + E KGEGSWNVAWDARPA
Sbjct: 53   -RHHSTACRLGGSDGGAASIWHAILPCGGGGGGRRRGEVWKNVERKGEGSWNVAWDARPA 111

Query: 977  RWLHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGV---ECDKERED 807
            RWLHRPDSAWLLFGVCACLA                     EG E+  V     D++   
Sbjct: 112  RWLHRPDSAWLLFGVCACLA-----PMIEFVDVNPDADDKIEGAELNLVSRLSADEKSSS 166

Query: 806  EVST-----DYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEEL 642
              S+     + +VTGVLADGRCLFRAIAHGACLR+GE+AP+EN Q ELADELRAQVV EL
Sbjct: 167  SSSSVAAADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVVNEL 226

Query: 641  LKRREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNI 462
            LKRREETEWFIEGDFDAYVK IQQPYVWGGEPE+LMASHVLKTPISV+M   SS +L  I
Sbjct: 227  LKRREETEWFIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNLTKI 286

Query: 461  AKYGEEYRNGKEISINVLFHGYGHYDILETL 369
            AKYGEEY+  KE  INVLFHGYGHYDILE+L
Sbjct: 287  AKYGEEYQKDKENPINVLFHGYGHYDILESL 317


>ref|XP_007010220.1| Cysteine proteinases superfamily protein isoform 2 [Theobroma cacao]
            gi|508727133|gb|EOY19030.1| Cysteine proteinases
            superfamily protein isoform 2 [Theobroma cacao]
          Length = 330

 Score =  374 bits (960), Expect = e-100
 Identities = 208/334 (62%), Positives = 229/334 (68%), Gaps = 17/334 (5%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFL----HGSAA--HHVARLAHGTAYSSLSPPRFSRLGQDVAFAG 1158
            MLGVLCA  PKPWI + L    HG  A  HH +RL          P  F+ L  D     
Sbjct: 1    MLGVLCARPPKPWILNSLSLIAHGGLAAHHHDSRLVEW-------PTHFADLSADDRRC- 52

Query: 1157 RRHHSSACKLRGSGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPA 978
             RHHS+AC+L GS GGAASIWH I+PCGG G  R    V  + E KGEGSWNVAWDARPA
Sbjct: 53   -RHHSTACRLGGSDGGAASIWHAILPCGGGGGGRRRGEVWKNVERKGEGSWNVAWDARPA 111

Query: 977  RWLHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGV---ECDKERED 807
            RWLHRPDSAWLLFGVCACLA                     EG E+  V     D++   
Sbjct: 112  RWLHRPDSAWLLFGVCACLA-----PMIEFVDVNPDADDKIEGAELNLVSRLSADEKSSS 166

Query: 806  EVST-----DYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQ---VV 651
              S+     + +VTGVLADGRCLFRAIAHGACLR+GE+AP+EN Q ELADELRAQ   VV
Sbjct: 167  SSSSVAAADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVSLVV 226

Query: 650  EELLKRREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDL 471
             ELLKRREETEWFIEGDFDAYVK IQQPYVWGGEPE+LMASHVLKTPISV+M   SS +L
Sbjct: 227  NELLKRREETEWFIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNL 286

Query: 470  VNIAKYGEEYRNGKEISINVLFHGYGHYDILETL 369
              IAKYGEEY+  KE  INVLFHGYGHYDILE+L
Sbjct: 287  TKIAKYGEEYQKDKENPINVLFHGYGHYDILESL 320


>ref|XP_010032108.1| PREDICTED: OTU domain-containing protein At3g57810-like [Eucalyptus
            grandis] gi|629085145|gb|KCW51502.1| hypothetical protein
            EUGRSUZ_J01018 [Eucalyptus grandis]
          Length = 314

 Score =  373 bits (957), Expect = e-100
 Identities = 208/330 (63%), Positives = 226/330 (68%), Gaps = 14/330 (4%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFS--FLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRRHH 1146
            MLGVLCA RPKPWI +  F H SAAHH  RLA    + S +  R            RRHH
Sbjct: 1    MLGVLCA-RPKPWILASCFSHASAAHHCGRLA----WVSAAAARLQLAADSPDRWRRRHH 55

Query: 1145 SSACKLRGSGG-----GAASIWHVIMPCG-GDGFRRGVVAVHHDHELKGEGSWNVAWDAR 984
            SS+C+L G+       G ASIWH I+P G GD  RR  +        +GEGSWNVAWDAR
Sbjct: 56   SSSCRLGGASSCAHPCGVASIWHAILPSGEGDPPRR--MDQPRRPVFRGEGSWNVAWDAR 113

Query: 983  PARWLHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDE 804
            PARWLHRPDSAWLLFGVCACLA                       RE    E   E  D 
Sbjct: 114  PARWLHRPDSAWLLFGVCACLA---------------PVDAAEPSREEVVPEARVEDRDS 158

Query: 803  V------STDYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEEL 642
            +      S DYRVTGVLADGRCLFRAIAH ACLR GE AP++NRQ ELADELRAQVV EL
Sbjct: 159  LDEAKRSSPDYRVTGVLADGRCLFRAIAHCACLRKGEAAPDDNRQRELADELRAQVVAEL 218

Query: 641  LKRREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNI 462
            LKRREETEW IEGDFDAY++RIQQPYVWGGEPELLMASHVLKTPISVFM D SS +LVN+
Sbjct: 219  LKRREETEWAIEGDFDAYIERIQQPYVWGGEPELLMASHVLKTPISVFMVDRSSGNLVNV 278

Query: 461  AKYGEEYRNGKEISINVLFHGYGHYDILET 372
            AKYGEEYR  +EI INVLFHGYGHYDILE+
Sbjct: 279  AKYGEEYRKDEEIPINVLFHGYGHYDILES 308


>ref|XP_012456105.1| PREDICTED: uncharacterized protein LOC105777394 [Gossypium raimondii]
            gi|763806450|gb|KJB73388.1| hypothetical protein
            B456_011G230700 [Gossypium raimondii]
          Length = 319

 Score =  370 bits (950), Expect = 2e-99
 Identities = 205/331 (61%), Positives = 232/331 (70%), Gaps = 14/331 (4%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFL----HGSAA--HHVARLAHGTAYSSLSPPRFSRLGQDVAFAG 1158
            MLGVLCA  PKPWI + L    HG +A  HH  RL H        P  F+    D++ A 
Sbjct: 1    MLGVLCARPPKPWILNSLSLIAHGGSAAHHHENRLLHW-------PSHFA----DLSAAN 49

Query: 1157 RR--HHSSACKLRG-SGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDA 987
            RR  HHS+AC+L G S GGAASIWH I+PCGGD   +    V  + E KGEGSWNV+WDA
Sbjct: 50   RRCRHHSTACRLGGGSEGGAASIWHAILPCGGDRGVKNRGDVWKNVERKGEGSWNVSWDA 109

Query: 986  RPARWLHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKERED 807
            RPARWL R DSAWLLFGVCACLA                       +    +  D+   +
Sbjct: 110  RPARWL-RSDSAWLLFGVCACLAPMPMDEFDDVNLDAD-------NKTDASLNSDENSSN 161

Query: 806  EVST-----DYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEEL 642
             +S+     +Y+VTG+LADGRCLFRAIAHGACLR+GEEAP+ENRQ ELADELRAQVV EL
Sbjct: 162  HLSSVAAADNYKVTGILADGRCLFRAIAHGACLRSGEEAPDENRQRELADELRAQVVNEL 221

Query: 641  LKRREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNI 462
            LKRREETEWFIEGDFDAYVK IQQPYVWGGEPELLMASHVLKT ISV+M   SS +L+NI
Sbjct: 222  LKRREETEWFIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTRISVYMIHRSSGNLINI 281

Query: 461  AKYGEEYRNGKEISINVLFHGYGHYDILETL 369
            AKYGEEY+  KE  INVLFHGYGHYDILE+L
Sbjct: 282  AKYGEEYQKEKENPINVLFHGYGHYDILESL 312


>ref|XP_007220473.1| hypothetical protein PRUPE_ppa008484mg [Prunus persica]
            gi|462416935|gb|EMJ21672.1| hypothetical protein
            PRUPE_ppa008484mg [Prunus persica]
          Length = 329

 Score =  370 bits (949), Expect = 2e-99
 Identities = 205/326 (62%), Positives = 223/326 (68%), Gaps = 8/326 (2%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFS----FLHGSAAHHVARL--AHGTAYSSLSPPRFSRLGQDVAFAG 1158
            MLG LCA R K WI S    F HGSAA H +RL  AH           FS       F  
Sbjct: 1    MLGFLCARR-KTWIVSSLSSFAHGSAAAHQSRLLQAHTLPLIHQQIASFS-----CGFET 54

Query: 1157 RRHH-SSACKLRGS-GGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDAR 984
            RRHH SSAC+L  + G GAASIWH ++P   +   R +      +ELKGEGSWN AWDAR
Sbjct: 55   RRHHHSSACQLGSACGTGAASIWHALLPSSCNRRSRDLRRPAIHYELKGEGSWNAAWDAR 114

Query: 983  PARWLHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDE 804
            PARWLHRPDSAWLLFGVC CLA                     E  + K      +   +
Sbjct: 115  PARWLHRPDSAWLLFGVCNCLAPIDWADDSTPDGNDGVSNENAESFDSKCSAAPDQNNID 174

Query: 803  VSTDYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREE 624
             S DYRVTGV ADGRCLFRAIAH ACLRNGEEAP+ENRQ +LADELRAQVV+ELLKRREE
Sbjct: 175  SSADYRVTGVPADGRCLFRAIAHVACLRNGEEAPDENRQRDLADELRAQVVDELLKRREE 234

Query: 623  TEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEE 444
            TEWFIEGDFDAYVKR+QQPYVWGGEPELLMASHVLKTPISVFM D SS  LVNIA YGEE
Sbjct: 235  TEWFIEGDFDAYVKRLQQPYVWGGEPELLMASHVLKTPISVFMIDRSSAGLVNIANYGEE 294

Query: 443  YRNGKEISINVLFHGYGHYDILETLS 366
            YR  +E  INVLFHGYGHYDIL++ S
Sbjct: 295  YRKEEEKPINVLFHGYGHYDILDSFS 320


>gb|KHG26701.1| hypothetical protein F383_04817 [Gossypium arboreum]
          Length = 319

 Score =  369 bits (948), Expect = 3e-99
 Identities = 202/329 (61%), Positives = 230/329 (69%), Gaps = 12/329 (3%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFL----HGSAA--HHVARLAHGTAYSSLSPPRFSRLGQDVAFAG 1158
            MLGVLC   PKPWI + L    HG +A  HH  RL H        P  F+ L  D     
Sbjct: 1    MLGVLCTRPPKPWILNSLSLIAHGGSAAHHHENRLLHW-------PSHFADLSADNRRC- 52

Query: 1157 RRHHSSACKLRG-SGGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARP 981
             RHHS+AC+L G S GGAASIWH I+PCGGD   +    V  + E KGEGSWNV+WDARP
Sbjct: 53   -RHHSTACRLGGGSEGGAASIWHAILPCGGDRGVKNRGDVWKNVERKGEGSWNVSWDARP 111

Query: 980  ARWLHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDEV 801
            ARWL RPDSAWLLFGVCACLA                       +    +  D++  + +
Sbjct: 112  ARWL-RPDSAWLLFGVCACLAPMPMDEFDDVNLDAD-------NKTDASLNSDEKSSNHL 163

Query: 800  ST-----DYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLK 636
            S+     +++VTG+LADGRCLFRAIAHGACLR+GEEAP+ENRQ ELADELRAQVV ELLK
Sbjct: 164  SSVAAADNFKVTGILADGRCLFRAIAHGACLRSGEEAPDENRQRELADELRAQVVNELLK 223

Query: 635  RREETEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 456
            RREETEW+IEGDFDAYVK IQQPYVWGGEPELLMASHVLKT ISV+M   SS +L+NIAK
Sbjct: 224  RREETEWYIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTRISVYMIHRSSGNLINIAK 283

Query: 455  YGEEYRNGKEISINVLFHGYGHYDILETL 369
            YGEEY+  KE  INVLFHGYGHYDILE+L
Sbjct: 284  YGEEYQKEKENPINVLFHGYGHYDILESL 312


>ref|XP_009793129.1| PREDICTED: uncharacterized protein LOC104240043 [Nicotiana
            sylvestris]
          Length = 328

 Score =  369 bits (948), Expect = 3e-99
 Identities = 203/327 (62%), Positives = 225/327 (68%), Gaps = 7/327 (2%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRRHHSS 1140
            MLGVLCA RPKPW+F+ L  S AH  A  A+      +  P  S L        RRHHSS
Sbjct: 1    MLGVLCA-RPKPWLFASLSLSHAHGSAPAAYNRL---IGTPTKSVLVGGSDQLQRRHHSS 56

Query: 1139 ACKLRGS--GGGAASIWHVIMPCGG---DGFRRGVVAVHHDHEL--KGEGSWNVAWDARP 981
             C+L  S   GGAASIWH I+P G    D  RR  V  HH +EL  KGEGSWNVAWD RP
Sbjct: 57   HCRLGASVNRGGAASIWHAILPAGRRNKDVKRRNTVFHHHHYELAKKGEGSWNVAWDTRP 116

Query: 980  ARWLHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDEV 801
            ARWLH PDSAWLLFGVC+CLA                     +G     V  D+   D  
Sbjct: 117  ARWLHNPDSAWLLFGVCSCLAAPSLDLPDSNSDVVAPIENMSQGFSSNTVNSDEA--DRN 174

Query: 800  STDYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREET 621
            S +Y VTGV ADGRCLFRAIAH ACLRNGE AP+ENRQ ELADELRAQVV+ELLKRR+E 
Sbjct: 175  SANYTVTGVPADGRCLFRAIAHMACLRNGEGAPDENRQRELADELRAQVVDELLKRRKEA 234

Query: 620  EWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEY 441
            EWFIEGDFDAYV+RI++PYVWGGEPELLMASHVLK+PISV+M D SS  L+NI+ YGEEY
Sbjct: 235  EWFIEGDFDAYVERIEKPYVWGGEPELLMASHVLKSPISVYMVDRSSGSLINISNYGEEY 294

Query: 440  RNGKEISINVLFHGYGHYDILETLSPK 360
            R   E  INVLFHGYGHYDILET+S K
Sbjct: 295  RKEGENPINVLFHGYGHYDILETISEK 321


>ref|XP_008232087.1| PREDICTED: OTU domain-containing protein At3g57810-like [Prunus mume]
          Length = 329

 Score =  369 bits (947), Expect = 4e-99
 Identities = 204/326 (62%), Positives = 223/326 (68%), Gaps = 8/326 (2%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFS----FLHGSAAHHVARL--AHGTAYSSLSPPRFSRLGQDVAFAG 1158
            MLG LCA R K WI S    F HGSAA H +RL  AH           FS       F  
Sbjct: 1    MLGFLCARR-KTWIVSSLSSFAHGSAAAHQSRLLQAHTLPLIHQQIASFS-----CGFET 54

Query: 1157 RRHH-SSACKLRGS-GGGAASIWHVIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDAR 984
            RRHH SSAC+L  + G GAASIWH ++P   +   R +      +ELKGEGSWN AWDAR
Sbjct: 55   RRHHHSSACQLGSACGTGAASIWHALLPSSCNRRSRDLRRPAIHYELKGEGSWNAAWDAR 114

Query: 983  PARWLHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDE 804
            PARWLHRPDSAWLLFGVC CLA                     E  + K      +   +
Sbjct: 115  PARWLHRPDSAWLLFGVCNCLAPIDWADDSTPDGNDGVSNENAESFDSKCSAASDQNNID 174

Query: 803  VSTDYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREE 624
             S DYRVTGV ADGRCLFRAIAH ACLRNGEEAP+ENRQ +LADELRAQVV+ELLKRREE
Sbjct: 175  SSADYRVTGVPADGRCLFRAIAHVACLRNGEEAPDENRQRDLADELRAQVVDELLKRREE 234

Query: 623  TEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEE 444
            TEWFIEGDFDAYVKR+QQPYVWGGEPELLMASHVLKTPISVFM D SS  LVNIA YGE+
Sbjct: 235  TEWFIEGDFDAYVKRLQQPYVWGGEPELLMASHVLKTPISVFMIDRSSAGLVNIANYGED 294

Query: 443  YRNGKEISINVLFHGYGHYDILETLS 366
            YR  +E  INVLFHGYGHYDIL++ S
Sbjct: 295  YRKEEEKPINVLFHGYGHYDILDSFS 320


>ref|XP_009603537.1| PREDICTED: uncharacterized protein LOC104098494 [Nicotiana
            tomentosiformis]
          Length = 328

 Score =  366 bits (940), Expect = 3e-98
 Identities = 202/327 (61%), Positives = 224/327 (68%), Gaps = 7/327 (2%)
 Frame = -3

Query: 1319 MLGVLCATRPKPWIFSFLHGSAAHHVARLAHGTAYSSLSPPRFSRLGQDVAFAGRRHHSS 1140
            MLGVLCA RPKPW+F+ L  S AH  A  A+      +  P  S L        RRHHSS
Sbjct: 1    MLGVLCA-RPKPWLFASLSLSHAHGSAPAAYNRL---IGTPTKSVLVGGSDQLQRRHHSS 56

Query: 1139 ACKLRGS--GGGAASIWHVIMPCGG---DGFRRGVVAVHHDHEL--KGEGSWNVAWDARP 981
             C+L  S   GGAASIWH I+P G    D  RR  V  HH + L  KGEGSWNVAWD RP
Sbjct: 57   HCRLGASVNRGGAASIWHAILPAGRRNKDVKRRNTVFHHHHYVLAKKGEGSWNVAWDTRP 116

Query: 980  ARWLHRPDSAWLLFGVCACLAXXXXXXXXXXXXXXXXXXXXXEGREMKGVECDKEREDEV 801
            ARWLH PDSAWLLFGVC+CLA                     +G     V  D+   D  
Sbjct: 117  ARWLHNPDSAWLLFGVCSCLAAPTLDLPDSNSEVVAPIENKSQGFSSNTVNSDEV--DRN 174

Query: 800  STDYRVTGVLADGRCLFRAIAHGACLRNGEEAPNENRQMELADELRAQVVEELLKRREET 621
            S +Y VTGV ADGRCLFRAIAH ACLRNGE AP+ENRQ ELADELRAQVV+ELLKRR+E 
Sbjct: 175  SANYTVTGVPADGRCLFRAIAHMACLRNGEGAPDENRQRELADELRAQVVDELLKRRKEA 234

Query: 620  EWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEY 441
            EWFIEGDFDAYV+RI++PYVWGGEPELLMASHVLK+PISV+M D SS  L+NI+ YGEEY
Sbjct: 235  EWFIEGDFDAYVERIEKPYVWGGEPELLMASHVLKSPISVYMVDRSSGSLINISNYGEEY 294

Query: 440  RNGKEISINVLFHGYGHYDILETLSPK 360
            R   E  INVLFHGYGHYDILET+S K
Sbjct: 295  RKEGENPINVLFHGYGHYDILETISAK 321


Top