BLASTX nr result

ID: Glycyrrhiza36_contig00017515 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza36_contig00017515
         (1220 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_004496177.1 PREDICTED: OTU domain-containing protein At3g5781...   408   e-139
BAE71258.1 hypothetical protein [Trifolium pratense]                  395   e-133
XP_013469378.1 OTU-like cysteine protease [Medicago truncatula] ...   380   e-128
XP_003536306.1 PREDICTED: uncharacterized protein LOC100793001 [...   374   e-125
XP_017413456.1 PREDICTED: uncharacterized protein LOC108324995 [...   366   e-123
XP_003556279.1 PREDICTED: OTU domain-containing protein At3g5781...   365   e-122
XP_014512510.1 PREDICTED: uncharacterized protein LOC106771118 [...   364   e-122
XP_007143828.1 hypothetical protein PHAVU_007G105100g [Phaseolus...   362   e-121
XP_016177333.1 PREDICTED: uncharacterized protein LOC107619558 [...   362   e-120
XP_015941210.1 PREDICTED: uncharacterized protein LOC107466718 [...   360   e-120
XP_004142455.1 PREDICTED: OTU domain-containing protein At3g5781...   327   e-107
XP_016900257.1 PREDICTED: OTU domain-containing protein At3g5781...   325   e-106
XP_019459096.1 PREDICTED: uncharacterized protein LOC109359045 [...   323   e-105
XP_010032108.1 PREDICTED: OTU domain-containing protein At3g5781...   316   e-103
OMO50984.1 Ovarian tumor, otubain [Corchorus olitorius]               315   e-102
OMO98833.1 Ovarian tumor, otubain [Corchorus capsularis]              314   e-102
XP_018845374.1 PREDICTED: uncharacterized protein LOC109009371 [...   313   e-101
KHN37847.1 OTU domain-containing protein [Glycine soja]               310   e-101
GAU40884.1 hypothetical protein TSUD_40590 [Trifolium subterraneum]   310   e-101
EOY19029.1 Cysteine proteinases superfamily protein isoform 1 [T...   310   e-100

>XP_004496177.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cicer
            arietinum]
          Length = 313

 Score =  408 bits (1048), Expect = e-139
 Identities = 216/316 (68%), Positives = 235/316 (74%), Gaps = 7/316 (2%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFLHASS---AARLAHGT-AYSSASPRFSRPGHDGARRQHSSSCE 986
            MLGVLCATR +PWIFSFLH+S+   AARLAH T A SS S RF       ARR HSS+CE
Sbjct: 1    MLGVLCATRSRPWIFSFLHSSASHHAARLAHCTVACSSLSTRFDATF--AARRHHSSACE 58

Query: 985  LRGXXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 806
            L+       SIWHAI PCGGDGFRRGVV V HDH+LKGEGSWNVAWDARPARWLH  DSA
Sbjct: 59   LQ-LGGGAASIWHAIRPCGGDGFRRGVVTVQHDHDLKGEGSWNVAWDARPARWLHRSDSA 117

Query: 805  WLLFGVCDCLXXXXXXXXXXXXXXXXXXES---SEGREVKVAECDSKEQDDEVSSDYRVT 635
            WLLFGVC CL                   +   SEGRE+K AE D KE++DE+S+DYRVT
Sbjct: 118  WLLFGVCACLAPPVIADVDLEAPPTPAINTDENSEGREMKYAEGD-KERNDELSADYRVT 176

Query: 634  GVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWFLEGD 455
            GVLADGRCLFRA+AHGACL NGE APNENRQ ELAD                 EWF+EGD
Sbjct: 177  GVLADGRCLFRAIAHGACLNNGEEAPNENRQRELADELRARVAEELLKRRKETEWFIEGD 236

Query: 454  FDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEIS 275
            FDAYV RI+Q + WGGEPELLMASHVLKTPI VFMRD SSIDLVNIAKYGEEY ND+EIS
Sbjct: 237  FDAYVNRIRQTYVWGGEPELLMASHVLKTPIYVFMRDASSIDLVNIAKYGEEYMNDKEIS 296

Query: 274  INVLFHRYGHYDILET 227
            INVLFHR+GHY+ILET
Sbjct: 297  INVLFHRHGHYEILET 312


>BAE71258.1 hypothetical protein [Trifolium pratense]
          Length = 326

 Score =  395 bits (1014), Expect = e-133
 Identities = 214/330 (64%), Positives = 230/330 (69%), Gaps = 13/330 (3%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFLHASSA-----ARLAHGTAYSSASPRFSRPGHDGARRQHSSSC 989
            MLGVLCATR +PWIFSFLH SS+     ARLAH T  SS+S     P    ARR HSS C
Sbjct: 1    MLGVLCATRSRPWIFSFLHHSSSHHHHTARLAHITVASSSS---LSPTFFSARRNHSSQC 57

Query: 988  ELR-GXXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPD 812
            +L+        SIWHAIMPCGGDGF+RG   VHHDHELKGEGSWNVAWDARPARWLH  D
Sbjct: 58   KLQISAGGGAASIWHAIMPCGGDGFQRGAFMVHHDHELKGEGSWNVAWDARPARWLHRSD 117

Query: 811  SAWLLFGVCDCL-------XXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVS 653
            SAWLLFGV   L                           SEG E+K AE D  + +DE+S
Sbjct: 118  SAWLLFGVRAWLAPPPVIVDVDPEVPLPTSVISPDEISRSEGLEIKDAESD--KPNDELS 175

Query: 652  SDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXE 473
            SDYRVTGVLADGRCLFRALAHGACLKNGE APNENRQ ELAD                 E
Sbjct: 176  SDYRVTGVLADGRCLFRALAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETE 235

Query: 472  WFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYR 293
            WF+EGDFD YV RIQQ F WGGEPELLMASHVLKTPI VFMRD +SIDLVNIAKYGEEY 
Sbjct: 236  WFIEGDFDTYVTRIQQSFVWGGEPELLMASHVLKTPIFVFMRDPNSIDLVNIAKYGEEYM 295

Query: 292  NDEEISINVLFHRYGHYDILETS*PKLPKK 203
            NDE ISINVLFHR+GHY++LET  PKL +K
Sbjct: 296  NDEGISINVLFHRHGHYELLETLCPKLSQK 325


>XP_013469378.1 OTU-like cysteine protease [Medicago truncatula] KEH43416.1 OTU-like
            cysteine protease [Medicago truncatula]
          Length = 305

 Score =  380 bits (976), Expect = e-128
 Identities = 199/315 (63%), Positives = 217/315 (68%), Gaps = 6/315 (1%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELR-- 980
            MLGVLCATR +PWIFS  H   A RL+H T      P         ARR HS++C     
Sbjct: 1    MLGVLCATRSRPWIFSSHHHHHAFRLSHATVAPLTFP---------ARRHHSTACNNLQI 51

Query: 979  GXXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWL 800
                   SIWHAI PCGGDGFR G V +HHDHELKGEGSWNVAWDARPARWLH  DSAWL
Sbjct: 52   STGGGAASIWHAITPCGGDGFRTGGVMLHHDHELKGEGSWNVAWDARPARWLHRSDSAWL 111

Query: 799  LFGVCDCLXXXXXXXXXXXXXXXXXXE----SSEGREVKVAECDSKEQDDEVSSDYRVTG 632
            LFGVC CL                       SSEGRE+K    D  E+DDE+++DYRVTG
Sbjct: 112  LFGVCACLAPPVVLDVDPEAAAPTPAVFPNESSEGREMKDELSD--ERDDELNADYRVTG 169

Query: 631  VLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWFLEGDF 452
            VLADGRCLFRA+AHGACLKNGE APNE+RQ ELAD                 EWF+EGDF
Sbjct: 170  VLADGRCLFRAIAHGACLKNGEEAPNESRQRELADELRVKVAEELLNRRKETEWFIEGDF 229

Query: 451  DAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISI 272
            D YV RIQQ + WGGEPELLMASHVLKTPI VFMRD SS+DLVNIAKYGEEY NDEEISI
Sbjct: 230  DTYVTRIQQTYVWGGEPELLMASHVLKTPIYVFMRDASSMDLVNIAKYGEEYMNDEEISI 289

Query: 271  NVLFHRYGHYDILET 227
            NVLFHR+GHY++LET
Sbjct: 290  NVLFHRHGHYELLET 304


>XP_003536306.1 PREDICTED: uncharacterized protein LOC100793001 [Glycine max]
            KRH34730.1 hypothetical protein GLYMA_10G202000 [Glycine
            max]
          Length = 296

 Score =  374 bits (959), Expect = e-125
 Identities = 195/313 (62%), Positives = 219/313 (69%), Gaps = 1/313 (0%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFLHA-SSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRG 977
            MLGVLCATRPKPW+ S +H  +S  RL H     SASP          RR+HS++C+L  
Sbjct: 1    MLGVLCATRPKPWLLSLVHVHASLPRLPHSPLSPSASPP--------PRRRHSTACKLFL 52

Query: 976  XXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLL 797
                  SIWHAIMP G DG RRGVVAVH   +LKGEGSWNVAWDARPARWLH PDSAWLL
Sbjct: 53   SGGAAASIWHAIMPRGDDGLRRGVVAVH---DLKGEGSWNVAWDARPARWLHRPDSAWLL 109

Query: 796  FGVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSSDYRVTGVLADG 617
            FGVC CL                  + S G        D + ++DEVS+DYRVTGV ADG
Sbjct: 110  FGVCACLAPPPGCVDADTNSAGIAVDESCGL------LDKEREEDEVSADYRVTGVPADG 163

Query: 616  RCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWFLEGDFDAYVK 437
            RCLFRA+AHGACL+NGE AP+ENRQ ELAD                 EWF+EGDFD Y++
Sbjct: 164  RCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQ 223

Query: 436  RIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLFH 257
            RIQQP+ WGGEPELLMASHVLKTPISVFMRDT S++LVNIAKYGEEYRND++ISINVLFH
Sbjct: 224  RIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFH 283

Query: 256  RYGHYDILETS*P 218
             YGHYDILET  P
Sbjct: 284  GYGHYDILETLRP 296


>XP_017413456.1 PREDICTED: uncharacterized protein LOC108324995 [Vigna angularis]
            KOM35649.1 hypothetical protein LR48_Vigan02g179900
            [Vigna angularis] BAT94560.1 hypothetical protein
            VIGAN_08117300 [Vigna angularis var. angularis]
          Length = 290

 Score =  366 bits (940), Expect = e-123
 Identities = 197/310 (63%), Positives = 214/310 (69%), Gaps = 1/310 (0%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 974
            MLGVLCATRPKPW+FS +HAS   RL H +    ASP          RR HSS+C+L G 
Sbjct: 1    MLGVLCATRPKPWLFSLVHASPP-RLPHASVSLLASP---------PRRHHSSACKLFGS 50

Query: 973  XXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLF 794
                 SIWHAIMP  GDGFRRGVVAVH   +LKGEGSWNVAWD RPARWLH  DSAWLLF
Sbjct: 51   AGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRSDSAWLLF 107

Query: 793  GVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGR 614
            GVC CL                  ES    +        KE   +VS+DYRVTGV ADGR
Sbjct: 108  GVCACLAPPGCVDAVTDSDAVAADESCGVLD--------KELKVDVSADYRVTGVPADGR 159

Query: 613  CLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWFLEGDFDAYVKR 434
            CLFRA+AHGACL+NGE AP+ENRQ ELAD                 EWF+EGDFD YVKR
Sbjct: 160  CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKR 219

Query: 433  IQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND-EEISINVLFH 257
            IQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+YRND EE SINVLFH
Sbjct: 220  IQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYRNDKEENSINVLFH 279

Query: 256  RYGHYDILET 227
             YGHYDILE+
Sbjct: 280  GYGHYDILES 289


>XP_003556279.1 PREDICTED: OTU domain-containing protein At3g57810-like [Glycine max]
            KHN00921.1 OTU domain-containing protein [Glycine soja]
            KRG92054.1 hypothetical protein GLYMA_20G188400 [Glycine
            max]
          Length = 294

 Score =  365 bits (936), Expect = e-122
 Identities = 194/311 (62%), Positives = 216/311 (69%), Gaps = 2/311 (0%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 974
            MLGVLCATR KPW+FS +HAS   RL+H     SASP          RR+HS++C+L   
Sbjct: 1    MLGVLCATRSKPWLFSLVHAS-LPRLSHAPLSPSASPP--------PRRRHSTACKLFLS 51

Query: 973  XXXXXSIWHAIMPC--GGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWL 800
                 SIWHAIMP     DGFRRGVVA H   ++KGEGSWNVAWDARPARWLH PDSAWL
Sbjct: 52   AGGAASIWHAIMPRVNDDDGFRRGVVAFH---DMKGEGSWNVAWDARPARWLHRPDSAWL 108

Query: 799  LFGVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSSDYRVTGVLAD 620
            LFGVC CL                  + S          D + ++ EVS+DYRVTGV AD
Sbjct: 109  LFGVCACLAPPSSCVDADTNTDAIAVDES------CRLLDKEREEYEVSADYRVTGVPAD 162

Query: 619  GRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWFLEGDFDAYV 440
            GRCLFRA+AHGACL+NGE AP+ENRQ ELAD                 EWF+EGDFD YV
Sbjct: 163  GRCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELMKRREETEWFIEGDFDTYV 222

Query: 439  KRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLF 260
            +RIQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGEEYRND+EISINVLF
Sbjct: 223  QRIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEEYRNDKEISINVLF 282

Query: 259  HRYGHYDILET 227
            H YGHYDILET
Sbjct: 283  HGYGHYDILET 293


>XP_014512510.1 PREDICTED: uncharacterized protein LOC106771118 [Vigna radiata var.
            radiata]
          Length = 290

 Score =  364 bits (934), Expect = e-122
 Identities = 196/310 (63%), Positives = 213/310 (68%), Gaps = 1/310 (0%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 974
            MLGVLCATRPKPW+FS +HAS   RL H +    ASP          RR HSS+C+L G 
Sbjct: 1    MLGVLCATRPKPWLFSLVHASPP-RLPHASVSLLASP---------PRRHHSSACKLFGS 50

Query: 973  XXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLF 794
                 SIWHAIMP  GDGFRRGVVAVH   +LKGEGSWNVAWD RPARWLH  DSAWLLF
Sbjct: 51   AGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRSDSAWLLF 107

Query: 793  GVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGR 614
            GVC CL                  ES    +        KE   +VS+DYRVTGV ADGR
Sbjct: 108  GVCACLAPPGCVDAVTDSDAVAADESCGVLD--------KELKVDVSADYRVTGVPADGR 159

Query: 613  CLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWFLEGDFDAYVKR 434
            CLFRA+AHGACL+NGE AP+ENRQ ELAD                 EWF+EGDFD YVKR
Sbjct: 160  CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKR 219

Query: 433  IQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND-EEISINVLFH 257
            IQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+Y ND EE SINVLFH
Sbjct: 220  IQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYMNDKEENSINVLFH 279

Query: 256  RYGHYDILET 227
             YGHYDILE+
Sbjct: 280  GYGHYDILES 289


>XP_007143828.1 hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris]
            ESW15822.1 hypothetical protein PHAVU_007G105100g
            [Phaseolus vulgaris]
          Length = 305

 Score =  362 bits (928), Expect = e-121
 Identities = 200/322 (62%), Positives = 217/322 (67%), Gaps = 1/322 (0%)
 Frame = -2

Query: 1189 NPAHDSHLCTSSMLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGAR 1010
            NPAHDS   +S MLGVLCATRP+PW+FS +HAS   RL H +   SASP          R
Sbjct: 7    NPAHDSF--SSPMLGVLCATRPRPWLFSHVHAS-LPRLVHASVSLSASP---------PR 54

Query: 1009 RQHSSSCELRGXXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPAR 830
            R HSS+C++ G      SIWHAIMP  GD FRRGVV VH   +LKGEGSWNVAWD RPAR
Sbjct: 55   RHHSSACKIFGSAGGAASIWHAIMPRSGDRFRRGVVPVH---DLKGEGSWNVAWDTRPAR 111

Query: 829  WLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSS 650
            WLH PDSAWLLFGVC CL                  ES    +V+ A  D         +
Sbjct: 112  WLHRPDSAWLLFGVCACLAPPGCVDVVTDFEAVAVDESCGVLKVE-ASAD--------YA 162

Query: 649  DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEW 470
            DYRVTGV ADGRCLFRA+AHG CL+NGE AP+EN Q ELAD                 EW
Sbjct: 163  DYRVTGVPADGRCLFRAIAHGDCLRNGEKAPDENCQRELADELRAKVVDELLKRREETEW 222

Query: 469  FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 290
            F+EGDFD YVKRIQQPF WGGEPELLMASHVLKTPISVFMR T S+ LVNIAKYGEEYRN
Sbjct: 223  FIEGDFDTYVKRIQQPFVWGGEPELLMASHVLKTPISVFMRATGSVGLVNIAKYGEEYRN 282

Query: 289  D-EEISINVLFHRYGHYDILET 227
            D EE SINVLFH YGHYDILET
Sbjct: 283  DKEENSINVLFHGYGHYDILET 304


>XP_016177333.1 PREDICTED: uncharacterized protein LOC107619558 [Arachis ipaensis]
          Length = 327

 Score =  362 bits (929), Expect = e-120
 Identities = 202/328 (61%), Positives = 219/328 (66%), Gaps = 21/328 (6%)
 Frame = -2

Query: 1147 GVLCATRPKPWIFS--FLHAS---SAARLAHGTAYSSASPRFSRP-GHDGARRQHSSSCE 986
            GVLCATRPKPWI S   LHAS   S+ARL H      A P F +      ARR HSS+C 
Sbjct: 4    GVLCATRPKPWILSAAILHASLHHSSARLLH------APPLFPQLLRRTDARRHHSSACN 57

Query: 985  LRGXXXXXXS--IWHAIMPCGGDGF------RRGVVAVHH-DHELKGEGSWNVAWDARPA 833
              G      +  IWHAIMPCGG          RGVVAVHH DHELKGEGSWNVAWDARPA
Sbjct: 58   HGGDFGGGGAASIWHAIMPCGGGAGSGKKLRHRGVVAVHHHDHELKGEGSWNVAWDARPA 117

Query: 832  RWLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXE------SSEGREVKVAECDSKE 671
            RWLH PDSAWLLFGVC CL                         + EG+ VKV       
Sbjct: 118  RWLHRPDSAWLLFGVCACLAPPVSSVTDLEATPPATATVVNRDINPEGQGVKV------- 170

Query: 670  QDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXX 491
              D +SSDYRVTGVLADGRCLFRA+AHGACL+NGEAAP+E RQ ELAD            
Sbjct: 171  --DGLSSDYRVTGVLADGRCLFRAIAHGACLRNGEAAPDERRQRELADELRAQVVEELMK 228

Query: 490  XXXXXEWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 311
                 EWF+EGDFD YVKRIQQP+ WGGEPELLMASHVLKTPISVFMRDTSS+ LVNIAK
Sbjct: 229  RREETEWFIEGDFDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSLSLVNIAK 288

Query: 310  YGEEYRNDEEISINVLFHRYGHYDILET 227
            YGEEYRN++++ INVLFH YGHYDILET
Sbjct: 289  YGEEYRNEKDVCINVLFHGYGHYDILET 316


>XP_015941210.1 PREDICTED: uncharacterized protein LOC107466718 [Arachis duranensis]
          Length = 327

 Score =  360 bits (925), Expect = e-120
 Identities = 201/328 (61%), Positives = 219/328 (66%), Gaps = 21/328 (6%)
 Frame = -2

Query: 1147 GVLCATRPKPWIFS--FLHAS---SAARLAHGTAYSSASPRFSRP-GHDGARRQHSSSCE 986
            GVLCATRPKPWI S   LHAS   S+ARL H      A P F +       RR HSS+C 
Sbjct: 4    GVLCATRPKPWILSAAILHASLHHSSARLLH------APPLFPQLLRRTDTRRHHSSACN 57

Query: 985  LRGXXXXXXS--IWHAIMPCGGDGF------RRGVVAVHH-DHELKGEGSWNVAWDARPA 833
              G      +  IWHAIMPCGG          RGVVAVHH DHELKGEGSWNVAWDARPA
Sbjct: 58   HGGDFGGGGAASIWHAIMPCGGGAGSGKKLRHRGVVAVHHHDHELKGEGSWNVAWDARPA 117

Query: 832  RWLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXE------SSEGREVKVAECDSKE 671
            RWLH PDSAWLLFGVC CL                         ++EG+ VKV       
Sbjct: 118  RWLHRPDSAWLLFGVCACLAPPVSSVADLEATPPATATVVNRDMNTEGQGVKV------- 170

Query: 670  QDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXX 491
              D +SSDYRVTGVLADGRCLFRA+AHGACL+NGEAAP+E RQ ELAD            
Sbjct: 171  --DGLSSDYRVTGVLADGRCLFRAIAHGACLRNGEAAPDERRQRELADELRAQVVEELMK 228

Query: 490  XXXXXEWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 311
                 EWF+EGDFD YVKRIQQP+ WGGEPELLMASHVLKTPISVFMRDTSS+ LVNIAK
Sbjct: 229  RREETEWFIEGDFDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSLSLVNIAK 288

Query: 310  YGEEYRNDEEISINVLFHRYGHYDILET 227
            YGEEYRN++++ INVLFH YGHYDILET
Sbjct: 289  YGEEYRNEKDMCINVLFHGYGHYDILET 316


>XP_004142455.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
            sativus] KGN52210.1 hypothetical protein Csa_5G615810
            [Cucumis sativus]
          Length = 313

 Score =  327 bits (837), Expect = e-107
 Identities = 179/321 (55%), Positives = 205/321 (63%), Gaps = 4/321 (1%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIF----SFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCE 986
            MLGVLCA RPKPWI     +F+H S+     H   + S     S    D  +R HSS+C+
Sbjct: 1    MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH---HQSRLLVQSPIQFDRRQRHHSSACK 56

Query: 985  LRGXXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 806
            L G       IWHAIMP G            H HE KGEGSWNVAWDARPARWLH PDSA
Sbjct: 57   LAGGGAAS--IWHAIMPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSA 114

Query: 805  WLLFGVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSSDYRVTGVL 626
            WLLFGVC C+                     + +EV  +      Q+DE S+DYRVTGVL
Sbjct: 115  WLLFGVCACIAPLDWVDASHEAVSL-----DQKKEVCESSGPEFNQNDESSADYRVTGVL 169

Query: 625  ADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWFLEGDFDA 446
            ADGRCLFRA+AHGACL++GE AP+++RQ ELAD                 EW++EGDFDA
Sbjct: 170  ADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDA 229

Query: 445  YVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINV 266
            YVKRIQQPF WGGEPELLMASHVLKTPISVFMR+ SS  L+NIAKYG+EY+  EE  INV
Sbjct: 230  YVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQKGEESPINV 289

Query: 265  LFHRYGHYDILETS*PKLPKK 203
            LFH YGHYDILETS  K+  K
Sbjct: 290  LFHGYGHYDILETSSDKVSLK 310


>XP_016900257.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
            melo]
          Length = 313

 Score =  325 bits (832), Expect = e-106
 Identities = 178/321 (55%), Positives = 205/321 (63%), Gaps = 4/321 (1%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIF----SFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCE 986
            MLGVLCA RPKPWI     +F+H S+     H   + S     S    D  +R HSS+C+
Sbjct: 1    MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH---HQSRLLVQSPIQFDRRQRHHSSACK 56

Query: 985  LRGXXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 806
            L G       IWHAI+P G            H HE KGEGSWNVAWDARPARWLH PDSA
Sbjct: 57   LAGGGAAS--IWHAILPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSA 114

Query: 805  WLLFGVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSSDYRVTGVL 626
            WLLFGVC C+                     + +EV  +      Q+DE S+DYRVTGVL
Sbjct: 115  WLLFGVCACIAPLDWVDASHEAVSL-----DQKKEVCESSGPEFNQNDESSADYRVTGVL 169

Query: 625  ADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWFLEGDFDA 446
            ADGRCLFRA+AHGACL++GE AP+++RQ ELAD                 EW++EGDFDA
Sbjct: 170  ADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDA 229

Query: 445  YVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINV 266
            YVKRIQQPF WGGEPELLMASHVLKTPISVFMR+ SS  L+NIAKYG+EY+  EE  INV
Sbjct: 230  YVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQMGEESPINV 289

Query: 265  LFHRYGHYDILETS*PKLPKK 203
            LFH YGHYDILETS  K+  K
Sbjct: 290  LFHGYGHYDILETSSDKVSLK 310


>XP_019459096.1 PREDICTED: uncharacterized protein LOC109359045 [Lupinus
            angustifolius] OIW01500.1 hypothetical protein
            TanjilG_19426 [Lupinus angustifolius]
          Length = 319

 Score =  323 bits (828), Expect = e-105
 Identities = 181/318 (56%), Positives = 205/318 (64%), Gaps = 8/318 (2%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFLHASSAARLAHGTA--YSSASPRFSRPGHDGARRQHSSSCELR 980
            ML  LC TRPKP   S     +A+   H +A   + +S  F  PG DG RR HSS+C + 
Sbjct: 1    MLAALC-TRPKPSFLSSFFFQTASLHNHNSARFINGSSLHFYCPGGDGRRRHHSSACTIG 59

Query: 979  GXXXXXXSIWHAIMPCGGDGF----RRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPD 812
            G      SIWH ++P           R   A+ H HEL+GEGSWN AWDARP+RWLH PD
Sbjct: 60   GSCGGAASIWHVVLPERAGASICCDLRWRSALPH-HELRGEGSWNAAWDARPSRWLHRPD 118

Query: 811  SAWLLFGVCDCLXXXXXXXXXXXXXXXXXXESSEGR-EVKVAECDSKEQDDEVSSDYRVT 635
            SAWLLFGVC CL                  +S  G  ++K   CD   + +EVSS YR+T
Sbjct: 119  SAWLLFGVCACLAPPLLLADVNTEVPSAEHDSDGGGGDLKGPGCD---EQNEVSSAYRIT 175

Query: 634  GVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWFLEGD 455
            GVLADGRCLFRA+AHGACL NGE AP+ENRQ ELAD                 EWF+EGD
Sbjct: 176  GVLADGRCLFRAIAHGACLMNGEEAPDENRQRELADELRAQVVEELMKRREETEWFIEGD 235

Query: 454  FDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEY-RNDEEI 278
            FDAYV RIQQPF WGGEPELLMASHVLKTPISVFMRD SS DLVNIAKYGEEY   ++EI
Sbjct: 236  FDAYVTRIQQPFVWGGEPELLMASHVLKTPISVFMRDRSSGDLVNIAKYGEEYITKEKEI 295

Query: 277  SINVLFHRYGHYDILETS 224
            +INVLFH YGHYDILE S
Sbjct: 296  AINVLFHGYGHYDILEIS 313


>XP_010032108.1 PREDICTED: OTU domain-containing protein At3g57810 [Eucalyptus
            grandis] KCW51502.1 hypothetical protein EUGRSUZ_J01018
            [Eucalyptus grandis]
          Length = 314

 Score =  316 bits (809), Expect = e-103
 Identities = 179/320 (55%), Positives = 201/320 (62%), Gaps = 11/320 (3%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFS--FLHASSAARLAHGTAYSSASPRFSRPGHDG---ARRQHSSSC 989
            MLGVLCA RPKPWI +  F HAS+A         S+A+ R            RR HSSSC
Sbjct: 1    MLGVLCA-RPKPWILASCFSHASAAHHCGRLAWVSAAAARLQLAADSPDRWRRRHHSSSC 59

Query: 988  ELRGXXXXXXS-----IWHAIMPCG-GDGFRRGVVAVHHDHELKGEGSWNVAWDARPARW 827
             L G            IWHAI+P G GD  RR  +        +GEGSWNVAWDARPARW
Sbjct: 60   RLGGASSCAHPCGVASIWHAILPSGEGDPPRR--MDQPRRPVFRGEGSWNVAWDARPARW 117

Query: 826  LHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSSD 647
            LH PDSAWLLFGVC CL                        E +V + DS ++    S D
Sbjct: 118  LHRPDSAWLLFGVCACLAPVDAAEPSREEVVP---------EARVEDRDSLDEAKRSSPD 168

Query: 646  YRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWF 467
            YRVTGVLADGRCLFRA+AH ACL+ GEAAP++NRQ ELAD                 EW 
Sbjct: 169  YRVTGVLADGRCLFRAIAHCACLRKGEAAPDDNRQRELADELRAQVVAELLKRREETEWA 228

Query: 466  LEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND 287
            +EGDFDAY++RIQQP+ WGGEPELLMASHVLKTPISVFM D SS +LVN+AKYGEEYR D
Sbjct: 229  IEGDFDAYIERIQQPYVWGGEPELLMASHVLKTPISVFMVDRSSGNLVNVAKYGEEYRKD 288

Query: 286  EEISINVLFHRYGHYDILET 227
            EEI INVLFH YGHYDILE+
Sbjct: 289  EEIPINVLFHGYGHYDILES 308


>OMO50984.1 Ovarian tumor, otubain [Corchorus olitorius]
          Length = 327

 Score =  315 bits (806), Expect = e-102
 Identities = 173/321 (53%), Positives = 199/321 (61%), Gaps = 12/321 (3%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFL----HASSAARLAHGTAYSSASPRFSRPGHDGAR-RQHSSSC 989
            MLGVLCA  PKPWI + L    H   +A   H +      P F+    D  R R HS++C
Sbjct: 1    MLGVLCARPPKPWILNSLSLVAHGGGSAAHHHDSRLLHW-PHFAHISADNRRCRHHSTAC 59

Query: 988  ELRGXXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 809
             L G      SIWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH PDS
Sbjct: 60   RLGGSDGGAASIWHAILPCGGSGRGRKREEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119

Query: 808  AWLLFGVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSS------- 650
            AWLLFGVC CL                     EG E+      S ++   +SS       
Sbjct: 120  AWLLFGVCACLAPMIEFVDVNPETDDKI----EGAELISINGLSADEKSSISSSPVAAPD 175

Query: 649  DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEW 470
            +Y+VTGVLADGRCLFRA+AHGACL++GE AP+E RQ ELAD                 EW
Sbjct: 176  NYKVTGVLADGRCLFRAIAHGACLRSGEEAPDETRQRELADELRAQVVNELLKRREETEW 235

Query: 469  FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 290
            F+EGDFDAYVK IQQP+ WGGEPELLMASHVLKTPISV+M   SS +L+ IA YGEEY+ 
Sbjct: 236  FIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTPISVYMIHRSSRNLIKIADYGEEYQK 295

Query: 289  DEEISINVLFHRYGHYDILET 227
            D+E  INVLFH YGHYDILE+
Sbjct: 296  DKETPINVLFHGYGHYDILES 316


>OMO98833.1 Ovarian tumor, otubain [Corchorus capsularis]
          Length = 327

 Score =  314 bits (804), Expect = e-102
 Identities = 173/321 (53%), Positives = 199/321 (61%), Gaps = 12/321 (3%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFL----HASSAARLAHGTAYSSASPRFSRPGHDGAR-RQHSSSC 989
            MLGVLCA  PKPWI + L    H   +A   H +      P F+    D  R R HS++C
Sbjct: 1    MLGVLCARPPKPWILNSLSLVAHGGGSAAHHHDSRLLHW-PHFADLSADNRRCRHHSTAC 59

Query: 988  ELRGXXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 809
             L G      SIWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH PDS
Sbjct: 60   RLGGSDGGAASIWHAILPCGGSGRGRKREEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119

Query: 808  AWLLFGVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSS------- 650
            AWLLFGVC CL                     EG E+      S ++   +SS       
Sbjct: 120  AWLLFGVCACLAPMIEFVDVNPETDDKI----EGTELISINGLSADEKSSISSSPVAAPD 175

Query: 649  DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEW 470
            +Y+VTGVLADGRCLFRA+AHGACL++GE AP+E RQ ELAD                 EW
Sbjct: 176  NYKVTGVLADGRCLFRAIAHGACLRSGEEAPDETRQRELADELRAQVVNELLKRREETEW 235

Query: 469  FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 290
            F+EGDFDAYVK IQQP+ WGGEPELLMASHVLKTPISV+M   SS +L+ IA YGEEY+ 
Sbjct: 236  FIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTPISVYMIHRSSRNLIKIADYGEEYQK 295

Query: 289  DEEISINVLFHRYGHYDILET 227
            D+E  INVLFH YGHYDILE+
Sbjct: 296  DKETPINVLFHGYGHYDILES 316


>XP_018845374.1 PREDICTED: uncharacterized protein LOC109009371 [Juglans regia]
          Length = 328

 Score =  313 bits (803), Expect = e-101
 Identities = 182/342 (53%), Positives = 209/342 (61%), Gaps = 33/342 (9%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIF-----SFLHASSAARLAHGTAYSSASPRFSRPGHDG---ARRQHS 998
            MLGVLCA RPKPWI      SF+H S+A     G   S        PG +G    RR HS
Sbjct: 1    MLGVLCA-RPKPWILTSLSSSFVHGSAAHHHITGLRQS--------PGFNGDLKPRRHHS 51

Query: 997  SSCELRGXXXXXXS-IWHAIMPCGGDGFRRGVVAVHHD---HELKGEGSWNVAWDARPAR 830
            S+C + G      + IWHAIMPCG  G    ++   +     E +GEGSWNVAWDARPAR
Sbjct: 52   SACRIDGSFGGGAASIWHAIMPCGAAGHPSDLLLRRNAMLRRERRGEGSWNVAWDARPAR 111

Query: 829  WLHSPD-SAWLLFGVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKE----QD 665
            WLH PD SAWLLFGVC CL                        E K+  CDS +    ++
Sbjct: 112  WLHRPDYSAWLLFGVCACLAPLDFAFDDSPEAIVV--------EAKIEACDSIDSNANKN 163

Query: 664  DEV----------------SSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGEL 533
            DE+                S+DYRVTGVLADGRCLFRALAHGAC ++GE AP+ENRQ EL
Sbjct: 164  DEIDGFDAIYSNTSKPKEGSADYRVTGVLADGRCLFRALAHGACSRSGEEAPDENRQREL 223

Query: 532  ADXXXXXXXXXXXXXXXXXEWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVF 353
            AD                 EWF+EGDFDAYV+RIQQPF WGGEPELLMASHVLKTPISVF
Sbjct: 224  ADELRAQVVDELLKRRKETEWFIEGDFDAYVERIQQPFVWGGEPELLMASHVLKTPISVF 283

Query: 352  MRDTSSIDLVNIAKYGEEYRNDEEISINVLFHRYGHYDILET 227
            M++ SS  LVNIAKYGEEYR +E+  INVLFH YGHYD+LE+
Sbjct: 284  MKNRSSGRLVNIAKYGEEYRKEEDSPINVLFHGYGHYDLLES 325


>KHN37847.1 OTU domain-containing protein [Glycine soja]
          Length = 234

 Score =  310 bits (793), Expect = e-101
 Identities = 155/236 (65%), Positives = 173/236 (73%)
 Frame = -2

Query: 925 DGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLFGVCDCLXXXXXXXXXX 746
           DGFRRGVVA H   ++KGEGSWNVAWDARPARWLH PDSAWLLFGVC CL          
Sbjct: 8   DGFRRGVVAFH---DMKGEGSWNVAWDARPARWLHRPDSAWLLFGVCACLAPPSSCVDAD 64

Query: 745 XXXXXXXXESSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGE 566
                   + S          D + ++DEVS+DYRVTGV ADGRCLFRA+AHGACL+NGE
Sbjct: 65  TNTDAIAVDES------CRLLDKEREEDEVSADYRVTGVPADGRCLFRAIAHGACLRNGE 118

Query: 565 AAPNENRQGELADXXXXXXXXXXXXXXXXXEWFLEGDFDAYVKRIQQPFAWGGEPELLMA 386
            AP+ENRQ ELAD                 EWF+EGDFD Y++RIQQP+ WGGEPELLMA
Sbjct: 119 KAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQRIQQPYVWGGEPELLMA 178

Query: 385 SHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLFHRYGHYDILETS*P 218
           SHVLKTPISVFMRDT S++LVNIAKYGEEYRND++ISINVLFH YGHYDILET  P
Sbjct: 179 SHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFHGYGHYDILETLRP 234


>GAU40884.1 hypothetical protein TSUD_40590 [Trifolium subterraneum]
          Length = 266

 Score =  310 bits (794), Expect = e-101
 Identities = 167/272 (61%), Positives = 182/272 (66%), Gaps = 11/272 (4%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFLHASS----AARLAHGT-AYSSASPRFSRPGHDGARRQHSSSC 989
            MLGVLCATR +PWIFSFLH+SS     ARLAH T + SS  P FS      ARR HSS C
Sbjct: 1    MLGVLCATRSRPWIFSFLHSSSHHNHTARLAHATVSASSLCPTFS------ARRNHSSQC 54

Query: 988  ELRGXXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 809
            +L+       SIWHAIMPCGGDG ++G   VHHDHELKGEGSWNVAWDARPARWLH  DS
Sbjct: 55   KLQISTGGAASIWHAIMPCGGDGLQQGGFMVHHDHELKGEGSWNVAWDARPARWLHRSDS 114

Query: 808  AWLLFGVCDCLXXXXXXXXXXXXXXXXXXESSEG---REVK-VAECDSKEQDDEVSS--D 647
            AWLLFGVC CL                     E    RE+K + + +S +  DE+SS  D
Sbjct: 115  AWLLFGVCACLAPPVDVEAEVPPLTTSVISPDENYKRREIKDIKDAESDKPSDELSSEAD 174

Query: 646  YRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXEWF 467
            YRVTGVLADGRCLFRA+AHGACLKNGE APNENRQ ELAD                 EWF
Sbjct: 175  YRVTGVLADGRCLFRAIAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETEWF 234

Query: 466  LEGDFDAYVKRIQQPFAWGGEPELLMASHVLK 371
            +EGDFD YV RIQQ F WGGEPELLMASHVLK
Sbjct: 235  IEGDFDTYVTRIQQTFVWGGEPELLMASHVLK 266


>EOY19029.1 Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao]
          Length = 327

 Score =  310 bits (794), Expect = e-100
 Identities = 173/325 (53%), Positives = 199/325 (61%), Gaps = 16/325 (4%)
 Frame = -2

Query: 1153 MLGVLCATRPKPWIFSFLHASSAARLAHG--TAYSSASPRFSRPGH-------DGARRQH 1001
            MLGVLCA  PKPWI +     S + +AHG   A+   S     P H       D   R H
Sbjct: 1    MLGVLCARPPKPWILN-----SLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHH 55

Query: 1000 SSSCELRGXXXXXXSIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLH 821
            S++C L G      SIWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH
Sbjct: 56   STACRLGGSDGGAASIWHAILPCGGGGGGRRRGEVWKNVERKGEGSWNVAWDARPARWLH 115

Query: 820  SPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXESSEGREVKVAECDSKEQDDEVSS--- 650
             PDSAWLLFGVC CL                     EG E+ +    S ++    SS   
Sbjct: 116  RPDSAWLLFGVCACLAPMIEFVDVNPDADDKI----EGAELNLVSRLSADEKSSSSSSSV 171

Query: 649  ----DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXX 482
                + +VTGVLADGRCLFRA+AHGACL++GE AP+EN Q ELAD               
Sbjct: 172  AAADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVVNELLKRRE 231

Query: 481  XXEWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGE 302
              EWF+EGDFDAYVK IQQP+ WGGEPE+LMASHVLKTPISV+M   SS +L  IAKYGE
Sbjct: 232  ETEWFIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNLTKIAKYGE 291

Query: 301  EYRNDEEISINVLFHRYGHYDILET 227
            EY+ D+E  INVLFH YGHYDILE+
Sbjct: 292  EYQKDKENPINVLFHGYGHYDILES 316


Top