BLASTX nr result

ID: Glycyrrhiza32_contig00004487 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza32_contig00004487
         (1220 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_004496177.1 PREDICTED: OTU domain-containing protein At3g5781...   408   e-139
BAE71258.1 hypothetical protein [Trifolium pratense]                  395   e-133
XP_013469378.1 OTU-like cysteine protease [Medicago truncatula] ...   380   e-128
XP_003536306.1 PREDICTED: uncharacterized protein LOC100793001 [...   374   e-125
XP_017413456.1 PREDICTED: uncharacterized protein LOC108324995 [...   366   e-123
XP_003556279.1 PREDICTED: OTU domain-containing protein At3g5781...   365   e-122
XP_014512510.1 PREDICTED: uncharacterized protein LOC106771118 [...   364   e-122
XP_007143828.1 hypothetical protein PHAVU_007G105100g [Phaseolus...   362   e-121
XP_016177333.1 PREDICTED: uncharacterized protein LOC107619558 [...   362   e-120
XP_015941210.1 PREDICTED: uncharacterized protein LOC107466718 [...   360   e-120
XP_004142455.1 PREDICTED: OTU domain-containing protein At3g5781...   327   e-107
XP_016900257.1 PREDICTED: OTU domain-containing protein At3g5781...   325   e-106
XP_019459096.1 PREDICTED: uncharacterized protein LOC109359045 [...   323   e-105
XP_010032108.1 PREDICTED: OTU domain-containing protein At3g5781...   316   e-103
OMO50984.1 Ovarian tumor, otubain [Corchorus olitorius]               315   e-102
OMO98833.1 Ovarian tumor, otubain [Corchorus capsularis]              314   e-102
XP_018845374.1 PREDICTED: uncharacterized protein LOC109009371 [...   313   e-101
KHN37847.1 OTU domain-containing protein [Glycine soja]               310   e-101
GAU40884.1 hypothetical protein TSUD_40590 [Trifolium subterraneum]   310   e-101
EOY19029.1 Cysteine proteinases superfamily protein isoform 1 [T...   310   e-100

>XP_004496177.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cicer
           arietinum]
          Length = 313

 Score =  408 bits (1048), Expect = e-139
 Identities = 214/316 (67%), Positives = 233/316 (73%), Gaps = 7/316 (2%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFSFLHASS---AARLAHGT-AYSSASPRFSRPGHDGARRQHSSSCE 235
           MLGVLCATR +PWIFSFLH+S+   AARLAH T A SS S RF       ARR HSS+CE
Sbjct: 1   MLGVLCATRSRPWIFSFLHSSASHHAARLAHCTVACSSLSTRFDATF--AARRHHSSACE 58

Query: 236 LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 415
           L+        IWHAI PCGGDGFRRGVV V HDH+LKGEGSWNVAWDARPARWLH  DSA
Sbjct: 59  LQ-LGGGAASIWHAIRPCGGDGFRRGVVTVQHDHDLKGEGSWNVAWDARPARWLHRSDSA 117

Query: 416 WLLFGVCDCLXXXXXXXXXXXXXXXXXXXS---SEGREVKVAECDSKEQDDEVSSDYRVT 586
           WLLFGVC CL                   +   SEGRE+K AE D KE++DE+S+DYRVT
Sbjct: 118 WLLFGVCACLAPPVIADVDLEAPPTPAINTDENSEGREMKYAEGD-KERNDELSADYRVT 176

Query: 587 GVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGD 766
           GVLADGRCLFRA+AHGACL NGE APNENRQ ELAD                  WF+EGD
Sbjct: 177 GVLADGRCLFRAIAHGACLNNGEEAPNENRQRELADELRARVAEELLKRRKETEWFIEGD 236

Query: 767 FDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEIS 946
           FDAYV RI+Q + WGGEPELLMASHVLKTPI VFMRD SSIDLVNIAKYGEEY ND+EIS
Sbjct: 237 FDAYVNRIRQTYVWGGEPELLMASHVLKTPIYVFMRDASSIDLVNIAKYGEEYMNDKEIS 296

Query: 947 INVLFHRYGHYDILET 994
           INVLFHR+GHY+ILET
Sbjct: 297 INVLFHRHGHYEILET 312


>BAE71258.1 hypothetical protein [Trifolium pratense]
          Length = 326

 Score =  395 bits (1014), Expect = e-133
 Identities = 212/330 (64%), Positives = 228/330 (69%), Gaps = 13/330 (3%)
 Frame = +2

Query: 68   MLGVLCATRPKPWIFSFLHASSA-----ARLAHGTAYSSASPRFSRPGHDGARRQHSSSC 232
            MLGVLCATR +PWIFSFLH SS+     ARLAH T  SS+S     P    ARR HSS C
Sbjct: 1    MLGVLCATRSRPWIFSFLHHSSSHHHHTARLAHITVASSSS---LSPTFFSARRNHSSQC 57

Query: 233  ELR-GXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPD 409
            +L+         IWHAIMPCGGDGF+RG   VHHDHELKGEGSWNVAWDARPARWLH  D
Sbjct: 58   KLQISAGGGAASIWHAIMPCGGDGFQRGAFMVHHDHELKGEGSWNVAWDARPARWLHRSD 117

Query: 410  SAWLLFGVCDCL-------XXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVS 568
            SAWLLFGV   L                           SEG E+K AE D  + +DE+S
Sbjct: 118  SAWLLFGVRAWLAPPPVIVDVDPEVPLPTSVISPDEISRSEGLEIKDAESD--KPNDELS 175

Query: 569  SDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXX 748
            SDYRVTGVLADGRCLFRALAHGACLKNGE APNENRQ ELAD                  
Sbjct: 176  SDYRVTGVLADGRCLFRALAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETE 235

Query: 749  WFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYR 928
            WF+EGDFD YV RIQQ F WGGEPELLMASHVLKTPI VFMRD +SIDLVNIAKYGEEY 
Sbjct: 236  WFIEGDFDTYVTRIQQSFVWGGEPELLMASHVLKTPIFVFMRDPNSIDLVNIAKYGEEYM 295

Query: 929  NDEEISINVLFHRYGHYDILETS*PKLPKK 1018
            NDE ISINVLFHR+GHY++LET  PKL +K
Sbjct: 296  NDEGISINVLFHRHGHYELLETLCPKLSQK 325


>XP_013469378.1 OTU-like cysteine protease [Medicago truncatula] KEH43416.1
           OTU-like cysteine protease [Medicago truncatula]
          Length = 305

 Score =  380 bits (976), Expect = e-128
 Identities = 197/315 (62%), Positives = 215/315 (68%), Gaps = 6/315 (1%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELR-- 241
           MLGVLCATR +PWIFS  H   A RL+H T      P         ARR HS++C     
Sbjct: 1   MLGVLCATRSRPWIFSSHHHHHAFRLSHATVAPLTFP---------ARRHHSTACNNLQI 51

Query: 242 GXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWL 421
                   IWHAI PCGGDGFR G V +HHDHELKGEGSWNVAWDARPARWLH  DSAWL
Sbjct: 52  STGGGAASIWHAITPCGGDGFRTGGVMLHHDHELKGEGSWNVAWDARPARWLHRSDSAWL 111

Query: 422 LFGVCDCLXXXXXXXXXXXXXXXXXXX----SSEGREVKVAECDSKEQDDEVSSDYRVTG 589
           LFGVC CL                       SSEGRE+K    D  E+DDE+++DYRVTG
Sbjct: 112 LFGVCACLAPPVVLDVDPEAAAPTPAVFPNESSEGREMKDELSD--ERDDELNADYRVTG 169

Query: 590 VLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDF 769
           VLADGRCLFRA+AHGACLKNGE APNE+RQ ELAD                  WF+EGDF
Sbjct: 170 VLADGRCLFRAIAHGACLKNGEEAPNESRQRELADELRVKVAEELLNRRKETEWFIEGDF 229

Query: 770 DAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISI 949
           D YV RIQQ + WGGEPELLMASHVLKTPI VFMRD SS+DLVNIAKYGEEY NDEEISI
Sbjct: 230 DTYVTRIQQTYVWGGEPELLMASHVLKTPIYVFMRDASSMDLVNIAKYGEEYMNDEEISI 289

Query: 950 NVLFHRYGHYDILET 994
           NVLFHR+GHY++LET
Sbjct: 290 NVLFHRHGHYELLET 304


>XP_003536306.1 PREDICTED: uncharacterized protein LOC100793001 [Glycine max]
            KRH34730.1 hypothetical protein GLYMA_10G202000 [Glycine
            max]
          Length = 296

 Score =  374 bits (959), Expect = e-125
 Identities = 193/313 (61%), Positives = 216/313 (69%), Gaps = 1/313 (0%)
 Frame = +2

Query: 68   MLGVLCATRPKPWIFSFLHA-SSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRG 244
            MLGVLCATRPKPW+ S +H  +S  RL H     SASP          RR+HS++C+L  
Sbjct: 1    MLGVLCATRPKPWLLSLVHVHASLPRLPHSPLSPSASPP--------PRRRHSTACKLFL 52

Query: 245  XXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLL 424
                   IWHAIMP G DG RRGVVAVH   +LKGEGSWNVAWDARPARWLH PDSAWLL
Sbjct: 53   SGGAAASIWHAIMPRGDDGLRRGVVAVH---DLKGEGSWNVAWDARPARWLHRPDSAWLL 109

Query: 425  FGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADG 604
            FGVC CL                    S G        D + ++DEVS+DYRVTGV ADG
Sbjct: 110  FGVCACLAPPPGCVDADTNSAGIAVDESCGL------LDKEREEDEVSADYRVTGVPADG 163

Query: 605  RCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVK 784
            RCLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD Y++
Sbjct: 164  RCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQ 223

Query: 785  RIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLFH 964
            RIQQP+ WGGEPELLMASHVLKTPISVFMRDT S++LVNIAKYGEEYRND++ISINVLFH
Sbjct: 224  RIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFH 283

Query: 965  RYGHYDILETS*P 1003
             YGHYDILET  P
Sbjct: 284  GYGHYDILETLRP 296


>XP_017413456.1 PREDICTED: uncharacterized protein LOC108324995 [Vigna angularis]
           KOM35649.1 hypothetical protein LR48_Vigan02g179900
           [Vigna angularis] BAT94560.1 hypothetical protein
           VIGAN_08117300 [Vigna angularis var. angularis]
          Length = 290

 Score =  366 bits (940), Expect = e-123
 Identities = 194/310 (62%), Positives = 211/310 (68%), Gaps = 1/310 (0%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 247
           MLGVLCATRPKPW+FS +HAS   RL H +    ASP          RR HSS+C+L G 
Sbjct: 1   MLGVLCATRPKPWLFSLVHASPP-RLPHASVSLLASP---------PRRHHSSACKLFGS 50

Query: 248 XXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLF 427
                 IWHAIMP  GDGFRRGVVAVH   +LKGEGSWNVAWD RPARWLH  DSAWLLF
Sbjct: 51  AGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRSDSAWLLF 107

Query: 428 GVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGR 607
           GVC CL                   S    +        KE   +VS+DYRVTGV ADGR
Sbjct: 108 GVCACLAPPGCVDAVTDSDAVAADESCGVLD--------KELKVDVSADYRVTGVPADGR 159

Query: 608 CLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKR 787
           CLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD YVKR
Sbjct: 160 CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKR 219

Query: 788 IQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND-EEISINVLFH 964
           IQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+YRND EE SINVLFH
Sbjct: 220 IQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYRNDKEENSINVLFH 279

Query: 965 RYGHYDILET 994
            YGHYDILE+
Sbjct: 280 GYGHYDILES 289


>XP_003556279.1 PREDICTED: OTU domain-containing protein At3g57810-like [Glycine
           max] KHN00921.1 OTU domain-containing protein [Glycine
           soja] KRG92054.1 hypothetical protein GLYMA_20G188400
           [Glycine max]
          Length = 294

 Score =  365 bits (936), Expect = e-122
 Identities = 192/311 (61%), Positives = 213/311 (68%), Gaps = 2/311 (0%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 247
           MLGVLCATR KPW+FS +HAS   RL+H     SASP          RR+HS++C+L   
Sbjct: 1   MLGVLCATRSKPWLFSLVHAS-LPRLSHAPLSPSASPP--------PRRRHSTACKLFLS 51

Query: 248 XXXXXXIWHAIMPC--GGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWL 421
                 IWHAIMP     DGFRRGVVA H   ++KGEGSWNVAWDARPARWLH PDSAWL
Sbjct: 52  AGGAASIWHAIMPRVNDDDGFRRGVVAFH---DMKGEGSWNVAWDARPARWLHRPDSAWL 108

Query: 422 LFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLAD 601
           LFGVC CL                    S          D + ++ EVS+DYRVTGV AD
Sbjct: 109 LFGVCACLAPPSSCVDADTNTDAIAVDES------CRLLDKEREEYEVSADYRVTGVPAD 162

Query: 602 GRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYV 781
           GRCLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD YV
Sbjct: 163 GRCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELMKRREETEWFIEGDFDTYV 222

Query: 782 KRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLF 961
           +RIQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGEEYRND+EISINVLF
Sbjct: 223 QRIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEEYRNDKEISINVLF 282

Query: 962 HRYGHYDILET 994
           H YGHYDILET
Sbjct: 283 HGYGHYDILET 293


>XP_014512510.1 PREDICTED: uncharacterized protein LOC106771118 [Vigna radiata var.
           radiata]
          Length = 290

 Score =  364 bits (934), Expect = e-122
 Identities = 193/310 (62%), Positives = 210/310 (67%), Gaps = 1/310 (0%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 247
           MLGVLCATRPKPW+FS +HAS   RL H +    ASP          RR HSS+C+L G 
Sbjct: 1   MLGVLCATRPKPWLFSLVHASPP-RLPHASVSLLASP---------PRRHHSSACKLFGS 50

Query: 248 XXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLF 427
                 IWHAIMP  GDGFRRGVVAVH   +LKGEGSWNVAWD RPARWLH  DSAWLLF
Sbjct: 51  AGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRSDSAWLLF 107

Query: 428 GVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGR 607
           GVC CL                   S    +        KE   +VS+DYRVTGV ADGR
Sbjct: 108 GVCACLAPPGCVDAVTDSDAVAADESCGVLD--------KELKVDVSADYRVTGVPADGR 159

Query: 608 CLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKR 787
           CLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD YVKR
Sbjct: 160 CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKR 219

Query: 788 IQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND-EEISINVLFH 964
           IQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+Y ND EE SINVLFH
Sbjct: 220 IQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYMNDKEENSINVLFH 279

Query: 965 RYGHYDILET 994
            YGHYDILE+
Sbjct: 280 GYGHYDILES 289


>XP_007143828.1 hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris]
           ESW15822.1 hypothetical protein PHAVU_007G105100g
           [Phaseolus vulgaris]
          Length = 305

 Score =  362 bits (928), Expect = e-121
 Identities = 197/322 (61%), Positives = 214/322 (66%), Gaps = 1/322 (0%)
 Frame = +2

Query: 32  NPAHDSHLCTSSMLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGAR 211
           NPAHDS   +S MLGVLCATRP+PW+FS +HAS   RL H +   SASP          R
Sbjct: 7   NPAHDSF--SSPMLGVLCATRPRPWLFSHVHAS-LPRLVHASVSLSASP---------PR 54

Query: 212 RQHSSSCELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPAR 391
           R HSS+C++ G       IWHAIMP  GD FRRGVV VH   +LKGEGSWNVAWD RPAR
Sbjct: 55  RHHSSACKIFGSAGGAASIWHAIMPRSGDRFRRGVVPVH---DLKGEGSWNVAWDTRPAR 111

Query: 392 WLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS 571
           WLH PDSAWLLFGVC CL                   S    +V+ A  D         +
Sbjct: 112 WLHRPDSAWLLFGVCACLAPPGCVDVVTDFEAVAVDESCGVLKVE-ASAD--------YA 162

Query: 572 DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 751
           DYRVTGV ADGRCLFRA+AHG CL+NGE AP+EN Q ELAD                  W
Sbjct: 163 DYRVTGVPADGRCLFRAIAHGDCLRNGEKAPDENCQRELADELRAKVVDELLKRREETEW 222

Query: 752 FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 931
           F+EGDFD YVKRIQQPF WGGEPELLMASHVLKTPISVFMR T S+ LVNIAKYGEEYRN
Sbjct: 223 FIEGDFDTYVKRIQQPFVWGGEPELLMASHVLKTPISVFMRATGSVGLVNIAKYGEEYRN 282

Query: 932 D-EEISINVLFHRYGHYDILET 994
           D EE SINVLFH YGHYDILET
Sbjct: 283 DKEENSINVLFHGYGHYDILET 304


>XP_016177333.1 PREDICTED: uncharacterized protein LOC107619558 [Arachis ipaensis]
          Length = 327

 Score =  362 bits (929), Expect = e-120
 Identities = 201/328 (61%), Positives = 217/328 (66%), Gaps = 21/328 (6%)
 Frame = +2

Query: 74  GVLCATRPKPWIFS--FLHAS---SAARLAHGTAYSSASPRFSRP-GHDGARRQHSSSCE 235
           GVLCATRPKPWI S   LHAS   S+ARL H      A P F +      ARR HSS+C 
Sbjct: 4   GVLCATRPKPWILSAAILHASLHHSSARLLH------APPLFPQLLRRTDARRHHSSACN 57

Query: 236 LRGXXXXXXX--IWHAIMPCGGDGF------RRGVVAVHH-DHELKGEGSWNVAWDARPA 388
             G         IWHAIMPCGG          RGVVAVHH DHELKGEGSWNVAWDARPA
Sbjct: 58  HGGDFGGGGAASIWHAIMPCGGGAGSGKKLRHRGVVAVHHHDHELKGEGSWNVAWDARPA 117

Query: 389 RWLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXX------SSEGREVKVAECDSKE 550
           RWLH PDSAWLLFGVC CL                         + EG+ VKV       
Sbjct: 118 RWLHRPDSAWLLFGVCACLAPPVSSVTDLEATPPATATVVNRDINPEGQGVKV------- 170

Query: 551 QDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXX 730
             D +SSDYRVTGVLADGRCLFRA+AHGACL+NGEAAP+E RQ ELAD            
Sbjct: 171 --DGLSSDYRVTGVLADGRCLFRAIAHGACLRNGEAAPDERRQRELADELRAQVVEELMK 228

Query: 731 XXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 910
                 WF+EGDFD YVKRIQQP+ WGGEPELLMASHVLKTPISVFMRDTSS+ LVNIAK
Sbjct: 229 RREETEWFIEGDFDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSLSLVNIAK 288

Query: 911 YGEEYRNDEEISINVLFHRYGHYDILET 994
           YGEEYRN++++ INVLFH YGHYDILET
Sbjct: 289 YGEEYRNEKDVCINVLFHGYGHYDILET 316


>XP_015941210.1 PREDICTED: uncharacterized protein LOC107466718 [Arachis
           duranensis]
          Length = 327

 Score =  360 bits (925), Expect = e-120
 Identities = 200/328 (60%), Positives = 217/328 (66%), Gaps = 21/328 (6%)
 Frame = +2

Query: 74  GVLCATRPKPWIFS--FLHAS---SAARLAHGTAYSSASPRFSRP-GHDGARRQHSSSCE 235
           GVLCATRPKPWI S   LHAS   S+ARL H      A P F +       RR HSS+C 
Sbjct: 4   GVLCATRPKPWILSAAILHASLHHSSARLLH------APPLFPQLLRRTDTRRHHSSACN 57

Query: 236 LRGXXXXXXX--IWHAIMPCGGDGF------RRGVVAVHH-DHELKGEGSWNVAWDARPA 388
             G         IWHAIMPCGG          RGVVAVHH DHELKGEGSWNVAWDARPA
Sbjct: 58  HGGDFGGGGAASIWHAIMPCGGGAGSGKKLRHRGVVAVHHHDHELKGEGSWNVAWDARPA 117

Query: 389 RWLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXX------SSEGREVKVAECDSKE 550
           RWLH PDSAWLLFGVC CL                         ++EG+ VKV       
Sbjct: 118 RWLHRPDSAWLLFGVCACLAPPVSSVADLEATPPATATVVNRDMNTEGQGVKV------- 170

Query: 551 QDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXX 730
             D +SSDYRVTGVLADGRCLFRA+AHGACL+NGEAAP+E RQ ELAD            
Sbjct: 171 --DGLSSDYRVTGVLADGRCLFRAIAHGACLRNGEAAPDERRQRELADELRAQVVEELMK 228

Query: 731 XXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 910
                 WF+EGDFD YVKRIQQP+ WGGEPELLMASHVLKTPISVFMRDTSS+ LVNIAK
Sbjct: 229 RREETEWFIEGDFDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSLSLVNIAK 288

Query: 911 YGEEYRNDEEISINVLFHRYGHYDILET 994
           YGEEYRN++++ INVLFH YGHYDILET
Sbjct: 289 YGEEYRNEKDMCINVLFHGYGHYDILET 316


>XP_004142455.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
            sativus] KGN52210.1 hypothetical protein Csa_5G615810
            [Cucumis sativus]
          Length = 313

 Score =  327 bits (837), Expect = e-107
 Identities = 178/321 (55%), Positives = 204/321 (63%), Gaps = 4/321 (1%)
 Frame = +2

Query: 68   MLGVLCATRPKPWIF----SFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCE 235
            MLGVLCA RPKPWI     +F+H S+     H   + S     S    D  +R HSS+C+
Sbjct: 1    MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH---HQSRLLVQSPIQFDRRQRHHSSACK 56

Query: 236  LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 415
            L G       IWHAIMP G            H HE KGEGSWNVAWDARPARWLH PDSA
Sbjct: 57   LAGGGAAS--IWHAIMPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSA 114

Query: 416  WLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVL 595
            WLLFGVC C+                     + +EV  +      Q+DE S+DYRVTGVL
Sbjct: 115  WLLFGVCACIAPLDWVDASHEAVSL-----DQKKEVCESSGPEFNQNDESSADYRVTGVL 169

Query: 596  ADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDA 775
            ADGRCLFRA+AHGACL++GE AP+++RQ ELAD                  W++EGDFDA
Sbjct: 170  ADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDA 229

Query: 776  YVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINV 955
            YVKRIQQPF WGGEPELLMASHVLKTPISVFMR+ SS  L+NIAKYG+EY+  EE  INV
Sbjct: 230  YVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQKGEESPINV 289

Query: 956  LFHRYGHYDILETS*PKLPKK 1018
            LFH YGHYDILETS  K+  K
Sbjct: 290  LFHGYGHYDILETSSDKVSLK 310


>XP_016900257.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
            melo]
          Length = 313

 Score =  325 bits (832), Expect = e-106
 Identities = 177/321 (55%), Positives = 204/321 (63%), Gaps = 4/321 (1%)
 Frame = +2

Query: 68   MLGVLCATRPKPWIF----SFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCE 235
            MLGVLCA RPKPWI     +F+H S+     H   + S     S    D  +R HSS+C+
Sbjct: 1    MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH---HQSRLLVQSPIQFDRRQRHHSSACK 56

Query: 236  LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 415
            L G       IWHAI+P G            H HE KGEGSWNVAWDARPARWLH PDSA
Sbjct: 57   LAGGGAAS--IWHAILPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSA 114

Query: 416  WLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVL 595
            WLLFGVC C+                     + +EV  +      Q+DE S+DYRVTGVL
Sbjct: 115  WLLFGVCACIAPLDWVDASHEAVSL-----DQKKEVCESSGPEFNQNDESSADYRVTGVL 169

Query: 596  ADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDA 775
            ADGRCLFRA+AHGACL++GE AP+++RQ ELAD                  W++EGDFDA
Sbjct: 170  ADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDA 229

Query: 776  YVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINV 955
            YVKRIQQPF WGGEPELLMASHVLKTPISVFMR+ SS  L+NIAKYG+EY+  EE  INV
Sbjct: 230  YVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQMGEESPINV 289

Query: 956  LFHRYGHYDILETS*PKLPKK 1018
            LFH YGHYDILETS  K+  K
Sbjct: 290  LFHGYGHYDILETSSDKVSLK 310


>XP_019459096.1 PREDICTED: uncharacterized protein LOC109359045 [Lupinus
           angustifolius] OIW01500.1 hypothetical protein
           TanjilG_19426 [Lupinus angustifolius]
          Length = 319

 Score =  323 bits (828), Expect = e-105
 Identities = 179/318 (56%), Positives = 202/318 (63%), Gaps = 8/318 (2%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFSFLHASSAARLAHGTA--YSSASPRFSRPGHDGARRQHSSSCELR 241
           ML  LC TRPKP   S     +A+   H +A   + +S  F  PG DG RR HSS+C + 
Sbjct: 1   MLAALC-TRPKPSFLSSFFFQTASLHNHNSARFINGSSLHFYCPGGDGRRRHHSSACTIG 59

Query: 242 GXXXXXXXIWHAIMPCGGDGF----RRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPD 409
           G       IWH ++P           R   A+ H HEL+GEGSWN AWDARP+RWLH PD
Sbjct: 60  GSCGGAASIWHVVLPERAGASICCDLRWRSALPH-HELRGEGSWNAAWDARPSRWLHRPD 118

Query: 410 SAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGR-EVKVAECDSKEQDDEVSSDYRVT 586
           SAWLLFGVC CL                   S  G  ++K   CD   + +EVSS YR+T
Sbjct: 119 SAWLLFGVCACLAPPLLLADVNTEVPSAEHDSDGGGGDLKGPGCD---EQNEVSSAYRIT 175

Query: 587 GVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGD 766
           GVLADGRCLFRA+AHGACL NGE AP+ENRQ ELAD                  WF+EGD
Sbjct: 176 GVLADGRCLFRAIAHGACLMNGEEAPDENRQRELADELRAQVVEELMKRREETEWFIEGD 235

Query: 767 FDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEY-RNDEEI 943
           FDAYV RIQQPF WGGEPELLMASHVLKTPISVFMRD SS DLVNIAKYGEEY   ++EI
Sbjct: 236 FDAYVTRIQQPFVWGGEPELLMASHVLKTPISVFMRDRSSGDLVNIAKYGEEYITKEKEI 295

Query: 944 SINVLFHRYGHYDILETS 997
           +INVLFH YGHYDILE S
Sbjct: 296 AINVLFHGYGHYDILEIS 313


>XP_010032108.1 PREDICTED: OTU domain-containing protein At3g57810 [Eucalyptus
           grandis] KCW51502.1 hypothetical protein EUGRSUZ_J01018
           [Eucalyptus grandis]
          Length = 314

 Score =  316 bits (809), Expect = e-103
 Identities = 178/320 (55%), Positives = 200/320 (62%), Gaps = 11/320 (3%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFS--FLHASSAARLAHGTAYSSASPRFSRPGHDG---ARRQHSSSC 232
           MLGVLCA RPKPWI +  F HAS+A         S+A+ R            RR HSSSC
Sbjct: 1   MLGVLCA-RPKPWILASCFSHASAAHHCGRLAWVSAAAARLQLAADSPDRWRRRHHSSSC 59

Query: 233 ELRGXXXXXXX-----IWHAIMPCG-GDGFRRGVVAVHHDHELKGEGSWNVAWDARPARW 394
            L G            IWHAI+P G GD  RR  +        +GEGSWNVAWDARPARW
Sbjct: 60  RLGGASSCAHPCGVASIWHAILPSGEGDPPRR--MDQPRRPVFRGEGSWNVAWDARPARW 117

Query: 395 LHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSD 574
           LH PDSAWLLFGVC CL                        E +V + DS ++    S D
Sbjct: 118 LHRPDSAWLLFGVCACLAPVDAAEPSREEVVP---------EARVEDRDSLDEAKRSSPD 168

Query: 575 YRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWF 754
           YRVTGVLADGRCLFRA+AH ACL+ GEAAP++NRQ ELAD                  W 
Sbjct: 169 YRVTGVLADGRCLFRAIAHCACLRKGEAAPDDNRQRELADELRAQVVAELLKRREETEWA 228

Query: 755 LEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND 934
           +EGDFDAY++RIQQP+ WGGEPELLMASHVLKTPISVFM D SS +LVN+AKYGEEYR D
Sbjct: 229 IEGDFDAYIERIQQPYVWGGEPELLMASHVLKTPISVFMVDRSSGNLVNVAKYGEEYRKD 288

Query: 935 EEISINVLFHRYGHYDILET 994
           EEI INVLFH YGHYDILE+
Sbjct: 289 EEIPINVLFHGYGHYDILES 308


>OMO50984.1 Ovarian tumor, otubain [Corchorus olitorius]
          Length = 327

 Score =  315 bits (806), Expect = e-102
 Identities = 171/321 (53%), Positives = 197/321 (61%), Gaps = 12/321 (3%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFSFL----HASSAARLAHGTAYSSASPRFSRPGHDGAR-RQHSSSC 232
           MLGVLCA  PKPWI + L    H   +A   H +      P F+    D  R R HS++C
Sbjct: 1   MLGVLCARPPKPWILNSLSLVAHGGGSAAHHHDSRLLHW-PHFAHISADNRRCRHHSTAC 59

Query: 233 ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 412
            L G       IWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH PDS
Sbjct: 60  RLGGSDGGAASIWHAILPCGGSGRGRKREEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119

Query: 413 AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS------- 571
           AWLLFGVC CL                     EG E+      S ++   +SS       
Sbjct: 120 AWLLFGVCACLAPMIEFVDVNPETDDKI----EGAELISINGLSADEKSSISSSPVAAPD 175

Query: 572 DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 751
           +Y+VTGVLADGRCLFRA+AHGACL++GE AP+E RQ ELAD                  W
Sbjct: 176 NYKVTGVLADGRCLFRAIAHGACLRSGEEAPDETRQRELADELRAQVVNELLKRREETEW 235

Query: 752 FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 931
           F+EGDFDAYVK IQQP+ WGGEPELLMASHVLKTPISV+M   SS +L+ IA YGEEY+ 
Sbjct: 236 FIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTPISVYMIHRSSRNLIKIADYGEEYQK 295

Query: 932 DEEISINVLFHRYGHYDILET 994
           D+E  INVLFH YGHYDILE+
Sbjct: 296 DKETPINVLFHGYGHYDILES 316


>OMO98833.1 Ovarian tumor, otubain [Corchorus capsularis]
          Length = 327

 Score =  314 bits (804), Expect = e-102
 Identities = 171/321 (53%), Positives = 197/321 (61%), Gaps = 12/321 (3%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFSFL----HASSAARLAHGTAYSSASPRFSRPGHDGAR-RQHSSSC 232
           MLGVLCA  PKPWI + L    H   +A   H +      P F+    D  R R HS++C
Sbjct: 1   MLGVLCARPPKPWILNSLSLVAHGGGSAAHHHDSRLLHW-PHFADLSADNRRCRHHSTAC 59

Query: 233 ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 412
            L G       IWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH PDS
Sbjct: 60  RLGGSDGGAASIWHAILPCGGSGRGRKREEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119

Query: 413 AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS------- 571
           AWLLFGVC CL                     EG E+      S ++   +SS       
Sbjct: 120 AWLLFGVCACLAPMIEFVDVNPETDDKI----EGTELISINGLSADEKSSISSSPVAAPD 175

Query: 572 DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 751
           +Y+VTGVLADGRCLFRA+AHGACL++GE AP+E RQ ELAD                  W
Sbjct: 176 NYKVTGVLADGRCLFRAIAHGACLRSGEEAPDETRQRELADELRAQVVNELLKRREETEW 235

Query: 752 FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 931
           F+EGDFDAYVK IQQP+ WGGEPELLMASHVLKTPISV+M   SS +L+ IA YGEEY+ 
Sbjct: 236 FIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTPISVYMIHRSSRNLIKIADYGEEYQK 295

Query: 932 DEEISINVLFHRYGHYDILET 994
           D+E  INVLFH YGHYDILE+
Sbjct: 296 DKETPINVLFHGYGHYDILES 316


>XP_018845374.1 PREDICTED: uncharacterized protein LOC109009371 [Juglans regia]
          Length = 328

 Score =  313 bits (803), Expect = e-101
 Identities = 181/342 (52%), Positives = 207/342 (60%), Gaps = 33/342 (9%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIF-----SFLHASSAARLAHGTAYSSASPRFSRPGHDG---ARRQHS 223
           MLGVLCA RPKPWI      SF+H S+A     G   S        PG +G    RR HS
Sbjct: 1   MLGVLCA-RPKPWILTSLSSSFVHGSAAHHHITGLRQS--------PGFNGDLKPRRHHS 51

Query: 224 SSCELRGXXXXXXX-IWHAIMPCGGDGFRRGVVAVHHD---HELKGEGSWNVAWDARPAR 391
           S+C + G        IWHAIMPCG  G    ++   +     E +GEGSWNVAWDARPAR
Sbjct: 52  SACRIDGSFGGGAASIWHAIMPCGAAGHPSDLLLRRNAMLRRERRGEGSWNVAWDARPAR 111

Query: 392 WLHSPD-SAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKE----QD 556
           WLH PD SAWLLFGVC CL                        E K+  CDS +    ++
Sbjct: 112 WLHRPDYSAWLLFGVCACLAPLDFAFDDSPEAIVV--------EAKIEACDSIDSNANKN 163

Query: 557 DEV----------------SSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGEL 688
           DE+                S+DYRVTGVLADGRCLFRALAHGAC ++GE AP+ENRQ EL
Sbjct: 164 DEIDGFDAIYSNTSKPKEGSADYRVTGVLADGRCLFRALAHGACSRSGEEAPDENRQREL 223

Query: 689 ADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVF 868
           AD                  WF+EGDFDAYV+RIQQPF WGGEPELLMASHVLKTPISVF
Sbjct: 224 ADELRAQVVDELLKRRKETEWFIEGDFDAYVERIQQPFVWGGEPELLMASHVLKTPISVF 283

Query: 869 MRDTSSIDLVNIAKYGEEYRNDEEISINVLFHRYGHYDILET 994
           M++ SS  LVNIAKYGEEYR +E+  INVLFH YGHYD+LE+
Sbjct: 284 MKNRSSGRLVNIAKYGEEYRKEEDSPINVLFHGYGHYDLLES 325


>KHN37847.1 OTU domain-containing protein [Glycine soja]
          Length = 234

 Score =  310 bits (793), Expect = e-101
 Identities = 154/236 (65%), Positives = 171/236 (72%)
 Frame = +2

Query: 296  DGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLFGVCDCLXXXXXXXXXX 475
            DGFRRGVVA H   ++KGEGSWNVAWDARPARWLH PDSAWLLFGVC CL          
Sbjct: 8    DGFRRGVVAFH---DMKGEGSWNVAWDARPARWLHRPDSAWLLFGVCACLAPPSSCVDAD 64

Query: 476  XXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGE 655
                      S          D + ++DEVS+DYRVTGV ADGRCLFRA+AHGACL+NGE
Sbjct: 65   TNTDAIAVDES------CRLLDKEREEDEVSADYRVTGVPADGRCLFRAIAHGACLRNGE 118

Query: 656  AAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMA 835
             AP+ENRQ ELAD                  WF+EGDFD Y++RIQQP+ WGGEPELLMA
Sbjct: 119  KAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQRIQQPYVWGGEPELLMA 178

Query: 836  SHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLFHRYGHYDILETS*P 1003
            SHVLKTPISVFMRDT S++LVNIAKYGEEYRND++ISINVLFH YGHYDILET  P
Sbjct: 179  SHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFHGYGHYDILETLRP 234


>GAU40884.1 hypothetical protein TSUD_40590 [Trifolium subterraneum]
          Length = 266

 Score =  310 bits (794), Expect = e-101
 Identities = 165/272 (60%), Positives = 180/272 (66%), Gaps = 11/272 (4%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFSFLHASS----AARLAHGT-AYSSASPRFSRPGHDGARRQHSSSC 232
           MLGVLCATR +PWIFSFLH+SS     ARLAH T + SS  P FS      ARR HSS C
Sbjct: 1   MLGVLCATRSRPWIFSFLHSSSHHNHTARLAHATVSASSLCPTFS------ARRNHSSQC 54

Query: 233 ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 412
           +L+        IWHAIMPCGGDG ++G   VHHDHELKGEGSWNVAWDARPARWLH  DS
Sbjct: 55  KLQISTGGAASIWHAIMPCGGDGLQQGGFMVHHDHELKGEGSWNVAWDARPARWLHRSDS 114

Query: 413 AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEG---REVK-VAECDSKEQDDEVSS--D 574
           AWLLFGVC CL                     E    RE+K + + +S +  DE+SS  D
Sbjct: 115 AWLLFGVCACLAPPVDVEAEVPPLTTSVISPDENYKRREIKDIKDAESDKPSDELSSEAD 174

Query: 575 YRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWF 754
           YRVTGVLADGRCLFRA+AHGACLKNGE APNENRQ ELAD                  WF
Sbjct: 175 YRVTGVLADGRCLFRAIAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETEWF 234

Query: 755 LEGDFDAYVKRIQQPFAWGGEPELLMASHVLK 850
           +EGDFD YV RIQQ F WGGEPELLMASHVLK
Sbjct: 235 IEGDFDTYVTRIQQTFVWGGEPELLMASHVLK 266


>EOY19029.1 Cysteine proteinases superfamily protein isoform 1 [Theobroma
           cacao]
          Length = 327

 Score =  310 bits (794), Expect = e-100
 Identities = 171/325 (52%), Positives = 197/325 (60%), Gaps = 16/325 (4%)
 Frame = +2

Query: 68  MLGVLCATRPKPWIFSFLHASSAARLAHG--TAYSSASPRFSRPGH-------DGARRQH 220
           MLGVLCA  PKPWI +     S + +AHG   A+   S     P H       D   R H
Sbjct: 1   MLGVLCARPPKPWILN-----SLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHH 55

Query: 221 SSSCELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLH 400
           S++C L G       IWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH
Sbjct: 56  STACRLGGSDGGAASIWHAILPCGGGGGGRRRGEVWKNVERKGEGSWNVAWDARPARWLH 115

Query: 401 SPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS--- 571
            PDSAWLLFGVC CL                     EG E+ +    S ++    SS   
Sbjct: 116 RPDSAWLLFGVCACLAPMIEFVDVNPDADDKI----EGAELNLVSRLSADEKSSSSSSSV 171

Query: 572 ----DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXX 739
               + +VTGVLADGRCLFRA+AHGACL++GE AP+EN Q ELAD               
Sbjct: 172 AAADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVVNELLKRRE 231

Query: 740 XXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGE 919
              WF+EGDFDAYVK IQQP+ WGGEPE+LMASHVLKTPISV+M   SS +L  IAKYGE
Sbjct: 232 ETEWFIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNLTKIAKYGE 291

Query: 920 EYRNDEEISINVLFHRYGHYDILET 994
           EY+ D+E  INVLFH YGHYDILE+
Sbjct: 292 EYQKDKENPINVLFHGYGHYDILES 316


Top