BLASTX nr result

ID: Glycyrrhiza29_contig00020455 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza29_contig00020455
         (1515 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_004496177.1 PREDICTED: OTU domain-containing protein At3g5781...   408   e-137
BAE71258.1 hypothetical protein [Trifolium pratense]                  395   e-132
XP_013469378.1 OTU-like cysteine protease [Medicago truncatula] ...   380   e-126
XP_003536306.1 PREDICTED: uncharacterized protein LOC100793001 [...   374   e-124
XP_017413456.1 PREDICTED: uncharacterized protein LOC108324995 [...   366   e-121
XP_003556279.1 PREDICTED: OTU domain-containing protein At3g5781...   365   e-121
XP_014512510.1 PREDICTED: uncharacterized protein LOC106771118 [...   364   e-120
XP_007143828.1 hypothetical protein PHAVU_007G105100g [Phaseolus...   362   e-119
XP_016177333.1 PREDICTED: uncharacterized protein LOC107619558 [...   362   e-119
XP_015941210.1 PREDICTED: uncharacterized protein LOC107466718 [...   360   e-118
XP_004142455.1 PREDICTED: OTU domain-containing protein At3g5781...   327   e-105
XP_016900257.1 PREDICTED: OTU domain-containing protein At3g5781...   325   e-105
XP_019459096.1 PREDICTED: uncharacterized protein LOC109359045 [...   323   e-104
XP_010032108.1 PREDICTED: OTU domain-containing protein At3g5781...   316   e-101
OMO50984.1 Ovarian tumor, otubain [Corchorus olitorius]               315   e-101
OMO98833.1 Ovarian tumor, otubain [Corchorus capsularis]              314   e-100
XP_018845374.1 PREDICTED: uncharacterized protein LOC109009371 [...   313   e-100
KHN37847.1 OTU domain-containing protein [Glycine soja]               310   e-100
GAU40884.1 hypothetical protein TSUD_40590 [Trifolium subterraneum]   310   e-100
EOY19029.1 Cysteine proteinases superfamily protein isoform 1 [T...   310   6e-99

>XP_004496177.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cicer
            arietinum]
          Length = 313

 Score =  408 bits (1048), Expect = e-137
 Identities = 214/316 (67%), Positives = 233/316 (73%), Gaps = 7/316 (2%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFLHASS---AARLAHGT-AYSSASPRFSRPGHDGARRQHSSSCE 530
            MLGVLCATR +PWIFSFLH+S+   AARLAH T A SS S RF       ARR HSS+CE
Sbjct: 1    MLGVLCATRSRPWIFSFLHSSASHHAARLAHCTVACSSLSTRFDATF--AARRHHSSACE 58

Query: 531  LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 710
            L+        IWHAI PCGGDGFRRGVV V HDH+LKGEGSWNVAWDARPARWLH  DSA
Sbjct: 59   LQ-LGGGAASIWHAIRPCGGDGFRRGVVTVQHDHDLKGEGSWNVAWDARPARWLHRSDSA 117

Query: 711  WLLFGVCDCLXXXXXXXXXXXXXXXXXXXS---SEGREVKVAECDSKEQDDEVSSDYRVT 881
            WLLFGVC CL                   +   SEGRE+K AE D KE++DE+S+DYRVT
Sbjct: 118  WLLFGVCACLAPPVIADVDLEAPPTPAINTDENSEGREMKYAEGD-KERNDELSADYRVT 176

Query: 882  GVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGD 1061
            GVLADGRCLFRA+AHGACL NGE APNENRQ ELAD                  WF+EGD
Sbjct: 177  GVLADGRCLFRAIAHGACLNNGEEAPNENRQRELADELRARVAEELLKRRKETEWFIEGD 236

Query: 1062 FDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEIS 1241
            FDAYV RI+Q + WGGEPELLMASHVLKTPI VFMRD SSIDLVNIAKYGEEY ND+EIS
Sbjct: 237  FDAYVNRIRQTYVWGGEPELLMASHVLKTPIYVFMRDASSIDLVNIAKYGEEYMNDKEIS 296

Query: 1242 INVLFHRYGHYDILET 1289
            INVLFHR+GHY+ILET
Sbjct: 297  INVLFHRHGHYEILET 312


>BAE71258.1 hypothetical protein [Trifolium pratense]
          Length = 326

 Score =  395 bits (1014), Expect = e-132
 Identities = 212/330 (64%), Positives = 228/330 (69%), Gaps = 13/330 (3%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFLHASSA-----ARLAHGTAYSSASPRFSRPGHDGARRQHSSSC 527
            MLGVLCATR +PWIFSFLH SS+     ARLAH T  SS+S     P    ARR HSS C
Sbjct: 1    MLGVLCATRSRPWIFSFLHHSSSHHHHTARLAHITVASSSS---LSPTFFSARRNHSSQC 57

Query: 528  ELR-GXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPD 704
            +L+         IWHAIMPCGGDGF+RG   VHHDHELKGEGSWNVAWDARPARWLH  D
Sbjct: 58   KLQISAGGGAASIWHAIMPCGGDGFQRGAFMVHHDHELKGEGSWNVAWDARPARWLHRSD 117

Query: 705  SAWLLFGVCDCL-------XXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVS 863
            SAWLLFGV   L                           SEG E+K AE D  + +DE+S
Sbjct: 118  SAWLLFGVRAWLAPPPVIVDVDPEVPLPTSVISPDEISRSEGLEIKDAESD--KPNDELS 175

Query: 864  SDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXX 1043
            SDYRVTGVLADGRCLFRALAHGACLKNGE APNENRQ ELAD                  
Sbjct: 176  SDYRVTGVLADGRCLFRALAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETE 235

Query: 1044 WFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYR 1223
            WF+EGDFD YV RIQQ F WGGEPELLMASHVLKTPI VFMRD +SIDLVNIAKYGEEY 
Sbjct: 236  WFIEGDFDTYVTRIQQSFVWGGEPELLMASHVLKTPIFVFMRDPNSIDLVNIAKYGEEYM 295

Query: 1224 NDEEISINVLFHRYGHYDILETS*PKLPKK 1313
            NDE ISINVLFHR+GHY++LET  PKL +K
Sbjct: 296  NDEGISINVLFHRHGHYELLETLCPKLSQK 325


>XP_013469378.1 OTU-like cysteine protease [Medicago truncatula] KEH43416.1 OTU-like
            cysteine protease [Medicago truncatula]
          Length = 305

 Score =  380 bits (976), Expect = e-126
 Identities = 197/315 (62%), Positives = 215/315 (68%), Gaps = 6/315 (1%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELR-- 536
            MLGVLCATR +PWIFS  H   A RL+H T      P         ARR HS++C     
Sbjct: 1    MLGVLCATRSRPWIFSSHHHHHAFRLSHATVAPLTFP---------ARRHHSTACNNLQI 51

Query: 537  GXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWL 716
                    IWHAI PCGGDGFR G V +HHDHELKGEGSWNVAWDARPARWLH  DSAWL
Sbjct: 52   STGGGAASIWHAITPCGGDGFRTGGVMLHHDHELKGEGSWNVAWDARPARWLHRSDSAWL 111

Query: 717  LFGVCDCLXXXXXXXXXXXXXXXXXXX----SSEGREVKVAECDSKEQDDEVSSDYRVTG 884
            LFGVC CL                       SSEGRE+K    D  E+DDE+++DYRVTG
Sbjct: 112  LFGVCACLAPPVVLDVDPEAAAPTPAVFPNESSEGREMKDELSD--ERDDELNADYRVTG 169

Query: 885  VLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDF 1064
            VLADGRCLFRA+AHGACLKNGE APNE+RQ ELAD                  WF+EGDF
Sbjct: 170  VLADGRCLFRAIAHGACLKNGEEAPNESRQRELADELRVKVAEELLNRRKETEWFIEGDF 229

Query: 1065 DAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISI 1244
            D YV RIQQ + WGGEPELLMASHVLKTPI VFMRD SS+DLVNIAKYGEEY NDEEISI
Sbjct: 230  DTYVTRIQQTYVWGGEPELLMASHVLKTPIYVFMRDASSMDLVNIAKYGEEYMNDEEISI 289

Query: 1245 NVLFHRYGHYDILET 1289
            NVLFHR+GHY++LET
Sbjct: 290  NVLFHRHGHYELLET 304


>XP_003536306.1 PREDICTED: uncharacterized protein LOC100793001 [Glycine max]
            KRH34730.1 hypothetical protein GLYMA_10G202000 [Glycine
            max]
          Length = 296

 Score =  374 bits (959), Expect = e-124
 Identities = 193/313 (61%), Positives = 216/313 (69%), Gaps = 1/313 (0%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFLHA-SSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRG 539
            MLGVLCATRPKPW+ S +H  +S  RL H     SASP          RR+HS++C+L  
Sbjct: 1    MLGVLCATRPKPWLLSLVHVHASLPRLPHSPLSPSASPP--------PRRRHSTACKLFL 52

Query: 540  XXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLL 719
                   IWHAIMP G DG RRGVVAVH   +LKGEGSWNVAWDARPARWLH PDSAWLL
Sbjct: 53   SGGAAASIWHAIMPRGDDGLRRGVVAVH---DLKGEGSWNVAWDARPARWLHRPDSAWLL 109

Query: 720  FGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADG 899
            FGVC CL                    S G        D + ++DEVS+DYRVTGV ADG
Sbjct: 110  FGVCACLAPPPGCVDADTNSAGIAVDESCGL------LDKEREEDEVSADYRVTGVPADG 163

Query: 900  RCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVK 1079
            RCLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD Y++
Sbjct: 164  RCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQ 223

Query: 1080 RIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLFH 1259
            RIQQP+ WGGEPELLMASHVLKTPISVFMRDT S++LVNIAKYGEEYRND++ISINVLFH
Sbjct: 224  RIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFH 283

Query: 1260 RYGHYDILETS*P 1298
             YGHYDILET  P
Sbjct: 284  GYGHYDILETLRP 296


>XP_017413456.1 PREDICTED: uncharacterized protein LOC108324995 [Vigna angularis]
            KOM35649.1 hypothetical protein LR48_Vigan02g179900
            [Vigna angularis] BAT94560.1 hypothetical protein
            VIGAN_08117300 [Vigna angularis var. angularis]
          Length = 290

 Score =  366 bits (940), Expect = e-121
 Identities = 194/310 (62%), Positives = 211/310 (68%), Gaps = 1/310 (0%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 542
            MLGVLCATRPKPW+FS +HAS   RL H +    ASP          RR HSS+C+L G 
Sbjct: 1    MLGVLCATRPKPWLFSLVHASPP-RLPHASVSLLASP---------PRRHHSSACKLFGS 50

Query: 543  XXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLF 722
                  IWHAIMP  GDGFRRGVVAVH   +LKGEGSWNVAWD RPARWLH  DSAWLLF
Sbjct: 51   AGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRSDSAWLLF 107

Query: 723  GVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGR 902
            GVC CL                   S    +        KE   +VS+DYRVTGV ADGR
Sbjct: 108  GVCACLAPPGCVDAVTDSDAVAADESCGVLD--------KELKVDVSADYRVTGVPADGR 159

Query: 903  CLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKR 1082
            CLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD YVKR
Sbjct: 160  CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKR 219

Query: 1083 IQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND-EEISINVLFH 1259
            IQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+YRND EE SINVLFH
Sbjct: 220  IQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYRNDKEENSINVLFH 279

Query: 1260 RYGHYDILET 1289
             YGHYDILE+
Sbjct: 280  GYGHYDILES 289


>XP_003556279.1 PREDICTED: OTU domain-containing protein At3g57810-like [Glycine max]
            KHN00921.1 OTU domain-containing protein [Glycine soja]
            KRG92054.1 hypothetical protein GLYMA_20G188400 [Glycine
            max]
          Length = 294

 Score =  365 bits (936), Expect = e-121
 Identities = 192/311 (61%), Positives = 213/311 (68%), Gaps = 2/311 (0%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 542
            MLGVLCATR KPW+FS +HAS   RL+H     SASP          RR+HS++C+L   
Sbjct: 1    MLGVLCATRSKPWLFSLVHAS-LPRLSHAPLSPSASPP--------PRRRHSTACKLFLS 51

Query: 543  XXXXXXIWHAIMPC--GGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWL 716
                  IWHAIMP     DGFRRGVVA H   ++KGEGSWNVAWDARPARWLH PDSAWL
Sbjct: 52   AGGAASIWHAIMPRVNDDDGFRRGVVAFH---DMKGEGSWNVAWDARPARWLHRPDSAWL 108

Query: 717  LFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLAD 896
            LFGVC CL                    S          D + ++ EVS+DYRVTGV AD
Sbjct: 109  LFGVCACLAPPSSCVDADTNTDAIAVDES------CRLLDKEREEYEVSADYRVTGVPAD 162

Query: 897  GRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYV 1076
            GRCLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD YV
Sbjct: 163  GRCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELMKRREETEWFIEGDFDTYV 222

Query: 1077 KRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLF 1256
            +RIQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGEEYRND+EISINVLF
Sbjct: 223  QRIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEEYRNDKEISINVLF 282

Query: 1257 HRYGHYDILET 1289
            H YGHYDILET
Sbjct: 283  HGYGHYDILET 293


>XP_014512510.1 PREDICTED: uncharacterized protein LOC106771118 [Vigna radiata var.
            radiata]
          Length = 290

 Score =  364 bits (934), Expect = e-120
 Identities = 193/310 (62%), Positives = 210/310 (67%), Gaps = 1/310 (0%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 542
            MLGVLCATRPKPW+FS +HAS   RL H +    ASP          RR HSS+C+L G 
Sbjct: 1    MLGVLCATRPKPWLFSLVHASPP-RLPHASVSLLASP---------PRRHHSSACKLFGS 50

Query: 543  XXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLF 722
                  IWHAIMP  GDGFRRGVVAVH   +LKGEGSWNVAWD RPARWLH  DSAWLLF
Sbjct: 51   AGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRSDSAWLLF 107

Query: 723  GVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGR 902
            GVC CL                   S    +        KE   +VS+DYRVTGV ADGR
Sbjct: 108  GVCACLAPPGCVDAVTDSDAVAADESCGVLD--------KELKVDVSADYRVTGVPADGR 159

Query: 903  CLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKR 1082
            CLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD YVKR
Sbjct: 160  CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKR 219

Query: 1083 IQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND-EEISINVLFH 1259
            IQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+Y ND EE SINVLFH
Sbjct: 220  IQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYMNDKEENSINVLFH 279

Query: 1260 RYGHYDILET 1289
             YGHYDILE+
Sbjct: 280  GYGHYDILES 289


>XP_007143828.1 hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris]
            ESW15822.1 hypothetical protein PHAVU_007G105100g
            [Phaseolus vulgaris]
          Length = 305

 Score =  362 bits (928), Expect = e-119
 Identities = 197/322 (61%), Positives = 214/322 (66%), Gaps = 1/322 (0%)
 Frame = +3

Query: 327  NPAHDSHLCTSSMLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGAR 506
            NPAHDS   +S MLGVLCATRP+PW+FS +HAS   RL H +   SASP          R
Sbjct: 7    NPAHDSF--SSPMLGVLCATRPRPWLFSHVHAS-LPRLVHASVSLSASP---------PR 54

Query: 507  RQHSSSCELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPAR 686
            R HSS+C++ G       IWHAIMP  GD FRRGVV VH   +LKGEGSWNVAWD RPAR
Sbjct: 55   RHHSSACKIFGSAGGAASIWHAIMPRSGDRFRRGVVPVH---DLKGEGSWNVAWDTRPAR 111

Query: 687  WLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS 866
            WLH PDSAWLLFGVC CL                   S    +V+ A  D         +
Sbjct: 112  WLHRPDSAWLLFGVCACLAPPGCVDVVTDFEAVAVDESCGVLKVE-ASAD--------YA 162

Query: 867  DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 1046
            DYRVTGV ADGRCLFRA+AHG CL+NGE AP+EN Q ELAD                  W
Sbjct: 163  DYRVTGVPADGRCLFRAIAHGDCLRNGEKAPDENCQRELADELRAKVVDELLKRREETEW 222

Query: 1047 FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 1226
            F+EGDFD YVKRIQQPF WGGEPELLMASHVLKTPISVFMR T S+ LVNIAKYGEEYRN
Sbjct: 223  FIEGDFDTYVKRIQQPFVWGGEPELLMASHVLKTPISVFMRATGSVGLVNIAKYGEEYRN 282

Query: 1227 D-EEISINVLFHRYGHYDILET 1289
            D EE SINVLFH YGHYDILET
Sbjct: 283  DKEENSINVLFHGYGHYDILET 304


>XP_016177333.1 PREDICTED: uncharacterized protein LOC107619558 [Arachis ipaensis]
          Length = 327

 Score =  362 bits (929), Expect = e-119
 Identities = 201/328 (61%), Positives = 217/328 (66%), Gaps = 21/328 (6%)
 Frame = +3

Query: 369  GVLCATRPKPWIFS--FLHAS---SAARLAHGTAYSSASPRFSRP-GHDGARRQHSSSCE 530
            GVLCATRPKPWI S   LHAS   S+ARL H      A P F +      ARR HSS+C 
Sbjct: 4    GVLCATRPKPWILSAAILHASLHHSSARLLH------APPLFPQLLRRTDARRHHSSACN 57

Query: 531  LRGXXXXXXX--IWHAIMPCGGDGF------RRGVVAVHH-DHELKGEGSWNVAWDARPA 683
              G         IWHAIMPCGG          RGVVAVHH DHELKGEGSWNVAWDARPA
Sbjct: 58   HGGDFGGGGAASIWHAIMPCGGGAGSGKKLRHRGVVAVHHHDHELKGEGSWNVAWDARPA 117

Query: 684  RWLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXX------SSEGREVKVAECDSKE 845
            RWLH PDSAWLLFGVC CL                         + EG+ VKV       
Sbjct: 118  RWLHRPDSAWLLFGVCACLAPPVSSVTDLEATPPATATVVNRDINPEGQGVKV------- 170

Query: 846  QDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXX 1025
              D +SSDYRVTGVLADGRCLFRA+AHGACL+NGEAAP+E RQ ELAD            
Sbjct: 171  --DGLSSDYRVTGVLADGRCLFRAIAHGACLRNGEAAPDERRQRELADELRAQVVEELMK 228

Query: 1026 XXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 1205
                  WF+EGDFD YVKRIQQP+ WGGEPELLMASHVLKTPISVFMRDTSS+ LVNIAK
Sbjct: 229  RREETEWFIEGDFDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSLSLVNIAK 288

Query: 1206 YGEEYRNDEEISINVLFHRYGHYDILET 1289
            YGEEYRN++++ INVLFH YGHYDILET
Sbjct: 289  YGEEYRNEKDVCINVLFHGYGHYDILET 316


>XP_015941210.1 PREDICTED: uncharacterized protein LOC107466718 [Arachis duranensis]
          Length = 327

 Score =  360 bits (925), Expect = e-118
 Identities = 200/328 (60%), Positives = 217/328 (66%), Gaps = 21/328 (6%)
 Frame = +3

Query: 369  GVLCATRPKPWIFS--FLHAS---SAARLAHGTAYSSASPRFSRP-GHDGARRQHSSSCE 530
            GVLCATRPKPWI S   LHAS   S+ARL H      A P F +       RR HSS+C 
Sbjct: 4    GVLCATRPKPWILSAAILHASLHHSSARLLH------APPLFPQLLRRTDTRRHHSSACN 57

Query: 531  LRGXXXXXXX--IWHAIMPCGGDGF------RRGVVAVHH-DHELKGEGSWNVAWDARPA 683
              G         IWHAIMPCGG          RGVVAVHH DHELKGEGSWNVAWDARPA
Sbjct: 58   HGGDFGGGGAASIWHAIMPCGGGAGSGKKLRHRGVVAVHHHDHELKGEGSWNVAWDARPA 117

Query: 684  RWLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXX------SSEGREVKVAECDSKE 845
            RWLH PDSAWLLFGVC CL                         ++EG+ VKV       
Sbjct: 118  RWLHRPDSAWLLFGVCACLAPPVSSVADLEATPPATATVVNRDMNTEGQGVKV------- 170

Query: 846  QDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXX 1025
              D +SSDYRVTGVLADGRCLFRA+AHGACL+NGEAAP+E RQ ELAD            
Sbjct: 171  --DGLSSDYRVTGVLADGRCLFRAIAHGACLRNGEAAPDERRQRELADELRAQVVEELMK 228

Query: 1026 XXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 1205
                  WF+EGDFD YVKRIQQP+ WGGEPELLMASHVLKTPISVFMRDTSS+ LVNIAK
Sbjct: 229  RREETEWFIEGDFDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSLSLVNIAK 288

Query: 1206 YGEEYRNDEEISINVLFHRYGHYDILET 1289
            YGEEYRN++++ INVLFH YGHYDILET
Sbjct: 289  YGEEYRNEKDMCINVLFHGYGHYDILET 316


>XP_004142455.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
            sativus] KGN52210.1 hypothetical protein Csa_5G615810
            [Cucumis sativus]
          Length = 313

 Score =  327 bits (837), Expect = e-105
 Identities = 178/321 (55%), Positives = 204/321 (63%), Gaps = 4/321 (1%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIF----SFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCE 530
            MLGVLCA RPKPWI     +F+H S+     H   + S     S    D  +R HSS+C+
Sbjct: 1    MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH---HQSRLLVQSPIQFDRRQRHHSSACK 56

Query: 531  LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 710
            L G       IWHAIMP G            H HE KGEGSWNVAWDARPARWLH PDSA
Sbjct: 57   LAGGGAAS--IWHAIMPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSA 114

Query: 711  WLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVL 890
            WLLFGVC C+                     + +EV  +      Q+DE S+DYRVTGVL
Sbjct: 115  WLLFGVCACIAPLDWVDASHEAVSL-----DQKKEVCESSGPEFNQNDESSADYRVTGVL 169

Query: 891  ADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDA 1070
            ADGRCLFRA+AHGACL++GE AP+++RQ ELAD                  W++EGDFDA
Sbjct: 170  ADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDA 229

Query: 1071 YVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINV 1250
            YVKRIQQPF WGGEPELLMASHVLKTPISVFMR+ SS  L+NIAKYG+EY+  EE  INV
Sbjct: 230  YVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQKGEESPINV 289

Query: 1251 LFHRYGHYDILETS*PKLPKK 1313
            LFH YGHYDILETS  K+  K
Sbjct: 290  LFHGYGHYDILETSSDKVSLK 310


>XP_016900257.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
            melo]
          Length = 313

 Score =  325 bits (832), Expect = e-105
 Identities = 177/321 (55%), Positives = 204/321 (63%), Gaps = 4/321 (1%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIF----SFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCE 530
            MLGVLCA RPKPWI     +F+H S+     H   + S     S    D  +R HSS+C+
Sbjct: 1    MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH---HQSRLLVQSPIQFDRRQRHHSSACK 56

Query: 531  LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 710
            L G       IWHAI+P G            H HE KGEGSWNVAWDARPARWLH PDSA
Sbjct: 57   LAGGGAAS--IWHAILPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSA 114

Query: 711  WLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVL 890
            WLLFGVC C+                     + +EV  +      Q+DE S+DYRVTGVL
Sbjct: 115  WLLFGVCACIAPLDWVDASHEAVSL-----DQKKEVCESSGPEFNQNDESSADYRVTGVL 169

Query: 891  ADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDA 1070
            ADGRCLFRA+AHGACL++GE AP+++RQ ELAD                  W++EGDFDA
Sbjct: 170  ADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDA 229

Query: 1071 YVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINV 1250
            YVKRIQQPF WGGEPELLMASHVLKTPISVFMR+ SS  L+NIAKYG+EY+  EE  INV
Sbjct: 230  YVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQMGEESPINV 289

Query: 1251 LFHRYGHYDILETS*PKLPKK 1313
            LFH YGHYDILETS  K+  K
Sbjct: 290  LFHGYGHYDILETSSDKVSLK 310


>XP_019459096.1 PREDICTED: uncharacterized protein LOC109359045 [Lupinus
            angustifolius] OIW01500.1 hypothetical protein
            TanjilG_19426 [Lupinus angustifolius]
          Length = 319

 Score =  323 bits (828), Expect = e-104
 Identities = 179/318 (56%), Positives = 202/318 (63%), Gaps = 8/318 (2%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFLHASSAARLAHGTA--YSSASPRFSRPGHDGARRQHSSSCELR 536
            ML  LC TRPKP   S     +A+   H +A   + +S  F  PG DG RR HSS+C + 
Sbjct: 1    MLAALC-TRPKPSFLSSFFFQTASLHNHNSARFINGSSLHFYCPGGDGRRRHHSSACTIG 59

Query: 537  GXXXXXXXIWHAIMPCGGDGF----RRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPD 704
            G       IWH ++P           R   A+ H HEL+GEGSWN AWDARP+RWLH PD
Sbjct: 60   GSCGGAASIWHVVLPERAGASICCDLRWRSALPH-HELRGEGSWNAAWDARPSRWLHRPD 118

Query: 705  SAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGR-EVKVAECDSKEQDDEVSSDYRVT 881
            SAWLLFGVC CL                   S  G  ++K   CD   + +EVSS YR+T
Sbjct: 119  SAWLLFGVCACLAPPLLLADVNTEVPSAEHDSDGGGGDLKGPGCD---EQNEVSSAYRIT 175

Query: 882  GVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGD 1061
            GVLADGRCLFRA+AHGACL NGE AP+ENRQ ELAD                  WF+EGD
Sbjct: 176  GVLADGRCLFRAIAHGACLMNGEEAPDENRQRELADELRAQVVEELMKRREETEWFIEGD 235

Query: 1062 FDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEY-RNDEEI 1238
            FDAYV RIQQPF WGGEPELLMASHVLKTPISVFMRD SS DLVNIAKYGEEY   ++EI
Sbjct: 236  FDAYVTRIQQPFVWGGEPELLMASHVLKTPISVFMRDRSSGDLVNIAKYGEEYITKEKEI 295

Query: 1239 SINVLFHRYGHYDILETS 1292
            +INVLFH YGHYDILE S
Sbjct: 296  AINVLFHGYGHYDILEIS 313


>XP_010032108.1 PREDICTED: OTU domain-containing protein At3g57810 [Eucalyptus
            grandis] KCW51502.1 hypothetical protein EUGRSUZ_J01018
            [Eucalyptus grandis]
          Length = 314

 Score =  316 bits (809), Expect = e-101
 Identities = 178/320 (55%), Positives = 200/320 (62%), Gaps = 11/320 (3%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFS--FLHASSAARLAHGTAYSSASPRFSRPGHDG---ARRQHSSSC 527
            MLGVLCA RPKPWI +  F HAS+A         S+A+ R            RR HSSSC
Sbjct: 1    MLGVLCA-RPKPWILASCFSHASAAHHCGRLAWVSAAAARLQLAADSPDRWRRRHHSSSC 59

Query: 528  ELRGXXXXXXX-----IWHAIMPCG-GDGFRRGVVAVHHDHELKGEGSWNVAWDARPARW 689
             L G            IWHAI+P G GD  RR  +        +GEGSWNVAWDARPARW
Sbjct: 60   RLGGASSCAHPCGVASIWHAILPSGEGDPPRR--MDQPRRPVFRGEGSWNVAWDARPARW 117

Query: 690  LHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSD 869
            LH PDSAWLLFGVC CL                        E +V + DS ++    S D
Sbjct: 118  LHRPDSAWLLFGVCACLAPVDAAEPSREEVVP---------EARVEDRDSLDEAKRSSPD 168

Query: 870  YRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWF 1049
            YRVTGVLADGRCLFRA+AH ACL+ GEAAP++NRQ ELAD                  W 
Sbjct: 169  YRVTGVLADGRCLFRAIAHCACLRKGEAAPDDNRQRELADELRAQVVAELLKRREETEWA 228

Query: 1050 LEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND 1229
            +EGDFDAY++RIQQP+ WGGEPELLMASHVLKTPISVFM D SS +LVN+AKYGEEYR D
Sbjct: 229  IEGDFDAYIERIQQPYVWGGEPELLMASHVLKTPISVFMVDRSSGNLVNVAKYGEEYRKD 288

Query: 1230 EEISINVLFHRYGHYDILET 1289
            EEI INVLFH YGHYDILE+
Sbjct: 289  EEIPINVLFHGYGHYDILES 308


>OMO50984.1 Ovarian tumor, otubain [Corchorus olitorius]
          Length = 327

 Score =  315 bits (806), Expect = e-101
 Identities = 171/321 (53%), Positives = 197/321 (61%), Gaps = 12/321 (3%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFL----HASSAARLAHGTAYSSASPRFSRPGHDGAR-RQHSSSC 527
            MLGVLCA  PKPWI + L    H   +A   H +      P F+    D  R R HS++C
Sbjct: 1    MLGVLCARPPKPWILNSLSLVAHGGGSAAHHHDSRLLHW-PHFAHISADNRRCRHHSTAC 59

Query: 528  ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 707
             L G       IWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH PDS
Sbjct: 60   RLGGSDGGAASIWHAILPCGGSGRGRKREEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119

Query: 708  AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS------- 866
            AWLLFGVC CL                     EG E+      S ++   +SS       
Sbjct: 120  AWLLFGVCACLAPMIEFVDVNPETDDKI----EGAELISINGLSADEKSSISSSPVAAPD 175

Query: 867  DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 1046
            +Y+VTGVLADGRCLFRA+AHGACL++GE AP+E RQ ELAD                  W
Sbjct: 176  NYKVTGVLADGRCLFRAIAHGACLRSGEEAPDETRQRELADELRAQVVNELLKRREETEW 235

Query: 1047 FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 1226
            F+EGDFDAYVK IQQP+ WGGEPELLMASHVLKTPISV+M   SS +L+ IA YGEEY+ 
Sbjct: 236  FIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTPISVYMIHRSSRNLIKIADYGEEYQK 295

Query: 1227 DEEISINVLFHRYGHYDILET 1289
            D+E  INVLFH YGHYDILE+
Sbjct: 296  DKETPINVLFHGYGHYDILES 316


>OMO98833.1 Ovarian tumor, otubain [Corchorus capsularis]
          Length = 327

 Score =  314 bits (804), Expect = e-100
 Identities = 171/321 (53%), Positives = 197/321 (61%), Gaps = 12/321 (3%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFL----HASSAARLAHGTAYSSASPRFSRPGHDGAR-RQHSSSC 527
            MLGVLCA  PKPWI + L    H   +A   H +      P F+    D  R R HS++C
Sbjct: 1    MLGVLCARPPKPWILNSLSLVAHGGGSAAHHHDSRLLHW-PHFADLSADNRRCRHHSTAC 59

Query: 528  ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 707
             L G       IWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH PDS
Sbjct: 60   RLGGSDGGAASIWHAILPCGGSGRGRKREEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119

Query: 708  AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS------- 866
            AWLLFGVC CL                     EG E+      S ++   +SS       
Sbjct: 120  AWLLFGVCACLAPMIEFVDVNPETDDKI----EGTELISINGLSADEKSSISSSPVAAPD 175

Query: 867  DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 1046
            +Y+VTGVLADGRCLFRA+AHGACL++GE AP+E RQ ELAD                  W
Sbjct: 176  NYKVTGVLADGRCLFRAIAHGACLRSGEEAPDETRQRELADELRAQVVNELLKRREETEW 235

Query: 1047 FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 1226
            F+EGDFDAYVK IQQP+ WGGEPELLMASHVLKTPISV+M   SS +L+ IA YGEEY+ 
Sbjct: 236  FIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTPISVYMIHRSSRNLIKIADYGEEYQK 295

Query: 1227 DEEISINVLFHRYGHYDILET 1289
            D+E  INVLFH YGHYDILE+
Sbjct: 296  DKETPINVLFHGYGHYDILES 316


>XP_018845374.1 PREDICTED: uncharacterized protein LOC109009371 [Juglans regia]
          Length = 328

 Score =  313 bits (803), Expect = e-100
 Identities = 181/342 (52%), Positives = 207/342 (60%), Gaps = 33/342 (9%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIF-----SFLHASSAARLAHGTAYSSASPRFSRPGHDG---ARRQHS 518
            MLGVLCA RPKPWI      SF+H S+A     G   S        PG +G    RR HS
Sbjct: 1    MLGVLCA-RPKPWILTSLSSSFVHGSAAHHHITGLRQS--------PGFNGDLKPRRHHS 51

Query: 519  SSCELRGXXXXXXX-IWHAIMPCGGDGFRRGVVAVHHD---HELKGEGSWNVAWDARPAR 686
            S+C + G        IWHAIMPCG  G    ++   +     E +GEGSWNVAWDARPAR
Sbjct: 52   SACRIDGSFGGGAASIWHAIMPCGAAGHPSDLLLRRNAMLRRERRGEGSWNVAWDARPAR 111

Query: 687  WLHSPD-SAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKE----QD 851
            WLH PD SAWLLFGVC CL                        E K+  CDS +    ++
Sbjct: 112  WLHRPDYSAWLLFGVCACLAPLDFAFDDSPEAIVV--------EAKIEACDSIDSNANKN 163

Query: 852  DEV----------------SSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGEL 983
            DE+                S+DYRVTGVLADGRCLFRALAHGAC ++GE AP+ENRQ EL
Sbjct: 164  DEIDGFDAIYSNTSKPKEGSADYRVTGVLADGRCLFRALAHGACSRSGEEAPDENRQREL 223

Query: 984  ADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVF 1163
            AD                  WF+EGDFDAYV+RIQQPF WGGEPELLMASHVLKTPISVF
Sbjct: 224  ADELRAQVVDELLKRRKETEWFIEGDFDAYVERIQQPFVWGGEPELLMASHVLKTPISVF 283

Query: 1164 MRDTSSIDLVNIAKYGEEYRNDEEISINVLFHRYGHYDILET 1289
            M++ SS  LVNIAKYGEEYR +E+  INVLFH YGHYD+LE+
Sbjct: 284  MKNRSSGRLVNIAKYGEEYRKEEDSPINVLFHGYGHYDLLES 325


>KHN37847.1 OTU domain-containing protein [Glycine soja]
          Length = 234

 Score =  310 bits (793), Expect = e-100
 Identities = 154/236 (65%), Positives = 171/236 (72%)
 Frame = +3

Query: 591  DGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLFGVCDCLXXXXXXXXXX 770
            DGFRRGVVA H   ++KGEGSWNVAWDARPARWLH PDSAWLLFGVC CL          
Sbjct: 8    DGFRRGVVAFH---DMKGEGSWNVAWDARPARWLHRPDSAWLLFGVCACLAPPSSCVDAD 64

Query: 771  XXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGE 950
                      S          D + ++DEVS+DYRVTGV ADGRCLFRA+AHGACL+NGE
Sbjct: 65   TNTDAIAVDES------CRLLDKEREEDEVSADYRVTGVPADGRCLFRAIAHGACLRNGE 118

Query: 951  AAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMA 1130
             AP+ENRQ ELAD                  WF+EGDFD Y++RIQQP+ WGGEPELLMA
Sbjct: 119  KAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQRIQQPYVWGGEPELLMA 178

Query: 1131 SHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLFHRYGHYDILETS*P 1298
            SHVLKTPISVFMRDT S++LVNIAKYGEEYRND++ISINVLFH YGHYDILET  P
Sbjct: 179  SHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFHGYGHYDILETLRP 234


>GAU40884.1 hypothetical protein TSUD_40590 [Trifolium subterraneum]
          Length = 266

 Score =  310 bits (794), Expect = e-100
 Identities = 165/272 (60%), Positives = 180/272 (66%), Gaps = 11/272 (4%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFLHASS----AARLAHGT-AYSSASPRFSRPGHDGARRQHSSSC 527
            MLGVLCATR +PWIFSFLH+SS     ARLAH T + SS  P FS      ARR HSS C
Sbjct: 1    MLGVLCATRSRPWIFSFLHSSSHHNHTARLAHATVSASSLCPTFS------ARRNHSSQC 54

Query: 528  ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 707
            +L+        IWHAIMPCGGDG ++G   VHHDHELKGEGSWNVAWDARPARWLH  DS
Sbjct: 55   KLQISTGGAASIWHAIMPCGGDGLQQGGFMVHHDHELKGEGSWNVAWDARPARWLHRSDS 114

Query: 708  AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEG---REVK-VAECDSKEQDDEVSS--D 869
            AWLLFGVC CL                     E    RE+K + + +S +  DE+SS  D
Sbjct: 115  AWLLFGVCACLAPPVDVEAEVPPLTTSVISPDENYKRREIKDIKDAESDKPSDELSSEAD 174

Query: 870  YRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWF 1049
            YRVTGVLADGRCLFRA+AHGACLKNGE APNENRQ ELAD                  WF
Sbjct: 175  YRVTGVLADGRCLFRAIAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETEWF 234

Query: 1050 LEGDFDAYVKRIQQPFAWGGEPELLMASHVLK 1145
            +EGDFD YV RIQQ F WGGEPELLMASHVLK
Sbjct: 235  IEGDFDTYVTRIQQTFVWGGEPELLMASHVLK 266


>EOY19029.1 Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao]
          Length = 327

 Score =  310 bits (794), Expect = 6e-99
 Identities = 171/325 (52%), Positives = 197/325 (60%), Gaps = 16/325 (4%)
 Frame = +3

Query: 363  MLGVLCATRPKPWIFSFLHASSAARLAHG--TAYSSASPRFSRPGH-------DGARRQH 515
            MLGVLCA  PKPWI +     S + +AHG   A+   S     P H       D   R H
Sbjct: 1    MLGVLCARPPKPWILN-----SLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHH 55

Query: 516  SSSCELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLH 695
            S++C L G       IWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH
Sbjct: 56   STACRLGGSDGGAASIWHAILPCGGGGGGRRRGEVWKNVERKGEGSWNVAWDARPARWLH 115

Query: 696  SPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS--- 866
             PDSAWLLFGVC CL                     EG E+ +    S ++    SS   
Sbjct: 116  RPDSAWLLFGVCACLAPMIEFVDVNPDADDKI----EGAELNLVSRLSADEKSSSSSSSV 171

Query: 867  ----DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXX 1034
                + +VTGVLADGRCLFRA+AHGACL++GE AP+EN Q ELAD               
Sbjct: 172  AAADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVVNELLKRRE 231

Query: 1035 XXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGE 1214
               WF+EGDFDAYVK IQQP+ WGGEPE+LMASHVLKTPISV+M   SS +L  IAKYGE
Sbjct: 232  ETEWFIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNLTKIAKYGE 291

Query: 1215 EYRNDEEISINVLFHRYGHYDILET 1289
            EY+ D+E  INVLFH YGHYDILE+
Sbjct: 292  EYQKDKENPINVLFHGYGHYDILES 316


Top