BLASTX nr result

ID: Glycyrrhiza30_contig00012439 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza30_contig00012439
         (1304 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_004496177.1 PREDICTED: OTU domain-containing protein At3g5781...   408   e-138
BAE71258.1 hypothetical protein [Trifolium pratense]                  395   e-133
XP_013469378.1 OTU-like cysteine protease [Medicago truncatula] ...   380   e-128
XP_003536306.1 PREDICTED: uncharacterized protein LOC100793001 [...   374   e-125
XP_017413456.1 PREDICTED: uncharacterized protein LOC108324995 [...   366   e-122
XP_003556279.1 PREDICTED: OTU domain-containing protein At3g5781...   365   e-122
XP_014512510.1 PREDICTED: uncharacterized protein LOC106771118 [...   364   e-121
XP_007143828.1 hypothetical protein PHAVU_007G105100g [Phaseolus...   362   e-120
XP_016177333.1 PREDICTED: uncharacterized protein LOC107619558 [...   362   e-120
XP_015941210.1 PREDICTED: uncharacterized protein LOC107466718 [...   360   e-119
XP_004142455.1 PREDICTED: OTU domain-containing protein At3g5781...   327   e-106
XP_016900257.1 PREDICTED: OTU domain-containing protein At3g5781...   325   e-106
XP_019459096.1 PREDICTED: uncharacterized protein LOC109359045 [...   323   e-105
XP_010032108.1 PREDICTED: OTU domain-containing protein At3g5781...   316   e-102
OMO50984.1 Ovarian tumor, otubain [Corchorus olitorius]               315   e-101
OMO98833.1 Ovarian tumor, otubain [Corchorus capsularis]              314   e-101
XP_018845374.1 PREDICTED: uncharacterized protein LOC109009371 [...   313   e-101
KHN37847.1 OTU domain-containing protein [Glycine soja]               310   e-101
GAU40884.1 hypothetical protein TSUD_40590 [Trifolium subterraneum]   310   e-101
EOY19029.1 Cysteine proteinases superfamily protein isoform 1 [T...   310   e-100

>XP_004496177.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cicer
            arietinum]
          Length = 313

 Score =  408 bits (1048), Expect = e-138
 Identities = 214/316 (67%), Positives = 233/316 (73%), Gaps = 7/316 (2%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFLHASS---AARLAHGT-AYSSASPRFSRPGHDGARRQHSSSCE 319
            MLGVLCATR +PWIFSFLH+S+   AARLAH T A SS S RF       ARR HSS+CE
Sbjct: 1    MLGVLCATRSRPWIFSFLHSSASHHAARLAHCTVACSSLSTRFDATF--AARRHHSSACE 58

Query: 320  LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 499
            L+        IWHAI PCGGDGFRRGVV V HDH+LKGEGSWNVAWDARPARWLH  DSA
Sbjct: 59   LQ-LGGGAASIWHAIRPCGGDGFRRGVVTVQHDHDLKGEGSWNVAWDARPARWLHRSDSA 117

Query: 500  WLLFGVCDCLXXXXXXXXXXXXXXXXXXXS---SEGREVKVAECDSKEQDDEVSSDYRVT 670
            WLLFGVC CL                   +   SEGRE+K AE D KE++DE+S+DYRVT
Sbjct: 118  WLLFGVCACLAPPVIADVDLEAPPTPAINTDENSEGREMKYAEGD-KERNDELSADYRVT 176

Query: 671  GVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGD 850
            GVLADGRCLFRA+AHGACL NGE APNENRQ ELAD                  WF+EGD
Sbjct: 177  GVLADGRCLFRAIAHGACLNNGEEAPNENRQRELADELRARVAEELLKRRKETEWFIEGD 236

Query: 851  FDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEIS 1030
            FDAYV RI+Q + WGGEPELLMASHVLKTPI VFMRD SSIDLVNIAKYGEEY ND+EIS
Sbjct: 237  FDAYVNRIRQTYVWGGEPELLMASHVLKTPIYVFMRDASSIDLVNIAKYGEEYMNDKEIS 296

Query: 1031 INVLFHRYGHYDILET 1078
            INVLFHR+GHY+ILET
Sbjct: 297  INVLFHRHGHYEILET 312


>BAE71258.1 hypothetical protein [Trifolium pratense]
          Length = 326

 Score =  395 bits (1014), Expect = e-133
 Identities = 212/330 (64%), Positives = 228/330 (69%), Gaps = 13/330 (3%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFLHASSA-----ARLAHGTAYSSASPRFSRPGHDGARRQHSSSC 316
            MLGVLCATR +PWIFSFLH SS+     ARLAH T  SS+S     P    ARR HSS C
Sbjct: 1    MLGVLCATRSRPWIFSFLHHSSSHHHHTARLAHITVASSSS---LSPTFFSARRNHSSQC 57

Query: 317  ELR-GXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPD 493
            +L+         IWHAIMPCGGDGF+RG   VHHDHELKGEGSWNVAWDARPARWLH  D
Sbjct: 58   KLQISAGGGAASIWHAIMPCGGDGFQRGAFMVHHDHELKGEGSWNVAWDARPARWLHRSD 117

Query: 494  SAWLLFGVCDCL-------XXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVS 652
            SAWLLFGV   L                           SEG E+K AE D  + +DE+S
Sbjct: 118  SAWLLFGVRAWLAPPPVIVDVDPEVPLPTSVISPDEISRSEGLEIKDAESD--KPNDELS 175

Query: 653  SDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXX 832
            SDYRVTGVLADGRCLFRALAHGACLKNGE APNENRQ ELAD                  
Sbjct: 176  SDYRVTGVLADGRCLFRALAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETE 235

Query: 833  WFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYR 1012
            WF+EGDFD YV RIQQ F WGGEPELLMASHVLKTPI VFMRD +SIDLVNIAKYGEEY 
Sbjct: 236  WFIEGDFDTYVTRIQQSFVWGGEPELLMASHVLKTPIFVFMRDPNSIDLVNIAKYGEEYM 295

Query: 1013 NDEEISINVLFHRYGHYDILETS*PKLPKK 1102
            NDE ISINVLFHR+GHY++LET  PKL +K
Sbjct: 296  NDEGISINVLFHRHGHYELLETLCPKLSQK 325


>XP_013469378.1 OTU-like cysteine protease [Medicago truncatula] KEH43416.1 OTU-like
            cysteine protease [Medicago truncatula]
          Length = 305

 Score =  380 bits (976), Expect = e-128
 Identities = 197/315 (62%), Positives = 215/315 (68%), Gaps = 6/315 (1%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELR-- 325
            MLGVLCATR +PWIFS  H   A RL+H T      P         ARR HS++C     
Sbjct: 1    MLGVLCATRSRPWIFSSHHHHHAFRLSHATVAPLTFP---------ARRHHSTACNNLQI 51

Query: 326  GXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWL 505
                    IWHAI PCGGDGFR G V +HHDHELKGEGSWNVAWDARPARWLH  DSAWL
Sbjct: 52   STGGGAASIWHAITPCGGDGFRTGGVMLHHDHELKGEGSWNVAWDARPARWLHRSDSAWL 111

Query: 506  LFGVCDCLXXXXXXXXXXXXXXXXXXX----SSEGREVKVAECDSKEQDDEVSSDYRVTG 673
            LFGVC CL                       SSEGRE+K    D  E+DDE+++DYRVTG
Sbjct: 112  LFGVCACLAPPVVLDVDPEAAAPTPAVFPNESSEGREMKDELSD--ERDDELNADYRVTG 169

Query: 674  VLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDF 853
            VLADGRCLFRA+AHGACLKNGE APNE+RQ ELAD                  WF+EGDF
Sbjct: 170  VLADGRCLFRAIAHGACLKNGEEAPNESRQRELADELRVKVAEELLNRRKETEWFIEGDF 229

Query: 854  DAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISI 1033
            D YV RIQQ + WGGEPELLMASHVLKTPI VFMRD SS+DLVNIAKYGEEY NDEEISI
Sbjct: 230  DTYVTRIQQTYVWGGEPELLMASHVLKTPIYVFMRDASSMDLVNIAKYGEEYMNDEEISI 289

Query: 1034 NVLFHRYGHYDILET 1078
            NVLFHR+GHY++LET
Sbjct: 290  NVLFHRHGHYELLET 304


>XP_003536306.1 PREDICTED: uncharacterized protein LOC100793001 [Glycine max]
            KRH34730.1 hypothetical protein GLYMA_10G202000 [Glycine
            max]
          Length = 296

 Score =  374 bits (959), Expect = e-125
 Identities = 193/313 (61%), Positives = 216/313 (69%), Gaps = 1/313 (0%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFLHA-SSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRG 328
            MLGVLCATRPKPW+ S +H  +S  RL H     SASP          RR+HS++C+L  
Sbjct: 1    MLGVLCATRPKPWLLSLVHVHASLPRLPHSPLSPSASPP--------PRRRHSTACKLFL 52

Query: 329  XXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLL 508
                   IWHAIMP G DG RRGVVAVH   +LKGEGSWNVAWDARPARWLH PDSAWLL
Sbjct: 53   SGGAAASIWHAIMPRGDDGLRRGVVAVH---DLKGEGSWNVAWDARPARWLHRPDSAWLL 109

Query: 509  FGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADG 688
            FGVC CL                    S G        D + ++DEVS+DYRVTGV ADG
Sbjct: 110  FGVCACLAPPPGCVDADTNSAGIAVDESCGL------LDKEREEDEVSADYRVTGVPADG 163

Query: 689  RCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVK 868
            RCLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD Y++
Sbjct: 164  RCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQ 223

Query: 869  RIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLFH 1048
            RIQQP+ WGGEPELLMASHVLKTPISVFMRDT S++LVNIAKYGEEYRND++ISINVLFH
Sbjct: 224  RIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFH 283

Query: 1049 RYGHYDILETS*P 1087
             YGHYDILET  P
Sbjct: 284  GYGHYDILETLRP 296


>XP_017413456.1 PREDICTED: uncharacterized protein LOC108324995 [Vigna angularis]
            KOM35649.1 hypothetical protein LR48_Vigan02g179900
            [Vigna angularis] BAT94560.1 hypothetical protein
            VIGAN_08117300 [Vigna angularis var. angularis]
          Length = 290

 Score =  366 bits (940), Expect = e-122
 Identities = 194/310 (62%), Positives = 211/310 (68%), Gaps = 1/310 (0%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 331
            MLGVLCATRPKPW+FS +HAS   RL H +    ASP          RR HSS+C+L G 
Sbjct: 1    MLGVLCATRPKPWLFSLVHASPP-RLPHASVSLLASP---------PRRHHSSACKLFGS 50

Query: 332  XXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLF 511
                  IWHAIMP  GDGFRRGVVAVH   +LKGEGSWNVAWD RPARWLH  DSAWLLF
Sbjct: 51   AGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRSDSAWLLF 107

Query: 512  GVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGR 691
            GVC CL                   S    +        KE   +VS+DYRVTGV ADGR
Sbjct: 108  GVCACLAPPGCVDAVTDSDAVAADESCGVLD--------KELKVDVSADYRVTGVPADGR 159

Query: 692  CLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKR 871
            CLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD YVKR
Sbjct: 160  CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKR 219

Query: 872  IQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND-EEISINVLFH 1048
            IQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+YRND EE SINVLFH
Sbjct: 220  IQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYRNDKEENSINVLFH 279

Query: 1049 RYGHYDILET 1078
             YGHYDILE+
Sbjct: 280  GYGHYDILES 289


>XP_003556279.1 PREDICTED: OTU domain-containing protein At3g57810-like [Glycine max]
            KHN00921.1 OTU domain-containing protein [Glycine soja]
            KRG92054.1 hypothetical protein GLYMA_20G188400 [Glycine
            max]
          Length = 294

 Score =  365 bits (936), Expect = e-122
 Identities = 192/311 (61%), Positives = 213/311 (68%), Gaps = 2/311 (0%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 331
            MLGVLCATR KPW+FS +HAS   RL+H     SASP          RR+HS++C+L   
Sbjct: 1    MLGVLCATRSKPWLFSLVHAS-LPRLSHAPLSPSASPP--------PRRRHSTACKLFLS 51

Query: 332  XXXXXXIWHAIMPC--GGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWL 505
                  IWHAIMP     DGFRRGVVA H   ++KGEGSWNVAWDARPARWLH PDSAWL
Sbjct: 52   AGGAASIWHAIMPRVNDDDGFRRGVVAFH---DMKGEGSWNVAWDARPARWLHRPDSAWL 108

Query: 506  LFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLAD 685
            LFGVC CL                    S          D + ++ EVS+DYRVTGV AD
Sbjct: 109  LFGVCACLAPPSSCVDADTNTDAIAVDES------CRLLDKEREEYEVSADYRVTGVPAD 162

Query: 686  GRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYV 865
            GRCLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD YV
Sbjct: 163  GRCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELMKRREETEWFIEGDFDTYV 222

Query: 866  KRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLF 1045
            +RIQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGEEYRND+EISINVLF
Sbjct: 223  QRIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEEYRNDKEISINVLF 282

Query: 1046 HRYGHYDILET 1078
            H YGHYDILET
Sbjct: 283  HGYGHYDILET 293


>XP_014512510.1 PREDICTED: uncharacterized protein LOC106771118 [Vigna radiata var.
            radiata]
          Length = 290

 Score =  364 bits (934), Expect = e-121
 Identities = 193/310 (62%), Positives = 210/310 (67%), Gaps = 1/310 (0%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 331
            MLGVLCATRPKPW+FS +HAS   RL H +    ASP          RR HSS+C+L G 
Sbjct: 1    MLGVLCATRPKPWLFSLVHASPP-RLPHASVSLLASP---------PRRHHSSACKLFGS 50

Query: 332  XXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLF 511
                  IWHAIMP  GDGFRRGVVAVH   +LKGEGSWNVAWD RPARWLH  DSAWLLF
Sbjct: 51   AGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRSDSAWLLF 107

Query: 512  GVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGR 691
            GVC CL                   S    +        KE   +VS+DYRVTGV ADGR
Sbjct: 108  GVCACLAPPGCVDAVTDSDAVAADESCGVLD--------KELKVDVSADYRVTGVPADGR 159

Query: 692  CLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKR 871
            CLFRA+AHGACL+NGE AP+ENRQ ELAD                  WF+EGDFD YVKR
Sbjct: 160  CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKR 219

Query: 872  IQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND-EEISINVLFH 1048
            IQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+Y ND EE SINVLFH
Sbjct: 220  IQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYMNDKEENSINVLFH 279

Query: 1049 RYGHYDILET 1078
             YGHYDILE+
Sbjct: 280  GYGHYDILES 289


>XP_007143828.1 hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris]
            ESW15822.1 hypothetical protein PHAVU_007G105100g
            [Phaseolus vulgaris]
          Length = 305

 Score =  362 bits (928), Expect = e-120
 Identities = 197/322 (61%), Positives = 214/322 (66%), Gaps = 1/322 (0%)
 Frame = +2

Query: 116  NPAHDSHLCTSSMLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGAR 295
            NPAHDS   +S MLGVLCATRP+PW+FS +HAS   RL H +   SASP          R
Sbjct: 7    NPAHDSF--SSPMLGVLCATRPRPWLFSHVHAS-LPRLVHASVSLSASP---------PR 54

Query: 296  RQHSSSCELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPAR 475
            R HSS+C++ G       IWHAIMP  GD FRRGVV VH   +LKGEGSWNVAWD RPAR
Sbjct: 55   RHHSSACKIFGSAGGAASIWHAIMPRSGDRFRRGVVPVH---DLKGEGSWNVAWDTRPAR 111

Query: 476  WLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS 655
            WLH PDSAWLLFGVC CL                   S    +V+ A  D         +
Sbjct: 112  WLHRPDSAWLLFGVCACLAPPGCVDVVTDFEAVAVDESCGVLKVE-ASAD--------YA 162

Query: 656  DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 835
            DYRVTGV ADGRCLFRA+AHG CL+NGE AP+EN Q ELAD                  W
Sbjct: 163  DYRVTGVPADGRCLFRAIAHGDCLRNGEKAPDENCQRELADELRAKVVDELLKRREETEW 222

Query: 836  FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 1015
            F+EGDFD YVKRIQQPF WGGEPELLMASHVLKTPISVFMR T S+ LVNIAKYGEEYRN
Sbjct: 223  FIEGDFDTYVKRIQQPFVWGGEPELLMASHVLKTPISVFMRATGSVGLVNIAKYGEEYRN 282

Query: 1016 D-EEISINVLFHRYGHYDILET 1078
            D EE SINVLFH YGHYDILET
Sbjct: 283  DKEENSINVLFHGYGHYDILET 304


>XP_016177333.1 PREDICTED: uncharacterized protein LOC107619558 [Arachis ipaensis]
          Length = 327

 Score =  362 bits (929), Expect = e-120
 Identities = 201/328 (61%), Positives = 217/328 (66%), Gaps = 21/328 (6%)
 Frame = +2

Query: 158  GVLCATRPKPWIFS--FLHAS---SAARLAHGTAYSSASPRFSRP-GHDGARRQHSSSCE 319
            GVLCATRPKPWI S   LHAS   S+ARL H      A P F +      ARR HSS+C 
Sbjct: 4    GVLCATRPKPWILSAAILHASLHHSSARLLH------APPLFPQLLRRTDARRHHSSACN 57

Query: 320  LRGXXXXXXX--IWHAIMPCGGDGF------RRGVVAVHH-DHELKGEGSWNVAWDARPA 472
              G         IWHAIMPCGG          RGVVAVHH DHELKGEGSWNVAWDARPA
Sbjct: 58   HGGDFGGGGAASIWHAIMPCGGGAGSGKKLRHRGVVAVHHHDHELKGEGSWNVAWDARPA 117

Query: 473  RWLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXX------SSEGREVKVAECDSKE 634
            RWLH PDSAWLLFGVC CL                         + EG+ VKV       
Sbjct: 118  RWLHRPDSAWLLFGVCACLAPPVSSVTDLEATPPATATVVNRDINPEGQGVKV------- 170

Query: 635  QDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXX 814
              D +SSDYRVTGVLADGRCLFRA+AHGACL+NGEAAP+E RQ ELAD            
Sbjct: 171  --DGLSSDYRVTGVLADGRCLFRAIAHGACLRNGEAAPDERRQRELADELRAQVVEELMK 228

Query: 815  XXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 994
                  WF+EGDFD YVKRIQQP+ WGGEPELLMASHVLKTPISVFMRDTSS+ LVNIAK
Sbjct: 229  RREETEWFIEGDFDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSLSLVNIAK 288

Query: 995  YGEEYRNDEEISINVLFHRYGHYDILET 1078
            YGEEYRN++++ INVLFH YGHYDILET
Sbjct: 289  YGEEYRNEKDVCINVLFHGYGHYDILET 316


>XP_015941210.1 PREDICTED: uncharacterized protein LOC107466718 [Arachis duranensis]
          Length = 327

 Score =  360 bits (925), Expect = e-119
 Identities = 200/328 (60%), Positives = 217/328 (66%), Gaps = 21/328 (6%)
 Frame = +2

Query: 158  GVLCATRPKPWIFS--FLHAS---SAARLAHGTAYSSASPRFSRP-GHDGARRQHSSSCE 319
            GVLCATRPKPWI S   LHAS   S+ARL H      A P F +       RR HSS+C 
Sbjct: 4    GVLCATRPKPWILSAAILHASLHHSSARLLH------APPLFPQLLRRTDTRRHHSSACN 57

Query: 320  LRGXXXXXXX--IWHAIMPCGGDGF------RRGVVAVHH-DHELKGEGSWNVAWDARPA 472
              G         IWHAIMPCGG          RGVVAVHH DHELKGEGSWNVAWDARPA
Sbjct: 58   HGGDFGGGGAASIWHAIMPCGGGAGSGKKLRHRGVVAVHHHDHELKGEGSWNVAWDARPA 117

Query: 473  RWLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXX------SSEGREVKVAECDSKE 634
            RWLH PDSAWLLFGVC CL                         ++EG+ VKV       
Sbjct: 118  RWLHRPDSAWLLFGVCACLAPPVSSVADLEATPPATATVVNRDMNTEGQGVKV------- 170

Query: 635  QDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXX 814
              D +SSDYRVTGVLADGRCLFRA+AHGACL+NGEAAP+E RQ ELAD            
Sbjct: 171  --DGLSSDYRVTGVLADGRCLFRAIAHGACLRNGEAAPDERRQRELADELRAQVVEELMK 228

Query: 815  XXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 994
                  WF+EGDFD YVKRIQQP+ WGGEPELLMASHVLKTPISVFMRDTSS+ LVNIAK
Sbjct: 229  RREETEWFIEGDFDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSLSLVNIAK 288

Query: 995  YGEEYRNDEEISINVLFHRYGHYDILET 1078
            YGEEYRN++++ INVLFH YGHYDILET
Sbjct: 289  YGEEYRNEKDMCINVLFHGYGHYDILET 316


>XP_004142455.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
            sativus] KGN52210.1 hypothetical protein Csa_5G615810
            [Cucumis sativus]
          Length = 313

 Score =  327 bits (837), Expect = e-106
 Identities = 178/321 (55%), Positives = 204/321 (63%), Gaps = 4/321 (1%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIF----SFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCE 319
            MLGVLCA RPKPWI     +F+H S+     H   + S     S    D  +R HSS+C+
Sbjct: 1    MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH---HQSRLLVQSPIQFDRRQRHHSSACK 56

Query: 320  LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 499
            L G       IWHAIMP G            H HE KGEGSWNVAWDARPARWLH PDSA
Sbjct: 57   LAGGGAAS--IWHAIMPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSA 114

Query: 500  WLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVL 679
            WLLFGVC C+                     + +EV  +      Q+DE S+DYRVTGVL
Sbjct: 115  WLLFGVCACIAPLDWVDASHEAVSL-----DQKKEVCESSGPEFNQNDESSADYRVTGVL 169

Query: 680  ADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDA 859
            ADGRCLFRA+AHGACL++GE AP+++RQ ELAD                  W++EGDFDA
Sbjct: 170  ADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDA 229

Query: 860  YVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINV 1039
            YVKRIQQPF WGGEPELLMASHVLKTPISVFMR+ SS  L+NIAKYG+EY+  EE  INV
Sbjct: 230  YVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQKGEESPINV 289

Query: 1040 LFHRYGHYDILETS*PKLPKK 1102
            LFH YGHYDILETS  K+  K
Sbjct: 290  LFHGYGHYDILETSSDKVSLK 310


>XP_016900257.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
            melo]
          Length = 313

 Score =  325 bits (832), Expect = e-106
 Identities = 177/321 (55%), Positives = 204/321 (63%), Gaps = 4/321 (1%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIF----SFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCE 319
            MLGVLCA RPKPWI     +F+H S+     H   + S     S    D  +R HSS+C+
Sbjct: 1    MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH---HQSRLLVQSPIQFDRRQRHHSSACK 56

Query: 320  LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 499
            L G       IWHAI+P G            H HE KGEGSWNVAWDARPARWLH PDSA
Sbjct: 57   LAGGGAAS--IWHAILPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSA 114

Query: 500  WLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVL 679
            WLLFGVC C+                     + +EV  +      Q+DE S+DYRVTGVL
Sbjct: 115  WLLFGVCACIAPLDWVDASHEAVSL-----DQKKEVCESSGPEFNQNDESSADYRVTGVL 169

Query: 680  ADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDA 859
            ADGRCLFRA+AHGACL++GE AP+++RQ ELAD                  W++EGDFDA
Sbjct: 170  ADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDA 229

Query: 860  YVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINV 1039
            YVKRIQQPF WGGEPELLMASHVLKTPISVFMR+ SS  L+NIAKYG+EY+  EE  INV
Sbjct: 230  YVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQMGEESPINV 289

Query: 1040 LFHRYGHYDILETS*PKLPKK 1102
            LFH YGHYDILETS  K+  K
Sbjct: 290  LFHGYGHYDILETSSDKVSLK 310


>XP_019459096.1 PREDICTED: uncharacterized protein LOC109359045 [Lupinus
            angustifolius] OIW01500.1 hypothetical protein
            TanjilG_19426 [Lupinus angustifolius]
          Length = 319

 Score =  323 bits (828), Expect = e-105
 Identities = 179/318 (56%), Positives = 202/318 (63%), Gaps = 8/318 (2%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFLHASSAARLAHGTA--YSSASPRFSRPGHDGARRQHSSSCELR 325
            ML  LC TRPKP   S     +A+   H +A   + +S  F  PG DG RR HSS+C + 
Sbjct: 1    MLAALC-TRPKPSFLSSFFFQTASLHNHNSARFINGSSLHFYCPGGDGRRRHHSSACTIG 59

Query: 326  GXXXXXXXIWHAIMPCGGDGF----RRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPD 493
            G       IWH ++P           R   A+ H HEL+GEGSWN AWDARP+RWLH PD
Sbjct: 60   GSCGGAASIWHVVLPERAGASICCDLRWRSALPH-HELRGEGSWNAAWDARPSRWLHRPD 118

Query: 494  SAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGR-EVKVAECDSKEQDDEVSSDYRVT 670
            SAWLLFGVC CL                   S  G  ++K   CD   + +EVSS YR+T
Sbjct: 119  SAWLLFGVCACLAPPLLLADVNTEVPSAEHDSDGGGGDLKGPGCD---EQNEVSSAYRIT 175

Query: 671  GVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGD 850
            GVLADGRCLFRA+AHGACL NGE AP+ENRQ ELAD                  WF+EGD
Sbjct: 176  GVLADGRCLFRAIAHGACLMNGEEAPDENRQRELADELRAQVVEELMKRREETEWFIEGD 235

Query: 851  FDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEY-RNDEEI 1027
            FDAYV RIQQPF WGGEPELLMASHVLKTPISVFMRD SS DLVNIAKYGEEY   ++EI
Sbjct: 236  FDAYVTRIQQPFVWGGEPELLMASHVLKTPISVFMRDRSSGDLVNIAKYGEEYITKEKEI 295

Query: 1028 SINVLFHRYGHYDILETS 1081
            +INVLFH YGHYDILE S
Sbjct: 296  AINVLFHGYGHYDILEIS 313


>XP_010032108.1 PREDICTED: OTU domain-containing protein At3g57810 [Eucalyptus
            grandis] KCW51502.1 hypothetical protein EUGRSUZ_J01018
            [Eucalyptus grandis]
          Length = 314

 Score =  316 bits (809), Expect = e-102
 Identities = 178/320 (55%), Positives = 200/320 (62%), Gaps = 11/320 (3%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFS--FLHASSAARLAHGTAYSSASPRFSRPGHDG---ARRQHSSSC 316
            MLGVLCA RPKPWI +  F HAS+A         S+A+ R            RR HSSSC
Sbjct: 1    MLGVLCA-RPKPWILASCFSHASAAHHCGRLAWVSAAAARLQLAADSPDRWRRRHHSSSC 59

Query: 317  ELRGXXXXXXX-----IWHAIMPCG-GDGFRRGVVAVHHDHELKGEGSWNVAWDARPARW 478
             L G            IWHAI+P G GD  RR  +        +GEGSWNVAWDARPARW
Sbjct: 60   RLGGASSCAHPCGVASIWHAILPSGEGDPPRR--MDQPRRPVFRGEGSWNVAWDARPARW 117

Query: 479  LHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSD 658
            LH PDSAWLLFGVC CL                        E +V + DS ++    S D
Sbjct: 118  LHRPDSAWLLFGVCACLAPVDAAEPSREEVVP---------EARVEDRDSLDEAKRSSPD 168

Query: 659  YRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWF 838
            YRVTGVLADGRCLFRA+AH ACL+ GEAAP++NRQ ELAD                  W 
Sbjct: 169  YRVTGVLADGRCLFRAIAHCACLRKGEAAPDDNRQRELADELRAQVVAELLKRREETEWA 228

Query: 839  LEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND 1018
            +EGDFDAY++RIQQP+ WGGEPELLMASHVLKTPISVFM D SS +LVN+AKYGEEYR D
Sbjct: 229  IEGDFDAYIERIQQPYVWGGEPELLMASHVLKTPISVFMVDRSSGNLVNVAKYGEEYRKD 288

Query: 1019 EEISINVLFHRYGHYDILET 1078
            EEI INVLFH YGHYDILE+
Sbjct: 289  EEIPINVLFHGYGHYDILES 308


>OMO50984.1 Ovarian tumor, otubain [Corchorus olitorius]
          Length = 327

 Score =  315 bits (806), Expect = e-101
 Identities = 171/321 (53%), Positives = 197/321 (61%), Gaps = 12/321 (3%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFL----HASSAARLAHGTAYSSASPRFSRPGHDGAR-RQHSSSC 316
            MLGVLCA  PKPWI + L    H   +A   H +      P F+    D  R R HS++C
Sbjct: 1    MLGVLCARPPKPWILNSLSLVAHGGGSAAHHHDSRLLHW-PHFAHISADNRRCRHHSTAC 59

Query: 317  ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 496
             L G       IWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH PDS
Sbjct: 60   RLGGSDGGAASIWHAILPCGGSGRGRKREEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119

Query: 497  AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS------- 655
            AWLLFGVC CL                     EG E+      S ++   +SS       
Sbjct: 120  AWLLFGVCACLAPMIEFVDVNPETDDKI----EGAELISINGLSADEKSSISSSPVAAPD 175

Query: 656  DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 835
            +Y+VTGVLADGRCLFRA+AHGACL++GE AP+E RQ ELAD                  W
Sbjct: 176  NYKVTGVLADGRCLFRAIAHGACLRSGEEAPDETRQRELADELRAQVVNELLKRREETEW 235

Query: 836  FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 1015
            F+EGDFDAYVK IQQP+ WGGEPELLMASHVLKTPISV+M   SS +L+ IA YGEEY+ 
Sbjct: 236  FIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTPISVYMIHRSSRNLIKIADYGEEYQK 295

Query: 1016 DEEISINVLFHRYGHYDILET 1078
            D+E  INVLFH YGHYDILE+
Sbjct: 296  DKETPINVLFHGYGHYDILES 316


>OMO98833.1 Ovarian tumor, otubain [Corchorus capsularis]
          Length = 327

 Score =  314 bits (804), Expect = e-101
 Identities = 171/321 (53%), Positives = 197/321 (61%), Gaps = 12/321 (3%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFL----HASSAARLAHGTAYSSASPRFSRPGHDGAR-RQHSSSC 316
            MLGVLCA  PKPWI + L    H   +A   H +      P F+    D  R R HS++C
Sbjct: 1    MLGVLCARPPKPWILNSLSLVAHGGGSAAHHHDSRLLHW-PHFADLSADNRRCRHHSTAC 59

Query: 317  ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 496
             L G       IWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH PDS
Sbjct: 60   RLGGSDGGAASIWHAILPCGGSGRGRKREEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119

Query: 497  AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS------- 655
            AWLLFGVC CL                     EG E+      S ++   +SS       
Sbjct: 120  AWLLFGVCACLAPMIEFVDVNPETDDKI----EGTELISINGLSADEKSSISSSPVAAPD 175

Query: 656  DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 835
            +Y+VTGVLADGRCLFRA+AHGACL++GE AP+E RQ ELAD                  W
Sbjct: 176  NYKVTGVLADGRCLFRAIAHGACLRSGEEAPDETRQRELADELRAQVVNELLKRREETEW 235

Query: 836  FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 1015
            F+EGDFDAYVK IQQP+ WGGEPELLMASHVLKTPISV+M   SS +L+ IA YGEEY+ 
Sbjct: 236  FIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTPISVYMIHRSSRNLIKIADYGEEYQK 295

Query: 1016 DEEISINVLFHRYGHYDILET 1078
            D+E  INVLFH YGHYDILE+
Sbjct: 296  DKETPINVLFHGYGHYDILES 316


>XP_018845374.1 PREDICTED: uncharacterized protein LOC109009371 [Juglans regia]
          Length = 328

 Score =  313 bits (803), Expect = e-101
 Identities = 181/342 (52%), Positives = 207/342 (60%), Gaps = 33/342 (9%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIF-----SFLHASSAARLAHGTAYSSASPRFSRPGHDG---ARRQHS 307
            MLGVLCA RPKPWI      SF+H S+A     G   S        PG +G    RR HS
Sbjct: 1    MLGVLCA-RPKPWILTSLSSSFVHGSAAHHHITGLRQS--------PGFNGDLKPRRHHS 51

Query: 308  SSCELRGXXXXXXX-IWHAIMPCGGDGFRRGVVAVHHD---HELKGEGSWNVAWDARPAR 475
            S+C + G        IWHAIMPCG  G    ++   +     E +GEGSWNVAWDARPAR
Sbjct: 52   SACRIDGSFGGGAASIWHAIMPCGAAGHPSDLLLRRNAMLRRERRGEGSWNVAWDARPAR 111

Query: 476  WLHSPD-SAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKE----QD 640
            WLH PD SAWLLFGVC CL                        E K+  CDS +    ++
Sbjct: 112  WLHRPDYSAWLLFGVCACLAPLDFAFDDSPEAIVV--------EAKIEACDSIDSNANKN 163

Query: 641  DEV----------------SSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGEL 772
            DE+                S+DYRVTGVLADGRCLFRALAHGAC ++GE AP+ENRQ EL
Sbjct: 164  DEIDGFDAIYSNTSKPKEGSADYRVTGVLADGRCLFRALAHGACSRSGEEAPDENRQREL 223

Query: 773  ADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVF 952
            AD                  WF+EGDFDAYV+RIQQPF WGGEPELLMASHVLKTPISVF
Sbjct: 224  ADELRAQVVDELLKRRKETEWFIEGDFDAYVERIQQPFVWGGEPELLMASHVLKTPISVF 283

Query: 953  MRDTSSIDLVNIAKYGEEYRNDEEISINVLFHRYGHYDILET 1078
            M++ SS  LVNIAKYGEEYR +E+  INVLFH YGHYD+LE+
Sbjct: 284  MKNRSSGRLVNIAKYGEEYRKEEDSPINVLFHGYGHYDLLES 325


>KHN37847.1 OTU domain-containing protein [Glycine soja]
          Length = 234

 Score =  310 bits (793), Expect = e-101
 Identities = 154/236 (65%), Positives = 171/236 (72%)
 Frame = +2

Query: 380  DGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLFGVCDCLXXXXXXXXXX 559
            DGFRRGVVA H   ++KGEGSWNVAWDARPARWLH PDSAWLLFGVC CL          
Sbjct: 8    DGFRRGVVAFH---DMKGEGSWNVAWDARPARWLHRPDSAWLLFGVCACLAPPSSCVDAD 64

Query: 560  XXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGE 739
                      S          D + ++DEVS+DYRVTGV ADGRCLFRA+AHGACL+NGE
Sbjct: 65   TNTDAIAVDES------CRLLDKEREEDEVSADYRVTGVPADGRCLFRAIAHGACLRNGE 118

Query: 740  AAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMA 919
             AP+ENRQ ELAD                  WF+EGDFD Y++RIQQP+ WGGEPELLMA
Sbjct: 119  KAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQRIQQPYVWGGEPELLMA 178

Query: 920  SHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLFHRYGHYDILETS*P 1087
            SHVLKTPISVFMRDT S++LVNIAKYGEEYRND++ISINVLFH YGHYDILET  P
Sbjct: 179  SHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFHGYGHYDILETLRP 234


>GAU40884.1 hypothetical protein TSUD_40590 [Trifolium subterraneum]
          Length = 266

 Score =  310 bits (794), Expect = e-101
 Identities = 165/272 (60%), Positives = 180/272 (66%), Gaps = 11/272 (4%)
 Frame = +2

Query: 152 MLGVLCATRPKPWIFSFLHASS----AARLAHGT-AYSSASPRFSRPGHDGARRQHSSSC 316
           MLGVLCATR +PWIFSFLH+SS     ARLAH T + SS  P FS      ARR HSS C
Sbjct: 1   MLGVLCATRSRPWIFSFLHSSSHHNHTARLAHATVSASSLCPTFS------ARRNHSSQC 54

Query: 317 ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 496
           +L+        IWHAIMPCGGDG ++G   VHHDHELKGEGSWNVAWDARPARWLH  DS
Sbjct: 55  KLQISTGGAASIWHAIMPCGGDGLQQGGFMVHHDHELKGEGSWNVAWDARPARWLHRSDS 114

Query: 497 AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEG---REVK-VAECDSKEQDDEVSS--D 658
           AWLLFGVC CL                     E    RE+K + + +S +  DE+SS  D
Sbjct: 115 AWLLFGVCACLAPPVDVEAEVPPLTTSVISPDENYKRREIKDIKDAESDKPSDELSSEAD 174

Query: 659 YRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWF 838
           YRVTGVLADGRCLFRA+AHGACLKNGE APNENRQ ELAD                  WF
Sbjct: 175 YRVTGVLADGRCLFRAIAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETEWF 234

Query: 839 LEGDFDAYVKRIQQPFAWGGEPELLMASHVLK 934
           +EGDFD YV RIQQ F WGGEPELLMASHVLK
Sbjct: 235 IEGDFDTYVTRIQQTFVWGGEPELLMASHVLK 266


>EOY19029.1 Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao]
          Length = 327

 Score =  310 bits (794), Expect = e-100
 Identities = 171/325 (52%), Positives = 197/325 (60%), Gaps = 16/325 (4%)
 Frame = +2

Query: 152  MLGVLCATRPKPWIFSFLHASSAARLAHG--TAYSSASPRFSRPGH-------DGARRQH 304
            MLGVLCA  PKPWI +     S + +AHG   A+   S     P H       D   R H
Sbjct: 1    MLGVLCARPPKPWILN-----SLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHH 55

Query: 305  SSSCELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLH 484
            S++C L G       IWHAI+PCGG G  R    V  + E KGEGSWNVAWDARPARWLH
Sbjct: 56   STACRLGGSDGGAASIWHAILPCGGGGGGRRRGEVWKNVERKGEGSWNVAWDARPARWLH 115

Query: 485  SPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS--- 655
             PDSAWLLFGVC CL                     EG E+ +    S ++    SS   
Sbjct: 116  RPDSAWLLFGVCACLAPMIEFVDVNPDADDKI----EGAELNLVSRLSADEKSSSSSSSV 171

Query: 656  ----DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXX 823
                + +VTGVLADGRCLFRA+AHGACL++GE AP+EN Q ELAD               
Sbjct: 172  AAADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVVNELLKRRE 231

Query: 824  XXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGE 1003
               WF+EGDFDAYVK IQQP+ WGGEPE+LMASHVLKTPISV+M   SS +L  IAKYGE
Sbjct: 232  ETEWFIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNLTKIAKYGE 291

Query: 1004 EYRNDEEISINVLFHRYGHYDILET 1078
            EY+ D+E  INVLFH YGHYDILE+
Sbjct: 292  EYQKDKENPINVLFHGYGHYDILES 316


Top