BLASTX nr result

ID: Cocculus23_contig00018430 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00018430
         (1401 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003632695.1| PREDICTED: OTU domain-containing protein At3...   338   3e-90
gb|EXC30911.1| OTU domain-containing protein [Morus notabilis]        333   1e-88
ref|XP_007028914.1| Cysteine proteinases superfamily protein iso...   326   1e-86
ref|XP_002534273.1| cysteine-type peptidase, putative [Ricinus c...   324   7e-86
ref|XP_004291162.1| PREDICTED: OTU domain-containing protein At3...   317   8e-84
ref|XP_007202322.1| hypothetical protein PRUPE_ppa008123mg [Prun...   317   8e-84
ref|XP_002323302.2| OTU-like cysteine protease family protein [P...   317   1e-83
ref|XP_004497941.1| PREDICTED: OTU domain-containing protein At3...   314   7e-83
emb|CAN60311.1| hypothetical protein VITISV_002512 [Vitis vinifera]   314   7e-83
ref|XP_006588483.1| PREDICTED: uncharacterized protein LOC100810...   311   6e-82
ref|XP_006490038.1| PREDICTED: OTU domain-containing protein At3...   308   4e-81
ref|XP_006421489.1| hypothetical protein CICLE_v10005351mg [Citr...   306   1e-80
ref|NP_001242273.1| uncharacterized protein LOC100810338 [Glycin...   306   1e-80
ref|XP_006421488.1| hypothetical protein CICLE_v10005351mg [Citr...   302   2e-79
ref|XP_007145652.1| hypothetical protein PHAVU_007G257000g [Phas...   301   4e-79
ref|XP_006381039.1| hypothetical protein POPTR_0006s05620g [Popu...   296   2e-77
ref|XP_002445909.1| hypothetical protein SORBIDRAFT_07g027850 [S...   295   4e-77
ref|XP_006850126.1| hypothetical protein AMTR_s00022p00229870 [A...   294   7e-77
ref|XP_007028911.1| Cysteine proteinases superfamily protein iso...   294   7e-77
ref|XP_007028913.1| Cysteine proteinases superfamily protein iso...   293   1e-76

>ref|XP_003632695.1| PREDICTED: OTU domain-containing protein At3g57810-like [Vitis
            vinifera]
          Length = 340

 Score =  338 bits (867), Expect = 3e-90
 Identities = 193/347 (55%), Positives = 231/347 (66%), Gaps = 5/347 (1%)
 Frame = -1

Query: 1203 MIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCCS----RGEFQPSYVAVTI 1036
            MI   PIST A+ +V L   V RQMS H   LVSQ  P+S  S     G  +P    +++
Sbjct: 1    MINCYPISTCARNIVRLSGCVQRQMSSHICSLVSQ-GPSSSFSFYFYTGHSKPKNTFMSV 59

Query: 1035 DGRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQNM 856
                  SS T   +FQG C  S                +  S  G   + L IS   QNM
Sbjct: 60   SETFSCSSITAFHTFQGSCFYSGLSKRRGSSRSLTVKSLIGSR-GPSKRSLNISLTCQNM 118

Query: 855  KVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDENDDM 676
             VRLL+PK   + KIK N GS S   G  SA G+ F L +C++ SEP ++E+++ + D  
Sbjct: 119  NVRLLVPKQGVLPKIKCNVGSVSWPQGCASA-GLMFALLVCYSSSEPVHAESAQKKEDKK 177

Query: 675  DDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLADE 496
             +    + N+ HGKKVYT+YSITGIPGDGRCLFRSV HGACLRSGKP PS S Q++LADE
Sbjct: 178  GE---CYTNS-HGKKVYTDYSITGIPGDGRCLFRSVVHGACLRSGKPAPSASCQRELADE 233

Query: 495  LRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMYD 316
            LRA V DEF++RR ETEWF+EGDFD YVSQ+RKPHVWGGEPELFMASHVL+MPITVYMYD
Sbjct: 234  LRAEVVDEFIRRRSETEWFIEGDFDTYVSQMRKPHVWGGEPELFMASHVLQMPITVYMYD 293

Query: 315  EKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIP-SKAAKSKL 178
            +   GLIAIAEY QEYGK +PIRVLYHGFGHY++LQIP  K AKS+L
Sbjct: 294  KDSGGLIAIAEYGQEYGKENPIRVLYHGFGHYESLQIPGKKGAKSRL 340


>gb|EXC30911.1| OTU domain-containing protein [Morus notabilis]
          Length = 893

 Score =  333 bits (854), Expect = 1e-88
 Identities = 186/352 (52%), Positives = 230/352 (65%), Gaps = 4/352 (1%)
 Frame = -1

Query: 1221 NSSYVNMIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCCS---RGEFQPSY 1051
            NS Y NMI    I    K + CL   +  +M      +VS+   +SCC     G  +  Y
Sbjct: 558  NSCYDNMIVCPSIGACTKSIACLSGNIQTEMGSKLCSVVSRRPYSSCCFCLYPGNSKTKY 617

Query: 1050 VAVTIDGRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISS 871
              +++     S+S   S +FQ   + SC               + S+      + L IS 
Sbjct: 618  AHLSVSKNHLSNS---SPTFQKSFVSSCFSTEKGRLWSLALKDLVSAAEPQRRR-LKISL 673

Query: 870  PHQNMKVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRD 691
             +  M +RLL+PK   + KI  N+G+          AG+  GL IC++ S+PA++E +R 
Sbjct: 674  ANTAMSIRLLVPKQRMLVKI--NSGT----------AGLLGGLLICYSSSKPAHAEVARS 721

Query: 690  ENDDMDDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQ 511
            ++D  DD DS++    HGKKVYT+YSITGIPGDGRCLFRSVAHGACLRSGKP PSESLQ+
Sbjct: 722  DDDSEDDCDSSYVKFSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPAPSESLQR 781

Query: 510  QLADELRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPIT 331
            +LAD LRA VADEF+KRREETEWFVEGDFD YV+Q+RKPHVWGGEPELFMASHVL MPIT
Sbjct: 782  ELADNLRARVADEFIKRREETEWFVEGDFDTYVAQMRKPHVWGGEPELFMASHVLLMPIT 841

Query: 330  VYMYDEKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIP-SKAAKSKL 178
            VYM+D    GLI IAEY QEYG  +PIRVLYHGFGHYDALQIP +KAAK++L
Sbjct: 842  VYMHDRDAGGLICIAEYGQEYGMENPIRVLYHGFGHYDALQIPGNKAAKARL 893


>ref|XP_007028914.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao]
            gi|590636687|ref|XP_007028915.1| Cysteine proteinases
            superfamily protein isoform 4 [Theobroma cacao]
            gi|590636690|ref|XP_007028916.1| Cysteine proteinases
            superfamily protein isoform 4 [Theobroma cacao]
            gi|508717519|gb|EOY09416.1| Cysteine proteinases
            superfamily protein isoform 4 [Theobroma cacao]
            gi|508717520|gb|EOY09417.1| Cysteine proteinases
            superfamily protein isoform 4 [Theobroma cacao]
            gi|508717521|gb|EOY09418.1| Cysteine proteinases
            superfamily protein isoform 4 [Theobroma cacao]
          Length = 340

 Score =  326 bits (836), Expect = 1e-86
 Identities = 183/347 (52%), Positives = 224/347 (64%), Gaps = 5/347 (1%)
 Frame = -1

Query: 1203 MIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCCS----RGEFQPSYVAVTI 1036
            M+  SPIST AK VV L   +G  +       V    P+S C      G  +  Y  +++
Sbjct: 1    MMVCSPISTCAKNVVHLRGHMGSSLCS-----VISCQPSSSCYYFSYSGHPKTKYTDLSV 55

Query: 1035 DGRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQNM 856
                  S A G  +FQ  C  S               I D +      + L IS P Q+M
Sbjct: 56   SYTTSGSPAVGYRAFQAGCFRSSRRSRKLQSLVVKESISDKTKQK---RQLEISWPGQSM 112

Query: 855  KVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDENDDM 676
            K++ LLPK   + K K   G  S   G  S  G+ FGL +C++ SEP ++EA+  + D  
Sbjct: 113  KMKFLLPKQGTLQKFKCTAGPISWSQGCASV-GLVFGLLVCYSSSEPVHAEAAGAKEDKQ 171

Query: 675  DDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLADE 496
            DD +S+ A   HGKKVYT+YS+ GIPGDGRC+FRSVAHGACLRSGK  PSE +Q++LAD+
Sbjct: 172  DDCESSHAKFSHGKKVYTDYSVIGIPGDGRCMFRSVAHGACLRSGKSAPSEHVQRELADD 231

Query: 495  LRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMYD 316
            LRA VADEF+KRR+ETEWFVEG+FD YVSQIRKPHVWGGEPELFMASHVL+MPITVYMYD
Sbjct: 232  LRAKVADEFIKRRKETEWFVEGNFDAYVSQIRKPHVWGGEPELFMASHVLQMPITVYMYD 291

Query: 315  EKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPS-KAAKSKL 178
            +   GLIAIAEY QEYG  +PIRVLYHGFGHYDALQ+   ++ KSKL
Sbjct: 292  KGAGGLIAIAEYGQEYGTENPIRVLYHGFGHYDALQMRGRRSGKSKL 338


>ref|XP_002534273.1| cysteine-type peptidase, putative [Ricinus communis]
            gi|223525596|gb|EEF28110.1| cysteine-type peptidase,
            putative [Ricinus communis]
          Length = 343

 Score =  324 bits (830), Expect = 7e-86
 Identities = 184/350 (52%), Positives = 227/350 (64%), Gaps = 8/350 (2%)
 Frame = -1

Query: 1203 MIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCC---SRGEFQPSYVAVTID 1033
            MI  SPIST A++VV L     + M      +VS     SCC    R     SY  ++I 
Sbjct: 1    MIVCSPISTYARKVVYL-SGCAQHMGSTIFNMVSNGQSTSCCFCSCRAHLSKSYARLSIS 59

Query: 1032 GRPPSSSA----TGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPH 865
                S S     T + +F G   GS                   +T G   K   +S  +
Sbjct: 60   KTFSSPSVGTCQTSNKNFSGS--GSAKQSGSWQSITVKGLF---NTRGPLKKHFNLSLAY 114

Query: 864  QNMKVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDEN 685
            QN+ +R  L K   ++KIK N GS S      S  G+  GL +C++ SEP  +EA+  E 
Sbjct: 115  QNLNMRFSLSKRGMLSKIKDNVGSISWAQECAST-GLICGLLVCYSSSEPTRAEAAAREK 173

Query: 684  DDMDDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQL 505
            D+ D+ D ++    HGK+VYT+YSITGIPGDGRCLFRSVAHGA LR+GKP PSESLQ++L
Sbjct: 174  DEEDNSDLSYVKFSHGKRVYTDYSITGIPGDGRCLFRSVAHGASLRTGKPAPSESLQREL 233

Query: 504  ADELRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVY 325
            AD+LRA VADEF++RR+ETEWF+EGDFD YV+Q+RKPHVWGGEPELFMASHVL+MPITVY
Sbjct: 234  ADDLRARVADEFIRRRQETEWFIEGDFDTYVAQMRKPHVWGGEPELFMASHVLKMPITVY 293

Query: 324  MYDEKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPS-KAAKSKL 178
            MYD+  RGLI+IAEY +EYGK++PIRVLYHGFGHYDALQIP  K  K KL
Sbjct: 294  MYDQNARGLISIAEYGEEYGKDNPIRVLYHGFGHYDALQIPGRKGGKPKL 343


>ref|XP_004291162.1| PREDICTED: OTU domain-containing protein At3g57810-like [Fragaria
            vesca subsp. vesca]
          Length = 343

 Score =  317 bits (812), Expect = 8e-84
 Identities = 186/351 (52%), Positives = 227/351 (64%), Gaps = 6/351 (1%)
 Frame = -1

Query: 1212 YVNMIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCCSR---GEFQPSYVAV 1042
            YVN +  + I+  A  VVC+   +  QM      +VS+   +S C R   G+    +  +
Sbjct: 10   YVNTVVGTHINQGANNVVCMSGCIEMQMGSKICSVVSRGASSSYCYRLQPGKSGNKFGTL 69

Query: 1041 TIDGRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGL--KWLGISSP 868
            ++    PS   TG     G C  SC                 S TV      K L IS  
Sbjct: 70   SLTKSRPSE--TGQTP-HGSCFRSCFSMDRGNSR--------SLTVNAKRTQKCLEISLA 118

Query: 867  HQNMKVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDE 688
             + MK R+L+P+   + KIK N G  S    G   AG+ FGL IC + SEPA++E +   
Sbjct: 119  CRGMKTRILVPRQGMLPKIKCNVGPMSWTQCG--YAGLMFGLLICNS-SEPAHAETTHKN 175

Query: 687  NDDMDDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQ 508
            +D  DD D +++   HGKKV+T+YSI GIPGDGRCLFRSVAHGACLR+GK  PS+SLQ++
Sbjct: 176  DDKEDDGDLSYS---HGKKVHTDYSIIGIPGDGRCLFRSVAHGACLRAGKSAPSQSLQRE 232

Query: 507  LADELRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITV 328
            LAD+LRA VADEF+KRREETEWFVEGDFD YVSQIRKPHVWGGEPEL MASHVL+MPITV
Sbjct: 233  LADDLRARVADEFIKRREETEWFVEGDFDTYVSQIRKPHVWGGEPELLMASHVLQMPITV 292

Query: 327  YMYDEKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPS-KAAKSKL 178
            YM+DEK  GLI IAEY QEYGK +PIRVLYHGFGHYDAL IP  ++ KS+L
Sbjct: 293  YMHDEKAGGLITIAEYGQEYGKENPIRVLYHGFGHYDALHIPGVRSGKSRL 343


>ref|XP_007202322.1| hypothetical protein PRUPE_ppa008123mg [Prunus persica]
            gi|462397853|gb|EMJ03521.1| hypothetical protein
            PRUPE_ppa008123mg [Prunus persica]
          Length = 344

 Score =  317 bits (812), Expect = 8e-84
 Identities = 181/339 (53%), Positives = 214/339 (63%)
 Frame = -1

Query: 1212 YVNMIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCCSRGEFQPSYVAVTID 1033
            +VN I   PI+   K VVCL      QM      +VS+   +SCC     Q       I 
Sbjct: 10   FVNTIVCPPINHSPKNVVCLSGCTQIQMGSKICSVVSRGASSSCCKG--LQTGKTGTKIF 67

Query: 1032 GRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQNMK 853
              P S +   ++       G+C                     G     L IS   + M 
Sbjct: 68   SLPLSKNRPTNIGQTSH--GNCFRFFFSKDSRSLTVNAGGPNKGS----LEISLACRGMN 121

Query: 852  VRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDENDDMD 673
             RLL+P+   + KIK N G  S   G  SA G+ FGL +C  CS PA++EA+  E D+ D
Sbjct: 122  TRLLVPRQGMLPKIKCNVGPVSWPQGCASA-GLIFGLLVC-NCSGPAHAEAAHRE-DEED 178

Query: 672  DFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLADEL 493
            D D ++     GKKVYT+YSI GIPGDGRCLFRSVAHGA LR+GK  P+ESLQ++LAD+L
Sbjct: 179  DNDLSYVKFSRGKKVYTDYSIIGIPGDGRCLFRSVAHGAYLRAGKAAPAESLQRELADDL 238

Query: 492  RATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMYDE 313
            RA VADEF+KRREETEWFVEGDFD YVSQIR+PHVWGGEPELFMASHVL+MPITVYMYDE
Sbjct: 239  RARVADEFIKRREETEWFVEGDFDTYVSQIRRPHVWGGEPELFMASHVLKMPITVYMYDE 298

Query: 312  KYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPSK 196
            K  GLI IAEY QEYGK +PI+VLYHGFGHYDAL+IP K
Sbjct: 299  KAGGLITIAEYGQEYGKENPIKVLYHGFGHYDALRIPGK 337


>ref|XP_002323302.2| OTU-like cysteine protease family protein [Populus trichocarpa]
            gi|550320875|gb|EEF05063.2| OTU-like cysteine protease
            family protein [Populus trichocarpa]
          Length = 342

 Score =  317 bits (811), Expect = 1e-83
 Identities = 179/346 (51%), Positives = 226/346 (65%), Gaps = 4/346 (1%)
 Frame = -1

Query: 1203 MIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCCSR---GEFQPSYVAVTID 1033
            MI  SPIST  K VV L  RV +QM      +VS     SCC     G  + SY  +++ 
Sbjct: 1    MIVCSPISTCVKNVVHLSSRV-QQMGSTILNVVSGGQTTSCCFSSYPGLSRSSYSRLSVS 59

Query: 1032 GRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQNMK 853
             +  S  +    + Q  C GS                V  S  G   +   IS P Q M 
Sbjct: 60   -KTFSCPSISYQTIQSNCFGSVLTKQRADLQSFSVKGVVRSR-GPLKRQFNISLPCQIMN 117

Query: 852  VRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDENDDMD 673
            +R  + K   ++KI  NTGS S   G  +  G+ FGL +C++ SEP ++EA+  +N++ D
Sbjct: 118  LRFSVSKQGVLSKINDNTGSISWSQGYPTT-GIIFGLLVCYSSSEPTHAEAATHKNEEED 176

Query: 672  DFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLADEL 493
            + + +     HGK+VY +YSI GIPGDGRCLFRSVAHGAC+RSGKP PSE+LQ++LAD+L
Sbjct: 177  NCNLSDIKFSHGKEVYRDYSIIGIPGDGRCLFRSVAHGACIRSGKPAPSENLQRELADDL 236

Query: 492  RATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMYDE 313
            R+ VADEF+KRREETEWF+EG+FD YVS+IRKPHVWGGEPEL MASHVL+MPITVYM D+
Sbjct: 237  RSKVADEFIKRREETEWFIEGNFDTYVSRIRKPHVWGGEPELLMASHVLKMPITVYMDDK 296

Query: 312  KYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIP-SKAAKSKL 178
               GLI+IAEY QEYGK DPIR++YHGFGHYDALQ P ++  KSKL
Sbjct: 297  NSGGLISIAEYGQEYGKEDPIRIIYHGFGHYDALQFPRTRGGKSKL 342


>ref|XP_004497941.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cicer
            arietinum]
          Length = 337

 Score =  314 bits (804), Expect = 7e-83
 Identities = 180/345 (52%), Positives = 220/345 (63%), Gaps = 8/345 (2%)
 Frame = -1

Query: 1188 PISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCCSRGEFQP-----SYVAVTIDGRP 1024
            P+S  +   V +  R    MS +   L S+    SC     F P     +YV ++I  +P
Sbjct: 6    PVSQSSISAVVVKGRTQLLMSSNICGLQSRGI--SCSFSSGFYPGKSGKNYVGLSICTKP 63

Query: 1023 PSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQNMKVRL 844
              S+  G  + +G  LGSC                  +++    K   IS   Q+M +RL
Sbjct: 64   SCSTVMGQ-TIRGGYLGSCCSKQRGSTQLF-------NSIVSRKKHREISLACQSMSMRL 115

Query: 843  LLPKHDKITKIKWNTGSGSRLYGGGSAAGVAF--GLSICFACSEPAYSEASRDENDDMDD 670
            L+PK   ++K+K N G   R+    S A V F  GL +C   SEPA++EA  +     DD
Sbjct: 116  LVPKQKMLSKVKCNVG---RINWPRSCASVGFIFGLFVCNLSSEPAHAEADYENRKRNDD 172

Query: 669  FDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLADELR 490
             D T     HGK+VYT+YS+ GIPGDGRCLFRSVAHGA LRSGKP PSE  Q++LAD+LR
Sbjct: 173  CDETNVKVSHGKQVYTDYSVIGIPGDGRCLFRSVAHGASLRSGKPPPSERFQRELADDLR 232

Query: 489  ATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMYDEK 310
            A VADEFVKRREETEWF+EGDFD Y+SQIRKPHVWGGEPELF+ASHVL+MPITVYMYD+ 
Sbjct: 233  AKVADEFVKRREETEWFIEGDFDSYISQIRKPHVWGGEPELFIASHVLQMPITVYMYDQD 292

Query: 309  YRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPS-KAAKSKL 178
              GLI+IAEY QEYGK +PIRVLYHGFGHYDAL IP  K  KS+L
Sbjct: 293  AGGLISIAEYGQEYGKENPIRVLYHGFGHYDALDIPKRKGPKSRL 337


>emb|CAN60311.1| hypothetical protein VITISV_002512 [Vitis vinifera]
          Length = 806

 Score =  314 bits (804), Expect = 7e-83
 Identities = 159/237 (67%), Positives = 187/237 (78%), Gaps = 1/237 (0%)
 Frame = -1

Query: 885  LGISSPHQNMKVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYS 706
            L IS   QNM VRLL+PK   + KIK N GS S   G  SA G+ F L +C++ SEP ++
Sbjct: 575  LNISLTCQNMNVRLLVPKQGVLPKIKCNVGSVSWPQGCASA-GLMFALLVCYSSSEPVHA 633

Query: 705  EASRDENDDMDDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPS 526
            E+++ + D   +    + N+ HGKKVYT+YSITGIPGDGRCLFRSV HGACLRSGKP PS
Sbjct: 634  ESAQKKEDKKGE---CYTNS-HGKKVYTDYSITGIPGDGRCLFRSVVHGACLRSGKPAPS 689

Query: 525  ESLQQQLADELRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVL 346
             S Q++LADELRA V DEF++RR ETEWF+EGDFD YVSQ+RKPHVWGGEPELFMASHVL
Sbjct: 690  ASCQRELADELRAEVVDEFIRRRSETEWFIEGDFDTYVSQMRKPHVWGGEPELFMASHVL 749

Query: 345  RMPITVYMYDEKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIP-SKAAKSKL 178
            +MPITVYMYD+   GLIAIAEY QEYGK +PIRVLYHGFGHY++LQIP  K AKS+L
Sbjct: 750  QMPITVYMYDKDSGGLIAIAEYGQEYGKENPIRVLYHGFGHYESLQIPGKKGAKSRL 806


>ref|XP_006588483.1| PREDICTED: uncharacterized protein LOC100810338 isoform X1 [Glycine
            max]
          Length = 339

 Score =  311 bits (796), Expect = 6e-82
 Identities = 168/301 (55%), Positives = 215/301 (71%), Gaps = 3/301 (0%)
 Frame = -1

Query: 1089 NSCCSRGEFQPSYVAVTIDGRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSS 910
            +S  S G+ + S+V +++  +   S+  G  + +G  LGSC                  S
Sbjct: 42   SSSLSPGKSEISHVGLSVCTKLSCSTVMGQ-TIRGGFLGSCCSKQRGNPRFF-------S 93

Query: 909  TVGCGLKWLGISSPHQNMKVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICF 730
            +V    ++  IS   Q + +RLL+PK + + K+K N GS S   G  S  G+ FGL +C 
Sbjct: 94   SVVPRKRYHEISLACQTINMRLLVPKQNMMRKVKCNLGSVSWPRGCASV-GLIFGLLVCN 152

Query: 729  ACSEPAYSEA-SRDEN--DDMDDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHG 559
              SEPA++E+ S +EN  DD ++++S     +HGKKVYT+YS+ GIPGDGRCLFRSVA G
Sbjct: 153  LSSEPAHAESHSENENRKDDCNEYESN-VKVLHGKKVYTDYSVIGIPGDGRCLFRSVARG 211

Query: 558  ACLRSGKPYPSESLQQQLADELRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGG 379
            ACLRSGKP P+ES+Q++LAD+LRA VADEF+KR+EETEWFVEGDFD YVSQIRKPHVWGG
Sbjct: 212  ACLRSGKPPPNESIQRELADDLRARVADEFIKRKEETEWFVEGDFDTYVSQIRKPHVWGG 271

Query: 378  EPELFMASHVLRMPITVYMYDEKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPS 199
            EPELF+ASHVL+MPITVYMYD+   GLI+IAEY QEYGK +PIRVLYHGFGHYDAL+IP 
Sbjct: 272  EPELFIASHVLQMPITVYMYDKDAGGLISIAEYGQEYGKENPIRVLYHGFGHYDALEIPR 331

Query: 198  K 196
            +
Sbjct: 332  R 332


>ref|XP_006490038.1| PREDICTED: OTU domain-containing protein At3g57810-like [Citrus
            sinensis]
          Length = 341

 Score =  308 bits (789), Expect = 4e-81
 Identities = 178/347 (51%), Positives = 223/347 (64%), Gaps = 5/347 (1%)
 Frame = -1

Query: 1203 MIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCCSR---GEFQPSYVAVTID 1033
            MI  + I   AK VV L  R   QM G+   +  +   +SCC     G+ + +Y  ++  
Sbjct: 1    MIVSTSICACAKNVVNLGGRFQGQMGGNICGVTYRGPSSSCCFHLCSGQSKKNYTGIS-- 58

Query: 1032 GRPPSSSATGSLS-FQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQNM 856
             R  SSS+   L  FQ  C                      S  G   + + IS    +M
Sbjct: 59   -RTISSSSLNVLQPFQATCFSLGLTKPRCNLQPLTIRSFIGSR-GSQKRHIEISLACHSM 116

Query: 855  KVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDENDDM 676
            K+RLL+P    + K+K N G      G  SA G+  GL +C++ S+ A++EA+ ++ D  
Sbjct: 117  KMRLLVPNQGVLPKLKLNAGPIDWPKGCASA-GLICGLLVCYSSSK-AHAEAADEKEDGE 174

Query: 675  DDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLADE 496
            +D+D +     HGKKVYT+YS+ GIPGDGRCLFR+VAHGACLR+GKP PS S+Q++LAD+
Sbjct: 175  EDYDLSNVKYSHGKKVYTDYSVIGIPGDGRCLFRAVAHGACLRAGKPAPSVSIQRELADD 234

Query: 495  LRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMYD 316
            LRA VADEF+KRREETEWF+EGDFD YVSQIRKPHVWGGEPEL MASHVLRMPITVY++D
Sbjct: 235  LRAKVADEFIKRREETEWFIEGDFDLYVSQIRKPHVWGGEPELLMASHVLRMPITVYIHD 294

Query: 315  EKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPS-KAAKSKL 178
            +   GLI+IAEY QEYGK  PIRVLYHGFGHYDALQIP  K   SKL
Sbjct: 295  KDAGGLISIAEYGQEYGKEKPIRVLYHGFGHYDALQIPGRKGGISKL 341


>ref|XP_006421489.1| hypothetical protein CICLE_v10005351mg [Citrus clementina]
            gi|557523362|gb|ESR34729.1| hypothetical protein
            CICLE_v10005351mg [Citrus clementina]
          Length = 341

 Score =  306 bits (784), Expect = 1e-80
 Identities = 173/336 (51%), Positives = 219/336 (65%), Gaps = 4/336 (1%)
 Frame = -1

Query: 1203 MIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCC---SRGEFQPSYVAVTID 1033
            MI  + I   AK VV L  R   QM G+   +  +   +SCC     G+ + +Y  ++  
Sbjct: 1    MIVSTSICACAKNVVNLGGRFQGQMGGNICGVTYRGPSSSCCFYLCSGQSKKNYAGIS-- 58

Query: 1032 GRPPSSSATGSLS-FQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQNM 856
             R  SSS+   L  FQ  C                      S  G   + + IS   ++M
Sbjct: 59   -RTISSSSLNVLQPFQATCFSPGLTKPRCNLRPLTIRSFIGSR-GSQKRHIEISLACRSM 116

Query: 855  KVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDENDDM 676
            K+RLL+P    + K+K N G      G  SA G+  GL +C++ S+ A++EA+ ++ D  
Sbjct: 117  KMRLLVPSQGVLPKLKLNAGPIDWPKGCASA-GLICGLLVCYSSSK-AHAEAADEKEDGE 174

Query: 675  DDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLADE 496
            +D+D +    +HGKKVYT+YS+ GIPGDGRCLFR+VAHGACLR+GKP PS S+Q++LAD+
Sbjct: 175  EDYDLSNVKYLHGKKVYTDYSVIGIPGDGRCLFRAVAHGACLRAGKPAPSVSIQRELADD 234

Query: 495  LRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMYD 316
            LRA VADEF+KRREETEWF+EGDFD YVSQIRKPHVWGGEPEL MASHVLRMPITVYM+D
Sbjct: 235  LRAKVADEFIKRREETEWFIEGDFDLYVSQIRKPHVWGGEPELLMASHVLRMPITVYMHD 294

Query: 315  EKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQ 208
            +   GLI+IAEY QEYGK  PIRVLYHGFGHYDALQ
Sbjct: 295  KDAGGLISIAEYGQEYGKEKPIRVLYHGFGHYDALQ 330


>ref|NP_001242273.1| uncharacterized protein LOC100810338 [Glycine max]
            gi|255645865|gb|ACU23423.1| unknown [Glycine max]
          Length = 339

 Score =  306 bits (784), Expect = 1e-80
 Identities = 166/301 (55%), Positives = 214/301 (71%), Gaps = 3/301 (0%)
 Frame = -1

Query: 1089 NSCCSRGEFQPSYVAVTIDGRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSS 910
            +S  S G+ + S+V +++  +   S+  G  + +G  LGSC                  S
Sbjct: 42   SSSLSPGKSEISHVGLSVCTKLSCSTVMGQ-TIRGGFLGSCCSKQRGNPRFF-------S 93

Query: 909  TVGCGLKWLGISSPHQNMKVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICF 730
            +V    ++  IS   Q + +RLL+PK + + K+K N GS S   G  S  G+ FGL +C 
Sbjct: 94   SVVPRKRYHEISLACQTINMRLLVPKQNMMRKVKCNLGSVSWPRGCASV-GLIFGLLVCN 152

Query: 729  ACSEPAYSEA-SRDEN--DDMDDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHG 559
              SEPA++E+ S +EN  DD ++++S     +HGKKVYT+YS+ GIPGDGRCLFRSVA G
Sbjct: 153  LSSEPAHAESHSENENRKDDCNEYESN-VKVLHGKKVYTDYSVIGIPGDGRCLFRSVARG 211

Query: 558  ACLRSGKPYPSESLQQQLADELRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGG 379
            ACLRSGKP P+ES+Q++LAD+LRA VADEF+KR+EETEWFVEGDFD YVSQIRKPHVWGG
Sbjct: 212  ACLRSGKPPPNESIQRELADDLRARVADEFIKRKEETEWFVEGDFDTYVSQIRKPHVWGG 271

Query: 378  EPELFMASHVLRMPITVYMYDEKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPS 199
            E ELF+ASHVL+MPITVYMYD+   GLI+IAEY Q+YGK +PIRVLYHGFGHYDAL+IP 
Sbjct: 272  ESELFIASHVLQMPITVYMYDKDAGGLISIAEYGQKYGKENPIRVLYHGFGHYDALEIPR 331

Query: 198  K 196
            +
Sbjct: 332  R 332


>ref|XP_006421488.1| hypothetical protein CICLE_v10005351mg [Citrus clementina]
            gi|557523361|gb|ESR34728.1| hypothetical protein
            CICLE_v10005351mg [Citrus clementina]
          Length = 311

 Score =  302 bits (774), Expect = 2e-79
 Identities = 171/336 (50%), Positives = 216/336 (64%), Gaps = 4/336 (1%)
 Frame = -1

Query: 1203 MIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCC---SRGEFQPSYVAVTID 1033
            MI  + I   AK VV L  R   QM G+   +  +   +SCC     G+ + +Y  ++  
Sbjct: 1    MIVSTSICACAKNVVNLGGRFQGQMGGNICGVTYRGPSSSCCFYLCSGQSKKNYAGIS-- 58

Query: 1032 GRPPSSSATGSLS-FQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQNM 856
             R  SSS+   L  FQ  C                                G++ P  +M
Sbjct: 59   -RTISSSSLNVLQPFQATCFSP-----------------------------GLTKP--SM 86

Query: 855  KVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDENDDM 676
            K+RLL+P    + K+K N G      G  SA G+  GL +C++ S+ A++EA+ ++ D  
Sbjct: 87   KMRLLVPSQGVLPKLKLNAGPIDWPKGCASA-GLICGLLVCYSSSK-AHAEAADEKEDGE 144

Query: 675  DDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLADE 496
            +D+D +    +HGKKVYT+YS+ GIPGDGRCLFR+VAHGACLR+GKP PS S+Q++LAD+
Sbjct: 145  EDYDLSNVKYLHGKKVYTDYSVIGIPGDGRCLFRAVAHGACLRAGKPAPSVSIQRELADD 204

Query: 495  LRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMYD 316
            LRA VADEF+KRREETEWF+EGDFD YVSQIRKPHVWGGEPEL MASHVLRMPITVYM+D
Sbjct: 205  LRAKVADEFIKRREETEWFIEGDFDLYVSQIRKPHVWGGEPELLMASHVLRMPITVYMHD 264

Query: 315  EKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQ 208
            +   GLI+IAEY QEYGK  PIRVLYHGFGHYDALQ
Sbjct: 265  KDAGGLISIAEYGQEYGKEKPIRVLYHGFGHYDALQ 300


>ref|XP_007145652.1| hypothetical protein PHAVU_007G257000g [Phaseolus vulgaris]
            gi|561018842|gb|ESW17646.1| hypothetical protein
            PHAVU_007G257000g [Phaseolus vulgaris]
          Length = 339

 Score =  301 bits (772), Expect = 4e-79
 Identities = 165/302 (54%), Positives = 212/302 (70%), Gaps = 4/302 (1%)
 Frame = -1

Query: 1071 GEFQPSYVAVTIDGRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGL 892
            GE + ++V +++  +   S+  G  + +G  LGSC                  S+V    
Sbjct: 48   GESEINHVDLSVCTKLSCSTVMGQ-TIRGGFLGSCCSKQRGNTQFF-------SSVVPRK 99

Query: 891  KWLGISSPHQNMKVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPA 712
            ++  IS   Q++ +RL LPK   + K+K N G  S   G  S  G+ FGL +C + SEPA
Sbjct: 100  RYHEISLACQSVNMRLFLPKQKLLHKVKRNFGPVSWPRGCASV-GLIFGLLVCSSSSEPA 158

Query: 711  YSEA-SRDEN--DDMDDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSG 541
            ++E+ S +EN  DD + ++S      HGKKVYT+YS+ GIPGDGRCLFRSV+ GACLRSG
Sbjct: 159  HAESHSENENRKDDCNQYESN-VKVSHGKKVYTDYSVIGIPGDGRCLFRSVSRGACLRSG 217

Query: 540  KPYPSESLQQQLADELRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFM 361
            KP P+ES+Q++LAD+LRA VADEF+KRREETEWF+EGDFD Y+S IRKPHVWGGEPELF+
Sbjct: 218  KPPPTESVQRELADDLRARVADEFIKRREETEWFIEGDFDTYISHIRKPHVWGGEPELFI 277

Query: 360  ASHVLRMPITVYMYDEKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIP-SKAAKS 184
            ASHVL+MPITVYMYD++  GLI+IAEY QEYGK +PIRVLYHGFGHYDAL+IP  K  K 
Sbjct: 278  ASHVLQMPITVYMYDKEAGGLISIAEYGQEYGKENPIRVLYHGFGHYDALEIPIRKGPKP 337

Query: 183  KL 178
            +L
Sbjct: 338  RL 339


>ref|XP_006381039.1| hypothetical protein POPTR_0006s05620g [Populus trichocarpa]
            gi|550335541|gb|ERP58836.1| hypothetical protein
            POPTR_0006s05620g [Populus trichocarpa]
          Length = 338

 Score =  296 bits (757), Expect = 2e-77
 Identities = 171/337 (50%), Positives = 216/337 (64%), Gaps = 5/337 (1%)
 Frame = -1

Query: 1203 MIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCC-----SRGEFQPSYVAVT 1039
            MI  S I+T  K VV L  RV +QM      +VS+    S C     SR     S ++V+
Sbjct: 1    MIVCSAINTCVKNVVHLSGRV-QQMGSTILNVVSRGQSTSRCFSLYPSRSRSNYSRLSVS 59

Query: 1038 IDGRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQN 859
                 PS S     +    C GS                V +S  G   +   IS P QN
Sbjct: 60   KTFSCPSISFH---TLHRNCFGSDSIKQRYNLVSLTVKGVVNSG-GPLKRQFNISLPSQN 115

Query: 858  MKVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDENDD 679
            M +R  + K   + KIK N GS S      +  G+ FGL +C++ SEP ++E++  +N +
Sbjct: 116  MALRFSVSKRGLLAKIKGNVGSVS-CSQRHTTTGIFFGLLVCYSSSEPTHAESATRKNKE 174

Query: 678  MDDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLAD 499
             D  +S+     HGK+VYT+YSI G+PGDGRCLFRSVAHGACLR GK  PSESLQ++LAD
Sbjct: 175  EDICNSSDIKFSHGKEVYTDYSIIGVPGDGRCLFRSVAHGACLRFGKRAPSESLQRELAD 234

Query: 498  ELRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMY 319
            +LR+ VADEF+KRRE+TEWF+EG+FD YVSQ+RKPHVWGGEPEL MASHVL+MPITVYM+
Sbjct: 235  DLRSNVADEFIKRREDTEWFIEGNFDSYVSQMRKPHVWGGEPELLMASHVLKMPITVYMH 294

Query: 318  DEKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQ 208
            D+  RGLI+IAEY QEYG  +PIRV+Y+GFGHYDALQ
Sbjct: 295  DKNARGLISIAEYGQEYGVENPIRVIYNGFGHYDALQ 331


>ref|XP_002445909.1| hypothetical protein SORBIDRAFT_07g027850 [Sorghum bicolor]
           gi|241942259|gb|EES15404.1| hypothetical protein
           SORBIDRAFT_07g027850 [Sorghum bicolor]
          Length = 309

 Score =  295 bits (754), Expect = 4e-77
 Identities = 147/235 (62%), Positives = 178/235 (75%), Gaps = 1/235 (0%)
 Frame = -1

Query: 882 GISSPHQNMKVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSIC-FACSEPAYS 706
           G+S+   ++ V+L +P H+K ++I WN  +     GG +A G+ FG S+   AC+E    
Sbjct: 80  GLSTREGSLSVKLDIPSHEK-SRIGWNWKNMHHKIGG-AAGGLCFGFSVTGLACAEVPVI 137

Query: 705 EASRDENDDMDDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPS 526
                   D  +  S+  ++ HGKKVYT+YS+TGIPGDGRCLFRSV HGAC+RSG+P P+
Sbjct: 138 RIK-----DNAETSSSSTSSTHGKKVYTDYSVTGIPGDGRCLFRSVVHGACIRSGRPIPN 192

Query: 525 ESLQQQLADELRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVL 346
           E LQ++LADELRA VADEFVKRREETEWFVEGDFD YVS IR+PHVWGGEPELFMASHVL
Sbjct: 193 EDLQRKLADELRAMVADEFVKRREETEWFVEGDFDTYVSHIREPHVWGGEPELFMASHVL 252

Query: 345 RMPITVYMYDEKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPSKAAKSK 181
           +MPITVYM DE   GLIAIAEY Q+YGK DPI+VLYHGFGHYDALQIP+K    +
Sbjct: 253 QMPITVYMRDEDAGGLIAIAEYGQQYGKEDPIQVLYHGFGHYDALQIPAKVGSKR 307


>ref|XP_006850126.1| hypothetical protein AMTR_s00022p00229870 [Amborella trichopoda]
           gi|548853724|gb|ERN11707.1| hypothetical protein
           AMTR_s00022p00229870 [Amborella trichopoda]
          Length = 244

 Score =  294 bits (752), Expect = 7e-77
 Identities = 145/233 (62%), Positives = 179/233 (76%), Gaps = 2/233 (0%)
 Frame = -1

Query: 891 KWLGISSPHQNMKVRLLLPKH--DKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSE 718
           K+LG S+   N+ ++L +  H    ++KI +     +R  GG SA  +AFG  +C A  E
Sbjct: 12  KYLGFSTICHNVNLKLSVTSHLSQSVSKISFLVKPRTRSRGGISAL-MAFGACVCCAHPE 70

Query: 717 PAYSEASRDENDDMDDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGK 538
              +E+   END   + D +   +VHGK VYT+YS+TGIPGDGRC+FRSVAHGACLRSGK
Sbjct: 71  QVKAESPVFENDHDSECDPSSVKSVHGKNVYTDYSVTGIPGDGRCMFRSVAHGACLRSGK 130

Query: 537 PYPSESLQQQLADELRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMA 358
           P P+ES+Q+++ADELRA VAD+FVKRR +TEWF+EGDFD YVSQIRKPHVWGGEPEL MA
Sbjct: 131 PPPNESVQREMADELRARVADQFVKRRSDTEWFIEGDFDTYVSQIRKPHVWGGEPELLMA 190

Query: 357 SHVLRMPITVYMYDEKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPS 199
           SHVL+MPITVYM+D+ Y GLIAIAEY QEYGK+DPI VLYHG+GHY+ALQ  S
Sbjct: 191 SHVLQMPITVYMHDDNYGGLIAIAEYGQEYGKDDPICVLYHGYGHYEALQFGS 243


>ref|XP_007028911.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao]
            gi|590636674|ref|XP_007028912.1| Cysteine proteinases
            superfamily protein isoform 1 [Theobroma cacao]
            gi|508717516|gb|EOY09413.1| Cysteine proteinases
            superfamily protein isoform 1 [Theobroma cacao]
            gi|508717517|gb|EOY09414.1| Cysteine proteinases
            superfamily protein isoform 1 [Theobroma cacao]
          Length = 317

 Score =  294 bits (752), Expect = 7e-77
 Identities = 173/347 (49%), Positives = 208/347 (59%), Gaps = 5/347 (1%)
 Frame = -1

Query: 1203 MIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCCS----RGEFQPSYVAVTI 1036
            M+  SPIST AK VV L   +G  +       V    P+S C      G  +  Y  +++
Sbjct: 1    MMVCSPISTCAKNVVHLRGHMGSSLCS-----VISCQPSSSCYYFSYSGHPKTKYTDLSV 55

Query: 1035 DGRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQNM 856
                  S A G  +FQ  C  S               I D +      + L IS P Q+M
Sbjct: 56   SYTTSGSPAVGYRAFQAGCFRSSRRSRKLQSLVVKESISDKTKQK---RQLEISWPGQSM 112

Query: 855  KVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDENDDM 676
            K++ LLPK   + K K   G  S                           EA+  + D  
Sbjct: 113  KMKFLLPKQGTLQKFKCTAGPISWS------------------------QEAAGAKEDKQ 148

Query: 675  DDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLADE 496
            DD +S+ A   HGKKVYT+YS+ GIPGDGRC+FRSVAHGACLRSGK  PSE +Q++LAD+
Sbjct: 149  DDCESSHAKFSHGKKVYTDYSVIGIPGDGRCMFRSVAHGACLRSGKSAPSEHVQRELADD 208

Query: 495  LRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMYD 316
            LRA VADEF+KRR+ETEWFVEG+FD YVSQIRKPHVWGGEPELFMASHVL+MPITVYMYD
Sbjct: 209  LRAKVADEFIKRRKETEWFVEGNFDAYVSQIRKPHVWGGEPELFMASHVLQMPITVYMYD 268

Query: 315  EKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPS-KAAKSKL 178
            +   GLIAIAEY QEYG  +PIRVLYHGFGHYDALQ+   ++ KSKL
Sbjct: 269  KGAGGLIAIAEYGQEYGTENPIRVLYHGFGHYDALQMRGRRSGKSKL 315


>ref|XP_007028913.1| Cysteine proteinases superfamily protein isoform 3 [Theobroma cacao]
            gi|508717518|gb|EOY09415.1| Cysteine proteinases
            superfamily protein isoform 3 [Theobroma cacao]
          Length = 324

 Score =  293 bits (750), Expect = 1e-76
 Identities = 173/355 (48%), Positives = 210/355 (59%), Gaps = 7/355 (1%)
 Frame = -1

Query: 1203 MIGLSPISTPAKQVVCLCERVGRQMSGHARFLVSQFTPNSCCS----RGEFQPSYVAVTI 1036
            M+  SPIST AK VV L   +G  +       V    P+S C      G  +  Y  +++
Sbjct: 1    MMVCSPISTCAKNVVHLRGHMGSSLCS-----VISCQPSSSCYYFSYSGHPKTKYTDLSV 55

Query: 1035 DGRPPSSSATGSLSFQGCCLGSCXXXXXXXXXXXXXXIVDSSTVGCGLKWLGISSPHQNM 856
                  S A G  +FQ  C  S               I D +      + L IS P Q+M
Sbjct: 56   SYTTSGSPAVGYRAFQAGCFRSSRRSRKLQSLVVKESISDKTKQK---RQLEISWPGQSM 112

Query: 855  KVRLLLPKHDKITKIKWNTGSGSRLYGGGSAAGVAFGLSICFACSEPAYSEASRDENDDM 676
            K++ LLPK   + K K   G  S                           EA+  + D  
Sbjct: 113  KMKFLLPKQGTLQKFKCTAGPISWS------------------------QEAAGAKEDKQ 148

Query: 675  DDFDSTFANTVHGKKVYTNYSITGIPGDGRCLFRSVAHGACLRSGKPYPSESLQQQLADE 496
            DD +S+ A   HGKKVYT+YS+ GIPGDGRC+FRSVAHGACLRSGK  PSE +Q++LAD+
Sbjct: 149  DDCESSHAKFSHGKKVYTDYSVIGIPGDGRCMFRSVAHGACLRSGKSAPSEHVQRELADD 208

Query: 495  LRATVADEFVKRREETEWFVEGDFDHYVSQIRKPHVWGGEPELFMASHVLRMPITVYMYD 316
            LRA VADEF+KRR+ETEWFVEG+FD YVSQIRKPHVWGGEPELFMASHVL+MPITVYMYD
Sbjct: 209  LRAKVADEFIKRRKETEWFVEGNFDAYVSQIRKPHVWGGEPELFMASHVLQMPITVYMYD 268

Query: 315  EKYRGLIAIAEYSQEYGKNDPIRVLYHGFGHYDALQIPSKAAKS---KL*HVFQE 160
            +   GLIAIAEY QEYG  +PIRVLYHGFGHYDALQ+  + + +    L H F+E
Sbjct: 269  KGAGGLIAIAEYGQEYGTENPIRVLYHGFGHYDALQMRGRRSVTVDHHLIHAFEE 323


Top