BLASTX nr result

ID: Jatropha_contig00031726 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00031726
         (667 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002525147.1| cysteine protease, putative [Ricinus communi...   295   1e-77
gb|EMJ15443.1| hypothetical protein PRUPE_ppa023515mg [Prunus pe...   260   2e-67
gb|ESR59982.1| hypothetical protein CICLE_v10015835mg [Citrus cl...   251   1e-64
ref|XP_004513101.1| PREDICTED: KDEL-tailed cysteine endopeptidas...   249   4e-64
ref|XP_004505522.1| PREDICTED: KDEL-tailed cysteine endopeptidas...   245   7e-63
ref|XP_004307286.1| PREDICTED: oryzain beta chain-like [Fragaria...   245   9e-63
ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula] gi...   241   1e-61
gb|EOX94180.1| Cysteine proteinases superfamily protein, putativ...   241   2e-61
ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula] gi...   236   4e-60
emb|CBI21941.3| unnamed protein product [Vitis vinifera]              233   5e-59
ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidas...   233   5e-59
ref|XP_004243398.1| PREDICTED: oryzain alpha chain-like [Solanum...   228   9e-58
ref|XP_006306197.1| hypothetical protein CARUB_v10011832mg [Caps...   227   2e-57
ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arab...   226   3e-57
ref|NP_563764.1| Cysteine proteinases superfamily protein [Arabi...   226   5e-57
ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like...   226   6e-57
gb|ESQ36293.1| hypothetical protein EUTSA_v10008102mg [Eutrema s...   224   2e-56
ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed ...   224   2e-56
ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidas...   221   1e-55
ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidas...   221   1e-55

>ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
           gi|223535606|gb|EEF37274.1| cysteine protease, putative
           [Ricinus communis]
          Length = 347

 Score =  295 bits (754), Expect = 1e-77
 Identities = 143/220 (65%), Positives = 171/220 (77%), Gaps = 2/220 (0%)
 Frame = +1

Query: 13  LKKAGXXXXXXXXXXXHTMADLEIHSLPRYN-PKAMRKRYDRWLKHHGRKYHNKDEYYLR 189
           +K AG            ++A  EIHSLP  + P AM+ RYD+WL+ +GRKY  KDEY LR
Sbjct: 7   IKNAGLMLITLCTLWIPSIARSEIHSLPIDSAPTAMKVRYDKWLEQYGRKYDTKDEYLLR 66

Query: 190 FGIYQSNIQFIDYINAQKLPYELRDNKFADLTNDEFKSIYLGFQANKHGRKKPSHEHGNS 369
           FGIY SNIQFI+YIN+Q L ++L DNKFADLTNDEF SIYLG+Q   + R+  SH H NS
Sbjct: 67  FGIYHSNIQFIEYINSQNLSFKLTDNKFADLTNDEFNSIYLGYQIRSYKRRNLSHMHENS 126

Query: 370 SDLPTSVDWIKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDT 549
           +DLP +VDW + GAVTP+KDQGQCGSCWAFSAVAAVEGINKIKTG LVSLSEQELVDCD 
Sbjct: 127 TDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGNLVSLSEQELVDCDV 186

Query: 550 HNDSQGCNGGFMETAFTYIQK-QGLSIEDDYPYEGKDGNC 666
           + D++GCNGGFME AFT+I+   GL+ E+DYPY+G DG+C
Sbjct: 187 NGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPYKGTDGSC 226


>gb|EMJ15443.1| hypothetical protein PRUPE_ppa023515mg [Prunus persica]
          Length = 343

 Score =  260 bits (665), Expect = 2e-67
 Identities = 118/190 (62%), Positives = 152/190 (80%)
 Frame = +1

Query: 97  RYNPKAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFA 276
           R +PKAM++RY+RWL+ +GR Y N++E   RFG+Y+SNI+F+D++N+Q L Y+L DNKFA
Sbjct: 35  RTDPKAMKERYERWLQKYGRIYKNREEAAYRFGVYKSNIEFVDFVNSQNLSYKLTDNKFA 94

Query: 277 DLTNDEFKSIYLGFQANKHGRKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWA 456
           D+TN EF   ++GFQ   H + K S++     +LPT+VDW K GAVTP+K+QGQCGSCWA
Sbjct: 95  DITNLEFTKTFMGFQTRSHPKTKFSYD--KDEELPTAVDWRKHGAVTPIKNQGQCGSCWA 152

Query: 457 FSAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQGLSIEDD 636
           FSAVAAVEGIN+IKTGKLVSLSEQELVDCD    ++GCNGG+ME AF++I+  GLS E D
Sbjct: 153 FSAVAAVEGINQIKTGKLVSLSEQELVDCDVKTGNEGCNGGYMEKAFSFIKDNGLSTEKD 212

Query: 637 YPYEGKDGNC 666
           YPY+G DG C
Sbjct: 213 YPYKGSDGIC 222


>gb|ESR59982.1| hypothetical protein CICLE_v10015835mg [Citrus clementina]
          Length = 341

 Score =  251 bits (642), Expect = 1e-64
 Identities = 118/191 (61%), Positives = 153/191 (80%), Gaps = 1/191 (0%)
 Frame = +1

Query: 97  RYNPKAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFA 276
           +Y+P++M +R++ WLK + R+Y ++DE+  RFGIY SN+Q+IDYIN+Q L ++L DNKFA
Sbjct: 32  KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFA 91

Query: 277 DLTNDEFKSIYLGFQANKHGRKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWA 456
           DL+N+EF S YLG+    +  + PS ++     LP SVDW KEGAVTPVK+QGQCGSCWA
Sbjct: 92  DLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKNQGQCGSCWA 148

Query: 457 FSAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQK-QGLSIED 633
           FSAVAAVEGINK+KTGKLVSLSEQELVDCD ++++QGCNGG+ME AF +I K  G++ ED
Sbjct: 149 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 208

Query: 634 DYPYEGKDGNC 666
           DYPY GK+  C
Sbjct: 209 DYPYRGKNDRC 219


>ref|XP_004513101.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like [Cicer
           arietinum]
          Length = 353

 Score =  249 bits (637), Expect = 4e-64
 Identities = 119/197 (60%), Positives = 149/197 (75%), Gaps = 2/197 (1%)
 Frame = +1

Query: 82  IHSLPRYNPKAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELR 261
           +H     +P+ M+KRY+ WLK HGR Y N++E+ +RF IYQSN++FI++ N+Q   Y+L 
Sbjct: 39  MHKNVSTDPEVMKKRYETWLKRHGRHYRNREEFEVRFDIYQSNVEFIEFYNSQNYSYKLT 98

Query: 262 DNKFADLTNDEFKSIYLGFQANKHGRKK-PSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQ 438
           DN+FADLTN+EFKS YLG+        +   H+HG   DLP ++DW K+GAVT VKDQG+
Sbjct: 99  DNRFADLTNEEFKSTYLGYLPRLRVETEFMYHQHG---DLPKNIDWRKKGAVTHVKDQGR 155

Query: 439 CGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQ- 615
           CGSCWAFSAVAAVEGINKIKTGKLVSLSEQEL+DCDT + ++GC GG ME AF+YI+K  
Sbjct: 156 CGSCWAFSAVAAVEGINKIKTGKLVSLSEQELIDCDTKSGNEGCEGGDMEIAFSYIKKHG 215

Query: 616 GLSIEDDYPYEGKDGNC 666
           GL    DYPYEG DG C
Sbjct: 216 GLDSSKDYPYEGTDGKC 232


>ref|XP_004505522.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like [Cicer
           arietinum]
          Length = 343

 Score =  245 bits (626), Expect = 7e-63
 Identities = 122/204 (59%), Positives = 147/204 (72%), Gaps = 2/204 (0%)
 Frame = +1

Query: 61  HTMADLEIHSLPRYNPKAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQ 240
           H  +D+E+          +RKR+  WLK HGRKY + +E+ +RFG+YQ+N+++I  IN Q
Sbjct: 33  HKSSDIEV----------LRKRFQGWLKRHGRKYKDNEEWEVRFGVYQANVEYIKCINLQ 82

Query: 241 KLPYELRDNKFADLTNDEFKSIYLGFQANKHGRKKPSH-EHGNSSDLPTSVDWIKEGAVT 417
           K  Y L DNKFADLTN+EF+S Y+G     H      + EHG   D+P S DW KEGAVT
Sbjct: 83  KNSYNLTDNKFADLTNEEFRSTYMGLSTRLHSHTGFRYDEHG---DIPDSKDWRKEGAVT 139

Query: 418 PVKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAF 597
            VKDQGQCGSCWAFSAVAAVEGI+KIKTG+LVSLSEQELVDCD  N +QGC GG METAF
Sbjct: 140 EVKDQGQCGSCWAFSAVAAVEGIHKIKTGELVSLSEQELVDCDVKNGNQGCEGGLMETAF 199

Query: 598 TYIQKQ-GLSIEDDYPYEGKDGNC 666
           T+I K  GL+ E +YPYEG DG C
Sbjct: 200 TFIVKNGGLTTEKEYPYEGVDGTC 223


>ref|XP_004307286.1| PREDICTED: oryzain beta chain-like [Fragaria vesca subsp. vesca]
          Length = 344

 Score =  245 bits (625), Expect = 9e-63
 Identities = 112/185 (60%), Positives = 144/185 (77%)
 Frame = +1

Query: 112 AMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFADLTND 291
           AM++RY+RWL  + R+Y N+DE+  RFGIYQSN++ ID+IN+Q L Y+L DN FAD+TN 
Sbjct: 41  AMKERYERWLAKYDRRYKNRDEWEHRFGIYQSNVELIDFINSQNLSYKLTDNVFADMTNQ 100

Query: 292 EFKSIYLGFQANKHGRKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAFSAVA 471
           EF + +LGF+A ++ + K  +E      LPT+VDW K G+VTPV+DQG+CGSCWAFSAVA
Sbjct: 101 EFTTTHLGFRAGRNPKTKFRYE--GMKGLPTTVDWRKNGSVTPVRDQGRCGSCWAFSAVA 158

Query: 472 AVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQGLSIEDDYPYEG 651
           AVEG++KI TGKLV LSEQELVDCD +  +QGC GGFME AF YI+K G++ + DYPY G
Sbjct: 159 AVEGLHKINTGKLVPLSEQELVDCDVNTGNQGCRGGFMENAFDYIRKYGITTQKDYPYTG 218

Query: 652 KDGNC 666
            DG C
Sbjct: 219 SDGTC 223


>ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
           gi|358347207|ref|XP_003637651.1| Cysteine proteinase
           [Medicago truncatula] gi|355503586|gb|AES84789.1|
           Cysteine proteinase [Medicago truncatula]
           gi|355508601|gb|AES89743.1| Cysteine proteinase
           [Medicago truncatula]
          Length = 345

 Score =  241 bits (616), Expect = 1e-61
 Identities = 118/197 (59%), Positives = 142/197 (72%), Gaps = 3/197 (1%)
 Frame = +1

Query: 85  HSLPRYNPKAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRD 264
           H     + +AM+KR+D W+K HGRKY + DE  +RFGIYQ+N+Q+I   NAQK  Y L D
Sbjct: 32  HKQKSSDVEAMKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTD 91

Query: 265 NKFADLTNDEFKSIYLGFQAN--KHGRKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQ 438
           NKFADLTN+EF+S Y+G       H       EHG   DLP S DW KEGAVT + DQGQ
Sbjct: 92  NKFADLTNEEFQSTYMGLSTRLRSHNTGFRYDEHG---DLPESKDWRKEGAVTEIMDQGQ 148

Query: 439 CGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTY-IQKQ 615
           CG CWAF+AVAAVEGINKIK+GKL+SLSEQEL+DCD  + +QGC GG META+T+ I+  
Sbjct: 149 CGGCWAFAAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENG 208

Query: 616 GLSIEDDYPYEGKDGNC 666
           GL+ E DYPYEG DG C
Sbjct: 209 GLTTEQDYPYEGVDGTC 225


>gb|EOX94180.1| Cysteine proteinases superfamily protein, putative [Theobroma
           cacao]
          Length = 340

 Score =  241 bits (614), Expect = 2e-61
 Identities = 109/189 (57%), Positives = 148/189 (78%), Gaps = 1/189 (0%)
 Frame = +1

Query: 103 NPKAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFADL 282
           +P+ +++RY+RWL  HGR+Y +K+E  LRFGIY+SN +FID IN+Q L ++L DNKFAD+
Sbjct: 31  DPENIQERYERWLVQHGRQYKDKEEMTLRFGIYKSNSEFIDSINSQNLSFKLTDNKFADM 90

Query: 283 TNDEFKSIYLGFQANKHGRKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAFS 462
           TN EF+S YLG  + +  R+    +H    +L T +DW ++GAVTP+KDQGQCGSCWAFS
Sbjct: 91  TNAEFRSAYLGSWSRRSPRESDEFQHDKHYNLSTYIDWREKGAVTPIKDQGQCGSCWAFS 150

Query: 463 AVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQ-GLSIEDDY 639
           AVAA+EGI KIKTG+L SLSEQEL+DCD +N++QGC GG+ME A+ +I K  G++ E++Y
Sbjct: 151 AVAAIEGIGKIKTGELTSLSEQELIDCDVNNENQGCKGGYMEKAYEFIIKNGGITTEENY 210

Query: 640 PYEGKDGNC 666
           PY G+DG C
Sbjct: 211 PYIGEDGIC 219


>ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
           gi|355501702|gb|AES82905.1| Cysteine proteinase
           [Medicago truncatula]
          Length = 338

 Score =  236 bits (602), Expect = 4e-60
 Identities = 112/198 (56%), Positives = 149/198 (75%), Gaps = 2/198 (1%)
 Frame = +1

Query: 79  EIHSLPRYNPKAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYEL 258
           EIH+    NP  M+KRY+ WLK +GR Y +++E+ +RF IYQSN+Q+I++ N+Q   Y+L
Sbjct: 23  EIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIYQSNVQYIEFYNSQNYSYKL 82

Query: 259 RDNKFADLTNDEFKSIYLGFQANKHGRKK-PSHEHGNSSDLPTSVDWIKEGAVTPVKDQG 435
            DN+FAD+TN+EFKS YLG+      + +   H+HG   +LP S+DW K+GAVT VKDQG
Sbjct: 83  IDNRFADITNEEFKSTYLGYLPRFRVQTEFRYHKHG---ELPKSIDWRKKGAVTHVKDQG 139

Query: 436 QCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQ 615
           +CGSCWAFSAVAAVEGINKIKT  LVSLSEQ+L+DCD  + ++GC GG M  AF YI+K 
Sbjct: 140 RCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKSGNEGCEGGDMYIAFNYIKKH 199

Query: 616 -GLSIEDDYPYEGKDGNC 666
            G++   +YPY+G+DGNC
Sbjct: 200 GGIATAKEYPYKGRDGNC 217


>emb|CBI21941.3| unnamed protein product [Vitis vinifera]
          Length = 354

 Score =  233 bits (593), Expect = 5e-59
 Identities = 109/186 (58%), Positives = 142/186 (76%), Gaps = 2/186 (1%)
 Frame = +1

Query: 115 MRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFADLTNDE 294
           M KRY+RWL  HGR+Y N+DE+   FGIYQSN++FI+YINAQ   + L DN+FAD+TN+E
Sbjct: 37  MEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNEE 96

Query: 295 FKSIYLGFQANKHGRK-KPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAFSAVA 471
           +K++Y+G   ++  RK + S +   S  LP SVDW K GAVTPV++QG+CGSCWAFS VA
Sbjct: 97  YKALYMGLGTSETSRKNQSSFKRERSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVA 156

Query: 472 AVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYI-QKQGLSIEDDYPYE 648
           AVEGINKI+TGKLVSLSEQEL+DCD  + ++GCNGG+M  AF +I Q  G++   +YPY 
Sbjct: 157 AVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYI 216

Query: 649 GKDGNC 666
           G+ G C
Sbjct: 217 GEQGIC 222


>ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
          Length = 358

 Score =  233 bits (593), Expect = 5e-59
 Identities = 109/186 (58%), Positives = 142/186 (76%), Gaps = 2/186 (1%)
 Frame = +1

Query: 115 MRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFADLTNDE 294
           M KRY+RWL  HGR+Y N+DE+   FGIYQSN++FI+YINAQ   + L DN+FAD+TN+E
Sbjct: 41  MEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNEE 100

Query: 295 FKSIYLGFQANKHGRK-KPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAFSAVA 471
           +K++Y+G   ++  RK + S +   S  LP SVDW K GAVTPV++QG+CGSCWAFS VA
Sbjct: 101 YKALYMGLGTSETSRKNQSSFKRERSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVA 160

Query: 472 AVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYI-QKQGLSIEDDYPYE 648
           AVEGINKI+TGKLVSLSEQEL+DCD  + ++GCNGG+M  AF +I Q  G++   +YPY 
Sbjct: 161 AVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYI 220

Query: 649 GKDGNC 666
           G+ G C
Sbjct: 221 GEQGIC 226


>ref|XP_004243398.1| PREDICTED: oryzain alpha chain-like [Solanum lycopersicum]
          Length = 342

 Score =  228 bits (582), Expect = 9e-58
 Identities = 109/183 (59%), Positives = 137/183 (74%), Gaps = 2/183 (1%)
 Frame = +1

Query: 124 RYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFADLTNDEFKS 303
           RY  W+K + RKY N+ E+ +RFGIYQSNIQFID+ N+  L Y L DN FAD+TN EF S
Sbjct: 35  RYQNWVKKYRRKYQNEAEWNMRFGIYQSNIQFIDFFNSLNLSYSLTDNAFADMTNREFNS 94

Query: 304 IYLGFQANKHGRKKPSHEHG-NSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAFSAVAAVE 480
           IYLG++     +K  ++    + S LP  VDW K+G VTP+KDQ  CGSCWAFSAVAA+E
Sbjct: 95  IYLGYEKPIQEQKDINNVTSYDISTLPIGVDWRKDGVVTPIKDQKSCGSCWAFSAVAAIE 154

Query: 481 GINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQ-GLSIEDDYPYEGKD 657
           GINKIKTGKLVSLSEQ+L+DCD ++D+QGCNGGFME+AF YI +  G++   +YPY GK+
Sbjct: 155 GINKIKTGKLVSLSEQQLMDCDVYSDNQGCNGGFMESAFDYIMENGGITTSKNYPYIGKE 214

Query: 658 GNC 666
             C
Sbjct: 215 QKC 217


>ref|XP_006306197.1| hypothetical protein CARUB_v10011832mg [Capsella rubella]
           gi|482574908|gb|EOA39095.1| hypothetical protein
           CARUB_v10011832mg [Capsella rubella]
          Length = 344

 Score =  227 bits (579), Expect = 2e-57
 Identities = 107/192 (55%), Positives = 143/192 (74%), Gaps = 3/192 (1%)
 Frame = +1

Query: 100 YNP-KAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFA 276
           Y+P K +++R++ WLK H + Y  KDE+ LRFGIYQSN+Q IDYIN+  LP++L DN+FA
Sbjct: 33  YDPHKTLKQRFENWLKTHSKLYGGKDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFA 92

Query: 277 DLTNDEFKSIYLGFQANKHGRKKPSHEHGN-SSDLPTSVDWIKEGAVTPVKDQGQCGSCW 453
           D+TN EFK+ +LG   +    +K      + + D+P +VDW K+GAVTP+++QG+CG CW
Sbjct: 93  DMTNSEFKAHFLGLNTSSLRIQKNQRPVCDPAGDVPAAVDWRKQGAVTPIRNQGKCGGCW 152

Query: 454 AFSAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQ-GLSIE 630
           AFSAVAA+EGINKIKTG LVSLSEQ+L+DCD    ++GC+GG METAF YI+   GL  +
Sbjct: 153 AFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEYIKTNGGLVTQ 212

Query: 631 DDYPYEGKDGNC 666
            DYPY G +G C
Sbjct: 213 TDYPYTGIEGTC 224


>ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata] gi|297335438|gb|EFH65855.1| hypothetical protein
           ARALYDRAFT_887827 [Arabidopsis lyrata subsp. lyrata]
          Length = 343

 Score =  226 bits (577), Expect = 3e-57
 Identities = 105/194 (54%), Positives = 147/194 (75%), Gaps = 5/194 (2%)
 Frame = +1

Query: 100 YNP-KAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFA 276
           Y+P K +++R+++WLK H + Y  +DE+ LRFGIYQSN+Q IDYIN+  LP++L DN+FA
Sbjct: 33  YDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFA 92

Query: 277 DLTNDEFKSIYLGFQANK---HGRKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGS 447
           D+TN EFK+ +LG   +    H +++P  +   + ++P +VDW  +GAVTP+++QG+CG 
Sbjct: 93  DMTNSEFKAHFLGLNTSSLRLHKKQRPVCDP--AGNVPDAVDWRTQGAVTPIRNQGKCGG 150

Query: 448 CWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQ-GLS 624
           CWAFSAVAA+EGINKIKTG LVSLSEQ+L+DCD    ++GC+GG METAF +I+   GL+
Sbjct: 151 CWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSNGGLT 210

Query: 625 IEDDYPYEGKDGNC 666
            E DYPY G +G C
Sbjct: 211 TETDYPYTGIEGTC 224


>ref|NP_563764.1| Cysteine proteinases superfamily protein [Arabidopsis thaliana]
           gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity
           to a cysteine endopeptidase 1 from Phaseolus vulgaris
           gb|U52970 and is a member of the papain cysteine
           protease family PF|00112 [Arabidopsis thaliana]
           gi|332189848|gb|AEE27969.1| Cysteine proteinases
           superfamily protein [Arabidopsis thaliana]
          Length = 343

 Score =  226 bits (576), Expect = 5e-57
 Identities = 105/194 (54%), Positives = 147/194 (75%), Gaps = 5/194 (2%)
 Frame = +1

Query: 100 YNP-KAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFA 276
           Y+P K +++R+++WLK H + Y  +DE+ LRFGIYQSN+Q IDYIN+  LP++L DN+FA
Sbjct: 33  YDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFA 92

Query: 277 DLTNDEFKSIYLGFQANK---HGRKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGS 447
           D+TN EFK+ +LG   +    H +++P  +   + ++P +VDW  +GAVTP+++QG+CG 
Sbjct: 93  DMTNSEFKAHFLGLNTSSLRLHKKQRPVCDP--AGNVPDAVDWRTQGAVTPIRNQGKCGG 150

Query: 448 CWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQ-GLS 624
           CWAFSAVAA+EGINKIKTG LVSLSEQ+L+DCD    ++GC+GG METAF +I+   GL+
Sbjct: 151 CWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLA 210

Query: 625 IEDDYPYEGKDGNC 666
            E DYPY G +G C
Sbjct: 211 TETDYPYTGIEGTC 224


>ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 306

 Score =  226 bits (575), Expect = 6e-57
 Identities = 114/190 (60%), Positives = 138/190 (72%), Gaps = 6/190 (3%)
 Frame = +1

Query: 115 MRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFADLTNDE 294
           MR R++RWLK + R Y +K+E+ +RFGIYQ+N+++I+  N+Q+  Y L DNKFADLTN+E
Sbjct: 1   MRVRFERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQEXSYNLTDNKFADLTNEE 60

Query: 295 FKSIYLGFQANKHGRKKPS-----HEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAF 459
           F S YLGF      R  P      HEH    DLP S DW KEGAV+ +KDQG CGSCWAF
Sbjct: 61  FVSPYLGFGT----RFLPHTGFMYHEH---EDLPESKDWRKEGAVSDIKDQGNCGSCWAF 113

Query: 460 SAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQ-GLSIEDD 636
           SAVAAVEGINKIK+GKLVSLSEQE  DCD  + +QGC GG M+TAF +I+K  GL+   D
Sbjct: 114 SAVAAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKD 173

Query: 637 YPYEGKDGNC 666
           YPYEG DG C
Sbjct: 174 YPYEGVDGTC 183


>gb|ESQ36293.1| hypothetical protein EUTSA_v10008102mg [Eutrema salsugineum]
          Length = 344

 Score =  224 bits (571), Expect = 2e-56
 Identities = 105/190 (55%), Positives = 143/190 (75%), Gaps = 4/190 (2%)
 Frame = +1

Query: 109 KAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFADLTN 288
           K ++KR+++WL+ HG  Y  KDE+ LRFGIYQSNIQ IDYIN+ +LP++L DN+FAD+TN
Sbjct: 37  KTLKKRFEKWLQTHGILYGGKDEWMLRFGIYQSNIQLIDYINSLQLPFKLADNRFADMTN 96

Query: 289 DEFKSIYLGFQANK---HGRKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAF 459
            EFK+ +LG   +    H    P  +   + ++  +VDW K+GAVTPV++QG+CG CWAF
Sbjct: 97  SEFKAHFLGLNTSSSRLHRNHMPVRDPA-AVNVSAAVDWRKQGAVTPVRNQGRCGGCWAF 155

Query: 460 SAVAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQ-GLSIEDD 636
           +AVAA+EGIN+IKTGKLVSLSEQ+L+DCDT   ++GC+GG META+ +I+   GL+ E D
Sbjct: 156 AAVAAIEGINQIKTGKLVSLSEQQLIDCDTGTYNKGCSGGLMETAYEFIKTNGGLTTETD 215

Query: 637 YPYEGKDGNC 666
           YPY   +G C
Sbjct: 216 YPYTAAEGTC 225


>ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP2-like [Glycine max]
          Length = 342

 Score =  224 bits (571), Expect = 2e-56
 Identities = 107/187 (57%), Positives = 140/187 (74%), Gaps = 2/187 (1%)
 Frame = +1

Query: 109 KAMRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFADLTN 288
           + MR RY+ WLK +G+KY NKDE+  RF IY++N+QFI+  N+Q   Y+L DNKF DLTN
Sbjct: 38  EVMRMRYESWLKKYGQKYRNKDEWEFRFEIYRANVQFIEVYNSQNYSYKLMDNKFVDLTN 97

Query: 289 DEFKSIYLGFQANKHGRKKPSHE-HGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAFSA 465
           +EF+ +YL +Q   H + +  ++ HG   DLP  +DW   GAVT +KDQG CGSCW+FSA
Sbjct: 98  EEFRRMYLVYQPRSHLQTRFMYQKHG---DLPKRIDWRTRGAVTXIKDQGHCGSCWSFSA 154

Query: 466 VAAVEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQ-GLSIEDDYP 642
           VA VE INKIKTGKLVSLSEQ+L+DCD  N ++GCNGG MET FT+I K+ GL+ + +YP
Sbjct: 155 VATVEDINKIKTGKLVSLSEQQLIDCDNRNGNEGCNGGHMET-FTFITKRGGLTTDKNYP 213

Query: 643 YEGKDGN 663
           Y+G DG+
Sbjct: 214 YQGSDGD 220


>ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
           [Cucumis sativus]
          Length = 314

 Score =  221 bits (563), Expect = 1e-55
 Identities = 101/184 (54%), Positives = 138/184 (75%)
 Frame = +1

Query: 115 MRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFADLTNDE 294
           ++ RY +W+  +GR+Y +++E+  RF IYQ+N+Q+ID  N+    + L +N FADLTN+E
Sbjct: 15  IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEE 74

Query: 295 FKSIYLGFQANKHGRKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAFSAVAA 474
           FK+ YLG++            +GN  +LPT+VDW +EGAVTP+K+QGQCGSCWAFSAVAA
Sbjct: 75  FKATYLGYKTVSI--PDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAVAA 132

Query: 475 VEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQGLSIEDDYPYEGK 654
           VEGINKIK GKL+SLSEQELVDCD  + +QGCNGG+M  AF +I++ GL+ E +YPY+G 
Sbjct: 133 VEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFIKRTGLTTEIEYPYQGA 192

Query: 655 DGNC 666
           +  C
Sbjct: 193 ESAC 196


>ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
           sativus]
          Length = 317

 Score =  221 bits (563), Expect = 1e-55
 Identities = 101/184 (54%), Positives = 138/184 (75%)
 Frame = +1

Query: 115 MRKRYDRWLKHHGRKYHNKDEYYLRFGIYQSNIQFIDYINAQKLPYELRDNKFADLTNDE 294
           ++ RY +W+  +GR+Y +++E+  RF IYQ+N+Q+ID  N+    + L +N FADLTN+E
Sbjct: 15  IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEE 74

Query: 295 FKSIYLGFQANKHGRKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAFSAVAA 474
           FK+ YLG++            +GN  +LPT+VDW +EGAVTP+K+QGQCGSCWAFSAVAA
Sbjct: 75  FKATYLGYKTVSI--PDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAVAA 132

Query: 475 VEGINKIKTGKLVSLSEQELVDCDTHNDSQGCNGGFMETAFTYIQKQGLSIEDDYPYEGK 654
           VEGINKIK GKL+SLSEQELVDCD  + +QGCNGG+M  AF +I++ GL+ E +YPY+G 
Sbjct: 133 VEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFIKRTGLTTEIEYPYQGA 192

Query: 655 DGNC 666
           +  C
Sbjct: 193 ESAC 196


Top