BLASTX nr result

ID: Sinomenium21_contig00001188 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00001188
         (1226 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002307688.2| cysteine protease family protein [Populus tr...   513   e-143
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          503   e-140
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   498   e-138
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   497   e-138
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   486   e-134
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   484   e-134
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   481   e-133
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   479   e-133
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   476   e-131
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   475   e-131
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   475   e-131
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   475   e-131
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   473   e-131
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  470   e-130
ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria...   470   e-130
ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [S...   468   e-129
ref|XP_006655467.1| PREDICTED: cysteine proteinase RD21a-like [O...   467   e-129
ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [A...   466   e-129
gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indi...   466   e-129
ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] g...   466   e-129

>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  513 bits (1320), Expect = e-143
 Identities = 232/327 (70%), Positives = 271/327 (82%)
 Frame = -1

Query: 1226 GFVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMD 1047
            G VGD+P SIDWRNKG VT VKDQG CGACWSFSATGAIEG+N+IVTGSLVSLSEQEL++
Sbjct: 109  GVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIE 168

Query: 1046 CDRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDV 867
            CD+SYN GCGGGLMDYAFQFV+NNHGID+E+DYPY+  D TCN++++KRRVVTID ++DV
Sbjct: 169  CDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDV 228

Query: 866  PSYNEKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVD 687
            P  NEK++L+AVA+QPVSVG+CGS+R FQ+YSKGIF+GPCSTSLDHAVLIVGYGSENGVD
Sbjct: 229  PENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVD 288

Query: 686  YWILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSL 507
            YWI+KNSWG  WGM GYMHMQRNSGN +GVCGINMLA                  T+C+L
Sbjct: 289  YWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNL 348

Query: 506  LSYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAG 327
            L+YC AGETCCC  +  GIC+SWKCCG+DSAVCCKD + CCP DYPVCDT  N C K AG
Sbjct: 349  LTYCAAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAG 408

Query: 326  NSTMVKGLEKKGSSWKFGGLDSLFEAW 246
            N+T ++ +E K +S KFG  +SL EAW
Sbjct: 409  NATRMEAIEGK-TSGKFGSWNSLPEAW 434


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  503 bits (1295), Expect = e-140
 Identities = 230/326 (70%), Positives = 260/326 (79%)
 Frame = -1

Query: 1223 FVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDC 1044
            FV DVP S+DWR  GAVT VKDQG CGACWSFSATGAIEG+N+IVTGSLVSLSEQEL+DC
Sbjct: 114  FVADVPASVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDC 173

Query: 1043 DRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVP 864
            D+SYN+GC GG+MDYAFQFV++NHGID+E+DYPYQ  DR+CN+ KLKR VVTIDG++DVP
Sbjct: 174  DKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVP 233

Query: 863  SYNEKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDY 684
              NEKE+LKAVA+QPVSVG+CGS+R FQLYSKGIF+GPCSTSLDHAVLIVGYGSENGVDY
Sbjct: 234  QNNEKELLKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDY 293

Query: 683  WILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLL 504
            WI+KNSWG  WGMDGYMHMQRNSG+  G+CGINMLA                  TRC L 
Sbjct: 294  WIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLF 353

Query: 503  SYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAGN 324
            ++CG GETCCC   + GICLSWKCC +DSAVCCKD   CCP DYPVCDT  N CLK  GN
Sbjct: 354  THCGEGETCCCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGN 413

Query: 323  STMVKGLEKKGSSWKFGGLDSLFEAW 246
            +T ++   K  SS KF    SL E W
Sbjct: 414  ATRIEKFAKNSSSGKFRSWSSLLEGW 439


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  498 bits (1281), Expect = e-138
 Identities = 225/328 (68%), Positives = 261/328 (79%), Gaps = 1/328 (0%)
 Frame = -1

Query: 1226 GFVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMD 1047
            G + DVP SIDWR KGAVT VKDQ  CGACW+FSATGAIEG+N+IVTGSLVSLSEQEL+D
Sbjct: 112  GTLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171

Query: 1046 CDRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDV 867
            CDRSYNSGCGGGLMDYA+QFV+ NHGID+EKDYPY+     CN+ KL R +VTIDG+ DV
Sbjct: 172  CDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231

Query: 866  PSYNEKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVD 687
            P  NEK++L+AV +QPVSVG+CGS+R FQLYS GIF+GPCSTSLDHAVLIVGY SENGVD
Sbjct: 232  PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVD 291

Query: 686  YWILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSL 507
            YWI+KNSWG++WGM+GYMHMQRN+GN  G+CGINMLA                  TRCSL
Sbjct: 292  YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSL 351

Query: 506  LSYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCL-KVA 330
            L+YC AGETCCCG  +LGICLSWKCCG  SAVCC DH  CCP +YP+CD+V +QCL +  
Sbjct: 352  LTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFT 411

Query: 329  GNSTMVKGLEKKGSSWKFGGLDSLFEAW 246
            GN T  + +E +GSSWKFG   S  + W
Sbjct: 412  GNVTAAEAIEMRGSSWKFGSWSSFIDVW 439


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  497 bits (1280), Expect = e-138
 Identities = 224/328 (68%), Positives = 262/328 (79%), Gaps = 1/328 (0%)
 Frame = -1

Query: 1226 GFVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMD 1047
            G + DVP SIDWR KGAVT VKDQ  CGACW+FSATGAIEG+N+IVTGSLVSLSEQEL+D
Sbjct: 112  GNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELID 171

Query: 1046 CDRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDV 867
            CDRSYNSGCGGGLMDYA+QFV+ NHGID+EKDYPY+     CN+ KL R +VTIDG+ DV
Sbjct: 172  CDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV 231

Query: 866  PSYNEKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVD 687
            P  NEK++L+AV +QPVSVG+CGS+R FQLYS GIF+GPCSTSLDHAVLI+GY SENGVD
Sbjct: 232  PENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVD 291

Query: 686  YWILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSL 507
            YWI+KNSWG++WGM+GYMHMQRN+GN  G+CGINMLA                  TRCSL
Sbjct: 292  YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSL 351

Query: 506  LSYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCL-KVA 330
            L+YC  GETCCCG  +LGICLSWKCCG  SAVCC DH  CCP +YP+CD+V +QCL ++ 
Sbjct: 352  LTYCAPGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLT 411

Query: 329  GNSTMVKGLEKKGSSWKFGGLDSLFEAW 246
            GN T  + +E +GSSWKFG   S  +AW
Sbjct: 412  GNVTAAEAIEMRGSSWKFGSWSSFIDAW 439


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  486 bits (1250), Expect = e-134
 Identities = 221/314 (70%), Positives = 256/314 (81%)
 Frame = -1

Query: 1214 DVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDCDRS 1035
            DVP+S+DWR KGAVT VKDQG CGACWSFSATGA+EG+NQI+TGSL+SLSEQEL+DCDRS
Sbjct: 113  DVPDSLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRS 172

Query: 1034 YNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVPSYN 855
            YNSGCGGGLMDYA+QFV++NHGID+E DYPYQ  D +C ++KL+R VVTIDG+ D+PS +
Sbjct: 173  YNSGCGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSND 232

Query: 854  EKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIL 675
            E ++L+AVA+QPVSVG+CGS+R FQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWI+
Sbjct: 233  EGKLLQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIV 292

Query: 674  KNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLLSYC 495
            KNSWGK+WGMDGYMHMQRNSGN EGVCGIN LA                  T+CS+L+ C
Sbjct: 293  KNSWGKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSC 352

Query: 494  GAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAGNSTM 315
             AGETCCC  + LG+CLSWKCCG+ SAVCCKD   CCP DYP+CDT  N CLK   N T 
Sbjct: 353  AAGETCCCAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTR 412

Query: 314  VKGLEKKGSSWKFG 273
             + LE + SS   G
Sbjct: 413  TEILENRSSSGSSG 426


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  484 bits (1245), Expect = e-134
 Identities = 222/324 (68%), Positives = 256/324 (79%)
 Frame = -1

Query: 1211 VPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDCDRSY 1032
            VP+S+DWR KGAVT VKDQG CGACWSFSATGA+EG+NQIVTG L+SLSEQEL+DCD+SY
Sbjct: 125  VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY 184

Query: 1031 NSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVPSYNE 852
            N+GC GGLMDYAF+FV+ NHGID+EKDYPYQE D TC ++KLK+RVVTID +  V S NE
Sbjct: 185  NAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNE 244

Query: 851  KEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILK 672
            K +++AVASQPVSVG+CGS+R FQLYS GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI+K
Sbjct: 245  KALMEAVASQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVK 304

Query: 671  NSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLLSYCG 492
            NSWGK+WGMDG+MHMQRN+GN EGVCGINMLA                  T+C+L +YC 
Sbjct: 305  NSWGKSWGMDGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCS 364

Query: 491  AGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAGNSTMV 312
            +GETCCC   L G+C SWKCC ++SAVCCKD   CCP DYPVCDT  + CLK  GN T +
Sbjct: 365  SGETCCCARTLFGLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEI 424

Query: 311  KGLEKKGSSWKFGGLDSLFEAWNM 240
            K   KK SS K G     FE W M
Sbjct: 425  KPFWKKNSSNKLG----RFEEWVM 444


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  481 bits (1238), Expect = e-133
 Identities = 219/327 (66%), Positives = 258/327 (78%)
 Frame = -1

Query: 1226 GFVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMD 1047
            G V D+P S+DWR KGAVT VKDQG CGACWSFSATGAIEG+N+IVTG+LVSLSEQEL+D
Sbjct: 110  GLVRDIPASMDWRTKGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVD 169

Query: 1046 CDRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDV 867
            CDRSYNSGC GGLMDYA+QFV++NHGID+E+DYPY   ++TCN+ K KRRVVTIDG+  V
Sbjct: 170  CDRSYNSGCEGGLMDYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGV 229

Query: 866  PSYNEKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVD 687
            P+ NE  +L+AVA QPVSVG+CGS+R FQLYSKGIF+GPCS+SLDHAVLIVGYGSENGVD
Sbjct: 230  PANNEDLLLQAVAKQPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVD 289

Query: 686  YWILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSL 507
            YWI+KNSWG  WGM+GY+HM RNSG+ +G+CGINMLA                  T+C L
Sbjct: 290  YWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDL 349

Query: 506  LSYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAG 327
             +YC AGETCCC  R+ GIC SWKCC +DSAVCCKD+  CCP DYPVCDT  +QCLK  G
Sbjct: 350  FTYCSAGETCCCTHRIFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVG 409

Query: 326  NSTMVKGLEKKGSSWKFGGLDSLFEAW 246
            N+T ++  EK+ S+ KF       E W
Sbjct: 410  NATRMEAFEKRHSTRKFSSWRPFVENW 436


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  479 bits (1234), Expect = e-133
 Identities = 214/316 (67%), Positives = 257/316 (81%)
 Frame = -1

Query: 1214 DVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDCDRS 1035
            D P S+DWR KGAVT VK+QG CGACWSFSATGA+EG+N+I TGSLVSLSEQEL+DCDRS
Sbjct: 120  DAPSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRS 179

Query: 1034 YNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVPSYN 855
            YN GCGGGLMDYAF+FV+ N GID+EKDYP++E + TCN+NKL+R VVTIDG+ D+P  +
Sbjct: 180  YNEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQND 239

Query: 854  EKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIL 675
            E ++LKAVA+QPVSVG+CGS R FQ YSKGIF+GPCST+LDHAVLIVGYGSENGVDYWI+
Sbjct: 240  EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWII 299

Query: 674  KNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLLSYC 495
            KNSWG +WG++GY+HMQRNSGN+EG+CGIN LA                  ++CS+ + C
Sbjct: 300  KNSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSC 359

Query: 494  GAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAGNSTM 315
            G GETCCCG + LGICLSWKCCG+DSAVCCKD   CCP DYP+CDT  N CLK   N+T+
Sbjct: 360  GQGETCCCGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATI 419

Query: 314  VKGLEKKGSSWKFGGL 267
            V+  +K+  + KFGGL
Sbjct: 420  VQQPQKEAFTGKFGGL 435


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  476 bits (1224), Expect = e-131
 Identities = 216/324 (66%), Positives = 256/324 (79%)
 Frame = -1

Query: 1211 VPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDCDRSY 1032
            VP+S+DWR KGAVT VKDQG CGACWSFSATGA+EG+NQIVTG L+SLSEQEL+DCD+SY
Sbjct: 118  VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY 177

Query: 1031 NSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVPSYNE 852
            N+GC GGLMDYAF+FV+ NHGID+EKDYPYQE D TC ++KLK++VVTID +  V S +E
Sbjct: 178  NAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDE 237

Query: 851  KEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILK 672
            K +++AVA+QPVSVG+CGS+R FQLYS+GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI+K
Sbjct: 238  KALMEAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVK 297

Query: 671  NSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLLSYCG 492
            NSWGK+WGMDG+MHMQRN+ N +GVCGINMLA                  T+C+L +YC 
Sbjct: 298  NSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCS 357

Query: 491  AGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAGNSTMV 312
            +GETCCC   L G+C SWKCC ++SAVCCKD   CCP DYPVCDT  + CLK  GN T +
Sbjct: 358  SGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAI 417

Query: 311  KGLEKKGSSWKFGGLDSLFEAWNM 240
            K   KK SS + G     FE W M
Sbjct: 418  KPFWKKNSSKQLG----RFEEWVM 437


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  475 bits (1223), Expect = e-131
 Identities = 213/316 (67%), Positives = 255/316 (80%)
 Frame = -1

Query: 1214 DVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDCDRS 1035
            D P S+DWR+KGAVT VK+QG CGACWSFSATGAIEG+N+I TGSLVSLSEQEL+DCDRS
Sbjct: 120  DAPSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRS 179

Query: 1034 YNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVPSYN 855
            YN GCGGGLMDYAF+FV+ N GID+EKDYP++E + TCN+NKL+RRVVTIDG+ D+P  +
Sbjct: 180  YNQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQND 239

Query: 854  EKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIL 675
            E ++LKAVA+QPVSVG+CGS R FQ YSKGIF+GPC T LDHAVLIVGYGSENG DYWI+
Sbjct: 240  EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWII 299

Query: 674  KNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLLSYC 495
            KNSWG +WG++GY+HMQRNSGN+EG+CG+N LA                  ++CS  + C
Sbjct: 300  KNSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSC 359

Query: 494  GAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAGNSTM 315
            G GETCCCG + LGICLSWKCCG+DSAVCCKD   CCP DYP+CDT  N CLK   N+T+
Sbjct: 360  GQGETCCCGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATI 419

Query: 314  VKGLEKKGSSWKFGGL 267
            V+  +K+  + KFGGL
Sbjct: 420  VQQPQKEPFTGKFGGL 435


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  475 bits (1223), Expect = e-131
 Identities = 215/298 (72%), Positives = 248/298 (83%)
 Frame = -1

Query: 1223 FVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDC 1044
            FVGDVP SIDWR KGAV+ VKDQG CGACWSFSATGAIEG+N+IVTGSLVSLSEQEL+DC
Sbjct: 115  FVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDC 174

Query: 1043 DRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVP 864
            DRSYN+GC GGLMDYA+QFV+ N+GID+E+DYPYQ  ++TCN+ KLKR VVTIDG+ DVP
Sbjct: 175  DRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVP 234

Query: 863  SYNEKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDY 684
              NEKE+LKAVA+QPVSVG+CGS+R FQLYSKGIF+GPCSTSLDHAVLIVGYGSENGVDY
Sbjct: 235  QNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDY 294

Query: 683  WILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLL 504
            WI+KNSWG +WG++GYM+M RNSGN +G+CGINMLA                  T+C L 
Sbjct: 295  WIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLF 354

Query: 503  SYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVA 330
            + CG GETCCC  R+ G+C SWKCC +DSAVCCKD + CCP DYPVCDT  N CLKV+
Sbjct: 355  TRCGEGETCCCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLKVS 412


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  475 bits (1222), Expect = e-131
 Identities = 216/324 (66%), Positives = 255/324 (78%)
 Frame = -1

Query: 1211 VPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDCDRSY 1032
            VP+S+DWR KGAVT VKDQG CGACWSFSATGA+EG+NQIVTG L+SLSEQEL+DCD+SY
Sbjct: 118  VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY 177

Query: 1031 NSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVPSYNE 852
            N+GC GGLMDYAF+FV+ NHGID+EKDYPYQE D TC ++KLK++VVTID +  V S +E
Sbjct: 178  NAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDE 237

Query: 851  KEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILK 672
            K +++AVA+QPVSVG+CGS+R FQLYS GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI+K
Sbjct: 238  KALMEAVAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVK 297

Query: 671  NSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLLSYCG 492
            NSWGK+WGMDG+MHMQRN+ N +GVCGINMLA                  T+C+L +YC 
Sbjct: 298  NSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCS 357

Query: 491  AGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAGNSTMV 312
            +GETCCC   L G+C SWKCC ++SAVCCKD   CCP DYPVCDT  + CLK  GN T +
Sbjct: 358  SGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAI 417

Query: 311  KGLEKKGSSWKFGGLDSLFEAWNM 240
            K   KK SS + G     FE W M
Sbjct: 418  KPFWKKNSSKQLG----RFEEWVM 437


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  473 bits (1218), Expect = e-131
 Identities = 218/326 (66%), Positives = 255/326 (78%), Gaps = 2/326 (0%)
 Frame = -1

Query: 1211 VPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDCDRSY 1032
            VP+S+DWR KGAVT VKDQG CGACWSFSATGA+EG+NQIVTG L+SLSEQEL+DCD+SY
Sbjct: 118  VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY 177

Query: 1031 NSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVPSYNE 852
            N+GC GGLMDYAF+FV+ NHGID+EKDYPYQE D TC ++KLK++VVTID +  V S +E
Sbjct: 178  NAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDE 237

Query: 851  KEILKAVASQPVSVGLCGSDRGFQLYSK--GIFSGPCSTSLDHAVLIVGYGSENGVDYWI 678
            K + +AVA+QPVSVG+CGS+R FQLYS+  GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI
Sbjct: 238  KALREAVAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWI 297

Query: 677  LKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLLSY 498
            +KNSWGK+WGMDG+MHMQRN+GN EG+CGINMLA                  T+C+L +Y
Sbjct: 298  VKNSWGKSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTY 357

Query: 497  CGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAGNST 318
            C AGETCCC   L G+C SWKCC ++SAVCC D   CCP DYPVCDT  + CLK  GN T
Sbjct: 358  CSAGETCCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFT 417

Query: 317  MVKGLEKKGSSWKFGGLDSLFEAWNM 240
             +K   KK SS K G     FE W M
Sbjct: 418  AIKPFWKKDSSNKLG----RFEGWVM 439


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  470 bits (1210), Expect = e-130
 Identities = 209/293 (71%), Positives = 241/293 (82%)
 Frame = -1

Query: 1214 DVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDCDRS 1035
            DVP S+DWR KGAVT VKDQG CGACW+FSATGAIEG+N+IVTGSLVSLSEQEL+DCD S
Sbjct: 114  DVPASLDWRKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTS 173

Query: 1034 YNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVPSYN 855
            YN+GC GGLMDYA+QFV++NHGID+E+DYPYQ  D++C + KLKRRVVTIDG+ DV   N
Sbjct: 174  YNAGCDGGLMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTDVAPNN 233

Query: 854  EKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIL 675
              ++L+AV +QPVSVG+CGS+R FQLYSKGIF+GPCSTSLDHAVLIVGY SENGVDYWI+
Sbjct: 234  GLQLLQAVVTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGVDYWIV 293

Query: 674  KNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLLSYC 495
            KNSWGK WGMDGY+HMQRN+GN +GVCGINMLA                  TRCS  + C
Sbjct: 294  KNSWGKQWGMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCSFFAQC 353

Query: 494  GAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLK 336
            G GETCCC WR LG+C SWKCCG++SAVCCKD + CCP DYP+CDT  N CLK
Sbjct: 354  GEGETCCCSWRFLGLCFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLK 406


>ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria italica]
          Length = 454

 Score =  470 bits (1210), Expect = e-130
 Identities = 212/325 (65%), Positives = 253/325 (77%)
 Frame = -1

Query: 1226 GFVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMD 1047
            G V  VP+++DWR KGAVT VK+QG CGACWSFSATGAIEG+N+I TGSLVSLSEQEL+D
Sbjct: 126  GGVAAVPDAVDWRKKGAVTKVKNQGSCGACWSFSATGAIEGINKIKTGSLVSLSEQELID 185

Query: 1046 CDRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDV 867
            CDRSYN+GCGGGLMDYAF+FV+ N GID+E DYPY++ D TCN+NKLKRRVVTIDG+ DV
Sbjct: 186  CDRSYNNGCGGGLMDYAFKFVIKNGGIDTEDDYPYRQADGTCNKNKLKRRVVTIDGYSDV 245

Query: 866  PSYNEKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVD 687
            PS  E  +L+AVA QPVSVG+CGS R FQLYS+GIF GPC TSLDHAVLIVGYGSE G D
Sbjct: 246  PSNKENLLLQAVAQQPVSVGICGSARAFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKD 305

Query: 686  YWILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSL 507
            YWI+KNSWG+ WGM GYMHM RN+G   G+CGINM+                   T+C+L
Sbjct: 306  YWIVKNSWGERWGMKGYMHMHRNTGASSGICGINMMPSFPTKTSPNPPPSPGPGPTKCNL 365

Query: 506  LSYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAG 327
            L+YC  G TCCC WR+LG+CLSW CCG+D+A+CCKD+  CCP DYP+CDTV  QCL+  G
Sbjct: 366  LTYCPEGSTCCCSWRVLGLCLSWSCCGLDNAICCKDNRYCCPHDYPICDTVRAQCLRANG 425

Query: 326  NSTMVKGLEKKGSSWKFGGLDSLFE 252
            N + ++G++KK S  K    + L E
Sbjct: 426  NFSGIEGIKKKQSFSKVPSWNGLLE 450


>ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
            gi|241945324|gb|EES18469.1| hypothetical protein
            SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  468 bits (1203), Expect = e-129
 Identities = 211/312 (67%), Positives = 251/312 (80%), Gaps = 1/312 (0%)
 Frame = -1

Query: 1226 GFVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMD 1047
            G +G VP+++DWR  GAVT VKDQG CGACWSFSATGA+EG+N+I TGSLVSLSEQEL+D
Sbjct: 134  GGLGAVPDALDWRENGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELID 193

Query: 1046 CDRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDV 867
            CDRSYNSGCGGGLMDYA++FVV N GID+E+DYPY+E D TCN+NKLK+R+VTIDG+ DV
Sbjct: 194  CDRSYNSGCGGGLMDYAYKFVVKNGGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDV 253

Query: 866  PSYNEKEILKAVASQPVSVGLCGSDRGFQLYS-KGIFSGPCSTSLDHAVLIVGYGSENGV 690
            PS  E  +L+AVA QPVSVG+CGS R FQLYS +GIF GPC TSLDHAVLIVGYGSE G 
Sbjct: 254  PSNKEDLLLQAVAQQPVSVGICGSARAFQLYSQQGIFDGPCPTSLDHAVLIVGYGSEGGK 313

Query: 689  DYWILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCS 510
            DYWI+KNSWG++WGM GYMHM RN+G+ +GVCGINM+A                  T+CS
Sbjct: 314  DYWIVKNSWGESWGMKGYMHMHRNTGDSKGVCGINMMASFPTKSSPNPPPSPGPGPTKCS 373

Query: 509  LLSYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVA 330
            LL+YC  G TCCC WR+LG CLSW CC +D+AVCCKD+ SCCP DYPVCDT    CLK +
Sbjct: 374  LLTYCPEGSTCCCSWRILGFCLSWSCCELDNAVCCKDNKSCCPHDYPVCDTDRGLCLKAS 433

Query: 329  GNSTMVKGLEKK 294
            GNS+ ++G+ +K
Sbjct: 434  GNSSAIEGIRRK 445


>ref|XP_006655467.1| PREDICTED: cysteine proteinase RD21a-like [Oryza brachyantha]
          Length = 377

 Score =  467 bits (1201), Expect = e-129
 Identities = 213/326 (65%), Positives = 252/326 (77%), Gaps = 1/326 (0%)
 Frame = -1

Query: 1226 GFVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMD 1047
            G VG VP+++DWR  G VT VKDQG CGACWSFSATGA+EG+N+I TGSL+SLSEQEL+D
Sbjct: 48   GGVGSVPDALDWRQSGVVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELID 107

Query: 1046 CDRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDV 867
            CDRSYN+GCGGGLMDYA++FVV N GID+E+DYPY+ETD TCN+NKLKRRVVTIDG+ DV
Sbjct: 108  CDRSYNTGCGGGLMDYAYKFVVKNGGIDTEEDYPYRETDGTCNKNKLKRRVVTIDGYKDV 167

Query: 866  PSYNEKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVD 687
            P+ NE  +L+AVA QPVSVG+CGS R FQLYSKGIF GPC TSLDHAVLIVGYGSE G D
Sbjct: 168  PANNEDLLLQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTSLDHAVLIVGYGSEGGKD 227

Query: 686  YWILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSL 507
            YWI+KNSWG++WGM GYMHM RN+GN  G+CGIN +                   T+CSL
Sbjct: 228  YWIVKNSWGESWGMKGYMHMHRNTGNSYGICGINQMPSFPTKTSPNPPPSPGPGPTKCSL 287

Query: 506  LSYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKV-A 330
            L+YC  G TCCC WR+LG+CLSW CC +DSA CCKD+  CCP DYP+CDT S +C K   
Sbjct: 288  LTYCPEGSTCCCSWRVLGLCLSWSCCELDSATCCKDNRYCCPHDYPICDTASRRCFKANN 347

Query: 329  GNSTMVKGLEKKGSSWKFGGLDSLFE 252
            GN ++++G  +K S  K   L  L E
Sbjct: 348  GNFSVMEGGSRKQSFSKVPSLGGLLE 373


>ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [Amborella trichopoda]
            gi|548841210|gb|ERN01273.1| hypothetical protein
            AMTR_s00002p00249780 [Amborella trichopoda]
          Length = 475

 Score =  466 bits (1200), Expect = e-129
 Identities = 213/323 (65%), Positives = 253/323 (78%)
 Frame = -1

Query: 1214 DVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMDCDRS 1035
            DVP S+DWR+KGAVT VKDQG CGACW+FSATGAIEG+N+IVTGSL+SLSEQE++DCD +
Sbjct: 157  DVPSSLDWRDKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLISLSEQEIIDCDTT 216

Query: 1034 YNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDVPSYN 855
            YNSGCGGGLMDYAF++V  NHGID+EKDYPY+E   +C ++K +R VVTIDG  D+PS +
Sbjct: 217  YNSGCGGGLMDYAFKWVTKNHGIDTEKDYPYREVQGSCIKDKAERHVVTIDGHTDIPSNS 276

Query: 854  EKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIL 675
            E  IL+AVA QPVSVG+CGS+R FQLYS GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI+
Sbjct: 277  EDLILQAVAKQPVSVGICGSERSFQLYSSGIFSGPCSTSLDHAVLIVGYGSKNGVDYWIV 336

Query: 674  KNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLLSYC 495
            KNSWG +WGMDGYMHM RNSG+ +GVCGINM+                    +CSLL+YC
Sbjct: 337  KNSWGTSWGMDGYMHMLRNSGDSQGVCGINMMPSYPTKSGANPPPSPPPGPVKCSLLTYC 396

Query: 494  GAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKVAGNSTM 315
             +G TCCC WR LGICLSW CC +D+AVCCKD   CCP DYPVC+T +  CLK +GN T 
Sbjct: 397  PSGNTCCCTWRFLGICLSWSCCDLDNAVCCKDGQYCCPQDYPVCNTATGYCLKGSGNWTE 456

Query: 314  VKGLEKKGSSWKFGGLDSLFEAW 246
            + GL+++ S   FGG    F  W
Sbjct: 457  MDGLKRRQS---FGG----FRPW 472


>gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score =  466 bits (1199), Expect = e-129
 Identities = 214/326 (65%), Positives = 252/326 (77%), Gaps = 1/326 (0%)
 Frame = -1

Query: 1226 GFVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMD 1047
            G VG VP+++DWR  GAVT VKDQG CGACWSFSATGA+EG+N+I TGSL+SLSEQEL+D
Sbjct: 120  GGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELID 179

Query: 1046 CDRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDV 867
            CDRSYNSGCGGGLMDYA++FVV N GID+E DYPY+ETD TCN+NKLKRRVVTIDG+ DV
Sbjct: 180  CDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRETDGTCNKNKLKRRVVTIDGYKDV 239

Query: 866  PSYNEKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVD 687
            P+ NE  +L+AVA QPVSVG+CGS R FQLYSKGIF GPC TSLDHA+LIVGYGSE G D
Sbjct: 240  PANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTSLDHAILIVGYGSEGGKD 299

Query: 686  YWILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSL 507
            YWI+KNSWG++WGM GYM+M RN+GN  GVCGIN +                   T+CSL
Sbjct: 300  YWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQMPSFPTKSSPNPPPSPGPGPTKCSL 359

Query: 506  LSYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKV-A 330
            L+YC  G TCCC WR+LG+CLSW CC +D+AVCCKD+  CCP DYPVCDT S +C K   
Sbjct: 360  LTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKDNRYCCPHDYPVCDTASQRCFKANN 419

Query: 329  GNSTMVKGLEKKGSSWKFGGLDSLFE 252
            GN ++++G  +K    K   L  L E
Sbjct: 420  GNFSVMEGGSRKQPFSKVPSLGGLLE 445


>ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] gi|48475189|gb|AAT44258.1|
            hypothetical protein [Oryza sativa Japonica Group]
            gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa
            Japonica Group]
          Length = 450

 Score =  466 bits (1199), Expect = e-129
 Identities = 214/326 (65%), Positives = 252/326 (77%), Gaps = 1/326 (0%)
 Frame = -1

Query: 1226 GFVGDVPESIDWRNKGAVTPVKDQGGCGACWSFSATGAIEGVNQIVTGSLVSLSEQELMD 1047
            G VG VP+++DWR  GAVT VKDQG CGACWSFSATGA+EG+N+I TGSL+SLSEQEL+D
Sbjct: 121  GGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELID 180

Query: 1046 CDRSYNSGCGGGLMDYAFQFVVNNHGIDSEKDYPYQETDRTCNRNKLKRRVVTIDGFIDV 867
            CDRSYNSGCGGGLMDYA++FVV N GID+E DYPY+ETD TCN+NKLKRRVVTIDG+ DV
Sbjct: 181  CDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRETDGTCNKNKLKRRVVTIDGYKDV 240

Query: 866  PSYNEKEILKAVASQPVSVGLCGSDRGFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVD 687
            P+ NE  +L+AVA QPVSVG+CGS R FQLYSKGIF GPC TSLDHA+LIVGYGSE G D
Sbjct: 241  PANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTSLDHAILIVGYGSEGGKD 300

Query: 686  YWILKNSWGKNWGMDGYMHMQRNSGNKEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSL 507
            YWI+KNSWG++WGM GYM+M RN+GN  GVCGIN +                   T+CSL
Sbjct: 301  YWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQMPSFPTKSSPNPPPSPGPGPTKCSL 360

Query: 506  LSYCGAGETCCCGWRLLGICLSWKCCGVDSAVCCKDHVSCCPPDYPVCDTVSNQCLKV-A 330
            L+YC  G TCCC WR+LG+CLSW CC +D+AVCCKD+  CCP DYPVCDT S +C K   
Sbjct: 361  LTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKDNRYCCPHDYPVCDTASQRCFKANN 420

Query: 329  GNSTMVKGLEKKGSSWKFGGLDSLFE 252
            GN ++++G  +K    K   L  L E
Sbjct: 421  GNFSVMEGGSRKQPFSKVPSLGGLLE 446


Top