BLASTX nr result
ID: Cocculus23_contig00003865
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00003865 (1468 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 539 e-150 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 534 e-149 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 534 e-149 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 533 e-149 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 533 e-148 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 527 e-147 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 527 e-147 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 526 e-147 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 526 e-147 ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087... 525 e-146 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 525 e-146 ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [A... 516 e-143 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 516 e-143 ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 514 e-143 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 511 e-142 ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps... 507 e-141 gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase... 507 e-141 ref|NP_001141813.1| uncharacterized protein LOC100273952 precurs... 506 e-140 ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria... 503 e-140 ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [S... 501 e-139 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 539 bits (1389), Expect = e-150 Identities = 255/393 (64%), Positives = 298/393 (75%), Gaps = 3/393 (0%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASM---VDRSKSLG 1296 +VF+DN FV +HNS NS+Y + LNA+ADLTHHEF+AS+LGLS AAS VDRS Sbjct: 52 KVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSSAASASLNVDRSN--- 108 Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116 + FV DVP S+DWRK GAVT VKDQ +CGACW+FS TGAIEGIN+IVTGSLVSLSE Sbjct: 109 -RQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSE 167 Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936 QEL+DCD+SYN+GC GG+MDYAF+FV+ NHGIDTEEDYPYQ DRSCN+ KLKR VVTID Sbjct: 168 QELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTID 227 Query: 935 GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGS 756 GY D+P +NE+ELLKAVA+QPVSVG+CGSER FQLYS GIF+GPC+TSLDHAV+IVGYGS Sbjct: 228 GYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGS 287 Query: 755 ENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXX 576 ENGVDYWI+KNSWG WGMDGYMHMQRN+G +G+CGINMLA Sbjct: 288 ENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGP 347 Query: 575 TRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQ 396 TRC L ++CG GETCCC + G +CLSWKCCELDSAVCCKD +CCP +YP+CDT Sbjct: 348 TRCDLFTHCGEGETCCCVHHIFG-ICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNI 406 Query: 395 CLRAAGNYSMVKPFEXXXXXXXXXXXXXLFEAW 297 CL+ GN + ++ F L E W Sbjct: 407 CLKHYGNATRIEKFAKNSSSGKFRSWSSLLEGW 439 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 534 bits (1376), Expect = e-149 Identities = 243/373 (65%), Positives = 297/373 (79%), Gaps = 1/373 (0%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAA-SMVDRSKSLGTK 1290 +VFEDN FV +HNS NS+Y + LNA+ADLTHHEF+ S+LGLS A ++ R+ + Sbjct: 51 KVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHRNLEI--- 107 Query: 1289 ARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQE 1110 +G VGD+P SIDWR KG VT VKDQ SCGACW+FS TGAIEGIN+IVTGSLVSLSEQE Sbjct: 108 --TGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQE 165 Query: 1109 LIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGY 930 LI+CD+SYN GCGGGLMDYAF+FV+ NHGIDTEEDYPY+A D +CN++++KRRVVTID Y Sbjct: 166 LIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKY 225 Query: 929 TDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSEN 750 D+P +NE++LL+AVA+QPVSVG+CGSER FQ+YS GIF+GPC+TSLDHAV+IVGYGSEN Sbjct: 226 VDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSEN 285 Query: 749 GVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTR 570 GVDYWI+KNSWG WGM GYMHMQRN+G+ QGVCGINMLA T+ Sbjct: 286 GVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTK 345 Query: 569 CSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCL 390 C+L++YC AGETCCC + G +C+SWKCC LDSAVCCKD ++CCPH+YP+CDT C Sbjct: 346 CNLLTYCAAGETCCCARKFFG-ICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCF 404 Query: 389 RAAGNYSMVKPFE 351 + AGN + ++ E Sbjct: 405 KRAGNATRMEAIE 417 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 534 bits (1375), Expect = e-149 Identities = 249/373 (66%), Positives = 295/373 (79%), Gaps = 1/373 (0%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKA 1287 ++FEDN AFV QHN++ NS++ + LNA+ADLTH EF+AS LG S A+ DR ++ ++ Sbjct: 51 KIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS 110 Query: 1286 RSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQEL 1107 G + DVP SIDWRKKGAVT VKDQASCGACWAFS TGAIEGIN+IVTGSLVSLSEQEL Sbjct: 111 -PGTLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169 Query: 1106 IDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYT 927 IDCDRSYNSGCGGGLMDYA++FV+KNHGIDTE+DYPY+ CN+ KL R +VTIDGY Sbjct: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229 Query: 926 DIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENG 747 D+P +NE++LL+AV +QPVSVG+CGSER FQLYS+GIF+GPC+TSLDHAV+IVGY SENG Sbjct: 230 DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENG 289 Query: 746 VDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRC 567 VDYWI+KNSWG+SWGM+GYMHMQRN G+ G+CGINMLA TRC Sbjct: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRC 349 Query: 566 SLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCL- 390 SL++YC AGETCCCG +LG +CLSWKCC SAVCC DH YCCP NYPICD+ QCL Sbjct: 350 SLLTYCAAGETCCCGSSILG-ICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408 Query: 389 RAAGNYSMVKPFE 351 R GN + + E Sbjct: 409 RFTGNVTAAEAIE 421 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 533 bits (1374), Expect = e-149 Identities = 246/372 (66%), Positives = 297/372 (79%), Gaps = 1/372 (0%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAA-SMVDRSKSLGTK 1290 ++F DN FV QHN +SNSTY + LNA+ADLTHHEF+AS+LGLS + S++ + +SLG Sbjct: 59 QIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHEFKASRLGLSAPSPSLMAKEQSLGVS 118 Query: 1289 ARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQE 1110 R VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSEQE Sbjct: 119 ERVRV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQE 176 Query: 1109 LIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGY 930 LIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTE+DYPYQ D +C ++KLK+RVVTID Y Sbjct: 177 LIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSY 236 Query: 929 TDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSEN 750 + S+NE+ L++AVASQPVSVG+CGSER FQLYS+GIFSGPC+TSLDHAV+IVGYGS+N Sbjct: 237 AGVASNNEKALMEAVASQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQN 296 Query: 749 GVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTR 570 GVDYWI+KNSWGKSWGMDG+MHMQRN G+ +GVCGINMLA T+ Sbjct: 297 GVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTK 356 Query: 569 CSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCL 390 C+L +YC +GETCCC L GL C SWKCCEL+SAVCCKD +CCP +YP+CDT CL Sbjct: 357 CNLFTYCSSGETCCCARTLFGL-CFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCL 415 Query: 389 RAAGNYSMVKPF 354 + GN++ +KPF Sbjct: 416 KKTGNFTEIKPF 427 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 533 bits (1372), Expect = e-148 Identities = 249/391 (63%), Positives = 297/391 (75%), Gaps = 1/391 (0%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKA 1287 ++FEDN AFV QHN++ NS++ + LNA+ADLTH EF+AS LG S A+ DR ++ ++ Sbjct: 51 KIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS 110 Query: 1286 RSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQEL 1107 G + DVP SIDWRKKGAVT VKDQASCGACWAFS TGAIEGIN+IVTGSLVSLSEQEL Sbjct: 111 -PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169 Query: 1106 IDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYT 927 IDCDRSYNSGCGGGLMDYA++FV+KNHGIDTE+DYPY+ CN+ KL R +VTIDGY Sbjct: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229 Query: 926 DIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENG 747 D+P +NE++LL+AV +QPVSVG+CGSER FQLYS+GIF+GPC+TSLDHAV+I+GY SENG Sbjct: 230 DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENG 289 Query: 746 VDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRC 567 VDYWI+KNSWG+SWGM+GYMHMQRN G+ G+CGINMLA TRC Sbjct: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRC 349 Query: 566 SLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCL- 390 SL++YC GETCCCG +LG +CLSWKCC SAVCC DH YCCP NYPICD+ QCL Sbjct: 350 SLLTYCAPGETCCCGSSILG-ICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408 Query: 389 RAAGNYSMVKPFEXXXXXXXXXXXXXLFEAW 297 R GN + + E +AW Sbjct: 409 RLTGNVTAAEAIEMRGSSWKFGSWSSFIDAW 439 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 527 bits (1358), Expect = e-147 Identities = 241/374 (64%), Positives = 297/374 (79%), Gaps = 3/374 (0%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAAS---MVDRSKSLG 1296 ++F+DN FV QHN ++N+TY + LNA+ADLTHHEF+AS+LGLSV+A M + +SLG Sbjct: 54 QIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLG 113 Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116 + VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSE Sbjct: 114 GSVK------VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSE 167 Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936 QELIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTE+DYPYQ D +C ++KLK++VVTID Sbjct: 168 QELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTID 227 Query: 935 GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGS 756 Y + S++E+ L++AVA+QPVSVG+CGSER FQLYS+GIFSGPC+TSLDHAV+IVGYGS Sbjct: 228 SYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGS 287 Query: 755 ENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXX 576 +NGVDYWI+KNSWGKSWGMDG+MHMQRN + GVCGINMLA Sbjct: 288 QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGP 347 Query: 575 TRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQ 396 T+C+L +YC +GETCCC L GL C SWKCCE++SAVCCKD +CCPH+YP+CDT Sbjct: 348 TKCNLFTYCSSGETCCCARELFGL-CFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSL 406 Query: 395 CLRAAGNYSMVKPF 354 CL+ GN++ +KPF Sbjct: 407 CLKKTGNFTAIKPF 420 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 527 bits (1357), Expect = e-147 Identities = 243/376 (64%), Positives = 299/376 (79%), Gaps = 5/376 (1%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAAS---MVDRSKSLG 1296 ++F+DN FV QHN ++N+TY + LNA+ADLTHHEF+AS+LGLSV+AS M + +SLG Sbjct: 54 QIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSASSLIMASKGQSLG 113 Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116 A+ VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSE Sbjct: 114 GNAK------VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSE 167 Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936 QELIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTE+DYPYQ D +C ++KLK++VVTID Sbjct: 168 QELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTID 227 Query: 935 GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYS--TGIFSGPCATSLDHAVVIVGY 762 Y + S++E+ L +AVA+QPVSVG+CGSER FQLYS +GIFSGPC+TSLDHAV+IVGY Sbjct: 228 SYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGY 287 Query: 761 GSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXX 582 GS+NGVDYWI+KNSWGKSWGMDG+MHMQRN G+ +G+CGINMLA Sbjct: 288 GSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPP 347 Query: 581 XXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQT 402 T+C+L +YC AGETCCC L GL C SWKCCE++SAVCC D +CCPH+YP+CDT Sbjct: 348 GPTKCNLFTYCSAGETCCCARNLFGL-CFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTR 406 Query: 401 KQCLRAAGNYSMVKPF 354 CL+ GN++ +KPF Sbjct: 407 SLCLKKTGNFTAIKPF 422 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 526 bits (1356), Expect = e-147 Identities = 248/359 (69%), Positives = 287/359 (79%) Frame = -3 Query: 1463 VFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKAR 1284 VFEDNLAFV QHN++ NS+Y + LNA+ADLTHHEF++S+LG S A ++ LG+K Sbjct: 53 VFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHHEFKSSRLGFSSA--LLSSLPKLGSKLL 110 Query: 1283 SGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELI 1104 + DVP S+DWRKKGAVT VKDQ SCGACWAFS TGAIEGIN+IVTGSLVSLSEQELI Sbjct: 111 D--LRDVPASLDWRKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 168 Query: 1103 DCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTD 924 DCD SYN+GC GGLMDYA++FV+ NHGIDTEEDYPYQA D+SC + KLKRRVVTIDGYTD Sbjct: 169 DCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTD 228 Query: 923 IPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGV 744 + +N +LL+AV +QPVSVG+CGSER FQLYS GIF+GPC+TSLDHAV+IVGY SENGV Sbjct: 229 VAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGV 288 Query: 743 DYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCS 564 DYWI+KNSWGK WGMDGY+HMQRN G+ QGVCGINMLA TRCS Sbjct: 289 DYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCS 348 Query: 563 LMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLR 387 + CG GETCCC +R LGL C SWKCC L+SAVCCKD I+CCP +YP+CDTQ CL+ Sbjct: 349 FFAQCGEGETCCCSWRFLGL-CFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLK 406 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 526 bits (1356), Expect = e-147 Identities = 241/374 (64%), Positives = 296/374 (79%), Gaps = 3/374 (0%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAAS---MVDRSKSLG 1296 ++F+DN FV QHN ++N+TY + LNA+ADLTHHEF+AS+LGLSV+A M + +SLG Sbjct: 54 QIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLG 113 Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116 + VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSE Sbjct: 114 GSVK------VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSE 167 Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936 QELIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTE+DYPYQ D +C ++KLK++VVTID Sbjct: 168 QELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTID 227 Query: 935 GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGS 756 Y + S++E+ L++AVA+QPVSVG+CGSER FQLYS GIFSGPC+TSLDHAV+IVGYGS Sbjct: 228 SYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGS 287 Query: 755 ENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXX 576 +NGVDYWI+KNSWGKSWGMDG+MHMQRN + GVCGINMLA Sbjct: 288 QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGP 347 Query: 575 TRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQ 396 T+C+L +YC +GETCCC L GL C SWKCCE++SAVCCKD +CCPH+YP+CDT Sbjct: 348 TKCNLFTYCSSGETCCCARELFGL-CFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSL 406 Query: 395 CLRAAGNYSMVKPF 354 CL+ GN++ +KPF Sbjct: 407 CLKKTGNFTAIKPF 420 >ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 525 bits (1353), Expect = e-146 Identities = 241/372 (64%), Positives = 292/372 (78%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKA 1287 +VFE+N AFV QHN + NS+Y + LNA+ADLTHHEF+AS+LGLS AA R Sbjct: 52 KVFEENYAFVTQHNGVGNSSYSLALNAFADLTHHEFKASRLGLSAAAIEGSRPNL----Q 107 Query: 1286 RSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQEL 1107 G V D+P S+DWR KGAVT VKDQ SCGACW+FS TGAIEGIN+IVTG+LVSLSEQEL Sbjct: 108 LPGLVRDIPASMDWRTKGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQEL 167 Query: 1106 IDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYT 927 +DCDRSYNSGC GGLMDYA++FV+ NHGID EEDYPY +++CN+ K KRRVVTIDGY Sbjct: 168 VDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYA 227 Query: 926 DIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENG 747 +P++NE+ LL+AVA QPVSVG+CGSER FQLYS GIF+GPC++SLDHAV+IVGYGSENG Sbjct: 228 GVPANNEDLLLQAVAKQPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENG 287 Query: 746 VDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRC 567 VDYWI+KNSWG WGM+GY+HM RN+GD +G+CGINMLA T+C Sbjct: 288 VDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKC 347 Query: 566 SLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLR 387 L +YC AGETCCC R+ G +C SWKCCELDSAVCCKD+ +CCP++YP+CDT+ QCL+ Sbjct: 348 DLFTYCSAGETCCCTHRIFG-ICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLK 406 Query: 386 AAGNYSMVKPFE 351 GN + ++ FE Sbjct: 407 RVGNATRMEAFE 418 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 525 bits (1351), Expect = e-146 Identities = 241/362 (66%), Positives = 293/362 (80%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKA 1287 ++FE+N FV +HNS NS+Y + LNA+ADLTHHEF+AS+LGLS ++ S+ Sbjct: 54 KIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGKLSRR--NFP 111 Query: 1286 RSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQEL 1107 FVGDVP SIDWRKKGAV+ VKDQ +CGACW+FS TGAIEGIN+IVTGSLVSLSEQEL Sbjct: 112 LHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQEL 171 Query: 1106 IDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYT 927 +DCDRSYN+GC GGLMDYA++FV++N+GIDTEEDYPYQA +++CN+ KLKR VVTIDGYT Sbjct: 172 VDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYT 231 Query: 926 DIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENG 747 D+P +NE+ELLKAVA+QPVSVG+CGSER FQLYS GIF+GPC+TSLDHAV+IVGYGSENG Sbjct: 232 DVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENG 291 Query: 746 VDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRC 567 VDYWI+KNSWG WG++GYM+M RN+G+ QG+CGINMLA T+C Sbjct: 292 VDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKC 351 Query: 566 SLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLR 387 L + CG GETCCC R+ GL C SWKCCELDSAVCCKD ++CCPH+YP+CDT+ CL+ Sbjct: 352 DLFTRCGEGETCCCTRRIFGL-CFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 Query: 386 AA 381 + Sbjct: 411 VS 412 >ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [Amborella trichopoda] gi|548841210|gb|ERN01273.1| hypothetical protein AMTR_s00002p00249780 [Amborella trichopoda] Length = 475 Score = 516 bits (1329), Expect = e-143 Identities = 236/366 (64%), Positives = 286/366 (78%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKA 1287 RVF DNL F+ +HN +NS Y VGLNA+ADLTHHEF+ +LGL + S + Sbjct: 95 RVFSDNLVFIREHNQRANSNYTVGLNAFADLTHHEFKIKRLGLCPSILRFSSSNFRSDQK 154 Query: 1286 RSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQEL 1107 + DVP S+DWR KGAVT VKDQ SCGACWAFS TGAIEGIN+IVTGSL+SLSEQE+ Sbjct: 155 KI----DVPSSLDWRDKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLISLSEQEI 210 Query: 1106 IDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYT 927 IDCD +YNSGCGGGLMDYAFK+V KNHGIDTE+DYPY+ + SC ++K +R VVTIDG+T Sbjct: 211 IDCDTTYNSGCGGGLMDYAFKWVTKNHGIDTEKDYPYREVQGSCIKDKAERHVVTIDGHT 270 Query: 926 DIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENG 747 DIPS++E+ +L+AVA QPVSVG+CGSER FQLYS+GIFSGPC+TSLDHAV+IVGYGS+NG Sbjct: 271 DIPSNSEDLILQAVAKQPVSVGICGSERSFQLYSSGIFSGPCSTSLDHAVLIVGYGSKNG 330 Query: 746 VDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRC 567 VDYWI+KNSWG SWGMDGYMHM RN+GD QGVCGINM+ +C Sbjct: 331 VDYWIVKNSWGTSWGMDGYMHMLRNSGDSQGVCGINMMPSYPTKSGANPPPSPPPGPVKC 390 Query: 566 SLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLR 387 SL++YC +G TCCC +R LG +CLSW CC+LD+AVCCKD YCCP +YP+C+T T CL+ Sbjct: 391 SLLTYCPSGNTCCCTWRFLG-ICLSWSCCDLDNAVCCKDGQYCCPQDYPVCNTATGYCLK 449 Query: 386 AAGNYS 369 +GN++ Sbjct: 450 GSGNWT 455 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 516 bits (1329), Expect = e-143 Identities = 244/363 (67%), Positives = 282/363 (77%) Frame = -3 Query: 1463 VFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKAR 1284 VF DN FV HN+L NS+Y + LN+YADLTHHEF+ S+LG S A R+ Sbjct: 52 VFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEFKVSRLGFSPAL----RNFRPVLPQE 107 Query: 1283 SGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELI 1104 DVP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQI+TGSL+SLSEQELI Sbjct: 108 PSLPRDVPDSLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELI 167 Query: 1103 DCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTD 924 DCDRSYNSGCGGGLMDYA++FV+ NHGIDTE DYPYQA D SC ++KL+R VVTIDGY D Sbjct: 168 DCDRSYNSGCGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYAD 227 Query: 923 IPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGV 744 IPS++E +LL+AVA+QPVSVG+CGSER FQLYS GIFSGPC+TSLDHAV+IVGYGSENGV Sbjct: 228 IPSNDEGKLLQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGV 287 Query: 743 DYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCS 564 DYWI+KNSWGKSWGMDGYMHMQRN+G+ +GVCGIN LA T+CS Sbjct: 288 DYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCS 347 Query: 563 LMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRA 384 +++ C AGETCCC + LGL CLSWKCC L SAVCCKD +CCP +YPICDT CL+ Sbjct: 348 ILTSCAAGETCCCAKKFLGL-CLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQ 406 Query: 383 AGN 375 N Sbjct: 407 TMN 409 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 514 bits (1324), Expect = e-143 Identities = 237/372 (63%), Positives = 297/372 (79%), Gaps = 3/372 (0%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGT-K 1290 +VFE+N A++ +HNS NS+Y +GLNAY+DLTHHEFR S LGLS +A+ R K G+ Sbjct: 51 KVFEENYAYITEHNSKENSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRGSGS 110 Query: 1289 ARSGFVGDV--PKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116 + +G + DV P S+DWR+KGAVT VK+Q SCGACW+FS TGA+EGIN+I TGSLVSLSE Sbjct: 111 SETGVLSDVDAPSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSE 170 Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936 QELIDCDRSYN GCGGGLMDYAF+FV+KN GIDTE+DYP++ + +CN+NKL+R VVTID Sbjct: 171 QELIDCDRSYNEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTID 230 Query: 935 GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGS 756 GYTDIP ++E++LLKAVA+QPVSVG+CGS R FQ YS GIF+GPC+T+LDHAV+IVGYGS Sbjct: 231 GYTDIPQNDEDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGS 290 Query: 755 ENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXX 576 ENGVDYWI+KNSWG SWG++GY+HMQRN+G+++G+CGIN LA Sbjct: 291 ENGVDYWIIKNSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGP 350 Query: 575 TRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQ 396 ++CS+ + CG GETCCCG + LG +CLSWKCC LDSAVCCKD +CCP +YPICDT Sbjct: 351 SKCSMFTSCGQGETCCCGSKFLG-ICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNL 409 Query: 395 CLRAAGNYSMVK 360 CL+ N ++V+ Sbjct: 410 CLKRMNNATIVQ 421 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 511 bits (1316), Expect = e-142 Identities = 237/372 (63%), Positives = 293/372 (78%), Gaps = 3/372 (0%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGT-K 1290 +VFE+N A++ +HNS NS+Y +GLNAY+DLTHHEFR S LGLS +A+ R K G+ Sbjct: 51 KVFEENYAYITEHNSKGNSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRGSGS 110 Query: 1289 ARSGFVGDV--PKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116 + +G + DV P S+DWR KGAVT VK+Q SCGACW+FS TGAIEGIN+I TGSLVSLSE Sbjct: 111 SAAGVLSDVDAPSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSE 170 Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936 QELIDCDRSYN GCGGGLMDYAF+FV+KN GIDTE+DYP++ + +CN+NKL+RRVVTID Sbjct: 171 QELIDCDRSYNQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTID 230 Query: 935 GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGS 756 GYTDIP ++E++LLKAVA+QPVSVG+CGS R FQ YS GIF+GPC T LDHAV+IVGYGS Sbjct: 231 GYTDIPQNDEDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGS 290 Query: 755 ENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXX 576 ENG DYWI+KNSWG SWG++GY+HMQRN+G+++G+CG+N LA Sbjct: 291 ENGFDYWIIKNSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGP 350 Query: 575 TRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQ 396 ++CS + CG GETCCCG + LG +CLSWKCC LDSAVCCKD +CCP +YPICDT Sbjct: 351 SKCSTFTSCGQGETCCCGLKFLG-ICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNL 409 Query: 395 CLRAAGNYSMVK 360 CL+ N ++V+ Sbjct: 410 CLKRMSNATIVQ 421 >ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] gi|482576142|gb|EOA40329.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] Length = 467 Score = 507 bits (1305), Expect = e-141 Identities = 241/402 (59%), Positives = 292/402 (72%), Gaps = 31/402 (7%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAAS---MVDRSKSLG 1296 ++F DN FV QHN ++N+TY + LNA+ADL H EF+ S+LGLSV+A M + KSLG Sbjct: 56 QIFRDNHDFVTQHNLITNATYSLSLNAFADLNHSEFKTSRLGLSVSAPSVIMASKGKSLG 115 Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116 + VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSE Sbjct: 116 GSVK------VPDSLDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSE 169 Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936 QELIDCD+SYN GC GGLMDYAF+FV+KN GIDTE+DYPYQ D +C ++KLK+RVV+ID Sbjct: 170 QELIDCDKSYNDGCNGGLMDYAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSID 229 Query: 935 GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYST---------------------- 822 Y + S+E+ LL+AVA+QPVSVG+CGSER FQLYS+ Sbjct: 230 SYAGVKPSDEKALLEAVAAQPVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLK 289 Query: 821 ------GIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDK 660 GIFSGPC+TSLDHAV+IVGYGS+NGVDYWI+KNSWGKSWGMDG+MHMQRN G+ Sbjct: 290 LYLMMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNS 349 Query: 659 QGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCC 480 QG+CGINMLA T+C+L +YC A ETCCC L GL CLSWKCC Sbjct: 350 QGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAAETCCCARNLFGL-CLSWKCC 408 Query: 479 ELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKPF 354 E++SAVCCKD +CCPH+YP+CDT CL+ GN++ +KPF Sbjct: 409 EIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPF 450 >gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135) [Arabidopsis thaliana] Length = 416 Score = 507 bits (1305), Expect = e-141 Identities = 236/370 (63%), Positives = 289/370 (78%), Gaps = 10/370 (2%) Frame = -3 Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAAS---MVDRSKSLG 1296 ++F+DN FV QHN ++N+TY + LNA+ADLTHHEF+AS+LGLSV+A M + +SLG Sbjct: 52 QIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLG 111 Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116 + VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSE Sbjct: 112 GSVK------VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSE 165 Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936 QELIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTE+DYPYQ D +C ++KLK++VVTID Sbjct: 166 QELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTID 225 Query: 935 GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYST-------GIFSGPCATSLDHAV 777 Y + S++E+ L++AVA+QPVSVG+CGSER FQLYS+ GIFSGPC+TSLDHAV Sbjct: 226 SYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAV 285 Query: 776 VIVGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXX 597 +IVGYGS+NGVDYWI+KNSWGKSWGMDG+MHMQRN + GVCGINMLA Sbjct: 286 LIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPP 345 Query: 596 XXXXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPI 417 T+C+L +YC +GETCCC L GL C SWKCCE++SAVCCKD +CCPH+YP+ Sbjct: 346 PPSPPGPTKCNLFTYCSSGETCCCARELFGL-CFSWKCCEIESAVCCKDGRHCCPHDYPV 404 Query: 416 CDTQTKQCLR 387 CDT CL+ Sbjct: 405 CDTTRSLCLK 414 >ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays] gi|194706024|gb|ACF87096.1| unknown [Zea mays] gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays] Length = 460 Score = 506 bits (1303), Expect = e-140 Identities = 240/381 (62%), Positives = 286/381 (75%), Gaps = 13/381 (3%) Frame = -3 Query: 1463 VFEDNLAFVDQHNSLSNS-------------TYKVGLNAYADLTHHEFRASKLGLSVAAS 1323 VF DN AFV HN+ + + +Y + LNA+ADLTH EFRA++LG +A Sbjct: 59 VFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLALNAFADLTHEEFRAARLG-RIAPG 117 Query: 1322 MVDRSKSLGTKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIV 1143 RS++ G VP ++DWRK GAVT VKDQ SCGACW+FS TGA+EGIN+I Sbjct: 118 AALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKVKDQGSCGACWSFSATGAMEGINKIK 177 Query: 1142 TGSLVSLSEQELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNK 963 TGSLVSLSEQELIDCDRSYNSGCGGGLMDYA+KFV+KN GIDTEEDYPY+ D +CN+NK Sbjct: 178 TGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVIKNGGIDTEEDYPYREADGTCNKNK 237 Query: 962 LKRRVVTIDGYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDH 783 LK+RVVTIDGYTD+PS+ E+ LL+AVA QPVSVG+CGS R FQLY GIF GPC TSLDH Sbjct: 238 LKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGICGSARAFQLYYQGIFDGPCPTSLDH 297 Query: 782 AVVIVGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXX 603 AV+IVGYGSE G DYWI+KNSWG+SWGM GYMHM RN GD +GVCGINM+A Sbjct: 298 AVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNTGDSKGVCGINMMASFPTKTSPN 357 Query: 602 XXXXXXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNY 423 T+CSL++YC G TCCC +R+LG CLSW CCELD+AVCCKD+ YCCPH+Y Sbjct: 358 PPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGF-CLSWSCCELDNAVCCKDNRYCCPHDY 416 Query: 422 PICDTQTKQCLRAAGNYSMVK 360 P+CDT QCL+A+GN+S ++ Sbjct: 417 PVCDTGRGQCLKASGNFSAIE 437 >ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria italica] Length = 454 Score = 503 bits (1296), Expect = e-140 Identities = 238/377 (63%), Positives = 282/377 (74%), Gaps = 9/377 (2%) Frame = -3 Query: 1463 VFEDNLAFVDQHNSLSNS------TYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKS 1302 VF DN AFV HN+ +N+ +Y + LNA+ADLTH EFRA++LG + +S Sbjct: 56 VFADNAAFVAAHNARANAVGGSPPSYTLALNAFADLTHEEFRAARLGRLAVGRVGATLRS 115 Query: 1301 LGTKARSGFVGDV---PKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSL 1131 G G G V P ++DWRKKGAVT VK+Q SCGACW+FS TGAIEGIN+I TGSL Sbjct: 116 AGAPVFGGLDGGVAAVPDAVDWRKKGAVTKVKNQGSCGACWSFSATGAIEGINKIKTGSL 175 Query: 1130 VSLSEQELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRR 951 VSLSEQELIDCDRSYN+GCGGGLMDYAFKFV+KN GIDTE+DYPY+ D +CN+NKLKRR Sbjct: 176 VSLSEQELIDCDRSYNNGCGGGLMDYAFKFVIKNGGIDTEDDYPYRQADGTCNKNKLKRR 235 Query: 950 VVTIDGYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVI 771 VVTIDGY+D+PS+ E LL+AVA QPVSVG+CGS R FQLYS GIF GPC TSLDHAV+I Sbjct: 236 VVTIDGYSDVPSNKENLLLQAVAQQPVSVGICGSARAFQLYSQGIFDGPCPTSLDHAVLI 295 Query: 770 VGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXX 591 VGYGSE G DYWI+KNSWG+ WGM GYMHM RN G G+CGINM+ Sbjct: 296 VGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTGASSGICGINMMPSFPTKTSPNPPPS 355 Query: 590 XXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICD 411 T+C+L++YC G TCCC +R+LGL CLSW CC LD+A+CCKD+ YCCPH+YPICD Sbjct: 356 PGPGPTKCNLLTYCPEGSTCCCSWRVLGL-CLSWSCCGLDNAICCKDNRYCCPHDYPICD 414 Query: 410 TQTKQCLRAAGNYSMVK 360 T QCLRA GN+S ++ Sbjct: 415 TVRAQCLRANGNFSGIE 431 >ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor] gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor] Length = 463 Score = 501 bits (1291), Expect = e-139 Identities = 239/378 (63%), Positives = 286/378 (75%), Gaps = 10/378 (2%) Frame = -3 Query: 1463 VFEDNLAFVDQHNSLSNS--------TYKVGLNAYADLTHHEFRASKLGLSVAASMVDRS 1308 VF DN AFV HN+ N+ +Y + LNA+ADLTH EFRA++LG A + RS Sbjct: 64 VFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFADLTHEEFRAARLGRIAAGAAALRS 123 Query: 1307 KSLGT-KARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSL 1131 + + G +G VP ++DWR+ GAVT VKDQ SCGACW+FS TGA+EGIN+I TGSL Sbjct: 124 PAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSL 183 Query: 1130 VSLSEQELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRR 951 VSLSEQELIDCDRSYNSGCGGGLMDYA+KFVVKN GIDTEEDYPY+ D +CN+NKLK+R Sbjct: 184 VSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEEDYPYREADGTCNKNKLKKR 243 Query: 950 VVTIDGYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYS-TGIFSGPCATSLDHAVV 774 +VTIDGY+D+PS+ E+ LL+AVA QPVSVG+CGS R FQLYS GIF GPC TSLDHAV+ Sbjct: 244 IVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSARAFQLYSQQGIFDGPCPTSLDHAVL 303 Query: 773 IVGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXX 594 IVGYGSE G DYWI+KNSWG+SWGM GYMHM RN GD +GVCGINM+A Sbjct: 304 IVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNTGDSKGVCGINMMASFPTKSSPNPPP 363 Query: 593 XXXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPIC 414 T+CSL++YC G TCCC +R+LG CLSW CCELD+AVCCKD+ CCPH+YP+C Sbjct: 364 SPGPGPTKCSLLTYCPEGSTCCCSWRILGF-CLSWSCCELDNAVCCKDNKSCCPHDYPVC 422 Query: 413 DTQTKQCLRAAGNYSMVK 360 DT CL+A+GN S ++ Sbjct: 423 DTDRGLCLKASGNSSAIE 440