BLASTX nr result
ID: Rauwolfia21_contig00008481
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00008481 (1882 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 612 e-172 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 607 e-171 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 595 e-167 dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 585 e-164 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 583 e-164 ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis ... 582 e-163 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 581 e-163 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 581 e-163 gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] 580 e-163 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 578 e-162 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 578 e-162 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 577 e-162 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 576 e-161 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 575 e-161 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 575 e-161 ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps... 556 e-155 gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase... 555 e-155 ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 552 e-154 ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F... 546 e-153 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 545 e-152 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 612 bits (1578), Expect = e-172 Identities = 286/433 (66%), Positives = 339/433 (78%), Gaps = 1/433 (0%) Frame = +3 Query: 312 MSWLW-SFWTIVLLFHVSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYI 488 M+WL S ++L+F C+ S +DLFE WC++ GK YSSEQE++YR +VFE+NY YI Sbjct: 1 MNWLLPSLVLVLLIFQQPFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYI 60 Query: 489 TQHNSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDND 668 T+HNSK NS+YTL LNA++DLTHHEF+ +LGLS+SAND IRL +G + D D Sbjct: 61 TEHNSKENSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSETGVLSDVD 120 Query: 669 IPSSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSY 848 PSSLDWREKGAVT+VK+QGSCGACW+FSATGAMEGINKI TGSLVSLSEQELIDCDRSY Sbjct: 121 APSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRSY 180 Query: 849 NSGCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNE 1028 N GCGGGLMDYA+EFV+KN GIDTEKDY F+ R+GTC++NKL+RHVVTIDGY D+ N+E Sbjct: 181 NEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQNDE 240 Query: 1029 KELLQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVK 1208 +LL+AVA QPVSVGICGS RAFQ YS+GIF+GPCST+LDHAVLIVGY S+NGVDYWI+K Sbjct: 241 DKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWIIK 300 Query: 1209 NSWGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCP 1388 NSWGTSWG+NGY+H+ R++GN EG+CGIN LASYP KTS N KC++F+SC Sbjct: 301 NSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSCG 360 Query: 1389 GGETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLF 1568 GETCCC K LGIC SWKCC LDSAV YPICDT RNLCLK+ N+T+ Sbjct: 361 QGETCCCGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATIV 420 Query: 1569 KQLQNRGLVGDLG 1607 +Q Q G G Sbjct: 421 QQPQKEAFTGKFG 433 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 607 bits (1564), Expect = e-171 Identities = 282/433 (65%), Positives = 336/433 (77%), Gaps = 1/433 (0%) Frame = +3 Query: 312 MSWLW-SFWTIVLLFHVSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYI 488 M WL S ++L+F +C+ S +DLFE WC++ GK YSSEQE++YR +VFE+NY YI Sbjct: 1 MKWLLPSLVLVLLIFQQPLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYI 60 Query: 489 TQHNSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDND 668 T+HNSKGNS+YTL LNA++DLTHHEF+ +LGLS+SAND IRL A+G + D D Sbjct: 61 TEHNSKGNSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRGSGSSAAGVLSDVD 120 Query: 669 IPSSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSY 848 PSSLDWR+KGAVTNVK+QGSCGACW+FSATGA+EGINKI TGSLVSLSEQELIDCDRSY Sbjct: 121 APSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRSY 180 Query: 849 NSGCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNE 1028 N GCGGGLMDYA+EFV+KN GIDTEKDY F+ ++GTC++NKL+R VVTIDGY D+ N+E Sbjct: 181 NQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQNDE 240 Query: 1029 KELLQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVK 1208 +LL+AVA QPVSVGICGS RAFQ YS+GIF+GPC T LDHAVLIVGY S+NG DYWI+K Sbjct: 241 DKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWIIK 300 Query: 1209 NSWGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCP 1388 NSWGTSWG+NGY+H+ R++GN EG+CG+N LASYP KTS N KC+ F+SC Sbjct: 301 NSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSCG 360 Query: 1389 GGETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLF 1568 GETCCC K LGIC SWKCC LDSAV YPICDT RNLCLK+ N+T+ Sbjct: 361 QGETCCCGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATIV 420 Query: 1569 KQLQNRGLVGDLG 1607 +Q Q G G Sbjct: 421 QQPQKEPFTGKFG 433 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 595 bits (1535), Expect = e-167 Identities = 281/442 (63%), Positives = 347/442 (78%), Gaps = 1/442 (0%) Frame = +3 Query: 312 MSWLWSFWTIVLLFHVS-ICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYI 488 M++L+ F +L+ +S SSS + LFE WCKE+GK+Y+S++E+ +RL+VFEDNYD++ Sbjct: 1 MNFLYIFALTLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFV 60 Query: 489 TQHNSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDND 668 T+HNSKGNS+Y+L+LNAFADLTHHEFK LGLSA+ +L N + +G VGD Sbjct: 61 TKHNSKGNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHRNLEI-----TGVVGD-- 113 Query: 669 IPSSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSY 848 IP+S+DWR KG VTNVKDQGSCGACW+FSATGA+EGINKIVTGSLVSLSEQELI+CD+SY Sbjct: 114 IPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSY 173 Query: 849 NSGCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNE 1028 N GCGGGLMDYA++FV+ N GIDTE+DY ++ RDGTC+++++KR VVTID Y DV NNE Sbjct: 174 NDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNE 233 Query: 1029 KELLQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVK 1208 K+LLQAVAAQPVSVGICGSERAFQ+YS+GIF+GPCSTSLDHAVLIVGY S+NGVDYWIVK Sbjct: 234 KQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVK 293 Query: 1209 NSWGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCP 1388 NSWGT WG+ GY+H+ R++GNS+GVCGINMLASYPVKTS N KCNL + C Sbjct: 294 NSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCA 353 Query: 1389 GGETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLF 1568 GETCCC+ K GIC SWKCC LDSAV YP+CDT +N+C K+ GN+T Sbjct: 354 AGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRM 413 Query: 1569 KQLQNRGLVGDLGNWNSFFQKW 1634 + ++ + G G+WNS + W Sbjct: 414 EAIEGK-TSGKFGSWNSLPEAW 434 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 585 bits (1509), Expect = e-164 Identities = 273/427 (63%), Positives = 338/427 (79%), Gaps = 3/427 (0%) Frame = +3 Query: 369 SSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQHNSKGNSTYTLSLNAFAD 548 SSS A LFE WC+++GKTY+S++EKL+RL+VF+DNYD++T+HNS+GNS+YTLSLNAFAD Sbjct: 22 SSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFAD 81 Query: 549 LTHHEFKAKYLGLSASAN---DLIRLNRGLLPFGASGAVGDNDIPSSLDWREKGAVTNVK 719 LTHHEFKA LGLS++A+ ++ R NR + F A D+P+S+DWR+ GAVT VK Sbjct: 82 LTHHEFKASRLGLSSAASASLNVDRSNRQIPDFVA-------DVPASVDWRKNGAVTQVK 134 Query: 720 DQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYEFVV 899 DQG+CGACW+FSATGA+EGINKIVTGSLVSLSEQEL+DCD+SYN+GC GG+MDYA++FV+ Sbjct: 135 DQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVI 194 Query: 900 KNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKELLQAVAAQPVSVGIC 1079 N GIDTE+DY +QGRD +C++ KLKRHVVTIDGY DV NNEKELL+AVA QPVSVGIC Sbjct: 195 DNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGIC 254 Query: 1080 GSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVKNSWGTSWGLNGYVHIAR 1259 GSERAFQLYS+GIF+GPCSTSLDHAVLIVGY S+NGVDYWIVKNSWG+ WG++GY+H+ R Sbjct: 255 GSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQR 314 Query: 1260 SNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCPGGETCCCSWKLLGICFS 1439 ++G+S G+CGINMLASYP KTS N +C+LF+ C GETCCC + GIC S Sbjct: 315 NSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLS 374 Query: 1440 WKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLFKQLQNRGLVGDLGNWNS 1619 WKCC+LDSAV YP+CDT RN+CLK GN+T ++ G +W+S Sbjct: 375 WKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSGKFRSWSS 434 Query: 1620 FFQKWNL 1640 + W L Sbjct: 435 LLEGWIL 441 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 583 bits (1504), Expect = e-164 Identities = 279/436 (63%), Positives = 336/436 (77%) Frame = +3 Query: 315 SWLWSFWTIVLLFHVSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQ 494 ++ + F T+ LL + ++S ++LFE WC E+GK+YSS +EKLYRL VF DNY+++T Sbjct: 3 NYAFHFLTLFLLLFRPLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTH 62 Query: 495 HNSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDIP 674 HN+ NS+YTLSLN++ADLTHHEFK LG S + +R R +LP S D+P Sbjct: 63 HNNLDNSSYTLSLNSYADLTHHEFKVSRLGFSPA----LRNFRPVLPQEPSLP---RDVP 115 Query: 675 SSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNS 854 SLDWR+KGAVT VKDQGSCGACW+FSATGAMEGIN+I+TGSL+SLSEQELIDCDRSYNS Sbjct: 116 DSLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNS 175 Query: 855 GCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKE 1034 GCGGGLMDYAY+FV+ N GIDTE DY +Q RDG+C ++KL+R+VVTIDGY D+ N+E + Sbjct: 176 GCGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGK 235 Query: 1035 LLQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVKNS 1214 LLQAVAAQPVSVGICGSERAFQLYS+GIFSGPCSTSLDHAVLIVGY S+NGVDYWIVKNS Sbjct: 236 LLQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNS 295 Query: 1215 WGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCPGG 1394 WG SWG++GY+H+ R++GNSEGVCGIN LASYP KT+ N KC++ +SC G Sbjct: 296 WGKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAG 355 Query: 1395 ETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLFKQ 1574 ETCCC+ K LG+C SWKCC L SAV YPICDT RNLCLK+T N T + Sbjct: 356 ETCCCAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEI 415 Query: 1575 LQNRGLVGDLGNWNSF 1622 L+NR G G W+SF Sbjct: 416 LENRSSSGSSGTWSSF 431 >ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera] gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 582 bits (1501), Expect = e-163 Identities = 265/442 (59%), Positives = 327/442 (73%) Frame = +3 Query: 315 SWLWSFWTIVLLFHVSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQ 494 SWLW+ ++L H S+ +S TADLFE WC++YGKTYSSE+EK RL+VFE+N+ ++TQ Sbjct: 3 SWLWAVSILILAVHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQ 62 Query: 495 HNSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDIP 674 HNS N++YTL+LNAFADLTHHEFKA LG S IR V + +P Sbjct: 63 HNSMANASYTLALNAFADLTHHEFKASRLGFSPGRAQSIR--------SVGTPVQELHVP 114 Query: 675 SSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNS 854 ++DWR+ GAVT VKDQG+CG CW+FS TGA+EGINKIVTGSLVSLSEQEL+DCDRSYNS Sbjct: 115 PAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNS 174 Query: 855 GCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKE 1034 GC GGLMDYAY+FV+KN+GID+E DY + G D C++ KLK+H+VTIDGY D+ PN+EK+ Sbjct: 175 GCEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQ 234 Query: 1035 LLQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVKNS 1214 LLQ VA QPVSVGICGSE+ FQLYS+G+++GPCS++LDHAVLIVGY +++GVD+WIVKNS Sbjct: 235 LLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNS 294 Query: 1215 WGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCPGG 1394 WG WG+ GY+H+ R+NG +EG+CGINMLASYP KTS N KC+ FSSC G Sbjct: 295 WGEHWGMRGYIHMLRNNGTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEG 354 Query: 1395 ETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLFKQ 1574 ETCCCSW+ +G+C SW CC SAV +PICDTKRN CLK GN T + Sbjct: 355 ETCCCSWRFIGVCLSWNCCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEV 414 Query: 1575 LQNRGLVGDLGNWNSFFQKWNL 1640 L+ RG G W+S WNL Sbjct: 415 LKRRGSSVKFGGWSSINDAWNL 436 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 581 bits (1498), Expect = e-163 Identities = 276/434 (63%), Positives = 331/434 (76%), Gaps = 1/434 (0%) Frame = +3 Query: 336 TIVLLFHVSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQHNSKGNS 515 +I+LL + + S +LFE WCK++GK YSSEQEK RL++FEDNY ++TQHN+ GNS Sbjct: 10 SILLLSSLPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69 Query: 516 TYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDIPSSLDWRE 695 ++TLSLNAFADLTH EFKA +LG SA++ D R + + G + D+P+S+DWR+ Sbjct: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ--SPGTL--RDVPASIDWRK 125 Query: 696 KGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLM 875 KGAVT VKDQ SCGACWAFSATGA+EGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLM Sbjct: 126 KGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLM 185 Query: 876 DYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKELLQAVAA 1055 DYAY+FV+KN GIDTEKDY ++G+ G C++ KL RH+VTIDGY+DV NNEK+LLQAV A Sbjct: 186 DYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA 245 Query: 1056 QPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVKNSWGTSWGL 1235 QPVSVGICGSERAFQLYS GIF+GPCSTSLDHAVLIVGYDS+NGVDYWI+KNSWG SWG+ Sbjct: 246 QPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 305 Query: 1236 NGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCPGGETCCCSW 1415 NGY+H+ R+ GNS G+CGINMLASYP KT N +C+L + C GETCCC Sbjct: 306 NGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGS 365 Query: 1416 KLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKK-TGNSTLFKQLQNRGL 1592 +LGIC SWKCC SAV YPICD+ R+ CL + TGN T + ++ RG Sbjct: 366 SILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIEMRGS 425 Query: 1593 VGDLGNWNSFFQKW 1634 G+W+SF W Sbjct: 426 SWKFGSWSSFIDVW 439 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 581 bits (1497), Expect = e-163 Identities = 278/417 (66%), Positives = 342/417 (82%), Gaps = 6/417 (1%) Frame = +3 Query: 312 MSWLWSFWTIVLLF-HVSICSSSLTAD---LFENWCKEYGKTYSSEQEKLYRLRVFEDNY 479 M++L + + I LLF ++SI S S ++D LFE+W KE+GKTY+S+++KLYR ++FE+NY Sbjct: 1 MNFLSALFLITLLFFNLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENY 60 Query: 480 DYITQHNSKGNSTYTLSLNAFADLTHHEFKAKYLGLSA--SANDLIRLNRGLLPFGASGA 653 +++ +HNS+GNS+YTLSLNAFADLTHHEFKA LGLSA ++ L R N L F Sbjct: 61 EFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGKLSRRNFPLHDF----- 115 Query: 654 VGDNDIPSSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELID 833 VGD +P S+DWR+KGAV+ VKDQG+CGACW+FSATGA+EGINKIVTGSLVSLSEQEL+D Sbjct: 116 VGD--VPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVD 173 Query: 834 CDRSYNSGCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDV 1013 CDRSYN+GC GGLMDYAY+FV++N GIDTE+DY +Q R+ TC++ KLKRHVVTIDGY DV Sbjct: 174 CDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDV 233 Query: 1014 TPNNEKELLQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVD 1193 NNEKELL+AVAAQPVSVGICGSERAFQLYS+GIF+GPCSTSLDHAVLIVGY S+NGVD Sbjct: 234 PQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVD 293 Query: 1194 YWIVKNSWGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNL 1373 YWIVKNSWGT WG+NGY+++ R++GNS+G+CGINMLAS+PVKTS N KC+L Sbjct: 294 YWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDL 353 Query: 1374 FSSCPGGETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLK 1544 F+ C GETCCC+ ++ G+CFSWKCC+LDSAV YP+CDTKRN+CLK Sbjct: 354 FTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 580 bits (1495), Expect = e-163 Identities = 279/442 (63%), Positives = 334/442 (75%) Frame = +3 Query: 315 SWLWSFWTIVLLFHVSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQ 494 S+L SF +L F S S S + LFE WC ++GK YSSE+EK YRL+VFE+NY ++TQ Sbjct: 7 SFLLSF---LLFFDPSFASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQ 63 Query: 495 HNSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDIP 674 HN GNS+Y+L+LNAFADLTHHEFKA LGLSA+A + R N L G V DIP Sbjct: 64 HNGVGNSSYSLALNAFADLTHHEFKASRLGLSAAAIEGSRPNLQL-----PGLV--RDIP 116 Query: 675 SSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNS 854 +S+DWR KGAVT VKDQGSCGACW+FSATGA+EGINKIVTG+LVSLSEQEL+DCDRSYNS Sbjct: 117 ASMDWRTKGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNS 176 Query: 855 GCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKE 1034 GC GGLMDYAY+FV+ N GID E+DY + GR+ TC++ K KR VVTIDGY V NNE Sbjct: 177 GCEGGLMDYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDL 236 Query: 1035 LLQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVKNS 1214 LLQAVA QPVSVGICGSERAFQLYS+GIF+GPCS+SLDHAVLIVGY S+NGVDYWIVKNS Sbjct: 237 LLQAVAKQPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNS 296 Query: 1215 WGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCPGG 1394 WGT WG+NGY+H+ R++G+S+G+CGINMLASYP KTS N KC+LF+ C G Sbjct: 297 WGTRWGMNGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAG 356 Query: 1395 ETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLFKQ 1574 ETCCC+ ++ GICFSWKCC+LDSAV YP+CDTK++ CLK+ GN+T + Sbjct: 357 ETCCCTHRIFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEA 416 Query: 1575 LQNRGLVGDLGNWNSFFQKWNL 1640 + R +W F + W L Sbjct: 417 FEKRHSTRKFSSWRPFVENWVL 438 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 578 bits (1490), Expect = e-162 Identities = 271/417 (64%), Positives = 323/417 (77%), Gaps = 1/417 (0%) Frame = +3 Query: 387 DLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQHNSKGNSTYTLSLNAFADLTHHEF 566 +LFE WCK++GK YSSEQEK RL++FEDNY ++TQHN+ GNS++TLSLNAFADLTH EF Sbjct: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86 Query: 567 KAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDIPSSLDWREKGAVTNVKDQGSCGACW 746 KA +LG SA++ D R + + G + D+P+S+DWR+KGAVT VKDQ SCGACW Sbjct: 87 KASFLGFSAASIDHDRRRNASVQ--SPGNL--RDVPASIDWRKKGAVTEVKDQASCGACW 142 Query: 747 AFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYEFVVKNKGIDTEK 926 AFSATGA+EGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY+FV+KN GIDTEK Sbjct: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202 Query: 927 DYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKELLQAVAAQPVSVGICGSERAFQLY 1106 DY ++G+ G C++ KL RH+VTIDGY+DV NNEK+LLQAV AQPVSVGICGSERAFQLY Sbjct: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262 Query: 1107 SRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVKNSWGTSWGLNGYVHIARSNGNSEGVC 1286 S GIF+GPCSTSLDHAVLI+GYDS+NGVDYWI+KNSWG SWG+NGY+H+ R+ GNS G+C Sbjct: 263 SSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322 Query: 1287 GINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCPGGETCCCSWKLLGICFSWKCCQLDSA 1466 GINMLASYP KT N +C+L + C GETCCC +LGIC SWKCC SA Sbjct: 323 GINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFSSA 382 Query: 1467 VXXXXXXXXXXXXYPICDTKRNLCLKK-TGNSTLFKQLQNRGLVGDLGNWNSFFQKW 1634 V YPICD+ R+ CL + TGN T + ++ RG G+W+SF W Sbjct: 383 VCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEMRGSSWKFGSWSSFIDAW 439 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 578 bits (1490), Expect = e-162 Identities = 275/430 (63%), Positives = 329/430 (76%) Frame = +3 Query: 345 LLFHVSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQHNSKGNSTYT 524 L F +S SS A+LF++WC +GKTY SE+E+ +R+++F DN+D++TQHN NSTY+ Sbjct: 21 LSFSISSSSSDDIAELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYS 80 Query: 525 LSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDIPSSLDWREKGA 704 LSLNAFADLTHHEFKA LGLSA + L+ + L G S V +P S+DWR+KGA Sbjct: 81 LSLNAFADLTHHEFKASRLGLSAPSPSLMAKEQSL---GVSERVRVK-VPDSVDWRKKGA 136 Query: 705 VTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 884 VTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQELIDCD+SYN+GC GGLMDYA Sbjct: 137 VTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYA 196 Query: 885 YEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKELLQAVAAQPV 1064 +EFV+KN GIDTEKDY +Q +DGTC ++KLK+ VVTID Y V NNEK L++AVA+QPV Sbjct: 197 FEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPV 256 Query: 1065 SVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVKNSWGTSWGLNGY 1244 SVGICGSERAFQLYS GIFSGPCSTSLDHAVLIVGY SQNGVDYWIVKNSWG SWG++G+ Sbjct: 257 SVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGF 316 Query: 1245 VHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCPGGETCCCSWKLL 1424 +H+ R+ GNSEGVCGINMLASYP+KT N KCNLF+ C GETCCC+ L Sbjct: 317 MHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLF 376 Query: 1425 GICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLFKQLQNRGLVGDL 1604 G+CFSWKCC+L+SAV YP+CDT ++LCLKKTGN T K + L Sbjct: 377 GLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKKNSSNKL 436 Query: 1605 GNWNSFFQKW 1634 G F++W Sbjct: 437 GR----FEEW 442 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 577 bits (1488), Expect = e-162 Identities = 274/439 (62%), Positives = 335/439 (76%), Gaps = 3/439 (0%) Frame = +3 Query: 327 SFWTIVLLFHVSICSSSLTAD---LFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQH 497 SF ++ F + + SSS + D LF++WC+++GKTY SE+E+ R+++F+DN+D++TQH Sbjct: 7 SFISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQH 66 Query: 498 NSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDIPS 677 N N+TY+LSLNAFADLTHHEFKA LGLS SA +I ++G G +P Sbjct: 67 NLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKG------QSLGGSVKVPD 120 Query: 678 SLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNSG 857 S+DWR+KGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQELIDCD+SYN+G Sbjct: 121 SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 180 Query: 858 CGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKEL 1037 C GGLMDYA+EFV+KN GIDTEKDY +Q RDGTC ++KLK+ VVTID Y V N+EK L Sbjct: 181 CNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240 Query: 1038 LQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVKNSW 1217 ++AVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGY SQNGVDYWIVKNSW Sbjct: 241 MEAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSW 300 Query: 1218 GTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCPGGE 1397 G SWG++G++H+ R+ NS+GVCGINMLASYP+KT N KCNLF+ C GE Sbjct: 301 GKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGE 360 Query: 1398 TCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLFKQL 1577 TCCC+ +L G+CFSWKCC+++SAV YP+CDT R+LCLKKTGN T K Sbjct: 361 TCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPF 420 Query: 1578 QNRGLVGDLGNWNSFFQKW 1634 + LG F++W Sbjct: 421 WKKNSSKQLGR----FEEW 435 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 576 bits (1485), Expect = e-161 Identities = 276/424 (65%), Positives = 329/424 (77%), Gaps = 2/424 (0%) Frame = +3 Query: 282 LFCLFSLSLQMSWLWSFWTIVLLFHVSICSSSL-TADLFENWCKEYGKTYSSEQEKLYRL 458 + CLF LSL +S H+S+ S SL ++ LFE WC+++G++YSSE+E+LYRL Sbjct: 3 ILCLFLLSLLLS-----------SHLSLSSPSLNSSQLFEAWCEKHGQSYSSEEERLYRL 51 Query: 459 RVFEDNYDYITQHNSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASA-NDLIRLNRGLLP 635 VFEDN ++TQHN+ GNS+YTLSLNAFADLTHHEFK+ LG S++ + L +L LL Sbjct: 52 TVFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHHEFKSSRLGFSSALLSSLPKLGSKLLD 111 Query: 636 FGASGAVGDNDIPSSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLS 815 D+P+SLDWR+KGAVTNVKDQGSCGACWAFSATGA+EGINKIVTGSLVSLS Sbjct: 112 L--------RDVPASLDWRKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLS 163 Query: 816 EQELIDCDRSYNSGCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTI 995 EQELIDCD SYN+GC GGLMDYAY+FV+ N GIDTE+DY +Q RD +C + KLKR VVTI Sbjct: 164 EQELIDCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTI 223 Query: 996 DGYRDVTPNNEKELLQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYD 1175 DGY DV PNN +LLQAV QPVSVGICGSERAFQLYS+GIF+GPCSTSLDHAVLIVGYD Sbjct: 224 DGYTDVAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYD 283 Query: 1176 SQNGVDYWIVKNSWGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXX 1355 S+NGVDYWIVKNSWG WG++GY+H+ R+ GNS+GVCGINMLASYP KTS N Sbjct: 284 SENGVDYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPG 343 Query: 1356 XVKCNLFSSCPGGETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNL 1535 +C+ F+ C GETCCCSW+ LG+CFSWKCC L+SAV YP+CDT+RN+ Sbjct: 344 PTRCSFFAQCGEGETCCCSWRFLGLCFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNV 403 Query: 1536 CLKK 1547 CLK+ Sbjct: 404 CLKE 407 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 575 bits (1482), Expect = e-161 Identities = 273/439 (62%), Positives = 334/439 (76%), Gaps = 3/439 (0%) Frame = +3 Query: 327 SFWTIVLLFHVSICSSSLTAD---LFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQH 497 SF ++ F + + SSS + D LF++WC+++GKTY SE+E+ R+++F+DN+D++TQH Sbjct: 7 SFISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQH 66 Query: 498 NSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDIPS 677 N N+TY+LSLNAFADLTHHEFKA LGLS SA +I ++G G +P Sbjct: 67 NLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKG------QSLGGSVKVPD 120 Query: 678 SLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNSG 857 S+DWR+KGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQELIDCD+SYN+G Sbjct: 121 SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 180 Query: 858 CGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKEL 1037 C GGLMDYA+EFV+KN GIDTEKDY +Q RDGTC ++KLK+ VVTID Y V N+EK L Sbjct: 181 CNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240 Query: 1038 LQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVKNSW 1217 ++AVAAQPVSVGICGSERAFQLYS GIFSGPCSTSLDHAVLIVGY SQNGVDYWIVKNSW Sbjct: 241 MEAVAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSW 300 Query: 1218 GTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCPGGE 1397 G SWG++G++H+ R+ NS+GVCGINMLASYP+KT N KCNLF+ C GE Sbjct: 301 GKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGE 360 Query: 1398 TCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLFKQL 1577 TCCC+ +L G+CFSWKCC+++SAV YP+CDT R+LCLKKTGN T K Sbjct: 361 TCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPF 420 Query: 1578 QNRGLVGDLGNWNSFFQKW 1634 + LG F++W Sbjct: 421 WKKNSSKQLGR----FEEW 435 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 575 bits (1482), Expect = e-161 Identities = 275/443 (62%), Positives = 333/443 (75%), Gaps = 2/443 (0%) Frame = +3 Query: 300 LSLQMSWLWSFWTIVLLFHVSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNY 479 +S+ S S LL S SS ++LF++WC+ +GKTY SE+E+ R+++F+DN+ Sbjct: 1 MSMSSSSFVSLTFFFLLLVSSPSSSDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNH 60 Query: 480 DYITQHNSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVG 659 D++TQHN N+TY+LSLNAFADLTHHEFKA LGLS SA+ LI ++G G Sbjct: 61 DFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSASSLIMASKG------QSLGG 114 Query: 660 DNDIPSSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCD 839 + +P S+DWR+KGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQELIDCD Sbjct: 115 NAKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCD 174 Query: 840 RSYNSGCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTP 1019 +SYN+GC GGLMDYA+EFV+KN GIDTEKDY +Q RDGTC ++KLK+ VVTID Y V Sbjct: 175 KSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKS 234 Query: 1020 NNEKELLQAVAAQPVSVGICGSERAFQLYSR--GIFSGPCSTSLDHAVLIVGYDSQNGVD 1193 N+EK L +AVAAQPVSVGICGSERAFQLYSR GIFSGPCSTSLDHAVLIVGY SQNGVD Sbjct: 235 NDEKALREAVAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVD 294 Query: 1194 YWIVKNSWGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNL 1373 YWIVKNSWG SWG++G++H+ R+ GNSEG+CGINMLASYP+KT N KCNL Sbjct: 295 YWIVKNSWGKSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNL 354 Query: 1374 FSSCPGGETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTG 1553 F+ C GETCCC+ L G+CFSWKCC+++SAV YP+CDT R+LCLKKTG Sbjct: 355 FTYCSAGETCCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTG 414 Query: 1554 NSTLFKQLQNRGLVGDLGNWNSF 1622 N T K + LG + + Sbjct: 415 NFTAIKPFWKKDSSNKLGRFEGW 437 >ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] gi|482576142|gb|EOA40329.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] Length = 467 Score = 556 bits (1433), Expect = e-155 Identities = 269/464 (57%), Positives = 329/464 (70%), Gaps = 28/464 (6%) Frame = +3 Query: 327 SFWTIVLLFHVSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQHNSK 506 +F+ ++L+ S SS ++LF++WC+ +GKTY+SE EK +R ++F DN+D++TQHN Sbjct: 12 TFFFLLLVSSSSSSSSDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQHNLI 71 Query: 507 GNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDIPSSLD 686 N+TY+LSLNAFADL H EFK LGLS SA +I ++G G +P SLD Sbjct: 72 TNATYSLSLNAFADLNHSEFKTSRLGLSVSAPSVIMASKG------KSLGGSVKVPDSLD 125 Query: 687 WREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 866 WR+KGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQELIDCD+SYN GC G Sbjct: 126 WRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNG 185 Query: 867 GLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKELLQA 1046 GLMDYA+EFV+KNKGIDTEKDY +Q RDGTC ++KLK+ VV+ID Y V P++EK LL+A Sbjct: 186 GLMDYAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEA 245 Query: 1047 VAAQPVSVGICGSERAFQLYS----------------------------RGIFSGPCSTS 1142 VAAQPVSVGICGSERAFQLYS +GIFSGPCSTS Sbjct: 246 VAAQPVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTS 305 Query: 1143 LDHAVLIVGYDSQNGVDYWIVKNSWGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKT 1322 LDHAVLIVGY SQNGVDYWIVKNSWG SWG++G++H+ R+ GNS+G+CGINMLASYP+KT Sbjct: 306 LDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKT 365 Query: 1323 SSNXXXXXXXXXVKCNLFSSCPGGETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXX 1502 N KCNLF+ C ETCCC+ L G+C SWKCC+++SAV Sbjct: 366 HPNPPPPSPPGPTKCNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPH 425 Query: 1503 XYPICDTKRNLCLKKTGNSTLFKQLQNRGLVGDLGNWNSFFQKW 1634 YP+CDT R+LCLKKTGN T K + LG F++W Sbjct: 426 DYPVCDTTRSLCLKKTGNFTAIKPFWKKNSSNKLGR----FEEW 465 >gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135) [Arabidopsis thaliana] Length = 416 Score = 555 bits (1429), Expect = e-155 Identities = 263/416 (63%), Positives = 322/416 (77%), Gaps = 10/416 (2%) Frame = +3 Query: 327 SFWTIVLLFHVSICSSSLTAD---LFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQH 497 SF ++ F + + SSS + D LF++WC+++GKTY SE+E+ R+++F+DN+D++TQH Sbjct: 5 SFISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQH 64 Query: 498 NSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDIPS 677 N N+TY+LSLNAFADLTHHEFKA LGLS SA +I ++G G +P Sbjct: 65 NLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKG------QSLGGSVKVPD 118 Query: 678 SLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNSG 857 S+DWR+KGAVTNVKDQGSCGACW+FSATGAMEGIN+IVTG L+SLSEQELIDCD+SYN+G Sbjct: 119 SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 178 Query: 858 CGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKEL 1037 C GGLMDYA+EFV+KN GIDTEKDY +Q RDGTC ++KLK+ VVTID Y V N+EK L Sbjct: 179 CNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 238 Query: 1038 LQAVAAQPVSVGICGSERAFQLYS-------RGIFSGPCSTSLDHAVLIVGYDSQNGVDY 1196 ++AVAAQPVSVGICGSERAFQLYS +GIFSGPCSTSLDHAVLIVGY SQNGVDY Sbjct: 239 MEAVAAQPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDY 298 Query: 1197 WIVKNSWGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLF 1376 WIVKNSWG SWG++G++H+ R+ NS+GVCGINMLASYP+KT N KCNLF Sbjct: 299 WIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLF 358 Query: 1377 SSCPGGETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLK 1544 + C GETCCC+ +L G+CFSWKCC+++SAV YP+CDT R+LCLK Sbjct: 359 TYCSSGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 552 bits (1423), Expect = e-154 Identities = 276/451 (61%), Positives = 325/451 (72%), Gaps = 8/451 (1%) Frame = +3 Query: 291 LFSLSLQMSWLWSFWTIVLLFHVSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFE 470 L+SLSL F +++LLF + S+S T++LFE WCKE+ KTYSSE+EKLYRL+VFE Sbjct: 4 LYSLSLLQ-----FLSLILLFTLFFLSASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFE 58 Query: 471 DNYDYITQHN-----SKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLP 635 DNY ++ QHN + NS+YTLSLNAFADLTHHEFK LGL + LL Sbjct: 59 DNYAFVAQHNQNANNNNNNSSYTLSLNAFADLTHHEFKTTRLGLPLT----------LLR 108 Query: 636 FGASGAVGDND---IPSSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLV 806 F D IPS +DWR+ GAVT VKDQ SCGACWAFSATGA+EGINKIVTGSLV Sbjct: 109 FKRPQNQQSRDLLHIPSQIDWRQSGAVTPVKDQASCGACWAFSATGAIEGINKIVTGSLV 168 Query: 807 SLSEQELIDCDRSYNSGCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHV 986 SLSEQELIDCD SYNSGCGGGLMD+AY+FV+ NKGIDTE DY +Q R +C ++KLKR Sbjct: 169 SLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRA 228 Query: 987 VTIDGYRDVTPNNEKELLQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIV 1166 VTI+ Y DV P+ E+E+L+AVA+QPVSVGICGSER FQLYS+GIF+GPCST LDHAVLIV Sbjct: 229 VTIEDYVDVPPS-EEEILKAVASQPVSVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIV 287 Query: 1167 GYDSQNGVDYWIVKNSWGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXX 1346 GY S+NGVDYWIVKNSWG WG+NGY+H+ R++GNS+G+CGIN LASYPVKT N Sbjct: 288 GYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNSKGICGINTLASYPVKTKPNPPIPP 347 Query: 1347 XXXXVKCNLFSSCPGGETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTK 1526 V+CNLF+ C GETCCC+ LGICFSWKCC L SAV YPICDT+ Sbjct: 348 PPGPVRCNLFTHCSEGETCCCAKSFLGICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTR 407 Query: 1527 RNLCLKKTGNSTLFKQLQNRGLVGDLGNWNS 1619 R CLK+T N T +N+ W S Sbjct: 408 RGQCLKRTANGTTTITSENQDFSHKSRGWKS 438 >ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp. vesca] Length = 441 Score = 546 bits (1408), Expect = e-153 Identities = 268/422 (63%), Positives = 320/422 (75%), Gaps = 7/422 (1%) Frame = +3 Query: 321 LWSFWTIVLLFH---VSICSSSLTADLFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYIT 491 L S T++LL +S SSS +++LFE WCK+YGK+YSS++EKLYRL +FE N +IT Sbjct: 5 LLSLLTLLLLLSHPCLSSSSSSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFIT 64 Query: 492 QHNSKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDNDI 671 QHN GNS+YTLSLN+F+DLTHHEFKA LG S + +RL R P + + Sbjct: 65 QHNDLGNSSYTLSLNSFSDLTHHEFKASRLGFSPT---FLRLYRKSDPKPSVV----RHV 117 Query: 672 PSSLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSY- 848 PSS+DWR+ GAVTNVKDQGSCGACW+FSATGA+EGINKIVTGSLVSLSEQELIDCDR Y Sbjct: 118 PSSIDWRKNGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYP 177 Query: 849 NSGCGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNE 1028 NSGC GGLMD A++F++ N GIDTE+DY +QG DGTC++ KLKRHVVTIDGY DV NNE Sbjct: 178 NSGCNGGLMDDAFQFIIDNNGIDTEEDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNE 237 Query: 1029 KELLQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVK 1208 ++LL+AVA QPVSVGI GS R FQ YS+GIF+GPCST+LDHAVLIVGY S+NGVDYWIVK Sbjct: 238 EQLLKAVATQPVSVGIAGSGREFQFYSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVK 297 Query: 1209 NSWGTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSN---XXXXXXXXXVKCNLFS 1379 NSWG +WG+NGY+HI R + NS+G+CGINMLASYP KT N KC+LFS Sbjct: 298 NSWGKNWGMNGYIHILRDHSNSKGLCGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFS 357 Query: 1380 SCPGGETCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNS 1559 C GETCCC+ K+LGIC SW+CC+ SAV YPICDT+RN CL+ GN Sbjct: 358 KCGVGETCCCARKILGICLSWRCCEFTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNL 417 Query: 1560 TL 1565 T+ Sbjct: 418 TM 419 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 545 bits (1405), Expect = e-152 Identities = 264/434 (60%), Positives = 310/434 (71%), Gaps = 4/434 (0%) Frame = +3 Query: 330 FWTIVLLFHVSICSSSLTAD---LFENWCKEYGKTYSSEQEKLYRLRVFEDNYDYITQHN 500 F + LL +S+ S D LF+ WCK++GKTY SEQEK YR VFEDNY ++ QHN Sbjct: 6 FMFLQLLLSLSLLSFVTAIDTSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHN 65 Query: 501 SKGNSTYTLSLNAFADLTHHEFKAKYLGLSASANDLIRLNRGLLPFGASGAVGDN-DIPS 677 GNS+YTLSLNAFADLTHHEFKA LGL S+ + NR F D +PS Sbjct: 66 QIGNSSYTLSLNAFADLTHHEFKATRLGLPPSSLLRFKFNR----FQDQQRSDDFLQVPS 121 Query: 678 SLDWREKGAVTNVKDQGSCGACWAFSATGAMEGINKIVTGSLVSLSEQELIDCDRSYNSG 857 +DWR+ GAV+ VKDQGSCGACW+FSATGA+EGINKIVTGSLVSLSEQEL+DCD +YNSG Sbjct: 122 EIDWRKNGAVSIVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSG 181 Query: 858 CGGGLMDYAYEFVVKNKGIDTEKDYGFQGRDGTCDRNKLKRHVVTIDGYRDVTPNNEKEL 1037 C GGLMDYAY+F++ N GIDTE+DY +Q R C ++KLKR VVTIDGY DV PN+EK+L Sbjct: 182 CDGGLMDYAYQFIIDNNGIDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKL 241 Query: 1038 LQAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYDSQNGVDYWIVKNSW 1217 L+AVA QPVSVGICGS RAFQLYS+GIF+GPCSTSLDHAVLIVGY S+NGVDYWIVKNSW Sbjct: 242 LKAVAVQPVSVGICGSARAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSW 301 Query: 1218 GTSWGLNGYVHIARSNGNSEGVCGINMLASYPVKTSSNXXXXXXXXXVKCNLFSSCPGGE 1397 G WG+NGY+H+ R+ +S G+CGINMLASYP KT N +KCNLF+ C GGE Sbjct: 302 GKYWGMNGYIHMLRNTDSSAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGE 361 Query: 1398 TCCCSWKLLGICFSWKCCQLDSAVXXXXXXXXXXXXYPICDTKRNLCLKKTGNSTLFKQL 1577 TCCC+ K LGICFSWKCC + SAV YP+CD CLK+ N T+ Sbjct: 362 TCCCAKKFLGICFSWKCCGVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTS 421 Query: 1578 QNRGLVGDLGNWNS 1619 +W S Sbjct: 422 DKEDPFHQTRDWRS 435