BLASTX nr result
ID: Catharanthus22_contig00021055
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00021055 (594 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOX94180.1| Cysteine proteinases superfamily protein, putativ... 232 5e-59 ref|XP_002525147.1| cysteine protease, putative [Ricinus communi... 229 4e-58 sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44;... 219 4e-55 gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus... 219 4e-55 gb|EOY22313.1| Cysteine proteinases superfamily protein isoform ... 219 6e-55 ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arab... 218 1e-54 ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula] gi... 218 1e-54 ref|XP_004505522.1| PREDICTED: KDEL-tailed cysteine endopeptidas... 216 3e-54 ref|NP_563764.1| Cysteine proteinases superfamily protein [Arabi... 216 5e-54 dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis] 214 2e-53 ref|XP_006852216.1| hypothetical protein AMTR_s00049p00133800 [A... 213 2e-53 gb|ABR19828.1| cysteine proteinase [Elaeis guineensis] 213 3e-53 ref|XP_006411912.1| hypothetical protein EUTSA_v10025483mg [Eutr... 212 5e-53 ref|XP_006487026.1| PREDICTED: cysteine proteinase RD21a-like [C... 212 7e-53 ref|XP_006417940.1| hypothetical protein EUTSA_v10008102mg [Eutr... 211 9e-53 dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus] 211 9e-53 ref|XP_006393620.1| hypothetical protein EUTSA_v10011464mg [Eutr... 211 1e-52 ref|XP_002523448.1| cysteine protease, putative [Ricinus communi... 211 1e-52 ref|XP_006422960.1| hypothetical protein CICLE_v10028338mg [Citr... 210 2e-52 ref|XP_004299172.1| PREDICTED: vignain-like [Fragaria vesca subs... 210 2e-52 >gb|EOX94180.1| Cysteine proteinases superfamily protein, putative [Theobroma cacao] Length = 340 Score = 232 bits (592), Expect = 5e-59 Identities = 105/173 (60%), Positives = 130/173 (75%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+L+DCD N +NQGC GG MEKA+++I+KNGG+T E +YPY G++G C K +N Sbjct: 168 SLSEQELIDCDVNNENQGCKGGYMEKAYEFIIKNGGITTEENYPYIGEDGICDEIKARNL 227 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 A +I+GY+TVP+ NE +L+ AV+ QPVSVAIDAGG FQLY GIF+GFCG +LNH Sbjct: 228 AVAISGYKTVPVNNERSLQDAVAHQPVSVAIDAGGYEFQLYSEGIFTGFCGNQLNHGVTV 287 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVKN 521 RKYW+VKNSWGT WGE GYIRM+R DK G CGIA+E SYPVK+ Sbjct: 288 VGYGEDGGRKYWLVKNSWGTSWGESGYIRMQRDFTDKRGICGIAMEASYPVKS 340 >ref|XP_002525147.1| cysteine protease, putative [Ricinus communis] gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis] Length = 347 Score = 229 bits (584), Expect = 4e-58 Identities = 108/173 (62%), Positives = 123/173 (71%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD N DN+GCNGG MEKAF +I GGLT E DYPY G +G C AK N+ Sbjct: 175 SLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPYKGTDGSCEKAKTDNH 234 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 A I GYETVP NE +LKVAVS QPVSVAIDA G FQLY G+FSG+CG +LNH Sbjct: 235 AVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGVFSGYCGIQLNHGVTI 294 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVKN 521 +KYW+VKNSWG WGE GYIRM+R D G CGIA+EPSYP+K+ Sbjct: 295 VGYGDNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGIAMEPSYPIKD 347 >sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor Length = 328 Score = 219 bits (558), Expect = 4e-55 Identities = 106/172 (61%), Positives = 125/172 (72%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD + NQGCNGGLM+ AF +I+KNGGL E DYPY G NG+C + + + Sbjct: 146 SLSEQELVDCDKSY-NQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLKNSR 204 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I+GYE VP K+ETALK AVS QPVSVAIDAGG FQ Y GIF+G CGT ++H Sbjct: 205 VVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHAVVA 264 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVK 518 YWIV+NSWGT WGEDGYIRMER + K+GKCGIA+E SYPVK Sbjct: 265 VGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIAIEASYPVK 316 >gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid cycling base population CrGC5, Peptide, 328 aa] Length = 328 Score = 219 bits (558), Expect = 4e-55 Identities = 106/172 (61%), Positives = 125/172 (72%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD + NQGCNGGLM+ AF +I+KNGGL E DYPY G NG+C + + + Sbjct: 146 SLSEQELVDCDKSY-NQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLKNSR 204 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I+GYE VP K+ETALK AVS QPVSVAIDAGG FQ Y GIF+G CGT ++H Sbjct: 205 VVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHAVVA 264 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVK 518 YWIV+NSWGT WGEDGYIRMER + K+GKCGIA+E SYPVK Sbjct: 265 VGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIAIEASYPVK 316 >gb|EOY22313.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] Length = 362 Score = 219 bits (557), Expect = 6e-55 Identities = 107/175 (61%), Positives = 128/175 (73%), Gaps = 1/175 (0%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD E+NQGCNGGLM+ AFD+I K GG+T ET+YPY ++G C +KE + Sbjct: 174 SLSEQELVDCD-TEENQGCNGGLMDIAFDFIQKKGGITTETNYPYEAEDGTCDVSKENSP 232 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNH-XXX 359 A SI+G+E VP NE AL AV+ QPVSVAIDAGG+ FQ Y G+F+G CGT LNH Sbjct: 233 AVSIDGHENVPANNEDALLKAVAHQPVSVAIDAGGMDFQFYSEGVFTGQCGTELNHGVAA 292 Query: 360 XXXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVKNA 524 KYWIVKNSWG EWGE G+IR+ERGI DK G CGIA+E SYP+KN+ Sbjct: 293 VGYGTTLDGTKYWIVKNSWGPEWGEKGFIRIERGIKDKKGLCGIAMESSYPIKNS 347 >ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp. lyrata] gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp. lyrata] Length = 343 Score = 218 bits (554), Expect = 1e-54 Identities = 101/172 (58%), Positives = 123/172 (71%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQQL+DCD N+GC+GGLME AF++I NGGLT ETDYPY G G C K KN Sbjct: 173 SLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQEKAKNK 232 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I GY+ V +NE +L++A + QPVSV IDAGG +FQLY G+F+ +CGT LNH Sbjct: 233 VVTIQGYQKVA-QNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTV 291 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVK 518 +KYWIVKNSWGT WGE+GYIRMERGI + TGKCGIA+ SYP++ Sbjct: 292 VGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPLQ 343 >ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula] gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula] gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula] gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula] Length = 345 Score = 218 bits (554), Expect = 1e-54 Identities = 100/171 (58%), Positives = 118/171 (69%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+L+DCD NQGC GGLME A+ +I++NGGLT E DYPY G +G C K +Y Sbjct: 174 SLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHY 233 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 AASI+GYE VP NE LK A + QPVSVAIDAGG FQ Y G+FSG CG +LNH Sbjct: 234 AASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTV 293 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPV 515 KYWIVKNSWG +WGE GYIRM+R + K G CGIA++ SYP+ Sbjct: 294 VGYGKETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344 >ref|XP_004505522.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like [Cicer arietinum] Length = 343 Score = 216 bits (551), Expect = 3e-54 Identities = 104/171 (60%), Positives = 118/171 (69%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD NQGC GGLME AF +IVKNGGLT E +YPY G +G C K ++ Sbjct: 172 SLSEQELVDCDVKNGNQGCEGGLMETAFTFIVKNGGLTTEKEYPYEGVDGTCNKEKAAHH 231 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 A SI+GYE VP +NE LK A S QPVSVAIDAGG FQLY G+FSG CG +LNH Sbjct: 232 AVSISGYEEVPAENEAKLKAAASHQPVSVAIDAGGYSFQLYSEGVFSGICGKQLNHAVTV 291 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPV 515 KYWIVKNSWGTEWGE GY++++R DK G CGIA SYPV Sbjct: 292 VGYGEVNSDKYWIVKNSWGTEWGESGYMKIKRDTFDKDGLCGIAKLASYPV 342 >ref|NP_563764.1| Cysteine proteinases superfamily protein [Arabidopsis thaliana] gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus vulgaris gb|U52970 and is a member of the papain cysteine protease family PF|00112 [Arabidopsis thaliana] gi|332189848|gb|AEE27969.1| Cysteine proteinases superfamily protein [Arabidopsis thaliana] Length = 343 Score = 216 bits (549), Expect = 5e-54 Identities = 99/172 (57%), Positives = 122/172 (70%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQQL+DCD N+GC+GGLME AF++I NGGL ETDYPY G G C K KN Sbjct: 173 SLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNK 232 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I GY+ V +NE +L++A + QPVSV IDAGG +FQLY G+F+ +CGT LNH Sbjct: 233 VVTIQGYQKVA-QNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTV 291 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVK 518 +KYWIVKNSWGT WGE+GYIRMERG+ + TGKCGIA+ SYP++ Sbjct: 292 VGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPLQ 343 >dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis] Length = 431 Score = 214 bits (544), Expect = 2e-53 Identities = 99/173 (57%), Positives = 122/173 (70%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD + N+GCNGGLM+ AF++I+KNGG+ E DYPY G +G+C ++ Sbjct: 172 SLSEQELVDCDTSY-NEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVDGRCDQTRKNAK 230 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I+ YE VP +E +LK A+S QP+SVAI+ GG FQLYD GIF G CGT L+H Sbjct: 231 VVTIDSYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVA 290 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVKN 521 + YWIVKNSWGT WGE GYIRMER I GKCGIA+EPSYP+KN Sbjct: 291 VGYGTENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPIKN 343 >ref|XP_006852216.1| hypothetical protein AMTR_s00049p00133800 [Amborella trichopoda] gi|548855820|gb|ERN13683.1| hypothetical protein AMTR_s00049p00133800 [Amborella trichopoda] Length = 359 Score = 213 bits (543), Expect = 2e-53 Identities = 102/175 (58%), Positives = 124/175 (70%), Gaps = 1/175 (0%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+L+DCD N+DNQGCNGGLM+ AFDYI KNGGLT E +YPY ++G C KE ++ Sbjct: 173 SLSEQELIDCD-NQDNQGCNGGLMDYAFDYIKKNGGLTTEENYPYVAEDGTCDANKENSH 231 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNH-XXX 359 I+G+E VP+ NE AL A + QPVSVAIDAGG FQ Y G+F G CGT L+H Sbjct: 232 VVVIDGHENVPVNNEDALLKATAHQPVSVAIDAGGEAFQFYSDGVFDGNCGTELDHGVAV 291 Query: 360 XXXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVKNA 524 KYWIVKNSWG EWGE GYIRM+RG+ + G CGIA+E SYP+K++ Sbjct: 292 VGYGTTVDGTKYWIVKNSWGPEWGEKGYIRMKRGVSAQEGLCGIAMEASYPIKSS 346 >gb|ABR19828.1| cysteine proteinase [Elaeis guineensis] Length = 469 Score = 213 bits (542), Expect = 3e-53 Identities = 98/173 (56%), Positives = 126/173 (72%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD NQGCNGGLM+ AF++I+ NGG+ + DYPY G++G C ++ + Sbjct: 186 SLSEQELVDCD-TYYNQGCNGGLMDYAFEFIISNGGIDTDEDYPYTGRDGSCDQYRKNAH 244 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I+ YE VPI +E +L+ AV+ QPVSVAI+AGG FQLY+ GIF+G+CGT L+H Sbjct: 245 VVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAGGRAFQLYESGIFTGYCGTELDHGVTA 304 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVKN 521 + YWIVKNSWG++WGE GYIRMER I TGKCGIA+E SYP+KN Sbjct: 305 IGYGSENGKYYWIVKNSWGSDWGESGYIRMERNINSATGKCGIAMEASYPIKN 357 >ref|XP_006411912.1| hypothetical protein EUTSA_v10025483mg [Eutrema salsugineum] gi|557113082|gb|ESQ53365.1| hypothetical protein EUTSA_v10025483mg [Eutrema salsugineum] Length = 376 Score = 212 bits (540), Expect = 5e-53 Identities = 107/174 (61%), Positives = 125/174 (71%), Gaps = 2/174 (1%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD + NQGCNGGLM+ AF +I+KNGGL E DYPY G NG+C + +KN Sbjct: 192 SLSEQELVDCDRSY-NQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSNGKCNSLVKKNS 250 Query: 183 -AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXX 359 SI+GYE VP KNE ALK AVS QPVSVAI+AGG +FQ Y GIF+G CGT ++H Sbjct: 251 RVVSIDGYEDVPTKNEMALKRAVSYQPVSVAIEAGGRVFQHYQSGIFTGKCGTNMDHAVV 310 Query: 360 XXXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGII-DKTGKCGIALEPSYPVK 518 YWIV+NSWG WGEDGYIRMER + K+GKCGIA+E SYPVK Sbjct: 311 AVGYGSENGVDYWIVRNSWGPRWGEDGYIRMERNLASSKSGKCGIAIEASYPVK 364 >ref|XP_006487026.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 472 Score = 212 bits (539), Expect = 7e-53 Identities = 97/172 (56%), Positives = 124/172 (72%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD + NQGCNGGLM+ AF +I+KNGG+ E DYPY +G C ++ + Sbjct: 184 SLSEQELVDCD-KQYNQGCNGGLMDYAFKFIIKNGGIDTEEDYPYKATDGSCDPNRKNAH 242 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I+GYE VP +E +L+ AV+ QPVSVAI+AGG+ FQLY+ G+F+G CGT L+H Sbjct: 243 VVTIDGYEDVPQNDEKSLQKAVASQPVSVAIEAGGMAFQLYNSGVFTGICGTELDHGVIA 302 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVK 518 YWIV+NSWG +WGE GYIRMER + KTGKCGIA+EPSYP+K Sbjct: 303 VGYGTDGHLDYWIVRNSWGPDWGESGYIRMERNVNTKTGKCGIAIEPSYPIK 354 >ref|XP_006417940.1| hypothetical protein EUTSA_v10008102mg [Eutrema salsugineum] gi|557095711|gb|ESQ36293.1| hypothetical protein EUTSA_v10008102mg [Eutrema salsugineum] Length = 344 Score = 211 bits (538), Expect = 9e-53 Identities = 97/172 (56%), Positives = 121/172 (70%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQQL+DCD N+GC+GGLME A+++I NGGLT ETDYPY G C K KN Sbjct: 174 SLSEQQLIDCDTGTYNKGCSGGLMETAYEFIKTNGGLTTETDYPYTAAEGTCDQEKAKNK 233 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I GY+ V +NE +L++A + QPVSV IDAGG +FQLY G+F+G+CG+ LNH Sbjct: 234 VVTIQGYQKVA-QNEASLRIAAAQQPVSVGIDAGGFIFQLYSSGVFTGYCGSNLNHAVTV 292 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVK 518 ++YWIVKNSWGT WGE+GYIRMERG +TGKCGIA+ SYP + Sbjct: 293 VGYGEEGGQQYWIVKNSWGTGWGEEGYIRMERGYSQETGKCGIAMLASYPTQ 344 >dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus] Length = 461 Score = 211 bits (538), Expect = 9e-53 Identities = 97/173 (56%), Positives = 127/173 (73%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 S+SEQ+LV+CD + NQGCNGGLM+ AF++I+KNGG+ E DYPY G++G+C K+ Sbjct: 186 SVSEQELVNCDTSY-NQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNKKNAK 244 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I+ YE VP+ +E++LK AVS QPV+VAI+AGG FQ Y GIF+G CGT L+H Sbjct: 245 VVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDHGVLA 304 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVKN 521 + YW+VKNSWG EWGE GY++MER I DK+GKCGIA+E SYP+KN Sbjct: 305 AGYGTEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEASYPIKN 357 >ref|XP_006393620.1| hypothetical protein EUTSA_v10011464mg [Eutrema salsugineum] gi|557090198|gb|ESQ30906.1| hypothetical protein EUTSA_v10011464mg [Eutrema salsugineum] Length = 459 Score = 211 bits (537), Expect = 1e-52 Identities = 97/173 (56%), Positives = 124/173 (71%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD + N+GCNGGLM+ AF++I+ NGG+ +E DYPY G +G+C ++ Sbjct: 180 SLSEQELVDCDTSY-NEGCNGGLMDYAFEFIINNGGIDSEEDYPYKGVDGRCDQNRKNAK 238 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I+ YE P +E +LK A+S QP+SVAI+ GG FQLYD GIF G CGT+L+H Sbjct: 239 VVTIDSYEDAPTYSEESLKKALSNQPISVAIEGGGRAFQLYDSGIFDGVCGTQLDHGVVA 298 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVKN 521 + YWIV+NSWGT WGE GYIRMER I D +GKCGIA+EPSYP+KN Sbjct: 299 VGYGTENGKDYWIVRNSWGTSWGESGYIRMERNIKDSSGKCGIAIEPSYPIKN 351 >ref|XP_002523448.1| cysteine protease, putative [Ricinus communis] gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis] Length = 341 Score = 211 bits (537), Expect = 1e-52 Identities = 101/170 (59%), Positives = 119/170 (70%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD + ++QGC GGLM+ AF++I +NGGLT E +YPY G +G C T K N Sbjct: 170 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGND 229 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 AA I GYE VP +E AL AV+ QPVSVAIDA G FQ Y GG+F+G CGT L+H Sbjct: 230 AAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTA 289 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYP 512 KYW+VKNSWGT WGEDGYIRMER I K G CGIA++ SYP Sbjct: 290 VGYGTSDGTKYWLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYP 339 >ref|XP_006422960.1| hypothetical protein CICLE_v10028338mg [Citrus clementina] gi|557524894|gb|ESR36200.1| hypothetical protein CICLE_v10028338mg [Citrus clementina] Length = 479 Score = 210 bits (535), Expect = 2e-52 Identities = 97/172 (56%), Positives = 123/172 (71%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD + NQGCNGGLM+ AF +I+KNGG+ E DYPY +G C ++ + Sbjct: 191 SLSEQELVDCD-KQYNQGCNGGLMDYAFKFIIKNGGIDTEEDYPYKATDGSCDPNRKNAH 249 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNHXXXX 362 +I+GYE VP +E +L+ AV+ QPVSVAI+AGG+ FQLY G+F+G CGT L+H Sbjct: 250 VVTIDGYEDVPQNDEKSLQKAVASQPVSVAIEAGGMAFQLYKSGVFTGTCGTELDHGVIA 309 Query: 363 XXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVK 518 YWIV+NSWG +WGE GYIRMER + KTGKCGIA+EPSYP+K Sbjct: 310 VGYGTDGHLDYWIVRNSWGPDWGESGYIRMERNVNTKTGKCGIAIEPSYPIK 361 >ref|XP_004299172.1| PREDICTED: vignain-like [Fragaria vesca subsp. vesca] Length = 364 Score = 210 bits (535), Expect = 2e-52 Identities = 104/175 (59%), Positives = 123/175 (70%), Gaps = 1/175 (0%) Frame = +3 Query: 3 SLSEQQLVDCDYNEDNQGCNGGLMEKAFDYIVKNGGLTNETDYPYAGQNGQCATAKEKNY 182 SLSEQ+LVDCD E NQGCNGGLME AF++I + GGLT ET+YPY + +C AKE Sbjct: 175 SLSEQELVDCDTKE-NQGCNGGLMELAFEFIKQRGGLTTETNYPYKATDSKCNAAKENTP 233 Query: 183 AASINGYETVPIKNETALKVAVSMQPVSVAIDAGGLLFQLYDGGIFSGFCGTRLNH-XXX 359 A SI+G+E+VP NE L AV+ QP+SVAIDAGG FQ Y G++ G CGT L+H Sbjct: 234 AVSIDGHESVPANNEDELLKAVANQPISVAIDAGGPDFQFYSEGVYDGKCGTELDHGVAI 293 Query: 360 XXXXXXXXXRKYWIVKNSWGTEWGEDGYIRMERGIIDKTGKCGIALEPSYPVKNA 524 KYWIVKNSWG EWGE GYIRM RGI +K GKCGIA+E SYP+KN+ Sbjct: 294 VGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARGIAEKEGKCGIAMEASYPIKNS 348