BLASTX nr result
ID: Cephaelis21_contig00002844
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00002844 (1400 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus] 325 2e-86 ref|XP_002298740.1| predicted protein [Populus trichocarpa] gi|2... 323 9e-86 gb|AEZ65082.1| cysteine protease [Carica papaya] 318 3e-84 ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine pro... 313 7e-83 ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|2... 313 9e-83 >dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus] Length = 461 Score = 325 bits (833), Expect = 2e-86 Identities = 153/307 (49%), Positives = 212/307 (69%), Gaps = 3/307 (0%) Frame = +3 Query: 3 FESWRVQNGRLYTAIGEKEKRFEIFKDNLKYITEQNSQGHPYELGLTKFVDLTNEEYIEK 182 +ESW V++G+ Y A+GEK++RF+IFKDNL++I E NS H Y+LGL KF DLTNEEY Sbjct: 52 YESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMT 111 Query: 183 YG--RGLDPAGPLRKNISNHLPLKAGVAAPYYVDWREQGAVTMVIDQGECGSCWAWATVG 356 Y + +D L K S+ ++G + P YVDWREQGAVT V DQG CGSCWA++T G Sbjct: 112 YTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTG 171 Query: 357 AIEGIYRIVNGPLVRLSAQELLDC-VPQSLGCGGGWPDRALDWIIRNGGIDTADDYPYQW 533 ++EG+ +IV G L+ +S QEL++C + GC GG D A ++II+NGGIDT +DYPY Sbjct: 172 SVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTG 231 Query: 534 YQQQCNDDKRLNNRVLTIDSYGRLPSNDEQTMMQGVALQPVIACVDIHSRNWQLYRGGIF 713 +C+ +K+ N +V+TIDSY +P NDE ++ + V+ QPV ++ R++Q Y GIF Sbjct: 232 KDGKCDKNKK-NAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIF 290 Query: 714 TADCGTQINHALIIIGYGTVSGYDYWLVKNTWGTNWGENGFMRILRNTYDRTGKCGIAML 893 T CGT ++H ++ GYGT G DYWLVKN+WG WGE G++++ RN D++GKCGIAM Sbjct: 291 TGSCGTALDHGVLAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAME 350 Query: 894 VYFPTIN 914 +P N Sbjct: 351 ASYPIKN 357 >ref|XP_002298740.1| predicted protein [Populus trichocarpa] gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa] Length = 455 Score = 323 bits (827), Expect = 9e-86 Identities = 163/307 (53%), Positives = 210/307 (68%), Gaps = 5/307 (1%) Frame = +3 Query: 3 FESWRVQNGRLYTAIGEKEKRFEIFKDNLKYITEQNSQGHP-YELGLTKFVDLTNEEYIE 179 +E W V++GR Y A+GEKE+RFEIFKDNLK+I E NS G+P Y+LGL KF DL+N+EY Sbjct: 25 YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYRS 84 Query: 180 KY-GRGLDPAGPLRKNISNHLPL-KAGVAAPYYVDWREQGAVTMVIDQGECGSCWAWATV 353 Y G +D G L + L K G P VDWRE+GAV V DQG+CGSCWA++TV Sbjct: 85 VYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGSCWAFSTV 144 Query: 354 GAIEGIYRIVNGPLVRLSAQELLDC-VPQSLGCGGGWPDRALDWIIRNGGIDTADDYPYQ 530 GA+EGI +IV G L LS QEL+DC +LGC GG D A D+II NGGIDT +DYPY+ Sbjct: 145 GAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGIDTEEDYPYK 204 Query: 531 WYQQQCNDDKRLNNRVLTIDSYGRLPSNDEQTMMQGVALQPVIACVDIHSRNWQLYRGGI 710 C D R N RV+TID Y +P NDE+++ + VA QPV ++ R +QLY+ G+ Sbjct: 205 AIDSMC-DPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQLYQSGV 263 Query: 711 FTADCGTQINHALIIIGYGTVSGYDYWLVKNTWGTNWGENGFMRILRNTYD-RTGKCGIA 887 FT CGTQ++H ++ +GYGT G DYW+V+N+WG WGENG++R+ R+ TGKCGIA Sbjct: 264 FTGSCGTQLDHGVVTVGYGTEHGVDYWIVRNSWGPAWGENGYIRMERDVASTETGKCGIA 323 Query: 888 MLVYFPT 908 M +PT Sbjct: 324 MEASYPT 330 >gb|AEZ65082.1| cysteine protease [Carica papaya] Length = 471 Score = 318 bits (814), Expect = 3e-84 Identities = 159/308 (51%), Positives = 211/308 (68%), Gaps = 6/308 (1%) Frame = +3 Query: 3 FESWRVQNGRLYTAIGEKEKRFEIFKDNLKYITEQNS-QGHPYELGLTKFVDLTNEEYIE 179 +E W V++G+ Y AIGEKE+RFEIFKDNL+++ EQNS G Y+LGLTKF DLTNEEY Sbjct: 52 YEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRA 111 Query: 180 KY-GRGLDPAGPLRKNISNHLPLKAGVA--APYYVDWREQGAVTMVIDQGECGSCWAWAT 350 Y G ++ LR S KAG P +VDWRE+GAVT V DQG+CGSCWA++T Sbjct: 112 MYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFST 171 Query: 351 VGAIEGIYRIVNGPLVRLSAQELLDC-VPQSLGCGGGWPDRALDWIIRNGGIDTADDYPY 527 VG++EGI +IV G L+ LS QEL+DC + GC GG D A ++II+NGGID+ DYPY Sbjct: 172 VGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSEADYPY 231 Query: 528 QWYQQQCNDDKRLNNRVLTIDSYGRLPSNDEQTMMQGVALQPVIACVDIHSRNWQLYRGG 707 + C D R N V+TID Y +P NDE+++ + VA QPV ++ R +QLY+ G Sbjct: 232 RASDNMC-DSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSG 290 Query: 708 IFTADCGTQINHALIIIGYGTVSGYDYWLVKNTWGTNWGENGFMRILRNTYDR-TGKCGI 884 +FT CGT ++H ++ +GYGT +G DYW+V+N+WG WGE+G++R+ RN TGKCGI Sbjct: 291 VFTGRCGTNLDHGVVAVGYGTENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGI 350 Query: 885 AMLVYFPT 908 AM +PT Sbjct: 351 AMEASYPT 358 >ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like [Glycine max] Length = 466 Score = 313 bits (802), Expect = 7e-83 Identities = 154/312 (49%), Positives = 212/312 (67%), Gaps = 8/312 (2%) Frame = +3 Query: 3 FESWRVQNGRLYTAIGEKEKRFEIFKDNLKYITEQNSQGHP-YELGLTKFVDLTNEEYIE 179 +E+W V++G+ Y A+GEKE+RF+IFKDNL++I E N G Y+LGL KF DLTNEEY Sbjct: 48 YEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEYRA 107 Query: 180 KYGRGLDPAGPLRKNI-----SNHLPLKAGVAAPYYVDWREQGAVTMVIDQGECGSCWAW 344 + G GP K ++ +AG P VDWRE+GAVT + DQG+CGSCWA+ Sbjct: 108 MF-LGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQCGSCWAF 166 Query: 345 ATVGAIEGIYRIVNGPLVRLSAQELLDCVP-QSLGCGGGWPDRALDWIIRNGGIDTADDY 521 +TVGA+EGI +IV G L LS QEL+DC ++GC GG D A ++I++NGGIDT +DY Sbjct: 167 STVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQNGGIDTEEDY 226 Query: 522 PYQWYQQQCNDDKRLNNRVLTIDSYGRLPSNDEQTMMQGVALQPVIACVDIHSRNWQLYR 701 PY C D R N RV+TID Y +P+NDE+++M+ VA QPV ++ +QLY+ Sbjct: 227 PYHAKDNTC-DPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAGGMEFQLYQ 285 Query: 702 GGIFTADCGTQINHALIIIGYGTVSGYDYWLVKNTWGTNWGENGFMRILRNTYD-RTGKC 878 G+FT CGT ++H ++ +GYGT +G DYWLV+N+WG+ WGENG++++ RN + TGKC Sbjct: 286 SGVFTGRCGTNLDHGVVAVGYGTENGTDYWLVRNSWGSAWGENGYIKLERNVQNTETGKC 345 Query: 879 GIAMLVYFPTIN 914 GIA+ +P N Sbjct: 346 GIAIEASYPIKN 357 >ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa] Length = 477 Score = 313 bits (801), Expect = 9e-83 Identities = 158/307 (51%), Positives = 209/307 (68%), Gaps = 5/307 (1%) Frame = +3 Query: 3 FESWRVQNGRLYTAIGEKEKRFEIFKDNLKYITEQNSQGHP-YELGLTKFVDLTNEEYIE 179 +E W V+ G+ Y A+GEKE+RFEIFKDNLK++ + NS G+P Y+LGL KF DL+NEEY Sbjct: 49 YEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRA 108 Query: 180 KY-GRGLDPAGPLRKNISNHLPL-KAGVAAPYYVDWREQGAVTMVIDQGECGSCWAWATV 353 Y G +D L + L K G P VDWRE+GAV V DQG+CGSCWA++TV Sbjct: 109 AYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTV 168 Query: 354 GAIEGIYRIVNGPLVRLSAQELLDCVP-QSLGCGGGWPDRALDWIIRNGGIDTADDYPYQ 530 GA+EGI +IV G L LS QEL+DC + GC GG D A ++I++NGGIDT +DYPY+ Sbjct: 169 GAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYK 228 Query: 531 WYQQQCNDDKRLNNRVLTIDSYGRLPSNDEQTMMQGVALQPVIACVDIHSRNWQLYRGGI 710 C D R N RV+TID Y +P NDE+++ + VA QPV ++ R +QLY+ G+ Sbjct: 229 AVDSMC-DPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGV 287 Query: 711 FTADCGTQINHALIIIGYGTVSGYDYWLVKNTWGTNWGENGFMRILRNTYD-RTGKCGIA 887 FT CGTQ++H ++ +GYGT +G DYW+V+N+WG WGENG++R+ RN TGKCGIA Sbjct: 288 FTGSCGTQLDHGVVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIA 347 Query: 888 MLVYFPT 908 M +PT Sbjct: 348 MEASYPT 354