BLASTX nr result

ID: Cephaelis21_contig00002844 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00002844
         (1400 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]               325   2e-86
ref|XP_002298740.1| predicted protein [Populus trichocarpa] gi|2...   323   9e-86
gb|AEZ65082.1| cysteine protease [Carica papaya]                      318   3e-84
ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine pro...   313   7e-83
ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|2...   313   9e-83

>dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  325 bits (833), Expect = 2e-86
 Identities = 153/307 (49%), Positives = 212/307 (69%), Gaps = 3/307 (0%)
 Frame = +3

Query: 3   FESWRVQNGRLYTAIGEKEKRFEIFKDNLKYITEQNSQGHPYELGLTKFVDLTNEEYIEK 182
           +ESW V++G+ Y A+GEK++RF+IFKDNL++I E NS  H Y+LGL KF DLTNEEY   
Sbjct: 52  YESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMT 111

Query: 183 YG--RGLDPAGPLRKNISNHLPLKAGVAAPYYVDWREQGAVTMVIDQGECGSCWAWATVG 356
           Y   + +D    L K  S+    ++G + P YVDWREQGAVT V DQG CGSCWA++T G
Sbjct: 112 YTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTG 171

Query: 357 AIEGIYRIVNGPLVRLSAQELLDC-VPQSLGCGGGWPDRALDWIIRNGGIDTADDYPYQW 533
           ++EG+ +IV G L+ +S QEL++C    + GC GG  D A ++II+NGGIDT +DYPY  
Sbjct: 172 SVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTG 231

Query: 534 YQQQCNDDKRLNNRVLTIDSYGRLPSNDEQTMMQGVALQPVIACVDIHSRNWQLYRGGIF 713
              +C+ +K+ N +V+TIDSY  +P NDE ++ + V+ QPV   ++   R++Q Y  GIF
Sbjct: 232 KDGKCDKNKK-NAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIF 290

Query: 714 TADCGTQINHALIIIGYGTVSGYDYWLVKNTWGTNWGENGFMRILRNTYDRTGKCGIAML 893
           T  CGT ++H ++  GYGT  G DYWLVKN+WG  WGE G++++ RN  D++GKCGIAM 
Sbjct: 291 TGSCGTALDHGVLAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAME 350

Query: 894 VYFPTIN 914
             +P  N
Sbjct: 351 ASYPIKN 357


>ref|XP_002298740.1| predicted protein [Populus trichocarpa] gi|222845998|gb|EEE83545.1|
           predicted protein [Populus trichocarpa]
          Length = 455

 Score =  323 bits (827), Expect = 9e-86
 Identities = 163/307 (53%), Positives = 210/307 (68%), Gaps = 5/307 (1%)
 Frame = +3

Query: 3   FESWRVQNGRLYTAIGEKEKRFEIFKDNLKYITEQNSQGHP-YELGLTKFVDLTNEEYIE 179
           +E W V++GR Y A+GEKE+RFEIFKDNLK+I E NS G+P Y+LGL KF DL+N+EY  
Sbjct: 25  YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYRS 84

Query: 180 KY-GRGLDPAGPLRKNISNHLPL-KAGVAAPYYVDWREQGAVTMVIDQGECGSCWAWATV 353
            Y G  +D  G L     +   L K G   P  VDWRE+GAV  V DQG+CGSCWA++TV
Sbjct: 85  VYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGSCWAFSTV 144

Query: 354 GAIEGIYRIVNGPLVRLSAQELLDC-VPQSLGCGGGWPDRALDWIIRNGGIDTADDYPYQ 530
           GA+EGI +IV G L  LS QEL+DC    +LGC GG  D A D+II NGGIDT +DYPY+
Sbjct: 145 GAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGIDTEEDYPYK 204

Query: 531 WYQQQCNDDKRLNNRVLTIDSYGRLPSNDEQTMMQGVALQPVIACVDIHSRNWQLYRGGI 710
                C D  R N RV+TID Y  +P NDE+++ + VA QPV   ++   R +QLY+ G+
Sbjct: 205 AIDSMC-DPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQLYQSGV 263

Query: 711 FTADCGTQINHALIIIGYGTVSGYDYWLVKNTWGTNWGENGFMRILRNTYD-RTGKCGIA 887
           FT  CGTQ++H ++ +GYGT  G DYW+V+N+WG  WGENG++R+ R+     TGKCGIA
Sbjct: 264 FTGSCGTQLDHGVVTVGYGTEHGVDYWIVRNSWGPAWGENGYIRMERDVASTETGKCGIA 323

Query: 888 MLVYFPT 908
           M   +PT
Sbjct: 324 MEASYPT 330


>gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score =  318 bits (814), Expect = 3e-84
 Identities = 159/308 (51%), Positives = 211/308 (68%), Gaps = 6/308 (1%)
 Frame = +3

Query: 3   FESWRVQNGRLYTAIGEKEKRFEIFKDNLKYITEQNS-QGHPYELGLTKFVDLTNEEYIE 179
           +E W V++G+ Y AIGEKE+RFEIFKDNL+++ EQNS  G  Y+LGLTKF DLTNEEY  
Sbjct: 52  YEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRA 111

Query: 180 KY-GRGLDPAGPLRKNISNHLPLKAGVA--APYYVDWREQGAVTMVIDQGECGSCWAWAT 350
            Y G  ++    LR   S     KAG     P +VDWRE+GAVT V DQG+CGSCWA++T
Sbjct: 112 MYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFST 171

Query: 351 VGAIEGIYRIVNGPLVRLSAQELLDC-VPQSLGCGGGWPDRALDWIIRNGGIDTADDYPY 527
           VG++EGI +IV G L+ LS QEL+DC    + GC GG  D A ++II+NGGID+  DYPY
Sbjct: 172 VGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSEADYPY 231

Query: 528 QWYQQQCNDDKRLNNRVLTIDSYGRLPSNDEQTMMQGVALQPVIACVDIHSRNWQLYRGG 707
           +     C D  R N  V+TID Y  +P NDE+++ + VA QPV   ++   R +QLY+ G
Sbjct: 232 RASDNMC-DSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSG 290

Query: 708 IFTADCGTQINHALIIIGYGTVSGYDYWLVKNTWGTNWGENGFMRILRNTYDR-TGKCGI 884
           +FT  CGT ++H ++ +GYGT +G DYW+V+N+WG  WGE+G++R+ RN     TGKCGI
Sbjct: 291 VFTGRCGTNLDHGVVAVGYGTENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGI 350

Query: 885 AMLVYFPT 908
           AM   +PT
Sbjct: 351 AMEASYPT 358


>ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score =  313 bits (802), Expect = 7e-83
 Identities = 154/312 (49%), Positives = 212/312 (67%), Gaps = 8/312 (2%)
 Frame = +3

Query: 3   FESWRVQNGRLYTAIGEKEKRFEIFKDNLKYITEQNSQGHP-YELGLTKFVDLTNEEYIE 179
           +E+W V++G+ Y A+GEKE+RF+IFKDNL++I E N  G   Y+LGL KF DLTNEEY  
Sbjct: 48  YEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEYRA 107

Query: 180 KYGRGLDPAGPLRKNI-----SNHLPLKAGVAAPYYVDWREQGAVTMVIDQGECGSCWAW 344
            +  G    GP  K       ++    +AG   P  VDWRE+GAVT + DQG+CGSCWA+
Sbjct: 108 MF-LGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQCGSCWAF 166

Query: 345 ATVGAIEGIYRIVNGPLVRLSAQELLDCVP-QSLGCGGGWPDRALDWIIRNGGIDTADDY 521
           +TVGA+EGI +IV G L  LS QEL+DC    ++GC GG  D A ++I++NGGIDT +DY
Sbjct: 167 STVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQNGGIDTEEDY 226

Query: 522 PYQWYQQQCNDDKRLNNRVLTIDSYGRLPSNDEQTMMQGVALQPVIACVDIHSRNWQLYR 701
           PY      C D  R N RV+TID Y  +P+NDE+++M+ VA QPV   ++     +QLY+
Sbjct: 227 PYHAKDNTC-DPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAGGMEFQLYQ 285

Query: 702 GGIFTADCGTQINHALIIIGYGTVSGYDYWLVKNTWGTNWGENGFMRILRNTYD-RTGKC 878
            G+FT  CGT ++H ++ +GYGT +G DYWLV+N+WG+ WGENG++++ RN  +  TGKC
Sbjct: 286 SGVFTGRCGTNLDHGVVAVGYGTENGTDYWLVRNSWGSAWGENGYIKLERNVQNTETGKC 345

Query: 879 GIAMLVYFPTIN 914
           GIA+   +P  N
Sbjct: 346 GIAIEASYPIKN 357


>ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|222849544|gb|EEE87091.1|
           predicted protein [Populus trichocarpa]
          Length = 477

 Score =  313 bits (801), Expect = 9e-83
 Identities = 158/307 (51%), Positives = 209/307 (68%), Gaps = 5/307 (1%)
 Frame = +3

Query: 3   FESWRVQNGRLYTAIGEKEKRFEIFKDNLKYITEQNSQGHP-YELGLTKFVDLTNEEYIE 179
           +E W V+ G+ Y A+GEKE+RFEIFKDNLK++ + NS G+P Y+LGL KF DL+NEEY  
Sbjct: 49  YEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRA 108

Query: 180 KY-GRGLDPAGPLRKNISNHLPL-KAGVAAPYYVDWREQGAVTMVIDQGECGSCWAWATV 353
            Y G  +D    L     +   L K G   P  VDWRE+GAV  V DQG+CGSCWA++TV
Sbjct: 109 AYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTV 168

Query: 354 GAIEGIYRIVNGPLVRLSAQELLDCVP-QSLGCGGGWPDRALDWIIRNGGIDTADDYPYQ 530
           GA+EGI +IV G L  LS QEL+DC    + GC GG  D A ++I++NGGIDT +DYPY+
Sbjct: 169 GAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYK 228

Query: 531 WYQQQCNDDKRLNNRVLTIDSYGRLPSNDEQTMMQGVALQPVIACVDIHSRNWQLYRGGI 710
                C D  R N RV+TID Y  +P NDE+++ + VA QPV   ++   R +QLY+ G+
Sbjct: 229 AVDSMC-DPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGV 287

Query: 711 FTADCGTQINHALIIIGYGTVSGYDYWLVKNTWGTNWGENGFMRILRNTYD-RTGKCGIA 887
           FT  CGTQ++H ++ +GYGT +G DYW+V+N+WG  WGENG++R+ RN     TGKCGIA
Sbjct: 288 FTGSCGTQLDHGVVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIA 347

Query: 888 MLVYFPT 908
           M   +PT
Sbjct: 348 MEASYPT 354


Top