BLASTX nr result

ID: Scutellaria24_contig00006637 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria24_contig00006637
         (1750 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2...   584   e-164
ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis th...   567   e-159
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   567   e-159
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   566   e-159
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   565   e-158

>ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1|
            predicted protein [Populus trichocarpa]
          Length = 436

 Score =  584 bits (1506), Expect = e-164
 Identities = 268/431 (62%), Positives = 339/431 (78%), Gaps = 1/431 (0%)
 Frame = -3

Query: 1595 IRLLWIWSFLNLILLFSQVPTSKTSSISDSFDLWCQEHGKSYSSEQERQHRLKVFEQNYE 1416
            +  L+I++   LI + S  P++ +S IS  F+ WC+EHGKSY+S++ER HRLKVFE NY+
Sbjct: 1    MNFLYIFALTLLISVLS--PSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYD 58

Query: 1415 LVLAHNARPNSSYTLSLNAFADLTNHEFRSKYLGLSPSHSDLLIRLNSRSPSNEGSDLVS 1236
             V  HN++ NSSY+L+LNAFADLT+HEF++  LGLS +  +L  R      + E + +V 
Sbjct: 59   FVTKHNSKGNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHR------NLEITGVVG 112

Query: 1235 ESDVPSSVDWRDKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCD 1056
              D+P+S+DWR+KG VT VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CD
Sbjct: 113  --DIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECD 170

Query: 1055 KSYNDGCGGGLMDYSYEFIIKNKGIDTEDDYPYRARDGTCDKNRLKRHIVTIDSYADLPS 876
            KSYNDGCGGGLMDY+++F+I N GIDTE+DYPYRARDGTC+K+R+KR +VTID Y D+P 
Sbjct: 171  KSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPE 230

Query: 875  KNEKKLQQAVATQPISVGICGSDSSFQLYSGGIFNGPCSTALDHAVLIVGYASQDGKDYW 696
             NEK+L QAVA QP+SVGICGS+ +FQ+YS GIF GPCST+LDHAVLIVGY S++G DYW
Sbjct: 231  NNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYW 290

Query: 695  IIKNSWGKSWGISGYMHMQRNTGNAEGLCGINLLAXXXXXXXXXXXXXXXXXXTRCNIFT 516
            I+KNSWG  WG+ GYMHMQRN+GN++G+CGIN+LA                  T+CN+ T
Sbjct: 291  IVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLT 350

Query: 515  YCASDETCCCTRSFLGICLNWKCCGAESAVCCKDRKHCCPHDYPICEPTKNLCLKQIGNS 336
            YCA+ ETCCC R F GIC++WKCCG +SAVCCKDR HCCPHDYP+C+  KN+C K+ GN+
Sbjct: 351  YCAAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNA 410

Query: 335  TLVMPI-GKTT 306
            T +  I GKT+
Sbjct: 411  TRMEAIEGKTS 421


>ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| xylem bark cysteine
            peptidase 3 [Arabidopsis thaliana]
          Length = 437

 Score =  567 bits (1461), Expect = e-159
 Identities = 261/427 (61%), Positives = 326/427 (76%), Gaps = 2/427 (0%)
 Frame = -3

Query: 1574 SFLNLILLFSQVPTSKTSS--ISDSFDLWCQEHGKSYSSEQERQHRLKVFEQNYELVLAH 1401
            SF++L   F  + +S +SS  IS+ FD WCQ+HGK+Y SE+ERQ R+++F+ N++ V  H
Sbjct: 7    SFISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQH 66

Query: 1400 NARPNSSYTLSLNAFADLTNHEFRSKYLGLSPSHSDLLIRLNSRSPSNEGSDLVSESDVP 1221
            N   N++Y+LSLNAFADLT+HEF++  LGLS S   +++       +++G  L     VP
Sbjct: 67   NLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVP 119

Query: 1220 SSVDWRDKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYND 1041
             SVDWR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN 
Sbjct: 120  DSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNA 179

Query: 1040 GCGGGLMDYSYEFIIKNKGIDTEDDYPYRARDGTCDKNRLKRHIVTIDSYADLPSKNEKK 861
            GC GGLMDY++EF+IKN GIDTE DYPY+ RDGTC K++LK+ +VTIDSYA + S +EK 
Sbjct: 180  GCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKA 239

Query: 860  LQQAVATQPISVGICGSDSSFQLYSGGIFNGPCSTALDHAVLIVGYASQDGKDYWIIKNS 681
            L +AVA QP+SVGICGS+ +FQLYS GIF+GPCST+LDHAVLIVGY SQ+G DYWI+KNS
Sbjct: 240  LMEAVAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNS 299

Query: 680  WGKSWGISGYMHMQRNTGNAEGLCGINLLAXXXXXXXXXXXXXXXXXXTRCNIFTYCASD 501
            WGKSWG+ G+MHMQRNT N++G+CGIN+LA                  T+CN+FTYC+S 
Sbjct: 300  WGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSG 359

Query: 500  ETCCCTRSFLGICLNWKCCGAESAVCCKDRKHCCPHDYPICEPTKNLCLKQIGNSTLVMP 321
            ETCCC R   G+C +WKCC  ESAVCCKD +HCCPHDYP+C+ T++LCLK+ GN T + P
Sbjct: 360  ETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKP 419

Query: 320  IGKTTLS 300
              K   S
Sbjct: 420  FWKKNSS 426


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  567 bits (1461), Expect = e-159
 Identities = 262/425 (61%), Positives = 328/425 (77%), Gaps = 4/425 (0%)
 Frame = -3

Query: 1574 SFLNLILLFSQVPTSKTSS--ISDSFDLWCQEHGKSYSSEQERQHRLKVFEQNYELVLAH 1401
            SF++L   F  + +S +SS  IS+ FD WCQ HGK+Y SE+ERQ R+++F+ N++ V  H
Sbjct: 7    SFVSLTFFFLLLVSSPSSSDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQH 66

Query: 1400 NARPNSSYTLSLNAFADLTNHEFRSKYLGLSPSHSDLLIRLNSRSPSNEGSDLVSESDVP 1221
            N   N++Y+LSLNAFADLT+HEF++  LGLS S S L++       +++G  L   + VP
Sbjct: 67   NLITNATYSLSLNAFADLTHHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVP 119

Query: 1220 SSVDWRDKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYND 1041
             SVDWR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN 
Sbjct: 120  DSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNA 179

Query: 1040 GCGGGLMDYSYEFIIKNKGIDTEDDYPYRARDGTCDKNRLKRHIVTIDSYADLPSKNEKK 861
            GC GGLMDY++EF+IKN GIDTE DYPY+ RDGTC K++LK+ +VTIDSYA + S +EK 
Sbjct: 180  GCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKA 239

Query: 860  LQQAVATQPISVGICGSDSSFQLYS--GGIFNGPCSTALDHAVLIVGYASQDGKDYWIIK 687
            L++AVA QP+SVGICGS+ +FQLYS   GIF+GPCST+LDHAVLIVGY SQ+G DYWI+K
Sbjct: 240  LREAVAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVK 299

Query: 686  NSWGKSWGISGYMHMQRNTGNAEGLCGINLLAXXXXXXXXXXXXXXXXXXTRCNIFTYCA 507
            NSWGKSWG+ G+MHMQRNTGN+EG+CGIN+LA                  T+CN+FTYC+
Sbjct: 300  NSWGKSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCS 359

Query: 506  SDETCCCTRSFLGICLNWKCCGAESAVCCKDRKHCCPHDYPICEPTKNLCLKQIGNSTLV 327
            + ETCCC R+  G+C +WKCC  ESAVCC D +HCCPHDYP+C+ T++LCLK+ GN T +
Sbjct: 360  AGETCCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAI 419

Query: 326  MPIGK 312
             P  K
Sbjct: 420  KPFWK 424


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  566 bits (1459), Expect = e-159
 Identities = 261/427 (61%), Positives = 326/427 (76%), Gaps = 2/427 (0%)
 Frame = -3

Query: 1574 SFLNLILLFSQVPTSKTSS--ISDSFDLWCQEHGKSYSSEQERQHRLKVFEQNYELVLAH 1401
            SF++L   F  + +S +SS  IS+ FD WCQ+HGK+Y SE+ERQ R+++F+ N++ V  H
Sbjct: 7    SFISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQH 66

Query: 1400 NARPNSSYTLSLNAFADLTNHEFRSKYLGLSPSHSDLLIRLNSRSPSNEGSDLVSESDVP 1221
            N   N++Y+LSLNAFADLT+HEF++  LGLS S   +++       +++G  L     VP
Sbjct: 67   NLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVP 119

Query: 1220 SSVDWRDKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYND 1041
             SVDWR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN 
Sbjct: 120  DSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNA 179

Query: 1040 GCGGGLMDYSYEFIIKNKGIDTEDDYPYRARDGTCDKNRLKRHIVTIDSYADLPSKNEKK 861
            GC GGLMDY++EF+IKN GIDTE DYPY+ RDGTC K++LK+ +VTIDSYA + S +EK 
Sbjct: 180  GCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKA 239

Query: 860  LQQAVATQPISVGICGSDSSFQLYSGGIFNGPCSTALDHAVLIVGYASQDGKDYWIIKNS 681
            L +AVA QP+SVGICGS+ +FQLYS GIF+GPCST+LDHAVLIVGY SQ+G DYWI+KNS
Sbjct: 240  LMEAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNS 299

Query: 680  WGKSWGISGYMHMQRNTGNAEGLCGINLLAXXXXXXXXXXXXXXXXXXTRCNIFTYCASD 501
            WGKSWG+ G+MHMQRNT N++G+CGIN+LA                  T+CN+FTYC+S 
Sbjct: 300  WGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSG 359

Query: 500  ETCCCTRSFLGICLNWKCCGAESAVCCKDRKHCCPHDYPICEPTKNLCLKQIGNSTLVMP 321
            ETCCC R   G+C +WKCC  ESAVCCKD +HCCPHDYP+C+ T++LCLK+ GN T + P
Sbjct: 360  ETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKP 419

Query: 320  IGKTTLS 300
              K   S
Sbjct: 420  FWKKNSS 426


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  565 bits (1457), Expect = e-158
 Identities = 266/417 (63%), Positives = 322/417 (77%)
 Frame = -3

Query: 1583 WIWSFLNLILLFSQVPTSKTSSISDSFDLWCQEHGKSYSSEQERQHRLKVFEQNYELVLA 1404
            + + FL L LL  + P S TS++S+ F++WC EHGKSYSS +E+ +RL VF  NYE V  
Sbjct: 4    YAFHFLTLFLLLFR-PLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTH 62

Query: 1403 HNARPNSSYTLSLNAFADLTNHEFRSKYLGLSPSHSDLLIRLNSRSPSNEGSDLVSESDV 1224
            HN   NSSYTLSLN++ADLT+HEF+   LG SP+  +    L  + PS          DV
Sbjct: 63   HNNLDNSSYTLSLNSYADLTHHEFKVSRLGFSPALRNFRPVL-PQEPS-------LPRDV 114

Query: 1223 PSSVDWRDKGAVTAVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYN 1044
            P S+DWR KGAVTAVKDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD+SYN
Sbjct: 115  PDSLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYN 174

Query: 1043 DGCGGGLMDYSYEFIIKNKGIDTEDDYPYRARDGTCDKNRLKRHIVTIDSYADLPSKNEK 864
             GCGGGLMDY+Y+F+I N GIDTE+DYPY+ARDG+C K++L+R++VTID YAD+PS +E 
Sbjct: 175  SGCGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEG 234

Query: 863  KLQQAVATQPISVGICGSDSSFQLYSGGIFNGPCSTALDHAVLIVGYASQDGKDYWIIKN 684
            KL QAVA QP+SVGICGS+ +FQLYS GIF+GPCST+LDHAVLIVGY S++G DYWI+KN
Sbjct: 235  KLLQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKN 294

Query: 683  SWGKSWGISGYMHMQRNTGNAEGLCGINLLAXXXXXXXXXXXXXXXXXXTRCNIFTYCAS 504
            SWGKSWG+ GYMHMQRN+GN+EG+CGIN LA                  T+C+I T CA+
Sbjct: 295  SWGKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAA 354

Query: 503  DETCCCTRSFLGICLNWKCCGAESAVCCKDRKHCCPHDYPICEPTKNLCLKQIGNST 333
             ETCCC + FLG+CL+WKCCG  SAVCCKD +HCCP DYPIC+  +NLCLKQ  N T
Sbjct: 355  GETCCCAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGT 411


Top