BLASTX nr result

ID: Bupleurum21_contig00034042 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00034042
         (1445 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002322693.1| predicted protein [Populus trichocarpa] gi|2...   637   e-180
ref|NP_567055.1| glycoside hydrolase family 28 protein / polygal...   604   e-170
emb|CAB41176.1| putative protein [Arabidopsis thaliana]               604   e-170
ref|XP_002878172.1| hypothetical protein ARALYDRAFT_324272 [Arab...   602   e-170
ref|XP_004161486.1| PREDICTED: exo-poly-alpha-D-galacturonosidas...   601   e-169

>ref|XP_002322693.1| predicted protein [Populus trichocarpa] gi|222867323|gb|EEF04454.1|
            predicted protein [Populus trichocarpa]
          Length = 445

 Score =  637 bits (1643), Expect = e-180
 Identities = 298/431 (69%), Positives = 346/431 (80%), Gaps = 3/431 (0%)
 Frame = +2

Query: 158  SVADYGTVGDGIHYDTAQIQAAIDDCSSHGGGR---VIFPPGTYLTATIFLKSGVVLEVQ 328
            SV D+G +GDGIHYDT  IQ+ I+ C +    +   V FPPG YLTATI LKS VVL +Q
Sbjct: 14   SVTDFGAIGDGIHYDTEAIQSTINSCPTTPPTKACHVNFPPGIYLTATIHLKSNVVLNIQ 73

Query: 329  KNATILGGTRLEDYPAENKRWYVVVAEEAXXXXXXXXXXXXXXXLKFVKRFDEKKNVMIS 508
            + AT+LGGT+LEDYP E  RWYVV+AE A               LKFVKRF+E+KNVM+S
Sbjct: 74   EGATLLGGTKLEDYPKEFNRWYVVLAENASDVGITGGGVVDGQGLKFVKRFNERKNVMVS 133

Query: 509  WNETGACLGDECRPRLVGFIRSKNVEIWNVNFTHPAYWCLHLVQCNNTSIHDVSIYGDFN 688
            WN TGACLGDECRPRLVGFI   NV++WNV  + PAYWCLH+VQC NT I DVSIYGDFN
Sbjct: 134  WNSTGACLGDECRPRLVGFIGCTNVKVWNVRLSEPAYWCLHIVQCLNTHISDVSIYGDFN 193

Query: 689  TPXXXXXXXXXXXXTLITRCNINTGDDAICPKTYEGALYNLTATNCWIQTKSSAIKLGSA 868
            +P            TLITRC+I+TGDDAICPKTY G +YNLTAT+CWI+TKSSAIKLGSA
Sbjct: 194  SPNNDGIDIEDSNNTLITRCHIDTGDDAICPKTYTGPIYNLTATDCWIRTKSSAIKLGSA 253

Query: 869  SWFDFKGLVFDNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRYYDPSWWGRAEPIYV 1048
            SWF+FKGLVFDNITIV+SHRGLGLQIRDGGNVSD+TFSNINI+TRYYDPSWWGRAEPIYV
Sbjct: 254  SWFEFKGLVFDNITIVDSHRGLGLQIRDGGNVSDITFSNINISTRYYDPSWWGRAEPIYV 313

Query: 1049 TTCPRNEDSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFMNVNLTYRRWTNYMD 1228
            TTCPR+  SK GSISN+ F+NIT  SENGVFLSG KGG+LSNL+F+N+NLT+RRWT Y  
Sbjct: 314  TTCPRHSSSKEGSISNLQFINITTNSENGVFLSGSKGGLLSNLRFINMNLTFRRWTTYPG 373

Query: 1229 GLIDYRPGCQGLVNHSSAGFLMEHIDGLEAVNVNMRWSEDHSGRWNTPLDFKPSSVNNIT 1408
            GL+DYRPGCQGLVNHS+AG +MEHI+G E  NVNMRWS+  +  W+ PLDF+PS+VNNI+
Sbjct: 374  GLVDYRPGCQGLVNHSAAGIIMEHIEGFEVENVNMRWSDYQNEPWDNPLDFRPSTVNNIS 433

Query: 1409 LINLHSGLFKQ 1441
             +N HS L+KQ
Sbjct: 434  FLNFHSALYKQ 444


>ref|NP_567055.1| glycoside hydrolase family 28 protein / polygalacturonase (pectinase)
            family protein [Arabidopsis thaliana]
            gi|332646180|gb|AEE79701.1| glycoside hydrolase family 28
            protein / polygalacturonase (pectinase) family protein
            [Arabidopsis thaliana]
          Length = 490

 Score =  604 bits (1558), Expect = e-170
 Identities = 287/446 (64%), Positives = 345/446 (77%), Gaps = 10/446 (2%)
 Frame = +2

Query: 128  IELLRSNLLFSVADYGTVGDGIHYDTAQIQAAIDDCSSHGGG-----RVIFPPGTYLTAT 292
            I+L   +L  SV D+G  GDGI+YDT+ IQ+ ID C+ H        RV+FP G YLTA 
Sbjct: 26   IQLPGDSLTLSVTDFGATGDGINYDTSAIQSTIDACNRHYTSFSSICRVVFPSGNYLTAK 85

Query: 293  IFLKSGVVLEVQKNATILGGTRLEDY-PAENKR-WYVVVAEEAXXXXXXXXXXXXXXXLK 466
            + L+SGV+L+V +NA +LGG R+EDY PAE    WYVVVA  A                K
Sbjct: 86   LHLRSGVILDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAIDGQGSK 145

Query: 467  FVKRFDEKKNVMISWNETGACLGDECRPRLVGFIRSKNVEIWNVNFTHPAYWCLHLVQCN 646
            FV RFDEKKNVM+SWN+TGACLGDECRPRLVGF+ S NVEIWN+    PAYWCLH+V+C 
Sbjct: 146  FVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSINVEIWNITLREPAYWCLHIVRCE 205

Query: 647  NTSIHDVSIYGDFNTPXXXXXXXXXXXXTLITRCNINTGDDAICPKTYEGALYNLTATNC 826
            NTS+HDVSI GDFNTP            T+ITRC+I+TGDDAICPKTY G LYNLTAT+C
Sbjct: 206  NTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNLTATDC 265

Query: 827  WIQTKSSAIKLGSASWFDFKGLVFDNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRY 1006
            WI+TKSSAIKLGSASWFDFKGLVFDNITI ESHRGLG+QIRDGGNVSDVTFSNINI+TRY
Sbjct: 266  WIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDVTFSNINISTRY 325

Query: 1007 YDPSWWGRAEPIYVTTCPRNEDSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFM 1186
            YDPSWWGRAEPIY+TTCPR+  +K GSISN+LFVNIT  SENGVFLSG   G+LS++KF 
Sbjct: 326  YDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITIDSENGVFLSGSPNGLLSDIKFK 385

Query: 1187 NVNLTYRRWTNYMDGLIDYRPGCQGLVNH-SSAGFLMEHIDGLEAVNVNMRWSEDH--SG 1357
            N+NLT+RRW+NY  GL+DYRPGCQGLVNH +++G +MEH++G    NV+++WS+D   + 
Sbjct: 386  NMNLTFRRWSNYSAGLVDYRPGCQGLVNHRATSGIIMEHVNGFRVENVDLKWSDDDDVNA 445

Query: 1358 RWNTPLDFKPSSVNNITLINLHSGLF 1435
             WN PL+F+PS+VNN++ +   SGL+
Sbjct: 446  AWNVPLEFRPSTVNNVSFVGFTSGLY 471


>emb|CAB41176.1| putative protein [Arabidopsis thaliana]
          Length = 614

 Score =  604 bits (1558), Expect = e-170
 Identities = 287/446 (64%), Positives = 345/446 (77%), Gaps = 10/446 (2%)
 Frame = +2

Query: 128  IELLRSNLLFSVADYGTVGDGIHYDTAQIQAAIDDCSSHGGG-----RVIFPPGTYLTAT 292
            I+L   +L  SV D+G  GDGI+YDT+ IQ+ ID C+ H        RV+FP G YLTA 
Sbjct: 26   IQLPGDSLTLSVTDFGATGDGINYDTSAIQSTIDACNRHYTSFSSICRVVFPSGNYLTAK 85

Query: 293  IFLKSGVVLEVQKNATILGGTRLEDY-PAENKR-WYVVVAEEAXXXXXXXXXXXXXXXLK 466
            + L+SGV+L+V +NA +LGG R+EDY PAE    WYVVVA  A                K
Sbjct: 86   LHLRSGVILDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAIDGQGSK 145

Query: 467  FVKRFDEKKNVMISWNETGACLGDECRPRLVGFIRSKNVEIWNVNFTHPAYWCLHLVQCN 646
            FV RFDEKKNVM+SWN+TGACLGDECRPRLVGF+ S NVEIWN+    PAYWCLH+V+C 
Sbjct: 146  FVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSINVEIWNITLREPAYWCLHIVRCE 205

Query: 647  NTSIHDVSIYGDFNTPXXXXXXXXXXXXTLITRCNINTGDDAICPKTYEGALYNLTATNC 826
            NTS+HDVSI GDFNTP            T+ITRC+I+TGDDAICPKTY G LYNLTAT+C
Sbjct: 206  NTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNLTATDC 265

Query: 827  WIQTKSSAIKLGSASWFDFKGLVFDNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRY 1006
            WI+TKSSAIKLGSASWFDFKGLVFDNITI ESHRGLG+QIRDGGNVSDVTFSNINI+TRY
Sbjct: 266  WIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDVTFSNINISTRY 325

Query: 1007 YDPSWWGRAEPIYVTTCPRNEDSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFM 1186
            YDPSWWGRAEPIY+TTCPR+  +K GSISN+LFVNIT  SENGVFLSG   G+LS++KF 
Sbjct: 326  YDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITIDSENGVFLSGSPNGLLSDIKFK 385

Query: 1187 NVNLTYRRWTNYMDGLIDYRPGCQGLVNH-SSAGFLMEHIDGLEAVNVNMRWSEDH--SG 1357
            N+NLT+RRW+NY  GL+DYRPGCQGLVNH +++G +MEH++G    NV+++WS+D   + 
Sbjct: 386  NMNLTFRRWSNYSAGLVDYRPGCQGLVNHRATSGIIMEHVNGFRVENVDLKWSDDDDVNA 445

Query: 1358 RWNTPLDFKPSSVNNITLINLHSGLF 1435
             WN PL+F+PS+VNN++ +   SGL+
Sbjct: 446  AWNVPLEFRPSTVNNVSFVGFTSGLY 471


>ref|XP_002878172.1| hypothetical protein ARALYDRAFT_324272 [Arabidopsis lyrata subsp.
            lyrata] gi|297324010|gb|EFH54431.1| hypothetical protein
            ARALYDRAFT_324272 [Arabidopsis lyrata subsp. lyrata]
          Length = 589

 Score =  602 bits (1552), Expect = e-170
 Identities = 285/436 (65%), Positives = 340/436 (77%), Gaps = 8/436 (1%)
 Frame = +2

Query: 128  IELLRSNLLFSVADYGTVGDGIHYDTAQIQAAIDDCSSHGGG-----RVIFPPGTYLTAT 292
            I+L   +L  SV D+G  GDGI+YDT+ +Q+ ID C+ H        RV FP G YLTA 
Sbjct: 25   IQLPGDSLALSVTDFGATGDGINYDTSAVQSTIDACNRHYTSSSSICRVTFPSGNYLTAK 84

Query: 293  IFLKSGVVLEVQKNATILGGTRLEDY-PAENKR-WYVVVAEEAXXXXXXXXXXXXXXXLK 466
            + L+SGVVL+V +NA +LGG R+EDY PAE    WYVVVA  A                K
Sbjct: 85   LHLRSGVVLDVTENAVLLGGPRIEDYYPAETSSDWYVVVANNATDVGITGGGAIDGQGSK 144

Query: 467  FVKRFDEKKNVMISWNETGACLGDECRPRLVGFIRSKNVEIWNVNFTHPAYWCLHLVQCN 646
            FV RFDEKKNVM+SWN+TGACLGDECRPRLVGF+ S+NVEIWN+    PAYWCLH+V+C 
Sbjct: 145  FVVRFDEKKNVMVSWNQTGACLGDECRPRLVGFVDSRNVEIWNITLREPAYWCLHIVRCE 204

Query: 647  NTSIHDVSIYGDFNTPXXXXXXXXXXXXTLITRCNINTGDDAICPKTYEGALYNLTATNC 826
            NTS+HDVSI GDFNTP            T+ITRC+I+TGDDAICPKTY G LYNLTAT+C
Sbjct: 205  NTSVHDVSILGDFNTPNNDGIDIEDSNNTVITRCHIDTGDDAICPKTYTGPLYNLTATDC 264

Query: 827  WIQTKSSAIKLGSASWFDFKGLVFDNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRY 1006
            WI+TKSSAIKLGSASWFDFKGLVFDNITI ESHRGLG+QIRDGGNVSD+TFSNINI+TRY
Sbjct: 265  WIRTKSSAIKLGSASWFDFKGLVFDNITIFESHRGLGMQIRDGGNVSDITFSNINISTRY 324

Query: 1007 YDPSWWGRAEPIYVTTCPRNEDSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFM 1186
            YDPSWWGRAEPIY+TTCPR+  +K GSISN+LFVNIT  SENGVFLSG   G+LS++KF 
Sbjct: 325  YDPSWWGRAEPIYITTCPRDSSAKEGSISNLLFVNITINSENGVFLSGSPNGLLSDIKFK 384

Query: 1187 NVNLTYRRWTNYMDGLIDYRPGCQGLVNHS-SAGFLMEHIDGLEAVNVNMRWSEDHSGRW 1363
            N+NLT RRW+NY  GL+DYRPGC+GLVNHS +AG +MEH++G    NV+++WS+D +  W
Sbjct: 385  NMNLTVRRWSNYSAGLVDYRPGCRGLVNHSATAGIIMEHVNGFSIENVDLKWSDDLNSGW 444

Query: 1364 NTPLDFKPSSVNNITL 1411
            N PL+F+PS+VNN++L
Sbjct: 445  NVPLEFRPSTVNNVSL 460


>ref|XP_004161486.1| PREDICTED: exo-poly-alpha-D-galacturonosidase-like [Cucumis sativus]
          Length = 463

 Score =  601 bits (1549), Expect = e-169
 Identities = 281/440 (63%), Positives = 338/440 (76%), Gaps = 1/440 (0%)
 Frame = +2

Query: 113  SASPIIELLRSNLLFSVADYGTVGDGIHYDTAQIQAAIDDCSSHGGGRVIFPPGTYLTAT 292
            SA P I LLR +  FSV D+G +GDG+HYDT  IQ+AI+ C +     V FPPGTYLTAT
Sbjct: 20   SAIPSIRLLRRSTSFSVTDFGAIGDGLHYDTTAIQSAINSCPAPSRCYVTFPPGTYLTAT 79

Query: 293  IFLKSGVVLEVQKNATILGGTRLEDYPAENKRWYVVVAEEAXXXXXXXXXXXXXXXLKFV 472
            I+L+SGVVL++Q  AT+L GT++EDYPA++ RW+ VVAE A               LKFV
Sbjct: 80   IWLRSGVVLDIQPGATVLAGTKMEDYPADSSRWFAVVAENASDVGISGGGTVDGQGLKFV 139

Query: 473  KRFDEKKNVMISWNETGACLGDECRPRLVGFIRSKNVEIWNVNFTHPAYWCLHLVQCNNT 652
            ++FD++KNVM+SWN+TGAC GDECRP LVGFI S  V + NV+F  PA+WCLHLV+C NT
Sbjct: 140  EKFDKRKNVMVSWNKTGACYGDECRPDLVGFIGSNKVRVSNVSFNQPAHWCLHLVRCENT 199

Query: 653  SIHDVSIYGDFNTPXXXXXXXXXXXXTLITRCNINTGDDAICPKTYEGALYNLTATNCWI 832
             I DVSIYGDF+TP            TLITRC I+TGDDAICPK+  G ++NLTATNCWI
Sbjct: 200  VIEDVSIYGDFDTPNNDGIDIEDSNNTLITRCRIDTGDDAICPKSSNGPVFNLTATNCWI 259

Query: 833  QTKSSAIKLGSASWFDFKGLVFDNITIVESHRGLGLQIRDGGNVSDVTFSNINITTRYYD 1012
            +TKSSAIKLGSASWF+F  ++FDN+TIV+SHRGL  Q+RDGG+ +D+TFSNINITTRYYD
Sbjct: 260  RTKSSAIKLGSASWFNFTRMLFDNLTIVDSHRGLAFQLRDGGSANDITFSNINITTRYYD 319

Query: 1013 PSWWGRAEPIYVTTCPRNEDSKSGSISNILFVNITATSENGVFLSGCKGGILSNLKFMNV 1192
            PSWWGRAEPIYVTTCPR+  SK GSISNI F+NITATSENGVFLSG K G+LSNL+F NV
Sbjct: 320  PSWWGRAEPIYVTTCPRDPGSKEGSISNIRFINITATSENGVFLSGSKSGVLSNLRFTNV 379

Query: 1193 NLTYRRWTNYMDGLIDYRPGCQGLVNHSSAGFLMEHIDGLEAVNVNMRWSEDH-SGRWNT 1369
             L Y+RWT Y  G+ DYRPGCQG V H  AG +MEHI+GL   NV+M W + + S +WN 
Sbjct: 380  KLRYKRWTKYGGGIADYRPGCQGFVKHGMAGMIMEHIEGLNLENVDMHWFDTNGSLQWNN 439

Query: 1370 PLDFKPSSVNNITLINLHSG 1429
            PLDF+PS+VNNI+  N HSG
Sbjct: 440  PLDFRPSTVNNISFFNFHSG 459


Top