BLASTX nr result

ID: Bupleurum21_contig00015002 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00015002
         (1280 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   342   9e-92
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   340   6e-91
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...   330   6e-88
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       328   2e-87
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   326   7e-87

>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  342 bits (878), Expect = 9e-92
 Identities = 172/422 (40%), Positives = 253/422 (59%), Gaps = 4/422 (0%)
 Frame = +1

Query: 25   EEQFLKQKSRIHWLQHGDGNNRFFFNACKGRWNRNKIVAIQNDNGEIVSNHQDISQVAVA 204
            +E  LKQKSRI WL  GD N++FFF A K R  RNKIV +QND G+ ++ + +I      
Sbjct: 343  DESILKQKSRIQWLSLGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICN 402

Query: 205  YFQSLLGSEKPTTSFPSDLEL----PKISLEQGARLTAPVTSAEILQTLKSMSANKCPGP 372
            +++ LLG+         DL +     K+S    A+L  P+T  EI Q L  +   K PG 
Sbjct: 403  FYRRLLGTSSSQLE-AIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGL 461

Query: 373  DGFPPEFYIATWHIVGNEVVNGIISFFQDLAMPRSVNATTIALIPKCDAPSNMSQFRPIS 552
            DGF   F+  +W ++  E+  GI+ FF++  M + +N T + LIPK D   +   +RPI+
Sbjct: 462  DGFNSVFFKKSWLVIKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIA 521

Query: 553  CCNVLYKCISKLLASRLKPIMSYLISPCQGAFVPRRLIGDNILLAQSLCRNYHLNTGTPR 732
            CC+ LYK ISK+L  RL+ +++ ++   Q  F+P R IGDNILLA  L R Y+    +PR
Sbjct: 522  CCSTLYKIISKILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHVSPR 581

Query: 733  CAIKLDISKAFDSLSWNFLFAALQAMNFPAIFIEWIKQCVSTCMYSVKVNGSLEGFFKAE 912
            C IK+DI KA+DS+ W FL + L+ + FP++FI WI  CV T  YS+ +NG     F A+
Sbjct: 582  CVIKVDIRKAYDSVEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFDAQ 641

Query: 913  SGIRQGDPISPYLFVIAMEVLTACLRSSTSAGNFRFHWKCKAVDLTHIIFADDVMLFSYA 1092
             G+RQGDP+SP+LF ++ME L+ C+ +      F FH KC+ + LTH++FADD+++F+ A
Sbjct: 642  KGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARA 701

Query: 1093 DGPSVTALMDGLNLFTSISDLTLNPSKSMVFFGNVQQHIRDEIIHVTGFQYGSLPITYLG 1272
            D  S++ +M   N F+  S L  +  KS ++FG V     +++        GSLP  YLG
Sbjct: 702  DASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYLG 761

Query: 1273 LP 1278
            +P
Sbjct: 762  VP 763


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  340 bits (871), Expect = 6e-91
 Identities = 182/423 (43%), Positives = 249/423 (58%), Gaps = 5/423 (1%)
 Frame = +1

Query: 25   EEQFLKQKSRIHWLQHGDGNNRFFFNACKGRWNRNKIVAIQNDNGEIVSNHQDISQVAVA 204
            EE FLKQKS++HW+  GDGNN +F  A + R  RN I  I+  N E +   ++I   A  
Sbjct: 650  EEGFLKQKSKLHWMNVGDGNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAER 709

Query: 205  YFQSLLGSEKPTTSFPSDLELPKI-----SLEQGARLTAPVTSAEILQTLKSMSANKCPG 369
            +F   L  +       S  +L  +     S+     LT  VT  EI + L +M  NK PG
Sbjct: 710  FFNEFLNRQSGDFHGISVEDLRNLMSYRCSVTDQNILTREVTGEEIQKVLFAMPNNKSPG 769

Query: 370  PDGFPPEFYIATWHIVGNEVVNGIISFFQDLAMPRSVNATTIALIPKCDAPSNMSQFRPI 549
            PDG+  EF+ ATW + G + +  I SFF    +P+ +NAT +ALIPK D    M  +RPI
Sbjct: 770  PDGYTSEFFKATWSLTGPDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPI 829

Query: 550  SCCNVLYKCISKLLASRLKPIMSYLISPCQGAFVPRRLIGDNILLAQSLCRNYHLNTGTP 729
            SCCNVLYK ISK+LA+RLK ++   I   Q AFV  RL+ +N+LLA  L ++YH  + TP
Sbjct: 830  SCCNVLYKVISKILANRLKLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESVTP 889

Query: 730  RCAIKLDISKAFDSLSWNFLFAALQAMNFPAIFIEWIKQCVSTCMYSVKVNGSLEGFFKA 909
            RCA+K+DISKAFDS+ W FL   L+A+NFP  F  WIK C+ST  +SV+VNG L GFF +
Sbjct: 890  RCAMKIDISKAFDSVQWQFLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFGS 949

Query: 910  ESGIRQGDPISPYLFVIAMEVLTACLRSSTSAGNFRFHWKCKAVDLTHIIFADDVMLFSY 1089
              G+RQG  +SPYLFVI M VL+  +  +    N  +H KC+ + LTH+ FADD+M+F  
Sbjct: 950  SRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVD 1009

Query: 1090 ADGPSVTALMDGLNLFTSISDLTLNPSKSMVFFGNVQQHIRDEIIHVTGFQYGSLPITYL 1269
                S+  +++    F   S L ++  KS ++   V    R + +    F  G LP+ YL
Sbjct: 1010 GHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPVRYL 1069

Query: 1270 GLP 1278
            GLP
Sbjct: 1070 GLP 1072


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score =  330 bits (845), Expect = 6e-88
 Identities = 179/429 (41%), Positives = 252/429 (58%), Gaps = 11/429 (2%)
 Frame = +1

Query: 25   EEQFLKQKSRIHWLQHGDGNNRFFFNACKGRWNRNKIVAIQNDNGEIVSNHQDISQVAVA 204
            EE+FLKQ+S++HWL  GD NN+ F  A   R  +N I  I   +G + S  + I   A  
Sbjct: 241  EEKFLKQRSKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEH 300

Query: 205  YFQSLLGSEKPTTSFPSDLE----------LP-KISLEQGARLTAPVTSAEILQTLKSMS 351
            +F+  L         P+D E          LP + S      LT  V++ EI + + SM 
Sbjct: 301  HFREFL------QLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSAEEIHKVVFSMP 354

Query: 352  ANKCPGPDGFPPEFYIATWHIVGNEVVNGIISFFQDLAMPRSVNATTIALIPKCDAPSNM 531
             +K PGPDG+  EFY   W+I+G E +  I SFF    +P+ +N+T +ALIPK      M
Sbjct: 355  NDKSPGPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKEAKEM 414

Query: 532  SQFRPISCCNVLYKCISKLLASRLKPIMSYLISPCQGAFVPRRLIGDNILLAQSLCRNYH 711
              +RPISCCNVLYK ISK++A+RLK ++   I   Q AFV  RL+ +N+LLA  + ++YH
Sbjct: 415  KDYRPISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATEIVKDYH 474

Query: 712  LNTGTPRCAIKLDISKAFDSLSWNFLFAALQAMNFPAIFIEWIKQCVSTCMYSVKVNGSL 891
             ++ + RCA+K+DISKAFDS+ W FL   L+AMNFP  F  WI  C++T  +SV+VNG L
Sbjct: 475  KDSVSSRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSVQVNGEL 534

Query: 892  EGFFKAESGIRQGDPISPYLFVIAMEVLTACLRSSTSAGNFRFHWKCKAVDLTHIIFADD 1071
             G F +   +RQG  +SPYLFVI+M+VL+  L  +  A  F +H KC+A+ LTH+ FADD
Sbjct: 535  AGVFSSARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLSFADD 594

Query: 1072 VMLFSYADGPSVTALMDGLNLFTSISDLTLNPSKSMVFFGNVQQHIRDEIIHVTGFQYGS 1251
            +M+ S     S+  ++  L  F   S L ++  KS ++   VQ  +  EI+    F  G 
Sbjct: 595  LMILSDGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGK 654

Query: 1252 LPITYLGLP 1278
            LP+ YLGLP
Sbjct: 655  LPVRYLGLP 663


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  328 bits (841), Expect = 2e-87
 Identities = 177/424 (41%), Positives = 251/424 (59%), Gaps = 4/424 (0%)
 Frame = +1

Query: 19   STEEQFLKQKSRIHWLQHGDGNNRFFFNACKGRWNRNKIVAIQNDNGEIVSNHQDISQVA 198
            + EE F +QKSRI W   GDGN ++F      R + N I A+ + NG++V + + I  + 
Sbjct: 346  AAEESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLC 405

Query: 199  VAYFQSLLGSE-KPTTSFPSDLELP---KISLEQGARLTAPVTSAEILQTLKSMSANKCP 366
             +YF SLLG E  P     +D+ L    + S  Q   L +  ++ +I   L S+  NK  
Sbjct: 406  ASYFGSLLGDEVDPYLMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSC 465

Query: 367  GPDGFPPEFYIATWHIVGNEVVNGIISFFQDLAMPRSVNATTIALIPKCDAPSNMSQFRP 546
            GPDGF  EF+I +W IVG EV + I  FF    + +  NATTI LIPK   P+  S FRP
Sbjct: 466  GPDGFTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRP 525

Query: 547  ISCCNVLYKCISKLLASRLKPIMSYLISPCQGAFVPRRLIGDNILLAQSLCRNYHLNTGT 726
            ISC N LYK I++LL  RL+ ++S +IS  Q AF+P R + +N+LLA  L   Y+ +  +
Sbjct: 526  ISCLNTLYKVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNIS 585

Query: 727  PRCAIKLDISKAFDSLSWNFLFAALQAMNFPAIFIEWIKQCVSTCMYSVKVNGSLEGFFK 906
            PR  +K+D+ KAFDS+ W F+ AAL+A+  P  FI WI QC+ST  ++V +NG   GFFK
Sbjct: 586  PRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFFK 645

Query: 907  AESGIRQGDPISPYLFVIAMEVLTACLRSSTSAGNFRFHWKCKAVDLTHIIFADDVMLFS 1086
            +  G+RQGDP+SPYLFV+AME  +  L S   +G   +H K   + ++H++FADDVM+F 
Sbjct: 646  STKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMIFF 705

Query: 1087 YADGPSVTALMDGLNLFTSISDLTLNPSKSMVFFGNVQQHIRDEIIHVTGFQYGSLPITY 1266
                 S+  + + L+ F S S L +N  KS ++   + Q +        GF  G+LPI Y
Sbjct: 706  DGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQ-LESNANAAYGFPIGTLPIRY 764

Query: 1267 LGLP 1278
            LGLP
Sbjct: 765  LGLP 768


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  326 bits (836), Expect = 7e-87
 Identities = 177/429 (41%), Positives = 253/429 (58%), Gaps = 11/429 (2%)
 Frame = +1

Query: 25   EEQFLKQKSRIHWLQHGDGNNRFFFNACKGRWNRNKIVAIQNDNGEIVSNHQDISQVAVA 204
            EE++LKQKS++HW Q GD N + F  A   R   N I  I +++G + +   +I   A  
Sbjct: 353  EEKYLKQKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAER 412

Query: 205  YFQSLLGSEKPTTSFPSDLE----------LP-KISLEQGARLTAPVTSAEILQTLKSMS 351
            +F+  L         P+D E          LP + S      L  PVT+ EI + L  M 
Sbjct: 413  FFREFL------QLIPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFRMP 466

Query: 352  ANKCPGPDGFPPEFYIATWHIVGNEVVNGIISFFQDLAMPRSVNATTIALIPKCDAPSNM 531
            ++K PGPDG+  EF+ ATW I+G+E    + SFF    +P+ +N+T +ALIPK      M
Sbjct: 467  SDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREM 526

Query: 532  SQFRPISCCNVLYKCISKLLASRLKPIMSYLISPCQGAFVPRRLIGDNILLAQSLCRNYH 711
              +RPISCCNVLYK ISK++A+RLK ++   I+  Q AFV  RL+ +N+LLA  L ++YH
Sbjct: 527  KDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKDYH 586

Query: 712  LNTGTPRCAIKLDISKAFDSLSWNFLFAALQAMNFPAIFIEWIKQCVSTCMYSVKVNGSL 891
             +T + RCAIK+DISKAFDS+ W FL      + FP  FI WI  C++T  +SV+VNG L
Sbjct: 587  KDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNGEL 646

Query: 892  EGFFKAESGIRQGDPISPYLFVIAMEVLTACLRSSTSAGNFRFHWKCKAVDLTHIIFADD 1071
             G+F++  G+RQG  +SPYLFVI M+VL+  L  + +A +F +H KCK + LTH+ FADD
Sbjct: 647  AGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADD 706

Query: 1072 VMLFSYADGPSVTALMDGLNLFTSISDLTLNPSKSMVFFGNVQQHIRDEIIHVTGFQYGS 1251
            +M+ S     S+  ++   + F   S L ++  KS V+   +    R+E+     F  G 
Sbjct: 707  LMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQ 766

Query: 1252 LPITYLGLP 1278
            LP+ YLGLP
Sbjct: 767  LPVRYLGLP 775


Top