BLASTX nr result

ID: Bupleurum21_contig00026454 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00026454
         (1054 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   256   5e-66
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   254   2e-65
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       253   7e-65
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           249   6e-64
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   249   6e-64

>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  256 bits (655), Expect = 5e-66
 Identities = 140/329 (42%), Positives = 191/329 (58%), Gaps = 6/329 (1%)
 Frame = -1

Query: 994  EECFHKQKSRITWLKKGDSNTGYFFKQCLARWNSNKILSIKDDNDIVHQSHESIANVAVN 815
            EE F KQKS++ W+  GD N  YF K    R   N I  I+  N    Q+ E I   A  
Sbjct: 650  EEGFLKQKSKLHWMNVGDGNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAER 709

Query: 814  YFK----RKMGSVSNVASFPDDIVLPSIPTSKIQ--MLVAPVTDGEILCTLKSMKKGRSP 653
            +F     R+ G    + S  D   L S   S     +L   VT  EI   L +M   +SP
Sbjct: 710  FFNEFLNRQSGDFHGI-SVEDLRNLMSYRCSVTDQNILTREVTGEEIQKVLFAMPNNKSP 768

Query: 652  GPDGFQPEFFIAAWKIVGSDFVKGVKYFFNSLSMPKIINSAAIALVPKSSCPERISDYRP 473
            GPDG+  EFF A W + G DF+  ++ FF    +PK +N+  +AL+PK      + DYRP
Sbjct: 769  GPDGYTSEFFKATWSLTGPDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRP 828

Query: 472  ISCCNTIYKCISKILASRLKHVIGDLVSLNQAAFVSNRCMGDNIMLAQVFCKNYHIDKGQ 293
            ISCCN +YK ISKILA+RLK ++   +  NQ+AFV  R + +N++LA    K+YH +   
Sbjct: 829  ISCCNVLYKVISKILANRLKLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESVT 888

Query: 292  PRFAMKLDLFKAFDTVNWEFLLTILHRMNFPQLFVRWIQKCLSSAMLSVKINGGLEGFFN 113
            PR AMK+D+ KAFD+V W+FLL  L  +NFP+ F  WI+ C+S+A  SV++NG L GFF 
Sbjct: 889  PRCAMKIDISKAFDSVQWQFLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFG 948

Query: 112  AKMGLRQGDPISPYLFVIAMEAFTAIINK 26
            +  GLRQG  +SPYLFVI M   + +I++
Sbjct: 949  SSRGLRQGCALSPYLFVICMNVLSHMIDE 977


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  254 bits (649), Expect = 2e-65
 Identities = 136/343 (39%), Positives = 202/343 (58%), Gaps = 5/343 (1%)
 Frame = -1

Query: 1018 RFQKALCDEECFHKQKSRITWLKKGDSNTGYFFKQCLARWNSNKILSIKDDNDIVHQSHE 839
            R+ +    EE + KQKS++ W + GD NT  F +   AR   N I  I  ++ IV    +
Sbjct: 345  RWDRVAILEEKYLKQKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGD 404

Query: 838  SIANVAVNYFKRKMGSVSN----VASFPDDIVLP-SIPTSKIQMLVAPVTDGEILCTLKS 674
             I   A  +F+  +  + N    V       +LP     +  Q L+ PVT  EI   L  
Sbjct: 405  EIKAEAERFFREFLQLIPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFR 464

Query: 673  MKKGRSPGPDGFQPEFFIAAWKIVGSDFVKGVKYFFNSLSMPKIINSAAIALVPKSSCPE 494
            M   +SPGPDG+  EFF A W+I+G +F   V+ FF    +PK INS  +AL+PK +   
Sbjct: 465  MPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAR 524

Query: 493  RISDYRPISCCNTIYKCISKILASRLKHVIGDLVSLNQAAFVSNRCMGDNIMLAQVFCKN 314
             + DYRPISCCN +YK ISKI+A+RLK V+   ++ NQ+AFV +R + +N++LA    K+
Sbjct: 525  EMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKD 584

Query: 313  YHIDKGQPRFAMKLDLFKAFDTVNWEFLLTILHRMNFPQLFVRWIQKCLSSAMLSVKING 134
            YH D    R A+K+D+ KAFD+V W FL+ +   + FP+ F+ WI  C+++A  SV++NG
Sbjct: 585  YHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNG 644

Query: 133  GLEGFFNAKMGLRQGDPISPYLFVIAMEAFTAIINKETSSQDF 5
             L G+F +  GLRQG  +SPYLFVI M+  + +++K  +++ F
Sbjct: 645  ELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHF 687


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  253 bits (645), Expect = 7e-65
 Identities = 135/347 (38%), Positives = 206/347 (59%), Gaps = 4/347 (1%)
 Frame = -1

Query: 1042 STEKDLQYRFQKALCDEECFHKQKSRITWLKKGDSNTGYFFKQCLARWNSNKILSIKDDN 863
            S E + + ++      EE F +QKSRI+W  +GD NT YF +   AR +SN I ++ D N
Sbjct: 332  SFELEAERKWHILTAAEESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGN 391

Query: 862  DIVHQSHESIANVAVNYFKRKMGSVSN--VASFPDDIVLPSIPTSKIQM--LVAPVTDGE 695
              +  S E I ++  +YF   +G   +  +    D  +L S   S  Q+  L +  ++ +
Sbjct: 392  GKLVDSQEGILDLCASYFGSLLGDEVDPYLMEQNDMNLLLSYRCSPAQVCELESTFSNED 451

Query: 694  ILCTLKSMKKGRSPGPDGFQPEFFIAAWKIVGSDFVKGVKYFFNSLSMPKIINSAAIALV 515
            I   L S+ + +S GPDGF  EFFI +W IVG++    +K FF+S  + K  N+  I L+
Sbjct: 452  IRAALFSLPRNKSCGPDGFTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLI 511

Query: 514  PKSSCPERISDYRPISCCNTIYKCISKILASRLKHVIGDLVSLNQAAFVSNRCMGDNIML 335
            PK   P   SD+RPISC NT+YK I+++L  RL+ ++  ++S  Q+AF+  R + +N++L
Sbjct: 512  PKIVNPTCTSDFRPISCLNTLYKVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLL 571

Query: 334  AQVFCKNYHIDKGQPRFAMKLDLFKAFDTVNWEFLLTILHRMNFPQLFVRWIQKCLSSAM 155
            A      Y+     PR  +K+DL KAFD+V WEF++  L  +  P+ F+ WI +C+S+  
Sbjct: 572  ATDLVHGYNWSNISPRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPT 631

Query: 154  LSVKINGGLEGFFNAKMGLRQGDPISPYLFVIAMEAFTAIINKETSS 14
             +V INGG  GFF +  GLRQGDP+SPYLFV+AMEAF+ +++    S
Sbjct: 632  FTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYES 678


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  249 bits (637), Expect = 6e-64
 Identities = 123/339 (36%), Positives = 200/339 (58%), Gaps = 4/339 (1%)
 Frame = -1

Query: 1036 EKDLQYRFQKALCDEECFHKQKSRITWLKKGDSNTGYFFKQCLARWNSNKILSIKDDNDI 857
            E + Q ++    C EE F  Q+SR++W  +GDSNT YF +   +R + N I S+ D N +
Sbjct: 194  ELEAQRKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGL 253

Query: 856  VHQSHESIANVAVNYFKRKMGSVSNVASFPDD----IVLPSIPTSKIQMLVAPVTDGEIL 689
            +  S + I +  V Y++R +GS+ +  S   +    ++       +   L    TD EI 
Sbjct: 254  LIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRCSQDQCSELEKSFTDDEIK 313

Query: 688  CTLKSMKKGRSPGPDGFQPEFFIAAWKIVGSDFVKGVKYFFNSLSMPKIINSAAIALVPK 509
               KS+ + ++ GPDG+  EFF   W I+G + +  +  FF+S  + K  N+  + L+PK
Sbjct: 314  AAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPK 373

Query: 508  SSCPERISDYRPISCCNTIYKCISKILASRLKHVIGDLVSLNQAAFVSNRCMGDNIMLAQ 329
            +S    IS++RPISC NT+YK ISK+L SRL+ ++  ++  +Q+AF+  R + +N++LA 
Sbjct: 374  TSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLAT 433

Query: 328  VFCKNYHIDKGQPRFAMKLDLFKAFDTVNWEFLLTILHRMNFPQLFVRWIQKCLSSAMLS 149
                 Y+     PR  +K+DL KAFD+V WEF+   L  +  P+ ++ WI +C+++   +
Sbjct: 434  EMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFT 493

Query: 148  VKINGGLEGFFNAKMGLRQGDPISPYLFVIAMEAFTAII 32
            + +NG   GFF +  GLRQGDP+SPYLFV+AME F+ ++
Sbjct: 494  ISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLL 532


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  249 bits (637), Expect = 6e-64
 Identities = 123/339 (36%), Positives = 200/339 (58%), Gaps = 4/339 (1%)
 Frame = -1

Query: 1036 EKDLQYRFQKALCDEECFHKQKSRITWLKKGDSNTGYFFKQCLARWNSNKILSIKDDNDI 857
            E + Q ++    C EE F  Q+SR++W  +GDSNT YF +   +R + N I S+ D N +
Sbjct: 194  ELEAQRKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGL 253

Query: 856  VHQSHESIANVAVNYFKRKMGSVSNVASFPDD----IVLPSIPTSKIQMLVAPVTDGEIL 689
            +  S + I +  V Y++R +GS+ +  S   +    ++       +   L    TD EI 
Sbjct: 254  LIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRCSQDQCSELEKSFTDDEIK 313

Query: 688  CTLKSMKKGRSPGPDGFQPEFFIAAWKIVGSDFVKGVKYFFNSLSMPKIINSAAIALVPK 509
               KS+ + ++ GPDG+  EFF   W I+G + +  +  FF+S  + K  N+  + L+PK
Sbjct: 314  AAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSGQLLKQWNATTLVLIPK 373

Query: 508  SSCPERISDYRPISCCNTIYKCISKILASRLKHVIGDLVSLNQAAFVSNRCMGDNIMLAQ 329
            +S    IS++RPISC NT+YK ISK+L SRL+ ++  ++  +Q+AF+  R + +N++LA 
Sbjct: 374  TSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLAT 433

Query: 328  VFCKNYHIDKGQPRFAMKLDLFKAFDTVNWEFLLTILHRMNFPQLFVRWIQKCLSSAMLS 149
                 Y+     PR  +K+DL KAFD+V WEF+   L  +  P+ ++ WI +C+++   +
Sbjct: 434  EMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFT 493

Query: 148  VKINGGLEGFFNAKMGLRQGDPISPYLFVIAMEAFTAII 32
            + +NG   GFF +  GLRQGDP+SPYLFV+AME F+ ++
Sbjct: 494  ISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLL 532


Top