BLASTX nr result

ID: Bupleurum21_contig00005148 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00005148
         (1725 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   328   2e-87
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               315   2e-83
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   314   4e-83
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           313   7e-83
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   313   9e-83

>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  328 bits (842), Expect = 2e-87
 Identities = 189/589 (32%), Positives = 301/589 (51%), Gaps = 23/589 (3%)
 Frame = -1

Query: 1722 ICVTSCHYSIKLNGALEGFFPAASGVRQGDPLSPYLFVIAMQVFTACLNFCKASRDFTYH 1543
            IC+T+  +S+++NG L G+F ++ G+RQG  LSPYLFVI M V +  L+   A+R F YH
Sbjct: 631  ICITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYH 690

Query: 1542 WKCRDLDITHITFADDILLYCYGDAPSISILMDGITLFSGISGLRPNNLKSSIFFCNVKP 1363
             KC+ + +TH++FADD+++   G   SI  ++     F+  SGLR +  KS+++   +  
Sbjct: 691  PKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSA 750

Query: 1362 EIRNWAVGYTGFQLGELPMSYLGLPLLTSRLTMQQCLPLIMKLCARIQSWMNRFLSFSGR 1183
              RN       F  G+LP+ YLGLPL+T RL+   CLPL+ ++  RI SW +RFLS++GR
Sbjct: 751  TARNEVADRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGR 810

Query: 1182 LQLIKSVLFGIQGYWAFHLFLPKGVIKKIQSILSNFLWGGTANSTAQHKIAWSTCCLPLE 1003
            L LI SVL+ I  +W     LP+  I++++ + S FLW GT  ++ + KI+W   C P +
Sbjct: 811  LNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKD 870

Query: 1002 EGGLGIRDLMEWNTAAMIFRVWRTLKPSASLWIRWFHTYIIMNKPFWTMKCS-SFHSWGV 826
            EGGLG+R L E N    +  VW+ +  S SLW++W   +++ N  FW +K + S  SW  
Sbjct: 871  EGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIW 930

Query: 825  RKIMNARSIAMQHIVYSAGRNSEFLLWHDPWPGI-PLISRL-SCAVVSITESVSMARLHT 652
            +K++  R +A        G   +   W+D W  +  L+ R     ++ +  S  M     
Sbjct: 931  KKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEA 990

Query: 651  IMDGNSWRPHSSSHHLAIELRRMLASIQIADADRITW---NDIAKIKIS--HIWNSIRHV 487
              +    R  +  +++  +  +     +    D++ W   +D+ +   S    W+  R  
Sbjct: 991  WTNRRQRRHRNDVYNVIEDALKKSWDTRTETEDKVLWRGKSDVFRTTFSTRDTWHHTRST 1050

Query: 486  GPSPPWVHKVWSSYMIKKCSFLMWTALHQWLLTKDRMINFGMNVDPSCLLCQQHPESIVH 307
                PW   +W S+   K SF  W A H  L T DRMIN+   +   C+ CQ   E+  H
Sbjct: 1051 SARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLETRDH 1110

Query: 306  LFSTCSYSLSIMQNSPHPLTCSWSDYKNGTF--------------LIGNPEKQMEQMAIL 169
            LF TCS++  I           W D   G F              +  +   ++E     
Sbjct: 1111 LFFTCSFTSVI-----------WVDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRR 1159

Query: 168  YLAVA-MYLIWKERNARLHNSHPKSAASLAGEVKAVVREKKSAASLAGE 25
            Y+  A +Y++W+ERN R H   P +A+ L G +   +R + S+  L G+
Sbjct: 1160 YVFQATIYIVWRERNGRRHGEPPNTASQLVGWIDKQIRNQLSSICLKGD 1208


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  315 bits (808), Expect = 2e-83
 Identities = 196/587 (33%), Positives = 304/587 (51%), Gaps = 21/587 (3%)
 Frame = -1

Query: 1722 ICVTSCHYSIKLNGALEGFFPAASGVRQGDPLSPYLFVIAMQVFTACLNFCKASRDFTYH 1543
            +C+T+  +S+++NG L G+F +  G+RQG  LSPYLFVI M V +  L+     R F +H
Sbjct: 278  LCITTASFSVQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFH 337

Query: 1542 WKCRDLDITHITFADDILLYCYGDAPSISILMDGITLFSGISGLRPNNLKSSIFFCNVKP 1363
             KC+ L +TH++FADD+++   G   SI  +++    F   SGLR +  KS+++   V P
Sbjct: 338  PKCQRLGLTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSP 397

Query: 1362 EIRNWAVGYTGFQLGELPMSYLGLPLLTSRLTMQQCLPLIMKLCARIQSWMNRFLSFSGR 1183
             I+        F +G+LP+ YLGLPL+T RLT     PL+ ++  RI +W  RF SF+GR
Sbjct: 398  IIKQEIAAKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGR 457

Query: 1182 LQLIKSVLFGIQGYWAFHLFLPKGVIKKIQSILSNFLWGGTANSTAQHKIAWSTCCLPLE 1003
              LIKSVL+ I  +W     LP+  I++I  + S+FLW G+  S+ + KI+W   C P  
Sbjct: 458  FNLIKSVLWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKA 517

Query: 1002 EGGLGIRDLMEWNTAAMIFRVWRTLKPSASLWIRWFHTYIIMNKPFWTMKCS-SFHSWGV 826
            EGGLG+R+L E N  + +  VWR +  S SLW +W   Y+I  K  W++K S S  SW  
Sbjct: 518  EGGLGLRNLKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIW 577

Query: 825  RKIMNARSIAMQHIVYSAGRNSEFLLWHDPWPG----IPLISRLSCAVVSITESVSMARL 658
            RKI+  R +A        G       W+D W      I  +       + I    S+A  
Sbjct: 578  RKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVADA 637

Query: 657  HTIMDGNSWRPHSSSHHLAIELRRMLA--SIQIADA-DRITW---NDIAKIKIS--HIWN 502
             T     S R H +S  L  E+  M+A   I  +DA D + W   ND+ K   S    W+
Sbjct: 638  WT---RRSRRRHRTS--LLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTRDTWH 692

Query: 501  SIRHVGPSPPWVHKVWSSYMIKKCSFLMWTALHQWLLTKDRMI--NFGMNVDPSCLLCQQ 328
             I+    +  W   VW  +   K +   W A+H  L T DRM+  N   +V  +C+LC  
Sbjct: 693  LIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCTN 752

Query: 327  HPESIVHLFSTCSYSLSIMQNSPHPL-----TCSWSDYKNGTFLIGNPEKQMEQMAILYL 163
            + +++ HLF +CSY+ ++       +     +  WS     T +  + + ++E     Y+
Sbjct: 753  NSKTLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHLL--THISTHFQDRVEGFLTRYI 810

Query: 162  AVA-MYLIWKERNARLHNSHPKSAASLAGEVKAVVREKKSAASLAGE 25
              A +Y +W+ERN R H++ P + A++ G +    R + +    +G+
Sbjct: 811  FQATIYHVWRERNGRRHDAAPNTPATVIGWIDKQTRNQITIIRQSGD 857


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  314 bits (805), Expect = 4e-83
 Identities = 190/567 (33%), Positives = 289/567 (50%), Gaps = 20/567 (3%)
 Frame = -1

Query: 1722 ICVTSCHYSIKLNGALEGFFPAASGVRQGDPLSPYLFVIAMQVFTACLNFCKASRDFTYH 1543
            +C+ +  +S+++NG L GFF +  G+RQG  LSPYL+VI M V +  L+     +  +YH
Sbjct: 778  LCIGTASFSVQVNGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYH 837

Query: 1542 WKCRDLDITHITFADDILLYCYGDAPSISILMDGITLFSGISGLRPNNLKSSIFFCNVKP 1363
             +CR++++TH+ FADDI+++  G + SI   +     F+ +S L+ +  KS+IF   + P
Sbjct: 838  PRCRNMNLTHLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISP 897

Query: 1362 EIRNWAVGYTGFQLGELPMSYLGLPLLTSRLTMQQCLPLIMKLCARIQSWMNRFLSFSGR 1183
              +   +    F+LG LP+ YLGLPLLT R+T    LPL+ K+ ARI SW NRFLSF+GR
Sbjct: 898  NAKTSILQQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGR 957

Query: 1182 LQLIKSVLFGIQGYWAFHLFLPKGVIKKIQSILSNFLWGGTANSTAQHKIAWSTCCLPLE 1003
            LQLIKSVL  I  +W     LPK  +++I+ + S FLW G   +T + KIAWS  C   E
Sbjct: 958  LQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKE 1017

Query: 1002 EGGLGIRDLMEWNTAAMIFRVWRTLKPSASLWIRWFHTYIIMNKPFWTMK-CSSFHSWGV 826
            EGGLG++ L E N  +++  +WR L    SLW++W + ++I  + FW++K  +   SW  
Sbjct: 1018 EGGLGLKPLKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLW 1077

Query: 825  RKIMNARSIA--MQHIVYSAGRNSEFLLWHDPWPGIPLISRLSCAVVSITESVSMARLHT 652
            RKI+  R  A     +   +G  + F  WHD W   PL  RL   + S   ++ +   + 
Sbjct: 1078 RKILKQRDKARLFHRMEVRSGTFTSF--WHDHW--CPL-GRLHQHMGS-RGTIDLGIPNN 1131

Query: 651  IMDGNSWRPHSSSHHLAIELRRMLASIQIA------DADRITWND-----IAKIKISHIW 505
                     H    H A  L ++ + I++A      D DR  W        +    S  W
Sbjct: 1132 ATVAEVMNTHRRKRHRADFLNQIKSQIELARQDRSTDGDRSLWKQKEDTFKSSFSSSKTW 1191

Query: 504  NSIRHVGPSPPWVHKVWSSYMIKKCSFLMWTALHQWLLTKDRMINFGMNVDPSCLLCQQH 325
              IR +     W   VW S    K SF+ W A H  L T D++  +       C+ C + 
Sbjct: 1192 QQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGEE 1251

Query: 324  PESIVHLFSTCSYSLSIMQNSPHPL-----TCSWSDYKNGTFLIGNPEKQMEQMAILY-L 163
             E+  HLF +C YS  +  +    L       +W+       L+ +    +    + Y  
Sbjct: 1252 LETRDHLFFSCPYSSHVWFSLTKGLLNGRNILNWNLIT--PHLLDSSRPYLHVFTLRYAF 1309

Query: 162  AVAMYLIWKERNARLHNSHPKSAASLA 82
              +++ +W+ERN R H      AA LA
Sbjct: 1310 QASIHSLWRERNCRRHGETAIPAAKLA 1336


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  313 bits (803), Expect = 7e-83
 Identities = 182/552 (32%), Positives = 277/552 (50%), Gaps = 15/552 (2%)
 Frame = -1

Query: 1719 CVTSCHYSIKLNGALEGFFPAASGVRQGDPLSPYLFVIAMQVFTACLNFCKASRDFTYHW 1540
            C+T+  ++I +NGA  GFF +  G+RQGDPLSPYLFV+AM+VF+  L     S    YH 
Sbjct: 486  CITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHP 545

Query: 1539 KCRDLDITHITFADDILLYCYGDAPSISILMDGITLFSGISGLRPNNLKSSIFFCNVKPE 1360
            K  DL I+H+ FADD++++  G + S+  + + +  F+  SGL+ N  KS +F   +   
Sbjct: 546  KAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDLS 605

Query: 1359 IRNWAVGYTGFQLGELPMSYLGLPLLTSRLTMQQCLPLIMKLCARIQSWMNRFLSFSGRL 1180
             R  +  Y GF  G  P+ YLGLPL+  +L +    PL+ KL AR++SW+++ LSF+GR 
Sbjct: 606  ERITSAAY-GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRT 664

Query: 1179 QLIKSVLFGIQGYWAFHLFLPKGVIKKIQSILSNFLWGGTANSTAQHKIAWSTCCLPLEE 1000
            QLI SV+FG+  +W     LPKG IKKI+S+ S FLW G+ +     K++W  CCLP  E
Sbjct: 665  QLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSE 724

Query: 999  GGLGIRDLMEWNTAAMIFRVWRTLKPSASLWIRWFHTYIIMNKPFWTMKCSSFHSWGVRK 820
            GGLG R   EWN   ++  +W       SLW +W   + + +  FW +       W  + 
Sbjct: 725  GGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKM 784

Query: 819  IMNARSIAMQHIVYSAGRNSEFLLWHDPWPGI-PLISRLSCAVVSITESVSMARLHTIMD 643
            ++N R +A + I    G       W D W  + PLI  L             A++   +D
Sbjct: 785  LLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAID 844

Query: 642  GNSWRPHSSSHHLAIELRRMLASI----QIADADRITW----NDIAKIKISHIWNSIRHV 487
            G+ WR   S    A  +   LAS+     +  +D  +W     D      +  W  +R  
Sbjct: 845  GSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKTWEVLRPR 904

Query: 486  GPSPPWVHKVWSSYMIKKCSFLMWTALHQWLLTKDRMINFGMNVDPSCLLCQQHPESIVH 307
             P   W   VW    + K +F  WTA    L T+ R++++G+     C LC    E+  H
Sbjct: 905  RPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSFDTETRDH 964

Query: 306  LFSTCSYSLSI-----MQNSPHP-LTCSWSDYKNGTFLIGNPEKQMEQMAILYLAVAMYL 145
            L   C +S  +     ++  P   L C+W++  + T         + +  +  L V  Y 
Sbjct: 965  LLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSLLRKVVAQLVV--YN 1022

Query: 144  IWKERNARLHNS 109
            +W++RN  LH+S
Sbjct: 1023 LWRQRNLVLHSS 1034


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  313 bits (802), Expect = 9e-83
 Identities = 182/552 (32%), Positives = 277/552 (50%), Gaps = 15/552 (2%)
 Frame = -1

Query: 1719 CVTSCHYSIKLNGALEGFFPAASGVRQGDPLSPYLFVIAMQVFTACLNFCKASRDFTYHW 1540
            C+T+  ++I +NGA  GFF +  G+RQGDPLSPYLFV+AM+VF+  L     S    YH 
Sbjct: 486  CITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHP 545

Query: 1539 KCRDLDITHITFADDILLYCYGDAPSISILMDGITLFSGISGLRPNNLKSSIFFCNVKPE 1360
            K  DL I+H+ FADD++++  G + S+  + + +  F+  SGL+ N  KS +F   +   
Sbjct: 546  KAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDLS 605

Query: 1359 IRNWAVGYTGFQLGELPMSYLGLPLLTSRLTMQQCLPLIMKLCARIQSWMNRFLSFSGRL 1180
             R  +  Y GF  G  P+ YLGLPL+  +L +    PL+ KL AR++SW+++ LSF+GR 
Sbjct: 606  ERITSAAY-GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRT 664

Query: 1179 QLIKSVLFGIQGYWAFHLFLPKGVIKKIQSILSNFLWGGTANSTAQHKIAWSTCCLPLEE 1000
            QLI SV+FG+  +W     LPKG IKKI+S+ S FLW G+ +     K++W  CCLP  E
Sbjct: 665  QLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSE 724

Query: 999  GGLGIRDLMEWNTAAMIFRVWRTLKPSASLWIRWFHTYIIMNKPFWTMKCSSFHSWGVRK 820
            GGLG R   EWN   ++  +W       SLW +W   + + +  FW +       W  + 
Sbjct: 725  GGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKM 784

Query: 819  IMNARSIAMQHIVYSAGRNSEFLLWHDPWPGI-PLISRLSCAVVSITESVSMARLHTIMD 643
            ++N R +A + I    G       W D W  + PLI  L             A++   +D
Sbjct: 785  LLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAID 844

Query: 642  GNSWRPHSSSHHLAIELRRMLASI----QIADADRITW----NDIAKIKISHIWNSIRHV 487
            G+ WR   S    A  +   LAS+     +  +D  +W     D      +  W  +R  
Sbjct: 845  GSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKTWEVLRPR 904

Query: 486  GPSPPWVHKVWSSYMIKKCSFLMWTALHQWLLTKDRMINFGMNVDPSCLLCQQHPESIVH 307
             P   W   VW    + K +F  WTA    L T+ R++++G+     C LC    E+  H
Sbjct: 905  RPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSFDTETRDH 964

Query: 306  LFSTCSYSLSI-----MQNSPHP-LTCSWSDYKNGTFLIGNPEKQMEQMAILYLAVAMYL 145
            L   C +S  +     ++  P   L C+W++  + T         + +  +  L V  Y 
Sbjct: 965  LLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSLLRKVVAQLVV--YN 1022

Query: 144  IWKERNARLHNS 109
            +W++RN  LH+S
Sbjct: 1023 LWRQRNLVLHSS 1034


Top