BLASTX nr result

ID: Bupleurum21_contig00023128 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00023128
         (1577 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...   385   e-104
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   374   e-101
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   371   e-100
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   369   e-100
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   369   2e-99

>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score =  385 bits (990), Expect = e-104
 Identities = 215/527 (40%), Positives = 310/527 (58%), Gaps = 8/527 (1%)
 Frame = +3

Query: 21   SWFVLTTKLKRVKQALKCLNNS-IGNVHLAVQEARNELYNLQNCIIGAPSDTQAIEERNL 197
            S F  + KLK +K  L+ L    +GN+    +EA   L   Q   +  PS +   EE   
Sbjct: 171  SLFRFSKKLKGLKPLLRNLGKERLGNLVKQTKEAFETLCQKQAMKMANPSPSSMQEENEA 230

Query: 198  MMKYQAALDSEENFLQQKSKVHWLQKGDGNNRFFFNYCRGRWNTNRIVGLQDP*GSLVTD 377
              K+      EE FL+Q+SK+HWL  GD NN+ F      R   N I  +    GS+ + 
Sbjct: 231  YAKWDHIAVLEEKFLKQRSKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQ 290

Query: 378  HHGIANIAVDYYRNLLGTEK------LVEPLPDLNLPSIPDDLGRSLIAPISSTEILRTL 539
               I   A  ++R  L           VE L DL      D     L   +S+ EI + +
Sbjct: 291  EEKIKTEAEHHFREFLQLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSAEEIHKVV 350

Query: 540  KSMKKNRSPGPDGFTPDFYIAVWDVVGSDLVNALHSFFDALDLPRQINATAISLIPKVDS 719
             SM  ++SPGPDG+T +FY   W+++G++ + A+ SFF    LP+ IN+T ++LIPK   
Sbjct: 351  FSMPNDKSPGPDGYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKE 410

Query: 720  PVEMHQFRPISCCNVLYKCITKILANRIKPVLKNLISFNQSAFIPSRSMGDNILLSQALC 899
              EM  +RPISCCNVLYK I+KI+ANR+K VL   I  NQSAF+  R + +N+LL+  + 
Sbjct: 411  AKEMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATEIV 470

Query: 900  RSYHLNKGAPRCMLKLDISKAFDSINWSFILNVLNAMHFPAKFTSWIRKCISTCMFSVKI 1079
            + YH +  + RC LK+DISKAFDS+ W F++NVL AM+FP +FT WI  CI+T  FSV++
Sbjct: 471  KDYHKDSVSSRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSVQV 530

Query: 1080 NGGLEGFFEGKSGVRQGDPLSPYLFVIAMEVLTCCL-ASVTSADFQFHPKCKDLKLSHLI 1256
            NG L G F     +RQG  LSPYLFVI+M+VL+  L  +V +  F +HPKC+ + L+HL 
Sbjct: 531  NGELAGVFSSARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLS 590

Query: 1257 FADDVLLFSHGDPRSVNTLLKGVDTFSRISGLHLNMQKSLIFFGNVRPADASAILQSSNL 1436
            FADD+++ S G  RS++ ++K +  F++ SGL ++M+KS ++   V+ +    I+Q  + 
Sbjct: 591  FADDLMILSDGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSF 650

Query: 1437 QRGSFPFSYLGIPLVTSRINAQLCTPLIMKLCARVNSWTGRFLSFGG 1577
              G  P  YLG+PLV+ R+ A  C PLI +L  ++ +WT RFLSF G
Sbjct: 651  DVGKLPVRYLGLPLVSKRLTASDCLPLIEQLRKKIEAWTSRFLSFAG 697


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  374 bits (959), Expect = e-101
 Identities = 199/530 (37%), Positives = 315/530 (59%), Gaps = 7/530 (1%)
 Frame = +3

Query: 9    VEGDSWFVLTTKLKRVKQALKCLN-NSIGNVHLAVQEARNELYNLQNCIIGAPSDTQAIE 185
            V G + + ++ KLK +K+ ++  + ++  ++    +EA + L   Q+ ++ +P  + A  
Sbjct: 171  VTGSAMYRVSVKLKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAI 230

Query: 186  ERNLMMKYQAALDSEENFLQQKSKVHWLQKGDGNNRFFFNYCRGRWNTNRIVGLQDP*GS 365
            E     K++   ++E +F  Q+S+V+WL++GD N+ +F      R + N I  L DP G 
Sbjct: 231  EAETQRKWRILAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGD 290

Query: 366  LVTDHHGIANIAVDYYRNLLGTEK---LVEPLPDLNLPSIPDDLGR--SLIAPISSTEIL 530
             +     + N  V+Y+++ LG+E+   L E     NL S      +  SL  P SS +I 
Sbjct: 291  RIEGQQNLENHCVEYFQSNLGSEQGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIK 350

Query: 531  RTLKSMKKNRSPGPDGFTPDFYIAVWDVVGSDLVNALHSFFDALDLPRQINATAISLIPK 710
                S+ +N++ GPDGF+P+F+ A W ++G ++  A+H FF +  L +Q NAT + LIPK
Sbjct: 351  NAFFSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPK 410

Query: 711  VDSPVEMHQFRPISCCNVLYKCITKILANRIKPVLKNLISFNQSAFIPSRSMGDNILLSQ 890
            + +   M  FRPISC N +YK I+K+L +R+K  L   IS +QSAF+P R   +N+LL+ 
Sbjct: 411  ITNASSMSDFRPISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLAT 470

Query: 891  ALCRSYHLNKGAPRCMLKLDISKAFDSINWSFILNVLNAMHFPAKFTSWIRKCISTCMFS 1070
             L   Y+    AP  MLK+D+ KAFDS+ W FI++ L A++ P KFT WI +C+ST  FS
Sbjct: 471  ELVHGYNKKNIAPSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFS 530

Query: 1071 VKINGGLEGFFEGKSGVRQGDPLSPYLFVIAMEVLTCCLAS-VTSADFQFHPKCKDLKLS 1247
            V +NG   G F    G+RQGDP+SPYLFV+AMEV +  L S  TS    +HPK   L++S
Sbjct: 531  VILNGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEIS 590

Query: 1248 HLIFADDVLLFSHGDPRSVNTLLKGVDTFSRISGLHLNMQKSLIFFGNVRPADASAILQS 1427
            HL+FADDV++F  G   S++ +++ ++ F+  SGL +N  K+ ++   +  +++ + + S
Sbjct: 591  HLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDS-MAS 649

Query: 1428 SNLQRGSFPFSYLGIPLVTSRINAQLCTPLIMKLCARVNSWTGRFLSFGG 1577
               + GS P  YLG+PL++ ++      PLI K+ AR NSW  R LSF G
Sbjct: 650  YGFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAG 699


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  371 bits (952), Expect = e-100
 Identities = 201/525 (38%), Positives = 299/525 (56%), Gaps = 8/525 (1%)
 Frame = +3

Query: 27   FVLTTKLKRVKQALKCL-NNSIGNVHLAVQEARNELYNLQNCIIGAPSDTQAIEERNLMM 203
            F  +  LK +K  ++ +  + +GN+     EA   L   Q+  +  PS     EE     
Sbjct: 285  FRFSKNLKGLKPKIRSMARDRLGNLSKKANEAYKILCAKQHVNLTNPSSMAMEEENAAYS 344

Query: 204  KYQAALDSEENFLQQKSKVHWLQKGDGNNRFFFNYCRGRWNTNRIVGLQDP*GSLVTDHH 383
            ++      EE +L+QKSK+HW Q GD N + F      R   N I  +    G + T   
Sbjct: 345  RWDRVAILEEKYLKQKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGD 404

Query: 384  GIANIAVDYYRNLLGTEK------LVEPLPDLNLPSIPDDLGRSLIAPISSTEILRTLKS 545
             I   A  ++R  L           +  L  L      D   +SLI P+++ EI + L  
Sbjct: 405  EIKAEAERFFREFLQLIPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFR 464

Query: 546  MKKNRSPGPDGFTPDFYIAVWDVVGSDLVNALHSFFDALDLPRQINATAISLIPKVDSPV 725
            M  ++SPGPDG+T +F+ A W+++G +   A+ SFF    LP+ IN+T ++LIPK     
Sbjct: 465  MPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAR 524

Query: 726  EMHQFRPISCCNVLYKCITKILANRIKPVLKNLISFNQSAFIPSRSMGDNILLSQALCRS 905
            EM  +RPISCCNVLYK I+KI+ANR+K VL   I+ NQSAF+  R + +N+LL+  L + 
Sbjct: 525  EMKDYRPISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKD 584

Query: 906  YHLNKGAPRCMLKLDISKAFDSINWSFILNVLNAMHFPAKFTSWIRKCISTCMFSVKING 1085
            YH +  + RC +K+DISKAFDS+ W F++NV   + FP +F  WI  CI+T  FSV++NG
Sbjct: 585  YHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNG 644

Query: 1086 GLEGFFEGKSGVRQGDPLSPYLFVIAMEVLTCCLASVTSA-DFQFHPKCKDLKLSHLIFA 1262
             L G+F+   G+RQG  LSPYLFVI M+VL+  L    +A  F +HPKCK + L+HL FA
Sbjct: 645  ELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFA 704

Query: 1263 DDVLLFSHGDPRSVNTLLKGVDTFSRISGLHLNMQKSLIFFGNVRPADASAILQSSNLQR 1442
            DD+++ S G  RS+  ++K  D F++ SGL ++++KS ++   +     + +        
Sbjct: 705  DDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSS 764

Query: 1443 GSFPFSYLGIPLVTSRINAQLCTPLIMKLCARVNSWTGRFLSFGG 1577
            G  P  YLG+PL+T R++   C PL+ ++  R+ SWT RFLS+ G
Sbjct: 765  GQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAG 809


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  369 bits (948), Expect = e-100
 Identities = 204/531 (38%), Positives = 296/531 (55%), Gaps = 8/531 (1%)
 Frame = +3

Query: 9    VEGDSWFVLTTKLKRVKQALKCLNNS-IGNVHLAVQEARNELYNLQNCIIGAPSDTQAIE 185
            V   + +  + KLK +K  L+ L    +G++    +EA   L   Q   +  PS     E
Sbjct: 576  VSTSALYRFSKKLKTLKPHLRELGKEKLGDLPKRTREAHILLCEKQATTLANPSQETIAE 635

Query: 186  ERNLMMKYQAALDSEENFLQQKSKVHWLQKGDGNNRFFFNYCRGRWNTNRIVGLQDP*GS 365
            E      +    + EE FL+QKSK+HW+  GDGNN +F    + R   N I  ++ P   
Sbjct: 636  ELKAYTDWTHLSELEEGFLKQKSKLHWMNVGDGNNSYFHKAAQVRKMRNSIREIRGPNAE 695

Query: 366  LVTDHHGIANIAVDYYRNLLGTEK------LVEPLPDLNLPSIPDDLGRSLIAPISSTEI 527
             +     I   A  ++   L  +        VE L +L            L   ++  EI
Sbjct: 696  TLQTSEEIKGEAERFFNEFLNRQSGDFHGISVEDLRNLMSYRCSVTDQNILTREVTGEEI 755

Query: 528  LRTLKSMKKNRSPGPDGFTPDFYIAVWDVVGSDLVNALHSFFDALDLPRQINATAISLIP 707
             + L +M  N+SPGPDG+T +F+ A W + G D + A+ SFF    LP+ +NAT ++LIP
Sbjct: 756  QKVLFAMPNNKSPGPDGYTSEFFKATWSLTGPDFIAAIQSFFVKGFLPKGLNATILALIP 815

Query: 708  KVDSPVEMHQFRPISCCNVLYKCITKILANRIKPVLKNLISFNQSAFIPSRSMGDNILLS 887
            K D  +EM  +RPISCCNVLYK I+KILANR+K +L + I  NQSAF+  R + +N+LL+
Sbjct: 816  KKDEAIEMKDYRPISCCNVLYKVISKILANRLKLLLPSFILQNQSAFVKERLLMENVLLA 875

Query: 888  QALCRSYHLNKGAPRCMLKLDISKAFDSINWSFILNVLNAMHFPAKFTSWIRKCISTCMF 1067
              L + YH     PRC +K+DISKAFDS+ W F+LN L A++FP  F  WI+ CIST  F
Sbjct: 876  TELVKDYHKESVTPRCAMKIDISKAFDSVQWQFLLNTLEALNFPETFRHWIKLCISTATF 935

Query: 1068 SVKINGGLEGFFEGKSGVRQGDPLSPYLFVIAMEVLTCCL-ASVTSADFQFHPKCKDLKL 1244
            SV++NG L GFF    G+RQG  LSPYLFVI M VL+  +  +    +  +HPKC+ + L
Sbjct: 936  SVQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKIGL 995

Query: 1245 SHLIFADDVLLFSHGDPRSVNTLLKGVDTFSRISGLHLNMQKSLIFFGNVRPADASAILQ 1424
            +HL FADD+++F  G   S+  ++     F+  SGL ++++KS I+   V  +D    L 
Sbjct: 996  THLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTLS 1055

Query: 1425 SSNLQRGSFPFSYLGIPLVTSRINAQLCTPLIMKLCARVNSWTGRFLSFGG 1577
            S     G  P  YLG+PL+T ++     +PLI  +  +++SWT R LS+ G
Sbjct: 1056 SFPFANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAG 1106


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  369 bits (946), Expect = 2e-99
 Identities = 189/518 (36%), Positives = 305/518 (58%), Gaps = 6/518 (1%)
 Frame = +3

Query: 42   KLKRVKQALKCLNNS-IGNVHLAVQEARNELYNLQNCIIGAPSDTQAIEERNLMMKYQAA 218
            +L+ VK+ALK  ++      H  V+E R +L  +Q     +       EE++L+ + +  
Sbjct: 280  RLQAVKRALKSFHSKKFSKAHCQVEELRRKLAAVQALPEVSQVSELQEEEKDLIAQLRKW 339

Query: 219  LDSEENFLQQKSKVHWLQKGDGNNRFFFNYCRGRWNTNRIVGLQDP*GSLVTDHHGIANI 398
               +E+ L+QKS++ WL  GD N++FFF   + R   N+IV LQ+  G  +T++  I N 
Sbjct: 340  STIDESILKQKSRIQWLSLGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNE 399

Query: 399  AVDYYRNLLGTEKLVEPLPDLNLPSIPDDLGRS----LIAPISSTEILRTLKSMKKNRSP 566
              ++YR LLGT        DL++  +   L  +    L+ PI+  EI + L  +   ++P
Sbjct: 400  ICNFYRRLLGTSSSQLEAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAP 459

Query: 567  GPDGFTPDFYIAVWDVVGSDLVNALHSFFDALDLPRQINATAISLIPKVDSPVEMHQFRP 746
            G DGF   F+   W V+  ++   +  FF+   + + IN TA++LIPK+D       +RP
Sbjct: 460  GLDGFNSVFFKKSWLVIKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRP 519

Query: 747  ISCCNVLYKCITKILANRIKPVLKNLISFNQSAFIPSRSMGDNILLSQALCRSYHLNKGA 926
            I+CC+ LYK I+KIL  R++ V+  ++   Q+ FIP R +GDNILL+  L R Y+    +
Sbjct: 520  IACCSTLYKIISKILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHVS 579

Query: 927  PRCMLKLDISKAFDSINWSFILNVLNAMHFPAKFTSWIRKCISTCMFSVKINGGLEGFFE 1106
            PRC++K+DI KA+DS+ W F+ ++L  + FP+ F  WI  C+ T  +S+ +NG     F+
Sbjct: 580  PRCVIKVDIRKAYDSVEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFD 639

Query: 1107 GKSGVRQGDPLSPYLFVIAMEVLTCCLASV-TSADFQFHPKCKDLKLSHLIFADDVLLFS 1283
             + G+RQGDPLSP+LF ++ME L+ C+ ++    +F FHPKC+ +KL+HL+FADD+L+F+
Sbjct: 640  AQKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFA 699

Query: 1284 HGDPRSVNTLLKGVDTFSRISGLHLNMQKSLIFFGNVRPADASAILQSSNLQRGSFPFSY 1463
              D  S++ ++   ++FS+ SGL  +++KS I+FG V   +A  +     +  GS PF Y
Sbjct: 700  RADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRY 759

Query: 1464 LGIPLVTSRINAQLCTPLIMKLCARVNSWTGRFLSFGG 1577
            LG+PL + ++N   C PLI K+  R   W    LS+ G
Sbjct: 760  LGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAG 797


Top