BLASTX nr result

ID: Atractylodes21_contig00024818 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00024818
         (1156 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ACB28472.1| polyprotein [Ananas comosus]                           360   e-105
gb|AAG51046.1|AC069473_8 gypsy/Ty-3 retroelement polyprotein; 69...   328   e-100
gb|AFJ66186.1| hypothetical protein 11M19.5 [Arabidopsis halleri]     338   e-100
gb|AAG51464.1|AC069160_10 gypsy/Ty3 element polyprotein, putativ...   315   4e-93
emb|CAN80132.1| hypothetical protein VITISV_012031 [Vitis vinifera]   311   3e-90

>gb|ACB28472.1| polyprotein [Ananas comosus]
          Length = 953

 Score =  360 bits (925), Expect(2) = e-105
 Identities = 178/365 (48%), Positives = 243/365 (66%), Gaps = 15/365 (4%)
 Frame = +1

Query: 1    VNGKPLHILIDSGSTHNFLDLLLAKKLGCSMEEMAPQAVTVADGNRIACQHKCKSFSWIM 180
            V  + +HILIDSGSTHNFLD  +A KLGC  E +    VTVADGN++     C++F W M
Sbjct: 18   VKNRRIHILIDSGSTHNFLDAAVAAKLGCCAENIPAVNVTVADGNKLISSSTCRAFKWKM 77

Query: 181  NKKHFTTDVMLISLGSCDMVLGVQWLSTLGQVTWDFKKLFMKFLLDGEQFSLKGIPSQKL 360
                F  +++L+ L  CDMVLGVQWL  LG + WDF KL M+F   G++  L+G     L
Sbjct: 78   QGLEFKANLLLLPLRGCDMVLGVQWLKQLGPILWDFSKLRMEFQFQGQKIVLRGSSGPSL 137

Query: 361  KVIEGEPSCKLLNTAAQLCLLQVATVDPTEPKRQHIQCPDD--------------QLKVL 498
            K+IEG+   K++     L  + + ++  T  +  HI   +D              QL++L
Sbjct: 138  KIIEGKQLKKMVLDDTALSAVHLCSIHATPQEGNHIATSEDAETTWSGLGKAYSQQLQLL 197

Query: 499  KEKFSMVFEDPSELPPCKDVFDHRIPLEAGSSPVNIRPYRYPLKQRDVIEQLVQEMYDRG 678
             E+ S +FE+P  LPP + + DH+IPL+ G++P+N+RPYRYP  Q+  IE+LVQEM  +G
Sbjct: 198  LEEHSDLFEEPQGLPPVR-LHDHKIPLKEGTNPINVRPYRYPAYQKTEIEKLVQEMLSQG 256

Query: 679  IIQNXXXXXXXXXXXXGKK-GTWRLCVDYRELNRRTIKNKFPIPVIEELIDELAGASVFS 855
            +I               KK G+WRLC+DYR LN  TIK+KFPIP+++EL+DEL+GA +FS
Sbjct: 257  VITPSNSPYSSPVVLVKKKDGSWRLCIDYRSLNDSTIKDKFPIPLVDELLDELSGAKLFS 316

Query: 856  KLDLRAGYHQLRVHPDDVFKTAFKTHTGHYEFLVMPFGLTNAPASFQGWMNNVFKPLLRK 1035
            +LDLR+GYHQ+R+H DD+ KTAF+TH GHYEFLVMPFGLTNAP++FQG MN++FKP LR+
Sbjct: 317  ELDLRSGYHQIRMHADDISKTAFRTHEGHYEFLVMPFGLTNAPSTFQGLMNHIFKPYLRR 376

Query: 1036 CVGVF 1050
             + VF
Sbjct: 377  FILVF 381



 Score = 50.4 bits (119), Expect(2) = e-105
 Identities = 21/40 (52%), Positives = 30/40 (75%)
 Frame = +2

Query: 1037 VLVFFDDILVYSRSKDEHWQHLEQVFELMRQNSMFAKMSK 1156
            +LVFFDDILVYS+  +EH  HL   F+++RQ+S+F +  K
Sbjct: 378  ILVFFDDILVYSKGVEEHLCHLRTTFQVLRQHSLFVRRKK 417


>gb|AAG51046.1|AC069473_8 gypsy/Ty-3 retroelement polyprotein; 69905-74404 [Arabidopsis
            thaliana] gi|10998138|dbj|BAB03109.1| retroelement pol
            polyprotein [Arabidopsis thaliana]
          Length = 1499

 Score =  328 bits (841), Expect(2) = e-100
 Identities = 173/356 (48%), Positives = 236/356 (66%), Gaps = 9/356 (2%)
 Frame = +1

Query: 10   KPLHILIDSGSTHNFLDLLLAKKLGCSMEEMAPQAVTVADGNRIACQHKCKSFSWIMNKK 189
            K + ILIDSGSTHNFLD   A KLGC ++      V+VADG ++  + K   FSW +   
Sbjct: 399  KIIFILIDSGSTHNFLDPNTAAKLGCKVDTAGLTRVSVADGRKLRVEGKVTDFSWKLQTT 458

Query: 190  HFTTDVMLISLGSCDMVLGVQWLSTLGQVTWDFKKLFMKFLLDGEQFSLKGIPSQKLKVI 369
             F +D++LI L   DMVLGVQWL TLG+++W+FKKL M+F  + ++  L G+ S  ++ +
Sbjct: 459  TFQSDILLIPLQGIDMVLGVQWLETLGRISWEFKKLEMRFKFNNQKVLLHGLTSGSVREV 518

Query: 370  EGEPSCKLLNTAAQLCLLQVATV-DPTEPKRQHIQCPDDQL-------KVLKEKFSMVFE 525
            + +   KL     QL +L V  V + TE +   I     +L       +VL E +  +F 
Sbjct: 519  KAQKLQKLQEDQVQLAMLCVQEVSESTEGELCTINALTSELGEESVVEEVLNE-YPDIFI 577

Query: 526  DPSELPPCKDVFDHRIPLEAGSSPVNIRPYRYPLKQRDVIEQLVQEMYDRGIIQNXXXXX 705
            +P+ LPP ++  +H+I L  GS+PVN RPYRY + Q++ I++LV+++   G +Q      
Sbjct: 578  EPTALPPFREKHNHKIKLLEGSNPVNQRPYRYSIHQKNEIDKLVEDLLTNGTVQASSSPY 637

Query: 706  XXXXXXXGKK-GTWRLCVDYRELNRRTIKNKFPIPVIEELIDELAGASVFSKLDLRAGYH 882
                    KK GTWRLCVDYRELN  T+K+ FPIP+IE+L+DEL GA +FSK+DLRAGYH
Sbjct: 638  ASPVVLVKKKDGTWRLCVDYRELNGMTVKDSFPIPLIEDLMDELGGAVIFSKIDLRAGYH 697

Query: 883  QLRVHPDDVFKTAFKTHTGHYEFLVMPFGLTNAPASFQGWMNNVFKPLLRKCVGVF 1050
            Q+R+ PDD+ KTAFKTH+GH+E+LVMPFGLTNAPA+FQG MN +FKP LRK V VF
Sbjct: 698  QVRMDPDDIQKTAFKTHSGHFEYLVMPFGLTNAPATFQGLMNFIFKPFLRKFVLVF 753



 Score = 63.9 bits (154), Expect(2) = e-100
 Identities = 30/40 (75%), Positives = 35/40 (87%)
 Frame = +2

Query: 1037 VLVFFDDILVYSRSKDEHWQHLEQVFELMRQNSMFAKMSK 1156
            VLVFFDDILVYS S +EH QHL+QVFE+MR N +FAK+SK
Sbjct: 750  VLVFFDDILVYSSSLEEHRQHLKQVFEVMRANKLFAKLSK 789


>gb|AFJ66186.1| hypothetical protein 11M19.5 [Arabidopsis halleri]
          Length = 1557

 Score =  338 bits (867), Expect(2) = e-100
 Identities = 172/357 (48%), Positives = 235/357 (65%), Gaps = 12/357 (3%)
 Frame = +1

Query: 16   LHILIDSGSTHNFLDLLLAKKLGCSMEEMAPQAVTVADGNRIACQHKCKSFSWIMNKKHF 195
            LHI +D GSTHNF+D+ +AK++ C +E   P  V  A G +     + K F+W M    F
Sbjct: 484  LHIFVDPGSTHNFIDIKVAKEINCKLEGTRPMTVDAALGGKTVTLFRSKDFTWRMQGYSF 543

Query: 196  TTDVMLISLGSCDMVLGVQWLSTLGQVTWDFKKLFMKFLLDGEQFSLKGIPSQKLKVIEG 375
            TT+V  + L   D+VLGVQWL+TLG + WDF  L M+F L+G ++ L+G      KVI+G
Sbjct: 544  TTEVRTLPLDHWDIVLGVQWLATLGPILWDFTYLRMEFTLNGAKYILRGTAKAGCKVIKG 603

Query: 376  EPSCKLLNTAAQLCLLQVATVDPTEPKR-----QHI------QCPDDQLKVLKEKFSMVF 522
                K+L+   Q+  LQ+   DP           HI         D  L+ L E +  +F
Sbjct: 604  NKLNKILSQEPQVAFLQL--YDPESATSVGASLSHIAVDESTSLSDATLQALLEAYEDLF 661

Query: 523  EDPSELPPCKDVFDHRIPLEAGSSPVNIRPYRYPLKQRDVIEQLVQEMYDRGIIQNXXXX 702
             +P+ LPP +  FDH+IP+EAG+SPV++RPYRY   Q+D+I+++V+EM  +GIIQN    
Sbjct: 662  IEPTGLPPFRKGFDHQIPVEAGASPVSLRPYRYNSIQKDIIDRMVREMLSQGIIQNSSSP 721

Query: 703  XXXXXXXXGKK-GTWRLCVDYRELNRRTIKNKFPIPVIEELIDELAGASVFSKLDLRAGY 879
                     KK G+WRLCVDYR +N++TIK+K+PIP++E+L+DEL G++ FSKLDLRAG+
Sbjct: 722  YASPVVLVKKKDGSWRLCVDYRGVNKQTIKDKYPIPLLEDLLDELGGSTYFSKLDLRAGF 781

Query: 880  HQLRVHPDDVFKTAFKTHTGHYEFLVMPFGLTNAPASFQGWMNNVFKPLLRKCVGVF 1050
            HQ+R+HP DV+KTAFKTH GHYE+LVMPFGLTN P +FQG MN VF+ + RK V VF
Sbjct: 782  HQIRMHPHDVYKTAFKTHAGHYEYLVMPFGLTNVPCTFQGLMNQVFRHIARKYVLVF 838



 Score = 53.5 bits (127), Expect(2) = e-100
 Identities = 24/40 (60%), Positives = 33/40 (82%)
 Frame = +2

Query: 1037 VLVFFDDILVYSRSKDEHWQHLEQVFELMRQNSMFAKMSK 1156
            VLVFFDDILVYS + ++H QHLE+VF ++R++ +F K SK
Sbjct: 835  VLVFFDDILVYSPTWEQHLQHLEEVFAVLRKHQLFLKPSK 874


>gb|AAG51464.1|AC069160_10 gypsy/Ty3 element polyprotein, putative [Arabidopsis thaliana]
          Length = 1447

 Score =  315 bits (806), Expect(2) = 4e-93
 Identities = 170/361 (47%), Positives = 232/361 (64%), Gaps = 11/361 (3%)
 Frame = +1

Query: 1    VNGKPLHILIDSGSTHNFLDLLLAKKLGCSMEEMAPQAVTVADGNRIACQHKCKSFSWIM 180
            V+ + L ILIDSGSTHNF+D  +A KLGC +E      V VADG ++    + K F+W +
Sbjct: 375  VDKRDLFILIDSGSTHNFIDSTVAAKLGCHVESAGLTKVAVADGRKLNVDGQIKGFTWKL 434

Query: 181  NKKHFTTDVMLISLGSCDMVLGVQWLSTLGQVTWDFKKLFMKFLLDGEQFSLKGIPSQKL 360
                F +D++LI L   DMVLGVQWL TLG+++W+FKKL M+F    ++  L GI +  +
Sbjct: 435  QSTTFQSDILLIPLQGVDMVLGVQWLETLGRISWEFKKLEMQFFYKNQRVWLHGIITGSV 494

Query: 361  KVIEGEPSCKLLNTAA-QLCLLQVATVDPTEPKRQHIQC---------PDDQLKVLKEKF 510
            + I+     KL  T A Q+ L  V   +    + Q I            +  ++ + E+F
Sbjct: 495  RDIKAH---KLQKTQADQIQLAMVCVREVVSDEEQEIGSISALTSDVVEESVVQNIVEEF 551

Query: 511  SMVFEDPSELPPCKDVFDHRIPLEAGSSPVNIRPYRYPLKQRDVIEQLVQEMYDRGIIQN 690
              VF +P++LPP ++  DH+I L  G++PVN RPYRY + Q+D I+++VQ+M   G IQ 
Sbjct: 552  PDVFAEPTDLPPFREKHDHKIKLLEGANPVNQRPYRYVVHQKDEIDKIVQDMIKSGTIQV 611

Query: 691  XXXXXXXXXXXXGKK-GTWRLCVDYRELNRRTIKNKFPIPVIEELIDELAGASVFSKLDL 867
                         KK GTWRLCVDY ELN  T+K++F IP+IE+L+DEL G+ VFSK+DL
Sbjct: 612  SSSPFASPVVLVKKKDGTWRLCVDYTELNGMTVKDRFLIPLIEDLMDELGGSVVFSKIDL 671

Query: 868  RAGYHQLRVHPDDVFKTAFKTHTGHYEFLVMPFGLTNAPASFQGWMNNVFKPLLRKCVGV 1047
            RAGYHQ+R+ PDD+ KTAFKTH GH+E+LVM FGLTNAPA+FQ  MN+VF+  LRK V V
Sbjct: 672  RAGYHQVRMDPDDIQKTAFKTHNGHFEYLVMLFGLTNAPATFQSLMNSVFRDFLRKFVLV 731

Query: 1048 F 1050
            F
Sbjct: 732  F 732



 Score = 54.7 bits (130), Expect(2) = 4e-93
 Identities = 26/40 (65%), Positives = 32/40 (80%)
 Frame = +2

Query: 1037 VLVFFDDILVYSRSKDEHWQHLEQVFELMRQNSMFAKMSK 1156
            VLVFFDDIL+YS S +EH +HL  VFE+MR + +FAK SK
Sbjct: 729  VLVFFDDILIYSSSIEEHKEHLRLVFEVMRLHKLFAKGSK 768


>emb|CAN80132.1| hypothetical protein VITISV_012031 [Vitis vinifera]
          Length = 1371

 Score =  311 bits (797), Expect(2) = 3e-90
 Identities = 158/358 (44%), Positives = 229/358 (63%), Gaps = 10/358 (2%)
 Frame = +1

Query: 7    GKPLHILIDSGSTHNFLDLLLAKKLGCSMEEMAPQAVTVADGNRIACQHKCKSFSWIMNK 186
            G+ L +LIDSGS+HNFL   +AK++ C  ++     VTVA+G+ + C   C  F W M  
Sbjct: 393  GRSLFVLIDSGSSHNFLSSKVAKRVDCCWQKARGIRVTVANGHELHCTALCSDFRWRMQG 452

Query: 187  KHFTTDVMLISLGSCDMVLGVQWLSTLGQVTWDFKKLFMKFLLDGEQFSLKGIP------ 348
            + F  +V ++ L + D++LG QWL+TLG ++W+F  L M F L+G+ + L+G        
Sbjct: 453  QEFIAEVYVLPLETYDLILGTQWLATLGDISWNFNTLQMGFELNGKPYLLQGKNKLQERM 512

Query: 349  ---SQKLKVIEGEPSCKLLNTAAQLCLLQVATVDPTEPKRQHIQCPDDQLKVLKEKFSMV 519
               + KLK +  +P    +   +   L  +   + T  +        ++L+ + + F+ V
Sbjct: 513  SPWADKLKGLVEQPGLFAIQDLSDATLWAIQVAENTHLEETLTPQQQEELQKMLQAFADV 572

Query: 520  FEDPSELPPCKDVFDHRIPLEAGSSPVNIRPYRYPLKQRDVIEQLVQEMYDRGIIQNXXX 699
            FE+P+ LPP +D +DH+I L+  + P+N RPYRY   Q+D IE+L+ EM   G+I+    
Sbjct: 573  FEEPTGLPPVRD-YDHQIDLKDEAGPINCRPYRYAAVQKDAIEKLIGEMLHAGVIRQSRS 631

Query: 700  XXXXXXXXXGKK-GTWRLCVDYRELNRRTIKNKFPIPVIEELIDELAGASVFSKLDLRAG 876
                      KK G+WRLCVDYR LN+ T+K+KFPIPVIEEL++EL G+++FSK+DLR+G
Sbjct: 632  PYASPVVLVKKKDGSWRLCVDYRALNQVTVKDKFPIPVIEELLEELGGSTIFSKIDLRSG 691

Query: 877  YHQLRVHPDDVFKTAFKTHTGHYEFLVMPFGLTNAPASFQGWMNNVFKPLLRKCVGVF 1050
            Y Q+R+H  DV KTAFKTH GHYEFLVMPFGLTNAP++FQ  MNN+F+P LRK + VF
Sbjct: 692  YWQIRMHEPDVPKTAFKTHEGHYEFLVMPFGLTNAPSTFQSLMNNIFQPYLRKFILVF 749



 Score = 48.5 bits (114), Expect(2) = 3e-90
 Identities = 20/40 (50%), Positives = 30/40 (75%)
 Frame = +2

Query: 1037 VLVFFDDILVYSRSKDEHWQHLEQVFELMRQNSMFAKMSK 1156
            +LVFFDDIL+YSRS  +H  HL    +++R+N ++AK +K
Sbjct: 746  ILVFFDDILIYSRSFSDHIHHLSIALQVLRENLLYAKSNK 785


Top