BLASTX nr result

ID: Atractylodes21_contig00002082 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00002082
         (2238 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI34193.3| unnamed protein product [Vitis vinifera]              306   1e-80
ref|XP_003591003.1| Trihelix transcription factor [Medicago trun...   285   4e-74
ref|XP_002874424.1| hypothetical protein ARALYDRAFT_489647 [Arab...   276   2e-71
dbj|BAE99307.1| GTL1 - like protein [Arabidopsis thaliana]            274   8e-71
ref|NP_568506.2| putative trihelix DNA-binding protein [Arabidop...   274   8e-71

>emb|CBI34193.3| unnamed protein product [Vitis vinifera]
          Length = 522

 Score =  306 bits (785), Expect = 1e-80
 Identities = 216/530 (40%), Positives = 277/530 (52%), Gaps = 28/530 (5%)
 Frame = -2

Query: 1787 MFDHGVPVEQFHQFXXXXXXXXXXXXSL-IQTPP--------ISTSTXXXXXXXXXXXXX 1635
            MFD GVP +QFHQF            +  +Q PP        +S+ST             
Sbjct: 1    MFD-GVPSDQFHQFVAAAAAAAASSTTSQLQPPPPSLSFPLHVSSSTFPSFDLYPSGSGG 59

Query: 1634 XXXXXXNHHQAL------FQSHHFVRTPTRDQFNGTD---VKVDQE-------IDINDSW 1503
                    HQ L         HH    P +D  +  +   V ++ E       +D+ + W
Sbjct: 60   GGGAAAAAHQPLQVPHLLHPLHHHSSAPHKDDQDKEENALVSINLEPQKERSMLDLINPW 119

Query: 1502 SNDEVIQLLRIKSSSENWFRDFTWDNVSSKLAELGYKRSAXXXXXXXXXETCRSFSSTIG 1323
            SNDEV+ LLRI+SS ENW+ DFTW++VS KLAE G+KRSA         E+ R F++T+ 
Sbjct: 120  SNDEVLALLRIRSSMENWYPDFTWEHVSRKLAEQGFKRSAEKCKEKFEQES-RYFNTTMN 178

Query: 1322 YNKNSSRYLISEELDEHLYNADHNTHHHDTIQSPENI--KHQEEAQEHEKRGGENHVAQG 1149
            Y+KN   Y    EL+E LY+ + + H  D  +  + +  K  EE +  E+      V   
Sbjct: 179  YSKN---YRFFSELEE-LYHGE-SPHQQDVAEKNQKVVEKPNEEDRSLEEDSRNETVVGN 233

Query: 1148 QKGHFEKDPDEIMVQPTQDNVHVTXXXXXXXXXXXXXXXKGLCVDMVHKMMAQQEEMHKK 969
                 EK  D+   +  + +                   KG C  +V KMMAQQEEMH K
Sbjct: 234  PCLETEKVEDKSKGKKRKRHTQ----------NKSFEMFKGFCEAVVSKMMAQQEEMHNK 283

Query: 968  LLEDMVNRDEENIKREEAWRQEEMERVKREIHIREHEQEMARDRQSTITEFLNKLTSFDQ 789
            LLEDMV RDEE   REEAW+++EM+R+ +EI IREHEQ +A DRQ+TI  FL K TS   
Sbjct: 284  LLEDMVKRDEEKTAREEAWKKQEMDRINKEIEIREHEQAIAGDRQATIIGFLKKFTS-SN 342

Query: 788  TIQLPMPLCEITKIPLS-DKITEDPNQEKATAKDDTGKRWPRDEVLALINIRSNVNNGIG 612
             ++ P    +    P S   IT+  +QE        GKRWPRDEVLALIN+R ++N    
Sbjct: 343  PLEAPTSSRKRPSAPTSFPSITDHRDQE-------LGKRWPRDEVLALINLRCSLNV--- 392

Query: 611  GNNEAEHQGPIGNNHXXXXXXXXXXXXXXXXSLWERISQKMLELGYKRSAKRCKEKWENI 432
              ++   +GP                      LWERISQ ML LGYKRSAKRCKEKWENI
Sbjct: 393  -EDKEGAKGP----------------------LWERISQGMLALGYKRSAKRCKEKWENI 429

Query: 431  NKYFRKTKDANKKRSLDSRTCPYYHQLSKLYNQEKQASSSNCVGPSSDEP 282
            NKYFRKTKD +KKRSLDSRTCPY+HQLS LY+Q         V PSS+ P
Sbjct: 430  NKYFRKTKDVSKKRSLDSRTCPYFHQLSTLYSQ------GTLVVPSSEAP 473


>ref|XP_003591003.1| Trihelix transcription factor [Medicago truncatula]
            gi|355480051|gb|AES61254.1| Trihelix transcription factor
            [Medicago truncatula]
          Length = 557

 Score =  285 bits (729), Expect = 4e-74
 Identities = 177/451 (39%), Positives = 242/451 (53%), Gaps = 50/451 (11%)
 Frame = -2

Query: 1511 DSWSNDEVIQLLRIKSSSENWFRDFTWDNVSSKLAELGYKRSAXXXXXXXXXETCRSFSS 1332
            D W+NDEV+ LL+I+SS E+WF DFTW++VS KLAE+GYKRSA         E+   F +
Sbjct: 102  DPWTNDEVLALLKIRSSMESWFPDFTWEHVSRKLAEVGYKRSAEKCKEKFEEES--RFFN 159

Query: 1331 TIGYNKNS--SRYLISEELDEHLYNADHNTHHHDTIQSPENIKHQEEAQEHEKRGGENHV 1158
             I +N+NS    +    EL+E +Y      ++ + +++ +  + Q++   HE+    + V
Sbjct: 160  NINHNQNSFGKNFRFVTELEE-VYQGGGGENNKNLVEAEKQNEVQDKMDPHEEDSRMDDV 218

Query: 1157 AQGQKGHFEKDPDEIMVQPTQDNVHVTXXXXXXXXXXXXXXXKGLCVDMVHKMMAQQEEM 978
               +K       +E++ + T ++                   KG C  +V KMM QQEEM
Sbjct: 219  LVSKKSE-----EEVVEKGTTND----EKKRKRSGDDRFEVFKGFCESVVKKMMDQQEEM 269

Query: 977  HKKLLEDMVNRDEENIKREEAWRQEEMERVKREIHIREHEQEMARDRQSTITEFLNKLTS 798
            H KL+EDMV RDEE   REEAW+++EME++ +E+ +  HEQ +A DRQ+ I +FLNK ++
Sbjct: 270  HNKLIEDMVKRDEEKFSREEAWKKQEMEKMNKELELMAHEQAIAGDRQAHIIQFLNKFST 329

Query: 797  F-------DQTIQLPMPLCEITKIPLSDKI-TEDPNQE---------------------- 708
                      + QL   L  +T    S  + +++PN E                      
Sbjct: 330  SANSSSLTSMSTQLQAYLATLTSNSSSSTLHSQNPNPETLKKTLQPIPENPSSTLPSSST 389

Query: 707  ------------------KATAKDDTGKRWPRDEVLALINIRSNVNNGIGGNNEAEHQGP 582
                               +  +DD G+RWP+DEVLALIN+R N       NN  E +G 
Sbjct: 390  TLVAQPRNNNPISSYSLISSGERDDIGRRWPKDEVLALINLRCN-------NNNEEKEGN 442

Query: 581  IGNNHXXXXXXXXXXXXXXXXSLWERISQKMLELGYKRSAKRCKEKWENINKYFRKTKDA 402
              N                   LWERISQ MLELGYKRSAKRCKEKWENINKYFRKTKDA
Sbjct: 443  SNNK----------------APLWERISQGMLELGYKRSAKRCKEKWENINKYFRKTKDA 486

Query: 401  NKKRSLDSRTCPYYHQLSKLYNQEKQASSSN 309
            N+KRSLDSRTCPY+H L+ LYNQ K    S+
Sbjct: 487  NRKRSLDSRTCPYFHLLTNLYNQGKLVLQSD 517


>ref|XP_002874424.1| hypothetical protein ARALYDRAFT_489647 [Arabidopsis lyrata subsp.
            lyrata] gi|297320261|gb|EFH50683.1| hypothetical protein
            ARALYDRAFT_489647 [Arabidopsis lyrata subsp. lyrata]
          Length = 606

 Score =  276 bits (706), Expect = 2e-71
 Identities = 197/578 (34%), Positives = 270/578 (46%), Gaps = 79/578 (13%)
 Frame = -2

Query: 1787 MFDHGVPVEQFHQFXXXXXXXXXXXXSLIQTP-----PISTSTXXXXXXXXXXXXXXXXX 1623
            MFD GVP EQ H+F                       P+S S+                 
Sbjct: 1    MFDGGVP-EQIHRFIASPPPPPPLPPHQPAAERSLPFPVSFSSFNTNHQAQHMLSLDRRK 59

Query: 1622 XXNHHQALFQSHHFVRT--PTRDQFNGTDVKVDQEIDINDS-WSNDEVIQLLRIKSSSEN 1452
              +HH      HH ++    T +    TD   D     +   W +DEV+ LLR +S+ EN
Sbjct: 60   IIHHHH--HHHHHDIKDGGATAEWIGHTDHDGDNHHHHHHHPWCSDEVLALLRFRSTVEN 117

Query: 1451 WFRDFTWDNVSSKLAELGYKRSAXXXXXXXXXETCRSFSST---------IG-YNKNSSR 1302
            WF +F W++ S KLAE+G+KRS          E  R F+           IG YN   + 
Sbjct: 118  WFPEFNWEHTSRKLAEVGFKRSPQECKEKFEEEERRYFNINNNNTNDHQHIGNYNNKGNN 177

Query: 1301 YLISEELDEHLYNA-----DHNTHHHDTIQSPENI---------KHQEEAQEHEKRGGE- 1167
            Y +  E++E  Y+      D+    +D++++  N+           +++ ++H++   E 
Sbjct: 178  YRVFSEVEE-FYDVSSEVGDNQNKRNDSVEAKGNVGETVTGQDLMEEDKLRDHDQGQVEE 236

Query: 1166 -------NHVAQGQKGHFEKDPDEIMVQPTQDNVHVTXXXXXXXXXXXXXXXKGLCVDMV 1008
                   N +  G+ G+ E+D            +                  KG C  +V
Sbjct: 237  TSMENKINSIEVGKVGNVEEDAKSSSSSSLMMIIKEKEKRKRKKEKERFGVLKGFCEGLV 296

Query: 1007 HKMMAQQEEMHKKLLEDMVNRDEENIKREEAWRQEEMERVKREIHIREHEQEMARDRQST 828
              M+AQQEEMHKKLLEDMV  +EE I REEAW+++E+ERV +E+ IR  EQ MA DR ++
Sbjct: 297  RNMIAQQEEMHKKLLEDMVKNEEEKIAREEAWKKQEIERVNKEVEIRVQEQAMASDRNTS 356

Query: 827  ITEFLNKLTSFD----------------------------QTIQLPMPLCEITKIPLS-D 735
            I +F++K T  D                            QTI   +P       PL+ D
Sbjct: 357  IIKFISKFTDHDLDVVENPTSLSQDSSSLTLPKTQGRRKFQTISSLLPQTLTPHNPLTHD 416

Query: 734  KI----------TEDPNQEKATAKDDTGKRWPRDEVLALINIRSNVNNGIGGNNEAEHQG 585
            K           T+ P   K+  K D GKRWP+DEVLALINIR     GI   N+ +H+ 
Sbjct: 417  KSLEPTKTLKTKTQTPKPPKSDDKSDLGKRWPKDEVLALINIR----RGISNMNDDDHKD 472

Query: 584  PIGNNHXXXXXXXXXXXXXXXXSLWERISQKMLELGYKRSAKRCKEKWENINKYFRKTKD 405
                                   LWERIS+KMLE+GYKRSAKRCKEKWENINKYFRKTKD
Sbjct: 473  E-----------NSLSSSSKAVPLWERISKKMLEIGYKRSAKRCKEKWENINKYFRKTKD 521

Query: 404  ANKKRSLDSRTCPYYHQLSKLYNQEKQASSSNCVGPSS 291
             NKKR LDSRTCPY+HQL+ LY+Q    +++     +S
Sbjct: 522  VNKKRPLDSRTCPYFHQLTALYSQSSTGTTTTATTATS 559


>dbj|BAE99307.1| GTL1 - like protein [Arabidopsis thaliana]
          Length = 619

 Score =  274 bits (700), Expect = 8e-71
 Identities = 177/472 (37%), Positives = 233/472 (49%), Gaps = 81/472 (17%)
 Frame = -2

Query: 1505 WSNDEVIQLLRIKSSSENWFRDFTWDNVSSKLAELGYKRSAXXXXXXXXXETCRSFSST- 1329
            W +DEV+ LLR +S+ ENWF +FTW++ S KLAE+G+KRS          E  R F+S  
Sbjct: 103  WCSDEVLALLRFRSTVENWFPEFTWEHTSRKLAEVGFKRSPQECKEKFEEEERRYFNSNN 162

Query: 1328 -----------IG-YNKNSSRYLISEELDEHLYNADHNTHHHDTIQSPEN---------- 1215
                       IG YN   + Y I  E++E  ++   N H    +   +N          
Sbjct: 163  NNNNNTNDHQHIGNYNNKGNNYRIFSEVEEFYHHGHDNEHVSSEVGDNQNKRTNLVEGKG 222

Query: 1214 --------------IKHQEEAQEHEK--RGGENHVAQGQKGHFEKDPDEIMVQPTQDNVH 1083
                          ++ Q++ Q  E       N +  G+ G+ E D            + 
Sbjct: 223  NVGETVQDLMAEDKLRDQDQGQVEEASMENQRNSIEVGKVGNVEDDAKSSSSSSLMMVMK 282

Query: 1082 VTXXXXXXXXXXXXXXXKGLCVDMVHKMMAQQEEMHKKLLEDMVNRDEENIKREEAWRQE 903
                             KG C  +V  M+AQQEEMHKKLLEDMV ++EE I REEAW+++
Sbjct: 283  EKKRKKRKKEKERFGVLKGFCEGLVRNMIAQQEEMHKKLLEDMVKKEEEKIAREEAWKKQ 342

Query: 902  EMERVKREIHIREHEQEMARDRQSTITEFLNKLTSFD----------------------- 792
            E+ERV +E+ IR  EQ MA DR + I +F++K T  D                       
Sbjct: 343  EIERVNKEVEIRAQEQAMASDRNTNIIKFISKFTDHDLDVVQNPTSPSQDSSSLALRKTQ 402

Query: 791  -------------QTIQLPMPLCEITKI--PLSDKITEDPNQE----KATAKDDTGKRWP 669
                         QT+  P  L  I K   P S K  +  NQ     K+  K D GKRWP
Sbjct: 403  GRRKFQTSSSLLPQTLT-PHNLLTIDKSLEPFSTKTLKPKNQNPKPPKSDDKSDLGKRWP 461

Query: 668  RDEVLALINIRSNVNNGIGGNNEAEHQGPIGNNHXXXXXXXXXXXXXXXXSLWERISQKM 489
            +DEVLALINIR +++N    +++ E+     +                   LWERIS+KM
Sbjct: 462  KDEVLALINIRRSISNMNDDDHKDENSLSTSSK---------------AVPLWERISKKM 506

Query: 488  LELGYKRSAKRCKEKWENINKYFRKTKDANKKRSLDSRTCPYYHQLSKLYNQ 333
            LE+GYKRSAKRCKEKWENINKYFRKTKD NKKR LDSRTCPY+HQL+ LY+Q
Sbjct: 507  LEIGYKRSAKRCKEKWENINKYFRKTKDVNKKRPLDSRTCPYFHQLTALYSQ 558


>ref|NP_568506.2| putative trihelix DNA-binding protein [Arabidopsis thaliana]
            gi|75244603|sp|Q8H181.1|GTL2_ARATH RecName: Full=Trihelix
            transcription factor GTL2; AltName: Full=GT2-LIKE protein
            2; Short=AtGTL2; AltName: Full=Trihelix DNA-binding
            protein GTL2 gi|23306422|gb|AAN17438.1| Unknown protein
            [Arabidopsis thaliana] gi|30725452|gb|AAP37748.1|
            At5g28300 [Arabidopsis thaliana]
            gi|332006404|gb|AED93787.1| putative trihelix DNA-binding
            protein [Arabidopsis thaliana]
          Length = 619

 Score =  274 bits (700), Expect = 8e-71
 Identities = 177/472 (37%), Positives = 233/472 (49%), Gaps = 81/472 (17%)
 Frame = -2

Query: 1505 WSNDEVIQLLRIKSSSENWFRDFTWDNVSSKLAELGYKRSAXXXXXXXXXETCRSFSST- 1329
            W +DEV+ LLR +S+ ENWF +FTW++ S KLAE+G+KRS          E  R F+S  
Sbjct: 103  WCSDEVLALLRFRSTVENWFPEFTWEHTSRKLAEVGFKRSPQECKEKFEEEERRYFNSNN 162

Query: 1328 -----------IG-YNKNSSRYLISEELDEHLYNADHNTHHHDTIQSPEN---------- 1215
                       IG YN   + Y I  E++E  ++   N H    +   +N          
Sbjct: 163  NNNNNTNDHQHIGNYNNKGNNYRIFSEVEEFYHHGHDNEHVSSEVGDNQNKRTNLVEGKG 222

Query: 1214 --------------IKHQEEAQEHEK--RGGENHVAQGQKGHFEKDPDEIMVQPTQDNVH 1083
                          ++ Q++ Q  E       N +  G+ G+ E D            + 
Sbjct: 223  NVGETVQDLMAEDKLRDQDQGQVEEASMENQRNSIEVGKVGNVEDDAKSSSSSSLMMIMK 282

Query: 1082 VTXXXXXXXXXXXXXXXKGLCVDMVHKMMAQQEEMHKKLLEDMVNRDEENIKREEAWRQE 903
                             KG C  +V  M+AQQEEMHKKLLEDMV ++EE I REEAW+++
Sbjct: 283  EKKRKKRKKEKERFGVLKGFCEGLVRNMIAQQEEMHKKLLEDMVKKEEEKIAREEAWKKQ 342

Query: 902  EMERVKREIHIREHEQEMARDRQSTITEFLNKLTSFD----------------------- 792
            E+ERV +E+ IR  EQ MA DR + I +F++K T  D                       
Sbjct: 343  EIERVNKEVEIRAQEQAMASDRNTNIIKFISKFTDHDLDVVQNPTSPSQDSSSLALRKTQ 402

Query: 791  -------------QTIQLPMPLCEITKI--PLSDKITEDPNQE----KATAKDDTGKRWP 669
                         QT+  P  L  I K   P S K  +  NQ     K+  K D GKRWP
Sbjct: 403  GRRKFQTSSSLLPQTLT-PHNLLTIDKSLEPFSTKTLKPKNQNPKPPKSDDKSDLGKRWP 461

Query: 668  RDEVLALINIRSNVNNGIGGNNEAEHQGPIGNNHXXXXXXXXXXXXXXXXSLWERISQKM 489
            +DEVLALINIR +++N    +++ E+     +                   LWERIS+KM
Sbjct: 462  KDEVLALINIRRSISNMNDDDHKDENSLSTSSK---------------AVPLWERISKKM 506

Query: 488  LELGYKRSAKRCKEKWENINKYFRKTKDANKKRSLDSRTCPYYHQLSKLYNQ 333
            LE+GYKRSAKRCKEKWENINKYFRKTKD NKKR LDSRTCPY+HQL+ LY+Q
Sbjct: 507  LEIGYKRSAKRCKEKWENINKYFRKTKDVNKKRPLDSRTCPYFHQLTALYSQ 558


Top