BLASTX nr result

ID: Atractylodes21_contig00033065 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00033065
         (1570 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002522374.1| hypothetical protein RCOM_0603630 [Ricinus c...   324   3e-86
ref|XP_002310176.1| predicted protein [Populus trichocarpa] gi|2...   275   2e-71
ref|XP_003548909.1| PREDICTED: uncharacterized protein LOC100818...   241   5e-61
ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] ...   239   1e-60
emb|CAB62317.1| putative protein [Arabidopsis thaliana]               239   1e-60

>ref|XP_002522374.1| hypothetical protein RCOM_0603630 [Ricinus communis]
            gi|223538452|gb|EEF40058.1| hypothetical protein
            RCOM_0603630 [Ricinus communis]
          Length = 1720

 Score =  324 bits (831), Expect = 3e-86
 Identities = 193/533 (36%), Positives = 289/533 (54%), Gaps = 12/533 (2%)
 Frame = -3

Query: 1568 ALNELLDDSTSFYFTDFRVDQLTLRISNWSAPAFNWEVQGFHVTISPRVVE---GSGRSR 1398
            +LN+LLDD++ F F    +++LTLR SNWS PAFN EV+G +V +  R  E    S R+R
Sbjct: 47   SLNQLLDDASLFSFGGVTIEELTLRFSNWSVPAFNIEVRGVNVILVAREEEEERSSVRAR 106

Query: 1397 EPSEVLLEEKKKVLREIDPEGSALHDIMEKLADISLSRSQ-TTSLPMLILSYCCLQMCDI 1221
            + SE + EEKKK +   DPEG ALHD++EK+   + SR   TTSL  LIL +C LQ+ D 
Sbjct: 107  KSSEKVNEEKKKAVAGFDPEGGALHDVLEKILISTPSRKGFTTSLLNLILKHCHLQVFDT 166

Query: 1220 NLRLQLAISDDSFECLWEIEELNADSRLVEPQSFLRGYINSLFISSKESYFDLEIRGLEI 1041
             L++Q+ I +D   CL E++E N +S   E    LRG++   F   KE+   +  +GL I
Sbjct: 167  KLQVQVPILNDDLVCLLELKEFNGESEYFEHGCLLRGFLGVAFNPPKETSIVMNFKGLGI 226

Query: 1040 RLKNNDRIRPVCYATDIICSLKLSDLQLVELHCSIEELVTSFSPADXXXXXXXXXXXXXX 861
                ND+   V  +TD+   ++L+DLQL ++   +  L    SP D              
Sbjct: 227  GYWMNDKENSVVSSTDLFSCIRLNDLQLADISIRVPGLNLLLSPLDLLVLSVLGRLPLKE 286

Query: 860  XSPIRNGRQLWKETATKIRSLISTRRWSVWKLVSVVCLWLRYVHAWENLFLLVGYPMDIM 681
               +RNGRQLW+  A ++  + S  R S+  L   VC+WLRY++A+E+L   +GY    +
Sbjct: 287  PKHVRNGRQLWRLAANRLGYVTSFPRLSLHNLADFVCMWLRYLNAYEHLLSFIGYTQVNL 346

Query: 680  IKRSAVKMSKNQKFSKSFRCQWKVISEIEKEIPPAAIALAHRVVRCRTVKNVVPNED--- 510
            +KR ++ M +++ F  S +  W++IS  EKE+PP AIA A R+ R +   ++   ED   
Sbjct: 347  LKRPSIGMLRDKMFHSSVKQHWELISRTEKELPPEAIAQARRIARYKATLSIPQGEDSYK 406

Query: 509  ELPVTRYLEYFWKTCRLLGLIWSTVCSMFSSITHWAHSRNSFATHPNMK-RIGVLPTDSC 333
            E  V    + F K   LL   W+ +  +  S  H   S       P     +G++  D C
Sbjct: 407  EYSVRSQFQVFSKVLSLLVFTWNVIHRVVLSNIHAFLSIVFSRQEPKFDGHLGIISEDHC 466

Query: 332  PNLCYRLNLGKIFINISPDNTIPSVGKRTVSDRWVSHLDLLSFCLLIDTFIMVYKETICE 153
            P  C+ LN GK+ I     NTI +V K+  S   +S  D+ SFCL +D  ++VY + I E
Sbjct: 467  PQYCFLLNFGKVLITFCSGNTIHNVIKKLESHIGISLPDIHSFCLSLDALLLVYVDDIFE 526

Query: 152  DHLTFSCGSFKIMYSSAAGATTNKYG---YSLKGFQKPQVL-DSKTIIQGKPA 6
               + SCG  K+  SS  G T  +     +++KG ++     DSKT++QG+PA
Sbjct: 527  QSFSLSCGKLKVKTSSVTGDTATEGSSKHHTVKGNRERMTANDSKTVLQGEPA 579


>ref|XP_002310176.1| predicted protein [Populus trichocarpa] gi|222853079|gb|EEE90626.1|
            predicted protein [Populus trichocarpa]
          Length = 868

 Score =  275 bits (703), Expect = 2e-71
 Identities = 174/524 (33%), Positives = 276/524 (52%), Gaps = 10/524 (1%)
 Frame = -3

Query: 1547 DSTSFYFTDFRVDQLTLRISNWSAPAFNWEVQGFHVTISPRVVEGSGRSREPSEVLLEEK 1368
            +S+ F F +  VD L+ R SNWS+PA    ++G ++T+    V+  G  R   + L EEK
Sbjct: 53   ESSRFQFKEVTVDHLSFRFSNWSSPACKIGIRGVNITLLAGEVKEEGSLRRARK-LSEEK 111

Query: 1367 KKVLREIDPEGSALHDIMEKLADISLSRSQ-TTSLPMLILSYCCLQMCDINLRLQLAISD 1191
            KK +   DPEGSALH+++E++     SR+   TSL  L+L +C LQ+ D NL++Q    +
Sbjct: 112  KKAVAGFDPEGSALHNVLERILLNPPSRNWFKTSLLNLLLKHCHLQISDTNLQVQFPDLN 171

Query: 1190 DSFECLWEIEELNADSRLVEPQSFLRGYINSLFISSKESYFDLEIRGLEIRLKNNDRIRP 1011
            D+   L E+++ N +S   +P   LRG + ++F   K   F ++ RG     K  D+I  
Sbjct: 172  DAVVFLLELKDFNGESEHSDPGCLLRGVVGAVFKPLKVVSFVMDFRGFGFAYKMEDQINH 231

Query: 1010 VCYATDIICSLKLSDLQLVELHCSIEELVTSFSPADXXXXXXXXXXXXXXXSPIRNGRQL 831
            +   TD++  +KL+DL++ + +  + +L   FSP D                 +R+GRQL
Sbjct: 232  ISSFTDLLSCIKLNDLRVADFNIRVPKLSLLFSPLDLLVLSAFGKLSTKERKHVRSGRQL 291

Query: 830  WKETATKIRSLISTRRWSVWKLVSVVCLWLRYVHAWENLFLLVGYPMDIMIKRSAVKMSK 651
            WK  A ++  + S+ R S+ KLV  +CLWLRY +A+E L  L+GY  D ++K+S +K+S+
Sbjct: 292  WKLAANRLGYVPSSPRLSLHKLVDFICLWLRYQNAYEYLLSLLGYSADNLLKKSVIKLSE 351

Query: 650  NQKFSKSFRCQWKVISEIEKEIPPAAIALAHRVVRCRTVKNVVPNED---ELPVTRYLEY 480
            ++ F  S +  W  IS IEKE+P  AIA A R+ R R V N+   ++   E  + + +  
Sbjct: 352  DKMFLNSVKHNWGEISGIEKELPAEAIAQARRIARYRAVSNIQNGKNSFKESSMDKQVNV 411

Query: 479  FWKTCRLLGLIWSTVCSMFSSITHWAHSRNSFATHPNMK-RIGVLPTDSCPNLCYRLNLG 303
            F K   +  +IW+ +  +  SI H       F   P +    G    D     C+ LN G
Sbjct: 412  FSKILSVFIVIWNVMYKILLSILHCFFFIILFFQRPKLDWNPGNNSEDYSSRYCFLLNFG 471

Query: 302  KIFINISPDNTIPSVGKRTVSDRWVSHLDLLSFCLLIDTFIMVYKETICEDHLTFSCGSF 123
            KI +  S  +   +V +R  S   +S+ D+ SF L I   ++ Y + + E  L+ SCG  
Sbjct: 472  KILVTFSSTSKHKNVDERIESHTGISYSDIHSFSLSIHMLLLAYVDEVFEQSLSLSCGKL 531

Query: 122  KIMYSS----AAGATTNKYGYSLKGFQKPQVLDS-KTIIQGKPA 6
            K+  SS    A    + K  +S K  ++   +D  KTI+ GKPA
Sbjct: 532  KVKSSSVMETAIVDRSVKNPFSSKKVRRKGSVDKLKTILMGKPA 575


>ref|XP_003548909.1| PREDICTED: uncharacterized protein LOC100818143 [Glycine max]
          Length = 3602

 Score =  241 bits (614), Expect = 5e-61
 Identities = 166/532 (31%), Positives = 263/532 (49%), Gaps = 11/532 (2%)
 Frame = -3

Query: 1568 ALNELLDDSTSFYFTDFRVDQLTLRISNWSAPAFNWEVQGFHVTIS---PRVVEGSGRSR 1398
            ALN L       +F D  V++LTLR S W  PAF  E+ G  +  S   P   E + R R
Sbjct: 47   ALNRLFHSPAFLFFKDLSVERLTLRFSTWFPPAFTVELHGVRIVQSFEKPEAEECAARLR 106

Query: 1397 EPSEVLLEEKKKVLREIDPEGSALHDIMEKLADISLSRSQ-TTSLPMLILSYCCLQMCDI 1221
                   +  +K L  +DPEG +LHDI+E++   +  +   TTS   LIL  C L    I
Sbjct: 107  NSKYDCEDYLRKNLSALDPEGCSLHDILERILFAAPEKKDFTTSFWNLILKNCHLVAHCI 166

Query: 1220 NLRLQLAISDDSFECLWEIEELNADSRLVEPQSFLRGYINSLFISSKESYFDLEIRGLEI 1041
            ++ +QL + +D F C  EI+EL+  S+ V+ +  LRG+++S+FI  K+S   L+  G   
Sbjct: 167  HVEIQLPVLNDEFMCFGEIKELSVRSKYVDKKCLLRGFLSSVFIPMKDSTLVLKGVGFRA 226

Query: 1040 RLKNNDRIRPVCYATDIICSLKLSDLQLVELHCSIEELVTSFSPADXXXXXXXXXXXXXX 861
            RL   D    V  ++D+   +K  DL+L        ELV SFSP                
Sbjct: 227  RLVGKDHTGNVLLSSDMQIDIKFRDLKLASCTLCFPELVFSFSPDGISVCLLFLKLVSNN 286

Query: 860  XSPIRNGRQLWKETATKIRSLISTRRWSVWKLVSVVCLWLRYVHAWENLFLLVGYPMDIM 681
             +  R  R+LW+  A++I  +  T R S  +LV V+  W+ Y +A+EN+ LL+GY     
Sbjct: 287  YNQSRGARELWRIAASRIGHVTVTPRLSFHRLVGVIGQWIHYANAYENILLLIGYSTSHT 346

Query: 680  IKRSAVKMSKNQKFSKSFRCQWKVISEIEKEIPPAAIALAHRVVRCR-TVKNVVPNEDEL 504
             K+S  K+++N+    S    WK+IS+IEK++P   I+LA R+ R R  +K+ +   ++ 
Sbjct: 347  WKKSISKLTRNKLILSSASRHWKLISDIEKKLPVEGISLARRIARHRAALKDSINCHEDF 406

Query: 503  PVTRYLEYFWKTCRLLGLIWSTVCSMFSSITHWAHSRNSFATHPNMKR--IGVLPTDSCP 330
              T   ++F     LL  +W  + ++   + +   SR      P++    +  L  D C 
Sbjct: 407  VTTN--KFFRPFIFLLSFMWKLISTIIHCLVN-IFSREKIVQDPDIDGCCLESLIEDPCQ 463

Query: 329  NLCYRLNLGKIFINISPDNTI-PSVGKRTVSDRWVSHLDLLSFCLLIDTFIMVYKETICE 153
            + C+ LN GKI I +S  N I PSV ++  S   ++    LS C  ID  +++  + I E
Sbjct: 464  SCCFVLNFGKIIITVSQINEIDPSVYEKLQSLAGIACSAFLSICFCIDALLLISVKDIFE 523

Query: 152  DHLTFSCGSFKI---MYSSAAGATTNKYGYSLKGFQKPQVLDSKTIIQGKPA 6
              +  SCG  K+     + +  A T     S KG +K  +   ++I+  +PA
Sbjct: 524  QRIFLSCGQMKVESAPLTMSEEACTMDPLSSAKGNEKEGINHMESIMWVEPA 575


>ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332645140|gb|AEE78661.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 3072

 Score =  239 bits (611), Expect = 1e-60
 Identities = 180/536 (33%), Positives = 260/536 (48%), Gaps = 16/536 (2%)
 Frame = -3

Query: 1565 LNELLDDSTSFYFTDFRVDQLTLRISNWSAPAFNWEVQGFHVTISPRVVE--GSGRSREP 1392
            LN+L D+S +F F  F VDQL +  S WSAPA  +E++G +V +S R  +   S R R  
Sbjct: 49   LNQLFDES-NFQFEKFTVDQLVVSFSVWSAPAIKFEIRGVNVKLSARGTDEGSSRRKRAS 107

Query: 1391 SEVLLEEKKKVLREIDPEGSALHDIMEKLADISLSRSQT--TSLPMLILSYCCLQMCDIN 1218
            S+ +  E KKVL  IDP+G  LHDI+EK+   S S+     TS   LIL +  +Q+  IN
Sbjct: 108  SDTVANEIKKVLSSIDPKGCVLHDILEKMLGRSTSQISKLKTSFSNLILRHFRIQIHGIN 167

Query: 1217 LRLQLAISDDSFECLWEIEELNADSRLVEPQSFLRGYINSLFISSKESYFDLEIRGLEIR 1038
            +++ L  S D   CL EI EL +DS      S +R    ++    + S F L   G  I 
Sbjct: 168  VQVCLPGSSD-LSCLMEINELRSDSENFGNLSLVRSSAAAVLFPLRRSSFTLSCFGFNIG 226

Query: 1037 LKNNDRIRPVCYATDIICSLKLSDLQLVELHCSIEELVTSFSPADXXXXXXXXXXXXXXX 858
             K ++ I  +C    ++  + L +LQLV+L   + EL  SF P D               
Sbjct: 227  YKRDNEIVDLCGFDSLVMLITLHNLQLVDLVVRVPELSFSFRPTDLPVLMGLANLSSKDS 286

Query: 857  SPIRNGRQLWKETATKIRSLISTRRWSVWKLVSVVCLWLRYVHAWENLFLLVGYPMDIMI 678
            + +RNGR LWK  A +   +IS    S   LVSVV LWLRYV+A+E L  L GY   +  
Sbjct: 287  NYVRNGRYLWKVAARRTGLMISPHSVSFQNLVSVVILWLRYVNAYEYLLSLAGYSRKMPE 346

Query: 677  KRSAVKMSKNQKFSKSFRCQWKVISEIEKEIPPAAIALAHRVVR---CRTVKNVVPNEDE 507
            K    K S+N++   + R +W++I  IEKE+P  AIA A RV R   C   ++   + DE
Sbjct: 347  KSLLWKFSENKRHFVTARRKWEMICNIEKELPAEAIARARRVARYRACLNSQDADDDYDE 406

Query: 506  LPVTRYLEYFWKTCRLLGLIWSTVCSMFSSITHWAHSRNSFATHPNMKRIGVLPTDSCPN 327
              +  + +Y  KT  +L  IW  +   F SI  +    N   T              C +
Sbjct: 407  SSLYGHFKYLSKTTWVLAYIWRLISRTFWSIACFLW-LNKLLTQELQTDRNNEDDSECVS 465

Query: 326  LCYR--LNLGKIFINISPDNTIPSVGKRTVSDRWVSHLD--LLSFCLLIDTFIMVYKETI 159
            L +   +NLGK+ +   P+  I S      S     H+D  ++  CL +D F+++Y    
Sbjct: 466  LEFHAVVNLGKLSVTCYPEKIISSF---MTSKDSTGHVDSNIVMLCLSVDEFLVLYTVGC 522

Query: 158  CEDHLTFSCGSFKIMYSSAAGA-----TTNKYGYSLKGFQKPQVLDSKTIIQGKPA 6
               +L+ SCG  K+  SS         +T     S +G +K    D KTI+   PA
Sbjct: 523  LTQYLSASCGKLKVESSSFKNTSRFMKSTKDPSSSSEGNKKHMREDVKTILDMDPA 578


>emb|CAB62317.1| putative protein [Arabidopsis thaliana]
          Length = 3071

 Score =  239 bits (611), Expect = 1e-60
 Identities = 180/536 (33%), Positives = 260/536 (48%), Gaps = 16/536 (2%)
 Frame = -3

Query: 1565 LNELLDDSTSFYFTDFRVDQLTLRISNWSAPAFNWEVQGFHVTISPRVVE--GSGRSREP 1392
            LN+L D+S +F F  F VDQL +  S WSAPA  +E++G +V +S R  +   S R R  
Sbjct: 49   LNQLFDES-NFQFEKFTVDQLVVSFSVWSAPAIKFEIRGVNVKLSARGTDEGSSRRKRAS 107

Query: 1391 SEVLLEEKKKVLREIDPEGSALHDIMEKLADISLSRSQT--TSLPMLILSYCCLQMCDIN 1218
            S+ +  E KKVL  IDP+G  LHDI+EK+   S S+     TS   LIL +  +Q+  IN
Sbjct: 108  SDTVANEIKKVLSSIDPKGCVLHDILEKMLGRSTSQISKLKTSFSNLILRHFRIQIHGIN 167

Query: 1217 LRLQLAISDDSFECLWEIEELNADSRLVEPQSFLRGYINSLFISSKESYFDLEIRGLEIR 1038
            +++ L  S D   CL EI EL +DS      S +R    ++    + S F L   G  I 
Sbjct: 168  VQVCLPGSSD-LSCLMEINELRSDSENFGNLSLVRSSAAAVLFPLRRSSFTLSCFGFNIG 226

Query: 1037 LKNNDRIRPVCYATDIICSLKLSDLQLVELHCSIEELVTSFSPADXXXXXXXXXXXXXXX 858
             K ++ I  +C    ++  + L +LQLV+L   + EL  SF P D               
Sbjct: 227  YKRDNEIVDLCGFDSLVMLITLHNLQLVDLVVRVPELSFSFRPTDLPVLMGLANLSSKDS 286

Query: 857  SPIRNGRQLWKETATKIRSLISTRRWSVWKLVSVVCLWLRYVHAWENLFLLVGYPMDIMI 678
            + +RNGR LWK  A +   +IS    S   LVSVV LWLRYV+A+E L  L GY   +  
Sbjct: 287  NYVRNGRYLWKVAARRTGLMISPHSVSFQNLVSVVILWLRYVNAYEYLLSLAGYSRKMPE 346

Query: 677  KRSAVKMSKNQKFSKSFRCQWKVISEIEKEIPPAAIALAHRVVR---CRTVKNVVPNEDE 507
            K    K S+N++   + R +W++I  IEKE+P  AIA A RV R   C   ++   + DE
Sbjct: 347  KSLLWKFSENKRHFVTARRKWEMICNIEKELPAEAIARARRVARYRACLNSQDADDDYDE 406

Query: 506  LPVTRYLEYFWKTCRLLGLIWSTVCSMFSSITHWAHSRNSFATHPNMKRIGVLPTDSCPN 327
              +  + +Y  KT  +L  IW  +   F SI  +    N   T              C +
Sbjct: 407  SSLYGHFKYLSKTTWVLAYIWRLISRTFWSIACFLW-LNKLLTQELQTDRNNEDDSECVS 465

Query: 326  LCYR--LNLGKIFINISPDNTIPSVGKRTVSDRWVSHLD--LLSFCLLIDTFIMVYKETI 159
            L +   +NLGK+ +   P+  I S      S     H+D  ++  CL +D F+++Y    
Sbjct: 466  LEFHAVVNLGKLSVTCYPEKIISSF---MTSKDSTGHVDSNIVMLCLSVDEFLVLYTVGC 522

Query: 158  CEDHLTFSCGSFKIMYSSAAGA-----TTNKYGYSLKGFQKPQVLDSKTIIQGKPA 6
               +L+ SCG  K+  SS         +T     S +G +K    D KTI+   PA
Sbjct: 523  LTQYLSASCGKLKVESSSFKNTSRFMKSTKDPSSSSEGNKKHMREDVKTILDMDPA 578


Top