BLASTX nr result

ID: Atractylodes21_contig00024604 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00024604
         (1413 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261...   142   2e-31
ref|NP_200517.2| hydroxyproline-rich glycoprotein family protein...   109   2e-21
dbj|BAA97357.1| unnamed protein product [Arabidopsis thaliana]        109   2e-21
ref|XP_002864497.1| hypothetical protein ARALYDRAFT_495801 [Arab...    98   5e-18
ref|XP_003547490.1| PREDICTED: uncharacterized protein LOC100814...    96   2e-17

>ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261010 [Vitis vinifera]
          Length = 555

 Score =  142 bits (359), Expect = 2e-31
 Identities = 135/426 (31%), Positives = 173/426 (40%), Gaps = 10/426 (2%)
 Frame = -2

Query: 1370 MEQDPDSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTS 1191
            ME D DSP +FW  PP +T+  RR                               L+ TS
Sbjct: 1    MEGDGDSP-SFWPSPPPSTSIYRRRRPSPLLNPAVLIILLPILAMIVVFFAVPSFLNFTS 59

Query: 1190 HILKPGSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTSGDVADQGNDFVPGGVNSVQSS 1011
              L+P SV+ +WDSLN+ LVLFAI+CGV ARKND+ +         +  V  G +    S
Sbjct: 60   QFLRPNSVRKSWDSLNVLLVLFAILCGVFARKNDEKNDDVLENHGSSGSVVMGKSHESIS 119

Query: 1010 TQWVGFTDRK----AVTGG---LRRSSSSYPDLRQESLWDNGENRNRXXXXXDVDIYSSP 852
                 F+DRK     +  G   LRRSSSSYPDLRQESLW  G++R R     +V+ Y SP
Sbjct: 120  HSLFEFSDRKIYDPPIQSGSVRLRRSSSSYPDLRQESLWGAGDDRRRFFDDFEVNNYRSP 179

Query: 851  VSRYYNNLSRRTDRERENLYRSTPVSGDIRQQSRRSEADQGEFSDAKEIVVDTFEVSPNV 672
             S  Y    RR++ ER++                         S+ K I VDTF V  + 
Sbjct: 180  ASSDYVRRHRRSELERDD-------------------------SEVKVIPVDTFAVRSS- 213

Query: 671  PAFSEQPQSVKXXXXXXXXXXXXANSRTHSFRSVGRNVKVEVPRRIESDELDKVRSYXXX 492
            P+ S  P                   R  S+ +V R  K+      ++D+  K RS    
Sbjct: 214  PSPSPAPPRTPPPPPPPPPPIVQRKPR-RSYETVARKEKLS---NSDADQFKKSRS---- 265

Query: 491  XXXXXXXXXXPTEVRVQRSH---HKHKKLERKVSDATKEIATAISSLYNQXXXXXXXXXX 321
                      P   RV   H    K +K  R++  ATK+IAT   SLYNQ          
Sbjct: 266  PPAPPPPPPPPPPPRVPGGHLPEQKSRKSARRMGGATKDIATVFVSLYNQTRKKKKQRTK 325

Query: 320  NIXXXXXXXXXXXXXXXSLDVQEPHQSLAXXXXXXXXXXXPSMFQNLFKKGGKHKRIHSV 141
            NI                     P  +             PSM  NLF+KG K KRIHSV
Sbjct: 326  NIHENAVQ-------------SPPSATTPTPPPPPPPPPPPSMLHNLFRKGSKSKRIHSV 372

Query: 140  PATGSP 123
             A   P
Sbjct: 373  SAPPPP 378


>ref|NP_200517.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|332009460|gb|AED96843.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 575

 Score =  109 bits (273), Expect = 2e-21
 Identities = 129/459 (28%), Positives = 168/459 (36%), Gaps = 48/459 (10%)
 Frame = -2

Query: 1355 DSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTSHILKP 1176
            D PP  W Q   T   RRR                               LS TS IL+P
Sbjct: 3    DQPPLIWPQFDSTGYARRRSSIPAILVPAMIGVTSAAIFLVFVTFVVPTFLSVTSQILQP 62

Query: 1175 GSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTS-----------GDVADQGNDFVPGGV 1029
             SVK  WDS+N+ LV+FAI+CGVLAR+NDD  +S           G  A    +   G +
Sbjct: 63   ASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGGAVTNGEMTVGEI 122

Query: 1028 NSVQSST-----QWV------------------GFTDRKAVTGG--LRRSSSSYPDLRQE 924
            + + SS+     QW                    F+    VTG   LRRSSSSYPDLRQ 
Sbjct: 123  SKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPVTGNVPLRRSSSSYPDLRQG 182

Query: 923  SLWDNGENRNRXXXXXDVDIYSSPVSRYYNNLSRRTDRERENLYRSTPVSGDIRQQSRRS 744
               + G+ R R     ++D Y S  S  Y      +  E E                   
Sbjct: 183  VFRETGDRRFRFYDDFEIDKYRSQDSSSYQQFQNLSKTEIEE------------------ 224

Query: 743  EADQGEFSDAKEIVVDTFEVSPNVPAFSEQPQSVKXXXXXXXXXXXXANSRTHSFRSVGR 564
                 E S+ KEI +DTF V P+ P   + P +                 RTH  RSV  
Sbjct: 225  -----EESEPKEIQIDTFVVKPSSPP-QQPPATPPPPPPPPPVEVPQKPRRTH--RSVRN 276

Query: 563  NVKVEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRS-HHKHKKLERKVSDATK 387
                ++    +  E    R++             P +  +  +   K   L+R+ S+A K
Sbjct: 277  R---DLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQRRKSNAAK 333

Query: 386  EIATAISSLYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEP--HQSL-------- 237
            EI    +SLYNQ                             DV EP  +QSL        
Sbjct: 334  EIKMVFASLYNQGKKKKK------LQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSPPP 387

Query: 236  AXXXXXXXXXXXPSMFQNLFKKGGK-HKRIHSVPATGSP 123
                         S+F  LFKKG K +K+IHSVPA   P
Sbjct: 388  PPPPPPPPLRSSQSVFYGLFKKGVKSNKKIHSVPAPPPP 426


>dbj|BAA97357.1| unnamed protein product [Arabidopsis thaliana]
          Length = 607

 Score =  109 bits (273), Expect = 2e-21
 Identities = 129/459 (28%), Positives = 168/459 (36%), Gaps = 48/459 (10%)
 Frame = -2

Query: 1355 DSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTSHILKP 1176
            D PP  W Q   T   RRR                               LS TS IL+P
Sbjct: 3    DQPPLIWPQFDSTGYARRRSSIPAILVPAMIGVTSAAIFLVFVTFVVPTFLSVTSQILQP 62

Query: 1175 GSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTS-----------GDVADQGNDFVPGGV 1029
             SVK  WDS+N+ LV+FAI+CGVLAR+NDD  +S           G  A    +   G +
Sbjct: 63   ASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGGAVTNGEMTVGEI 122

Query: 1028 NSVQSST-----QWV------------------GFTDRKAVTGG--LRRSSSSYPDLRQE 924
            + + SS+     QW                    F+    VTG   LRRSSSSYPDLRQ 
Sbjct: 123  SKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPVTGNVPLRRSSSSYPDLRQG 182

Query: 923  SLWDNGENRNRXXXXXDVDIYSSPVSRYYNNLSRRTDRERENLYRSTPVSGDIRQQSRRS 744
               + G+ R R     ++D Y S  S  Y      +  E E                   
Sbjct: 183  VFRETGDRRFRFYDDFEIDKYRSQDSSSYQQFQNLSKTEIEE------------------ 224

Query: 743  EADQGEFSDAKEIVVDTFEVSPNVPAFSEQPQSVKXXXXXXXXXXXXANSRTHSFRSVGR 564
                 E S+ KEI +DTF V P+ P   + P +                 RTH  RSV  
Sbjct: 225  -----EESEPKEIQIDTFVVKPSSPP-QQPPATPPPPPPPPPVEVPQKPRRTH--RSVRN 276

Query: 563  NVKVEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRS-HHKHKKLERKVSDATK 387
                ++    +  E    R++             P +  +  +   K   L+R+ S+A K
Sbjct: 277  R---DLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQRRKSNAAK 333

Query: 386  EIATAISSLYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEP--HQSL-------- 237
            EI    +SLYNQ                             DV EP  +QSL        
Sbjct: 334  EIKMVFASLYNQGKKKKK------LQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSPPP 387

Query: 236  AXXXXXXXXXXXPSMFQNLFKKGGK-HKRIHSVPATGSP 123
                         S+F  LFKKG K +K+IHSVPA   P
Sbjct: 388  PPPPPPPPLRSSQSVFYGLFKKGVKSNKKIHSVPAPPPP 426


>ref|XP_002864497.1| hypothetical protein ARALYDRAFT_495801 [Arabidopsis lyrata subsp.
            lyrata] gi|297310332|gb|EFH40756.1| hypothetical protein
            ARALYDRAFT_495801 [Arabidopsis lyrata subsp. lyrata]
          Length = 566

 Score = 98.2 bits (243), Expect = 5e-18
 Identities = 124/460 (26%), Positives = 172/460 (37%), Gaps = 49/460 (10%)
 Frame = -2

Query: 1355 DSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTSHILKP 1176
            D PP  W Q   T    RR                               LS TS IL+P
Sbjct: 3    DQPPLIWPQFESTGYTHRRSPIPAMLVPAMIGVISAAIFLLFVNFVIPPFLSVTSQILQP 62

Query: 1175 GSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTSGDVADQGNDFVPGGVNSVQS------ 1014
             SVK  WDS+N+ LV+FAI+CGVLAR+NDD  +S  +     + V G V S +       
Sbjct: 63   SSVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGAVTSGEMTLGEIS 122

Query: 1013 ---------STQWV------------------GFTDRKAVTGG--LRRSSSSYPDLRQES 921
                     S QW                    F+    VTG   LRRS SSYPDLRQ  
Sbjct: 123  KISSSSSAVSEQWFDDVYDAERLKIYESVSSRSFSHGLPVTGTVPLRRSCSSYPDLRQGV 182

Query: 920  LWDNGENRNRXXXXXDVDIYSSPVSRYYNN--LSRRTDRERENLYRSTPVSGDIRQQSRR 747
              + G+ R                 R+Y++  +  R+  E +N                R
Sbjct: 183  FRETGDRR----------------FRFYDDFEIHNRSYEEFQN----------------R 210

Query: 746  SEADQGEFSDAKEIVVDTFEVSPNVPAFSEQPQSVKXXXXXXXXXXXXANSRTHSFRSVG 567
            S+ +  E S+ KEI +DTF V P+ P   +QP +               + +    R   
Sbjct: 211  SKIEIEEESEPKEIQIDTFVVKPSSP--PQQPPAPPTPPPPPPPPPVEVSQKP---RRTH 265

Query: 566  RNVK-VEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRSHHKHKKLERKVSDAT 390
            R+VK  ++   ++ +++   R++             P  +       K   L+R+ S+A 
Sbjct: 266  RSVKNRDIQENVKRNDIKFKRAF--QPPNPPPPPPPPPPLITATPPRKQGTLQRRKSNAA 323

Query: 389  KEIATAISSLYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEP--HQSL------- 237
            KEI    +SLYNQ                            +DV EP  +QSL       
Sbjct: 324  KEIKMVFASLYNQGKRKKK------IQKSKRKERIESSPVVVDVTEPPQYQSLIPPPSPP 377

Query: 236  -AXXXXXXXXXXXPSMFQNLFKKGGK-HKRIHSVPATGSP 123
                          S+F  LFKKG K +K+IHSVPA   P
Sbjct: 378  PPPPPPPPPPRTSQSVFYGLFKKGVKSNKKIHSVPAPPPP 417


>ref|XP_003547490.1| PREDICTED: uncharacterized protein LOC100814577 [Glycine max]
          Length = 563

 Score = 95.9 bits (237), Expect = 2e-17
 Identities = 107/393 (27%), Positives = 159/393 (40%), Gaps = 33/393 (8%)
 Frame = -2

Query: 1202 SHTSHILKPGS--VKSTWDSLNIFLVLFAIICGVLARK-NDDVSTSGDVADQGNDFVPGG 1032
            S  + +L+  S  VK++WDSLNI LV+FAI+CGV AR+ NDD  T  +   Q +D     
Sbjct: 40   SAVTRLLRSASSDVKTSWDSLNILLVVFAILCGVFARRNNDDEQTPNNHVHQHDDDAVSD 99

Query: 1031 VNSV------QSSTQWVGFTDRKAV--------------TGG----LRRSSSSYPDLRQE 924
             N+       +  +QW GF + + V              TGG    +RR+SSSYPDLRQ 
Sbjct: 100  RNAAFRRLHSEGQSQWFGFAEERKVYGNDTPLNQLQSLDTGGNRLRMRRNSSSYPDLRQ- 158

Query: 923  SLWDNGENRNRXXXXXDVDI---YSSPVSRYYNNLSRRTDRERENLYRSTPVSGDIRQQS 753
              W+ G++R +     D +I   + SP   ++  +  R         +  P S   +Q  
Sbjct: 159  --WETGDDRYKFRFYDDFEIDKPFRSPAREHFPAVEHR---------KRWPESPQPQQHQ 207

Query: 752  RRSEADQGEFSDAKEIVVDTFEVSPNVPAF-SEQPQSVKXXXXXXXXXXXXANSRTHSFR 576
             + +  + +    K+I VDTFE+ P+ P   S  P                   RTH  R
Sbjct: 208  HQYQQQEDQ---VKDIPVDTFEIRPSPPPVKSTSPPPPPPPPPPPPESARHNTRRTH--R 262

Query: 575  SVGRNVKVEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRSHHKHKKLERKVSD 396
             V R  + E+   ++  E   +RS               T           +K +R+ S 
Sbjct: 263  KVERERESEITVELDDHEFTTIRSPPPAPPTPPQPPSVKT--------RSERKSDRRKST 314

Query: 395  ATKEIATAISS-LYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEPHQSLAXXXXX 219
              +E+A   +S L NQ          N                  +  E   +       
Sbjct: 315  VKREMAMIWASVLSNQRKKKKKQKTKNNENQDPQYDD--------NADELTNNTTVPPPP 366

Query: 218  XXXXXXPSMFQNLFKKG-GKHKRIHSVPATGSP 123
                  PS+F +LF+KG GK K+IHSV A   P
Sbjct: 367  PPPPPLPSVFHSLFRKGLGKSKKIHSVSAPPPP 399


Top