BLASTX nr result
ID: Atractylodes21_contig00024604
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00024604 (1413 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261... 142 2e-31 ref|NP_200517.2| hydroxyproline-rich glycoprotein family protein... 109 2e-21 dbj|BAA97357.1| unnamed protein product [Arabidopsis thaliana] 109 2e-21 ref|XP_002864497.1| hypothetical protein ARALYDRAFT_495801 [Arab... 98 5e-18 ref|XP_003547490.1| PREDICTED: uncharacterized protein LOC100814... 96 2e-17 >ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261010 [Vitis vinifera] Length = 555 Score = 142 bits (359), Expect = 2e-31 Identities = 135/426 (31%), Positives = 173/426 (40%), Gaps = 10/426 (2%) Frame = -2 Query: 1370 MEQDPDSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTS 1191 ME D DSP +FW PP +T+ RR L+ TS Sbjct: 1 MEGDGDSP-SFWPSPPPSTSIYRRRRPSPLLNPAVLIILLPILAMIVVFFAVPSFLNFTS 59 Query: 1190 HILKPGSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTSGDVADQGNDFVPGGVNSVQSS 1011 L+P SV+ +WDSLN+ LVLFAI+CGV ARKND+ + + V G + S Sbjct: 60 QFLRPNSVRKSWDSLNVLLVLFAILCGVFARKNDEKNDDVLENHGSSGSVVMGKSHESIS 119 Query: 1010 TQWVGFTDRK----AVTGG---LRRSSSSYPDLRQESLWDNGENRNRXXXXXDVDIYSSP 852 F+DRK + G LRRSSSSYPDLRQESLW G++R R +V+ Y SP Sbjct: 120 HSLFEFSDRKIYDPPIQSGSVRLRRSSSSYPDLRQESLWGAGDDRRRFFDDFEVNNYRSP 179 Query: 851 VSRYYNNLSRRTDRERENLYRSTPVSGDIRQQSRRSEADQGEFSDAKEIVVDTFEVSPNV 672 S Y RR++ ER++ S+ K I VDTF V + Sbjct: 180 ASSDYVRRHRRSELERDD-------------------------SEVKVIPVDTFAVRSS- 213 Query: 671 PAFSEQPQSVKXXXXXXXXXXXXANSRTHSFRSVGRNVKVEVPRRIESDELDKVRSYXXX 492 P+ S P R S+ +V R K+ ++D+ K RS Sbjct: 214 PSPSPAPPRTPPPPPPPPPPIVQRKPR-RSYETVARKEKLS---NSDADQFKKSRS---- 265 Query: 491 XXXXXXXXXXPTEVRVQRSH---HKHKKLERKVSDATKEIATAISSLYNQXXXXXXXXXX 321 P RV H K +K R++ ATK+IAT SLYNQ Sbjct: 266 PPAPPPPPPPPPPPRVPGGHLPEQKSRKSARRMGGATKDIATVFVSLYNQTRKKKKQRTK 325 Query: 320 NIXXXXXXXXXXXXXXXSLDVQEPHQSLAXXXXXXXXXXXPSMFQNLFKKGGKHKRIHSV 141 NI P + PSM NLF+KG K KRIHSV Sbjct: 326 NIHENAVQ-------------SPPSATTPTPPPPPPPPPPPSMLHNLFRKGSKSKRIHSV 372 Query: 140 PATGSP 123 A P Sbjct: 373 SAPPPP 378 >ref|NP_200517.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|332009460|gb|AED96843.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 575 Score = 109 bits (273), Expect = 2e-21 Identities = 129/459 (28%), Positives = 168/459 (36%), Gaps = 48/459 (10%) Frame = -2 Query: 1355 DSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTSHILKP 1176 D PP W Q T RRR LS TS IL+P Sbjct: 3 DQPPLIWPQFDSTGYARRRSSIPAILVPAMIGVTSAAIFLVFVTFVVPTFLSVTSQILQP 62 Query: 1175 GSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTS-----------GDVADQGNDFVPGGV 1029 SVK WDS+N+ LV+FAI+CGVLAR+NDD +S G A + G + Sbjct: 63 ASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGGAVTNGEMTVGEI 122 Query: 1028 NSVQSST-----QWV------------------GFTDRKAVTGG--LRRSSSSYPDLRQE 924 + + SS+ QW F+ VTG LRRSSSSYPDLRQ Sbjct: 123 SKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPVTGNVPLRRSSSSYPDLRQG 182 Query: 923 SLWDNGENRNRXXXXXDVDIYSSPVSRYYNNLSRRTDRERENLYRSTPVSGDIRQQSRRS 744 + G+ R R ++D Y S S Y + E E Sbjct: 183 VFRETGDRRFRFYDDFEIDKYRSQDSSSYQQFQNLSKTEIEE------------------ 224 Query: 743 EADQGEFSDAKEIVVDTFEVSPNVPAFSEQPQSVKXXXXXXXXXXXXANSRTHSFRSVGR 564 E S+ KEI +DTF V P+ P + P + RTH RSV Sbjct: 225 -----EESEPKEIQIDTFVVKPSSPP-QQPPATPPPPPPPPPVEVPQKPRRTH--RSVRN 276 Query: 563 NVKVEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRS-HHKHKKLERKVSDATK 387 ++ + E R++ P + + + K L+R+ S+A K Sbjct: 277 R---DLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQRRKSNAAK 333 Query: 386 EIATAISSLYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEP--HQSL-------- 237 EI +SLYNQ DV EP +QSL Sbjct: 334 EIKMVFASLYNQGKKKKK------LQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSPPP 387 Query: 236 AXXXXXXXXXXXPSMFQNLFKKGGK-HKRIHSVPATGSP 123 S+F LFKKG K +K+IHSVPA P Sbjct: 388 PPPPPPPPLRSSQSVFYGLFKKGVKSNKKIHSVPAPPPP 426 >dbj|BAA97357.1| unnamed protein product [Arabidopsis thaliana] Length = 607 Score = 109 bits (273), Expect = 2e-21 Identities = 129/459 (28%), Positives = 168/459 (36%), Gaps = 48/459 (10%) Frame = -2 Query: 1355 DSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTSHILKP 1176 D PP W Q T RRR LS TS IL+P Sbjct: 3 DQPPLIWPQFDSTGYARRRSSIPAILVPAMIGVTSAAIFLVFVTFVVPTFLSVTSQILQP 62 Query: 1175 GSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTS-----------GDVADQGNDFVPGGV 1029 SVK WDS+N+ LV+FAI+CGVLAR+NDD +S G A + G + Sbjct: 63 ASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGGAVTNGEMTVGEI 122 Query: 1028 NSVQSST-----QWV------------------GFTDRKAVTGG--LRRSSSSYPDLRQE 924 + + SS+ QW F+ VTG LRRSSSSYPDLRQ Sbjct: 123 SKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPVTGNVPLRRSSSSYPDLRQG 182 Query: 923 SLWDNGENRNRXXXXXDVDIYSSPVSRYYNNLSRRTDRERENLYRSTPVSGDIRQQSRRS 744 + G+ R R ++D Y S S Y + E E Sbjct: 183 VFRETGDRRFRFYDDFEIDKYRSQDSSSYQQFQNLSKTEIEE------------------ 224 Query: 743 EADQGEFSDAKEIVVDTFEVSPNVPAFSEQPQSVKXXXXXXXXXXXXANSRTHSFRSVGR 564 E S+ KEI +DTF V P+ P + P + RTH RSV Sbjct: 225 -----EESEPKEIQIDTFVVKPSSPP-QQPPATPPPPPPPPPVEVPQKPRRTH--RSVRN 276 Query: 563 NVKVEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRS-HHKHKKLERKVSDATK 387 ++ + E R++ P + + + K L+R+ S+A K Sbjct: 277 R---DLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQRRKSNAAK 333 Query: 386 EIATAISSLYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEP--HQSL-------- 237 EI +SLYNQ DV EP +QSL Sbjct: 334 EIKMVFASLYNQGKKKKK------LQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSPPP 387 Query: 236 AXXXXXXXXXXXPSMFQNLFKKGGK-HKRIHSVPATGSP 123 S+F LFKKG K +K+IHSVPA P Sbjct: 388 PPPPPPPPLRSSQSVFYGLFKKGVKSNKKIHSVPAPPPP 426 >ref|XP_002864497.1| hypothetical protein ARALYDRAFT_495801 [Arabidopsis lyrata subsp. lyrata] gi|297310332|gb|EFH40756.1| hypothetical protein ARALYDRAFT_495801 [Arabidopsis lyrata subsp. lyrata] Length = 566 Score = 98.2 bits (243), Expect = 5e-18 Identities = 124/460 (26%), Positives = 172/460 (37%), Gaps = 49/460 (10%) Frame = -2 Query: 1355 DSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTSHILKP 1176 D PP W Q T RR LS TS IL+P Sbjct: 3 DQPPLIWPQFESTGYTHRRSPIPAMLVPAMIGVISAAIFLLFVNFVIPPFLSVTSQILQP 62 Query: 1175 GSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTSGDVADQGNDFVPGGVNSVQS------ 1014 SVK WDS+N+ LV+FAI+CGVLAR+NDD +S + + V G V S + Sbjct: 63 SSVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGAVTSGEMTLGEIS 122 Query: 1013 ---------STQWV------------------GFTDRKAVTGG--LRRSSSSYPDLRQES 921 S QW F+ VTG LRRS SSYPDLRQ Sbjct: 123 KISSSSSAVSEQWFDDVYDAERLKIYESVSSRSFSHGLPVTGTVPLRRSCSSYPDLRQGV 182 Query: 920 LWDNGENRNRXXXXXDVDIYSSPVSRYYNN--LSRRTDRERENLYRSTPVSGDIRQQSRR 747 + G+ R R+Y++ + R+ E +N R Sbjct: 183 FRETGDRR----------------FRFYDDFEIHNRSYEEFQN----------------R 210 Query: 746 SEADQGEFSDAKEIVVDTFEVSPNVPAFSEQPQSVKXXXXXXXXXXXXANSRTHSFRSVG 567 S+ + E S+ KEI +DTF V P+ P +QP + + + R Sbjct: 211 SKIEIEEESEPKEIQIDTFVVKPSSP--PQQPPAPPTPPPPPPPPPVEVSQKP---RRTH 265 Query: 566 RNVK-VEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRSHHKHKKLERKVSDAT 390 R+VK ++ ++ +++ R++ P + K L+R+ S+A Sbjct: 266 RSVKNRDIQENVKRNDIKFKRAF--QPPNPPPPPPPPPPLITATPPRKQGTLQRRKSNAA 323 Query: 389 KEIATAISSLYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEP--HQSL------- 237 KEI +SLYNQ +DV EP +QSL Sbjct: 324 KEIKMVFASLYNQGKRKKK------IQKSKRKERIESSPVVVDVTEPPQYQSLIPPPSPP 377 Query: 236 -AXXXXXXXXXXXPSMFQNLFKKGGK-HKRIHSVPATGSP 123 S+F LFKKG K +K+IHSVPA P Sbjct: 378 PPPPPPPPPPRTSQSVFYGLFKKGVKSNKKIHSVPAPPPP 417 >ref|XP_003547490.1| PREDICTED: uncharacterized protein LOC100814577 [Glycine max] Length = 563 Score = 95.9 bits (237), Expect = 2e-17 Identities = 107/393 (27%), Positives = 159/393 (40%), Gaps = 33/393 (8%) Frame = -2 Query: 1202 SHTSHILKPGS--VKSTWDSLNIFLVLFAIICGVLARK-NDDVSTSGDVADQGNDFVPGG 1032 S + +L+ S VK++WDSLNI LV+FAI+CGV AR+ NDD T + Q +D Sbjct: 40 SAVTRLLRSASSDVKTSWDSLNILLVVFAILCGVFARRNNDDEQTPNNHVHQHDDDAVSD 99 Query: 1031 VNSV------QSSTQWVGFTDRKAV--------------TGG----LRRSSSSYPDLRQE 924 N+ + +QW GF + + V TGG +RR+SSSYPDLRQ Sbjct: 100 RNAAFRRLHSEGQSQWFGFAEERKVYGNDTPLNQLQSLDTGGNRLRMRRNSSSYPDLRQ- 158 Query: 923 SLWDNGENRNRXXXXXDVDI---YSSPVSRYYNNLSRRTDRERENLYRSTPVSGDIRQQS 753 W+ G++R + D +I + SP ++ + R + P S +Q Sbjct: 159 --WETGDDRYKFRFYDDFEIDKPFRSPAREHFPAVEHR---------KRWPESPQPQQHQ 207 Query: 752 RRSEADQGEFSDAKEIVVDTFEVSPNVPAF-SEQPQSVKXXXXXXXXXXXXANSRTHSFR 576 + + + + K+I VDTFE+ P+ P S P RTH R Sbjct: 208 HQYQQQEDQ---VKDIPVDTFEIRPSPPPVKSTSPPPPPPPPPPPPESARHNTRRTH--R 262 Query: 575 SVGRNVKVEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRSHHKHKKLERKVSD 396 V R + E+ ++ E +RS T +K +R+ S Sbjct: 263 KVERERESEITVELDDHEFTTIRSPPPAPPTPPQPPSVKT--------RSERKSDRRKST 314 Query: 395 ATKEIATAISS-LYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEPHQSLAXXXXX 219 +E+A +S L NQ N + E + Sbjct: 315 VKREMAMIWASVLSNQRKKKKKQKTKNNENQDPQYDD--------NADELTNNTTVPPPP 366 Query: 218 XXXXXXPSMFQNLFKKG-GKHKRIHSVPATGSP 123 PS+F +LF+KG GK K+IHSV A P Sbjct: 367 PPPPPLPSVFHSLFRKGLGKSKKIHSVSAPPPP 399