BLASTX nr result
ID: Atractylodes22_contig00008933
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes22_contig00008933 (2335 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261... 153 2e-34 ref|NP_200517.2| hydroxyproline-rich glycoprotein family protein... 122 3e-25 dbj|BAA97357.1| unnamed protein product [Arabidopsis thaliana] 122 3e-25 ref|XP_002864497.1| hypothetical protein ARALYDRAFT_495801 [Arab... 110 1e-21 ref|XP_002514089.1| conserved hypothetical protein [Ricinus comm... 104 9e-20 >ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261010 [Vitis vinifera] Length = 555 Score = 153 bits (387), Expect = 2e-34 Identities = 137/428 (32%), Positives = 176/428 (41%), Gaps = 10/428 (2%) Frame = -3 Query: 2279 MEQDPDSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTS 2100 ME D DSP +FW PP +T+ RR L+ TS Sbjct: 1 MEGDGDSP-SFWPSPPPSTSIYRRRRPSPLLNPAVLIILLPILAMIVVFFAVPSFLNFTS 59 Query: 2099 HILKPGSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTSGDVADQGNDFVPGGVNTVQSS 1920 L+P SV+ +WDSLN+ LVLFAI+CGV ARKND+ + + V G + S Sbjct: 60 QFLRPNSVRKSWDSLNVLLVLFAILCGVFARKNDEKNDDVLENHGSSGSVVMGKSHESIS 119 Query: 1919 TQWVGFTDRKA----VTGG---LRRSSSSYPDLRQESLWDNGENRSRFFDDFDVDIYSSP 1761 F+DRK + G LRRSSSSYPDLRQESLW G++R RFFDDF+V+ Y SP Sbjct: 120 HSLFEFSDRKIYDPPIQSGSVRLRRSSSSYPDLRQESLWGAGDDRRRFFDDFEVNNYRSP 179 Query: 1760 VSRYYNNLSRRTDRERENLYRSTTVSGDIRQQSRRSEADQGEFSDAKEIVVDTFEVSPNV 1581 S Y RR++ ER++ S+ K I VDTF V + Sbjct: 180 ASSDYVRRHRRSELERDD-------------------------SEVKVIPVDTFAVRSSP 214 Query: 1580 PAFSEQPQSVKXXXXXXXXXXXXXXANSRTHSFRSVGRNVKVEVPRRIESDELDKVRSYX 1401 P++ S+ +V R K+ ++D+ K RS Sbjct: 215 SPSPAPPRT----PPPPPPPPPPIVQRKPRRSYETVARKEKLS---NSDADQFKKSRS-- 265 Query: 1400 XXXXXXXXXXXXPTEVRVQRSH---HKHKKLERKVSDATKEIATAISSLYNQXXXXXXXX 1230 P RV H K +K R++ ATK+IAT SLYNQ Sbjct: 266 --PPAPPPPPPPPPPPRVPGGHLPEQKSRKSARRMGGATKDIATVFVSLYNQTRKKKKQR 323 Query: 1229 XXNIXXXXXXXXXXXXXXXSLDVQEPHQSLAXXXXXXXXXXXPSMFQNLFKKGGKHKRIH 1050 NI P + PSM NLF+KG K KRIH Sbjct: 324 TKNIHENAVQ-------------SPPSATTPTPPPPPPPPPPPSMLHNLFRKGSKSKRIH 370 Query: 1049 SVPATGSP 1026 SV A P Sbjct: 371 SVSAPPPP 378 Score = 117 bits (292), Expect = 2e-23 Identities = 73/167 (43%), Positives = 92/167 (55%), Gaps = 2/167 (1%) Frame = -3 Query: 755 RKTQAQXXXXXXXXXXXPDRRRSTSKGKPPLPTKASSYYDRDDFLQSGSQSPLIXXXXXX 576 RKT PD R + GKPPLP + SS+Y+RDD + SG QSPLI Sbjct: 393 RKTHIPPAPPTPPPPPPPDTSRRRAAGKPPLPARKSSFYNRDDNVNSGGQSPLIPMPPPP 452 Query: 575 XXXXXXXXXFELRGDFVRIRSTNSSVCSSPDREDGDLSSTVMXXXXXXXXXXXXXXXXXX 396 + +RGDFVRIRST+SS CSSP+ +D DLSS Sbjct: 453 PPFRMPELKYVVRGDFVRIRSTHSSRCSSPELDDVDLSSN----KSAMDGGDAIGATFCP 508 Query: 395 XPDVNAKADSFISRLKDEWRMEKINSIKGK--MG*SPLRSPNQTRTS 261 PDVN KAD+FI+RL+ EWR+EKINS++ + +G +P SPN T TS Sbjct: 509 SPDVNVKADTFIARLRGEWRLEKINSLRERKNVGLTPDPSPNPTHTS 555 >ref|NP_200517.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|332009460|gb|AED96843.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 575 Score = 122 bits (307), Expect = 3e-25 Identities = 138/461 (29%), Positives = 175/461 (37%), Gaps = 48/461 (10%) Frame = -3 Query: 2264 DSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTSHILKP 2085 D PP W Q T RRR LS TS IL+P Sbjct: 3 DQPPLIWPQFDSTGYARRRSSIPAILVPAMIGVTSAAIFLVFVTFVVPTFLSVTSQILQP 62 Query: 2084 GSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTS-----GDVADQGNDFVPGGVNTV--- 1929 SVK WDS+N+ LV+FAI+CGVLAR+NDD +S G+ + G V G TV Sbjct: 63 ASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGGAVTNGEMTVGEI 122 Query: 1928 -----QSST---QWV------------------GFTDRKAVTGG--LRRSSSSYPDLRQE 1833 SST QW F+ VTG LRRSSSSYPDLRQ Sbjct: 123 SKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPVTGNVPLRRSSSSYPDLRQG 182 Query: 1832 SLWDNGENRSRFFDDFDVDIYSSPVSRYYNNLSRRTDRERENLYRSTTVSGDIRQQSRRS 1653 + G+ R RF+DDF++D Y S S Y + E E Sbjct: 183 VFRETGDRRFRFYDDFEIDKYRSQDSSSYQQFQNLSKTEIEE------------------ 224 Query: 1652 EADQGEFSDAKEIVVDTFEVSPNVPAFSEQPQSVKXXXXXXXXXXXXXXANSRTHSFRSV 1473 E S+ KEI +DTF V P+ P +QP + RTH RSV Sbjct: 225 -----EESEPKEIQIDTFVVKPSSP--PQQPPATPPPPPPPPPVEVPQKPR-RTH--RSV 274 Query: 1472 GRNVKVEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRS-HHKHKKLERKVSDA 1296 ++ + E R++ P + + + K L+R+ S+A Sbjct: 275 RNR---DLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQRRKSNA 331 Query: 1295 TKEIATAISSLYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEP--HQSL------ 1140 KEI +SLYNQ DV EP +QSL Sbjct: 332 AKEIKMVFASLYNQGKKKKK------LQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSP 385 Query: 1139 --AXXXXXXXXXXXPSMFQNLFKKGGK-HKRIHSVPATGSP 1026 S+F LFKKG K +K+IHSVPA P Sbjct: 386 PPPPPPPPPPLRSSQSVFYGLFKKGVKSNKKIHSVPAPPPP 426 Score = 67.8 bits (164), Expect = 1e-08 Identities = 47/138 (34%), Positives = 65/138 (47%), Gaps = 9/138 (6%) Frame = -3 Query: 692 RSTSKGKPPLPTKASSYYDRDDFLQSGSQSPLIXXXXXXXXXXXXXXXF---ELRGDFVR 522 R G+PP PTK ++ + ++ G SPLI + GDF + Sbjct: 441 RRVKSGRPPRPTKPKNFNEENN----GQGSPLIQITPPPPPPPPFRVPPLKYVVSGDFAK 496 Query: 521 IRSTNSSVCSSPDREDGD------LSSTVMXXXXXXXXXXXXXXXXXXXPDVNAKADSFI 360 IRS SS CSSP+RE D L+ + PDV+ KAD+FI Sbjct: 497 IRSNQSSRCSSPEREVFDIGWGLELTQSDGGVETKAAVSGGGMPGFCPSPDVDTKADNFI 556 Query: 359 SRLKDEWRMEKINSIKGK 306 +RL+DEWR++KINS+ K Sbjct: 557 ARLRDEWRLDKINSVNRK 574 >dbj|BAA97357.1| unnamed protein product [Arabidopsis thaliana] Length = 607 Score = 122 bits (307), Expect = 3e-25 Identities = 138/461 (29%), Positives = 175/461 (37%), Gaps = 48/461 (10%) Frame = -3 Query: 2264 DSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTSHILKP 2085 D PP W Q T RRR LS TS IL+P Sbjct: 3 DQPPLIWPQFDSTGYARRRSSIPAILVPAMIGVTSAAIFLVFVTFVVPTFLSVTSQILQP 62 Query: 2084 GSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTS-----GDVADQGNDFVPGGVNTV--- 1929 SVK WDS+N+ LV+FAI+CGVLAR+NDD +S G+ + G V G TV Sbjct: 63 ASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGGAVTNGEMTVGEI 122 Query: 1928 -----QSST---QWV------------------GFTDRKAVTGG--LRRSSSSYPDLRQE 1833 SST QW F+ VTG LRRSSSSYPDLRQ Sbjct: 123 SKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPVTGNVPLRRSSSSYPDLRQG 182 Query: 1832 SLWDNGENRSRFFDDFDVDIYSSPVSRYYNNLSRRTDRERENLYRSTTVSGDIRQQSRRS 1653 + G+ R RF+DDF++D Y S S Y + E E Sbjct: 183 VFRETGDRRFRFYDDFEIDKYRSQDSSSYQQFQNLSKTEIEE------------------ 224 Query: 1652 EADQGEFSDAKEIVVDTFEVSPNVPAFSEQPQSVKXXXXXXXXXXXXXXANSRTHSFRSV 1473 E S+ KEI +DTF V P+ P +QP + RTH RSV Sbjct: 225 -----EESEPKEIQIDTFVVKPSSP--PQQPPATPPPPPPPPPVEVPQKPR-RTH--RSV 274 Query: 1472 GRNVKVEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRS-HHKHKKLERKVSDA 1296 ++ + E R++ P + + + K L+R+ S+A Sbjct: 275 RNR---DLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQRRKSNA 331 Query: 1295 TKEIATAISSLYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEP--HQSL------ 1140 KEI +SLYNQ DV EP +QSL Sbjct: 332 AKEIKMVFASLYNQGKKKKK------LQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSP 385 Query: 1139 --AXXXXXXXXXXXPSMFQNLFKKGGK-HKRIHSVPATGSP 1026 S+F LFKKG K +K+IHSVPA P Sbjct: 386 PPPPPPPPPPLRSSQSVFYGLFKKGVKSNKKIHSVPAPPPP 426 Score = 67.8 bits (164), Expect = 1e-08 Identities = 47/138 (34%), Positives = 65/138 (47%), Gaps = 9/138 (6%) Frame = -3 Query: 692 RSTSKGKPPLPTKASSYYDRDDFLQSGSQSPLIXXXXXXXXXXXXXXXF---ELRGDFVR 522 R G+PP PTK ++ + ++ G SPLI + GDF + Sbjct: 441 RRVKSGRPPRPTKPKNFNEENN----GQGSPLIQITPPPPPPPPFRVPPLKYVVSGDFAK 496 Query: 521 IRSTNSSVCSSPDREDGD------LSSTVMXXXXXXXXXXXXXXXXXXXPDVNAKADSFI 360 IRS SS CSSP+RE D L+ + PDV+ KAD+FI Sbjct: 497 IRSNQSSRCSSPEREVFDIGWGLELTQSDGGVETKAAVSGGGMPGFCPSPDVDTKADNFI 556 Query: 359 SRLKDEWRMEKINSIKGK 306 +RL+DEWR++KINS+ K Sbjct: 557 ARLRDEWRLDKINSVNRK 574 >ref|XP_002864497.1| hypothetical protein ARALYDRAFT_495801 [Arabidopsis lyrata subsp. lyrata] gi|297310332|gb|EFH40756.1| hypothetical protein ARALYDRAFT_495801 [Arabidopsis lyrata subsp. lyrata] Length = 566 Score = 110 bits (276), Expect = 1e-21 Identities = 128/459 (27%), Positives = 173/459 (37%), Gaps = 46/459 (10%) Frame = -3 Query: 2264 DSPPTFWLQPPHTTNRRRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSHTSHILKP 2085 D PP W Q T RR LS TS IL+P Sbjct: 3 DQPPLIWPQFESTGYTHRRSPIPAMLVPAMIGVISAAIFLLFVNFVIPPFLSVTSQILQP 62 Query: 2084 GSVKSTWDSLNIFLVLFAIICGVLARKNDDVSTS-----GDVADQG-----NDFVPGGVN 1935 SVK WDS+N+ LV+FAI+CGVLAR+NDD +S G+ + G + G ++ Sbjct: 63 SSVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGAVTSGEMTLGEIS 122 Query: 1934 TVQSST-----QWV------------------GFTDRKAVTG--GLRRSSSSYPDLRQES 1830 + SS+ QW F+ VTG LRRS SSYPDLRQ Sbjct: 123 KISSSSSAVSEQWFDDVYDAERLKIYESVSSRSFSHGLPVTGTVPLRRSCSSYPDLRQGV 182 Query: 1829 LWDNGENRSRFFDDFDVDIYSSPVSRYYNNLSRRTDRERENLYRSTTVSGDIRQQSRRSE 1650 + G+ R RF+DDF++ +R Y R+ E E Sbjct: 183 FRETGDRRFRFYDDFEIH------NRSYEEFQNRSKIEIE-------------------- 216 Query: 1649 ADQGEFSDAKEIVVDTFEVSPNVPAFSEQPQSVKXXXXXXXXXXXXXXANSRTHSFRSVG 1470 E S+ KEI +DTF V P+ P + P RTH RSV Sbjct: 217 ----EESEPKEIQIDTFVVKPSSPP-QQPPAPPTPPPPPPPPPVEVSQKPRRTH--RSVK 269 Query: 1469 RNVKVEVPRRIESDELDKVRSYXXXXXXXXXXXXXPTEVRVQRSHHKHKKLERKVSDATK 1290 ++ ++ +++ R++ P + K L+R+ S+A K Sbjct: 270 NR---DIQENVKRNDIKFKRAF--QPPNPPPPPPPPPPLITATPPRKQGTLQRRKSNAAK 324 Query: 1289 EIATAISSLYNQXXXXXXXXXXNIXXXXXXXXXXXXXXXSLDVQEP--HQSL-------- 1140 EI +SLYNQ +DV EP +QSL Sbjct: 325 EIKMVFASLYNQGKRKKK------IQKSKRKERIESSPVVVDVTEPPQYQSLIPPPSPPP 378 Query: 1139 AXXXXXXXXXXXPSMFQNLFKKGGK-HKRIHSVPATGSP 1026 S+F LFKKG K +K+IHSVPA P Sbjct: 379 PPPPPPPPPRTSQSVFYGLFKKGVKSNKKIHSVPAPPPP 417 Score = 68.2 bits (165), Expect = 1e-08 Identities = 47/138 (34%), Positives = 67/138 (48%), Gaps = 9/138 (6%) Frame = -3 Query: 692 RSTSKGKPPLPTKASSYYDRDDFLQSGSQSPLIXXXXXXXXXXXXXXXF---ELRGDFVR 522 R + G+PP PTK +++ + ++ G SPLI + GDF + Sbjct: 432 RRVNSGRPPRPTKPTNFNEENN----GQGSPLIQITPPPPPPPPFRVPPLKFVVSGDFAK 487 Query: 521 IRSTNSSVCSSPDREDGD------LSSTVMXXXXXXXXXXXXXXXXXXXPDVNAKADSFI 360 IRS SS CSSP+RE D L+ + PDV+ KAD+FI Sbjct: 488 IRSNQSSRCSSPEREVIDIGWGLELTQSDDGVKTKAAVGGGGMPGFCPSPDVDTKADNFI 547 Query: 359 SRLKDEWRMEKINSIKGK 306 +RL+DEWR++KINS+ K Sbjct: 548 ARLRDEWRLDKINSVNRK 565 >ref|XP_002514089.1| conserved hypothetical protein [Ricinus communis] gi|223546545|gb|EEF48043.1| conserved hypothetical protein [Ricinus communis] Length = 831 Score = 104 bits (260), Expect = 9e-20 Identities = 71/189 (37%), Positives = 101/189 (53%), Gaps = 11/189 (5%) Frame = -3 Query: 2093 LKPGSVKSTWDSLNIFLVLFAIICGVLARKNDDVST-SGDVADQGNDFVPGGVN------ 1935 L+P +VK +WDSLN+FLVLFAI+CG+ AR+NDD S SGD ++ + N Sbjct: 45 LRPSTVKKSWDSLNVFLVLFAILCGIFARRNDDDSAPSGDHSNSSSVLHNNSNNNKERDH 104 Query: 1934 TVQSSTQWVGFTDRKAVT--GGLRRSSSSYPDLRQESLWDNGE--NRSRFFDDFDVDIYS 1767 V + + W+ + T L+RSSSSYPDLRQESLW +G+ +R RFFDDF++ + Sbjct: 105 AVSNHSHWLDDNQFASATPMRRLKRSSSSYPDLRQESLWQSGDDIDRFRFFDDFELSKFR 164 Query: 1766 SPVSRYYNNLSRRTDRERENLYRSTTVSGDIRQQSRRSEADQGEFSDAKEIVVDTFEVSP 1587 S S Y +N N+Y + +RS + + KEI VDT+ + Sbjct: 165 S--SEYSHN----------NIY---------HHRRQRSYVFDQDSTTVKEIPVDTYVLRS 203 Query: 1586 NVPAFSEQP 1560 + P P Sbjct: 204 SPPKSPAPP 212 Score = 85.1 bits (209), Expect = 8e-14 Identities = 54/139 (38%), Positives = 77/139 (55%), Gaps = 3/139 (2%) Frame = -3 Query: 698 RRRSTSKGKPPLPTKASSYYDRDDFLQSGSQSPLIXXXXXXXXXXXXXXXFE--LRGDFV 525 R + + G+PPLPT+ ++ ++ + SG QSPLI F+ ++GD+V Sbjct: 380 RNHTATTGRPPLPTRVNNNNWYEENVNSGGQSPLIPMPPPPPPPPFRVPGFKFAVKGDYV 439 Query: 524 RIRSTNSSVCSSPDREDGDLSSTVMXXXXXXXXXXXXXXXXXXXPDVNAKADSFISRLKD 345 ++RS +SS CSSP+ E+ D ST PDVN KADSFI+RL+ Sbjct: 440 KVRSAHSSRCSSPELEEVDRQST------DTVNMMEGGSVFCLSPDVNLKADSFIARLRG 493 Query: 344 EWRMEKINSIKGK-MG*SP 291 EWR+EKINS+K + MG P Sbjct: 494 EWRLEKINSLKNRSMGLGP 512