BLASTX nr result
ID: Atractylodes22_contig00016936
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes22_contig00016936 (1790 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN72416.1| hypothetical protein VITISV_027905 [Vitis vinifera] 571 e-160 ref|XP_002320901.1| predicted protein [Populus trichocarpa] gi|2... 567 e-159 ref|XP_003626608.1| Pentatricopeptide repeat-containing protein ... 565 e-158 ref|XP_004138384.1| PREDICTED: pentatricopeptide repeat-containi... 511 e-142 ref|NP_001154199.1| uncharacterized protein [Arabidopsis thalian... 479 e-132 >emb|CAN72416.1| hypothetical protein VITISV_027905 [Vitis vinifera] Length = 422 Score = 571 bits (1471), Expect = e-160 Identities = 285/410 (69%), Positives = 333/410 (81%), Gaps = 5/410 (1%) Frame = -1 Query: 1418 SPSRIQKLIASQSDPLVAKEIFDLASNASPGFSHSYATFQSLILKLGRSRHFXXXXXXXX 1239 SPSR+QKLIASQSDPL+AKEIFDLAS P F HSY++F LILKLG +R F Sbjct: 13 SPSRVQKLIASQSDPLLAKEIFDLAS-LQPNFKHSYSSFHILILKLGWARQFSLMQDLLM 71 Query: 1238 XXXSDRRYTVTPSLFTHIIRIYGDANLPDQALKTFYTILEFNIKPRTKQLNVILEILVSQ 1059 S++ Y++ PSLF+ II IYG+ANLPDQALKTF+++L+F+ KP K LN +L++LVS Sbjct: 72 RLKSEQ-YSINPSLFSDIIEIYGEANLPDQALKTFHSMLQFHSKPLPKHLNXLLQLLVSH 130 Query: 1058 RNYVRPAFDLFKSAHRYDVSPNVESYNILMRAFCLNGDLSIAYNLFNQMPKRDIVPDVES 879 RNY+RPAFDLFKSAHRY VSP+ +SYNILM AFC NGDLSIAY LFNQM KRD+ PDVES Sbjct: 131 RNYIRPAFDLFKSAHRYGVSPDTKSYNILMSAFCFNGDLSIAYTLFNQMFKRDVAPDVES 190 Query: 878 YRILMQGLCRKSQVNRAVDLLEDMLNKGFVPDSLTYTTLLNSLCRKKKLREAYKLLCRMK 699 YRILMQGLCRKSQVNRAVDLLEDMLNKG+VPD+L+YTTLLNSLCRKKKL+EAYKLLCRMK Sbjct: 191 YRILMQGLCRKSQVNRAVDLLEDMLNKGYVPDALSYTTLLNSLCRKKKLKEAYKLLCRMK 250 Query: 698 VKGCNPDIVHYNTVILGFCRENRAHDACKVLEDMPSNGCLPNLVSYRTLVGGLCSQGLYD 519 VKGCNPDIVHYNTVILGFCRE R DACKVLEDMPSNGC PNL+SY TLV GLC QGLYD Sbjct: 251 VKGCNPDIVHYNTVILGFCREGRXLDACKVLEDMPSNGCSPNLMSYGTLVSGLCDQGLYD 310 Query: 518 EAKTYLNLMTSKGFSPHVSIWLVLISGLCNVGKIEEACSVLEGMLKSGEAPPLNTWMNIV 339 EAK Y+ M SKGFSPH S++ LI+G CNVGK+EEAC VL ML+ GEA TW+ I+ Sbjct: 311 EAKNYVEEMLSKGFSPHFSVFHALINGFCNVGKLEEACEVLXEMLRHGEAXHTETWVAII 370 Query: 338 TRICEVE-----TERLEEVLKVEIEPHTRIVEAGVDLQEYLVKKARANAK 204 RICEV+ +E LK+EI P+TR+VEAG+ L+EY+++K R ++ Sbjct: 371 PRICEVDKLVRMENIFDEXLKLEITPNTRLVEAGIGLEEYVIRKVRDKSR 420 >ref|XP_002320901.1| predicted protein [Populus trichocarpa] gi|222861674|gb|EEE99216.1| predicted protein [Populus trichocarpa] Length = 475 Score = 567 bits (1461), Expect = e-159 Identities = 282/409 (68%), Positives = 334/409 (81%), Gaps = 5/409 (1%) Frame = -1 Query: 1418 SPSRIQKLIASQSDPLVAKEIFDLASNASPGFSHSYATFQSLILKLGRSRHFXXXXXXXX 1239 SPSR+QKLIASQSDPL+AKEIFD AS P F HSY+++ LILKLGR+++F Sbjct: 66 SPSRVQKLIASQSDPLLAKEIFDYASR-QPNFQHSYSSYLILILKLGRAKYFSFIDDLLT 124 Query: 1238 XXXSDRRYTVTPSLFTHIIRIYGDANLPDQALKTFYTILEFNIKPRTKQLNVILEILVSQ 1059 + Y VTP+LF++II IYG+ANLPD+ALK FYTIL+F+ P K LN ILEILVS Sbjct: 125 DLK-SKNYPVTPTLFSYIINIYGEANLPDKALKIFYTILKFDCNPSPKHLNGILEILVSH 183 Query: 1058 RNYVRPAFDLFKSAHRYDVSPNVESYNILMRAFCLNGDLSIAYNLFNQMPKRDIVPDVES 879 +NY++PAFDLFK AH YDV PN +SYNIL+RAFCLNG +S+AY+LFNQM KRD++PDVES Sbjct: 184 QNYIKPAFDLFKDAHTYDVFPNTKSYNILIRAFCLNGQISMAYSLFNQMFKRDVMPDVES 243 Query: 878 YRILMQGLCRKSQVNRAVDLLEDMLNKGFVPDSLTYTTLLNSLCRKKKLREAYKLLCRMK 699 YRILMQ LCRKSQVN AVDLLEDMLNKG+VPD+L+YTTLLNSLCRKKKLREAYKLLCRMK Sbjct: 244 YRILMQALCRKSQVNGAVDLLEDMLNKGYVPDALSYTTLLNSLCRKKKLREAYKLLCRMK 303 Query: 698 VKGCNPDIVHYNTVILGFCRENRAHDACKVLEDMPSNGCLPNLVSYRTLVGGLCSQGLYD 519 VKGCNPDI+HYNTVILGFCRE RA DACKVLEDM SNGC+PNLVSYRTLVGGLC QG++D Sbjct: 304 VKGCNPDIIHYNTVILGFCREGRAMDACKVLEDMESNGCMPNLVSYRTLVGGLCDQGMFD 363 Query: 518 EAKTYLNLMTSKGFSPHVSIWLVLISGLCNVGKIEEACSVLEGMLKSGEAPPLNTWMNIV 339 EAK++L M KGFSPH ++ LI G CNVGKIEEAC V+E +LK GEAP TW+ +V Sbjct: 364 EAKSHLEEMMMKGFSPHFAVSNALIKGFCNVGKIEEACGVVEELLKHGEAPHTETWVMMV 423 Query: 338 TRICEVET-----ERLEEVLKVEIEPHTRIVEAGVDLQEYLVKKARANA 207 +RICEV+ E L++V KVE++ TRIVEAG+ L+EYL+K+ + A Sbjct: 424 SRICEVDDLQRIGEILDKVKKVELKGDTRIVEAGIGLEEYLIKRTQQKA 472 >ref|XP_003626608.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|87240852|gb|ABD32710.1| Tetratricopeptide-like helical [Medicago truncatula] gi|355501623|gb|AES82826.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 451 Score = 565 bits (1457), Expect = e-158 Identities = 283/425 (66%), Positives = 339/425 (79%), Gaps = 5/425 (1%) Frame = -1 Query: 1463 FHSSSTLLLNHNQLPSPSRIQKLIASQSDPLVAKEIFDLASNASPGFSHSYATFQSLILK 1284 FHSSS+ + + + SP+R+QKLIASQSDPL+AKEIFD AS P F H+Y+T+ LILK Sbjct: 29 FHSSSS---SSSPIGSPTRVQKLIASQSDPLLAKEIFDYAS-LQPNFRHNYSTYLILILK 84 Query: 1283 LGRSRHFXXXXXXXXXXXSDRRYTVTPSLFTHIIRIYGDANLPDQALKTFYTILEFNIKP 1104 GRS+HF S+ +TP+LF+++I+IYG+ANLPD+AL TFY +L+FNIKP Sbjct: 85 FGRSKHFSLLDDLLRRLKSESSQPITPTLFSYLIKIYGEANLPDKALNTFYIMLQFNIKP 144 Query: 1103 RTKQLNVILEILVSQRNYVRPAFDLFKSAHRYDVSPNVESYNILMRAFCLNGDLSIAYNL 924 TK LN IL+ILVS RNY+RPAFDLFK AH++ V P+ +SYNILMRAFCLNGD+SIAY L Sbjct: 145 LTKHLNRILDILVSHRNYLRPAFDLFKDAHKHGVFPDTKSYNILMRAFCLNGDISIAYTL 204 Query: 923 FNQMPKRDIVPDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGFVPDSLTYTTLLNSLCR 744 FN+M KRD+VPD++SYRILMQ LCRKSQVN AVDL EDMLNKGFVPDS TYTTLLNSLCR Sbjct: 205 FNKMFKRDVVPDIQSYRILMQALCRKSQVNGAVDLFEDMLNKGFVPDSFTYTTLLNSLCR 264 Query: 743 KKKLREAYKLLCRMKVKGCNPDIVHYNTVILGFCRENRAHDACKVLEDMPSNGCLPNLVS 564 KKKLREAYKLLCRMKVKGCNPDIVHYNTVILGFCRE RAHDACKV++DM +NGCLPNLVS Sbjct: 265 KKKLREAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRAHDACKVIDDMQANGCLPNLVS 324 Query: 563 YRTLVGGLCSQGLYDEAKTYLNLMTSKGFSPHVSIWLVLISGLCNVGKIEEACSVLEGML 384 YRTLV GLC G+ DEA Y+ M SKGFSPH ++ L+ G CNVG+IEEAC VL L Sbjct: 325 YRTLVNGLCHLGMLDEATKYVEEMLSKGFSPHFAVIHALVKGFCNVGRIEEACGVLTKSL 384 Query: 383 KSGEAPPLNTWMNIVTRICEVE-----TERLEEVLKVEIEPHTRIVEAGVDLQEYLVKKA 219 + EAP +TWM IV +ICEV+ LEEVLK+EI+ TRIV+AG+ L++YL++K Sbjct: 385 EHREAPHKDTWMIIVPQICEVDDGVKIDGVLEEVLKIEIKGDTRIVDAGIGLEDYLIRKI 444 Query: 218 RANAK 204 RA ++ Sbjct: 445 RAKSR 449 >ref|XP_004138384.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucumis sativus] gi|449499186|ref|XP_004160743.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucumis sativus] Length = 482 Score = 511 bits (1317), Expect = e-142 Identities = 263/448 (58%), Positives = 331/448 (73%), Gaps = 16/448 (3%) Frame = -1 Query: 1505 ILDCSVKTLKPVVFFHSSSTLLLN---HNQ--------LPSPSRIQKLIASQSDPLVAKE 1359 ++ S +P + H+ S L+ H Q + SP R+QKLIASQSDPL+AKE Sbjct: 32 LISSSSSLYQPHLNVHNESKFLITNVKHEQCEDQPDFSIGSPCRVQKLIASQSDPLLAKE 91 Query: 1358 IFDLASNASPGFSHSYATFQSLILKLGRSRHFXXXXXXXXXXXSDRRYTVTPSLFTHIIR 1179 IFD A P F S ++ LILKLGRS++F RRY VTP+ F++II+ Sbjct: 92 IFDYACR-QPHFRPSSSSLLVLILKLGRSKYFSLIDDLLLSFK-SRRYPVTPTAFSYIIK 149 Query: 1178 IYGDANLPDQALKTFYTILEFNIKPRTKQLNVILEILVSQRNYVRPAFDLFKSAHRYDVS 999 IYG+A+LPD+ALK FYT+++F P +KQLN ILEILVS RN++RPAFDLFK+A + V Sbjct: 150 IYGEADLPDKALKVFYTMIDFGCTPSSKQLNRILEILVSHRNFIRPAFDLFKNARHHGVL 209 Query: 998 PNVESYNILMRAFCLNGDLSIAYNLFNQMPKRDIVPDVESYRILMQGLCRKSQVNRAVDL 819 PN +SYNIL+RAFC NG++SIAY LFN+M +R+++PDVE+YR LMQGLCRK+QVN AVDL Sbjct: 210 PNTKSYNILIRAFCWNGNISIAYTLFNKMFERNVIPDVETYRTLMQGLCRKNQVNGAVDL 269 Query: 818 LEDMLNKGFVPDSLTYTTLLNSLCRKKKLREAYKLLCRMKVKGCNPDIVHYNTVILGFCR 639 LEDMLNKG++PD+L+Y TLLNSLCRKKKLREAYKLLCRMKVKGCNPDI HYNTVI+GFCR Sbjct: 270 LEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCR 329 Query: 638 ENRAHDACKVLEDMPSNGCLPNLVSYRTLVGGLCSQGLYDEAKTYLNLMTSKGFSPHVSI 459 E RA DACK+LEDM SNGCLPNLVSY +L GLC QG+++ AK Y+ MT KGF PH S+ Sbjct: 330 EGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFSV 389 Query: 458 WLVLISGLCNVGKIEEACSVLEGMLKSGEAPPLNTWMNIVTRICEVE-----TERLEEVL 294 L+ G ++G+I E+CSVLE MLK G+AP +TW I++ ICEVE E E++L Sbjct: 390 IHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKIL 449 Query: 293 KVEIEPHTRIVEAGVDLQEYLVKKARAN 210 K ++ TRIVEAG L EYL++K +A+ Sbjct: 450 KKDVRRDTRIVEAGTGLGEYLIRKLQAS 477 >ref|NP_001154199.1| uncharacterized protein [Arabidopsis thaliana] gi|223635643|sp|Q8LDU5.2|PP298_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g01400, mitochondrial; Flags: Precursor gi|332656621|gb|AEE82021.1| uncharacterized protein [Arabidopsis thaliana] Length = 466 Score = 479 bits (1232), Expect = e-132 Identities = 245/417 (58%), Positives = 310/417 (74%), Gaps = 5/417 (1%) Frame = -1 Query: 1418 SPSRIQKLIASQSDPLVAKEIFDLASNASPGFSHSYATFQSLILKLGRSRHFXXXXXXXX 1239 SP+R+QKLIASQSDPL+AKEIFD AS P F HS ++ LILKLGR R+F Sbjct: 50 SPTRVQKLIASQSDPLLAKEIFDYASQ-QPNFRHSRSSHLILILKLGRGRYFNLIDDVLA 108 Query: 1238 XXXSDRRYTVTPSLFTHIIRIYGDANLPDQALKTFYTILEFNIKPRTKQLNVILEILVSQ 1059 S Y +T +FT++I++Y +A LP++ L TFY +LEFN P+ K LN IL++LVS Sbjct: 109 KHRSSG-YPLTGEIFTYLIKVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSH 167 Query: 1058 RNYVRPAFDLFKSAHRYDVSPNVESYNILMRAFCLNGDLSIAYNLFNQMPKRDIVPDVES 879 R Y++ AF+LFKS+ + V PN SYN+LM+AFCLN DLSIAY LF +M +RD+VPDV+S Sbjct: 168 RGYLQKAFELFKSSRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDS 227 Query: 878 YRILMQGLCRKSQVNRAVDLLEDMLNKGFVPDSLTYTTLLNSLCRKKKLREAYKLLCRMK 699 Y+IL+QG CRK QVN A++LL+DMLNKGFVPD L+YTTLLNSLCRK +LREAYKLLCRMK Sbjct: 228 YKILIQGFCRKGQVNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMK 287 Query: 698 VKGCNPDIVHYNTVILGFCRENRAHDACKVLEDMPSNGCLPNLVSYRTLVGGLCSQGLYD 519 +KGCNPD+VHYNT+ILGFCRE+RA DA KVL+DM SNGC PN VSYRTL+GGLC QG++D Sbjct: 288 LKGCNPDLVHYNTMILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFD 347 Query: 518 EAKTYLNLMTSKGFSPHVSIWLVLISGLCNVGKIEEACSVLEGMLKSGEAPPLNTWMNIV 339 E K YL M SKGFSPH S+ L+ G C+ GK+EEAC V+E ++K+GE +TW ++ Sbjct: 348 EGKKYLEEMISKGFSPHFSVSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVI 407 Query: 338 TRIC-EVETER----LEEVLKVEIEPHTRIVEAGVDLQEYLVKKARANAKFKSGREK 183 IC E E+E+ LE+ +K EI TRIV+ G+ L YL K + K K+ RE+ Sbjct: 408 PLICNEDESEKIKLFLEDAVKEEITGDTRIVDVGIGLGSYLSSKLQ--MKRKNARER 462