BLASTX nr result
ID: Atractylodes22_contig00032658
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes22_contig00032658 (573 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003635109.1| PREDICTED: pentatricopeptide repeat-containi... 142 5e-32 ref|XP_002332091.1| predicted protein [Populus trichocarpa] gi|2... 141 9e-32 ref|XP_002872129.1| predicted protein [Arabidopsis lyrata subsp.... 133 2e-29 ref|NP_568460.1| pentatricopeptide repeat-containing protein [Ar... 125 5e-27 dbj|BAC41999.1| unknown protein [Arabidopsis thaliana] 125 5e-27 >ref|XP_003635109.1| PREDICTED: pentatricopeptide repeat-containing protein At5g24830-like [Vitis vinifera] Length = 580 Score = 142 bits (357), Expect = 5e-32 Identities = 70/124 (56%), Positives = 89/124 (71%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 LIH IK G I +AHS+ EMLL G PD VTYNLL+GA FG ++ A ++YD+ML+RG Sbjct: 455 LIHAQIKGGNIVDAHSIKKEMLLNGIYPDVVTYNLLIGAACNFGRIHFALRLYDEMLRRG 514 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 Y+PDI+TYTELI+G+C+RG V EA+ A LQ S L+IDH PFQILI+KY + R Sbjct: 515 YEPDIITYTELIRGFCIRGHVMEAEELLA--KLQRSGLSIDHAPFQILIQKYCRTRVPGR 572 Query: 363 AYSV 374 AY + Sbjct: 573 AYDL 576 Score = 64.7 bits (156), Expect = 1e-08 Identities = 42/135 (31%), Positives = 65/135 (48%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 +I L G++ A L +M+ G +PD +T+N LV K G L A + +ML+ G Sbjct: 137 MIRNLCLEGKLRAALWLRNKMIQKGVIPDVLTHNYLVNGLCKAGDLEKADNLVREMLEIG 196 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 P+ T+ IKGYCL V +A + F + +S + + V + ILI K Sbjct: 197 PSPNCATFNTFIKGYCLNNNVDKA--LYLFSTMANSGIGPNKVTYNILIHALCKKGLLKD 254 Query: 363 AYSVYQKWLRMDQGQ 407 A + +K L D G+ Sbjct: 255 ARKLLEKILDDDCGK 269 Score = 60.8 bits (146), Expect = 1e-07 Identities = 35/124 (28%), Positives = 63/124 (50%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 LIH ++ A+ EM G LPD TYN L+ K G+L A ++ M K G Sbjct: 315 LIHGFCLIQDMNSAYRYFCEMFKRGLLPDIFTYNTLISGFCKIGNLDEACYIHGVMSKMG 374 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 PD+++Y +I+G C+ G V A++F + ++ + + + + ++I + + D + Sbjct: 375 AAPDLISYKMIIQGLCIHGDVIRANQFLV--CMLENLMVPEPLIWNVVIDGHGRHGDLSN 432 Query: 363 AYSV 374 A S+ Sbjct: 433 ALSI 436 Score = 59.3 bits (142), Expect = 4e-07 Identities = 34/105 (32%), Positives = 55/105 (52%) Frame = +3 Query: 18 IKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRGYDPDI 197 +K G++ +A EML G D V YN+L+ + SAY+ + +M KRG PDI Sbjct: 285 LKKGDMVQALVHWDEMLQRGTQIDVVAYNVLIHGFCLIQDMNSAYRYFCEMFKRGLLPDI 344 Query: 198 VTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIK 332 TY LI G+C G + EA + ++ A D + ++++I+ Sbjct: 345 FTYNTLISGFCKIGNLDEA--CYIHGVMSKMGAAPDLISYKMIIQ 387 >ref|XP_002332091.1| predicted protein [Populus trichocarpa] gi|222874911|gb|EEF12042.1| predicted protein [Populus trichocarpa] Length = 521 Score = 141 bits (355), Expect = 9e-32 Identities = 69/129 (53%), Positives = 92/129 (71%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 LIH +K G I A+ L +MLL G PD VTYNLL+GA + GH++ A Q+YD+ML+ G Sbjct: 374 LIHAQVKIGNILYAYFLKKDMLLKGLFPDVVTYNLLIGAAAHAGHIHYALQLYDEMLRGG 433 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 +PD++TYTELI+GYC++ KEA+ A L S L IDHVPFQILIK+Y KM++ D Sbjct: 434 CNPDMITYTELIRGYCVKYNTKEAEELLA--KLLKSGLLIDHVPFQILIKQYCKMKEPDR 491 Query: 363 AYSVYQKWL 389 A+ +Y+KWL Sbjct: 492 AFQLYRKWL 500 Score = 63.2 bits (152), Expect = 3e-08 Identities = 38/122 (31%), Positives = 67/122 (54%) Frame = +3 Query: 24 SGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRGYDPDIVT 203 S E+ A+S +ML G LPD TYN LV + K G L A ++D ML+ G PD V+ Sbjct: 241 SREMKLAYSYSCQMLKMGLLPDVFTYNTLVSSLCKSGKLDEACYMHDVMLRMGVAPDEVS 300 Query: 204 YTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDSAYSVYQK 383 Y +I+G C+ G V +A+ + + + ++ + + + ++I Y + D +A+++ + Sbjct: 301 YKLIIQGLCVCGDVDKANGY--LNCTLEKSMIPEPLVWNLIIDGYGRCGDICNAFAIRDR 358 Query: 384 WL 389 L Sbjct: 359 ML 360 Score = 55.8 bits (133), Expect = 5e-06 Identities = 35/109 (32%), Positives = 54/109 (49%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 +I L G++ A L ++M+ G +PD +T+N +V K L A + +ML +G Sbjct: 70 IIKDLCLGGKLGPALWLRSKMIQKGFVPDVLTHNYMVNGLCKMIELEKADWLIREMLDKG 129 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILI 329 P+ TY IKGYCL KV +A F + +S + V F L+ Sbjct: 130 PSPNCATYNTFIKGYCLLDKVDKA--LHLFSSMANSGTKPNRVTFNTLL 176 >ref|XP_002872129.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297317966|gb|EFH48388.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 588 Score = 133 bits (334), Expect = 2e-29 Identities = 65/129 (50%), Positives = 88/129 (68%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 LIH +K G + +A + EM PDT TYNLLVGA GHL A+Q+YD+MLKRG Sbjct: 444 LIHGYVKGGRLIDAWWVKNEMRSTKIHPDTTTYNLLVGAACTLGHLRLAFQLYDEMLKRG 503 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 PDI+TYTEL++G C +G++KEA+ + +Q S + +DHVPF IL+KKY +++ D Sbjct: 504 CQPDIITYTELVRGLCWKGRLKEAESLLS--RMQVSGITMDHVPFLILVKKYTRLQRPDE 561 Query: 363 AYSVYQKWL 389 AY VY+KWL Sbjct: 562 AYLVYKKWL 570 Score = 62.8 bits (151), Expect = 4e-08 Identities = 37/130 (28%), Positives = 62/130 (47%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 L+ K+G + +A + EM D+V YN+++ G++ +AY DM+KRG Sbjct: 269 LMDSCFKNGNVVQALEVWKEMSQKNVPTDSVVYNVIIRGLCSSGNMVAAYGFMCDMVKRG 328 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 +PD+ TY LI C GK A +Q+ +A D + ++++I+ D D Sbjct: 329 VNPDVFTYNTLISALCKAGKFDVACDLHG--TMQNVGVAPDQISYKVIIQGLCIQGDVDR 386 Query: 363 AYSVYQKWLR 392 A Q L+ Sbjct: 387 ANEFLQSMLK 396 Score = 60.5 bits (145), Expect = 2e-07 Identities = 40/124 (32%), Positives = 62/124 (50%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 +I L SG + A+ M +M+ G PD TYN L+ A K G A ++ M G Sbjct: 304 IIRGLCSSGNMVAAYGFMCDMVKRGVNPDVFTYNTLISALCKAGKFDVACDLHGTMQNVG 363 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 PD ++Y +I+G C++G V A+ F ML+ S L + + + ++I Y + D Sbjct: 364 VAPDQISYKVIIQGLCIQGDVDRANEFLQ-SMLKRS-LLPEVLLWNVVIDGYGRYGDTSC 421 Query: 363 AYSV 374 A SV Sbjct: 422 ALSV 425 >ref|NP_568460.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75154065|sp|Q8L6Y3.1|PP396_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g24830 gi|22655280|gb|AAM98230.1| putative protein [Arabidopsis thaliana] gi|30725398|gb|AAP37721.1| At5g24830 [Arabidopsis thaliana] gi|332005984|gb|AED93367.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 593 Score = 125 bits (314), Expect = 5e-27 Identities = 61/129 (47%), Positives = 86/129 (66%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 LIH +K G + +A + EM PDT TYNLL+GA GHL A+Q+YD+ML+RG Sbjct: 446 LIHGYVKGGRLIDAWWVKNEMRSTKIHPDTTTYNLLLGAACTLGHLRLAFQLYDEMLRRG 505 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 PDI+TYTEL++G C +G++K+A+ + +Q + + IDHVPF IL KKY +++ Sbjct: 506 CQPDIITYTELVRGLCWKGRLKKAESLLS--RIQATGITIDHVPFLILAKKYTRLQRPGE 563 Query: 363 AYSVYQKWL 389 AY VY+KWL Sbjct: 564 AYLVYKKWL 572 Score = 64.7 bits (156), Expect = 1e-08 Identities = 41/124 (33%), Positives = 63/124 (50%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 +I L SG + A+ M +M+ G PD TYN L+ A K G A ++ M G Sbjct: 306 IIRGLCSSGNMVAAYGFMCDMVKRGVNPDVFTYNTLISALCKEGKFDEACDLHGTMQNGG 365 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 PD ++Y +I+G C+ G V A+ F ML+ S+L + + + ++I Y + D S Sbjct: 366 VAPDQISYKVIIQGLCIHGDVNRANEFL-LSMLK-SSLLPEVLLWNVVIDGYGRYGDTSS 423 Query: 363 AYSV 374 A SV Sbjct: 424 ALSV 427 Score = 62.8 bits (151), Expect = 4e-08 Identities = 33/110 (30%), Positives = 57/110 (51%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 L+ K+G + +A + EM D+V YN+++ G++ +AY DM+KRG Sbjct: 271 LMDSCFKNGNVVQALEVWKEMSQKNVPADSVVYNVIIRGLCSSGNMVAAYGFMCDMVKRG 330 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIK 332 +PD+ TY LI C GK EA +Q+ +A D + ++++I+ Sbjct: 331 VNPDVFTYNTLISALCKEGKFDEACDLHG--TMQNGGVAPDQISYKVIIQ 378 >dbj|BAC41999.1| unknown protein [Arabidopsis thaliana] Length = 593 Score = 125 bits (314), Expect = 5e-27 Identities = 61/129 (47%), Positives = 86/129 (66%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 LIH +K G + +A + EM PDT TYNLL+GA GHL A+Q+YD+ML+RG Sbjct: 446 LIHGYVKGGRLIDAWWVKNEMRSTKIHPDTTTYNLLLGAACTLGHLRLAFQLYDEMLRRG 505 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 PDI+TYTEL++G C +G++K+A+ + +Q + + IDHVPF IL KKY +++ Sbjct: 506 CQPDIITYTELVRGLCWKGRLKKAESLLS--RIQATGITIDHVPFLILAKKYTRLQRPGE 563 Query: 363 AYSVYQKWL 389 AY VY+KWL Sbjct: 564 AYLVYKKWL 572 Score = 64.7 bits (156), Expect = 1e-08 Identities = 41/124 (33%), Positives = 63/124 (50%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 +I L SG + A+ M +M+ G PD TYN L+ A K G A ++ M G Sbjct: 306 IIRGLCSSGNMVAAYGFMCDMVKRGVNPDVFTYNTLISALCKEGKFDEACDLHGTMQNGG 365 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIKKYFKMRDFDS 362 PD ++Y +I+G C+ G V A+ F ML+ S+L + + + ++I Y + D S Sbjct: 366 VAPDQISYKVIIQGLCIHGDVNRANEFL-LSMLK-SSLLPEVLLWNVVIDGYGRYGDTSS 423 Query: 363 AYSV 374 A SV Sbjct: 424 ALSV 427 Score = 62.8 bits (151), Expect = 4e-08 Identities = 33/110 (30%), Positives = 57/110 (51%) Frame = +3 Query: 3 LIHMLIKSGEIAEAHSLMTEMLLAGPLPDTVTYNLLVGAESKFGHLYSAYQVYDDMLKRG 182 L+ K+G + +A + EM D+V YN+++ G++ +AY DM+KRG Sbjct: 271 LMDSCFKNGNVVQALEVWKEMSQKNVPADSVVYNVIIRGLCSSGNMVAAYGFMCDMVKRG 330 Query: 183 YDPDIVTYTELIKGYCLRGKVKEADRFFAFDMLQDSNLAIDHVPFQILIK 332 +PD+ TY LI C GK EA +Q+ +A D + ++++I+ Sbjct: 331 VNPDVFTYNTLISALCKEGKFDEACDLHG--TMQNGGVAPDQISYKVIIQ 378