BLASTX nr result
ID: Atractylodes21_contig00014885
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00014885 (282 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002271725.2| PREDICTED: pentatricopeptide repeat-containi... 139 2e-31 emb|CAN79811.1| hypothetical protein VITISV_018821 [Vitis vinifera] 139 2e-31 ref|XP_002871739.1| pentatricopeptide repeat-containing protein ... 136 2e-30 ref|NP_197188.1| pentatricopeptide repeat-containing protein [Ar... 135 4e-30 ref|XP_002515835.1| pentatricopeptide repeat-containing protein,... 134 1e-29 >ref|XP_002271725.2| PREDICTED: pentatricopeptide repeat-containing protein At5g16860-like [Vitis vinifera] Length = 852 Score = 139 bits (351), Expect = 2e-31 Identities = 64/93 (68%), Positives = 78/93 (83%) Frame = +2 Query: 2 VLSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIF 181 +LSGCA GTLL GKE HCHAIK ILN++ NDPGD+ MVIN+LIDMY+KCK+ AR +F Sbjct: 373 LLSGCALAGTLLHGKETHCHAIKWILNLDENDPGDDLMVINALIDMYSKCKSPKAARAMF 432 Query: 182 DSVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 D + P +R+VVTWTV+IGG +QHGEAN+ALELF Sbjct: 433 DLIPPKDRSVVTWTVLIGGNAQHGEANEALELF 465 Score = 65.1 bits (157), Expect = 6e-09 Identities = 36/93 (38%), Positives = 56/93 (60%) Frame = +2 Query: 2 VLSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIF 181 VL CA VG +GK++H +A++ L ++ V N+++DMYAKC + A K+F Sbjct: 237 VLPACASVGAWSRGKQVHGYALRSGLF-------EDVFVGNAVVDMYAKCGMMEEANKVF 289 Query: 182 DSVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 + + ++VV+W M+ GYSQ G +DAL LF Sbjct: 290 ERM--KVKDVVSWNAMVTGYSQIGRFDDALGLF 320 Score = 61.6 bits (148), Expect = 6e-08 Identities = 34/92 (36%), Positives = 50/92 (54%) Frame = +2 Query: 5 LSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIFD 184 L CA +G L G++IH + ++ N V N LIDMY+K D AR +FD Sbjct: 486 LMACARLGALRFGRQIHAYVLR------NRFESAMLFVANCLIDMYSKSGDVDAARVVFD 539 Query: 185 SVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 ++ RN V+WT ++ GY HG +AL++F Sbjct: 540 NM--HQRNGVSWTSLMTGYGMHGRGEEALQIF 569 >emb|CAN79811.1| hypothetical protein VITISV_018821 [Vitis vinifera] Length = 871 Score = 139 bits (351), Expect = 2e-31 Identities = 64/93 (68%), Positives = 78/93 (83%) Frame = +2 Query: 2 VLSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIF 181 +LSGCA GTLL GKE HCHAIK ILN++ NDPGD+ MVIN+LIDMY+KCK+ AR +F Sbjct: 392 LLSGCASAGTLLHGKETHCHAIKWILNLDENDPGDDLMVINALIDMYSKCKSPKAARAMF 451 Query: 182 DSVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 D + P +R+VVTWTV+IGG +QHGEAN+ALELF Sbjct: 452 DLIPPKDRSVVTWTVLIGGNAQHGEANEALELF 484 Score = 65.1 bits (157), Expect = 6e-09 Identities = 36/93 (38%), Positives = 56/93 (60%) Frame = +2 Query: 2 VLSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIF 181 VL CA VG +GK++H +A++ L ++ V N+++DMYAKC + A K+F Sbjct: 256 VLPACASVGAWSRGKQVHGYALRSGLF-------EDVFVGNAVVDMYAKCGMMEEANKVF 308 Query: 182 DSVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 + + ++VV+W M+ GYSQ G +DAL LF Sbjct: 309 ERM--KVKDVVSWNAMVTGYSQIGRFDDALGLF 339 Score = 61.6 bits (148), Expect = 6e-08 Identities = 34/92 (36%), Positives = 50/92 (54%) Frame = +2 Query: 5 LSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIFD 184 L CA +G L G++IH + ++ N V N LIDMY+K D AR +FD Sbjct: 505 LMACARLGALRFGRQIHAYVLR------NRFESAMLFVANCLIDMYSKSGDVDAARVVFD 558 Query: 185 SVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 ++ RN V+WT ++ GY HG +AL++F Sbjct: 559 NM--HQRNGVSWTSLMTGYGMHGRGEEALQIF 588 >ref|XP_002871739.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297317576|gb|EFH47998.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 850 Score = 136 bits (342), Expect = 2e-30 Identities = 64/92 (69%), Positives = 74/92 (80%) Frame = +2 Query: 2 VLSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIF 181 VLSGCA VG L+ GKEIHC+AIK +++ N GDE MVIN LIDMYAKCK D+AR +F Sbjct: 371 VLSGCASVGALMHGKEIHCYAIKYPMDLRKNGHGDENMVINQLIDMYAKCKKVDIARAMF 430 Query: 182 DSVGPSNRNVVTWTVMIGGYSQHGEANDALEL 277 DS+ P R+VVTWTVMIGGYSQHG+AN ALEL Sbjct: 431 DSLSPKERDVVTWTVMIGGYSQHGDANKALEL 462 Score = 62.4 bits (150), Expect = 4e-08 Identities = 38/94 (40%), Positives = 52/94 (55%), Gaps = 2/94 (2%) Frame = +2 Query: 5 LSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDL--ARKI 178 L CA + L GK+IH +A++ N V N LIDMYAKC GD+ AR + Sbjct: 484 LVACASLAALSIGKQIHAYALR------NQQNAVPLFVSNCLIDMYAKC--GDIGDARLV 535 Query: 179 FDSVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 FD++ +N VTWT ++ GY HG +AL +F Sbjct: 536 FDNM--MEKNEVTWTSLMTGYGMHGYGEEALGIF 567 Score = 57.0 bits (136), Expect = 2e-06 Identities = 36/93 (38%), Positives = 48/93 (51%) Frame = +2 Query: 2 VLSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIF 181 VL CA VGT GK+ H A+ + N V N L+DMYAK D A +F Sbjct: 235 VLPPCASVGTRSLGKQFHGFAVTSEMIQN-------MFVGNCLVDMYAKFGMMDEANTVF 287 Query: 182 DSVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 ++ ++VV+W M+ GYSQ G DA+ LF Sbjct: 288 SNMPV--KDVVSWNAMVAGYSQIGRFEDAVRLF 318 >ref|NP_197188.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75174141|sp|Q9LFL5.1|PP390_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g16860 gi|9755687|emb|CAC01699.1| putative protein [Arabidopsis thaliana] gi|332004967|gb|AED92350.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 850 Score = 135 bits (339), Expect = 4e-30 Identities = 64/92 (69%), Positives = 73/92 (79%) Frame = +2 Query: 2 VLSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIF 181 VLSGCA VG L+ GKEIHC+AIK +++ N GDE MVIN LIDMYAKCK D AR +F Sbjct: 371 VLSGCASVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMF 430 Query: 182 DSVGPSNRNVVTWTVMIGGYSQHGEANDALEL 277 DS+ P R+VVTWTVMIGGYSQHG+AN ALEL Sbjct: 431 DSLSPKERDVVTWTVMIGGYSQHGDANKALEL 462 Score = 67.0 bits (162), Expect = 1e-09 Identities = 38/93 (40%), Positives = 52/93 (55%) Frame = +2 Query: 2 VLSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIF 181 VL CA +GT GK++HC A+ + N V N L+DMYAKC D A +F Sbjct: 235 VLPPCASLGTHSLGKQLHCFAVTSEMIQN-------MFVGNCLVDMYAKCGMMDEANTVF 287 Query: 182 DSVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 ++ S ++VV+W M+ GYSQ G DA+ LF Sbjct: 288 SNM--SVKDVVSWNAMVAGYSQIGRFEDAVRLF 318 Score = 62.8 bits (151), Expect = 3e-08 Identities = 36/92 (39%), Positives = 50/92 (54%) Frame = +2 Query: 5 LSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIFD 184 L CA + L GK+IH +A++ N V N LIDMYAKC + AR +FD Sbjct: 484 LVACASLAALRIGKQIHAYALR------NQQNAVPLFVSNCLIDMYAKCGSISDARLVFD 537 Query: 185 SVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 ++ +N VTWT ++ GY HG +AL +F Sbjct: 538 NM--MAKNEVTWTSLMTGYGMHGYGEEALGIF 567 >ref|XP_002515835.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544990|gb|EEF46504.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 655 Score = 134 bits (336), Expect = 1e-29 Identities = 60/93 (64%), Positives = 76/93 (81%) Frame = +2 Query: 2 VLSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIF 181 +LSGCA VG LL GKE HC++IK +LN + +DP DE +V+N++IDMY KCK ++ R IF Sbjct: 388 LLSGCASVGALLHGKETHCYSIKCVLNFDRSDPRDELLVVNAIIDMYTKCKDINVGRAIF 447 Query: 182 DSVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 +S+ P +RNVVTWT MIGGY+QHGEANDALELF Sbjct: 448 NSIPPKDRNVVTWTAMIGGYAQHGEANDALELF 480 Score = 70.1 bits (170), Expect = 2e-10 Identities = 40/93 (43%), Positives = 56/93 (60%) Frame = +2 Query: 2 VLSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIF 181 VL CA +G L GK++H AI+ L ++ V NSL+DMYAKC +A K+F Sbjct: 252 VLPACASMGDWLCGKQVHGFAIRYGLF-------EDVFVANSLVDMYAKCGLMCIANKVF 304 Query: 182 DSVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 D + +++VV+W M+ GYSQ G+ DAL LF Sbjct: 305 DRM--QHKDVVSWNAMVTGYSQIGKFEDALGLF 335 Score = 58.9 bits (141), Expect = 4e-07 Identities = 32/92 (34%), Positives = 50/92 (54%) Frame = +2 Query: 5 LSGCACVGTLLQGKEIHCHAIKQILNINNNDPGDEQMVINSLIDMYAKCKAGDLARKIFD 184 L CA + L G++IH ++ + D V N LIDMY+K D AR +FD Sbjct: 502 LMACARLAALRFGRQIHAFVLRDQYDC------DVLYVANCLIDMYSKSGDMDAARLVFD 555 Query: 185 SVGPSNRNVVTWTVMIGGYSQHGEANDALELF 280 ++ +RN V+WT ++ GY HG +A+++F Sbjct: 556 NM--KHRNTVSWTSLMTGYGMHGHGEEAIKVF 585