BLASTX nr result
ID: Atractylodes21_contig00030082
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00030082 (964 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI16200.3| unnamed protein product [Vitis vinifera] 346 5e-93 ref|XP_002281474.1| PREDICTED: pentatricopeptide repeat-containi... 346 5e-93 emb|CAN80345.1| hypothetical protein VITISV_003133 [Vitis vinifera] 345 7e-93 ref|XP_002531100.1| pentatricopeptide repeat-containing protein,... 328 1e-87 ref|NP_172253.1| pentatricopeptide repeat-containing protein [Ar... 311 2e-82 >emb|CBI16200.3| unnamed protein product [Vitis vinifera] Length = 1093 Score = 346 bits (887), Expect = 5e-93 Identities = 168/284 (59%), Positives = 213/284 (75%) Frame = +2 Query: 113 KQRPHKPTKALRKPIPFVTDLKDIQNSDEALTLFHDYCQTGGFKHDYPSYSCLIYKLARK 292 ++ PH+ T LRK IPF+ DLK +Q+ D+AL+LF+ Y Q G FKHDYPSYS L+YKLAR Sbjct: 439 REPPHRSTSRLRKRIPFLADLKSVQDPDDALSLFNQYQQMG-FKHDYPSYSALVYKLARS 497 Query: 293 RNFKAVEILLQQLQHYNVRCKEALFIGLIEHYGKSGLPDKAIELFRRMPSFECYRSXXXX 472 RNF+AVE LL LQ+ N+RC+E LFI LI+HYGKS +P+KA+ELF+RMPSF C+R+ Sbjct: 498 RNFEAVETLLDYLQNINIRCRETLFIALIQHYGKSQMPEKAVELFQRMPSFNCHRTLVSF 557 Query: 473 XXXXXXXXXXGRFDDAEEMFKCCSKTGFRPNAVSFNIMIKWWLQRGEWDEARKVFAEMLE 652 RF DA +F +K GFR N++SFNI+IK WL +GEWD+A +VF EM++ Sbjct: 558 NTLLNVLVENDRFLDAIGIFDRSTKMGFRRNSISFNIIIKGWLGKGEWDKAWQVFEEMID 617 Query: 653 REVEPTVVTYNCQIGFWSKKGKFSEAKSLFNDMICKGKKPNAISYALLMEGLCSQEKFKE 832 +EV+PTVVT+N IGF KG A L DMI K +PNA++YALLMEGLCS K+KE Sbjct: 618 KEVKPTVVTFNSLIGFLCGKGDLDGAMGLLEDMIQKRHRPNAVTYALLMEGLCSLGKYKE 677 Query: 833 AKKMMFDMEYQGCKPRLVNYGILMNDLARRGNFDEAKALLLEMK 964 AKKMMFDM+YQGCKPRL+N+G+LM+DL RRG D++K LLLEMK Sbjct: 678 AKKMMFDMDYQGCKPRLLNFGVLMSDLGRRGRIDDSKTLLLEMK 721 Score = 82.4 bits (202), Expect = 1e-13 Identities = 47/154 (30%), Positives = 83/154 (53%) Frame = +2 Query: 503 GRFDDAEEMFKCCSKTGFRPNAVSFNIMIKWWLQRGEWDEARKVFAEMLEREVEPTVVTY 682 G++ +A++M G +P ++F +++ +RG D+++ + EM R +P VVTY Sbjct: 673 GKYKEAKKMMFDMDYQGCKPRLLNFGVLMSDLGRRGRIDDSKTLLLEMKRRRFKPDVVTY 732 Query: 683 NCQIGFWSKKGKFSEAKSLFNDMICKGKKPNAISYALLMEGLCSQEKFKEAKKMMFDMEY 862 N I K+G+ EA + +M G +PNA +Y ++++G C E F+ K++ M Sbjct: 733 NILINHLCKEGRALEAYKVLVEMQVGGCEPNAATYRMMVDGFCQVEDFEGGLKVLSAMLM 792 Query: 863 QGCKPRLVNYGILMNDLARRGNFDEAKALLLEMK 964 G PRL ++ L+ L + G D A +L EM+ Sbjct: 793 CGHCPRLESFCDLVVGLLKNGKIDGACFVLEEME 826 >ref|XP_002281474.1| PREDICTED: pentatricopeptide repeat-containing protein At1g07740, mitochondrial-like [Vitis vinifera] Length = 501 Score = 346 bits (887), Expect = 5e-93 Identities = 168/284 (59%), Positives = 213/284 (75%) Frame = +2 Query: 113 KQRPHKPTKALRKPIPFVTDLKDIQNSDEALTLFHDYCQTGGFKHDYPSYSCLIYKLARK 292 ++ PH+ T LRK IPF+ DLK +Q+ D+AL+LF+ Y Q G FKHDYPSYS L+YKLAR Sbjct: 77 REPPHRSTSRLRKRIPFLADLKSVQDPDDALSLFNQYQQMG-FKHDYPSYSALVYKLARS 135 Query: 293 RNFKAVEILLQQLQHYNVRCKEALFIGLIEHYGKSGLPDKAIELFRRMPSFECYRSXXXX 472 RNF+AVE LL LQ+ N+RC+E LFI LI+HYGKS +P+KA+ELF+RMPSF C+R+ Sbjct: 136 RNFEAVETLLDYLQNINIRCRETLFIALIQHYGKSQMPEKAVELFQRMPSFNCHRTLVSF 195 Query: 473 XXXXXXXXXXGRFDDAEEMFKCCSKTGFRPNAVSFNIMIKWWLQRGEWDEARKVFAEMLE 652 RF DA +F +K GFR N++SFNI+IK WL +GEWD+A +VF EM++ Sbjct: 196 NTLLNVLVENDRFLDAIGIFDRSTKMGFRRNSISFNIIIKGWLGKGEWDKAWQVFEEMID 255 Query: 653 REVEPTVVTYNCQIGFWSKKGKFSEAKSLFNDMICKGKKPNAISYALLMEGLCSQEKFKE 832 +EV+PTVVT+N IGF KG A L DMI K +PNA++YALLMEGLCS K+KE Sbjct: 256 KEVKPTVVTFNSLIGFLCGKGDLDGAMGLLEDMIQKRHRPNAVTYALLMEGLCSLGKYKE 315 Query: 833 AKKMMFDMEYQGCKPRLVNYGILMNDLARRGNFDEAKALLLEMK 964 AKKMMFDM+YQGCKPRL+N+G+LM+DL RRG D++K LLLEMK Sbjct: 316 AKKMMFDMDYQGCKPRLLNFGVLMSDLGRRGRIDDSKTLLLEMK 359 Score = 82.4 bits (202), Expect = 1e-13 Identities = 47/154 (30%), Positives = 83/154 (53%) Frame = +2 Query: 503 GRFDDAEEMFKCCSKTGFRPNAVSFNIMIKWWLQRGEWDEARKVFAEMLEREVEPTVVTY 682 G++ +A++M G +P ++F +++ +RG D+++ + EM R +P VVTY Sbjct: 311 GKYKEAKKMMFDMDYQGCKPRLLNFGVLMSDLGRRGRIDDSKTLLLEMKRRRFKPDVVTY 370 Query: 683 NCQIGFWSKKGKFSEAKSLFNDMICKGKKPNAISYALLMEGLCSQEKFKEAKKMMFDMEY 862 N I K+G+ EA + +M G +PNA +Y ++++G C E F+ K++ M Sbjct: 371 NILINHLCKEGRALEAYKVLVEMQVGGCEPNAATYRMMVDGFCQVEDFEGGLKVLSAMLM 430 Query: 863 QGCKPRLVNYGILMNDLARRGNFDEAKALLLEMK 964 G PRL ++ L+ L + G D A +L EM+ Sbjct: 431 CGHCPRLESFCDLVVGLLKNGKIDGACFVLEEME 464 >emb|CAN80345.1| hypothetical protein VITISV_003133 [Vitis vinifera] Length = 1051 Score = 345 bits (886), Expect = 7e-93 Identities = 169/284 (59%), Positives = 212/284 (74%) Frame = +2 Query: 113 KQRPHKPTKALRKPIPFVTDLKDIQNSDEALTLFHDYCQTGGFKHDYPSYSCLIYKLARK 292 ++ PH+ T LRK IPF+ DLK +Q+ D+AL+LF+ Y Q G FKHDYPSYS L+YKLAR Sbjct: 368 REPPHRSTSRLRKRIPFLADLKSVQDPDDALSLFNQYQQMG-FKHDYPSYSALVYKLARS 426 Query: 293 RNFKAVEILLQQLQHYNVRCKEALFIGLIEHYGKSGLPDKAIELFRRMPSFECYRSXXXX 472 RNF+AVE LL LQ+ N+RC+E LFI LI+HYGKS +P+KAIELF+RMPSF C+R+ Sbjct: 427 RNFEAVETLLDYLQNINIRCRETLFIALIQHYGKSQMPEKAIELFQRMPSFNCHRTIVSF 486 Query: 473 XXXXXXXXXXGRFDDAEEMFKCCSKTGFRPNAVSFNIMIKWWLQRGEWDEARKVFAEMLE 652 RF DA +F +K GFR N++SFNI+IK WL +GEWD+A +VF EM++ Sbjct: 487 NTLLNVLVEIDRFLDAIGIFDRSTKMGFRRNSISFNIIIKGWLGKGEWDKAWQVFEEMID 546 Query: 653 REVEPTVVTYNCQIGFWSKKGKFSEAKSLFNDMICKGKKPNAISYALLMEGLCSQEKFKE 832 +EV+PTVVT+N IGF KG A L DMI K +PNA++YALLMEGLCS K+KE Sbjct: 547 KEVKPTVVTFNSLIGFLCGKGDLDGAMGLLZDMIQKRHRPNAVTYALLMEGLCSLGKYKE 606 Query: 833 AKKMMFDMEYQGCKPRLVNYGILMNDLARRGNFDEAKALLLEMK 964 AKKMMFDM+YQGCKPRL+N+G+LM+DL RRG D+ K LLLEMK Sbjct: 607 AKKMMFDMDYQGCKPRLLNFGVLMSDLGRRGRIDDXKTLLLEMK 650 Score = 82.4 bits (202), Expect = 1e-13 Identities = 66/264 (25%), Positives = 118/264 (44%) Frame = +2 Query: 173 LKDIQNSDEALTLFHDYCQTGGFKHDYPSYSCLIYKLARKRNFKAVEILLQQLQHYNVRC 352 L +I +A+ +F D GF+ + S++ +I K + + +++ V+ Sbjct: 493 LVEIDRFLDAIGIF-DRSTKMGFRRNSISFNIIIKGWLGKGEWDKAWQVFEEMIDKEVKP 551 Query: 353 KEALFIGLIEHYGKSGLPDKAIELFRRMPSFECYRSXXXXXXXXXXXXXXGRFDDAEEMF 532 F LI G D A+ L M + G++ +A++M Sbjct: 552 TVVTFNSLIGFLCGKGDLDGAMGLLZDMIQKRHRPNAVTYALLMEGLCSLGKYKEAKKMM 611 Query: 533 KCCSKTGFRPNAVSFNIMIKWWLQRGEWDEARKVFAEMLEREVEPTVVTYNCQIGFWSKK 712 G +P ++F +++ +RG D+ + + EM R +P VVTYN I K+ Sbjct: 612 FDMDYQGCKPRLLNFGVLMSDLGRRGRIDDXKTLLLEMKRRRFKPDVVTYNILINXLCKE 671 Query: 713 GKFSEAKSLFNDMICKGKKPNAISYALLMEGLCSQEKFKEAKKMMFDMEYQGCKPRLVNY 892 G+ EA + +M G +PNA +Y ++++G C E F+ K++ M G PRL ++ Sbjct: 672 GRAXEAYKVLVEMQVGGCEPNAATYRMMVDGFCQVEDFEGGLKVLSAMLMCGHCPRLESF 731 Query: 893 GILMNDLARRGNFDEAKALLLEMK 964 L+ L + G D A +L EM+ Sbjct: 732 CDLVVGLLKNGKIDGACFVLEEME 755 >ref|XP_002531100.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223529296|gb|EEF31265.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 483 Score = 328 bits (841), Expect = 1e-87 Identities = 160/287 (55%), Positives = 208/287 (72%) Frame = +2 Query: 104 RYHKQRPHKPTKALRKPIPFVTDLKDIQNSDEALTLFHDYCQTGGFKHDYPSYSCLIYKL 283 R Q+ T+ R+ IPFV ++K++++ D+AL+LFHDY Q G F+HDYPSYS L+YKL Sbjct: 32 RKGNQKHQHFTRRQRRDIPFVNNVKEVEDPDKALSLFHDYLQNG-FRHDYPSYSALVYKL 90 Query: 284 ARKRNFKAVEILLQQLQHYNVRCKEALFIGLIEHYGKSGLPDKAIELFRRMPSFECYRSX 463 AR R F+AVE +L LQ +NVRC++ LFI L EHYGK GL KAI LF M F C R+ Sbjct: 91 ARSRRFEAVETVLGYLQDFNVRCRDTLFIALFEHYGKVGLVAKAIRLFNEMTGFNCIRTL 150 Query: 464 XXXXXXXXXXXXXGRFDDAEEMFKCCSKTGFRPNAVSFNIMIKWWLQRGEWDEARKVFAE 643 R DA+++F S+ GFR N+V FNI+IK WL++GEW +A KVF E Sbjct: 151 QSFNALLNVLVDNDRLFDAKQLFDRSSEMGFRLNSVPFNILIKGWLKKGEWYQAGKVFDE 210 Query: 644 MLEREVEPTVVTYNCQIGFWSKKGKFSEAKSLFNDMICKGKKPNAISYALLMEGLCSQEK 823 MLER+VEP+VVTYN IG+ + G+ +AK LF DMI KGK+PNA++YALLMEGLCS + Sbjct: 211 MLERKVEPSVVTYNSLIGYLCRNGELGKAKGLFKDMIKKGKRPNAVTYALLMEGLCSMGE 270 Query: 824 FKEAKKMMFDMEYQGCKPRLVNYGILMNDLARRGNFDEAKALLLEMK 964 +KEAKKM+FDMEY+GCKP+ +N+G+LMNDL ++G +EAK LLLEMK Sbjct: 271 YKEAKKMLFDMEYRGCKPKNLNFGVLMNDLGKKGKIEEAKLLLLEMK 317 Score = 89.4 bits (220), Expect = 1e-15 Identities = 62/235 (26%), Positives = 106/235 (45%) Frame = +2 Query: 257 SYSCLIYKLARKRNFKAVEILLQQLQHYNVRCKEALFIGLIEHYGKSGLPDKAIELFRRM 436 S++ L+ L + L + R F LI+ + K G +A ++F M Sbjct: 152 SFNALLNVLVDNDRLFDAKQLFDRSSEMGFRLNSVPFNILIKGWLKKGEWYQAGKVFDEM 211 Query: 437 PSFECYRSXXXXXXXXXXXXXXGRFDDAEEMFKCCSKTGFRPNAVSFNIMIKWWLQRGEW 616 + S G A+ +FK K G RPNAV++ ++++ GE+ Sbjct: 212 LERKVEPSVVTYNSLIGYLCRNGELGKAKGLFKDMIKKGKRPNAVTYALLMEGLCSMGEY 271 Query: 617 DEARKVFAEMLEREVEPTVVTYNCQIGFWSKKGKFSEAKSLFNDMICKGKKPNAISYALL 796 EA+K+ +M R +P + + + KKGK EAK L +M + +P+ + Y +L Sbjct: 272 KEAKKMLFDMEYRGCKPKNLNFGVLMNDLGKKGKIEEAKLLLLEMKKRRFRPDVVIYNIL 331 Query: 797 MEGLCSQEKFKEAKKMMFDMEYQGCKPRLVNYGILMNDLARRGNFDEAKALLLEM 961 + LC + K EA K +F+M+ GC+ Y +L + + G F+E +L M Sbjct: 332 INHLCKEGKVAEAYKTLFEMQIGGCEANAATYRMLADGFCQVGEFEEGLKVLNAM 386 Score = 77.8 bits (190), Expect = 4e-12 Identities = 58/243 (23%), Positives = 108/243 (44%) Frame = +2 Query: 236 GFKHDYPSYSCLIYKLARKRNFKAVEILLQQLQHYNVRCKEALFIGLIEHYGKSGLPDKA 415 GF+ + ++ LI +K + + ++ V + LI + ++G KA Sbjct: 180 GFRLNSVPFNILIKGWLKKGEWYQAGKVFDEMLERKVEPSVVTYNSLIGYLCRNGELGKA 239 Query: 416 IELFRRMPSFECYRSXXXXXXXXXXXXXXGRFDDAEEMFKCCSKTGFRPNAVSFNIMIKW 595 LF+ M + G + +A++M G +P ++F +++ Sbjct: 240 KGLFKDMIKKGKRPNAVTYALLMEGLCSMGEYKEAKKMLFDMEYRGCKPKNLNFGVLMND 299 Query: 596 WLQRGEWDEARKVFAEMLEREVEPTVVTYNCQIGFWSKKGKFSEAKSLFNDMICKGKKPN 775 ++G+ +EA+ + EM +R P VV YN I K+GK +EA +M G + N Sbjct: 300 LGKKGKIEEAKLLLLEMKKRRFRPDVVIYNILINHLCKEGKVAEAYKTLFEMQIGGCEAN 359 Query: 776 AISYALLMEGLCSQEKFKEAKKMMFDMEYQGCKPRLVNYGILMNDLARRGNFDEAKALLL 955 A +Y +L +G C +F+E K++ M PR+ + + L + G+ D A +L Sbjct: 360 AATYRMLADGFCQVGEFEEGLKVLNAMLVSRHAPRIETFNCFVVGLMKSGSIDGAFFVLE 419 Query: 956 EMK 964 EM+ Sbjct: 420 EME 422 >ref|NP_172253.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75180186|sp|Q9LQQ1.1|PPR20_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g07740, mitochondrial; Flags: Precursor gi|8439893|gb|AAF75079.1|AC007583_15 It contains PPR repeats PF|01535 [Arabidopsis thaliana] gi|14596021|gb|AAK68738.1| Unknown protein [Arabidopsis thaliana] gi|31376389|gb|AAP49521.1| At1g07730 [Arabidopsis thaliana] gi|51970836|dbj|BAD44110.1| hypothetical protein [Arabidopsis thaliana] gi|332190050|gb|AEE28171.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 459 Score = 311 bits (796), Expect = 2e-82 Identities = 162/314 (51%), Positives = 213/314 (67%), Gaps = 13/314 (4%) Frame = +2 Query: 62 SVLRKAYCNLTFLRRYHKQRPHKPTKAL----------RKP---IPFVTDLKDIQNSDEA 202 SVL C + R YH RP KPTK RKP +PF+TDLK+I++ +EA Sbjct: 7 SVLINNQC-IASQRHYHTSRPEKPTKKASSHEPTHKFTRKPWEEVPFLTDLKEIEDPEEA 65 Query: 203 LTLFHDYCQTGGFKHDYPSYSCLIYKLARKRNFKAVEILLQQLQHYNVRCKEALFIGLIE 382 L+LFH Y Q GF+HDYPSYS LIYKLA+ RNF AV+ +L+ +++ NVRC+E+LF+GLI+ Sbjct: 66 LSLFHQY-QEMGFRHDYPSYSSLIYKLAKSRNFDAVDQILRLVRYRNVRCRESLFMGLIQ 124 Query: 383 HYGKSGLPDKAIELFRRMPSFECYRSXXXXXXXXXXXXXXGRFDDAEEMFKCCSKTGFRP 562 HYGK+G DKAI++F ++ SF+C R+ G + A+ F RP Sbjct: 125 HYGKAGSVDKAIDVFHKITSFDCVRTIQSLNTLINVLVDNGELEKAKSFFDGAKDMRLRP 184 Query: 563 NAVSFNIMIKWWLQRGEWDEARKVFAEMLEREVEPTVVTYNCQIGFWSKKGKFSEAKSLF 742 N+VSFNI+IK +L + +W+ A KVF EMLE EV+P+VVTYN IGF + +AKSL Sbjct: 185 NSVSFNILIKGFLDKCDWEAACKVFDEMLEMEVQPSVVTYNSLIGFLCRNDDMGKAKSLL 244 Query: 743 NDMICKGKKPNAISYALLMEGLCSQEKFKEAKKMMFDMEYQGCKPRLVNYGILMNDLARR 922 DMI K +PNA+++ LLM+GLC + ++ EAKK+MFDMEY+GCKP LVNYGILM+DL +R Sbjct: 245 EDMIKKRIRPNAVTFGLLMKGLCCKGEYNEAKKLMFDMEYRGCKPGLVNYGILMSDLGKR 304 Query: 923 GNFDEAKALLLEMK 964 G DEAK LL EMK Sbjct: 305 GRIDEAKLLLGEMK 318 Score = 89.0 bits (219), Expect = 2e-15 Identities = 45/148 (30%), Positives = 84/148 (56%) Frame = +2 Query: 518 AEEMFKCCSKTGFRPNAVSFNIMIKWWLQRGEWDEARKVFAEMLEREVEPTVVTYNCQIG 697 A+ + + K RPNAV+F +++K +GE++EA+K+ +M R +P +V Y + Sbjct: 240 AKSLLEDMIKKRIRPNAVTFGLLMKGLCCKGEYNEAKKLMFDMEYRGCKPGLVNYGILMS 299 Query: 698 FWSKKGKFSEAKSLFNDMICKGKKPNAISYALLMEGLCSQEKFKEAKKMMFDMEYQGCKP 877 K+G+ EAK L +M + KP+ + Y +L+ LC++ + EA +++ +M+ +GCKP Sbjct: 300 DLGKRGRIDEAKLLLGEMKKRRIKPDVVIYNILVNHLCTECRVPEAYRVLTEMQMKGCKP 359 Query: 878 RLVNYGILMNDLARRGNFDEAKALLLEM 961 Y ++++ R +FD +L M Sbjct: 360 NAATYRMMIDGFCRIEDFDSGLNVLNAM 387 Score = 72.8 bits (177), Expect = 1e-10 Identities = 40/146 (27%), Positives = 73/146 (50%) Frame = +2 Query: 503 GRFDDAEEMFKCCSKTGFRPNAVSFNIMIKWWLQRGEWDEARKVFAEMLEREVEPTVVTY 682 G +++A+++ G +P V++ I++ +RG DEA+ + EM +R ++P VV Y Sbjct: 270 GEYNEAKKLMFDMEYRGCKPGLVNYGILMSDLGKRGRIDEAKLLLGEMKKRRIKPDVVIY 329 Query: 683 NCQIGFWSKKGKFSEAKSLFNDMICKGKKPNAISYALLMEGLCSQEKFKEAKKMMFDMEY 862 N + + + EA + +M KG KPNA +Y ++++G C E F ++ M Sbjct: 330 NILVNHLCTECRVPEAYRVLTEMQMKGCKPNAATYRMMIDGFCRIEDFDSGLNVLNAMLA 389 Query: 863 QGCKPRLVNYGILMNDLARRGNFDEA 940 P + ++ L + GN D A Sbjct: 390 SRHCPTPATFVCMVAGLIKGGNLDHA 415