BLASTX nr result
ID: Atractylodes21_contig00042039
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00042039 (507 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002516159.1| pentatricopeptide repeat-containing protein,... 179 2e-43 ref|XP_002281998.2| PREDICTED: pentatricopeptide repeat-containi... 176 1e-42 emb|CBI20738.3| unnamed protein product [Vitis vinifera] 176 1e-42 ref|NP_195043.1| pentatricopeptide repeat-containing protein [Ar... 169 3e-40 ref|XP_002867196.1| pentatricopeptide repeat-containing protein ... 167 1e-39 >ref|XP_002516159.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544645|gb|EEF46161.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 1439 Score = 179 bits (455), Expect = 2e-43 Identities = 95/159 (59%), Positives = 116/159 (72%) Frame = +2 Query: 29 TADRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSHLGNVEEGX 208 T+DRFL NNL++MYSKCGS+ SARQLFD P RDLVTWN++L+AYA S +V EG Sbjct: 736 TSDRFLANNLITMYSKCGSVSSARQLFDRTPDRDLVTWNAVLSAYARSDESEYDHVVEGF 795 Query: 209 XXXXXXXXXXXXXXTKLTLAPVLKLCSMSSYVWASEAVHGYAAKLGLESEMLISGALVNT 388 +KLTLAP+LKLC +S YV AS+AVHGYA K+GLE ++ +SGALVN Sbjct: 796 HIFRLLRERFVST-SKLTLAPMLKLCLLSGYVCASQAVHGYAVKIGLELDVFVSGALVNI 854 Query: 389 YIKFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMGV 505 Y KFG REAR +FD M E RDVVLWN+ML+AYV+MG+ Sbjct: 855 YSKFGLVREARGLFDIMQE--RDVVLWNVMLKAYVEMGL 891 Score = 75.9 bits (185), Expect = 3e-12 Identities = 48/168 (28%), Positives = 81/168 (48%), Gaps = 6/168 (3%) Frame = +2 Query: 17 HGYTTADRF-----LTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGS 181 HG T F + N+L++MYSK G + A +F M DL++WNS+++ YA Sbjct: 1011 HGMTLKSGFDSVVSVANSLINMYSKMGFVSLAHTVFTGMNELDLISWNSMISCYAQ---- 1066 Query: 182 HLGNVEEGXXXXXXXXXXXXXXXTKLTLAPVLKLC-SMSSYVWASEAVHGYAAKLGLESE 358 +++ TLA VLK C S++ ++ S+ +H Y K + +E Sbjct: 1067 --NGLQKESVNLLVGLLRDGLQPDHFTLASVLKACSSLTEGLFLSKQIHVYVTKTSIIAE 1124 Query: 359 MLISGALVNTYIKFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMG 502 +S AL++ Y + G EA +F+ ++ D+ WN M+ Y+ G Sbjct: 1125 NFVSTALIDVYSRSGLMAEAEFIFENKNKF--DLAAWNAMMFGYIICG 1170 Score = 65.5 bits (158), Expect = 4e-09 Identities = 43/167 (25%), Positives = 75/167 (44%), Gaps = 5/167 (2%) Frame = +2 Query: 17 HGYTT-----ADRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGS 181 H Y T A+ F++ L+ +YS+ G + A +F+ DL WN+++ Y C Sbjct: 1113 HVYVTKTSIIAENFVSTALIDVYSRSGLMAEAEFIFENKNKFDLAAWNAMMFGYIIC--- 1169 Query: 182 HLGNVEEGXXXXXXXXXXXXXXXTKLTLAPVLKLCSMSSYVWASEAVHGYAAKLGLESEM 361 G+ ++G + TLA K C + + +H A K GL S++ Sbjct: 1170 --GDHDKGLKLFAFMHEKGESCD-EYTLATAAKACGSLVRLEQGKQIHALAIKFGLNSDL 1226 Query: 362 LISGALVNTYIKFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMG 502 +S +++ YIK G + ++FD + D V W +M+ V+ G Sbjct: 1227 FLSSGILDMYIKCGNMEDGHLLFDNIPV--PDDVAWTIMISGCVENG 1271 >ref|XP_002281998.2| PREDICTED: pentatricopeptide repeat-containing protein At4g33170 [Vitis vinifera] Length = 1580 Score = 176 bits (447), Expect = 1e-42 Identities = 90/160 (56%), Positives = 115/160 (71%) Frame = +2 Query: 26 TTADRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSHLGNVEEG 205 + D FL+NNL++MYSKCGSL SARQ+FD P RDLVTWN+IL AYA+ S+ GN +EG Sbjct: 652 SAGDHFLSNNLLTMYSKCGSLSSARQVFDTTPERDLVTWNAILGAYAASVDSNDGNAQEG 711 Query: 206 XXXXXXXXXXXXXXXTKLTLAPVLKLCSMSSYVWASEAVHGYAAKLGLESEMLISGALVN 385 T++TLAPVLKLC S +WA+E VHGYA K+GLE ++ +SGALVN Sbjct: 712 LHLFRLLRASLGST-TRMTLAPVLKLCLNSGCLWAAEGVHGYAIKIGLEWDVFVSGALVN 770 Query: 386 TYIKFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMGV 505 Y K G+ R+AR++FD M E RDVVLWNMML+ YV++G+ Sbjct: 771 IYSKCGRMRDARLLFDWMRE--RDVVLWNMMLKGYVQLGL 808 Score = 79.3 bits (194), Expect = 3e-13 Identities = 50/164 (30%), Positives = 84/164 (51%), Gaps = 1/164 (0%) Frame = +2 Query: 5 VVIKHGYTTADRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSH 184 + +K G + D + N+LV+MYSK G + AR++F+ M H DL++WNS++++ A Sbjct: 929 IAVKSGLDS-DVSVANSLVNMYSKMGCAYFAREVFNDMKHLDLISWNSMISSCAQ----- 982 Query: 185 LGNVEEGXXXXXXXXXXXXXXXTKLTLAPVLKLC-SMSSYVWASEAVHGYAAKLGLESEM 361 ++EE TLA VL+ C S+ + S +H +A K G ++ Sbjct: 983 -SSLEEESVNLFIDLLHEGLKPDHFTLASVLRACSSLIDGLNISRQIHVHALKTGNIADS 1041 Query: 362 LISGALVNTYIKFGKGREARMMFDEMAEYDRDVVLWNMMLRAYV 493 ++ L++ Y K GK EA +F + D D+ WN M+ Y+ Sbjct: 1042 FVATTLIDVYSKSGKMEEAEFLFQN--KDDLDLACWNAMMFGYI 1083 Score = 62.8 bits (151), Expect = 3e-08 Identities = 40/157 (25%), Positives = 69/157 (43%) Frame = +2 Query: 32 ADRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSHLGNVEEGXX 211 AD F+ L+ +YSK G + A LF DL WN+++ Y +GN + Sbjct: 1039 ADSFVATTLIDVYSKSGKMEEAEFLFQNKDDLDLACWNAMMFGYI------IGNDGKKAL 1092 Query: 212 XXXXXXXXXXXXXTKLTLAPVLKLCSMSSYVWASEAVHGYAAKLGLESEMLISGALVNTY 391 ++TLA K C + + +H +A K G +S++ ++ +++ Y Sbjct: 1093 ELFSLIHKSGEKSDQITLATAAKACGCLVLLDQGKQIHAHAIKAGFDSDLHVNSGILDMY 1152 Query: 392 IKFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMG 502 IK G A ++F+ ++ D V W M+ V G Sbjct: 1153 IKCGDMVNAGIVFNYISA--PDDVAWTSMISGCVDNG 1187 >emb|CBI20738.3| unnamed protein product [Vitis vinifera] Length = 865 Score = 176 bits (447), Expect = 1e-42 Identities = 90/160 (56%), Positives = 115/160 (71%) Frame = +2 Query: 26 TTADRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSHLGNVEEG 205 + D FL+NNL++MYSKCGSL SARQ+FD P RDLVTWN+IL AYA+ S+ GN +EG Sbjct: 108 SAGDHFLSNNLLTMYSKCGSLSSARQVFDTTPERDLVTWNAILGAYAASVDSNDGNAQEG 167 Query: 206 XXXXXXXXXXXXXXXTKLTLAPVLKLCSMSSYVWASEAVHGYAAKLGLESEMLISGALVN 385 T++TLAPVLKLC S +WA+E VHGYA K+GLE ++ +SGALVN Sbjct: 168 LHLFRLLRASLGST-TRMTLAPVLKLCLNSGCLWAAEGVHGYAIKIGLEWDVFVSGALVN 226 Query: 386 TYIKFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMGV 505 Y K G+ R+AR++FD M E RDVVLWNMML+ YV++G+ Sbjct: 227 IYSKCGRMRDARLLFDWMRE--RDVVLWNMMLKGYVQLGL 264 Score = 77.0 bits (188), Expect = 1e-12 Identities = 52/167 (31%), Positives = 79/167 (47%), Gaps = 12/167 (7%) Frame = +2 Query: 17 HGYTTA-----DRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGS 181 HGY D F++ LV++YSKCG + AR LFD M RD+V WN +L Y Sbjct: 206 HGYAIKIGLEWDVFVSGALVNIYSKCGRMRDARLLFDWMRERDVVLWNMMLKGYVQL--- 262 Query: 182 HLGNVEEGXXXXXXXXXXXXXXXTKLTLAPVLKLCSMSSYVWAS-------EAVHGYAAK 340 +E+ + ++ +L C +WA + VHG A K Sbjct: 263 ---GLEKEAFQLFSEFHRSGLRPDEFSVQLILNGC-----LWAGTDDLELGKQVHGIAVK 314 Query: 341 LGLESEMLISGALVNTYIKFGKGREARMMFDEMAEYDRDVVLWNMML 481 GL+S++ ++ +LVN Y K G AR +F++M D++ WN M+ Sbjct: 315 SGLDSDVSVANSLVNMYSKMGCAYFAREVFNDMKHL--DLISWNSMI 359 Score = 71.6 bits (174), Expect = 6e-11 Identities = 48/168 (28%), Positives = 82/168 (48%), Gaps = 2/168 (1%) Frame = +2 Query: 5 VVIKHGYTTADRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSH 184 + +K G + D + N+LV+MYSK G + AR++F+ M H DL++WNS++ +SC S Sbjct: 311 IAVKSGLDS-DVSVANSLVNMYSKMGCAYFAREVFNDMKHLDLISWNSMI---SSCAQSS 366 Query: 185 LGNVEEGXXXXXXXXXXXXXXXT--KLTLAPVLKLCSMSSYVWASEAVHGYAAKLGLESE 358 L T +TLA K C + + +H +A K G +S+ Sbjct: 367 LEEESVNLFIDLLHEGLKPDHFTLASITLATAAKACGCLVLLDQGKQIHAHAIKAGFDSD 426 Query: 359 MLISGALVNTYIKFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMG 502 + ++ +++ YIK G A ++F+ ++ D V W M+ V G Sbjct: 427 LHVNSGILDMYIKCGDMVNAGIVFNYISA--PDDVAWTSMISGCVDNG 472 >ref|NP_195043.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206840|sp|Q9SMZ2.1|PP347_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g33170 gi|4455331|emb|CAB36791.1| putative protein [Arabidopsis thaliana] gi|7270265|emb|CAB80034.1| putative protein [Arabidopsis thaliana] gi|332660786|gb|AEE86186.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 990 Score = 169 bits (427), Expect = 3e-40 Identities = 83/156 (53%), Positives = 113/156 (72%) Frame = +2 Query: 35 DRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSHLGNVEEGXXX 214 +RFL NNL+SMYSKCGSL AR++FD MP RDLV+WNSILAAYA + N+++ Sbjct: 73 ERFLINNLISMYSKCGSLTYARRVFDKMPDRDLVSWNSILAAYAQSSECVVENIQQAFLL 132 Query: 215 XXXXXXXXXXXXTKLTLAPVLKLCSMSSYVWASEAVHGYAAKLGLESEMLISGALVNTYI 394 +++TL+P+LKLC S YVWASE+ HGYA K+GL+ + ++GALVN Y+ Sbjct: 133 FRILRQDVVYT-SRMTLSPMLKLCLHSGYVWASESFHGYACKIGLDGDEFVAGALVNIYL 191 Query: 395 KFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMG 502 KFGK +E +++F+EM RDVVLWN+ML+AY++MG Sbjct: 192 KFGKVKEGKVLFEEMPY--RDVVLWNLMLKAYLEMG 225 Score = 66.2 bits (160), Expect = 3e-09 Identities = 43/152 (28%), Positives = 75/152 (49%), Gaps = 1/152 (0%) Frame = +2 Query: 44 LTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSHLGNVEEGXXXXXX 223 ++N+L++MY K AR +FD M RDL++WNS++A A +E Sbjct: 352 VSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQ------NGLEVEAVCLFM 405 Query: 224 XXXXXXXXXTKLTLAPVLKLC-SMSSYVWASEAVHGYAAKLGLESEMLISGALVNTYIKF 400 + T+ VLK S+ + S+ VH +A K+ S+ +S AL++ Y + Sbjct: 406 QLLRCGLKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSRN 465 Query: 401 GKGREARMMFDEMAEYDRDVVLWNMMLRAYVK 496 +EA ++F+ ++ D+V WN M+ Y + Sbjct: 466 RCMKEAEILFE---RHNFDLVAWNAMMAGYTQ 494 Score = 62.0 bits (149), Expect = 5e-08 Identities = 41/157 (26%), Positives = 67/157 (42%) Frame = +2 Query: 32 ADRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSHLGNVEEGXX 211 +D F++ L+ YS+ + A LF+ + DLV WN+++A Y H H Sbjct: 450 SDSFVSTALIDAYSRNRCMKEAEILFE-RHNFDLVAWNAMMAGYTQSHDGHK------TL 502 Query: 212 XXXXXXXXXXXXXTKLTLAPVLKLCSMSSYVWASEAVHGYAAKLGLESEMLISGALVNTY 391 TLA V K C + + VH YA K G + ++ +S +++ Y Sbjct: 503 KLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGILDMY 562 Query: 392 IKFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMG 502 +K G A+ FD + D V W M+ ++ G Sbjct: 563 VKCGDMSAAQFAFDSIPV--PDDVAWTTMISGCIENG 597 >ref|XP_002867196.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297313032|gb|EFH43455.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 997 Score = 167 bits (422), Expect = 1e-39 Identities = 85/156 (54%), Positives = 112/156 (71%) Frame = +2 Query: 35 DRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSHLGNVEEGXXX 214 +RFL NNL+SMYSKCGSL AR++FD MP RDLV+WNSILAAYA + NV+E Sbjct: 80 ERFLVNNLISMYSKCGSLTYARRVFDKMPERDLVSWNSILAAYAQSSEGVVENVKEAFLL 139 Query: 215 XXXXXXXXXXXXTKLTLAPVLKLCSMSSYVWASEAVHGYAAKLGLESEMLISGALVNTYI 394 +++TL+P+LKLC S YV ASE+ HGYA K+GL+ + ++GALVN Y+ Sbjct: 140 FRILRQDVVYT-SRMTLSPMLKLCLHSGYVCASESFHGYACKIGLDGDDFVAGALVNIYL 198 Query: 395 KFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMG 502 KFGK +E R++F+EM RDVVLWN+ML+AY++MG Sbjct: 199 KFGKVKEGRVLFEEMPY--RDVVLWNLMLKAYLEMG 232 Score = 61.6 bits (148), Expect = 6e-08 Identities = 41/157 (26%), Positives = 67/157 (42%) Frame = +2 Query: 32 ADRFLTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSHLGNVEEGXX 211 AD F++ L+ YS+ + A LF + DLV WN++++ Y H H Sbjct: 457 ADSFVSTALIDAYSRNRCMKEAEVLFG-RNNFDLVAWNAMMSGYTQSHDGHK------TL 509 Query: 212 XXXXXXXXXXXXXTKLTLAPVLKLCSMSSYVWASEAVHGYAAKLGLESEMLISGALVNTY 391 TLA VLK C + + VH YA K G + ++ +S +++ Y Sbjct: 510 ELFALMHKQGERSDDFTLATVLKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGILDMY 569 Query: 392 IKFGKGREARMMFDEMAEYDRDVVLWNMMLRAYVKMG 502 +K G A+ FD + D V W ++ ++ G Sbjct: 570 VKCGDMSAAQFAFDSIPV--PDDVAWTTLISGCIENG 604 Score = 59.7 bits (143), Expect = 2e-07 Identities = 40/152 (26%), Positives = 73/152 (48%), Gaps = 1/152 (0%) Frame = +2 Query: 44 LTNNLVSMYSKCGSLHSARQLFDVMPHRDLVTWNSILAAYASCHGSHLGNVEEGXXXXXX 223 ++N+L++MY K + AR +F+ M RDL++WNS++A A ++E Sbjct: 359 VSNSLINMYCKLRKIGLARTVFNNMSERDLISWNSVIAGIAQ------SDLEVEAVCLFM 412 Query: 224 XXXXXXXXXTKLTLAPVLKLC-SMSSYVWASEAVHGYAAKLGLESEMLISGALVNTYIKF 400 T+ VLK S+ + S+ +H +A K ++ +S AL++ Y + Sbjct: 413 QLLRCGLKPDHYTMTSVLKAASSLPEGLSLSKQIHVHAIKTNNVADSFVSTALIDAYSRN 472 Query: 401 GKGREARMMFDEMAEYDRDVVLWNMMLRAYVK 496 +EA ++F + D+V WN M+ Y + Sbjct: 473 RCMKEAEVLF---GRNNFDLVAWNAMMSGYTQ 501