BLASTX nr result
ID: Bupleurum21_contig00036791
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00036791 (374 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002274884.1| PREDICTED: pentatricopeptide repeat-containi... 175 4e-42 ref|XP_002875201.1| predicted protein [Arabidopsis lyrata subsp.... 167 8e-40 ref|XP_002512123.1| pentatricopeptide repeat-containing protein,... 166 1e-39 ref|XP_003529461.1| PREDICTED: pentatricopeptide repeat-containi... 166 2e-39 ref|NP_178437.1| pentatricopeptide repeat-containing protein [Ar... 162 3e-38 >ref|XP_002274884.1| PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Vitis vinifera] gi|147818711|emb|CAN65040.1| hypothetical protein VITISV_009460 [Vitis vinifera] Length = 700 Score = 175 bits (443), Expect = 4e-42 Identities = 85/124 (68%), Positives = 100/124 (80%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 LS CASV A G ++H ++IK GLLS +VY+GTALL+FYAKCGDAESARVIFD M EK Sbjct: 446 LSACASVGAYRVGSSLHGYAIKAGLLS-GSVYVGTALLNFYAKCGDAESARVIFDEMGEK 504 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGRRYF 15 N TW+A+IGGYG+QGD S SL LF DML EK EPN++ FTTILSACSH+GM+ EG RYF Sbjct: 505 NTITWSAMIGGYGIQGDCSRSLELFGDMLKEKLEPNEVIFTTILSACSHSGMLGEGWRYF 564 Query: 14 SLMC 3 + MC Sbjct: 565 NTMC 568 Score = 78.2 bits (191), Expect = 7e-13 Identities = 41/116 (35%), Positives = 62/116 (53%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 LS CA +L+ G +VH IK+G + AL+ YAKC AR +F+ + +K Sbjct: 346 LSACAQTGSLNMGRSVHCLGIKLG---SEDATFENALVDMYAKCHMIGDARYVFETVFDK 402 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEG 27 + WN++I GY G A +L LF+ M ++ P+ IT ++LSAC+ G G Sbjct: 403 DVIAWNSIISGYTQNGYAYEALELFDQMRSDSVYPDAITLVSVLSACASVGAYRVG 458 Score = 77.4 bits (189), Expect = 1e-12 Identities = 42/117 (35%), Positives = 62/117 (52%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 ++ C + ALH G VH + IK G N ++ T LL Y KCGD A +FD + Sbjct: 245 VTACTKLGALHQGKWVHGYVIKSGFDL--NSFLVTPLLDLYFKCGDIRDAFSVFDELSTI 302 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGR 24 + +W A+I GY +G +L LF D + PN +T +++LSAC+ G + GR Sbjct: 303 DLVSWTAMIVGYAQRGYPREALKLFTDERWKDLLPNTVTTSSVLSACAQTGSLNMGR 359 Score = 65.9 bits (159), Expect = 3e-09 Identities = 34/117 (29%), Positives = 58/117 (49%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 L C+ + G +H +KVG + ++ T L+ YAKC + E +R +FD + ++ Sbjct: 145 LKACSELRETDEGRKLHCQIVKVG---SPDSFVLTGLVDMYAKCREVEDSRRVFDEILDR 201 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGR 24 N W ++I GY L LFN M E N T ++++AC+ G + +G+ Sbjct: 202 NVVCWTSMIVGYVQNDCLKEGLVLFNRMREGLVEGNQYTLGSLVTACTKLGALHQGK 258 Score = 57.0 bits (136), Expect = 2e-06 Identities = 34/119 (28%), Positives = 62/119 (52%), Gaps = 1/119 (0%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 L +C +V++L +HA + GL ++ T L+S Y G E AR++FDR+ Sbjct: 46 LGICKTVSSLR---KIHALLVVHGL--SEDLLCETKLVSLYGSFGHVECARLMFDRIRNP 100 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNE-KSEPNDITFTTILSACSHAGMIEEGRR 21 + ++W +I Y + S + +N L + +E +++ F+ +L ACS +EGR+ Sbjct: 101 DLYSWKVMIRWYFLNDSYSEIVQFYNTRLRKCLNEYDNVVFSIVLKACSELRETDEGRK 159 >ref|XP_002875201.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297814638|ref|XP_002875202.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297321039|gb|EFH51460.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297321040|gb|EFH51461.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 708 Score = 167 bits (423), Expect = 8e-40 Identities = 74/122 (60%), Positives = 100/122 (81%) Frame = -1 Query: 371 SVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEKN 192 S CAS+ +L G ++HA+S+K+G L+ ++V++GTALL FYAKCGDAESAR+IFD +EEKN Sbjct: 463 SACASLGSLAIGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDAESARLIFDTIEEKN 522 Query: 191 QFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGRRYFS 12 TW+A+IGGYG QGD GSL LF +ML ++ +PN+ TFT++LSACSH GM+ EG++YFS Sbjct: 523 TITWSAMIGGYGKQGDTKGSLELFEEMLKKQQKPNESTFTSVLSACSHTGMVNEGKKYFS 582 Query: 11 LM 6 M Sbjct: 583 SM 584 Score = 79.3 bits (194), Expect = 3e-13 Identities = 41/116 (35%), Positives = 60/116 (51%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 LS C V L G ++H SIKVG+ N + AL+ YAKC A+ +F+ EK Sbjct: 362 LSGCGLVGNLELGRSIHGLSIKVGIWDTN---VANALVHMYAKCYQNRDAKYVFEMESEK 418 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEG 27 + WN++I G+ G +L LF+ M E PN +T ++ SAC+ G + G Sbjct: 419 DIVAWNSIISGFSQNGSIHEALFLFHRMNTESVMPNGVTVASLFSACASLGSLAIG 474 Score = 73.9 bits (180), Expect = 1e-11 Identities = 44/121 (36%), Positives = 60/121 (49%), Gaps = 2/121 (1%) Frame = -1 Query: 365 CASVAALHFGYAVHAHSIKVG--LLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEKN 192 C + ALH G H IK G L SC + T+LL Y KCGD +AR +F+ + Sbjct: 264 CTKLRALHQGKWFHGCLIKSGIELSSC----LVTSLLDMYVKCGDISNARRVFNEHSHVD 319 Query: 191 QFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGRRYFS 12 W A+I GY G + +L+LF M +PN +T ++LS C G +E GR Sbjct: 320 LVMWTAMIVGYTHNGSVNEALSLFQKMSGVGIKPNCVTIASVLSGCGLVGNLELGRSIHG 379 Query: 11 L 9 L Sbjct: 380 L 380 Score = 63.9 bits (154), Expect = 1e-08 Identities = 37/117 (31%), Positives = 58/117 (49%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 L C V L G +H +KV S +NV + T LL YAKCG+ +S+ +F+ + + Sbjct: 161 LKACTEVQDLDNGKKIHCQIVKVP--SFDNVVL-TGLLDMYAKCGEIKSSYKVFEDITLR 217 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGR 24 N W ++I GY L LFN M N+ T+ T++ AC+ + +G+ Sbjct: 218 NVVCWTSMIAGYVKNDLYEEGLVLFNRMRENSVLGNEYTYGTLVMACTKLRALHQGK 274 >ref|XP_002512123.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223549303|gb|EEF50792.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 456 Score = 166 bits (421), Expect = 1e-39 Identities = 79/123 (64%), Positives = 99/123 (80%) Frame = -1 Query: 371 SVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEKN 192 S CAS+ AL G ++HA+S+K GLLS +NVY+ TALL+FYAKCGDA SAR IFD M+EKN Sbjct: 313 SACASLGALQVGSSLHAYSVKEGLLS-SNVYVSTALLTFYAKCGDAGSARTIFDGMQEKN 371 Query: 191 QFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGRRYFS 12 TW+A+IGGYG+QGDA GSL++FNDML ++ +PN++ FTTILSACSH GM+ EG F Sbjct: 372 TVTWSAMIGGYGVQGDAGGSLSIFNDMLRQELKPNEVIFTTILSACSHTGMVGEGWNLFI 431 Query: 11 LMC 3 MC Sbjct: 432 SMC 434 Score = 77.8 bits (190), Expect = 9e-13 Identities = 43/122 (35%), Positives = 62/122 (50%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 ++ C + ALH G H ++IK G+ N Y+ TALL Y KCG AR +FD + Sbjct: 111 VTACTKLGALHQGKCFHGYAIKSGVQL--NSYLMTALLDMYVKCGVIRDARSVFDELSSI 168 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGRRYF 15 + +W A+I GY + +L LF D PND+T + L+AC+ G + GR Sbjct: 169 DLVSWTAMIVGYTQSNLSYDALKLFLDKKWAGILPNDVTIVSALAACARMGNLNLGRSIH 228 Query: 14 SL 9 L Sbjct: 229 GL 230 Score = 70.5 bits (171), Expect = 1e-10 Identities = 36/116 (31%), Positives = 59/116 (50%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 L+ CA + L+ G ++H +IK+G + AL+ YAKC A +F+ EK Sbjct: 212 LAACARMGNLNLGRSIHGLAIKLGFAEPT---LMNALVHMYAKCHMNRDASYLFETASEK 268 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEG 27 + +WN++I G G +L LF M E P+ +T ++ SAC+ G ++ G Sbjct: 269 DVVSWNSIISGCSQMGSPYEALDLFQRMRKESVSPDAVTLVSVFSACASLGALQVG 324 Score = 64.7 bits (156), Expect = 8e-09 Identities = 35/117 (29%), Positives = 57/117 (48%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 + C+ + + G +H IK G + ++ T L FYAKCG+ E +R FD ++ Sbjct: 11 IRACSELRDIDEGRKLHCQIIKAGP---PDSFVLTGLTDFYAKCGEIECSRCAFDENLDR 67 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGR 24 N +W ++I GY L LFN M E N T +++AC+ G + +G+ Sbjct: 68 NVVSWTSMIVGYVQNDCPVEGLILFNRMREGLIEGNQFTLGILVTACTKLGALHQGK 124 >ref|XP_003529461.1| PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial-like [Glycine max] Length = 699 Score = 166 bits (420), Expect = 2e-39 Identities = 77/124 (62%), Positives = 98/124 (79%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 LS CAS+ LH G +VH ++K GL+ +++Y+GTALL+FYAKCGDA +AR++FD M EK Sbjct: 446 LSACASLGMLHLGCSVHGLALKDGLV-VSSIYVGTALLNFYAKCGDARAARMVFDSMGEK 504 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGRRYF 15 N TW A+IGGYGMQGD +GSL LF DML E EPN++ FTTIL+ACSH+GM+ EG R F Sbjct: 505 NAVTWGAMIGGYGMQGDGNGSLTLFRDMLEELVEPNEVVFTTILAACSHSGMVGEGSRLF 564 Query: 14 SLMC 3 +LMC Sbjct: 565 NLMC 568 Score = 77.4 bits (189), Expect = 1e-12 Identities = 43/116 (37%), Positives = 64/116 (55%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 LS CA + G +H ++K GL ++ + AL+ YAKCG AR +F+ M EK Sbjct: 346 LSSCAQLGNSVMGKLLHGLAVKCGL---DDHPVRNALVDMYAKCGVVSDARCVFEAMLEK 402 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEG 27 + +WN++I G+ G+A +L LF M E P+ +T ILSAC+ GM+ G Sbjct: 403 DVVSWNSIISGFVQSGEAYEALNLFRRMGLELFSPDAVTVVGILSACASLGMLHLG 458 Score = 71.6 bits (174), Expect = 6e-11 Identities = 40/115 (34%), Positives = 62/115 (53%), Gaps = 4/115 (3%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRME-- 201 +S C + LH G VH IK G+ C N Y+ T+LL+ Y KCG+ + A +FD Sbjct: 241 VSACTKLNWLHQGKWVHGFVIKNGI--CVNSYLTTSLLNMYVKCGNIQDACKVFDESSSS 298 Query: 200 --EKNQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAG 42 +++ +W A+I GY +G +L LF D PN +T +++LS+C+ G Sbjct: 299 SYDRDLVSWTAMIVGYSQRGYPHLALELFKDKKWSGILPNSVTVSSLLSSCAQLG 353 >ref|NP_178437.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75216181|sp|Q9ZQ74.1|PP146_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g03380, mitochondrial; Flags: Precursor gi|4335760|gb|AAD17437.1| unknown protein [Arabidopsis thaliana] gi|330250600|gb|AEC05694.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 689 Score = 162 bits (409), Expect = 3e-38 Identities = 72/122 (59%), Positives = 98/122 (80%) Frame = -1 Query: 371 SVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEKN 192 S CAS+ +L G ++HA+S+K+G L+ ++V++GTALL FYAKCGD +SAR+IFD +EEKN Sbjct: 451 SACASLGSLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIEEKN 510 Query: 191 QFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGRRYFS 12 TW+A+IGGYG QGD GSL LF +ML ++ +PN+ TFT+ILSAC H GM+ EG++YFS Sbjct: 511 TITWSAMIGGYGKQGDTIGSLELFEEMLKKQQKPNESTFTSILSACGHTGMVNEGKKYFS 570 Query: 11 LM 6 M Sbjct: 571 SM 572 Score = 80.1 bits (196), Expect = 2e-13 Identities = 41/116 (35%), Positives = 61/116 (52%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 LS C + L G +VH SIKVG+ N + AL+ YAKC A+ +F+ EK Sbjct: 350 LSGCGLIENLELGRSVHGLSIKVGIWDTN---VANALVHMYAKCYQNRDAKYVFEMESEK 406 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEG 27 + WN++I G+ G +L LF+ M +E PN +T ++ SAC+ G + G Sbjct: 407 DIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLGSLAVG 462 Score = 72.4 bits (176), Expect = 4e-11 Identities = 42/124 (33%), Positives = 62/124 (50%), Gaps = 2/124 (1%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVG--LLSCNNVYIGTALLSFYAKCGDAESARVIFDRME 201 + C ++ALH G H +K G L SC + T+LL Y KCGD +AR +F+ Sbjct: 249 IMACTKLSALHQGKWFHGCLVKSGIELSSC----LVTSLLDMYVKCGDISNARRVFNEHS 304 Query: 200 EKNQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGRR 21 + W A+I GY G + +L+LF M + +PN +T ++LS C +E GR Sbjct: 305 HVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGVEIKPNCVTIASVLSGCGLIENLELGRS 364 Query: 20 YFSL 9 L Sbjct: 365 VHGL 368 Score = 66.2 bits (160), Expect = 3e-09 Identities = 37/117 (31%), Positives = 58/117 (49%) Frame = -1 Query: 374 LSVCASVAALHFGYAVHAHSIKVGLLSCNNVYIGTALLSFYAKCGDAESARVIFDRMEEK 195 L C + L G +H +KV S +NV + T LL YAKCG+ +SA +F+ + + Sbjct: 149 LKACTELQDLDNGKKIHCQLVKVP--SFDNVVL-TGLLDMYAKCGEIKSAHKVFNDITLR 205 Query: 194 NQFTWNAVIGGYGMQGDASGSLALFNDMLNEKSEPNDITFTTILSACSHAGMIEEGR 24 N W ++I GY L LFN M N+ T+ T++ AC+ + +G+ Sbjct: 206 NVVCWTSMIAGYVKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGK 262