BLASTX nr result
ID: Bupleurum21_contig00031931
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00031931 (634 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI16054.3| unnamed protein product [Vitis vinifera] 220 1e-55 ref|XP_002280428.1| PREDICTED: pentatricopeptide repeat-containi... 220 1e-55 ref|XP_002520500.1| pentatricopeptide repeat-containing protein,... 207 2e-51 ref|XP_002313416.1| predicted protein [Populus trichocarpa] gi|2... 201 1e-49 ref|XP_003552616.1| PREDICTED: pentatricopeptide repeat-containi... 199 5e-49 >emb|CBI16054.3| unnamed protein product [Vitis vinifera] Length = 476 Score = 220 bits (561), Expect = 1e-55 Identities = 122/198 (61%), Positives = 143/198 (72%), Gaps = 13/198 (6%) Frame = +2 Query: 59 MLAFQGPQILQK-HPIHSLHYSSSISAQLTNPSPNF-LSIRPS-----------NNELIQ 199 M AFQ PQ +Q+ H H ++IS P P L++RPS NN LIQ Sbjct: 1 MWAFQTPQTIQQPHLPKPFHKPTAIS-----PKPQCCLALRPSTTTRSNGDSNNNNPLIQ 55 Query: 200 SLCQKGNLKQALQLLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFL 379 SLC++GNL QALQ+L +E PTQHTYELLILSC+RQNSLP +H LI DG +QD FL Sbjct: 56 SLCKQGNLNQALQVLSQEPNPTQHTYELLILSCTRQNSLPQGIDLHRHLIHDGSDQDPFL 115 Query: 380 ATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGV 559 ATKLI+MYSELDSI+ ARKVFD+ R+RT++VWNA+FRALTL G G EVL LYRRMN +GV Sbjct: 116 ATKLINMYSELDSIDNARKVFDKTRKRTIYVWNALFRALTLAGYGREVLDLYRRMNRIGV 175 Query: 560 LSDRFTYTYVLKACVVGE 613 SDRFTYTYVLKACV E Sbjct: 176 PSDRFTYTYVLKACVASE 193 Score = 56.6 bits (135), Expect = 4e-06 Identities = 39/149 (26%), Positives = 71/149 (47%), Gaps = 10/149 (6%) Frame = +2 Query: 185 NELIQSLCQKGNLKQALQLLYKESQ----PTQHTYELLILSCSRQNS----LPGAGIVHC 340 N L ++L G ++ L L + ++ + TY ++ +C + L +H Sbjct: 148 NALFRALTLAGYGREVLDLYRRMNRIGVPSDRFTYTYVLKACVASEAFVSLLLNGREIHG 207 Query: 341 RLIRDGFEQDTFLATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEE 520 ++R GFE + T L+DMY+ + A +VFD++ + V W+A+ + G E Sbjct: 208 HILRHGFEGHVHIMTTLLDMYARFGCVLNASRVFDQMPVKNVVSWSAMIACYSKNGKPLE 267 Query: 521 VLSLYRRMNMVG--VLSDRFTYTYVLKAC 601 L L+R+M + +L + T VL+AC Sbjct: 268 ALELFRKMMLENQDLLPNSVTMVSVLQAC 296 >ref|XP_002280428.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Vitis vinifera] Length = 658 Score = 220 bits (561), Expect = 1e-55 Identities = 122/198 (61%), Positives = 143/198 (72%), Gaps = 13/198 (6%) Frame = +2 Query: 59 MLAFQGPQILQK-HPIHSLHYSSSISAQLTNPSPNF-LSIRPS-----------NNELIQ 199 M AFQ PQ +Q+ H H ++IS P P L++RPS NN LIQ Sbjct: 1 MWAFQTPQTIQQPHLPKPFHKPTAIS-----PKPQCCLALRPSTTTRSNGDSNNNNPLIQ 55 Query: 200 SLCQKGNLKQALQLLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFL 379 SLC++GNL QALQ+L +E PTQHTYELLILSC+RQNSLP +H LI DG +QD FL Sbjct: 56 SLCKQGNLNQALQVLSQEPNPTQHTYELLILSCTRQNSLPQGIDLHRHLIHDGSDQDPFL 115 Query: 380 ATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGV 559 ATKLI+MYSELDSI+ ARKVFD+ R+RT++VWNA+FRALTL G G EVL LYRRMN +GV Sbjct: 116 ATKLINMYSELDSIDNARKVFDKTRKRTIYVWNALFRALTLAGYGREVLDLYRRMNRIGV 175 Query: 560 LSDRFTYTYVLKACVVGE 613 SDRFTYTYVLKACV E Sbjct: 176 PSDRFTYTYVLKACVASE 193 Score = 56.6 bits (135), Expect = 4e-06 Identities = 39/149 (26%), Positives = 71/149 (47%), Gaps = 10/149 (6%) Frame = +2 Query: 185 NELIQSLCQKGNLKQALQLLYKESQ----PTQHTYELLILSCSRQNS----LPGAGIVHC 340 N L ++L G ++ L L + ++ + TY ++ +C + L +H Sbjct: 148 NALFRALTLAGYGREVLDLYRRMNRIGVPSDRFTYTYVLKACVASEAFVSLLLNGREIHG 207 Query: 341 RLIRDGFEQDTFLATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEE 520 ++R GFE + T L+DMY+ + A +VFD++ + V W+A+ + G E Sbjct: 208 HILRHGFEGHVHIMTTLLDMYARFGCVLNASRVFDQMPVKNVVSWSAMIACYSKNGKPLE 267 Query: 521 VLSLYRRMNMVG--VLSDRFTYTYVLKAC 601 L L+R+M + +L + T VL+AC Sbjct: 268 ALELFRKMMLENQDLLPNSVTMVSVLQAC 296 Score = 56.6 bits (135), Expect = 4e-06 Identities = 33/143 (23%), Positives = 71/143 (49%), Gaps = 6/143 (4%) Frame = +2 Query: 191 LIQSLCQKGNLKQALQLLYK---ESQ---PTQHTYELLILSCSRQNSLPGAGIVHCRLIR 352 +I + G +AL+L K E+Q P T ++ +C+ +L ++H ++R Sbjct: 255 MIACYSKNGKPLEALELFRKMMLENQDLLPNSVTMVSVLQACAALAALEQGKLMHGYILR 314 Query: 353 DGFEQDTFLATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSL 532 G + + + L+ +Y+ ++E +VF+R+ +R V WN++ + + G G + + + Sbjct: 315 RGLDSILPVVSALVTVYARCGNLELGHRVFERMEKRDVVSWNSLISSYGIHGFGRKAIQI 374 Query: 533 YRRMNMVGVLSDRFTYTYVLKAC 601 ++ M G+ ++ VL AC Sbjct: 375 FKEMIDQGLSPSPISFVSVLGAC 397 >ref|XP_002520500.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223540342|gb|EEF41913.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 414 Score = 207 bits (526), Expect = 2e-51 Identities = 110/192 (57%), Positives = 134/192 (69%), Gaps = 5/192 (2%) Frame = +2 Query: 59 MLAFQGPQILQKHPIHSLHYSSSISAQLTNPSPNFLSIRPS-----NNELIQSLCQKGNL 223 M AF PQ + P HS +S P F+++ P+ +N+LIQSLC++GNL Sbjct: 1 MWAFHSPQATTQPPSHSTFFSRPTP----KPPICFVNLNPTIPTANSNKLIQSLCKQGNL 56 Query: 224 KQALQLLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFLATKLIDMY 403 KQAL LL E P QHTYELL+LSC+ QNS A VH L+ +GF+QD FLATKLI+MY Sbjct: 57 KQALNLLCNEPDPAQHTYELLLLSCTHQNSFLDAQFVHQHLLDNGFDQDPFLATKLINMY 116 Query: 404 SELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGVLSDRFTYT 583 S SI+ ARKVFD+ R RT++V+NA+FRALTL GNGEEVL LYRRMN +G+ SDRFTYT Sbjct: 117 SSFGSIDNARKVFDKTRSRTLYVYNALFRALTLVGNGEEVLRLYRRMNSIGMPSDRFTYT 176 Query: 584 YVLKACVVGEEL 619 YVLKACV L Sbjct: 177 YVLKACVASNSL 188 Score = 58.9 bits (141), Expect = 7e-07 Identities = 30/127 (23%), Positives = 65/127 (51%) Frame = +2 Query: 221 LKQALQLLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFLATKLIDM 400 L + + L K+ P T ++ +C+ +L ++H ++R G + + + L+ M Sbjct: 264 LFREMMLETKDMCPNSVTMVSVLQACAALAALEQGKLLHGYILRRGLDSILPVISSLVTM 323 Query: 401 YSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGVLSDRFTY 580 Y+ ++ A+ VFD++ +R V WN++ + + G G++ + +++ M GV ++ Sbjct: 324 YARCGKLQLAQHVFDQMDKRDVVSWNSLISSYGVHGFGKKAIQIFKDMTRNGVFPSPISF 383 Query: 581 TYVLKAC 601 VL AC Sbjct: 384 VSVLGAC 390 Score = 56.6 bits (135), Expect = 4e-06 Identities = 39/149 (26%), Positives = 68/149 (45%), Gaps = 10/149 (6%) Frame = +2 Query: 185 NELIQSLCQKGNLKQALQLLYKESQ----PTQHTYELLILSCSRQNSLPG----AGIVHC 340 N L ++L GN ++ L+L + + + TY ++ +C NSL +H Sbjct: 141 NALFRALTLVGNGEEVLRLYRRMNSIGMPSDRFTYTYVLKACVASNSLLSLLKKGKEIHA 200 Query: 341 RLIRDGFEQDTFLATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEE 520 +++R G+E + T L+DMY+ + A VF + + V W+A+ G E Sbjct: 201 QILRRGYEAHVHIMTTLVDMYARFGYVSYASCVFSEMSVKNVVSWSAMIACYAKNGRPFE 260 Query: 521 VLSLYRRMNM--VGVLSDRFTYTYVLKAC 601 L L+R M + + + T VL+AC Sbjct: 261 ALELFREMMLETKDMCPNSVTMVSVLQAC 289 >ref|XP_002313416.1| predicted protein [Populus trichocarpa] gi|222849824|gb|EEE87371.1| predicted protein [Populus trichocarpa] Length = 650 Score = 201 bits (510), Expect = 1e-49 Identities = 105/185 (56%), Positives = 133/185 (71%) Frame = +2 Query: 59 MLAFQGPQILQKHPIHSLHYSSSISAQLTNPSPNFLSIRPSNNELIQSLCQKGNLKQALQ 238 M AFQ P+ + S+ + + + N + NN+LIQSLC++GNL QAL+ Sbjct: 1 MWAFQSPKTTLLPSNATFLPRPSLKPPICSITLNPTASTADNNKLIQSLCKQGNLTQALE 60 Query: 239 LLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFLATKLIDMYSELDS 418 LL E P QHTYELLILSC+ QNSL A VH L+ +GF+QD FLATKLI+MYS DS Sbjct: 61 LLSLEPNPAQHTYELLILSCTHQNSLLDAQRVHRHLLENGFDQDPFLATKLINMYSFFDS 120 Query: 419 IECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGVLSDRFTYTYVLKA 598 I+ ARKVFD+ R RT++V+NA+FRAL+L G+GEEVL++YRRMN +G+ SDRFTYTYVLKA Sbjct: 121 IDNARKVFDKTRNRTIYVYNALFRALSLAGHGEEVLNMYRRMNSIGIPSDRFTYTYVLKA 180 Query: 599 CVVGE 613 CV E Sbjct: 181 CVASE 185 Score = 59.7 bits (143), Expect = 4e-07 Identities = 35/143 (24%), Positives = 71/143 (49%), Gaps = 6/143 (4%) Frame = +2 Query: 191 LIQSLCQKGNLKQALQL---LYKESQ---PTQHTYELLILSCSRQNSLPGAGIVHCRLIR 352 +I + G +AL+L L E+Q P T ++ +C+ +L ++H ++R Sbjct: 247 MIACYAKNGKAFEALELFRELMLETQDLCPNSVTMVSVLQACAALAALEQGRLIHGYILR 306 Query: 353 DGFEQDTFLATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSL 532 G + + + L+ MY+ +E ++VFD++ +R V WN++ + + G G++ + + Sbjct: 307 KGLDSILPVISALVTMYARCGKLELGQRVFDQMDKRDVVSWNSLISSYGVHGFGKKAIGI 366 Query: 533 YRRMNMVGVLSDRFTYTYVLKAC 601 + M GV ++ VL AC Sbjct: 367 FEEMTYNGVEPSPISFVSVLGAC 389 >ref|XP_003552616.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Glycine max] Length = 658 Score = 199 bits (505), Expect = 5e-49 Identities = 108/193 (55%), Positives = 139/193 (72%), Gaps = 8/193 (4%) Frame = +2 Query: 59 MLAFQGPQILQKHPIHS-LHYSSSISAQLT------NPSPNFLS-IRPSNNELIQSLCQK 214 M Q PQI++ P S L Y+S +S+++ NPS N ++ I+ +NN+LIQSLC+ Sbjct: 1 MWVLQIPQIVRHAPSQSHLCYNSHVSSRVPVSFVSLNPSANLMNDIKGNNNQLIQSLCKG 60 Query: 215 GNLKQALQLLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFLATKLI 394 GNLKQA+ LL E PTQ T+E LI SC++QNSL VH RL+ GF+QD FLATKLI Sbjct: 61 GNLKQAIHLLCCEPNPTQRTFEHLICSCAQQNSLSDGLDVHRRLVSSGFDQDPFLATKLI 120 Query: 395 DMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGVLSDRF 574 +MY EL SI+ ARKVFD R RT++VWNA+FRAL + G G+E+L LY +MN +G+ SDRF Sbjct: 121 NMYYELGSIDRARKVFDETRERTIYVWNALFRALAMVGCGKELLDLYVQMNWIGIPSDRF 180 Query: 575 TYTYVLKACVVGE 613 TYT+VLKACVV E Sbjct: 181 TYTFVLKACVVSE 193 Score = 55.8 bits (133), Expect = 6e-06 Identities = 40/170 (23%), Positives = 77/170 (45%), Gaps = 5/170 (2%) Frame = +2 Query: 107 SLHYSSSISAQLTNPSPNFLSIRP-----SNNELIQSLCQKGNLKQALQLLYKESQPTQH 271 S+ Y++S+ + P+ NF+S + NE+ + L Q + L +S P Sbjct: 233 SVSYANSVFCAM--PTKNFVSWSAMIACFAKNEMPMKALE---LFQLMMLEAHDSVPNSV 287 Query: 272 TYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFLATKLIDMYSELDSIECARKVFDRI 451 T ++ +C+ +L ++H ++R G + + LI MY I ++VFD + Sbjct: 288 TMVNVLQACAGLAALEQGKLIHGYILRRGLDSILPVLNALITMYGRCGEILMGQRVFDNM 347 Query: 452 RRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGVLSDRFTYTYVLKAC 601 + R V WN++ + G G++ + ++ M G ++ VL AC Sbjct: 348 KNRDVVSWNSLISIYGMHGFGKKAIQIFENMIHQGSSPSYISFITVLGAC 397