BLASTX nr result
ID: Bupleurum21_contig00015106
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00015106 (2173 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containi... 514 e-143 ref|XP_002305605.1| predicted protein [Populus trichocarpa] gi|2... 473 e-130 ref|XP_002518527.1| pentatricopeptide repeat-containing protein,... 465 e-128 sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-c... 355 2e-95 ref|NP_193849.2| pentatricopeptide repeat-containing protein [Ar... 294 8e-77 >ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like [Vitis vinifera] Length = 569 Score = 514 bits (1323), Expect = e-143 Identities = 251/490 (51%), Positives = 347/490 (70%), Gaps = 1/490 (0%) Frame = +2 Query: 2 WSRKNLGFQPDLRVECKLVQTLIRFGLSQPARPILDSLVETHPLPQIVDALVVSCKGTDC 181 W R NLGFQPDL ++++ I+ GL QPA+ ILDSL+ET + +VD+++ +C+G D Sbjct: 83 WVRTNLGFQPDLAAHSQIIRISIQSGLFQPAKGILDSLIETQKVSVLVDSVIQACRGKDS 142 Query: 182 HSVVFSSVVECYTKKGLYLQALQVFRKVRDFGCVVSDHSCNGLLNVLQESNEIRLSWCFY 361 S V V+ECY+ KGL+++AL+VFR++ G V S SCN LL+ LQ NEI+L+WC Sbjct: 143 ESPVLGFVLECYSSKGLFIEALEVFRRITIHGYVPSVRSCNALLDSLQRENEIKLAWCVC 202 Query: 362 ASMLRNGVLASQYTWNVIARILSKDGKFEKIGQVIDMG-VHNSLIYNLIIEFYSTSGNFK 538 +++RNGVL IA IL K+GK E++ +++DM V N+LIY L+I+ Y GNF Sbjct: 203 GALIRNGVLPDYVR---IALILCKNGKLERVVRLLDMSIVCNALIYKLVIDCYCERGNFS 259 Query: 539 GAVDRLNEMVDKKLEPGFVTYSSILDGACQHGNVEIIELMMESMEKNEHIPILPSLEYNS 718 A LNEM ++K +PGF Y+SILDGAC++ N E+I+++M SM + +P L EY+S Sbjct: 260 AAFHYLNEMCNRKFDPGFCAYNSILDGACKYENDEVIQIVMGSMVEKGLLPKLLLSEYDS 319 Query: 719 IIQRLCEVGKTYAADMFFKRASAEKVELDEATYDCILRAFCNKGRAKDAIAIHEMMLESG 898 IIQ++C +GKT+AA MFFKRA EK+ELD ATY C+LRA GR K+AI ++ ++LESG Sbjct: 320 IIQKICNLGKTHAAQMFFKRARNEKIELDNATYGCMLRALAKDGRVKEAIGVYLVILESG 379 Query: 899 TVAKDSCYKLFLNVLCNEYPSEKISKLLVDLIGRGFFPCLSELSKFIVSQCKNKKWREAE 1078 KD CY F+NVLC E PS+++SKL+ ++IG+GF PC S+LSKFI S CKN +W EA+ Sbjct: 380 VTVKDGCYHAFVNVLCEEDPSQEVSKLMGEIIGKGFSPCGSKLSKFITSLCKNGRWTEAD 439 Query: 1079 DLIDVLLEKGFVPDSFSASCLVKHFCSSRRIDSAVALHNKLEHMRGTLDTNAYNXXXXXX 1258 DL++V +EKG +PDSF S LV+H+C SR+IDS++ALH K++ ++G+LD YN Sbjct: 440 DLLNVTIEKGLLPDSFCCSALVEHYCRSRQIDSSIALHEKIKKVKGSLDVATYNVLLNGL 499 Query: 1259 XXXXXXXXALKIFDYMRSHSLLNGESFSAMISGLCQENDLRKAMRLHDEMLKMGLKPDLK 1438 A+ +FD MRS +LL+ SF+ M+SGLC+E +LRKAM+ HDEMLKMGLKPD Sbjct: 500 FMEKRIEDAVSVFDCMRSQNLLSSTSFTIMVSGLCRERELRKAMKFHDEMLKMGLKPDRA 559 Query: 1439 NYKRLIASFK 1468 YKRLI+ FK Sbjct: 560 TYKRLISGFK 569 >ref|XP_002305605.1| predicted protein [Populus trichocarpa] gi|222848569|gb|EEE86116.1| predicted protein [Populus trichocarpa] Length = 564 Score = 473 bits (1216), Expect = e-130 Identities = 226/488 (46%), Positives = 335/488 (68%) Frame = +2 Query: 2 WSRKNLGFQPDLRVECKLVQTLIRFGLSQPARPILDSLVETHPLPQIVDALVVSCKGTDC 181 W + NL +PDL+ +C ++ + GL+ P RPI+DSLV+TH + + +A+V SC+G Sbjct: 75 WVQTNLKLKPDLKSQCHIINICVNSGLTLPVRPIMDSLVKTHHVSVLGEAMVDSCRGKSL 134 Query: 182 HSVVFSSVVECYTKKGLYLQALQVFRKVRDFGCVVSDHSCNGLLNVLQESNEIRLSWCFY 361 S FS V+ECY+ KGL++++L++FRK+R G + S +CN +L+VLQ NEI+L+WCFY Sbjct: 135 KSDAFSFVLECYSHKGLFMESLEMFRKMRGNGFIASGTACNSVLDVLQRENEIKLAWCFY 194 Query: 362 ASMLRNGVLASQYTWNVIARILSKDGKFEKIGQVIDMGVHNSLIYNLIIEFYSTSGNFKG 541 +M+++GVL + TW++IA+IL KDG FE+I + +DMGV+NS++YN +I+ S G+F+ Sbjct: 195 CAMIKDGVLPDKLTWSLIAQILCKDGNFERIVKFLDMGVYNSVLYNGVIDCCSKRGDFEA 254 Query: 542 AVDRLNEMVDKKLEPGFVTYSSILDGACQHGNVEIIELMMESMEKNEHIPILPSLEYNSI 721 A +RLN+M ++KL+PGF TYS+ILDGAC+HGN E+IE +M+ M + +P P + +S+ Sbjct: 255 AFERLNQMCERKLDPGFSTYSAILDGACKHGNEEVIERVMDIMAEKGLLPKCPLSQCDSV 314 Query: 722 IQRLCEVGKTYAADMFFKRASAEKVELDEATYDCILRAFCNKGRAKDAIAIHEMMLESGT 901 IQ+ ++ K A MFF+RA EK+ L +ATY C+L+A + R K+AI ++ ++ E G Sbjct: 315 IQKFSDLCKMNVATMFFRRACDEKIGLQDATYGCMLKALSKEARVKEAIGLYSLISEKGI 374 Query: 902 VAKDSCYKLFLNVLCNEYPSEKISKLLVDLIGRGFFPCLSELSKFIVSQCKNKKWREAED 1081 KDS Y FL++L E E+ ++L D++ RGF P LSKFI+ + ++WRE ED Sbjct: 375 RVKDSTYHAFLDLLSEEDQYEEGYEILGDMMRRGFRPGTVGLSKFILLLSRKRRWREVED 434 Query: 1082 LIDVLLEKGFVPDSFSASCLVKHFCSSRRIDSAVALHNKLEHMRGTLDTNAYNXXXXXXX 1261 L+D++LEKG +PDS LV+H+CS R+ID AVALHNK+E ++ +LD YN Sbjct: 435 LLDLVLEKGLLPDSLCCCSLVEHYCSRRQIDKAVALHNKMEKLQASLDVATYNILLDGLV 494 Query: 1262 XXXXXXXALKIFDYMRSHSLLNGESFSAMISGLCQENDLRKAMRLHDEMLKMGLKPDLKN 1441 +++FDYM+ L+N ESF+ I GLC+ ++RKAM+LHDEML MGLKPD Sbjct: 495 KNGRIEEVVRVFDYMKGLKLVNSESFTITIRGLCRAKEMRKAMKLHDEMLDMGLKPDKAA 554 Query: 1442 YKRLIASF 1465 YKRLI F Sbjct: 555 YKRLILEF 562 >ref|XP_002518527.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223542372|gb|EEF43914.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 599 Score = 465 bits (1197), Expect = e-128 Identities = 221/488 (45%), Positives = 336/488 (68%) Frame = +2 Query: 2 WSRKNLGFQPDLRVECKLVQTLIRFGLSQPARPILDSLVETHPLPQIVDALVVSCKGTDC 181 W++ NL F PDL+ +C ++Q + L + A+ ILDSL++T+P ++ +V +C+G Sbjct: 104 WAKTNLNFNPDLKSQCHVIQLSLGSDLPRAAKKILDSLIKTYPSNLFLETMVQACRGKSS 163 Query: 182 HSVVFSSVVECYTKKGLYLQALQVFRKVRDFGCVVSDHSCNGLLNVLQESNEIRLSWCFY 361 + V+E Y+ KG +L+ L+V++K+R GC S H+CN LL+ LQ +EIRL+WCFY Sbjct: 164 LLCTLNFVLEFYSHKGSFLEGLEVYKKMRVIGCTPSVHACNVLLDALQRESEIRLAWCFY 223 Query: 362 ASMLRNGVLASQYTWNVIARILSKDGKFEKIGQVIDMGVHNSLIYNLIIEFYSTSGNFKG 541 +M+R GVL ++TW+++A IL KDG FE+I +++DMG+ NS++YN ++++YS +G+FK Sbjct: 224 CAMIRVGVLPDKFTWSLVAHILCKDGNFERIVKLLDMGICNSVMYNAVVDYYSKNGDFKA 283 Query: 542 AVDRLNEMVDKKLEPGFVTYSSILDGACQHGNVEIIELMMESMEKNEHIPILPSLEYNSI 721 A RLNEM D+K+EPGF TYSSILDGAC+ N+++IE ++ M + + PS +Y+SI Sbjct: 284 AFCRLNEMYDRKVEPGFSTYSSILDGACKCRNLQVIERVVAIMVGKQLLSKCPSSDYDSI 343 Query: 722 IQRLCEVGKTYAADMFFKRASAEKVELDEATYDCILRAFCNKGRAKDAIAIHEMMLESGT 901 IQ+LC++GK AA +FFKRA E++ L +ATY +LRAF +G ++AI +++++LE G Sbjct: 344 IQKLCDLGKVSAATLFFKRACDERIGLQDATYGRMLRAFSIEGILEEAIGLYQVILERGL 403 Query: 902 VAKDSCYKLFLNVLCNEYPSEKISKLLVDLIGRGFFPCLSELSKFIVSQCKNKKWREAED 1081 KD+ F+++L + + +++ D++ RGF PC S LSK+I CK ++W+EAE+ Sbjct: 404 TIKDNASDAFVDLLSEKDQYAEGYEIVRDIMRRGFSPCTSSLSKYITLLCKKRRWKEAEE 463 Query: 1082 LIDVLLEKGFVPDSFSASCLVKHFCSSRRIDSAVALHNKLEHMRGTLDTNAYNXXXXXXX 1261 L+ ++LEKG +PD+ S LVKH+CSS++ D A+ALHN LE ++ +LD AYN Sbjct: 464 LLYMVLEKGLLPDTLSFCSLVKHYCSSKQTDKALALHNTLEKLQASLDITAYNLLLGGLV 523 Query: 1262 XXXXXXXALKIFDYMRSHSLLNGESFSAMISGLCQENDLRKAMRLHDEMLKMGLKPDLKN 1441 ++K+FDYM+ L N SF+ +I GLC+ +LRKAM+LHDEML MGLKPD Sbjct: 524 KEGRVEESIKVFDYMKGLKLANSASFTVIIRGLCRAKELRKAMKLHDEMLNMGLKPDKPT 583 Query: 1442 YKRLIASF 1465 YKRLI F Sbjct: 584 YKRLILEF 591 >sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21170 Length = 585 Score = 355 bits (912), Expect = 2e-95 Identities = 184/494 (37%), Positives = 306/494 (61%), Gaps = 5/494 (1%) Frame = +2 Query: 2 WSRKNLGFQPDLRVECKLVQTLIRFGLSQPARPILDSLVETHPLPQIVDALVVSCKGTDC 181 +++ +L F+PDL+ C++++ GL + A +L LVET+ + +V + +G Sbjct: 92 FAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRWFEGEVS 151 Query: 182 HSVVFSSVVECYTKKGLYLQALQVFRKVRDFGCVVSDHSCNGLLNVLQESNEIRLSWCFY 361 SV S V+E Y KG + L+VF +R S + N LL L + N+ R++ C Y Sbjct: 152 LSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRVALCLY 211 Query: 362 ASMLRNGVLASQYTWNVIARILSKDGKFEKIGQVIDMGVHNSLIYNLIIEFYSTSGNFKG 541 ++M+RNG+++ + TW++IA+IL + G+ + + ++++ GV + IY ++E YS +G F Sbjct: 212 SAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGEFDA 271 Query: 542 AVDRLNEMVDKKLEPGFVTYSSILDGACQHGNVEIIELMMESMEKNEHIPILPSLEYNSI 721 ++EM DKKLE F +Y +LD AC+ G+ E I+ ++ M + + + + S + I Sbjct: 272 VFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVNDKI 331 Query: 722 IQRLCEVGKTYAADMFFKRA-SAEKVELDEATYDCILRAFCNKGRAKDAIAIHEMMLESG 898 I+RLC++GKT+A++M F++A + E V L ++TY C+L+A K R K+A+ ++ M+ G Sbjct: 332 IERLCDMGKTFASEMLFRKACNGETVRLWDSTYGCMLKALSRKKRTKEAVDVYRMICRKG 391 Query: 899 -TVAKDSCYKLFLNVLC-NEYPSEKISKLLVDLIGRGFFPCLSELSKFIVSQCKNKKWRE 1072 TV +SCY F N LC ++ SE+ +LLVD+I RGF PC +LS+ + S C+ ++W+ Sbjct: 392 ITVLDESCYIEFANALCRDDNSSEEEEELLVDVIKRGFVPCTHKLSEVLASMCRKRRWKS 451 Query: 1073 AEDLIDVLLEKGFVPDSFSASCLVKHFCSSRRIDSAVALHNKLEHMRGTLDTNAYN--XX 1246 AE L+D ++E DSF+ L++ +C S +++ A+ LH K++ M+G+LD NAYN Sbjct: 452 AEKLLDSVMEMEVYFDSFACGLLMERYCRSGKLEKALVLHEKIKKMKGSLDVNAYNAVLD 511 Query: 1247 XXXXXXXXXXXXALKIFDYMRSHSLLNGESFSAMISGLCQENDLRKAMRLHDEMLKMGLK 1426 A+ +F+YM+ + +N +SF+ MI GLC+ +++KAMR HDEML++GLK Sbjct: 512 RLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHDEMLRLGLK 571 Query: 1427 PDLKNYKRLIASFK 1468 PDL YKRLI FK Sbjct: 572 PDLVTYKRLILGFK 585 >ref|NP_193849.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332659015|gb|AEE84415.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 551 Score = 294 bits (752), Expect = 8e-77 Identities = 171/494 (34%), Positives = 281/494 (56%), Gaps = 5/494 (1%) Frame = +2 Query: 2 WSRKNLGFQPDLRVECKLVQTLIRFGLSQPARPILDSLVETHPLPQIVDALVVSCKGTDC 181 +++ +L F+PDL+ C++++ GL + A +L LVET+ + +V + +G Sbjct: 92 FAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRWFEGEVS 151 Query: 182 HSVVFSSVVECYTKKGLYLQALQVFRKVRDFGCVVSDHSCNGLLNVLQESNEIRLSWCFY 361 SV S V+E Y KG + L+VF +R S + N LL L + N+ R++ C Y Sbjct: 152 LSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRVALCLY 211 Query: 362 ASMLRNGVLASQYTWNVIARILSKDGKFEKIGQVIDMGVHNSLIYNLIIEFYSTSGNFKG 541 ++M+RNG+++ + TW++IA+IL + G+ + + ++++ GV + IY ++E YS +G F Sbjct: 212 SAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGEFDA 271 Query: 542 AVDRLNEMVDKKLEPGFVTYSSILDGACQHGNVEIIELMMESMEKNEHIPILPSLEYNSI 721 ++EM DKKLE F +Y +LD AC+ G+ E I+ ++ M + + + + S + I Sbjct: 272 VFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVNDKI 331 Query: 722 IQRLCEVGKTYAADMFFKRA-SAEKVELDEATYDCILRAFCNKGRAKDAIAIHEMMLESG 898 I+RLC++GKT+A++M F++A + E V L ++TY C+L+A K R K+A+ ++ M+ G Sbjct: 332 IERLCDMGKTFASEMLFRKACNGETVRLWDSTYGCMLKALSRKKRTKEAVDVYRMICRKG 391 Query: 899 -TVAKDSCYKLFLNVLC-NEYPSEKISKLLVDLIGRGFFPCLSELSKFIVSQCKNKKWRE 1072 TV +SCY F N LC ++ SE+ +LLVD+I RG + S I + KWR Sbjct: 392 ITVLDESCYIEFANALCRDDNSSEEEEELLVDVIKRGKEDGNPQRSFLI----RLWKWR- 446 Query: 1073 AEDLIDVLLEKGFVPDSFSASCLVKHFCSSRRIDSAVALHNKLEHMRGTLDTNAYN--XX 1246 S +++ A+ LH K++ M+G+LD NAYN Sbjct: 447 -----------------------------SGKLEKALVLHEKIKKMKGSLDVNAYNAVLD 477 Query: 1247 XXXXXXXXXXXXALKIFDYMRSHSLLNGESFSAMISGLCQENDLRKAMRLHDEMLKMGLK 1426 A+ +F+YM+ + +N +SF+ MI GLC+ +++KAMR HDEML++GLK Sbjct: 478 RLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHDEMLRLGLK 537 Query: 1427 PDLKNYKRLIASFK 1468 PDL YKRLI FK Sbjct: 538 PDLVTYKRLILGFK 551