BLASTX nr result
ID: Bupleurum21_contig00023624
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00023624 (2115 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI18867.3| unnamed protein product [Vitis vinifera] 509 e-141 gb|ADI58371.1| pentatricopeptide repeat-containing protein [Caps... 489 e-135 ref|NP_001185409.1| pentatricopeptide repeat-containing protein ... 486 e-135 ref|XP_002889073.1| pentatricopeptide repeat-containing protein ... 483 e-134 gb|AAF16672.1|AC012394_21 hypothetical protein; 84465-87513 [Ara... 477 e-132 >emb|CBI18867.3| unnamed protein product [Vitis vinifera] Length = 804 Score = 509 bits (1310), Expect = e-141 Identities = 270/465 (58%), Positives = 342/465 (73%) Frame = +3 Query: 3 NCVLAEHIIQQMQNLGVEPSNKTYDGFMRAVVKDKGSQAGIELLKLMQNKNILPCDTTFA 182 NC LAE +I QMQNLG+EPS TYDG ++A+V D+G G+E+LK MQ +N+ P D+T A Sbjct: 343 NCGLAEQLILQMQNLGLEPSCHTYDGLIKAIVSDRGFSDGMEVLKTMQLRNLKPYDSTLA 402 Query: 183 TLSITCSLELKLDLAESYLDQMSCCSNAYPYNYCLEACKILDQPERAVRILAKMRHMKLQ 362 LSI S L+LDLAES LDQ+S S +P+N L AC LDQPERAV IL+KM+ +KLQ Sbjct: 403 ALSIGSSKALQLDLAESLLDQISRISYVHPFNAFLAACDTLDQPERAVPILSKMKQLKLQ 462 Query: 363 PDTRTYELMFSLFGYVNMPYESGNMLSRVDAAKRISAIEADMLKNGIQHSSLSILNLLRA 542 P+ TYEL+FSLFG VN PYE GNMLS+VD A+RI AIE DM+ NGIQHS LS+ NLL+A Sbjct: 463 PNVGTYELLFSLFGNVNAPYEEGNMLSQVDVARRIKAIEMDMMNNGIQHSHLSMKNLLKA 522 Query: 543 LGAERMLKELMLYLHVAESQFSRYQKEAITMIYNAVLHSLVDAEKTQLAAETFKTMMKIG 722 LGAE M++ELM YLHVAE+QF R T IYN VLHSLV+A+++ +A E FK M+ Sbjct: 523 LGAEGMIRELMQYLHVAENQFFRTNTYLGTPIYNTVLHSLVEAKESHIAIEIFKNMISRS 582 Query: 723 CQPDLATYTTMIDCCSIVGGFKYACSLVSLMIRNGFCPETVTYTVLTKILLENERLKNKD 902 D ATY MIDCCS + +K AC+LVS+M+R+GF P T+TYT L KILLE E D Sbjct: 583 LPRDAATYNIMIDCCSTIKCYKSACALVSMMMRDGFLPWTLTYTALIKILLERE-----D 637 Query: 903 FVGALHLLDMACSEDIQLDKLLFNTILREALYKKKLVVIELVIERMHEKKIPPDPTTCSY 1082 F AL+LLD A E+I D LL+NTIL++A K ++ +IELV E+MH++KI PDP+TC Y Sbjct: 638 FDEALNLLDQARLEEIPSDVLLYNTILQKACLKGRIDLIELVAEQMHQEKIQPDPSTCYY 697 Query: 1083 VFYAYFLRKRHSTALEALQVLSMRMLLEDNSSLEELRSAYEQDYILADDPEAESRIMELF 1262 VF +Y STALE+LQVLSMRM+ EDNS+LEE R+ E D+I ++D +AES+I++ F Sbjct: 698 VFSSYVDGGFFSTALESLQVLSMRMISEDNSTLEEKRTELE-DFIHSEDKDAESQILQFF 756 Query: 1263 KDHKDNLAFGLLNLRWCAMVGHEISWSPDQSLWAKRLANRFGTGK 1397 K +NLA LLNLRWCA+ G ISW+P++SLWA+RL N G+ K Sbjct: 757 KGSDENLAIALLNLRWCAISGSPISWTPNESLWARRLFNSHGSRK 801 >gb|ADI58371.1| pentatricopeptide repeat-containing protein [Capsicum annuum] Length = 805 Score = 489 bits (1259), Expect = e-135 Identities = 251/463 (54%), Positives = 337/463 (72%) Frame = +3 Query: 3 NCVLAEHIIQQMQNLGVEPSNKTYDGFMRAVVKDKGSQAGIELLKLMQNKNILPCDTTFA 182 NC LAE +I QMQ LG++PS TYDGF+RA+ +G G+E+LK+M+ +NI P D+T A Sbjct: 344 NCDLAEQLILQMQTLGLQPSGSTYDGFIRAIATTRGFSEGVEVLKVMREENIKPRDSTLA 403 Query: 183 TLSITCSLELKLDLAESYLDQMSCCSNAYPYNYCLEACKILDQPERAVRILAKMRHMKLQ 362 L+I CS EL+LDLAES+LD++ + +P N LEAC +LD+PERAV+I AKM+ + LQ Sbjct: 404 VLAIICSRELELDLAESFLDEIYEIRSPHPCNAFLEACDVLDRPERAVQIFAKMKKLNLQ 463 Query: 363 PDTRTYELMFSLFGYVNMPYESGNMLSRVDAAKRISAIEADMLKNGIQHSSLSILNLLRA 542 P+ RTYEL+FSLFG VN PYE GNMLS+VD AKRI+AIE DM+ NG+QHS LS+ N L+A Sbjct: 464 PNIRTYELLFSLFGNVNAPYEEGNMLSQVDVAKRINAIEMDMMINGLQHSHLSLKNELKA 523 Query: 543 LGAERMLKELMLYLHVAESQFSRYQKEAITMIYNAVLHSLVDAEKTQLAAETFKTMMKIG 722 LG E M+KEL+ YLH AE++FSRY IT +YN VLHSLV+A+++Q+A + FK+M+ G Sbjct: 524 LGTEGMIKELIQYLHAAENRFSRYDTYMITPVYNTVLHSLVEAKESQMATKMFKSMVSSG 583 Query: 723 CQPDLATYTTMIDCCSIVGGFKYACSLVSLMIRNGFCPETVTYTVLTKILLENERLKNKD 902 PD ATY MIDCCSI+G F+ A +L+S+M RNGF PE VT T L KILL +E D Sbjct: 584 VPPDAATYNIMIDCCSIIGCFRSALALISMMFRNGFNPEAVTLTGLLKILLRSE-----D 638 Query: 903 FVGALHLLDMACSEDIQLDKLLFNTILREALYKKKLVVIELVIERMHEKKIPPDPTTCSY 1082 F G L LL+ SE IQLD LL++T+L+ A K ++ VIEL++E+MH + + PDP+TCS+ Sbjct: 639 FDGTLKLLNQGISEGIQLDVLLYDTVLQVASEKGRIDVIELIVEQMHLQGVLPDPSTCSH 698 Query: 1083 VFYAYFLRKRHSTALEALQVLSMRMLLEDNSSLEELRSAYEQDYILADDPEAESRIMELF 1262 VF AY ++TA+EALQVLS+RM+ +E ++ E + IL +D E ESRI++ F Sbjct: 699 VFAAYVDHGFYNTAMEALQVLSVRMIAGGFKDNDEKQTELE-NLILGEDSEDESRILKPF 757 Query: 1263 KDHKDNLAFGLLNLRWCAMVGHEISWSPDQSLWAKRLANRFGT 1391 KD K+ L LL LRWCA++G+ +SWSP S WA+RL++ + Sbjct: 758 KDSKEYLTVALLQLRWCAILGYPVSWSPSDSHWARRLSSNLAS 800 >ref|NP_001185409.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332197699|gb|AEE35820.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 801 Score = 486 bits (1252), Expect = e-135 Identities = 247/463 (53%), Positives = 335/463 (72%), Gaps = 1/463 (0%) Frame = +3 Query: 3 NCVLAEHIIQQMQNLGVEPSNKTYDGFMRAVVKDKGSQAGIELLKLMQNKNILPCDTTFA 182 N LAE ++ QMQNLG+ PS+ TYDGF+RAV +G + G+ LLK+MQ +N+ P D+T A Sbjct: 344 NSELAEQLMLQMQNLGLLPSSHTYDGFIRAVAFPEGYEYGMTLLKVMQQQNLKPYDSTLA 403 Query: 183 TLSITCSLELKLDLAESYLDQMSCCSNAYPYNYCLEACKILDQPERAVRILAKMRHMKLQ 362 T++ CS L++DLAE LDQ+S CS +YP+N L A LDQPERAVR+LA+M+ +KL+ Sbjct: 404 TVAAYCSKALQVDLAEHLLDQISECSYSYPFNNLLAAYDSLDQPERAVRVLARMKELKLR 463 Query: 363 PDTRTYELMFSLFGYVNMPYESGNMLSRVDAAKRISAIEADMLKNGIQHSSLSILNLLRA 542 PD RTYEL+FSLFG VN PYE GNMLS+VD KRI+AIE DM++NG QHS +S LN+LRA Sbjct: 464 PDMRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDMMRNGFQHSPISRLNVLRA 523 Query: 543 LGAERMLKELMLYLHVAESQFSRYQKEAITMIYNAVLHSLVDAEKTQLAAETFKTMMKIG 722 LGAE M+ E++ +L AE+ + T YN VLHSL++A +T + FK M G Sbjct: 524 LGAEGMVNEMIRHLQKAENLSAHSNMYLGTPTYNIVLHSLLEANETDMVINIFKRMKSCG 583 Query: 723 CQPDLATYTTMIDCCSIVGGFKYACSLVSLMIRNGFCPETVTYTVLTKILLENERLKNKD 902 C D+ATY MIDCCS++ +K AC+LVS+MIR+GF P+ VT+T L KIL L + + Sbjct: 584 CPADVATYNIMIDCCSLIHSYKSACALVSMMIRDGFSPKAVTFTALMKIL-----LNDAN 638 Query: 903 FVGALHLLDMACSEDIQLDKLLFNTILREALYKKKLVVIELVIERMHEKKIPPDPTTCSY 1082 F AL+LLD A E+I LD L +NTILR+A K + VIE ++E+MH +K+ PDPTTC Y Sbjct: 639 FEEALNLLDQAALEEIHLDVLSYNTILRKAFEKGMIDVIEYIVEQMHREKVNPDPTTCHY 698 Query: 1083 VFYAYFLRKRHSTALEALQVLSMRML-LEDNSSLEELRSAYEQDYILADDPEAESRIMEL 1259 VF Y + H+TA+EAL VLS+RML ED SL++ + E+++++++DPEAE++I+EL Sbjct: 699 VFSCYVEKGYHATAIEALNVLSLRMLNEEDKESLQDKKIELEENFVMSEDPEAETKIIEL 758 Query: 1260 FKDHKDNLAFGLLNLRWCAMVGHEISWSPDQSLWAKRLANRFG 1388 F+ +++LA LLNLRWCAM+G I WS DQS WA+ L+N++G Sbjct: 759 FRKSEEHLAAALLNLRWCAMLGGRIIWSEDQSPWARALSNKYG 801 >ref|XP_002889073.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297334914|gb|EFH65332.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 800 Score = 483 bits (1243), Expect = e-134 Identities = 248/463 (53%), Positives = 332/463 (71%), Gaps = 1/463 (0%) Frame = +3 Query: 3 NCVLAEHIIQQMQNLGVEPSNKTYDGFMRAVVKDKGSQAGIELLKLMQNKNILPCDTTFA 182 N LAE ++ QMQN+G+ PS+ TYDGF+RAV G + G+ LLK+MQ +N+ P D+T A Sbjct: 343 NSELAEQLMLQMQNIGLLPSSHTYDGFIRAVAFPGGYEYGMTLLKVMQQQNLKPYDSTLA 402 Query: 183 TLSITCSLELKLDLAESYLDQMSCCSNAYPYNYCLEACKILDQPERAVRILAKMRHMKLQ 362 T+S CS ++DLAE LDQ+S CS AYP+N L A LDQPERAVR+LA+M+ +KL+ Sbjct: 403 TVSAYCSKAFQVDLAEHLLDQISECSYAYPFNNLLAAYDSLDQPERAVRVLARMKQLKLR 462 Query: 363 PDTRTYELMFSLFGYVNMPYESGNMLSRVDAAKRISAIEADMLKNGIQHSSLSILNLLRA 542 PD RTYEL+FSLFG VN PYE GNMLS+VD KRI+AIE DM++NG QHS +S N+LRA Sbjct: 463 PDMRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDMVRNGFQHSPISRRNVLRA 522 Query: 543 LGAERMLKELMLYLHVAESQFSRYQKEAITMIYNAVLHSLVDAEKTQLAAETFKTMMKIG 722 LGAE M+ E++ +L AE+ T YN VLHSL++A +T + FK M G Sbjct: 523 LGAEGMVNEMIRHLQKAENLNVHSNMYLGTPTYNIVLHSLLEANETDMVINIFKRMKSCG 582 Query: 723 CQPDLATYTTMIDCCSIVGGFKYACSLVSLMIRNGFCPETVTYTVLTKILLENERLKNKD 902 C D+ATY MIDCCSI+ +K AC+LVS+MIR+GF P+ VT+T L KIL L + + Sbjct: 583 CPADVATYNIMIDCCSIIHSYKSACALVSMMIRDGFSPKAVTFTALMKIL-----LNDGN 637 Query: 903 FVGALHLLDMACSEDIQLDKLLFNTILREALYKKKLVVIELVIERMHEKKIPPDPTTCSY 1082 F AL+LLD A E+I LD L +NTILR+A K + VIE ++E+MH +K+ PDPTTC Y Sbjct: 638 FEEALNLLDQAALEEIHLDVLSYNTILRKAFEKGMIDVIEYIVEQMHREKVNPDPTTCHY 697 Query: 1083 VFYAYFLRKRHSTALEALQVLSMRML-LEDNSSLEELRSAYEQDYILADDPEAESRIMEL 1259 VF Y + H+TA+EAL VLS+RML ED SL+E + E+++++++DPEAE++I+EL Sbjct: 698 VFTCYVEKGYHATAIEALNVLSLRMLNEEDKESLQEKKIELEENFVMSEDPEAETKIIEL 757 Query: 1260 FKDHKDNLAFGLLNLRWCAMVGHEISWSPDQSLWAKRLANRFG 1388 F++ +++LA LLNLRWCAM+G I WS DQS WA+ L+N++G Sbjct: 758 FRNSEEHLAAALLNLRWCAMLGARIIWSEDQSPWARGLSNKYG 800 >gb|AAF16672.1|AC012394_21 hypothetical protein; 84465-87513 [Arabidopsis thaliana] Length = 558 Score = 477 bits (1227), Expect = e-132 Identities = 247/477 (51%), Positives = 335/477 (70%), Gaps = 15/477 (3%) Frame = +3 Query: 3 NCVLAEHIIQQMQNLGVEPSNKTYDGFMRAVVKDKGSQAGIELLKLMQNKNILPCDTTFA 182 N LAE ++ QMQNLG+ PS+ TYDGF+RAV +G + G+ LLK+MQ +N+ P D+T A Sbjct: 87 NSELAEQLMLQMQNLGLLPSSHTYDGFIRAVAFPEGYEYGMTLLKVMQQQNLKPYDSTLA 146 Query: 183 TLSITCSLELKLDLAESYLDQMSCCSNAYPYNYCLEACKILDQPERAVRILAKMRHMKLQ 362 T++ CS L++DLAE LDQ+S CS +YP+N L A LDQPERAVR+LA+M+ +KL+ Sbjct: 147 TVAAYCSKALQVDLAEHLLDQISECSYSYPFNNLLAAYDSLDQPERAVRVLARMKELKLR 206 Query: 363 PDTRTYELMFSLFGYVNMPYESGNMLSRVDAAKRISAIEADMLKNGIQHSSLSILNL--- 533 PD RTYEL+FSLFG VN PYE GNMLS+VD KRI+AIE DM++NG QHS +S LN+ Sbjct: 207 PDMRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDMMRNGFQHSPISRLNVNNG 266 Query: 534 -----------LRALGAERMLKELMLYLHVAESQFSRYQKEAITMIYNAVLHSLVDAEKT 680 LRALGAE M+ E++ +L AE+ + T YN VLHSL++A +T Sbjct: 267 LIVYTFLFLSQLRALGAEGMVNEMIRHLQKAENLSAHSNMYLGTPTYNIVLHSLLEANET 326 Query: 681 QLAAETFKTMMKIGCQPDLATYTTMIDCCSIVGGFKYACSLVSLMIRNGFCPETVTYTVL 860 + FK M GC D+ATY MIDCCS++ +K AC+LVS+MIR+GF P+ VT+T L Sbjct: 327 DMVINIFKRMKSCGCPADVATYNIMIDCCSLIHSYKSACALVSMMIRDGFSPKAVTFTAL 386 Query: 861 TKILLENERLKNKDFVGALHLLDMACSEDIQLDKLLFNTILREALYKKKLVVIELVIERM 1040 KIL L + +F AL+LLD A E+I LD L +NTILR+A K + VIE ++E+M Sbjct: 387 MKIL-----LNDANFEEALNLLDQAALEEIHLDVLSYNTILRKAFEKGMIDVIEYIVEQM 441 Query: 1041 HEKKIPPDPTTCSYVFYAYFLRKRHSTALEALQVLSMRML-LEDNSSLEELRSAYEQDYI 1217 H +K+ PDPTTC YVF Y + H+TA+EAL VLS+RML ED SL++ + E++++ Sbjct: 442 HREKVNPDPTTCHYVFSCYVEKGYHATAIEALNVLSLRMLNEEDKESLQDKKIELEENFV 501 Query: 1218 LADDPEAESRIMELFKDHKDNLAFGLLNLRWCAMVGHEISWSPDQSLWAKRLANRFG 1388 +++DPEAE++I+ELF+ +++LA LLNLRWCAM+G I WS DQS WA+ L+N++G Sbjct: 502 MSEDPEAETKIIELFRKSEEHLAAALLNLRWCAMLGGRIIWSEDQSPWARALSNKYG 558