BLASTX nr result

ID: Bupleurum21_contig00023624 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00023624
         (2115 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI18867.3| unnamed protein product [Vitis vinifera]              509   e-141
gb|ADI58371.1| pentatricopeptide repeat-containing protein [Caps...   489   e-135
ref|NP_001185409.1| pentatricopeptide repeat-containing protein ...   486   e-135
ref|XP_002889073.1| pentatricopeptide repeat-containing protein ...   483   e-134
gb|AAF16672.1|AC012394_21 hypothetical protein; 84465-87513 [Ara...   477   e-132

>emb|CBI18867.3| unnamed protein product [Vitis vinifera]
          Length = 804

 Score =  509 bits (1310), Expect = e-141
 Identities = 270/465 (58%), Positives = 342/465 (73%)
 Frame = +3

Query: 3    NCVLAEHIIQQMQNLGVEPSNKTYDGFMRAVVKDKGSQAGIELLKLMQNKNILPCDTTFA 182
            NC LAE +I QMQNLG+EPS  TYDG ++A+V D+G   G+E+LK MQ +N+ P D+T A
Sbjct: 343  NCGLAEQLILQMQNLGLEPSCHTYDGLIKAIVSDRGFSDGMEVLKTMQLRNLKPYDSTLA 402

Query: 183  TLSITCSLELKLDLAESYLDQMSCCSNAYPYNYCLEACKILDQPERAVRILAKMRHMKLQ 362
             LSI  S  L+LDLAES LDQ+S  S  +P+N  L AC  LDQPERAV IL+KM+ +KLQ
Sbjct: 403  ALSIGSSKALQLDLAESLLDQISRISYVHPFNAFLAACDTLDQPERAVPILSKMKQLKLQ 462

Query: 363  PDTRTYELMFSLFGYVNMPYESGNMLSRVDAAKRISAIEADMLKNGIQHSSLSILNLLRA 542
            P+  TYEL+FSLFG VN PYE GNMLS+VD A+RI AIE DM+ NGIQHS LS+ NLL+A
Sbjct: 463  PNVGTYELLFSLFGNVNAPYEEGNMLSQVDVARRIKAIEMDMMNNGIQHSHLSMKNLLKA 522

Query: 543  LGAERMLKELMLYLHVAESQFSRYQKEAITMIYNAVLHSLVDAEKTQLAAETFKTMMKIG 722
            LGAE M++ELM YLHVAE+QF R      T IYN VLHSLV+A+++ +A E FK M+   
Sbjct: 523  LGAEGMIRELMQYLHVAENQFFRTNTYLGTPIYNTVLHSLVEAKESHIAIEIFKNMISRS 582

Query: 723  CQPDLATYTTMIDCCSIVGGFKYACSLVSLMIRNGFCPETVTYTVLTKILLENERLKNKD 902
               D ATY  MIDCCS +  +K AC+LVS+M+R+GF P T+TYT L KILLE E     D
Sbjct: 583  LPRDAATYNIMIDCCSTIKCYKSACALVSMMMRDGFLPWTLTYTALIKILLERE-----D 637

Query: 903  FVGALHLLDMACSEDIQLDKLLFNTILREALYKKKLVVIELVIERMHEKKIPPDPTTCSY 1082
            F  AL+LLD A  E+I  D LL+NTIL++A  K ++ +IELV E+MH++KI PDP+TC Y
Sbjct: 638  FDEALNLLDQARLEEIPSDVLLYNTILQKACLKGRIDLIELVAEQMHQEKIQPDPSTCYY 697

Query: 1083 VFYAYFLRKRHSTALEALQVLSMRMLLEDNSSLEELRSAYEQDYILADDPEAESRIMELF 1262
            VF +Y      STALE+LQVLSMRM+ EDNS+LEE R+  E D+I ++D +AES+I++ F
Sbjct: 698  VFSSYVDGGFFSTALESLQVLSMRMISEDNSTLEEKRTELE-DFIHSEDKDAESQILQFF 756

Query: 1263 KDHKDNLAFGLLNLRWCAMVGHEISWSPDQSLWAKRLANRFGTGK 1397
            K   +NLA  LLNLRWCA+ G  ISW+P++SLWA+RL N  G+ K
Sbjct: 757  KGSDENLAIALLNLRWCAISGSPISWTPNESLWARRLFNSHGSRK 801


>gb|ADI58371.1| pentatricopeptide repeat-containing protein [Capsicum annuum]
          Length = 805

 Score =  489 bits (1259), Expect = e-135
 Identities = 251/463 (54%), Positives = 337/463 (72%)
 Frame = +3

Query: 3    NCVLAEHIIQQMQNLGVEPSNKTYDGFMRAVVKDKGSQAGIELLKLMQNKNILPCDTTFA 182
            NC LAE +I QMQ LG++PS  TYDGF+RA+   +G   G+E+LK+M+ +NI P D+T A
Sbjct: 344  NCDLAEQLILQMQTLGLQPSGSTYDGFIRAIATTRGFSEGVEVLKVMREENIKPRDSTLA 403

Query: 183  TLSITCSLELKLDLAESYLDQMSCCSNAYPYNYCLEACKILDQPERAVRILAKMRHMKLQ 362
             L+I CS EL+LDLAES+LD++    + +P N  LEAC +LD+PERAV+I AKM+ + LQ
Sbjct: 404  VLAIICSRELELDLAESFLDEIYEIRSPHPCNAFLEACDVLDRPERAVQIFAKMKKLNLQ 463

Query: 363  PDTRTYELMFSLFGYVNMPYESGNMLSRVDAAKRISAIEADMLKNGIQHSSLSILNLLRA 542
            P+ RTYEL+FSLFG VN PYE GNMLS+VD AKRI+AIE DM+ NG+QHS LS+ N L+A
Sbjct: 464  PNIRTYELLFSLFGNVNAPYEEGNMLSQVDVAKRINAIEMDMMINGLQHSHLSLKNELKA 523

Query: 543  LGAERMLKELMLYLHVAESQFSRYQKEAITMIYNAVLHSLVDAEKTQLAAETFKTMMKIG 722
            LG E M+KEL+ YLH AE++FSRY    IT +YN VLHSLV+A+++Q+A + FK+M+  G
Sbjct: 524  LGTEGMIKELIQYLHAAENRFSRYDTYMITPVYNTVLHSLVEAKESQMATKMFKSMVSSG 583

Query: 723  CQPDLATYTTMIDCCSIVGGFKYACSLVSLMIRNGFCPETVTYTVLTKILLENERLKNKD 902
              PD ATY  MIDCCSI+G F+ A +L+S+M RNGF PE VT T L KILL +E     D
Sbjct: 584  VPPDAATYNIMIDCCSIIGCFRSALALISMMFRNGFNPEAVTLTGLLKILLRSE-----D 638

Query: 903  FVGALHLLDMACSEDIQLDKLLFNTILREALYKKKLVVIELVIERMHEKKIPPDPTTCSY 1082
            F G L LL+   SE IQLD LL++T+L+ A  K ++ VIEL++E+MH + + PDP+TCS+
Sbjct: 639  FDGTLKLLNQGISEGIQLDVLLYDTVLQVASEKGRIDVIELIVEQMHLQGVLPDPSTCSH 698

Query: 1083 VFYAYFLRKRHSTALEALQVLSMRMLLEDNSSLEELRSAYEQDYILADDPEAESRIMELF 1262
            VF AY     ++TA+EALQVLS+RM+       +E ++  E + IL +D E ESRI++ F
Sbjct: 699  VFAAYVDHGFYNTAMEALQVLSVRMIAGGFKDNDEKQTELE-NLILGEDSEDESRILKPF 757

Query: 1263 KDHKDNLAFGLLNLRWCAMVGHEISWSPDQSLWAKRLANRFGT 1391
            KD K+ L   LL LRWCA++G+ +SWSP  S WA+RL++   +
Sbjct: 758  KDSKEYLTVALLQLRWCAILGYPVSWSPSDSHWARRLSSNLAS 800


>ref|NP_001185409.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332197699|gb|AEE35820.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 801

 Score =  486 bits (1252), Expect = e-135
 Identities = 247/463 (53%), Positives = 335/463 (72%), Gaps = 1/463 (0%)
 Frame = +3

Query: 3    NCVLAEHIIQQMQNLGVEPSNKTYDGFMRAVVKDKGSQAGIELLKLMQNKNILPCDTTFA 182
            N  LAE ++ QMQNLG+ PS+ TYDGF+RAV   +G + G+ LLK+MQ +N+ P D+T A
Sbjct: 344  NSELAEQLMLQMQNLGLLPSSHTYDGFIRAVAFPEGYEYGMTLLKVMQQQNLKPYDSTLA 403

Query: 183  TLSITCSLELKLDLAESYLDQMSCCSNAYPYNYCLEACKILDQPERAVRILAKMRHMKLQ 362
            T++  CS  L++DLAE  LDQ+S CS +YP+N  L A   LDQPERAVR+LA+M+ +KL+
Sbjct: 404  TVAAYCSKALQVDLAEHLLDQISECSYSYPFNNLLAAYDSLDQPERAVRVLARMKELKLR 463

Query: 363  PDTRTYELMFSLFGYVNMPYESGNMLSRVDAAKRISAIEADMLKNGIQHSSLSILNLLRA 542
            PD RTYEL+FSLFG VN PYE GNMLS+VD  KRI+AIE DM++NG QHS +S LN+LRA
Sbjct: 464  PDMRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDMMRNGFQHSPISRLNVLRA 523

Query: 543  LGAERMLKELMLYLHVAESQFSRYQKEAITMIYNAVLHSLVDAEKTQLAAETFKTMMKIG 722
            LGAE M+ E++ +L  AE+  +       T  YN VLHSL++A +T +    FK M   G
Sbjct: 524  LGAEGMVNEMIRHLQKAENLSAHSNMYLGTPTYNIVLHSLLEANETDMVINIFKRMKSCG 583

Query: 723  CQPDLATYTTMIDCCSIVGGFKYACSLVSLMIRNGFCPETVTYTVLTKILLENERLKNKD 902
            C  D+ATY  MIDCCS++  +K AC+LVS+MIR+GF P+ VT+T L KIL     L + +
Sbjct: 584  CPADVATYNIMIDCCSLIHSYKSACALVSMMIRDGFSPKAVTFTALMKIL-----LNDAN 638

Query: 903  FVGALHLLDMACSEDIQLDKLLFNTILREALYKKKLVVIELVIERMHEKKIPPDPTTCSY 1082
            F  AL+LLD A  E+I LD L +NTILR+A  K  + VIE ++E+MH +K+ PDPTTC Y
Sbjct: 639  FEEALNLLDQAALEEIHLDVLSYNTILRKAFEKGMIDVIEYIVEQMHREKVNPDPTTCHY 698

Query: 1083 VFYAYFLRKRHSTALEALQVLSMRML-LEDNSSLEELRSAYEQDYILADDPEAESRIMEL 1259
            VF  Y  +  H+TA+EAL VLS+RML  ED  SL++ +   E+++++++DPEAE++I+EL
Sbjct: 699  VFSCYVEKGYHATAIEALNVLSLRMLNEEDKESLQDKKIELEENFVMSEDPEAETKIIEL 758

Query: 1260 FKDHKDNLAFGLLNLRWCAMVGHEISWSPDQSLWAKRLANRFG 1388
            F+  +++LA  LLNLRWCAM+G  I WS DQS WA+ L+N++G
Sbjct: 759  FRKSEEHLAAALLNLRWCAMLGGRIIWSEDQSPWARALSNKYG 801


>ref|XP_002889073.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297334914|gb|EFH65332.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 800

 Score =  483 bits (1243), Expect = e-134
 Identities = 248/463 (53%), Positives = 332/463 (71%), Gaps = 1/463 (0%)
 Frame = +3

Query: 3    NCVLAEHIIQQMQNLGVEPSNKTYDGFMRAVVKDKGSQAGIELLKLMQNKNILPCDTTFA 182
            N  LAE ++ QMQN+G+ PS+ TYDGF+RAV    G + G+ LLK+MQ +N+ P D+T A
Sbjct: 343  NSELAEQLMLQMQNIGLLPSSHTYDGFIRAVAFPGGYEYGMTLLKVMQQQNLKPYDSTLA 402

Query: 183  TLSITCSLELKLDLAESYLDQMSCCSNAYPYNYCLEACKILDQPERAVRILAKMRHMKLQ 362
            T+S  CS   ++DLAE  LDQ+S CS AYP+N  L A   LDQPERAVR+LA+M+ +KL+
Sbjct: 403  TVSAYCSKAFQVDLAEHLLDQISECSYAYPFNNLLAAYDSLDQPERAVRVLARMKQLKLR 462

Query: 363  PDTRTYELMFSLFGYVNMPYESGNMLSRVDAAKRISAIEADMLKNGIQHSSLSILNLLRA 542
            PD RTYEL+FSLFG VN PYE GNMLS+VD  KRI+AIE DM++NG QHS +S  N+LRA
Sbjct: 463  PDMRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDMVRNGFQHSPISRRNVLRA 522

Query: 543  LGAERMLKELMLYLHVAESQFSRYQKEAITMIYNAVLHSLVDAEKTQLAAETFKTMMKIG 722
            LGAE M+ E++ +L  AE+          T  YN VLHSL++A +T +    FK M   G
Sbjct: 523  LGAEGMVNEMIRHLQKAENLNVHSNMYLGTPTYNIVLHSLLEANETDMVINIFKRMKSCG 582

Query: 723  CQPDLATYTTMIDCCSIVGGFKYACSLVSLMIRNGFCPETVTYTVLTKILLENERLKNKD 902
            C  D+ATY  MIDCCSI+  +K AC+LVS+MIR+GF P+ VT+T L KIL     L + +
Sbjct: 583  CPADVATYNIMIDCCSIIHSYKSACALVSMMIRDGFSPKAVTFTALMKIL-----LNDGN 637

Query: 903  FVGALHLLDMACSEDIQLDKLLFNTILREALYKKKLVVIELVIERMHEKKIPPDPTTCSY 1082
            F  AL+LLD A  E+I LD L +NTILR+A  K  + VIE ++E+MH +K+ PDPTTC Y
Sbjct: 638  FEEALNLLDQAALEEIHLDVLSYNTILRKAFEKGMIDVIEYIVEQMHREKVNPDPTTCHY 697

Query: 1083 VFYAYFLRKRHSTALEALQVLSMRML-LEDNSSLEELRSAYEQDYILADDPEAESRIMEL 1259
            VF  Y  +  H+TA+EAL VLS+RML  ED  SL+E +   E+++++++DPEAE++I+EL
Sbjct: 698  VFTCYVEKGYHATAIEALNVLSLRMLNEEDKESLQEKKIELEENFVMSEDPEAETKIIEL 757

Query: 1260 FKDHKDNLAFGLLNLRWCAMVGHEISWSPDQSLWAKRLANRFG 1388
            F++ +++LA  LLNLRWCAM+G  I WS DQS WA+ L+N++G
Sbjct: 758  FRNSEEHLAAALLNLRWCAMLGARIIWSEDQSPWARGLSNKYG 800


>gb|AAF16672.1|AC012394_21 hypothetical protein; 84465-87513 [Arabidopsis thaliana]
          Length = 558

 Score =  477 bits (1227), Expect = e-132
 Identities = 247/477 (51%), Positives = 335/477 (70%), Gaps = 15/477 (3%)
 Frame = +3

Query: 3    NCVLAEHIIQQMQNLGVEPSNKTYDGFMRAVVKDKGSQAGIELLKLMQNKNILPCDTTFA 182
            N  LAE ++ QMQNLG+ PS+ TYDGF+RAV   +G + G+ LLK+MQ +N+ P D+T A
Sbjct: 87   NSELAEQLMLQMQNLGLLPSSHTYDGFIRAVAFPEGYEYGMTLLKVMQQQNLKPYDSTLA 146

Query: 183  TLSITCSLELKLDLAESYLDQMSCCSNAYPYNYCLEACKILDQPERAVRILAKMRHMKLQ 362
            T++  CS  L++DLAE  LDQ+S CS +YP+N  L A   LDQPERAVR+LA+M+ +KL+
Sbjct: 147  TVAAYCSKALQVDLAEHLLDQISECSYSYPFNNLLAAYDSLDQPERAVRVLARMKELKLR 206

Query: 363  PDTRTYELMFSLFGYVNMPYESGNMLSRVDAAKRISAIEADMLKNGIQHSSLSILNL--- 533
            PD RTYEL+FSLFG VN PYE GNMLS+VD  KRI+AIE DM++NG QHS +S LN+   
Sbjct: 207  PDMRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDMMRNGFQHSPISRLNVNNG 266

Query: 534  -----------LRALGAERMLKELMLYLHVAESQFSRYQKEAITMIYNAVLHSLVDAEKT 680
                       LRALGAE M+ E++ +L  AE+  +       T  YN VLHSL++A +T
Sbjct: 267  LIVYTFLFLSQLRALGAEGMVNEMIRHLQKAENLSAHSNMYLGTPTYNIVLHSLLEANET 326

Query: 681  QLAAETFKTMMKIGCQPDLATYTTMIDCCSIVGGFKYACSLVSLMIRNGFCPETVTYTVL 860
             +    FK M   GC  D+ATY  MIDCCS++  +K AC+LVS+MIR+GF P+ VT+T L
Sbjct: 327  DMVINIFKRMKSCGCPADVATYNIMIDCCSLIHSYKSACALVSMMIRDGFSPKAVTFTAL 386

Query: 861  TKILLENERLKNKDFVGALHLLDMACSEDIQLDKLLFNTILREALYKKKLVVIELVIERM 1040
             KIL     L + +F  AL+LLD A  E+I LD L +NTILR+A  K  + VIE ++E+M
Sbjct: 387  MKIL-----LNDANFEEALNLLDQAALEEIHLDVLSYNTILRKAFEKGMIDVIEYIVEQM 441

Query: 1041 HEKKIPPDPTTCSYVFYAYFLRKRHSTALEALQVLSMRML-LEDNSSLEELRSAYEQDYI 1217
            H +K+ PDPTTC YVF  Y  +  H+TA+EAL VLS+RML  ED  SL++ +   E++++
Sbjct: 442  HREKVNPDPTTCHYVFSCYVEKGYHATAIEALNVLSLRMLNEEDKESLQDKKIELEENFV 501

Query: 1218 LADDPEAESRIMELFKDHKDNLAFGLLNLRWCAMVGHEISWSPDQSLWAKRLANRFG 1388
            +++DPEAE++I+ELF+  +++LA  LLNLRWCAM+G  I WS DQS WA+ L+N++G
Sbjct: 502  MSEDPEAETKIIELFRKSEEHLAAALLNLRWCAMLGGRIIWSEDQSPWARALSNKYG 558


Top