BLASTX nr result

ID: Bupleurum21_contig00015106 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00015106
         (2173 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containi...   514   e-143
ref|XP_002305605.1| predicted protein [Populus trichocarpa] gi|2...   473   e-130
ref|XP_002518527.1| pentatricopeptide repeat-containing protein,...   465   e-128
sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-c...   355   2e-95
ref|NP_193849.2| pentatricopeptide repeat-containing protein [Ar...   294   8e-77

>ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like
            [Vitis vinifera]
          Length = 569

 Score =  514 bits (1323), Expect = e-143
 Identities = 251/490 (51%), Positives = 347/490 (70%), Gaps = 1/490 (0%)
 Frame = +2

Query: 2    WSRKNLGFQPDLRVECKLVQTLIRFGLSQPARPILDSLVETHPLPQIVDALVVSCKGTDC 181
            W R NLGFQPDL    ++++  I+ GL QPA+ ILDSL+ET  +  +VD+++ +C+G D 
Sbjct: 83   WVRTNLGFQPDLAAHSQIIRISIQSGLFQPAKGILDSLIETQKVSVLVDSVIQACRGKDS 142

Query: 182  HSVVFSSVVECYTKKGLYLQALQVFRKVRDFGCVVSDHSCNGLLNVLQESNEIRLSWCFY 361
             S V   V+ECY+ KGL+++AL+VFR++   G V S  SCN LL+ LQ  NEI+L+WC  
Sbjct: 143  ESPVLGFVLECYSSKGLFIEALEVFRRITIHGYVPSVRSCNALLDSLQRENEIKLAWCVC 202

Query: 362  ASMLRNGVLASQYTWNVIARILSKDGKFEKIGQVIDMG-VHNSLIYNLIIEFYSTSGNFK 538
             +++RNGVL        IA IL K+GK E++ +++DM  V N+LIY L+I+ Y   GNF 
Sbjct: 203  GALIRNGVLPDYVR---IALILCKNGKLERVVRLLDMSIVCNALIYKLVIDCYCERGNFS 259

Query: 539  GAVDRLNEMVDKKLEPGFVTYSSILDGACQHGNVEIIELMMESMEKNEHIPILPSLEYNS 718
             A   LNEM ++K +PGF  Y+SILDGAC++ N E+I+++M SM +   +P L   EY+S
Sbjct: 260  AAFHYLNEMCNRKFDPGFCAYNSILDGACKYENDEVIQIVMGSMVEKGLLPKLLLSEYDS 319

Query: 719  IIQRLCEVGKTYAADMFFKRASAEKVELDEATYDCILRAFCNKGRAKDAIAIHEMMLESG 898
            IIQ++C +GKT+AA MFFKRA  EK+ELD ATY C+LRA    GR K+AI ++ ++LESG
Sbjct: 320  IIQKICNLGKTHAAQMFFKRARNEKIELDNATYGCMLRALAKDGRVKEAIGVYLVILESG 379

Query: 899  TVAKDSCYKLFLNVLCNEYPSEKISKLLVDLIGRGFFPCLSELSKFIVSQCKNKKWREAE 1078
               KD CY  F+NVLC E PS+++SKL+ ++IG+GF PC S+LSKFI S CKN +W EA+
Sbjct: 380  VTVKDGCYHAFVNVLCEEDPSQEVSKLMGEIIGKGFSPCGSKLSKFITSLCKNGRWTEAD 439

Query: 1079 DLIDVLLEKGFVPDSFSASCLVKHFCSSRRIDSAVALHNKLEHMRGTLDTNAYNXXXXXX 1258
            DL++V +EKG +PDSF  S LV+H+C SR+IDS++ALH K++ ++G+LD   YN      
Sbjct: 440  DLLNVTIEKGLLPDSFCCSALVEHYCRSRQIDSSIALHEKIKKVKGSLDVATYNVLLNGL 499

Query: 1259 XXXXXXXXALKIFDYMRSHSLLNGESFSAMISGLCQENDLRKAMRLHDEMLKMGLKPDLK 1438
                    A+ +FD MRS +LL+  SF+ M+SGLC+E +LRKAM+ HDEMLKMGLKPD  
Sbjct: 500  FMEKRIEDAVSVFDCMRSQNLLSSTSFTIMVSGLCRERELRKAMKFHDEMLKMGLKPDRA 559

Query: 1439 NYKRLIASFK 1468
             YKRLI+ FK
Sbjct: 560  TYKRLISGFK 569


>ref|XP_002305605.1| predicted protein [Populus trichocarpa] gi|222848569|gb|EEE86116.1|
            predicted protein [Populus trichocarpa]
          Length = 564

 Score =  473 bits (1216), Expect = e-130
 Identities = 226/488 (46%), Positives = 335/488 (68%)
 Frame = +2

Query: 2    WSRKNLGFQPDLRVECKLVQTLIRFGLSQPARPILDSLVETHPLPQIVDALVVSCKGTDC 181
            W + NL  +PDL+ +C ++   +  GL+ P RPI+DSLV+TH +  + +A+V SC+G   
Sbjct: 75   WVQTNLKLKPDLKSQCHIINICVNSGLTLPVRPIMDSLVKTHHVSVLGEAMVDSCRGKSL 134

Query: 182  HSVVFSSVVECYTKKGLYLQALQVFRKVRDFGCVVSDHSCNGLLNVLQESNEIRLSWCFY 361
             S  FS V+ECY+ KGL++++L++FRK+R  G + S  +CN +L+VLQ  NEI+L+WCFY
Sbjct: 135  KSDAFSFVLECYSHKGLFMESLEMFRKMRGNGFIASGTACNSVLDVLQRENEIKLAWCFY 194

Query: 362  ASMLRNGVLASQYTWNVIARILSKDGKFEKIGQVIDMGVHNSLIYNLIIEFYSTSGNFKG 541
             +M+++GVL  + TW++IA+IL KDG FE+I + +DMGV+NS++YN +I+  S  G+F+ 
Sbjct: 195  CAMIKDGVLPDKLTWSLIAQILCKDGNFERIVKFLDMGVYNSVLYNGVIDCCSKRGDFEA 254

Query: 542  AVDRLNEMVDKKLEPGFVTYSSILDGACQHGNVEIIELMMESMEKNEHIPILPSLEYNSI 721
            A +RLN+M ++KL+PGF TYS+ILDGAC+HGN E+IE +M+ M +   +P  P  + +S+
Sbjct: 255  AFERLNQMCERKLDPGFSTYSAILDGACKHGNEEVIERVMDIMAEKGLLPKCPLSQCDSV 314

Query: 722  IQRLCEVGKTYAADMFFKRASAEKVELDEATYDCILRAFCNKGRAKDAIAIHEMMLESGT 901
            IQ+  ++ K   A MFF+RA  EK+ L +ATY C+L+A   + R K+AI ++ ++ E G 
Sbjct: 315  IQKFSDLCKMNVATMFFRRACDEKIGLQDATYGCMLKALSKEARVKEAIGLYSLISEKGI 374

Query: 902  VAKDSCYKLFLNVLCNEYPSEKISKLLVDLIGRGFFPCLSELSKFIVSQCKNKKWREAED 1081
              KDS Y  FL++L  E   E+  ++L D++ RGF P    LSKFI+   + ++WRE ED
Sbjct: 375  RVKDSTYHAFLDLLSEEDQYEEGYEILGDMMRRGFRPGTVGLSKFILLLSRKRRWREVED 434

Query: 1082 LIDVLLEKGFVPDSFSASCLVKHFCSSRRIDSAVALHNKLEHMRGTLDTNAYNXXXXXXX 1261
            L+D++LEKG +PDS     LV+H+CS R+ID AVALHNK+E ++ +LD   YN       
Sbjct: 435  LLDLVLEKGLLPDSLCCCSLVEHYCSRRQIDKAVALHNKMEKLQASLDVATYNILLDGLV 494

Query: 1262 XXXXXXXALKIFDYMRSHSLLNGESFSAMISGLCQENDLRKAMRLHDEMLKMGLKPDLKN 1441
                    +++FDYM+   L+N ESF+  I GLC+  ++RKAM+LHDEML MGLKPD   
Sbjct: 495  KNGRIEEVVRVFDYMKGLKLVNSESFTITIRGLCRAKEMRKAMKLHDEMLDMGLKPDKAA 554

Query: 1442 YKRLIASF 1465
            YKRLI  F
Sbjct: 555  YKRLILEF 562


>ref|XP_002518527.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223542372|gb|EEF43914.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 599

 Score =  465 bits (1197), Expect = e-128
 Identities = 221/488 (45%), Positives = 336/488 (68%)
 Frame = +2

Query: 2    WSRKNLGFQPDLRVECKLVQTLIRFGLSQPARPILDSLVETHPLPQIVDALVVSCKGTDC 181
            W++ NL F PDL+ +C ++Q  +   L + A+ ILDSL++T+P    ++ +V +C+G   
Sbjct: 104  WAKTNLNFNPDLKSQCHVIQLSLGSDLPRAAKKILDSLIKTYPSNLFLETMVQACRGKSS 163

Query: 182  HSVVFSSVVECYTKKGLYLQALQVFRKVRDFGCVVSDHSCNGLLNVLQESNEIRLSWCFY 361
                 + V+E Y+ KG +L+ L+V++K+R  GC  S H+CN LL+ LQ  +EIRL+WCFY
Sbjct: 164  LLCTLNFVLEFYSHKGSFLEGLEVYKKMRVIGCTPSVHACNVLLDALQRESEIRLAWCFY 223

Query: 362  ASMLRNGVLASQYTWNVIARILSKDGKFEKIGQVIDMGVHNSLIYNLIIEFYSTSGNFKG 541
             +M+R GVL  ++TW+++A IL KDG FE+I +++DMG+ NS++YN ++++YS +G+FK 
Sbjct: 224  CAMIRVGVLPDKFTWSLVAHILCKDGNFERIVKLLDMGICNSVMYNAVVDYYSKNGDFKA 283

Query: 542  AVDRLNEMVDKKLEPGFVTYSSILDGACQHGNVEIIELMMESMEKNEHIPILPSLEYNSI 721
            A  RLNEM D+K+EPGF TYSSILDGAC+  N+++IE ++  M   + +   PS +Y+SI
Sbjct: 284  AFCRLNEMYDRKVEPGFSTYSSILDGACKCRNLQVIERVVAIMVGKQLLSKCPSSDYDSI 343

Query: 722  IQRLCEVGKTYAADMFFKRASAEKVELDEATYDCILRAFCNKGRAKDAIAIHEMMLESGT 901
            IQ+LC++GK  AA +FFKRA  E++ L +ATY  +LRAF  +G  ++AI +++++LE G 
Sbjct: 344  IQKLCDLGKVSAATLFFKRACDERIGLQDATYGRMLRAFSIEGILEEAIGLYQVILERGL 403

Query: 902  VAKDSCYKLFLNVLCNEYPSEKISKLLVDLIGRGFFPCLSELSKFIVSQCKNKKWREAED 1081
              KD+    F+++L  +    +  +++ D++ RGF PC S LSK+I   CK ++W+EAE+
Sbjct: 404  TIKDNASDAFVDLLSEKDQYAEGYEIVRDIMRRGFSPCTSSLSKYITLLCKKRRWKEAEE 463

Query: 1082 LIDVLLEKGFVPDSFSASCLVKHFCSSRRIDSAVALHNKLEHMRGTLDTNAYNXXXXXXX 1261
            L+ ++LEKG +PD+ S   LVKH+CSS++ D A+ALHN LE ++ +LD  AYN       
Sbjct: 464  LLYMVLEKGLLPDTLSFCSLVKHYCSSKQTDKALALHNTLEKLQASLDITAYNLLLGGLV 523

Query: 1262 XXXXXXXALKIFDYMRSHSLLNGESFSAMISGLCQENDLRKAMRLHDEMLKMGLKPDLKN 1441
                   ++K+FDYM+   L N  SF+ +I GLC+  +LRKAM+LHDEML MGLKPD   
Sbjct: 524  KEGRVEESIKVFDYMKGLKLANSASFTVIIRGLCRAKELRKAMKLHDEMLNMGLKPDKPT 583

Query: 1442 YKRLIASF 1465
            YKRLI  F
Sbjct: 584  YKRLILEF 591


>sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21170
          Length = 585

 Score =  355 bits (912), Expect = 2e-95
 Identities = 184/494 (37%), Positives = 306/494 (61%), Gaps = 5/494 (1%)
 Frame = +2

Query: 2    WSRKNLGFQPDLRVECKLVQTLIRFGLSQPARPILDSLVETHPLPQIVDALVVSCKGTDC 181
            +++ +L F+PDL+  C++++     GL + A  +L  LVET+ +  +V  +    +G   
Sbjct: 92   FAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRWFEGEVS 151

Query: 182  HSVVFSSVVECYTKKGLYLQALQVFRKVRDFGCVVSDHSCNGLLNVLQESNEIRLSWCFY 361
             SV  S V+E Y  KG +   L+VF  +R      S  + N LL  L + N+ R++ C Y
Sbjct: 152  LSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRVALCLY 211

Query: 362  ASMLRNGVLASQYTWNVIARILSKDGKFEKIGQVIDMGVHNSLIYNLIIEFYSTSGNFKG 541
            ++M+RNG+++ + TW++IA+IL + G+ + + ++++ GV +  IY  ++E YS +G F  
Sbjct: 212  SAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGEFDA 271

Query: 542  AVDRLNEMVDKKLEPGFVTYSSILDGACQHGNVEIIELMMESMEKNEHIPILPSLEYNSI 721
                ++EM DKKLE  F +Y  +LD AC+ G+ E I+ ++  M + + + +  S   + I
Sbjct: 272  VFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVNDKI 331

Query: 722  IQRLCEVGKTYAADMFFKRA-SAEKVELDEATYDCILRAFCNKGRAKDAIAIHEMMLESG 898
            I+RLC++GKT+A++M F++A + E V L ++TY C+L+A   K R K+A+ ++ M+   G
Sbjct: 332  IERLCDMGKTFASEMLFRKACNGETVRLWDSTYGCMLKALSRKKRTKEAVDVYRMICRKG 391

Query: 899  -TVAKDSCYKLFLNVLC-NEYPSEKISKLLVDLIGRGFFPCLSELSKFIVSQCKNKKWRE 1072
             TV  +SCY  F N LC ++  SE+  +LLVD+I RGF PC  +LS+ + S C+ ++W+ 
Sbjct: 392  ITVLDESCYIEFANALCRDDNSSEEEEELLVDVIKRGFVPCTHKLSEVLASMCRKRRWKS 451

Query: 1073 AEDLIDVLLEKGFVPDSFSASCLVKHFCSSRRIDSAVALHNKLEHMRGTLDTNAYN--XX 1246
            AE L+D ++E     DSF+   L++ +C S +++ A+ LH K++ M+G+LD NAYN    
Sbjct: 452  AEKLLDSVMEMEVYFDSFACGLLMERYCRSGKLEKALVLHEKIKKMKGSLDVNAYNAVLD 511

Query: 1247 XXXXXXXXXXXXALKIFDYMRSHSLLNGESFSAMISGLCQENDLRKAMRLHDEMLKMGLK 1426
                        A+ +F+YM+  + +N +SF+ MI GLC+  +++KAMR HDEML++GLK
Sbjct: 512  RLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHDEMLRLGLK 571

Query: 1427 PDLKNYKRLIASFK 1468
            PDL  YKRLI  FK
Sbjct: 572  PDLVTYKRLILGFK 585


>ref|NP_193849.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332659015|gb|AEE84415.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 551

 Score =  294 bits (752), Expect = 8e-77
 Identities = 171/494 (34%), Positives = 281/494 (56%), Gaps = 5/494 (1%)
 Frame = +2

Query: 2    WSRKNLGFQPDLRVECKLVQTLIRFGLSQPARPILDSLVETHPLPQIVDALVVSCKGTDC 181
            +++ +L F+PDL+  C++++     GL + A  +L  LVET+ +  +V  +    +G   
Sbjct: 92   FAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRWFEGEVS 151

Query: 182  HSVVFSSVVECYTKKGLYLQALQVFRKVRDFGCVVSDHSCNGLLNVLQESNEIRLSWCFY 361
             SV  S V+E Y  KG +   L+VF  +R      S  + N LL  L + N+ R++ C Y
Sbjct: 152  LSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRVALCLY 211

Query: 362  ASMLRNGVLASQYTWNVIARILSKDGKFEKIGQVIDMGVHNSLIYNLIIEFYSTSGNFKG 541
            ++M+RNG+++ + TW++IA+IL + G+ + + ++++ GV +  IY  ++E YS +G F  
Sbjct: 212  SAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGEFDA 271

Query: 542  AVDRLNEMVDKKLEPGFVTYSSILDGACQHGNVEIIELMMESMEKNEHIPILPSLEYNSI 721
                ++EM DKKLE  F +Y  +LD AC+ G+ E I+ ++  M + + + +  S   + I
Sbjct: 272  VFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVNDKI 331

Query: 722  IQRLCEVGKTYAADMFFKRA-SAEKVELDEATYDCILRAFCNKGRAKDAIAIHEMMLESG 898
            I+RLC++GKT+A++M F++A + E V L ++TY C+L+A   K R K+A+ ++ M+   G
Sbjct: 332  IERLCDMGKTFASEMLFRKACNGETVRLWDSTYGCMLKALSRKKRTKEAVDVYRMICRKG 391

Query: 899  -TVAKDSCYKLFLNVLC-NEYPSEKISKLLVDLIGRGFFPCLSELSKFIVSQCKNKKWRE 1072
             TV  +SCY  F N LC ++  SE+  +LLVD+I RG      + S  I    +  KWR 
Sbjct: 392  ITVLDESCYIEFANALCRDDNSSEEEEELLVDVIKRGKEDGNPQRSFLI----RLWKWR- 446

Query: 1073 AEDLIDVLLEKGFVPDSFSASCLVKHFCSSRRIDSAVALHNKLEHMRGTLDTNAYN--XX 1246
                                         S +++ A+ LH K++ M+G+LD NAYN    
Sbjct: 447  -----------------------------SGKLEKALVLHEKIKKMKGSLDVNAYNAVLD 477

Query: 1247 XXXXXXXXXXXXALKIFDYMRSHSLLNGESFSAMISGLCQENDLRKAMRLHDEMLKMGLK 1426
                        A+ +F+YM+  + +N +SF+ MI GLC+  +++KAMR HDEML++GLK
Sbjct: 478  RLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHDEMLRLGLK 537

Query: 1427 PDLKNYKRLIASFK 1468
            PDL  YKRLI  FK
Sbjct: 538  PDLVTYKRLILGFK 551


Top