BLASTX nr result

ID: Bupleurum21_contig00022730 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00022730
         (484 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN70930.1| hypothetical protein VITISV_000387 [Vitis vinifera]   203   1e-50
ref|XP_002280644.1| PREDICTED: pentatricopeptide repeat-containi...   201   7e-50
ref|XP_003530188.1| PREDICTED: pentatricopeptide repeat-containi...   192   3e-47
ref|NP_188050.1| pentatricopeptide repeat-containing protein [Ar...   152   4e-35
dbj|BAB01039.1| unnamed protein product [Arabidopsis thaliana]        152   4e-35

>emb|CAN70930.1| hypothetical protein VITISV_000387 [Vitis vinifera]
          Length = 773

 Score =  203 bits (516), Expect = 1e-50
 Identities = 98/164 (59%), Positives = 129/164 (78%), Gaps = 3/164 (1%)
 Frame = -2

Query: 483 SKKSLQHGQRLYLQLLLSRNKAHQNLLQNPTIKSKLITLYTTCGEIEEARRVFDDGKPG- 307
           S+K+L+HGQRLYLQLLL R++ + NLL NPT+K KLITL++ CG ++EARRVF+DG    
Sbjct: 79  SRKALEHGQRLYLQLLLYRDRCNHNLLNNPTLKGKLITLFSVCGRVDEARRVFEDGGEDV 138

Query: 306 --GESVWVAMAIGYSRKGCFRETLYLYCDMLCQCVWPSNFAFSTALKACSELVDLKMGRS 133
              ESVWVAM IGYSR G  +E L LY +M+CQ     NFAFS ALKACS+L DL+ GR+
Sbjct: 139 DLPESVWVAMGIGYSRNGYPKEALLLYYEMVCQFGQLGNFAFSMALKACSDLGDLRTGRA 198

Query: 132 IHAQIVKSVDDPDQVVYNSLLRLYTECGCFDDVLQLFEEMPERD 1
           +HAQ++K+ +DPDQVV N+LLRLY+E GCF++ L++F+ MP R+
Sbjct: 199 VHAQVLKATEDPDQVVNNALLRLYSEDGCFEEALRMFDGMPHRN 242



 Score = 67.0 bits (162), Expect = 1e-09
 Identities = 49/158 (31%), Positives = 77/158 (48%), Gaps = 1/158 (0%)
 Frame = -2

Query: 471 LQHGQRLYLQLLLSRNKAHQNLLQNPTIKSKLITLYTTCGEIEEARRVFDDGKPGGESV- 295
           L+ G+ ++ Q+L +     Q       + + L+ LY+  G  EEA R+FD G P    V 
Sbjct: 193 LRTGRAVHAQVLKATEDPDQ------VVNNALLRLYSEDGCFEEALRMFD-GMPHRNLVS 245

Query: 294 WVAMAIGYSRKGCFRETLYLYCDMLCQCVWPSNFAFSTALKACSELVDLKMGRSIHAQIV 115
           W ++  G  +K    E +  +  M  + +  S    +T L  C+ +  L  G+ IHA IV
Sbjct: 246 WNSLIAGLVKKEGVFEAIEAFRIMQGKGMGFSWVTLTTILPVCARVTALGSGKEIHAVIV 305

Query: 114 KSVDDPDQVVYNSLLRLYTECGCFDDVLQLFEEMPERD 1
           KS   PD  V NSL+ +Y +CG  D   ++F  M  +D
Sbjct: 306 KSTAKPDAPVLNSLVDMYAKCGAMDYCRRVFNGMQGKD 343


>ref|XP_002280644.1| PREDICTED: pentatricopeptide repeat-containing protein At3g14330
           [Vitis vinifera] gi|296088358|emb|CBI36803.3| unnamed
           protein product [Vitis vinifera]
          Length = 652

 Score =  201 bits (510), Expect = 7e-50
 Identities = 98/164 (59%), Positives = 128/164 (78%), Gaps = 3/164 (1%)
 Frame = -2

Query: 483 SKKSLQHGQRLYLQLLLSRNKAHQNLLQNPTIKSKLITLYTTCGEIEEARRVFDDGKPG- 307
           S+K+L+HGQRLYLQLLL R++ + NLL NPT+K KLITL++ C  ++EARRVF+DG    
Sbjct: 80  SRKALEHGQRLYLQLLLYRDRCNHNLLNNPTLKGKLITLFSVCRRVDEARRVFEDGGEDV 139

Query: 306 --GESVWVAMAIGYSRKGCFRETLYLYCDMLCQCVWPSNFAFSTALKACSELVDLKMGRS 133
              ESVWVAM IGYSR G  +E L LY +M+CQ     NFAFS ALKACS+L DL+ GR+
Sbjct: 140 DLPESVWVAMGIGYSRNGYPKEALLLYYEMVCQFGQLGNFAFSMALKACSDLGDLQTGRA 199

Query: 132 IHAQIVKSVDDPDQVVYNSLLRLYTECGCFDDVLQLFEEMPERD 1
           +HAQ++K+ +DPDQVV N+LLRLY+E GCFD+ L++F+ MP R+
Sbjct: 200 VHAQVLKATEDPDQVVNNALLRLYSEDGCFDEALRVFDGMPHRN 243



 Score = 68.9 bits (167), Expect = 4e-10
 Identities = 50/158 (31%), Positives = 77/158 (48%), Gaps = 1/158 (0%)
 Frame = -2

Query: 471 LQHGQRLYLQLLLSRNKAHQNLLQNPTIKSKLITLYTTCGEIEEARRVFDDGKPGGESV- 295
           LQ G+ ++ Q+L +     Q       + + L+ LY+  G  +EA RVFD G P    V 
Sbjct: 194 LQTGRAVHAQVLKATEDPDQ------VVNNALLRLYSEDGCFDEALRVFD-GMPHRNVVS 246

Query: 294 WVAMAIGYSRKGCFRETLYLYCDMLCQCVWPSNFAFSTALKACSELVDLKMGRSIHAQIV 115
           W ++  G  +K    E +  +  M  + +  S    +T L  C+ +  L  G+ IHA IV
Sbjct: 247 WNSLIAGLVKKDGVFEAIEAFRIMQGKGMGFSWVTLTTILPVCARVTALGSGKEIHAVIV 306

Query: 114 KSVDDPDQVVYNSLLRLYTECGCFDDVLQLFEEMPERD 1
           KS   PD  V NSL+ +Y +CG  D   ++F  M  +D
Sbjct: 307 KSTAKPDAPVLNSLVDMYAKCGAMDYCRRVFNGMQGKD 344


>ref|XP_003530188.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g14330-like [Glycine max]
          Length = 650

 Score =  192 bits (487), Expect = 3e-47
 Identities = 101/164 (61%), Positives = 126/164 (76%), Gaps = 3/164 (1%)
 Frame = -2

Query: 483 SKKSLQHGQRLYLQLLLSRNKAHQNLLQNPTIKSKLITLYTTCGEIEEARRVF--DDGKP 310
           S++SL+HG++L+L LL S+N+    +L+NPT+K+KLITLY+ CG + EARRVF  DD KP
Sbjct: 82  SRRSLEHGRKLHLHLLRSQNR----VLENPTLKTKLITLYSVCGRVNEARRVFQIDDEKP 137

Query: 309 GGESVWVAMAIGYSRKGCFRETLYLYCDMLCQCVWPSNFAFSTALKACSELVDLKMGRSI 130
             E VWVAMAIGYSR G   E L LY DML  CV P NFAFS ALKACS+L +  +GR+I
Sbjct: 138 PEEPVWVAMAIGYSRNGFSHEALLLYRDMLSCCVKPGNFAFSMALKACSDLDNALVGRAI 197

Query: 129 HAQIVK-SVDDPDQVVYNSLLRLYTECGCFDDVLQLFEEMPERD 1
           HAQIVK  V + DQVV N+LL LY E GCFD+VL++FEEMP+R+
Sbjct: 198 HAQIVKHDVGEADQVVNNALLGLYVEIGCFDEVLKVFEEMPQRN 241



 Score = 62.4 bits (150), Expect = 4e-08
 Identities = 36/130 (27%), Positives = 65/130 (50%)
 Frame = -2

Query: 390 IKSKLITLYTTCGEIEEARRVFDDGKPGGESVWVAMAIGYSRKGCFRETLYLYCDMLCQC 211
           + + L+ LY   G  +E  +VF++        W  +  G++ +G   ETL  +  M  + 
Sbjct: 213 VNNALLGLYVEIGCFDEVLKVFEEMPQRNVVSWNTLIAGFAGQGRVFETLSAFRVMQREG 272

Query: 210 VWPSNFAFSTALKACSELVDLKMGRSIHAQIVKSVDDPDQVVYNSLLRLYTECGCFDDVL 31
           +  S    +T L  C+++  L  G+ IH QI+KS  + D  + NSL+ +Y +CG      
Sbjct: 273 MGFSWITLTTMLPVCAQVTALHSGKEIHGQILKSRKNADVPLLNSLMDMYAKCGEIGYCE 332

Query: 30  QLFEEMPERD 1
           ++F+ M  +D
Sbjct: 333 KVFDRMHSKD 342



 Score = 62.4 bits (150), Expect = 4e-08
 Identities = 43/158 (27%), Positives = 70/158 (44%), Gaps = 1/158 (0%)
 Frame = -2

Query: 474 SLQHGQRLYLQLLLSRNKAHQNLLQNPTIKSKLITLYTTCGEIEEARRVFDDGKPGGESV 295
           +L  G+ ++ Q+L SR  A   LL +      L+ +Y  CGEI    +VFD       + 
Sbjct: 292 ALHSGKEIHGQILKSRKNADVPLLNS------LMDMYAKCGEIGYCEKVFDRMHSKDLTS 345

Query: 294 WVAMAIGYSRKGCFRETLYLYCDMLCQCVWPSNFAFSTALKACSELVDLKMGRSIHAQIV 115
           W  M  G+S  G   E L L+ +M+   + P+   F   L  CS       G+ + + ++
Sbjct: 346 WNTMLAGFSINGQIHEALCLFDEMIRYGIEPNGITFVALLSGCSHSGLTSEGKRLFSNVM 405

Query: 114 KSVD-DPDQVVYNSLLRLYTECGCFDDVLQLFEEMPER 4
           +     P    Y  L+ +    G FD+ L + E +P R
Sbjct: 406 QDFGVQPSLEHYACLVDILGRSGKFDEALSVAENIPMR 443


>ref|NP_188050.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|218546762|sp|Q9LUL5.2|PP229_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At3g14330 gi|332641981|gb|AEE75502.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 710

 Score =  152 bits (383), Expect = 4e-35
 Identities = 81/164 (49%), Positives = 106/164 (64%), Gaps = 3/164 (1%)
 Frame = -2

Query: 483 SKKSLQHGQRLYLQLLLSRNKAHQNLLQNPTIKSKLITLYTTCGEIEEARRVFDDGKPGG 304
           S KSL HG ++   +L + +  H     NP + SKLITL++ C  ++ AR++FDD     
Sbjct: 143 SAKSLHHGIKICSLILNNPSLRH-----NPKLLSKLITLFSVCRRLDLARKIFDDVTDSS 197

Query: 303 ---ESVWVAMAIGYSRKGCFRETLYLYCDMLCQCVWPSNFAFSTALKACSELVDLKMGRS 133
              E VW AMAIGYSR G  R+ L +Y DMLC  + P NF+ S ALKAC +L DL++GR 
Sbjct: 198 LLTEKVWAAMAIGYSRNGSPRDALIVYVDMLCSFIEPGNFSISVALKACVDLKDLRVGRG 257

Query: 132 IHAQIVKSVDDPDQVVYNSLLRLYTECGCFDDVLQLFEEMPERD 1
           IHAQIVK  +  DQVVYN LL+LY E G FDD  ++F+ M ER+
Sbjct: 258 IHAQIVKRKEKVDQVVYNVLLKLYMESGLFDDARKVFDGMSERN 301



 Score = 73.2 bits (178), Expect = 2e-11
 Identities = 47/159 (29%), Positives = 80/159 (50%)
 Frame = -2

Query: 477 KSLQHGQRLYLQLLLSRNKAHQNLLQNPTIKSKLITLYTTCGEIEEARRVFDDGKPGGES 298
           K L+ G+ ++ Q++  + K  Q       + + L+ LY   G  ++AR+VFD        
Sbjct: 250 KDLRVGRGIHAQIVKRKEKVDQ------VVYNVLLKLYMESGLFDDARKVFDGMSERNVV 303

Query: 297 VWVAMAIGYSRKGCFRETLYLYCDMLCQCVWPSNFAFSTALKACSELVDLKMGRSIHAQI 118
            W ++    S+K    E   L+  M  + +  S    +T L ACS +  L  G+ IHAQI
Sbjct: 304 TWNSLISVLSKKVRVHEMFNLFRKMQEEMIGFSWATLTTILPACSRVAALLTGKEIHAQI 363

Query: 117 VKSVDDPDQVVYNSLLRLYTECGCFDDVLQLFEEMPERD 1
           +KS + PD  + NSL+ +Y +CG  +   ++F+ M  +D
Sbjct: 364 LKSKEKPDVPLLNSLMDMYGKCGEVEYSRRVFDVMLTKD 402


>dbj|BAB01039.1| unnamed protein product [Arabidopsis thaliana]
          Length = 717

 Score =  152 bits (383), Expect = 4e-35
 Identities = 81/164 (49%), Positives = 106/164 (64%), Gaps = 3/164 (1%)
 Frame = -2

Query: 483 SKKSLQHGQRLYLQLLLSRNKAHQNLLQNPTIKSKLITLYTTCGEIEEARRVFDDGKPGG 304
           S KSL HG ++   +L + +  H     NP + SKLITL++ C  ++ AR++FDD     
Sbjct: 150 SAKSLHHGIKICSLILNNPSLRH-----NPKLLSKLITLFSVCRRLDLARKIFDDVTDSS 204

Query: 303 ---ESVWVAMAIGYSRKGCFRETLYLYCDMLCQCVWPSNFAFSTALKACSELVDLKMGRS 133
              E VW AMAIGYSR G  R+ L +Y DMLC  + P NF+ S ALKAC +L DL++GR 
Sbjct: 205 LLTEKVWAAMAIGYSRNGSPRDALIVYVDMLCSFIEPGNFSISVALKACVDLKDLRVGRG 264

Query: 132 IHAQIVKSVDDPDQVVYNSLLRLYTECGCFDDVLQLFEEMPERD 1
           IHAQIVK  +  DQVVYN LL+LY E G FDD  ++F+ M ER+
Sbjct: 265 IHAQIVKRKEKVDQVVYNVLLKLYMESGLFDDARKVFDGMSERN 308



 Score = 73.2 bits (178), Expect = 2e-11
 Identities = 47/159 (29%), Positives = 80/159 (50%)
 Frame = -2

Query: 477 KSLQHGQRLYLQLLLSRNKAHQNLLQNPTIKSKLITLYTTCGEIEEARRVFDDGKPGGES 298
           K L+ G+ ++ Q++  + K  Q       + + L+ LY   G  ++AR+VFD        
Sbjct: 257 KDLRVGRGIHAQIVKRKEKVDQ------VVYNVLLKLYMESGLFDDARKVFDGMSERNVV 310

Query: 297 VWVAMAIGYSRKGCFRETLYLYCDMLCQCVWPSNFAFSTALKACSELVDLKMGRSIHAQI 118
            W ++    S+K    E   L+  M  + +  S    +T L ACS +  L  G+ IHAQI
Sbjct: 311 TWNSLISVLSKKVRVHEMFNLFRKMQEEMIGFSWATLTTILPACSRVAALLTGKEIHAQI 370

Query: 117 VKSVDDPDQVVYNSLLRLYTECGCFDDVLQLFEEMPERD 1
           +KS + PD  + NSL+ +Y +CG  +   ++F+ M  +D
Sbjct: 371 LKSKEKPDVPLLNSLMDMYGKCGEVEYSRRVFDVMLTKD 409


Top