BLASTX nr result

ID: Bupleurum21_contig00031931 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00031931
         (634 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI16054.3| unnamed protein product [Vitis vinifera]              220   1e-55
ref|XP_002280428.1| PREDICTED: pentatricopeptide repeat-containi...   220   1e-55
ref|XP_002520500.1| pentatricopeptide repeat-containing protein,...   207   2e-51
ref|XP_002313416.1| predicted protein [Populus trichocarpa] gi|2...   201   1e-49
ref|XP_003552616.1| PREDICTED: pentatricopeptide repeat-containi...   199   5e-49

>emb|CBI16054.3| unnamed protein product [Vitis vinifera]
          Length = 476

 Score =  220 bits (561), Expect = 1e-55
 Identities = 122/198 (61%), Positives = 143/198 (72%), Gaps = 13/198 (6%)
 Frame = +2

Query: 59  MLAFQGPQILQK-HPIHSLHYSSSISAQLTNPSPNF-LSIRPS-----------NNELIQ 199
           M AFQ PQ +Q+ H     H  ++IS     P P   L++RPS           NN LIQ
Sbjct: 1   MWAFQTPQTIQQPHLPKPFHKPTAIS-----PKPQCCLALRPSTTTRSNGDSNNNNPLIQ 55

Query: 200 SLCQKGNLKQALQLLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFL 379
           SLC++GNL QALQ+L +E  PTQHTYELLILSC+RQNSLP    +H  LI DG +QD FL
Sbjct: 56  SLCKQGNLNQALQVLSQEPNPTQHTYELLILSCTRQNSLPQGIDLHRHLIHDGSDQDPFL 115

Query: 380 ATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGV 559
           ATKLI+MYSELDSI+ ARKVFD+ R+RT++VWNA+FRALTL G G EVL LYRRMN +GV
Sbjct: 116 ATKLINMYSELDSIDNARKVFDKTRKRTIYVWNALFRALTLAGYGREVLDLYRRMNRIGV 175

Query: 560 LSDRFTYTYVLKACVVGE 613
            SDRFTYTYVLKACV  E
Sbjct: 176 PSDRFTYTYVLKACVASE 193



 Score = 56.6 bits (135), Expect = 4e-06
 Identities = 39/149 (26%), Positives = 71/149 (47%), Gaps = 10/149 (6%)
 Frame = +2

Query: 185 NELIQSLCQKGNLKQALQLLYKESQ----PTQHTYELLILSCSRQNS----LPGAGIVHC 340
           N L ++L   G  ++ L L  + ++      + TY  ++ +C    +    L     +H 
Sbjct: 148 NALFRALTLAGYGREVLDLYRRMNRIGVPSDRFTYTYVLKACVASEAFVSLLLNGREIHG 207

Query: 341 RLIRDGFEQDTFLATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEE 520
            ++R GFE    + T L+DMY+    +  A +VFD++  + V  W+A+    +  G   E
Sbjct: 208 HILRHGFEGHVHIMTTLLDMYARFGCVLNASRVFDQMPVKNVVSWSAMIACYSKNGKPLE 267

Query: 521 VLSLYRRMNMVG--VLSDRFTYTYVLKAC 601
            L L+R+M +    +L +  T   VL+AC
Sbjct: 268 ALELFRKMMLENQDLLPNSVTMVSVLQAC 296


>ref|XP_002280428.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46790,
           chloroplastic [Vitis vinifera]
          Length = 658

 Score =  220 bits (561), Expect = 1e-55
 Identities = 122/198 (61%), Positives = 143/198 (72%), Gaps = 13/198 (6%)
 Frame = +2

Query: 59  MLAFQGPQILQK-HPIHSLHYSSSISAQLTNPSPNF-LSIRPS-----------NNELIQ 199
           M AFQ PQ +Q+ H     H  ++IS     P P   L++RPS           NN LIQ
Sbjct: 1   MWAFQTPQTIQQPHLPKPFHKPTAIS-----PKPQCCLALRPSTTTRSNGDSNNNNPLIQ 55

Query: 200 SLCQKGNLKQALQLLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFL 379
           SLC++GNL QALQ+L +E  PTQHTYELLILSC+RQNSLP    +H  LI DG +QD FL
Sbjct: 56  SLCKQGNLNQALQVLSQEPNPTQHTYELLILSCTRQNSLPQGIDLHRHLIHDGSDQDPFL 115

Query: 380 ATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGV 559
           ATKLI+MYSELDSI+ ARKVFD+ R+RT++VWNA+FRALTL G G EVL LYRRMN +GV
Sbjct: 116 ATKLINMYSELDSIDNARKVFDKTRKRTIYVWNALFRALTLAGYGREVLDLYRRMNRIGV 175

Query: 560 LSDRFTYTYVLKACVVGE 613
            SDRFTYTYVLKACV  E
Sbjct: 176 PSDRFTYTYVLKACVASE 193



 Score = 56.6 bits (135), Expect = 4e-06
 Identities = 39/149 (26%), Positives = 71/149 (47%), Gaps = 10/149 (6%)
 Frame = +2

Query: 185 NELIQSLCQKGNLKQALQLLYKESQ----PTQHTYELLILSCSRQNS----LPGAGIVHC 340
           N L ++L   G  ++ L L  + ++      + TY  ++ +C    +    L     +H 
Sbjct: 148 NALFRALTLAGYGREVLDLYRRMNRIGVPSDRFTYTYVLKACVASEAFVSLLLNGREIHG 207

Query: 341 RLIRDGFEQDTFLATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEE 520
            ++R GFE    + T L+DMY+    +  A +VFD++  + V  W+A+    +  G   E
Sbjct: 208 HILRHGFEGHVHIMTTLLDMYARFGCVLNASRVFDQMPVKNVVSWSAMIACYSKNGKPLE 267

Query: 521 VLSLYRRMNMVG--VLSDRFTYTYVLKAC 601
            L L+R+M +    +L +  T   VL+AC
Sbjct: 268 ALELFRKMMLENQDLLPNSVTMVSVLQAC 296



 Score = 56.6 bits (135), Expect = 4e-06
 Identities = 33/143 (23%), Positives = 71/143 (49%), Gaps = 6/143 (4%)
 Frame = +2

Query: 191 LIQSLCQKGNLKQALQLLYK---ESQ---PTQHTYELLILSCSRQNSLPGAGIVHCRLIR 352
           +I    + G   +AL+L  K   E+Q   P   T   ++ +C+   +L    ++H  ++R
Sbjct: 255 MIACYSKNGKPLEALELFRKMMLENQDLLPNSVTMVSVLQACAALAALEQGKLMHGYILR 314

Query: 353 DGFEQDTFLATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSL 532
            G +    + + L+ +Y+   ++E   +VF+R+ +R V  WN++  +  + G G + + +
Sbjct: 315 RGLDSILPVVSALVTVYARCGNLELGHRVFERMEKRDVVSWNSLISSYGIHGFGRKAIQI 374

Query: 533 YRRMNMVGVLSDRFTYTYVLKAC 601
           ++ M   G+     ++  VL AC
Sbjct: 375 FKEMIDQGLSPSPISFVSVLGAC 397


>ref|XP_002520500.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223540342|gb|EEF41913.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 414

 Score =  207 bits (526), Expect = 2e-51
 Identities = 110/192 (57%), Positives = 134/192 (69%), Gaps = 5/192 (2%)
 Frame = +2

Query: 59  MLAFQGPQILQKHPIHSLHYSSSISAQLTNPSPNFLSIRPS-----NNELIQSLCQKGNL 223
           M AF  PQ   + P HS  +S         P   F+++ P+     +N+LIQSLC++GNL
Sbjct: 1   MWAFHSPQATTQPPSHSTFFSRPTP----KPPICFVNLNPTIPTANSNKLIQSLCKQGNL 56

Query: 224 KQALQLLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFLATKLIDMY 403
           KQAL LL  E  P QHTYELL+LSC+ QNS   A  VH  L+ +GF+QD FLATKLI+MY
Sbjct: 57  KQALNLLCNEPDPAQHTYELLLLSCTHQNSFLDAQFVHQHLLDNGFDQDPFLATKLINMY 116

Query: 404 SELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGVLSDRFTYT 583
           S   SI+ ARKVFD+ R RT++V+NA+FRALTL GNGEEVL LYRRMN +G+ SDRFTYT
Sbjct: 117 SSFGSIDNARKVFDKTRSRTLYVYNALFRALTLVGNGEEVLRLYRRMNSIGMPSDRFTYT 176

Query: 584 YVLKACVVGEEL 619
           YVLKACV    L
Sbjct: 177 YVLKACVASNSL 188



 Score = 58.9 bits (141), Expect = 7e-07
 Identities = 30/127 (23%), Positives = 65/127 (51%)
 Frame = +2

Query: 221 LKQALQLLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFLATKLIDM 400
           L + + L  K+  P   T   ++ +C+   +L    ++H  ++R G +    + + L+ M
Sbjct: 264 LFREMMLETKDMCPNSVTMVSVLQACAALAALEQGKLLHGYILRRGLDSILPVISSLVTM 323

Query: 401 YSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGVLSDRFTY 580
           Y+    ++ A+ VFD++ +R V  WN++  +  + G G++ + +++ M   GV     ++
Sbjct: 324 YARCGKLQLAQHVFDQMDKRDVVSWNSLISSYGVHGFGKKAIQIFKDMTRNGVFPSPISF 383

Query: 581 TYVLKAC 601
             VL AC
Sbjct: 384 VSVLGAC 390



 Score = 56.6 bits (135), Expect = 4e-06
 Identities = 39/149 (26%), Positives = 68/149 (45%), Gaps = 10/149 (6%)
 Frame = +2

Query: 185 NELIQSLCQKGNLKQALQLLYKESQ----PTQHTYELLILSCSRQNSLPG----AGIVHC 340
           N L ++L   GN ++ L+L  + +       + TY  ++ +C   NSL         +H 
Sbjct: 141 NALFRALTLVGNGEEVLRLYRRMNSIGMPSDRFTYTYVLKACVASNSLLSLLKKGKEIHA 200

Query: 341 RLIRDGFEQDTFLATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEE 520
           +++R G+E    + T L+DMY+    +  A  VF  +  + V  W+A+       G   E
Sbjct: 201 QILRRGYEAHVHIMTTLVDMYARFGYVSYASCVFSEMSVKNVVSWSAMIACYAKNGRPFE 260

Query: 521 VLSLYRRMNM--VGVLSDRFTYTYVLKAC 601
            L L+R M +    +  +  T   VL+AC
Sbjct: 261 ALELFREMMLETKDMCPNSVTMVSVLQAC 289


>ref|XP_002313416.1| predicted protein [Populus trichocarpa] gi|222849824|gb|EEE87371.1|
           predicted protein [Populus trichocarpa]
          Length = 650

 Score =  201 bits (510), Expect = 1e-49
 Identities = 105/185 (56%), Positives = 133/185 (71%)
 Frame = +2

Query: 59  MLAFQGPQILQKHPIHSLHYSSSISAQLTNPSPNFLSIRPSNNELIQSLCQKGNLKQALQ 238
           M AFQ P+        +     S+   + + + N  +    NN+LIQSLC++GNL QAL+
Sbjct: 1   MWAFQSPKTTLLPSNATFLPRPSLKPPICSITLNPTASTADNNKLIQSLCKQGNLTQALE 60

Query: 239 LLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFLATKLIDMYSELDS 418
           LL  E  P QHTYELLILSC+ QNSL  A  VH  L+ +GF+QD FLATKLI+MYS  DS
Sbjct: 61  LLSLEPNPAQHTYELLILSCTHQNSLLDAQRVHRHLLENGFDQDPFLATKLINMYSFFDS 120

Query: 419 IECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGVLSDRFTYTYVLKA 598
           I+ ARKVFD+ R RT++V+NA+FRAL+L G+GEEVL++YRRMN +G+ SDRFTYTYVLKA
Sbjct: 121 IDNARKVFDKTRNRTIYVYNALFRALSLAGHGEEVLNMYRRMNSIGIPSDRFTYTYVLKA 180

Query: 599 CVVGE 613
           CV  E
Sbjct: 181 CVASE 185



 Score = 59.7 bits (143), Expect = 4e-07
 Identities = 35/143 (24%), Positives = 71/143 (49%), Gaps = 6/143 (4%)
 Frame = +2

Query: 191 LIQSLCQKGNLKQALQL---LYKESQ---PTQHTYELLILSCSRQNSLPGAGIVHCRLIR 352
           +I    + G   +AL+L   L  E+Q   P   T   ++ +C+   +L    ++H  ++R
Sbjct: 247 MIACYAKNGKAFEALELFRELMLETQDLCPNSVTMVSVLQACAALAALEQGRLIHGYILR 306

Query: 353 DGFEQDTFLATKLIDMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSL 532
            G +    + + L+ MY+    +E  ++VFD++ +R V  WN++  +  + G G++ + +
Sbjct: 307 KGLDSILPVISALVTMYARCGKLELGQRVFDQMDKRDVVSWNSLISSYGVHGFGKKAIGI 366

Query: 533 YRRMNMVGVLSDRFTYTYVLKAC 601
           +  M   GV     ++  VL AC
Sbjct: 367 FEEMTYNGVEPSPISFVSVLGAC 389


>ref|XP_003552616.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46790,
           chloroplastic-like [Glycine max]
          Length = 658

 Score =  199 bits (505), Expect = 5e-49
 Identities = 108/193 (55%), Positives = 139/193 (72%), Gaps = 8/193 (4%)
 Frame = +2

Query: 59  MLAFQGPQILQKHPIHS-LHYSSSISAQLT------NPSPNFLS-IRPSNNELIQSLCQK 214
           M   Q PQI++  P  S L Y+S +S+++       NPS N ++ I+ +NN+LIQSLC+ 
Sbjct: 1   MWVLQIPQIVRHAPSQSHLCYNSHVSSRVPVSFVSLNPSANLMNDIKGNNNQLIQSLCKG 60

Query: 215 GNLKQALQLLYKESQPTQHTYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFLATKLI 394
           GNLKQA+ LL  E  PTQ T+E LI SC++QNSL     VH RL+  GF+QD FLATKLI
Sbjct: 61  GNLKQAIHLLCCEPNPTQRTFEHLICSCAQQNSLSDGLDVHRRLVSSGFDQDPFLATKLI 120

Query: 395 DMYSELDSIECARKVFDRIRRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGVLSDRF 574
           +MY EL SI+ ARKVFD  R RT++VWNA+FRAL + G G+E+L LY +MN +G+ SDRF
Sbjct: 121 NMYYELGSIDRARKVFDETRERTIYVWNALFRALAMVGCGKELLDLYVQMNWIGIPSDRF 180

Query: 575 TYTYVLKACVVGE 613
           TYT+VLKACVV E
Sbjct: 181 TYTFVLKACVVSE 193



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 40/170 (23%), Positives = 77/170 (45%), Gaps = 5/170 (2%)
 Frame = +2

Query: 107 SLHYSSSISAQLTNPSPNFLSIRP-----SNNELIQSLCQKGNLKQALQLLYKESQPTQH 271
           S+ Y++S+   +  P+ NF+S        + NE+     +   L Q + L   +S P   
Sbjct: 233 SVSYANSVFCAM--PTKNFVSWSAMIACFAKNEMPMKALE---LFQLMMLEAHDSVPNSV 287

Query: 272 TYELLILSCSRQNSLPGAGIVHCRLIRDGFEQDTFLATKLIDMYSELDSIECARKVFDRI 451
           T   ++ +C+   +L    ++H  ++R G +    +   LI MY     I   ++VFD +
Sbjct: 288 TMVNVLQACAGLAALEQGKLIHGYILRRGLDSILPVLNALITMYGRCGEILMGQRVFDNM 347

Query: 452 RRRTVFVWNAIFRALTLKGNGEEVLSLYRRMNMVGVLSDRFTYTYVLKAC 601
           + R V  WN++     + G G++ + ++  M   G      ++  VL AC
Sbjct: 348 KNRDVVSWNSLISIYGMHGFGKKAIQIFENMIHQGSSPSYISFITVLGAC 397


Top