BLASTX nr result

ID: Bupleurum21_contig00020593 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00020593
         (1775 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2...   123   7e-50
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   108   1e-36
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       107   2e-34
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   113   9e-33
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           113   1e-32

>ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1|
           predicted protein [Populus trichocarpa]
          Length = 517

 Score =  123 bits (308), Expect(3) = 7e-50
 Identities = 61/166 (36%), Positives = 88/166 (53%), Gaps = 1/166 (0%)
 Frame = +2

Query: 476 SAKFLSFAGRLQLIRSVLYGILGYWSSHIFLPMCILKKVQSLMSKFLWNGSYLGRAHHKV 655
           + + LS+AGR+QLI SVL+ I  YW+S   LP  ++K V+ +M  FLW+GS +     KV
Sbjct: 88  TCRTLSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGAKV 147

Query: 656 SWSTCCTXXXXXXXXXKNIFDLNTSAILMQIWRSIQPNPSSLWLKWLNLYVFRSK-FWTM 832
           +W   C          K+I + N  A+L  IW     +  S+W  W+   + R + FWT+
Sbjct: 148 AWDQVCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFWTI 207

Query: 833 SIPGNCSWIVRNILKARPLAMQHVQYEVRSGSSIFLWHDPWVNRTP 970
             P NCSW    ILK R LA   ++Y +  G +  LW D W   +P
Sbjct: 208 KTPQNCSWAWGKILKLRSLAWPKMKYIIGDGMTTSLWFDNWHPHSP 253



 Score = 80.9 bits (198), Expect(3) = 7e-50
 Identities = 41/136 (30%), Positives = 69/136 (50%), Gaps = 6/136 (4%)
 Frame = +3

Query: 1026 AKVDSIIVNGHWHISSNHVLATNLRHKV----NQTRLHSSDRISW--DGNRRVRMSVVWN 1187
            AKV+ +I N  W   +   +  +   +     +  ++   D + W    N R  + V W 
Sbjct: 272  AKVNVLIQNSEWKTPTTQAIGWHPIIEAIPSNSNPKMGQKDELVWLDSPNHRFSVKVAWE 331

Query: 1188 SIRRTDHAPSWVSTCWNSLSIPKCSFMLWLLFHGRLSTRDRLIRFGINVPSVCLLCNTAD 1367
             +RR      W    W   ++P+ SF+LW+    +L+T+D+L RFGI+ P+ C LC   +
Sbjct: 332  QLRRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIHGPNRCSLCLRNN 391

Query: 1368 ESIDHLFFECPFSRHI 1415
            E  +HLFFEC +++ I
Sbjct: 392  EDHNHLFFECSYTKAI 407



 Score = 42.7 bits (99), Expect(3) = 7e-50
 Identities = 21/63 (33%), Positives = 33/63 (52%)
 Frame = +2

Query: 1448 WKSVIHGGTSIIQQAFYLFIACVSYHIWQERNKRVYGGTYVPAFQLVADIKRMFREKLFL 1627
            W +V   G S +  +  L  A   YH+WQERN R++ G       ++  I+ + R+KL L
Sbjct: 430  WATVSWHGKSFVNFSCKLSFAATVYHVWQERNARIFAGMSRTPNLVLNQIECIIRDKLDL 489

Query: 1628 SKN 1636
             +N
Sbjct: 490  MRN 492


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  108 bits (270), Expect(3) = 1e-36
 Identities = 58/171 (33%), Positives = 84/171 (49%), Gaps = 1/171 (0%)
 Frame = +2

Query: 476  SAKFLSFAGRLQLIRSVLYGILGYWSSHIFLPMCILKKVQSLMSKFLWNGSYLGRAHHKV 655
            + K LSFAGRLQLI SV+Y  + +W S   LP C LK ++ + ++FLW      R   KV
Sbjct: 793  ATKTLSFAGRLQLISSVIYSTVNFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKV 852

Query: 656  SWSTCCTXXXXXXXXXKNIFDLNTSAILMQIWRSIQPNPSSLWLKWLNLYVFRS-KFWTM 832
            SW   C          +N +  N +  L  IW  +     SLW+ W +    R   FW  
Sbjct: 853  SWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIW-MLFARRDSLWVAWNHANRLRHVNFWNA 911

Query: 833  SIPGNCSWIVRNILKARPLAMQHVQYEVRSGSSIFLWHDPWVNRTPPVMSV 985
                + SWI + IL  RPLA + ++  V +G  +  W+D W N  P + ++
Sbjct: 912  EAASHHSWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAI 962



 Score = 62.8 bits (151), Expect(3) = 1e-36
 Identities = 42/147 (28%), Positives = 64/147 (43%), Gaps = 11/147 (7%)
 Frame = +3

Query: 990  PNIISIVESSVLAKVDSIIVNGHWHISS---NHVLATNLRHKVNQTRLHSSDR----ISW 1148
            P +  I ES+V+ +  S   +  W + S    +    NLR  +  +   S DR     +W
Sbjct: 967  PQLTGIHESAVVTEASS---STGWILPSARTRNASLANLRSTLLNSPAPSGDRGEDTYTW 1023

Query: 1149 --DGNRRVRMS--VVWNSIRRTDHAPSWVSTCWNSLSIPKCSFMLWLLFHGRLSTRDRLI 1316
              +G+     S  + W  +R+ D    W +  W    IPK +F  W+    RL  R R  
Sbjct: 1024 YIEGSSSTSFSSKLTWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTT 1083

Query: 1317 RFGINVPSVCLLCNTADESIDHLFFEC 1397
             +  N PS+C +C    E+ DHLF  C
Sbjct: 1084 HWSTNRPSLCCVCQRETETRDHLFIHC 1110



 Score = 30.8 bits (68), Expect(3) = 1e-36
 Identities = 21/78 (26%), Positives = 35/78 (44%), Gaps = 8/78 (10%)
 Frame = +2

Query: 1442 QDWKSVIHGGTSIIQQAFY-----LFIACVSYHIWQERNKRVYGGTYVPAFQLVADIKRM 1606
            ++WK +I    S  Q +F      L +    +HIW+ERN R++         +   I R 
Sbjct: 1131 REWKDIIEWMLSN-QGSFSGTLKKLAVQTAIFHIWKERNSRLHSAMSASHTAIFKQIDRS 1189

Query: 1607 FREKL---FLSKNFKSLV 1651
             R+ +      +NFK L+
Sbjct: 1190 IRDSILARITRRNFKDLL 1207


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  107 bits (266), Expect(2) = 2e-34
 Identities = 56/164 (34%), Positives = 86/164 (52%), Gaps = 1/164 (0%)
 Frame = +2

Query: 482  KFLSFAGRLQLIRSVLYGILGYWSSHIFLPMCILKKVQSLMSKFLWNGSYLGRAHHKVSW 661
            K LSFAGR+QLI SV++G + +W S   LP   +K+++SL S+FLW+G+       KVSW
Sbjct: 796  KCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSW 855

Query: 662  STCCTXXXXXXXXXKNIFDLNTSAILMQIWRSIQPNPSSLWLKWLNL-YVFRSKFWTMSI 838
            +  C          + + + N +  +  IWR       SLW  W +L ++ R  FW +  
Sbjct: 856  AALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAK-DSLWADWQHLHHLSRGSFWAVEG 914

Query: 839  PGNCSWIVRNILKARPLAMQHVQYEVRSGSSIFLWHDPWVNRTP 970
              + SW  + +L  RPLA Q +  +V +G     W+D W +  P
Sbjct: 915  GQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGP 958



 Score = 67.0 bits (162), Expect(2) = 2e-34
 Identities = 43/142 (30%), Positives = 59/142 (41%), Gaps = 10/142 (7%)
 Frame = +3

Query: 1020 VLAKVDSIIVNGHWHISSNHVLATNLRHK------VNQTRLHSSDRISWDGN----RRVR 1169
            +LAKV S      W +  +        H       V  T     DR  W  N    +   
Sbjct: 975  LLAKVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQGFS 1034

Query: 1170 MSVVWNSIRRTDHAPSWVSTCWNSLSIPKCSFMLWLLFHGRLSTRDRLIRFGINVPSVCL 1349
             +  W +IR      SW S+ W   ++PK +F +W+    RL TR RL  +G      C+
Sbjct: 1035 AAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACV 1094

Query: 1350 LCNTADESIDHLFFECPFSRHI 1415
            LC+ A ES DHL   C FS  +
Sbjct: 1095 LCSFASESRDHLLLICEFSAQV 1116


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  113 bits (283), Expect(2) = 9e-33
 Identities = 58/165 (35%), Positives = 90/165 (54%), Gaps = 1/165 (0%)
 Frame = +2

Query: 479  AKFLSFAGRLQLIRSVLYGILGYWSSHIFLPMCILKKVQSLMSKFLWNGSYLGRAHHKVS 658
            +K LSFAGR QLI SV++G++ +W S   LP   +KK++SL SKFLW GS  GR   KVS
Sbjct: 655  SKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVS 714

Query: 659  WSTCCTXXXXXXXXXKNIFDLNTSAILMQIWRSIQPNPSSLWLKWLNLY-VFRSKFWTMS 835
            W  CC          ++  + N + +L  IW  +    +SLW +W   + +  + FW ++
Sbjct: 715  WVDCCLPKSEGGLGFRSFGEWNKTLLLRLIW-VLFDRDTSLWAQWQRHHRLGHASFWQVN 773

Query: 836  IPGNCSWIVRNILKARPLAMQHVQYEVRSGSSIFLWHDPWVNRTP 970
                  W  + +L  RPLA + ++ +V +G ++  W D W +  P
Sbjct: 774  ALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGP 818



 Score = 55.1 bits (131), Expect(2) = 9e-33
 Identities = 38/139 (27%), Positives = 58/139 (41%), Gaps = 9/139 (6%)
 Frame = +3

Query: 1026 AKVDSIIVNGHWHISSNHVLATN--LRHKVN---QTRLHSSDRISWDGN----RRVRMSV 1178
            AKV   I    W +  +  L  +  L H  +    + L  SD  SW  +    +    + 
Sbjct: 837  AKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAK 896

Query: 1179 VWNSIRRTDHAPSWVSTCWNSLSIPKCSFMLWLLFHGRLSTRDRLIRFGINVPSVCLLCN 1358
             W  +R       W  + W   ++PK +F  W     RL TR RL+ +G+   + C LC+
Sbjct: 897  TWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCS 956

Query: 1359 TADESIDHLFFECPFSRHI 1415
               E+ DHL   C FS  +
Sbjct: 957  FDTETRDHLLLLCDFSSQV 975


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  113 bits (283), Expect(2) = 1e-32
 Identities = 58/165 (35%), Positives = 90/165 (54%), Gaps = 1/165 (0%)
 Frame = +2

Query: 479  AKFLSFAGRLQLIRSVLYGILGYWSSHIFLPMCILKKVQSLMSKFLWNGSYLGRAHHKVS 658
            +K LSFAGR QLI SV++G++ +W S   LP   +KK++SL SKFLW GS  GR   KVS
Sbjct: 655  SKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVS 714

Query: 659  WSTCCTXXXXXXXXXKNIFDLNTSAILMQIWRSIQPNPSSLWLKWLNLY-VFRSKFWTMS 835
            W  CC          ++  + N + +L  IW  +    +SLW +W   + +  + FW ++
Sbjct: 715  WVDCCLPKSEGGLGFRSFGEWNKTLLLRLIW-VLFDRDTSLWAQWQRHHRLGHASFWQVN 773

Query: 836  IPGNCSWIVRNILKARPLAMQHVQYEVRSGSSIFLWHDPWVNRTP 970
                  W  + +L  RPLA + ++ +V +G ++  W D W +  P
Sbjct: 774  ALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGP 818



 Score = 54.7 bits (130), Expect(2) = 1e-32
 Identities = 38/139 (27%), Positives = 58/139 (41%), Gaps = 9/139 (6%)
 Frame = +3

Query: 1026 AKVDSIIVNGHWHISSNHVLATN--LRHKVN---QTRLHSSDRISWDGN----RRVRMSV 1178
            AKV   I    W +  +  L  +  L H  +    + L  SD  SW  +    +    + 
Sbjct: 837  AKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAK 896

Query: 1179 VWNSIRRTDHAPSWVSTCWNSLSIPKCSFMLWLLFHGRLSTRDRLIRFGINVPSVCLLCN 1358
             W  +R       W  + W   ++PK +F  W     RL TR RL+ +G+   + C LC+
Sbjct: 897  TWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCS 956

Query: 1359 TADESIDHLFFECPFSRHI 1415
               E+ DHL   C FS  +
Sbjct: 957  FDTETRDHLLLLCDFSSQV 975


Top