BLASTX nr result

ID: Bupleurum21_contig00037847 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00037847
         (771 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAD99219.1| polypeptide with a gag-like domain [Petunia x hy...   169   8e-40
gb|ABD32757.1| Integrase, catalytic region [Medicago truncatula]      160   3e-37
ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817...   157   2e-36
emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis...   157   2e-36
emb|CAB78071.1| putative protein [Arabidopsis thaliana]               154   2e-35

>dbj|BAD99219.1| polypeptide with a gag-like domain [Petunia x hybrida]
          Length = 463

 Score =  169 bits (427), Expect = 8e-40
 Identities = 78/229 (34%), Positives = 136/229 (59%), Gaps = 1/229 (0%)
 Frame = -3

Query: 685 YYIHLSDASSTQLMSVKFNGSGFTNWKRSMILTLSAKNKLGFVNGVISITDPNTEEYRYW 506
           YY+  SDA    L++  F+GS + NWKR ++++LSAKNKLGF+ G     D     +  W
Sbjct: 37  YYLASSDAPGMNLINTSFDGSSYGNWKRGVLISLSAKNKLGFITGAYKKPDKEDLLFEQW 96

Query: 505 ERCNNLVISWILFNLEESIAKSVLFLPTTKDIWDDLEDRYGYASMAQVFSLEQQLSELKQ 326
            RC+++V++W+L +L + IA+SVL+  T +++W +LE RYG     ++F L+++L+ + Q
Sbjct: 97  RRCSDMVLAWLLNSLSKEIAESVLYSQTAQELWQELEQRYGQIDGTKMFQLQRELNNVSQ 156

Query: 325 GSDSISEFFTKVKTVWDSVNDASPVPYCTCNKCTCNLTXXXXXXXXXXXXXXXXXXFSEE 146
           G++ ++ +F K+K +WD +   +    C+C +C C                      +E 
Sbjct: 157 GTNDVAAYFNKLKRIWDQMKVLNTFMVCSC-ECNCEAKGHNAKMQEDQQLIQFLMGLNEV 215

Query: 145 FASVRGNILMIQPLPTISQAYKLFAQEERHREISRLAS-QTENLAFYAA 2
           ++ +RGNILM++PLP+ +QAY + + EE  R I+   +  T++ AF A+
Sbjct: 216 YSGIRGNILMMKPLPSTAQAYSIISHEETQRGIAAGNNVSTDSAAFNAS 264


>gb|ABD32757.1| Integrase, catalytic region [Medicago truncatula]
          Length = 1157

 Score =  160 bits (405), Expect = 3e-37
 Identities = 82/222 (36%), Positives = 127/222 (57%), Gaps = 4/222 (1%)
 Frame = -3

Query: 715 LQNNQDPS---SIYYIHLSDASSTQLMSVKFNGSGFTNWKRSMILTLSAKNKLGFVNGVI 545
           ++ N  PS   S YYIH SD  S+ +++ K NGS +  W RSM   L AKNKL F++G I
Sbjct: 2   VRGNSAPSNSDSPYYIHPSDGPSSLIITPKLNGSNYLAWHRSMQRALGAKNKLVFLDGSI 61

Query: 544 SITDPNTEEYRYWERCNNLVISWILFNLEESIAKSVLFLPTTKDIWDDLEDRYGYASMAQ 365
           S+ D +    + WERCN+L+ SWI+ ++ ESIA++++F  T    WDDL++ +      +
Sbjct: 62  SVPDIDDLNRQAWERCNHLIHSWIVNSVTESIAQTIVFHDTALSAWDDLKECFSKVDRVR 121

Query: 364 VFSLEQQLSELKQGSDSISEFFTKVKTVWDSVNDASPVPYCTC-NKCTCNLTXXXXXXXX 188
           V SL   ++ LKQG+ S+ ++F ++ T+WD +N   P+P CTC + C C           
Sbjct: 122 VLSLRSTINNLKQGTKSVLDYFIELCTLWDELNSHRPIPNCTCIHPCRCESIRLAKYYRT 181

Query: 187 XXXXXXXXXXFSEEFASVRGNILMIQPLPTISQAYKLFAQEE 62
                      ++ F+ V+  IL++ PLP I++ Y L  QEE
Sbjct: 182 EDQILQFLTGLNDTFSVVKTQILLMDPLPPINKVYSLVVQEE 223


>ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817175 [Glycine max]
          Length = 2045

 Score =  157 bits (398), Expect = 2e-36
 Identities = 75/227 (33%), Positives = 133/227 (58%), Gaps = 1/227 (0%)
 Frame = -3

Query: 736  KSFYIMTLQNNQDPSSIYYIHLSDASSTQLMSVKFNGSGFTNWKRSMILTLSAKNKLGFV 557
            KS    T  NN +  S  Y+H S+  +T L+S   + + + +W RSM+  LSAKNK+ F+
Sbjct: 345  KSTMNETSINNME--SYLYLHPSENPATALVSPVLDSTNYHSWSRSMVTALSAKNKVEFI 402

Query: 556  NGVISITDPNTEEYRYWERCNNLVISWILFNLEESIAKSVLFLPTTKDIWDDLEDRYGYA 377
            +G           +  W RCNN+V+SWI+ ++  SI +S+L++   ++IW DL+ RY   
Sbjct: 403  DGSAPEPLKTDRMHGAWCRCNNMVVSWIVHSVATSIRQSILWMDKAEEIWRDLKSRYSQG 462

Query: 376  SMAQVFSLEQQLSELKQGSDSISEFFTKVKTVWDSVNDASPVPYCTCN-KCTCNLTXXXX 200
             + ++  L+Q+ S +KQG+ +++E+FT ++ +WD + +  P P C+CN +C+CN      
Sbjct: 463  DLLRISDLQQEASTMKQGTLTVTEYFTCLRVIWDEIENFRPDPICSCNIRCSCNAFTIIA 522

Query: 199  XXXXXXXXXXXXXXFSEEFASVRGNILMIQPLPTISQAYKLFAQEER 59
                           +E++A++R ++L++ P+PTIS+ +   AQ+ER
Sbjct: 523  QRKLEDRAMQFLRGLNEQYANIRSHVLLMDPIPTISKIFSYVAQQER 569


>emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana]
           gi|7268152|emb|CAB78488.1| retrovirus-related like
           polyprotein [Arabidopsis thaliana]
          Length = 1489

 Score =  157 bits (398), Expect = 2e-36
 Identities = 71/226 (31%), Positives = 136/226 (60%), Gaps = 1/226 (0%)
 Frame = -3

Query: 685 YYIHLSDASSTQLMSVKFN-GSGFTNWKRSMILTLSAKNKLGFVNGVISITDPNTEEYRY 509
           YY+H +D +   L+S +    S F +W+RS+++ L+ +NKLGF+NG I+    +  ++  
Sbjct: 34  YYLHSADHAGLILVSDRLTTASDFHSWRRSILMALNVRNKLGFINGTITKPPEDHRDFGA 93

Query: 508 WERCNNLVISWILFNLEESIAKSVLFLPTTKDIWDDLEDRYGYASMAQVFSLEQQLSELK 329
           W RCN++V +W++ ++++ I +S+L++ T + IW++L  R+      ++F +EQ+LS+++
Sbjct: 94  WSRCNDIVSTWLMNSVDKKIGQSLLYIATVQGIWNNLLSRFKQDDAPRIFDIEQKLSKIE 153

Query: 328 QGSDSISEFFTKVKTVWDSVNDASPVPYCTCNKCTCNLTXXXXXXXXXXXXXXXXXXFSE 149
           QGS  IS ++T + T+W+   +   +P CTC +C C+                     +E
Sbjct: 154 QGSMDISTYYTALLTLWEEHRNYVELPVCTCGRCECDAAVKWEHLQQRSRVTKFLKELNE 213

Query: 148 EFASVRGNILMIQPLPTISQAYKLFAQEERHREISRLASQTENLAF 11
            F   R +ILM++P+PTI +A+ +  Q+ER R +  L ++ +++AF
Sbjct: 214 GFDQTRRHILMLKPIPTIKEAFNMVTQDERQRNVKPL-TRVDSVAF 258


>emb|CAB78071.1| putative protein [Arabidopsis thaliana]
          Length = 290

 Score =  154 bits (389), Expect = 2e-35
 Identities = 72/229 (31%), Positives = 134/229 (58%), Gaps = 1/229 (0%)
 Frame = -3

Query: 685 YYIHLSDASSTQLMSVK-FNGSGFTNWKRSMILTLSAKNKLGFVNGVISITDPNTEEYRY 509
           Y++H SD +   L+S +  +GS F +W+RS+ + L+ +NKLGF++G I        +   
Sbjct: 15  YFLHSSDQAGLILVSDRPSSGSEFHSWRRSVRMALNVRNKLGFIDGTIPQPPSTHRDAGS 74

Query: 508 WERCNNLVISWILFNLEESIAKSVLFLPTTKDIWDDLEDRYGYASMAQVFSLEQQLSELK 329
           W RCN++V +W++ ++ + I +S+LF+ T + IW +L  R+      +VF +EQ+L  L+
Sbjct: 75  WSRCNDMVATWLMNSVSKKIGQSLLFMSTAESIWKNLLSRFKQDDAPRVFEIEQRLGSLQ 134

Query: 328 QGSDSISEFFTKVKTVWDSVNDASPVPYCTCNKCTCNLTXXXXXXXXXXXXXXXXXXFSE 149
           QGS  +S ++T++ T+W+   +   +P CTC +C CN +                   +E
Sbjct: 135 QGSMDVSTYYTELVTLWEEYKNYIELPLCTCGRCECNASALWEKMQQRSRVTKFLMGLNE 194

Query: 148 EFASVRGNILMIQPLPTISQAYKLFAQEERHREISRLASQTENLAFYAA 2
            + + + +ILM++P+PTI   + + AQ+ER + I  +A + +N+AF ++
Sbjct: 195 AYEATQRHILMLKPIPTIEDVFNMVAQDERQKSIKPVA-KMDNVAFQSS 242


Top