BLASTX nr result
ID: Bupleurum21_contig00019016
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00019016 (1576 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2... 214 7e-53 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 196 1e-47 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 194 7e-47 dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] 189 1e-45 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 187 5e-45 >ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1| predicted protein [Populus trichocarpa] Length = 517 Score = 214 bits (544), Expect = 7e-53 Identities = 125/397 (31%), Positives = 192/397 (48%), Gaps = 15/397 (3%) Frame = +3 Query: 3 RLQLIKSVLLGIQGFWCMYLFLPNGVLQKIQSILSKFLWGGPSADTVHYKVAWNTCCLPL 182 R+QLI SVL IQ +W LP V++ ++ I+ FLW G T KVAW+ CLP Sbjct: 97 RVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGAKVAWDQVCLPK 156 Query: 183 SEGGLGIRNMFIWNRASILFNLWRLLRPSE-SPWIRWFHSYVLKSKSIWEASVPANSSWA 359 EGGLGI+++ WN+ ++L ++W L S+ S W W S +L+ ++ W P N SWA Sbjct: 157 KEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFWTIKTPQNCSWA 216 Query: 360 VRKIMNSRSQALQFLQYTVGKDSQFSLWFDPW-PRPSLVDRFGRXXXXXXXXXXXARVST 536 KI+ RS A ++Y +G SLWFD W P L D +G A+V+ Sbjct: 217 WGKILKLRSLAWPKMKYIIGDGMTTSLWFDNWHPHSPLADSYGERFIYDSGMAKNAKVNV 276 Query: 537 LQRSDQWAPFHSNHSLVIELRHLLQSI------TIASEDRITWDGITN--VKITNIWDSI 692 L ++ +W + + I ++++I + +D + W N + W+ + Sbjct: 277 LIQNSEW---KTPTTQAIGWHPIIEAIPSNSNPKMGQKDELVWLDSPNHRFSVKVAWEQL 333 Query: 693 RPHASTSAWAIALWHSWAIPRCTFTAWLALQDRLLTRDRMSRFGFNNDLCCVLCNSDAES 872 R H W +W A+PR +F W+A+Q +L T+D++ RFG + C LC + E Sbjct: 334 RRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIHGPNRCSLCLRNNED 393 Query: 873 AKHLFAECSFSS----QVLAACPF-QLLGNWNEYCNGNFVQAHFTNVEKQFGILFFAVAV 1037 HLF ECS++ V C ++ W+E+ V H + L FA V Sbjct: 394 HNHLFFECSYTKAIWWDVCDRCDIPRMTKGWDEWIRWATVSWHGKSFVNFSCKLSFAATV 453 Query: 1038 HSIWKERNVRTHPSSAVAPKNVARLIFDIKSTIRAKL 1148 + +W+ERN R + P V + I+ IR KL Sbjct: 454 YHVWQERNARIFAGMSRTPNLV---LNQIECIIRDKL 487 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 196 bits (498), Expect = 1e-47 Identities = 117/387 (30%), Positives = 182/387 (47%), Gaps = 16/387 (4%) Frame = +3 Query: 3 RLQLIKSVLLGIQGFWCMYLFLPNGVLQKIQSILSKFLWGGPSADTVHYKVAWNTCCLPL 182 R+QLI SV+ G FW LP G +++I+S+ S+FLW G KV+W CLP Sbjct: 803 RIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPK 862 Query: 183 SEGGLGIRNMFIWNRASILFNLWRLLRPSESPWIRWFHSYVLKSKSIWEASVPANSSWAV 362 SEGGLG+R + WN+ + +WRL +S W W H + L S W + SW Sbjct: 863 SEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTW 922 Query: 363 RKIMNSRSQALQFLQYTVGKDSQFSLWFDPWPRPSLVDR-FGRXXXXXXXXXXXARVSTL 539 +++++ R A QFL VG + W+D W + R G A+V++ Sbjct: 923 KRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLAKVASA 982 Query: 540 QRSDQWAPFHSNHSLVIELRHLLQSITIASE-----DRITWDG----ITNVKITNIWDSI 692 D W S + + L ++ + S DR W W++I Sbjct: 983 FSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQGFSAAKTWEAI 1042 Query: 693 RPHASTSAWAIALWHSWAIPRCTFTAWLALQDRLLTRDRMSRFGFNNDLCCVLCNSDAES 872 RP A+ +WA ++W A+P+ F W++ +RLLTR R++ +G CVLC+ +ES Sbjct: 1043 RPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACVLCSFASES 1102 Query: 873 AKHLFAECSFSSQV-----LAACPFQ-LLGNWNEYCNGNFVQAHFTNVEKQFGILFFAVA 1034 HL C FS+QV CP Q L +W+E ++V+ + V Sbjct: 1103 RDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELL--SWVRQSSPEAPPLLRKIVSQVV 1160 Query: 1035 VHSIWKERNVRTHPSSAVAPKNVARLI 1115 V+++W++RN H S +AP + +L+ Sbjct: 1161 VYNLWRQRNNLLHNSLRLAPAVIFKLV 1187 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 194 bits (492), Expect = 7e-47 Identities = 122/406 (30%), Positives = 195/406 (48%), Gaps = 19/406 (4%) Frame = +3 Query: 3 RLQLIKSVLLGIQGFWCMYLFLPNGVLQKIQSILSKFLWGGPSADTVHYKVAWNTCCLPL 182 RL L+ SV++ I FW LP G +++I+ + S FLW GP + K+AW++ C P Sbjct: 1107 RLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPK 1166 Query: 183 SEGGLGIRNMFIWNRASILFNLWRLLRPSESPWIRWFHSYVLKSKSIWEASVPAN-SSWA 359 EGGLGI+++ N+ S L +WRLL S W+ W +++++ + W A+ ++ SW Sbjct: 1167 KEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWIWTFIIRKGTFWSANERSSLGSWM 1226 Query: 360 VRKIMNSRSQALQFLQYTVGKDSQFSLWFDPWPR-PSLVDRFGRXXXXXXXXXXXARVST 536 +K++ R A + V S S W+D W L+D G + T Sbjct: 1227 WKKLLKYRELAKSMHKVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVIDLGIPLETNLET 1286 Query: 537 LQRSDQWAPFHSNHSLVI------ELRHLLQSITIASEDRITWDGITN-------VKITN 677 + R+ Q H H I E++ L Q A D W + N K+T Sbjct: 1287 VLRTHQ----HRQHRAAIYNRINAEIQRLQQQEREAGPDISLWRSLKNDFNKRFITKVT- 1341 Query: 678 IWDSIRPHASTSAWAIALWHSWAIPRCTFTAWLALQDRLLTRDRMSRFGFNNDLCCVLCN 857 W+++R H W +W ++ P+ +F WL +Q+RL T DR+ + + C LCN Sbjct: 1342 -WNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCN 1400 Query: 858 SDAESAKHLFAECSFSSQVLAACPFQLLG-NWNEYCNGNFVQAHFTNVEKQFGILF---F 1025 + E+ HLF C ++S V A +LL N++ N F +N+ + LF F Sbjct: 1401 NAEETRDHLFFSCQYTSYVWEALTQRLLSTNYSRDWNRLFTLLCTSNLPRDHLFLFRYVF 1460 Query: 1026 AVAVHSIWKERNVRTHPSSAVAPKNVARLIFDIKSTIRAKLATRGD 1163 +++ IW+ERN R H +P N RLI I T+R ++++ D Sbjct: 1461 QASIYHIWRERNARRH-GEISSPTN--RLIKLIDKTVRNRISSIRD 1503 >dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] Length = 478 Score = 189 bits (481), Expect = 1e-45 Identities = 123/411 (29%), Positives = 193/411 (46%), Gaps = 27/411 (6%) Frame = +3 Query: 3 RLQLIKSVLLGIQGFWCMYLFLPNGVLQKIQSILSKFLWGGPSADTVHYKVAWNTCCLPL 182 RLQLI SV+ + FW LP+ +++I SI S FLW GP +T KVAW+ C P Sbjct: 66 RLQLISSVIHSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPK 125 Query: 183 SEGGLGIRNMFIWNRASILFNLWRLLRPSESPWIRWFHSYVLKSKSIWEASVPAN-SSWA 359 EGGLGIR++ N+ S+L +WR+L S S W++W Y+L+ S W S SW Sbjct: 126 DEGGLGIRSLKEANKVSLLKLIWRML-SSTSLWVQWLRLYLLRKGSFWSISGNTTLGSWM 184 Query: 360 VRKIMNSRSQALQFLQYTVGKDSQFSLWFDPWPR-PSLVDRFGRXXXXXXXXXXXARVST 536 +KI+ R+ A F+++ + S S WFD W + L+D G A V+ Sbjct: 185 WKKILKHRALASGFVKHDIHNGSNTSFWFDNWSKIGRLIDVTGHRGCIDMGITLHASVAE 244 Query: 537 LQRSDQWAPFHSNHSLVIELRHLLQSI----TIASEDRITWDGITNV-----KITNIWDS 689 + + P H ++ + ++ + + ED + W G ++ W + Sbjct: 245 AVVNHR--PRRHRHDTLLRIEDVIAEVRHQGLTSGEDTVRWKGNGDIFKPCFNTKETWAA 302 Query: 690 IRPHASTSAWAIALWHSWAIPRCTFTAWLALQDRLLTRDRMSRFGFNNDLCCVLCNSDAE 869 R W +W S A P+ + AW+A+++RL T DRM + D CVLC+ E Sbjct: 303 TREPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCVLCHHLVE 362 Query: 870 SAKHLFAECSFSSQVLAACPFQLLGNWNEYCNGNFVQAHFTN---------VEKQFG--I 1016 + HLF C +S++V + +LL HFTN K G + Sbjct: 363 TRDHLFFTCPYSAEVWSTLTRKLLSQ------------HFTNRWEAILKLLTNKSLGHEV 410 Query: 1017 LF-----FAVAVHSIWKERNVRTHPSSAVAPKNVARLIFDIKSTIRAKLAT 1154 F F + +HS+WKERN R H P+ A+++ + +R ++++ Sbjct: 411 PFLTRYTFQLTLHSLWKERNGRRH---GEVPQAAAQMVRFLDKQVRNRISS 458 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 187 bits (476), Expect = 5e-45 Identities = 124/414 (29%), Positives = 192/414 (46%), Gaps = 20/414 (4%) Frame = +3 Query: 3 RLQLIKSVLLGIQGFWCMYLFLPNGVLQKIQSILSKFLWGGPSADTVHYKVAWNTCCLPL 182 RLQLI SV+ FW LP L+ I+ + ++FLWG KV+W CLP Sbjct: 802 RLQLISSVIYSTVNFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPK 861 Query: 183 SEGGLGIRNMFIWNRASILFNLWRLLRPSESPWIRWFHSYVLKSKSIWEASVPANSSWAV 362 +EGGLG+RN + WN+ L +W L +S W+ W H+ L+ + W A ++ SW Sbjct: 862 AEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSWIW 921 Query: 363 RKIMNSRSQALQFLQYTVGKDSQFSLWFDPWPR-PSLVDRFGRXXXXXXXXXXXARVSTL 539 + I+ R A +FL+ VG S W+D W L++ G A V+ Sbjct: 922 KAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAVVTEA 981 Query: 540 QRSDQW--APFHSNHSLVIELRHLLQSITIAS----EDRITW--DGITNVKITN--IWDS 689 S W + ++ + LR L + S ED TW +G ++ ++ W+ Sbjct: 982 SSSTGWILPSARTRNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSSTSFSSKLTWEC 1041 Query: 690 IRPHASTSAWAIALWHSWAIPRCTFTAWLALQDRLLTRDRMSRFGFNNDLCCVLCNSDAE 869 +R +T WA A+W+ IP+ F W+A +RL R R + + N C +C + E Sbjct: 1042 LRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRPSLCCVCQRETE 1101 Query: 870 SAKHLFAECSFSS----QVLAAC-PFQLLGNWN---EYCNGNFVQAHFTNVEKQFGILFF 1025 + HLF C+ S QVLA Q+ W E+ N Q F+ K+ + Sbjct: 1102 TRDHLFIHCTLGSLIWQQVLARFGRSQMFREWKDIIEWMLSN--QGSFSGTLKKLAV--- 1156 Query: 1026 AVAVHSIWKERNVRTHPSSAVAPKNVARLI-FDIKSTIRAKLATRGDFKKAASR 1184 A+ IWKERN R H + + + + + I I+ +I A++ TR +FK S+ Sbjct: 1157 QTAIFHIWKERNSRLHSAMSASHTAIFKQIDRSIRDSILARI-TRRNFKDLLSQ 1209