BLASTX nr result
ID: Bupleurum21_contig00018659
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00018659 (1829 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 300 7e-79 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 300 7e-79 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 296 9e-78 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 295 3e-77 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 291 5e-76 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 300 bits (769), Expect = 7e-79 Identities = 181/593 (30%), Positives = 286/593 (48%), Gaps = 16/593 (2%) Frame = -3 Query: 1827 ISWIRHCLSSTMISVKINGSLEGYFKAKVGLRQGDPLSPYLFVIAMEAFSAIINKATDLH 1648 I+WI C+++ ++ +NG+ G+F++ GLRQGDPLSPYLFV+AME FS ++ D Sbjct: 480 INWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSG 539 Query: 1647 QFRFHKDATDPKVSHLFFADDVMLFCRGDADSINVLLNATDQFAKYSGLRPNPSKSSCYF 1468 +H A D +SHL FADDVM+F G + S++ + D FA +SGL+ N KS + Sbjct: 540 YIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQ 599 Query: 1467 ANVPLCIVQRVLKRC-RFSWGSLPVKFLGLPLLSSSPTDRDCEPLITRICNRIQSWTARF 1291 A + L +R+ F G+ P+++LGLPL+ D PL+ ++ R++SW ++ Sbjct: 600 AGLDLS--ERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKA 657 Query: 1290 LSFAGRIQLIKSILYSIQEFWAMYLFLPVKVLKTLQSIFARFLWSGKRDGKCNYKVAWQE 1111 LSFAGR QLI S+++ + FW LP +K ++S+ ++FLW+G DG+ + KV+W + Sbjct: 658 LSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVD 717 Query: 1110 CTKPICCGGLGIKDLRLWNKAAVQYQLWRVVSPRSSSLWTSWFRGCFLKNKAFWTMKKPS 931 C P GGLG + WNK + +W V+ R +SLW W R L + +FW + Sbjct: 718 CCLPKSEGGLGFRSFGEWNKTLLLRLIW-VLFDRDTSLWAQWQRHHRLGHASFWQVNALQ 776 Query: 930 SCSWCISKILNARYEAMSHVNYKIGKGDNTLLWHDPWLNGRPVINVLXXXXXXXXXXXSH 751 + W +LN R A + K+G G W D W + P+I L Sbjct: 777 TDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFS 836 Query: 750 ALVSSIIKDGSWSVGPSNHALAIE--FRHLLS---GVRLHSNDTVLW--EDKPSSQVSIS 592 A V+ I W + P + +L + HL S L +D+ W +D S + Sbjct: 837 AKVADAIDGSGWRL-PLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAA 895 Query: 591 FIYNSGRPLHTCVPWSGFIWFPGAVPRFSFCCWXXXXXXXXXXXXXLSYGLLDEAICGLC 412 + RP W+ +WF GAVP+ +F W +S+GL+ A C LC Sbjct: 896 KTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLC 955 Query: 411 CQYDEDERHLFFSCSFASRI-----ISECXXXXXXXXXXXXXHCDTLSAGGFEFQYIRLY 247 E HL C F+S++ + C S ++ Sbjct: 956 SFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSLLRKVV 1015 Query: 246 VTTAIYYIWQQRNCRLWNPSQALTVDATIQLIKKTVRQIVFG---CNRFRKLL 97 +Y +W+QRN L + S ++ +L+ + +R ++ R+R+LL Sbjct: 1016 AQLVVYNLWRQRNLVL-HSSLRVSCSVVFRLVDRELRNVILSRRHKRRWRELL 1067 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 300 bits (769), Expect = 7e-79 Identities = 181/593 (30%), Positives = 286/593 (48%), Gaps = 16/593 (2%) Frame = -3 Query: 1827 ISWIRHCLSSTMISVKINGSLEGYFKAKVGLRQGDPLSPYLFVIAMEAFSAIINKATDLH 1648 I+WI C+++ ++ +NG+ G+F++ GLRQGDPLSPYLFV+AME FS ++ D Sbjct: 480 INWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSG 539 Query: 1647 QFRFHKDATDPKVSHLFFADDVMLFCRGDADSINVLLNATDQFAKYSGLRPNPSKSSCYF 1468 +H A D +SHL FADDVM+F G + S++ + D FA +SGL+ N KS + Sbjct: 540 YIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQ 599 Query: 1467 ANVPLCIVQRVLKRC-RFSWGSLPVKFLGLPLLSSSPTDRDCEPLITRICNRIQSWTARF 1291 A + L +R+ F G+ P+++LGLPL+ D PL+ ++ R++SW ++ Sbjct: 600 AGLDLS--ERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKA 657 Query: 1290 LSFAGRIQLIKSILYSIQEFWAMYLFLPVKVLKTLQSIFARFLWSGKRDGKCNYKVAWQE 1111 LSFAGR QLI S+++ + FW LP +K ++S+ ++FLW+G DG+ + KV+W + Sbjct: 658 LSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVD 717 Query: 1110 CTKPICCGGLGIKDLRLWNKAAVQYQLWRVVSPRSSSLWTSWFRGCFLKNKAFWTMKKPS 931 C P GGLG + WNK + +W V+ R +SLW W R L + +FW + Sbjct: 718 CCLPKSEGGLGFRSFGEWNKTLLLRLIW-VLFDRDTSLWAQWQRHHRLGHASFWQVNALQ 776 Query: 930 SCSWCISKILNARYEAMSHVNYKIGKGDNTLLWHDPWLNGRPVINVLXXXXXXXXXXXSH 751 + W +LN R A + K+G G W D W + P+I L Sbjct: 777 TDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFS 836 Query: 750 ALVSSIIKDGSWSVGPSNHALAIE--FRHLLS---GVRLHSNDTVLW--EDKPSSQVSIS 592 A V+ I W + P + +L + HL S L +D+ W +D S + Sbjct: 837 AKVADAIDGSGWRL-PLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAA 895 Query: 591 FIYNSGRPLHTCVPWSGFIWFPGAVPRFSFCCWXXXXXXXXXXXXXLSYGLLDEAICGLC 412 + RP W+ +WF GAVP+ +F W +S+GL+ A C LC Sbjct: 896 KTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLC 955 Query: 411 CQYDEDERHLFFSCSFASRI-----ISECXXXXXXXXXXXXXHCDTLSAGGFEFQYIRLY 247 E HL C F+S++ + C S ++ Sbjct: 956 SFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSLLRKVV 1015 Query: 246 VTTAIYYIWQQRNCRLWNPSQALTVDATIQLIKKTVRQIVFG---CNRFRKLL 97 +Y +W+QRN L + S ++ +L+ + +R ++ R+R+LL Sbjct: 1016 AQLVVYNLWRQRNLVL-HSSLRVSCSVVFRLVDRELRNVILSRRHKRRWRELL 1067 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 296 bits (759), Expect = 9e-78 Identities = 192/586 (32%), Positives = 288/586 (49%), Gaps = 20/586 (3%) Frame = -3 Query: 1827 ISWIRHCLSSTMISVKINGSLEGYFKAKVGLRQGDPLSPYLFVIAMEAFSAIINKATDLH 1648 I WI C+++ SV++NG L GYF++K GLRQG LSPYLFVI M+ S +++KA + Sbjct: 273 IHWINLCITTASFSVQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVR 332 Query: 1647 QFRFHKDATDPKVSHLFFADDVMLFCRGDADSINVLLNATDQFAKYSGLRPNPSKSSCYF 1468 +F FH ++HL FADD+M+ G SI +L D+F K SGLR + KS+ Y Sbjct: 333 KFGFHPKCQRLGLTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYM 392 Query: 1467 ANVPLCIVQRVLKRCRFSWGSLPVKFLGLPLLSSSPTDRDCEPLITRICNRIQSWTARFL 1288 A V I Q + + F G LPV++LGLPL++ T D PL+ +I RI +WT RF Sbjct: 393 AGVSPIIKQEIAAKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFF 452 Query: 1287 SFAGRIQLIKSILYSIQEFWAMYLFLPVKVLKTLQSIFARFLWSGKRDGKCNYKVAWQEC 1108 SFAGR LIKS+L+SI FW LP + ++ + + + FLWSG K++W Sbjct: 453 SFAGRFNLIKSVLWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIV 512 Query: 1107 TKPICCGGLGIKDLRLWNKAAVQYQLWRVVSPRSSSLWTSWFRGCFLKNKAFWTMKKPSS 928 KP GGLG+++L+ N + +WR++S S+SLWT W ++ K+ W++K+ +S Sbjct: 513 CKPKAEGGLGLRNLKEANDVSCLKLVWRIIS-NSNSLWTKWVAEYLIRKKSIWSLKQSTS 571 Query: 927 C-SWCISKILNARYEAMSHVNYKIGKGDNTLLWHDPWLNGRPVINVLXXXXXXXXXXXSH 751 SW KIL R A S ++G G++ W+D W +I+ + Sbjct: 572 MGSWIWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPRE 631 Query: 750 ALVSSIIKDGSW---SVGPSNHALAIEFRHLLSGVRLH---SNDTVLWEDKP---SSQVS 598 A V+ +W S +L E +++ R+H + DTVLW K S Sbjct: 632 ASVAD-----AWTRRSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFS 686 Query: 597 ISFIYNSGRPLHTCVPWSGFIWFPGAVPRFSFCCWXXXXXXXXXXXXXLSYGLLDEAI-- 424 ++ + + V W +WF A P+++ C W L + Sbjct: 687 TRDTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGN 746 Query: 423 CGLCCQYDEDERHLFFSCSFASRIISECXXXXXXXXXXXXXHCDTLSAGGFEFQ-----Y 259 C LC + HLFFSCS+AS + + L+ FQ + Sbjct: 747 CVLCTNNSKTLEHLFFSCSYASTVWA-ALAKGIWKTRYSTRWSHLLTHISTHFQDRVEGF 805 Query: 258 IRLYVTTA-IYYIWQQRNCRLWN--PSQALTVDATIQLIKKTVRQI 130 + Y+ A IY++W++RN R + P+ TV I K+T QI Sbjct: 806 LTRYIFQATIYHVWRERNGRRHDAAPNTPATVIGWID--KQTRNQI 849 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 295 bits (755), Expect = 3e-77 Identities = 165/495 (33%), Positives = 249/495 (50%), Gaps = 7/495 (1%) Frame = -3 Query: 1821 WIRHCLSSTMISVKINGSLEGYFKAKVGLRQGDPLSPYLFVIAMEAFSAIINKATDLHQF 1642 WI CLS+ SV +NG G+F + GLRQGDP+SPYLFV+AME FS ++ Sbjct: 519 WILECLSTASFSVILNGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYI 578 Query: 1641 RFHKDATDPKVSHLFFADDVMLFCRGDADSINVLLNATDQFAKYSGLRPNPSKSSCYFAN 1462 +H + ++SHL FADDVM+F G + S++ ++ + + FA +SGL N +K+ Y A Sbjct: 579 AYHPKTSQLEISHLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAG 638 Query: 1461 VPLCIVQRVLKRCRFSWGSLPVKFLGLPLLSSSPTDRDCEPLITRICNRIQSWTARFLSF 1282 + + F GSLPV++LGLPL+S T + PLI +I R SW R LSF Sbjct: 639 LSQSESDSMASY-GFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSF 697 Query: 1281 AGRIQLIKSILYSIQEFWAMYLFLPVKVLKTLQSIFARFLWSGKRDGKCNYKVAWQECTK 1102 AGR+QL+ S++ I FW LP+ +K ++S+ +RFLWS + D K KVAW + Sbjct: 698 AGRVQLLASVISGIVNFWISSFILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCL 757 Query: 1101 PICCGGLGIKDLRLWNKAAVQYQLWRVVSPRSSSLWTSWFRGCFL-KNKAFWTMKKPSSC 925 P GG+G++ + N+ +W + S S SLW +W + L K+ +FW + Sbjct: 758 PKAEGGIGLRRFAVSNRTLYLRMIWLLFS-NSGSLWVAWHKQHSLGKSTSFWNQPEKPHD 816 Query: 924 SWCISKILNARYEAMSHVNYKIGKGDNTLLWHDPWLNGRPVINVLXXXXXXXXXXXSHAL 745 SW +L R A + +G G + W D W P+I L +A Sbjct: 817 SWNWKCLLRLRVVAERFIRCNVGNGRDASFWFDNWTPFGPLIKFLGNEGPRDLRVHLNAK 876 Query: 744 VSSIIKDGSWSVGPSNHALAIEFRHLLSGVRLHSN----DTVLW--EDKPSSQVSISFIY 583 +S + WS+ A+ L+ + + S+ D+ W ++K S + + Sbjct: 877 ISDVCTSEGWSIADPRSDQALSLHTHLTNISMPSDAQDLDSYDWVVDNKVCQGFSAAATW 936 Query: 582 NSGRPLHTCVPWSGFIWFPGAVPRFSFCCWXXXXXXXXXXXXXLSYGLLDEAICGLCCQY 403 ++ RP VPW+ +WF GA P+ +F W S+G+ + CGLC + Sbjct: 937 SALRPSSAPVPWARAVWFKGATPKHAFHLWTAHLDRLPTKVRLASWGMQIDTTCGLCSLH 996 Query: 402 DEDERHLFFSCSFAS 358 E HLF SC FA+ Sbjct: 997 PETRDHLFLSCDFAN 1011 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 291 bits (744), Expect = 5e-76 Identities = 186/591 (31%), Positives = 279/591 (47%), Gaps = 12/591 (2%) Frame = -3 Query: 1827 ISWIRHCLSSTMISVKINGSLEGYFKAKVGLRQGDPLSPYLFVIAMEAFSAIINKATDLH 1648 I+WI C+S+ +V ING G+FK+ GLRQGDPLSPYLFV+AMEAFS +++ + Sbjct: 620 INWISQCISTPTFTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESG 679 Query: 1647 QFRFHKDATDPKVSHLFFADDVMLFCRGDADSINVLLNATDQFAKYSGLRPNPSKSSCYF 1468 +H A++ +SHL FADDVM+F G + S++ + D FA +SGL+ N KS Y Sbjct: 680 LIHYHPKASNLSISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYL 739 Query: 1467 ANVPLCIVQRVLKRCRFSWGSLPVKFLGLPLLSSSPTDRDCEPLITRICNRIQSWTARFL 1288 A + + F G+LP+++LGLPL++ + EPL+ +I R +SW + L Sbjct: 740 AGLNQ-LESNANAAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCL 798 Query: 1287 SFAGRIQLIKSILYSIQEFWAMYLFLPVKVLKTLQSIFARFLWSGKRDGKCNYKVAWQEC 1108 SFAGRIQLI S+++ FW LP +K ++S+ +RFLWSG + KV+W Sbjct: 799 SFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAAL 858 Query: 1107 TKPICCGGLGIKDLRLWNKAAVQYQLWRVVSPRSSSLWTSWFRGCFLKNKAFWTMKKPSS 928 P GGLG++ L WNK +WR+ + SLW W L +FW ++ S Sbjct: 859 CLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAK-DSLWADWQHLHHLSRGSFWAVEGGQS 917 Query: 927 CSWCISKILNARYEAMSHVNYKIGKGDNTLLWHDPWLNGRPVINVLXXXXXXXXXXXSHA 748 SW ++L+ R A + K+G G W+D W + P+ ++ A Sbjct: 918 DSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLA 977 Query: 747 LVSSIIKDGSWSVGPSNHALAIEFRHLLSGVRLHSN-----DTVLWEDKP--SSQVSISF 589 V+S + W + S A A L V + S D W S + Sbjct: 978 KVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQGFSAAK 1037 Query: 588 IYNSGRPLHTCVPWSGFIWFPGAVPRFSFCCWXXXXXXXXXXXXXLSYGLLDEAICGLCC 409 + + RP T W+ IWF GAVP+++F W S+G + C LC Sbjct: 1038 TWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACVLCS 1097 Query: 408 QYDEDERHLFFSCSFASRIISECXXXXXXXXXXXXXHCDTLS----AGGFEFQYIRLYVT 241 E HL C F++++ + LS + +R V+ Sbjct: 1098 FASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELLSWVRQSSPEAPPLLRKIVS 1157 Query: 240 -TAIYYIWQQRNCRLWNPSQALTVDATIQLIKKTVRQIVFGCNRFRKLLKK 91 +Y +W+QRN L N S L +L+ + +R I+ R RK +K Sbjct: 1158 QVVVYNLWRQRNNLLHN-SLRLAPAVIFKLVDREIRNII-SSRRLRKRWRK 1206