BLASTX nr result
ID: Bupleurum21_contig00002617
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00002617 (1447 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2... 263 1e-67 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 239 1e-60 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 234 4e-59 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 234 6e-59 dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] 233 1e-58 >ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1| predicted protein [Populus trichocarpa] Length = 517 Score = 263 bits (671), Expect = 1e-67 Identities = 156/473 (32%), Positives = 242/473 (51%), Gaps = 17/473 (3%) Frame = +2 Query: 2 LAPNKLKSSIYFCNVDLDTQLQAINLSGFQPGSLPFTYLGLPLITSRLNTQQCMPLVMRL 181 L PN KS I+ V + Q I++ GF+ G LP YLG+PL++SRL C LV R+ Sbjct: 21 LYPNPNKSDIFLSGVLNAEREQIIHILGFREGELPMKYLGVPLLSSRLKAIYCKGLVDRI 80 Query: 182 CQRVNSWTNRFLSLAGRLQLLKSILFGIQGYWAAHIFLPQGVLAKIQSILSRFLWGGNSN 361 +V WT R LS AGR+QL+ S+LF IQ YWA+ LP V+ ++ I+ FLW G+ Sbjct: 81 TSKVRHWTCRTLSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDM 140 Query: 362 RKPHYKVAWVDCCLPAEEGGLGLRDLESWNTAAVLYQLWRLIKSSD-SLWIAWFKNCILR 538 R KVAW CLP +EGGLG++ ++ WN A+L +W L SD S+W W ++ +LR Sbjct: 141 RTTGAKVAWDQVCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLR 200 Query: 539 NKALWTVKCSYSHSWCVRKILNIRPMALRYIKYEVGEGSNFLLWHDPW-AGEPLITQMSD 715 + WT+K + SW KIL +R +A +KY +G+G LW D W PL + Sbjct: 201 GRNFWTIKTPQNCSWAWGKILKLRSLAWPKMKYIIGDGMTTSLWFDNWHPHSPLADSYGE 260 Query: 716 HIISVMESTSLAKVSSIMNNTSWSSGGSN----HPLAIELRHMISTVQIRRHDRVSWDGY 883 I AKV+ ++ N+ W + + HP+ IE S ++ + D + W Sbjct: 261 RFIYDSGMAKNAKVNVLIQNSEWKTPTTQAIGWHPI-IEAIPSNSNPKMGQKDELVWLDS 319 Query: 884 CN--VKLKHIWNTIRQVGTLPAWYPVVWHSWMIRKCSLHMWLAFKNRLLTRERMSRFGMG 1057 N +K W +R+ + W+ +VW + + S +W+A + +L T++++ RFG+ Sbjct: 320 PNHRFSVKVAWEQLRRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIH 379 Query: 1058 TSLFCTLCDTQQVETVAHIFTTCPYAV----EIMSASSFP-LNGCWARYALGDIACVALS 1222 C+LC + E H+F C Y ++ P + W + I +S Sbjct: 380 GPNRCSLC-LRNNEDHNHLFFECSYTKAIWWDVCDRCDIPRMTKGWDEW----IRWATVS 434 Query: 1223 QDEKRMAS----LYLAVAMHLIWNERNLRIHSATSRPAAVLIMEIKRIVRDKL 1369 K + L A ++ +W ERN RI + SR +++ +I+ I+RDKL Sbjct: 435 WHGKSFVNFSCKLSFAATVYHVWQERNARIFAGMSRTPNLVLNQIECIIRDKL 487 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 239 bits (611), Expect = 1e-60 Identities = 149/475 (31%), Positives = 224/475 (47%), Gaps = 22/475 (4%) Frame = +2 Query: 11 NKLKSSIYFCNVDLDTQLQAINLSGFQPGSLPFTYLGLPLITSRLNTQQCMPLVMRLCQR 190 N K+++Y V + I+ F G LP YLGLPL+T RL + PL ++ R Sbjct: 142 NMEKTTLYTAGVSDHNRYMMISRYPFGLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNR 201 Query: 191 VNSWTNRFLSLAGRLQLLKSILFGIQGYWAAHIFLPQGVLAKIQSILSRFLWGGNSNRKP 370 + +WT+R+LS AGRL L+ S+L+ +W + LP L +I SI S FLW G + Sbjct: 202 IGTWTSRYLSFAGRLNLISSVLWSTMNFWMSAFRLPSACLKEINSICSAFLWSGPELHRR 261 Query: 371 HYKVAWVDCCLPAEEGGLGLRDLESWNTAAVLYQLWRLIKSSDSLWIAWFKNCILRNKAL 550 KV+W D C P +EGGLGLR L N +VL +WR+ + DSLW+ W K +L+ ++ Sbjct: 262 KAKVSWDDICKPKQEGGLGLRSLTEANVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESF 321 Query: 551 WTVKCSYS-HSWCVRKILNIRPMALRYIKYEVGEGSNFLLWHDPWAGEPLITQMSDHIIS 727 W++ + S SW +K+L R A + + EV G+ W D W+G H++ Sbjct: 322 WSLTPNSSLGSWMWKKMLKYRETAKPFSRVEVNNGARTSFWFDNWSG-------MGHLMD 374 Query: 728 VMESTSLAKVSSIMNNT---SWSSGGSN-------HPLAIELRHMISTVQIRRHDRVSWD 877 V + N T +WS+ + + L T + R D W Sbjct: 375 VTGQRGQIDLGISRNKTVAEAWSNRRRRKHRTEQLNDIEAALNQKYQTRNLLREDATLWR 434 Query: 878 GYCNV-----KLKHIWNTIRQVGTLPAWYPVVWHSWMIRKCSLHMWLAFKNRLLTRERMS 1042 G +V K WN +R+ AWY VW S K WLA +NRL T RM Sbjct: 435 GKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQ 494 Query: 1043 RFGMGTSLFCTLCDTQQVETVAHIFTTCPYAVEIMSASSFPLNGCWARYA------LGDI 1204 + G+ + CT C T +ET H+F +C YA I +A N R++ + I Sbjct: 495 LWNNGSDVKCTFCST-SIETRDHLFFSCSYASAIWTA--IAKNVLQHRFSTDWQTIVNYI 551 Query: 1205 ACVALSQDEKRMASLYLAVAMHLIWNERNLRIHSATSRPAAVLIMEIKRIVRDKL 1369 + + ++ + +H +W ERN R H R +A LI + + +R++L Sbjct: 552 SETQTDRIRSFLSRYIFQLTVHTVWKERNDRRHGEEPRTSANLISWMDKQIRNQL 606 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 234 bits (597), Expect = 4e-59 Identities = 144/472 (30%), Positives = 228/472 (48%), Gaps = 18/472 (3%) Frame = +2 Query: 2 LAPNKLKSSIYFCNVDLDTQLQAINLSGFQPGSLPFTYLGLPLITSRLNTQQCMPLVMRL 181 L NK KS +Y ++ + A GF G+LP YLGLPL+ +L + PL+ ++ Sbjct: 728 LKVNKDKSHLYLAGLN-QLESNANAAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKI 786 Query: 182 CQRVNSWTNRFLSLAGRLQLLKSILFGIQGYWAAHIFLPQGVLAKIQSILSRFLWGGNSN 361 R SW N+ LS AGR+QL+ S++FG +W + LP+G + +I+S+ SRFLW GN Sbjct: 787 TARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIE 846 Query: 362 RKPHYKVAWVDCCLPAEEGGLGLRDLESWNTAAVLYQLWRLIKSSDSLWIAWFKNCILRN 541 + KV+W CLP EGGLGLR L WN + +WRL + DSLW W L Sbjct: 847 QAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSR 906 Query: 542 KALWTVKCSYSHSWCVRKILNIRPMALRYIKYEVGEGSNFLLWHDPWAG-EPLITQMSDH 718 + W V+ S SW +++L++RP+A +++ +VG G W+D W PL + D Sbjct: 907 GSFWAVEGGQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDI 966 Query: 719 IISVMESTSLAKVSSIMNNTSWSSGGSNHPLAIELRHMISTVQI-----RRHDRVSW--D 877 S + LAKV+S + W S A + + TV + DR W + Sbjct: 967 GPSSLRVPLLAKVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVN 1026 Query: 878 GYC--NVKLKHIWNTIRQVGTLPAWYPVVWHSWMIRKCSLHMWLAFKNRLLTRERMSRFG 1051 G+ W IR T+ +W +W + K + +MW++ NRLLTR+R++ +G Sbjct: 1027 GFLCQGFSAAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWG 1086 Query: 1052 MGTSLFCTLCDTQQVETVAHIFTTCPYAVEIMS------ASSFPLNGCWARYALGDIACV 1213 S C LC E+ H+ C ++ ++ L W+ ++ V Sbjct: 1087 HIQSDACVLCSFAS-ESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSEL----LSWV 1141 Query: 1214 ALSQDE--KRMASLYLAVAMHLIWNERNLRIHSATSRPAAVLIMEIKRIVRD 1363 S E + + V ++ +W +RN +H++ AV+ + R +R+ Sbjct: 1142 RQSSPEAPPLLRKIVSQVVVYNLWRQRNNLLHNSLRLAPAVIFKLVDREIRN 1193 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 234 bits (596), Expect = 6e-59 Identities = 149/459 (32%), Positives = 224/459 (48%), Gaps = 21/459 (4%) Frame = +2 Query: 20 KSSIYFCNVDLDTQLQAINLSGFQPGSLPFTYLGLPLITSRLNTQQCMPLVMRLCQRVNS 199 KS+I+ + + + + F+ G+LP YLGLPL+T R+ +PLV ++ R+ S Sbjct: 887 KSTIFMAGISPNAKTSILQQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITS 946 Query: 200 WTNRFLSLAGRLQLLKSILFGIQGYWAAHIFLPQGVLAKIQSILSRFLWGGNSNRKPHYK 379 WTNRFLS AGRLQL+KS+L I +W + LP+ L +I+ + S FLW G K Sbjct: 947 WTNRFLSFAGRLQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAK 1006 Query: 380 VAWVDCCLPAEEGGLGLRDLESWNTAAVLYQLWRLIKSSDSLWIAWFKNCILRNKALWTV 559 +AW + C EEGGLGL+ L+ N ++L +WR++ + DSLW+ W ++R + W+V Sbjct: 1007 IAWSEVCKLKEEGGLGLKPLKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSV 1066 Query: 560 KCSYS-HSWCVRKILNIRPMALRYIKYEVGEGSNFLLWHDPWAGEPLITQ-MSDHIISVM 733 K + SW RKIL R A + + EV G+ WHD W + Q M + Sbjct: 1067 KENTGLGSWLWRKILKQRDKARLFHRMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDL 1126 Query: 734 ESTSLAKVSSIMNNTSWSSGGSNHPLAIELRHMISTVQIRRHDRVSWDG----------- 880 + A V+ +MN + A L + S +++ R DR S DG Sbjct: 1127 GIPNNATVAEVMN-----THRRKRHRADFLNQIKSQIELARQDR-STDGDRSLWKQKEDT 1180 Query: 881 -YCNVKLKHIWNTIRQVGTLPAWYPVVWHSWMIRKCSLHMWLAFKNRLLTRERMSRFGMG 1057 + W IR + WY VW S K S WLAF NRL T +++ ++ G Sbjct: 1181 FKSSFSSSKTWQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSG 1240 Query: 1058 TSLFCTLCDTQQVETVAHIFTTCPYAVEI-MSASSFPLNGCWARYALG-DIACVALSQDE 1231 C C +++ET H+F +CPY+ + S + LNG R L ++ L Sbjct: 1241 ARYDCVFCG-EELETRDHLFFSCPYSSHVWFSLTKGLLNG---RNILNWNLITPHLLDSS 1296 Query: 1232 KRMASLY-----LAVAMHLIWNERNLRIHSATSRPAAVL 1333 + ++ ++H +W ERN R H T+ PAA L Sbjct: 1297 RPYLHVFTLRYAFQASIHSLWRERNCRRHGETAIPAAKL 1335 >dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] Length = 478 Score = 233 bits (593), Expect = 1e-58 Identities = 142/444 (31%), Positives = 218/444 (49%), Gaps = 16/444 (3%) Frame = +2 Query: 86 FQPGSLPFTYLGLPLITSRLNTQQCMPLVMRLCQRVNSWTNRFLSLAGRLQLLKSILFGI 265 F G+LP YLGLPL+T ++ T PLV ++ R+ WT R LS AGRLQL+ S++ + Sbjct: 18 FASGALPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSL 77 Query: 266 QGYWAAHIFLPQGVLAKIQSILSRFLWGGNSNRKPHYKVAWVDCCLPAEEGGLGLRDLES 445 +W + LP + +I SI S FLW G KVAW D C P +EGGLG+R L+ Sbjct: 78 TNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKE 137 Query: 446 WNTAAVLYQLWRLIKSSDSLWIAWFKNCILRNKALWTVKCSYS-HSWCVRKILNIRPMAL 622 N ++L +WR++ SS SLW+ W + +LR + W++ + + SW +KIL R +A Sbjct: 138 ANKVSLLKLIWRML-SSTSLWVQWLRLYLLRKGSFWSISGNTTLGSWMWKKILKHRALAS 196 Query: 623 RYIKYEVGEGSNFLLWHDPWAGEPLITQMSDHIISV-MESTSLAKVSSIMNNTSWSSGGS 799 ++K+++ GSN W D W+ + ++ H + M T A V+ + N Sbjct: 197 GFVKHDIHNGSNTSFWFDNWSKIGRLIDVTGHRGCIDMGITLHASVAEAVVN--HRPRRH 254 Query: 800 NHPLAIELRHMISTVQ----IRRHDRVSWDGYCNV-----KLKHIWNTIRQVGTLPAWYP 952 H + + +I+ V+ D V W G ++ K W R+ WY Sbjct: 255 RHDTLLRIEDVIAEVRHQGLTSGEDTVRWKGNGDIFKPCFNTKETWAATREPKLKVNWYK 314 Query: 953 VVWHSWMIRKCSLHMWLAFKNRLLTRERMSRFGMGTSLFCTLCDTQQVETVAHIFTTCPY 1132 VW S K S+ W+A KNRL T +RM + G C LC VET H+F TCPY Sbjct: 315 GVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCVLCH-HLVETRDHLFFTCPY 373 Query: 1133 AVEIMSASSFPL-----NGCWARYALGDIACVALSQDEKRMASLYLAVAMHLIWNERNLR 1297 + E+ S + L W L + +L + + + +H +W ERN R Sbjct: 374 SAEVWSTLTRKLLSQHFTNRW-EAILKLLTNKSLGHEVPFLTRYTFQLTLHSLWKERNGR 432 Query: 1298 IHSATSRPAAVLIMEIKRIVRDKL 1369 H + AA ++ + + VR+++ Sbjct: 433 RHGEVPQAAAQMVRFLDKQVRNRI 456