BLASTX nr result
ID: Bupleurum21_contig00032420
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00032420 (1121 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 226 6e-57 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 222 2e-55 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 205 1e-50 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 205 1e-50 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 202 9e-50 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 226 bits (577), Expect = 6e-57 Identities = 143/386 (37%), Positives = 212/386 (54%), Gaps = 13/386 (3%) Frame = +2 Query: 2 PWCVMGDFNAFISLDETASGSS-RWTTSMIEFKDCLFSLGITDLNYIGCPFTWWDKSRSQ 178 PW V+GDFN ++ E ++ S +M +F+DCL + ++DL Y G FTWW+KS + Sbjct: 138 PWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAELSDLRYKGNTFTWWNKSHTT 197 Query: 179 PLVRKLDRVLVNTSWINVFPNSFANFLPRGLSDHSPATVCLGQAQVKLNKPFQVFHHMLT 358 P+ +K+DR+LVN SW +FP+S F SDH V L + +K +PF+ F+++L Sbjct: 198 PVAKKIDRILVNDSWNALFPSSLGIFGSLDFSDHVSCGVVLEETSIKAKRPFKFFNYLLK 257 Query: 359 HPDFLNVVKEAWDT-PISGDSWFILTSKLKLVKSGLK---RLN-SLVGNVQSAVHIARID 523 + DFLN+V++ W T + G S F ++ KLK +K +K RLN S + H I Sbjct: 258 NLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRLNYSELEKRTKEAHDFLIG 317 Query: 524 LHNFQAALPNIPSQAQLDEEA-RLLGIFSSALDIEEQFLRQKSKAHWLKSGDGNNKYFFN 700 + A P P A + EA R I ++A EE F RQKS+ W GDGN KYF Sbjct: 318 CQDRTLADPT-PINASFELEAERKWHILTAA---EESFFRQKSRISWFAEGDGNTKYFHR 373 Query: 701 YCRGRWNNNKIVGLLDSTGSIVTNHAELASIAVDHFKDVIGEERPVDPFPDD------LI 862 R ++N I L D G +V + + + +F ++G+E VDP+ + L+ Sbjct: 374 MADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLLGDE--VDPYLMEQNDMNLLL 431 Query: 863 LPKLLDSQKSGLIANVTPAEILRALKSMAKNKSPGPDGFSPEFYLTTWDIVGADVIAGIS 1042 + +Q L + + +I AL S+ +NKS GPDGF+ EF++ +W IVGA+V I Sbjct: 432 SYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAEFFIDSWSIVGAEVTDAIK 491 Query: 1043 SFFTSLHLLRIINATAISLIPKENSP 1120 FF+S LL+ NAT I LIPK +P Sbjct: 492 EFFSSGCLLKQWNATTIVLIPKIVNP 517 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 222 bits (565), Expect = 2e-55 Identities = 130/379 (34%), Positives = 202/379 (53%), Gaps = 9/379 (2%) Frame = +2 Query: 2 PWCVMGDFNAFISLDETASGSSRWTTSMIEFKDCLFSLGITDLNYIGCPFTWWDKSRSQP 181 PW ++GDFN + + ++G SR T M EF++CL + I+DL + G +TWW+ + P Sbjct: 137 PWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDLPFRGNHYTWWNNQENNP 196 Query: 182 LVRKLDRVLVNTSWINVFPNSFANFLPRGLSDHSPATVCLGQAQVKLNKPFQVFHHMLTH 361 + +K+DR+LVN SW+ P S+ +F SDH P+ V + NKPF++ + ++ H Sbjct: 197 IAKKIDRILVNDSWLIASPLSYGSFCAMEFSDHCPSCVNISNQSGGRNKPFKLSNFLMHH 256 Query: 362 PDFLNVVKEAWD-TPISGDSWFILTSKLKLVKSGLKRLN-SLVGNVQSAVHIARIDLHNF 535 P+F+ ++ WD G + F L+ K K +K ++ N ++ V A +L Sbjct: 257 PEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTFNREHYSGLEKRVVQAAQNLKTC 316 Query: 536 QAALPNIPSQ--AQLDEEARLLGIFSSALDIEEQFLRQKSKAHWLKSGDGNNKYFFNYCR 709 Q L PS A L++EA ++ EE+FL QKS+ WLK GD N +F Sbjct: 317 QNNLLAAPSSYLAGLEKEAH--RSWAELALAEERFLCQKSRVLWLKCGDSNTTFFHRMMT 374 Query: 710 GRWNNNKIVGLLDSTGSIVTNHAELASIAVDHFKDVIGEERPVDPFP-----DDLILPKL 874 R N+I LLD TG + N EL + VD FK++ G + + L K Sbjct: 375 ARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFGSSSHLISAEGISQINSLTRFKC 434 Query: 875 LDSQKSGLIANVTPAEILRALKSMAKNKSPGPDGFSPEFYLTTWDIVGADVIAGISSFFT 1054 ++ + L A V+ A+I ++ NKSPGPDG++ EF+ TW IVG +IA + FF Sbjct: 435 DENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSEFFKKTWSIVGPSLIAAVQEFFR 494 Query: 1055 SLHLLRIINATAISLIPKE 1111 S LL N+TA++++PK+ Sbjct: 495 SGRLLGQWNSTAVTMVPKK 513 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 205 bits (522), Expect = 1e-50 Identities = 132/379 (34%), Positives = 198/379 (52%), Gaps = 10/379 (2%) Frame = +2 Query: 11 VMGDFNAFISLDETASGSS-RWTTSMIEFKDCLFSLGITDLNYIGCPFTWWDKSRSQPLV 187 ++GDFN + E ++ S M +F CL + ++DL + G FTWW+KS +P+ Sbjct: 1 MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60 Query: 188 RKLDRVLVNTSWINVFPNSFANFLPRGLSDHSPATVCLGQAQVKLNKPFQVFHHMLTHPD 367 +KLDR+L N SW N++P+S F SDH V L + +PF+ F+ +L + D Sbjct: 61 KKLDRILANDSWCNLYPSSHGLFGNLDFSDHVSCGVVLEANGISAKRPFKFFNFLLKNED 120 Query: 368 FLNVVKEAW-DTPISGDSWFILTSKLKLVKSGLK---RLN-SLVGNVQSAVHIARIDLHN 532 FLNVV + W T + G S + ++ KLK +K +K RLN S + H I N Sbjct: 121 FLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLITCQN 180 Query: 533 FQAALPNIPSQAQLDEEARLLGIFSSALDIEEQFLRQKSKAHWLKSGDGNNKYFFNYCRG 712 A P++ S A L+ EA+ + S EE F Q+S+ W GD N YF Sbjct: 181 LTLANPSV-SNAALELEAQRKWVLLSC--AEESFFHQRSRVSWFAEGDSNTHYFHRMVDS 237 Query: 713 RWNNNKIVGLLDSTGSIVTNHAELASIAVDHFKDVIGE-ERPVDPFPDD---LILPKLLD 880 R + N I L+DS G ++ + + V +++ ++G E P +D L+ + Sbjct: 238 RKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRCSQ 297 Query: 881 SQKSGLIANVTPAEILRALKSMAKNKSPGPDGFSPEFYLTTWDIVGADVIAGISSFFTSL 1060 Q S L + T EI A KS+ +NK+ GPDG+S EF+ TW I+G +V+A I FF S Sbjct: 298 DQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSG 357 Query: 1061 HLLRIINATAISLIPKENS 1117 LL+ NAT + LIPK ++ Sbjct: 358 QLLKQWNATTLVLIPKTSN 376 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 205 bits (522), Expect = 1e-50 Identities = 132/379 (34%), Positives = 198/379 (52%), Gaps = 10/379 (2%) Frame = +2 Query: 11 VMGDFNAFISLDETASGSS-RWTTSMIEFKDCLFSLGITDLNYIGCPFTWWDKSRSQPLV 187 ++GDFN + E ++ S M +F CL + ++DL + G FTWW+KS +P+ Sbjct: 1 MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60 Query: 188 RKLDRVLVNTSWINVFPNSFANFLPRGLSDHSPATVCLGQAQVKLNKPFQVFHHMLTHPD 367 +KLDR+L N SW N++P+S F SDH V L + +PF+ F+ +L + D Sbjct: 61 KKLDRILANDSWCNLYPSSHGLFGNLDFSDHVSCGVVLEANGISAKRPFKFFNFLLKNED 120 Query: 368 FLNVVKEAW-DTPISGDSWFILTSKLKLVKSGLK---RLN-SLVGNVQSAVHIARIDLHN 532 FLNVV + W T + G S + ++ KLK +K +K RLN S + H I N Sbjct: 121 FLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLITCQN 180 Query: 533 FQAALPNIPSQAQLDEEARLLGIFSSALDIEEQFLRQKSKAHWLKSGDGNNKYFFNYCRG 712 A P++ S A L+ EA+ + S EE F Q+S+ W GD N YF Sbjct: 181 LTLANPSV-SNAALELEAQRKWVLLSC--AEESFFHQRSRVSWFAEGDSNTHYFHRMVDS 237 Query: 713 RWNNNKIVGLLDSTGSIVTNHAELASIAVDHFKDVIGE-ERPVDPFPDD---LILPKLLD 880 R + N I L+DS G ++ + + V +++ ++G E P +D L+ + Sbjct: 238 RKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRCSQ 297 Query: 881 SQKSGLIANVTPAEILRALKSMAKNKSPGPDGFSPEFYLTTWDIVGADVIAGISSFFTSL 1060 Q S L + T EI A KS+ +NK+ GPDG+S EF+ TW I+G +V+A I FF S Sbjct: 298 DQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFDSG 357 Query: 1061 HLLRIINATAISLIPKENS 1117 LL+ NAT + LIPK ++ Sbjct: 358 QLLKQWNATTLVLIPKTSN 376 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 202 bits (515), Expect = 9e-50 Identities = 128/380 (33%), Positives = 198/380 (52%), Gaps = 11/380 (2%) Frame = +2 Query: 2 PWCVMGDFNAFISLDETASGSS-RWTTSMIEFKDCLFSLGITDLNYIGCPFTWWDKSRSQ 178 PW ++GDFN + E + +S M F+DCLF + DL + G FTWW+KS ++ Sbjct: 87 PWIMLGDFNQVLCPAEHSQATSLNVNRRMKVFRDCLFEAELCDLVFKGNTFTWWNKSATR 146 Query: 179 PLVRKLDRVLVNTSWINVFPNSFANFLPRGLSDHSPATVCLGQAQVKLNKPFQVFHHMLT 358 P+ +KLDR+LVN SW + FP+++A F SDH+ V + + +PF+ ++ +L Sbjct: 147 PVAKKLDRILVNESWCSRFPSAYAVFGEPDFSDHASCGVIINPLMHREKRPFRFYNFLLQ 206 Query: 359 HPDFLNVVKEAW-DTPISGDSWFILTSKLKLVKS-----GLKRLNSLVGNVQSAVHIARI 520 +PDF+++V E W + G S F ++ KLK +K+ ++ ++L V+ A H + Sbjct: 207 NPDFISLVGELWYSINVVGSSMFKMSKKLKALKNPIRTFSMENFSNLEKRVKEA-HNLVL 265 Query: 521 DLHNFQAALPNIPSQAQLDEEARLLGIFSSALDIEEQFLRQKSKAHWLKSGDGNNKYFFN 700 N + P IP+ A E R I A EE F Q+S+ W+ GD N YF Sbjct: 266 YRQNKTLSDPTIPNAALEMEAQRKWLILVKA---EESFFCQRSRVTWMGEGDSNTSYFHR 322 Query: 701 YCRGRWNNNKIVGLLDSTGSIVTNHAELASIAVDHFKDVIGEE--RPVDPFPD-DLILP- 868 R N I ++D G + + +++F +++G E P+ D DL+LP Sbjct: 323 MADSRKAVNTIHIIIDDNGVKIDTQLGIKEHCIEYFSNLLGGEVGPPMLIQEDFDLLLPF 382 Query: 869 KLLDSQKSGLIANVTPAEILRALKSMAKNKSPGPDGFSPEFYLTTWDIVGADVIAGISSF 1048 + QK L + + +I A S NK+ GPDGF EF+ TW ++G +V +S F Sbjct: 383 RCSHDQKKELAMSFSRQDIKSAFFSFPSNKTSGPDGFPVEFFKETWSVIGTEVTDAVSEF 442 Query: 1049 FTSLHLLRIINATAISLIPK 1108 FTS LL+ NAT + LIPK Sbjct: 443 FTSSVLLKQWNATTLVLIPK 462