BLASTX nr result
ID: Cephaelis21_contig00021886
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00021886 (965 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ABI34321.1| RNase H family protein [Solanum demissum] 85 2e-14 ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis ... 82 3e-13 gb|AAD20714.1| putative non-LTR retroelement reverse transcripta... 82 3e-13 gb|AAD24831.1| putative non-LTR retroelement reverse transcripta... 80 6e-13 emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulga... 77 6e-12 >gb|ABI34321.1| RNase H family protein [Solanum demissum] Length = 945 Score = 85.1 bits (209), Expect = 2e-14 Identities = 77/326 (23%), Positives = 140/326 (42%), Gaps = 7/326 (2%) Frame = -1 Query: 965 FSLSSAYLLLRQHGQIMPDITPFWDTRIPRKISIFGWKLFNGFLPFPEVLMSFGFHLPSK 786 F+ SAY+ + W + P K+S W+L LPF + + F ++ S Sbjct: 548 FTTKSAYVDCSNTREKNDMRNKIWHGKFPFKMSFLTWRLVQNKLPFYDTVGKFVDNIDSN 607 Query: 785 CPCCPAL--DSLDHSLLQCPTAQYLWKYFSDMFELQFDSSAGIFELLHS--NLSMHNSTP 618 C CC + ++++H L A YLWK F + +S+ I LL + N+ HNS Sbjct: 608 CVCCKNMKTETINHVFLNSDVASYLWKKFGGTLGIDTRASSTI-NLLKTWWNVQTHNSI- 665 Query: 617 EYSYLHQIIPLLITWTLWRLRNSILFNGAKAVHSQLIGHVIFLLHSIFVKQPLP---ISQ 447 ++ + +P+LI W +W+ R + + K + + + + ++ + ++ P I Sbjct: 666 -HNVIIHTLPILIFWEIWKRRCACKYGDQKKMWYRTMENHVWWNLKMSLRMTFPSFEIGN 724 Query: 446 KTTTLHNCPILPTVIRKRKILRVSWLLPPFSKLAINVDXXXXXXXXXXXXGFILRDWRGQ 267 L N K KI V W P + + IN D G+I+RD + Sbjct: 725 SWRDLLNKVESLRPYPKWKI--VHWNTPNINCVKINTD--GSFSSGNAGLGWIVRDHTRR 780 Query: 266 ILYARSEYFGSGTSFQAEVQCLLRGLQYCTEMNLTSVRIESDSKTLVDMVNSRASWPWRQ 87 ++ A S ++ AE G+ +C + + +E DSK +VDMV + + + Sbjct: 781 MIMAFSIPSSCSSNNLAEALAARFGILWCLQQGFHNCYLELDSKLVVDMVRNGQATNLKI 840 Query: 86 *HQITQICNLLQSSSSALSHIFRETN 9 + I ++ + ++H +RE N Sbjct: 841 KGVVEDIIQVVAKMNCEVNHCYREAN 866 >ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis thaliana] gi|6682231|gb|AAF23283.1|AC016661_8 putative non-LTR reverse transcriptase [Arabidopsis thaliana] gi|332641254|gb|AEE74775.1| RNase H domain-containing protein [Arabidopsis thaliana] Length = 484 Score = 81.6 bits (200), Expect = 3e-13 Identities = 69/275 (25%), Positives = 118/275 (42%), Gaps = 3/275 (1%) Frame = -1 Query: 929 HGQIMPDI-TPFWDTRIPRKISIFGWKLFNGFLPFPEVLMSFGFHLPSKCPCCPAL-DSL 756 HG I D+ T W+ I K+ F W+ + L E L + G + CP C +S+ Sbjct: 153 HGSI--DLKTRIWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESI 210 Query: 755 DHSLLQCPTAQYLWKYFSDMFELQFDSSAGIFELLHSNLSMHNSTPEYSYLHQIIPLLIT 576 +H+L CP A W+ SD ++ + FE SN+ S H+++P+ + Sbjct: 211 NHALFTCPFATMAWR-LSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLI 269 Query: 575 WTLWRLRNSILFNGAKAVHSQLIGHVIFLLHSIFVKQPLPISQKTTTLHNCPILPTVIRK 396 W +W+ RN+++FN + S+ + + K T H PT R+ Sbjct: 270 WRIWKARNNVVFNKFRESPSKTV---------LSAKAETHDWLNATQSHKKTPSPT--RQ 318 Query: 395 RKILRVSWLLPPFSKLAINVDXXXXXXXXXXXXGFILRDWRG-QILYARSEYFGSGTSFQ 219 ++ W PP + + N D G+I+R+ G I + + + + Sbjct: 319 IAENKIEWRNPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLE 378 Query: 218 AEVQCLLRGLQYCTEMNLTSVRIESDSKTLVDMVN 114 AE + LL LQ T V +E D +TL++++N Sbjct: 379 AETKALLAALQQTWIRGYTQVFMEGDCQTLINLIN 413 >gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1750 Score = 81.6 bits (200), Expect = 3e-13 Identities = 69/275 (25%), Positives = 118/275 (42%), Gaps = 3/275 (1%) Frame = -1 Query: 929 HGQIMPDI-TPFWDTRIPRKISIFGWKLFNGFLPFPEVLMSFGFHLPSKCPCCPAL-DSL 756 HG I D+ T W+ I K+ F W+ + L E L + G + CP C +S+ Sbjct: 1419 HGSI--DLKTRIWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESI 1476 Query: 755 DHSLLQCPTAQYLWKYFSDMFELQFDSSAGIFELLHSNLSMHNSTPEYSYLHQIIPLLIT 576 +H+L CP A W+ SD ++ + FE SN+ S H+++P+ + Sbjct: 1477 NHALFTCPFATMAWR-LSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLI 1535 Query: 575 WTLWRLRNSILFNGAKAVHSQLIGHVIFLLHSIFVKQPLPISQKTTTLHNCPILPTVIRK 396 W +W+ RN+++FN + S+ + + K T H PT R+ Sbjct: 1536 WRIWKARNNVVFNKFRESPSKTV---------LSAKAETHDWLNATQSHKKTPSPT--RQ 1584 Query: 395 RKILRVSWLLPPFSKLAINVDXXXXXXXXXXXXGFILRDWRG-QILYARSEYFGSGTSFQ 219 ++ W PP + + N D G+I+R+ G I + + + + Sbjct: 1585 IAENKIEWRNPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLE 1644 Query: 218 AEVQCLLRGLQYCTEMNLTSVRIESDSKTLVDMVN 114 AE + LL LQ T V +E D +TL++++N Sbjct: 1645 AETKALLAALQQTWIRGYTQVFMEGDCQTLINLIN 1679 >gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1524 Score = 80.5 bits (197), Expect = 6e-13 Identities = 69/275 (25%), Positives = 118/275 (42%), Gaps = 3/275 (1%) Frame = -1 Query: 929 HGQIMPDI-TPFWDTRIPRKISIFGWKLFNGFLPFPEVLMSFGFHLPSKCPCCPAL-DSL 756 HG I D+ T W+ I K+ F W+ + L E L + G + CP C +S+ Sbjct: 1193 HGSI--DLKTRIWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPICPRCHRENESI 1250 Query: 755 DHSLLQCPTAQYLWKYFSDMFELQFDSSAGIFELLHSNLSMHNSTPEYSYLHQIIPLLIT 576 +H+L CP A W + SD ++ + FE SN+ S H+++P+ + Sbjct: 1251 NHALFTCPFATMAW-WLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLI 1309 Query: 575 WTLWRLRNSILFNGAKAVHSQLIGHVIFLLHSIFVKQPLPISQKTTTLHNCPILPTVIRK 396 W +W+ RN+++FN + S+ + + K T H PT R+ Sbjct: 1310 WRIWKARNNVVFNKFRESPSKTV---------LSAKAETHDWLNATQSHKKTPSPT--RQ 1358 Query: 395 RKILRVSWLLPPFSKLAINVDXXXXXXXXXXXXGFILRDWRG-QILYARSEYFGSGTSFQ 219 ++ W PP + + N D G+I+R+ G I + + + + Sbjct: 1359 IAENKIEWRNPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLE 1418 Query: 218 AEVQCLLRGLQYCTEMNLTSVRIESDSKTLVDMVN 114 AE + LL LQ T V +E D +TL++++N Sbjct: 1419 AETKALLAALQQTWIRGYTQVFMEGDCQTLINLIN 1453 >emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1389 Score = 77.0 bits (188), Expect = 6e-12 Identities = 78/307 (25%), Positives = 122/307 (39%), Gaps = 17/307 (5%) Frame = -1 Query: 896 WDTRIPRKISIFGWKLFNGFLPFPEVLMSFGFHLPSKCPCCPA-LDSLDHSLLQCPTAQY 720 W P KI F WK N L L +P C C +++ H QCP Sbjct: 1053 WGIHAPPKIKNFLWKACNDGLATTSRLERSHIFVPQNCCFCDCPSETICHLCFQCPFTLD 1112 Query: 719 LWKYFSDMFE----------LQFDSSAGIFELLHSNLSMHNSTPEYSYLHQIIPLLITWT 570 ++ + D F+ LQ S + E H NL++ YL ++ ++ W Sbjct: 1113 IYSHLEDKFQWPAYPSWFSTLQLSSFRSVLEACHINLTLE-------YLTKLS--IVWWH 1163 Query: 569 LWRLRNSILFNGAKAVHSQLIGHVIFLLHSI---FVKQPLPISQKTTTLHNCPILPTVIR 399 +W RN ++FN SQ F++HS + K L I T L LP Sbjct: 1164 VWYFRNKLIFNNESTSFSQ----ASFIIHSFMGKWEKANLEIPSFNTPLPKDCKLPVRSG 1219 Query: 398 KRKILRVSWLLPPFSKLAINVDXXXXXXXXXXXXGFILRDWRGQILYARSEYFGSGTS-F 222 K I W P L +N D GF++R+ G++L AR++ G S Sbjct: 1220 KNLI----WSPPNEDVLKVNFD-GSKLDNGQAAYGFVIRNSNGEVLMARAKALGVYPSIL 1274 Query: 221 QAEVQCLLRGLQYCTEMNLTSVRI--ESDSKTLVDMVNSRASWPWRQ*HQITQICNLLQS 48 AE LL G++ + S +I E D+ +++ ++ A+ PW I N++ Sbjct: 1275 MAEAMGLLEGIKGAISLQNWSRKIIFEGDNIAVINAMSPSATGPW-------TIANIILD 1327 Query: 47 SSSALSH 27 + + L H Sbjct: 1328 AGALLGH 1334