BLASTX nr result
ID: Coptis21_contig00012050
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00012050 (1084 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAD24831.1| putative non-LTR retroelement reverse transcripta... 100 6e-19 emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga... 99 2e-18 ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis ... 97 9e-18 gb|EEE69144.1| hypothetical protein OsJ_28268 [Oryza sativa Japo... 96 2e-17 gb|AAD20714.1| putative non-LTR retroelement reverse transcripta... 96 2e-17 >gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1524 Score = 100 bits (249), Expect = 6e-19 Identities = 104/389 (26%), Positives = 161/389 (41%), Gaps = 35/389 (8%) Frame = -3 Query: 1067 WNHNRINSLFTAEIATEIYKIHLDHNCVSEPWIWTTERNGKFTVKSTYNWYMQLLP--NS 894 W+ ++I+ I++I+L + + IW G++TV+S Y W + P N Sbjct: 1128 WDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGY-WLLTHDPSTNI 1186 Query: 893 PSSTP------IPANIWN--IIRKLKSPLRIQLFIWQLLHSILPTKLFLTHRHMATNAIC 738 P+ P + IWN I+ KLK F+W+ L L T LT R M + IC Sbjct: 1187 PAINPPHGSIDLKTRIWNLPIMPKLKH------FLWRALSQALATTERLTTRGMRIDPIC 1240 Query: 737 PRCTQAPESIDHALLTCPSLATTWFTSPLSL-RTQSHTAFLDTFLKISMETQPKDDILYN 561 PRC + ESI+HAL TCP W+ S SL R Q +S + + + N Sbjct: 1241 PRCHRENESINHALFTCPFATMAWWLSDSSLIRNQ----------LMSNDFEENISNILN 1290 Query: 560 LTHFANLSDF-----------IWLDRNQLIFTPNHKPLSSIKLVAQSTT----------S 444 +SDF IW RN ++F + S L A++ T Sbjct: 1291 FVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHK 1350 Query: 443 FLPSKPKPLCKYYIPKIFLPELFL-ITIDGGY--ESLSKTGGIGLTICKWSCDILFAGSK 273 PS + + + I P ++ D G+ + L TGG + + I + K Sbjct: 1351 KTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGG-WIIRNHYGTPISWGSMK 1409 Query: 272 HCIASGPEEMEFQALLWGLEKALDLDIKMVVFVSDCKSMVDAVNGQIACSSWELEDLRSK 93 S P E E +ALL L++ V DC+++++ +NG SS L + Sbjct: 1410 LAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFHSS--LANHLED 1467 Query: 92 IQFIRQSFGFCRFVYCSRSETQHAHLLAE 6 I F F +F + R + AH+LA+ Sbjct: 1468 ISFWANKFASIQFGFIRRKGNKLAHVLAK 1496 >emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1369 Score = 98.6 bits (244), Expect = 2e-18 Identities = 94/372 (25%), Positives = 163/372 (43%), Gaps = 15/372 (4%) Frame = -3 Query: 1076 NNSWNHNRINSLFTAEIATEIYKIHLDHNCVSEPWIWTTERNGKFTVKSTYNWYMQLLPN 897 N+ WN +N+LF +T I +I + + W+W +NG+FTV+S Y Y +LL + Sbjct: 978 NDRWNVELLNTLFQPWESTAIQRIPVALQKKPDQWMWMMSKNGQFTVRSAY--YHELLED 1035 Query: 896 ---SPSSTPIP-ANIWNIIRKLKSPLRIQLFIWQLLHSILPTKLFLTHRHMATNAICPRC 729 PS++ P +W I K K P +++LF W+ +H+ L + R M + CPRC Sbjct: 1036 RKTGPSTSRGPNLKLWQKIWKAKIPPKVKLFSWKAIHNGLAVYTNMRKRGMNIDGACPRC 1095 Query: 728 TQAPESIDHALLTCPSLATTWFTSPLSLRTQSHTAFLDTFLKISMETQPKDDILYNLTHF 549 + E+ +H + C + W+ SPL + T + A S+ KD + L F Sbjct: 1096 GEKEETTEHLIWGCDESSRAWYISPLRIHTGNIEAGSFRIWVESLLDTHKDTEWWAL--F 1153 Query: 548 ANLSDFIWLDRNQLIFTPNHKPLSSIKLVAQSTTSFLPSKPKPLCKYYIPKIFL------ 387 + IWL RN+ +F K L+ ++V ++ + + + C + P L Sbjct: 1154 WMICWNIWLGRNKWVF--EKKKLAFQEVVERAVRGVMEFEEE--CAHTSPVETLNTHENG 1209 Query: 386 ---PELFLITIDGGYESLSKTG-GIGLTICKWSCDILFAGSKHCIA-SGPEEMEFQALLW 222 P + ++ ++ G G+G + D+L A A P E +L + Sbjct: 1210 WSVPPVGMVKLNVDAAVFKHVGIGMGGVVRDAEGDVLLATCCGGWAMEDPAMAEACSLRY 1269 Query: 221 GLEKALDLDIKMVVFVSDCKSMVDAVNGQIACSSWELEDLRSKIQFIRQSFGFCRFVYCS 42 GL+ A + + +V DCK + + G+ A + I ++ F + Sbjct: 1270 GLKVAYEAGFRNLVVEMDCKKLFLQLRGK-ASDVTPFGRVVDDILYLASKCSNVVFEHVK 1328 Query: 41 RSETQHAHLLAE 6 R + AHLLA+ Sbjct: 1329 RHCNKVAHLLAQ 1340 >ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis thaliana] gi|6682231|gb|AAF23283.1|AC016661_8 putative non-LTR reverse transcriptase [Arabidopsis thaliana] gi|332641254|gb|AEE74775.1| RNase H domain-containing protein [Arabidopsis thaliana] Length = 484 Score = 96.7 bits (239), Expect = 9e-18 Identities = 103/389 (26%), Positives = 159/389 (40%), Gaps = 35/389 (8%) Frame = -3 Query: 1067 WNHNRINSLFTAEIATEIYKIHLDHNCVSEPWIWTTERNGKFTVKSTYNWYMQLLP--NS 894 W+ ++I+ I++I+L + + IW G++TV+S Y W + P N Sbjct: 88 WDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGY-WLLTHDPSTNI 146 Query: 893 PSSTP------IPANIWN--IIRKLKSPLRIQLFIWQLLHSILPTKLFLTHRHMATNAIC 738 P+ P + IWN I+ KLK F+W+ L L T LT R M + C Sbjct: 147 PAINPPHGSIDLKTRIWNLPIMPKLKH------FLWRALSQALATTERLTTRGMRIDPSC 200 Query: 737 PRCTQAPESIDHALLTCPSLATTWFTSPLSL-RTQSHTAFLDTFLKISMETQPKDDILYN 561 PRC + ESI+HAL TCP W S SL R Q +S + + + N Sbjct: 201 PRCHRENESINHALFTCPFATMAWRLSDSSLIRNQ----------LMSNDFEENISNILN 250 Query: 560 LTHFANLSDF-----------IWLDRNQLIFTPNHKPLSSIKLVAQSTT----------S 444 +SDF IW RN ++F + S L A++ T Sbjct: 251 FVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHK 310 Query: 443 FLPSKPKPLCKYYIPKIFLPELFL-ITIDGGY--ESLSKTGGIGLTICKWSCDILFAGSK 273 PS + + + I P ++ D G+ + L TGG + + I + K Sbjct: 311 KTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGG-WIIRNHYGTPISWGSMK 369 Query: 272 HCIASGPEEMEFQALLWGLEKALDLDIKMVVFVSDCKSMVDAVNGQIACSSWELEDLRSK 93 S P E E +ALL L++ V DC+++++ +NG SS L + Sbjct: 370 LAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFHSS--LANHLED 427 Query: 92 IQFIRQSFGFCRFVYCSRSETQHAHLLAE 6 I F F +F + R + AH+LA+ Sbjct: 428 ISFWANKFASIQFGFIRRKGNKLAHVLAK 456 >gb|EEE69144.1| hypothetical protein OsJ_28268 [Oryza sativa Japonica Group] Length = 1256 Score = 95.9 bits (237), Expect = 2e-17 Identities = 94/393 (23%), Positives = 159/393 (40%), Gaps = 37/393 (9%) Frame = -3 Query: 1076 NNSWNHNRINSLFTAEIATEIYKIHLDHNCVSEPWIWTTERNGKFTVKSTYNWYM---QL 906 + SW++++I+ F A I IHL + W +R G+F+V+S Y+ + Q+ Sbjct: 843 SGSWDNDKISHHFLPMDAEAILNIHLSSRLEEDFIAWHPDRLGRFSVRSAYHLAVALAQV 902 Query: 905 LPNSPSSTPIPANIWNIIRKLKSPLRIQLFIWQLLHSILPTKLFLTHRHMATNAICPRCT 726 S SS + N + K +P ++++F W+ + + L T R + + IC C Sbjct: 903 NDGSSSSGNGSSRACNALWKCNAPQKVKIFAWRAITNSLTTLENKKKRKLEVSDICTICG 962 Query: 725 QAPESIDHALLTCPSLATTW--FTSPLSLRTQS--HTAFLDTFLKISMETQPKDDILYNL 558 E + HAL CP W + + L S H L + ++QP+D ++ + Sbjct: 963 VESEYVVHALFHCPHARQLWEAMSDDMQLNPLSNIHNGDSKVILDLLEQSQPEDQVMLLM 1022 Query: 557 THFANLSDFIWLDRNQLIFTPNHKPLSSIKLVAQSTTSFL-------------PSKPKPL 417 + IW RN+++ + KP I + + S++ P K K + Sbjct: 1023 VLWR-----IWHTRNEIV---HGKPAPGILVSKRFIESYVLSLAEIKQHPQANPEKGKHV 1074 Query: 416 CKYYIPKIF----------------LPELFLITIDGGYESLSKTGGIGLTICKWSCDILF 285 + K LP + +DG ++ GGIG + + +++F Sbjct: 1075 VDVVLKKSHSIKRSREPAPDKWSKPLPGSMKLNVDGSFQESEGKGGIGAVLRNCTGEVIF 1134 Query: 284 AGSKHC-IASGPEEMEFQALLWGLEKALDLDIKMVVFVSDCKSMVDAVNGQIACSSWELE 108 A H S EME A GL AL + +V +DC +MV S EL Sbjct: 1135 AACGHVDHCSSALEMELLASRDGLALALQWTLLPIVIETDCLAMVHLFRDATGAKS-ELA 1193 Query: 107 DLRSKIQFIRQSFGFCRFVYCSRSETQHAHLLA 9 L ++I + F C RS+ +H LA Sbjct: 1194 FLITEIDSLLVGNRDISFNKCLRSQNLISHCLA 1226 >gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1750 Score = 95.5 bits (236), Expect = 2e-17 Identities = 102/389 (26%), Positives = 159/389 (40%), Gaps = 35/389 (8%) Frame = -3 Query: 1067 WNHNRINSLFTAEIATEIYKIHLDHNCVSEPWIWTTERNGKFTVKSTYNWYMQLLP--NS 894 W+ ++I+ I++I+L + + IW G++TV+S Y W + P N Sbjct: 1354 WDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGY-WLLTHDPSTNI 1412 Query: 893 PSSTP------IPANIWN--IIRKLKSPLRIQLFIWQLLHSILPTKLFLTHRHMATNAIC 738 P+ P + IWN I+ KLK F+W+ L L T LT R M + C Sbjct: 1413 PAINPPHGSIDLKTRIWNLPIMPKLKH------FLWRALSQALATTERLTTRGMRIDPSC 1466 Query: 737 PRCTQAPESIDHALLTCPSLATTWFTSPLSL-RTQSHTAFLDTFLKISMETQPKDDILYN 561 PRC + ESI+HAL TCP W S SL R Q +S + + + N Sbjct: 1467 PRCHRENESINHALFTCPFATMAWRLSDSSLIRNQ----------LMSNDFEENISNILN 1516 Query: 560 LTHFANLSDF-----------IWLDRNQLIFTPNHKPLSSIKLVAQSTT----------S 444 +SDF IW RN ++F + S L A++ T Sbjct: 1517 FVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHK 1576 Query: 443 FLPSKPKPLCKYYIPKIFLPELFL-ITIDGGY--ESLSKTGGIGLTICKWSCDILFAGSK 273 PS + + + I P ++ D G+ + L TGG + + I + K Sbjct: 1577 KTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGG-WIIRNHYGTPISWGSMK 1635 Query: 272 HCIASGPEEMEFQALLWGLEKALDLDIKMVVFVSDCKSMVDAVNGQIACSSWELEDLRSK 93 S P E E +ALL L++ V DC+++++ +NG SS L + Sbjct: 1636 LAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFHSS--LANHLED 1693 Query: 92 IQFIRQSFGFCRFVYCSRSETQHAHLLAE 6 I F F +F + + + AH+LA+ Sbjct: 1694 ISFWANKFASIQFGFIRKKGNKLAHVLAK 1722