BLASTX nr result
ID: Cephaelis21_contig00001934
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00001934 (1755 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADK92871.1| retrotransposon protein [Hypericum perforatum] 54 7e-12 ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thalia... 53 1e-09 gb|AAB82639.1| putative non-LTR retroelement reverse transcripta... 48 6e-09 gb|EEC71228.1| hypothetical protein OsI_03168 [Oryza sativa Indi... 45 8e-09 dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] 56 1e-08 >gb|ADK92871.1| retrotransposon protein [Hypericum perforatum] Length = 593 Score = 53.9 bits (128), Expect(2) = 7e-12 Identities = 47/193 (24%), Positives = 87/193 (45%), Gaps = 1/193 (0%) Frame = +2 Query: 1091 LWHIWKARNLWIF-*HQRLKVHVVVQKAITEWQEFEAVTRSQKTPQAGRLVVTNTQRART 1267 LW IW+ RN +F + + + ++K WQ+F + A R T+ A T Sbjct: 384 LWIIWRVRNSIVFRTGEEIVICKELEKGFRFWQDFMDTEGNPTVRGAPR---TSKWNAPT 440 Query: 1268 AVEPGVISICVASESHFAGKDSGAGYTFQDDQGNLLLANVVFCSNVQHLYVVELDAIRSA 1447 A G I V + A + G +DD G ++A N+ H ++E A+ + Sbjct: 441 A---GFYKINVDAGLR-AERGGQVGIVVRDDTGAFVMATTRSFPNLVHPTLLEGQAVYTG 496 Query: 1448 LTTAKQRNIDRVDIGLDDKIVVAWLQEKTPTTTDGTTILEDILLLKSQFLCCKFIFIQRK 1627 L A ++RV++ D VV L + +D + I++D +L S F + ++R+ Sbjct: 497 LEFANALGLERVELESDCLPVVMQLSKGYTDRSDLSNIIDDCKMLLSNFQQVRIAHVRRE 556 Query: 1628 WNECSYRLAQFVL 1666 N+ ++ +A+ + Sbjct: 557 ANQAAHEMAKMTI 569 Score = 44.3 bits (103), Expect(2) = 7e-12 Identities = 20/43 (46%), Positives = 24/43 (55%) Frame = +3 Query: 885 LATTVALKRRGVQVGDTCKRCGSAK*TIEHLFFHCPESLFIWK 1013 L+ L RRG+QV + C C T HLFF CP +L IWK Sbjct: 305 LSVRTNLTRRGIQVDEVCPCCAGPSETAAHLFFCCPYTLDIWK 347 >ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thaliana] gi|4972055|emb|CAB43923.1| putative protein [Arabidopsis thaliana] gi|7269807|emb|CAB79667.1| putative protein [Arabidopsis thaliana] gi|67633766|gb|AAY78807.1| putative reverse transcriptase/RNA-dependent DNA polymerase [Arabidopsis thaliana] gi|332660185|gb|AEE85585.1| Ribonuclease H-like protein [Arabidopsis thaliana] Length = 575 Score = 53.1 bits (126), Expect(2) = 1e-09 Identities = 46/206 (22%), Positives = 94/206 (45%) Frame = +2 Query: 1061 QKGDELTGYFLWHIWKARNLWIF*HQRLKVHVVVQKAITEWQEFEAVTRSQKTPQAGRLV 1240 +K +L + LW +WK RN +F + V+++A + +E+ T ++ ++ Sbjct: 354 EKASQLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVN 413 Query: 1241 VTNTQRARTAVEPGVISICVASESHFAGKDSGAGYTFQDDQGNLLLANVVFCSNVQHLYV 1420 ++ R R V A+ + + G G+ ++++G + ++ + Sbjct: 414 RSSCGRWRPPPHQWVKCNTDATWNR-DNERCGIGWVLRNEKGEVKWMGARALPKLKSVLE 472 Query: 1421 VELDAIRSALTTAKQRNIDRVDIGLDDKIVVAWLQEKTPTTTDGTTILEDILLLKSQFLC 1600 EL+A+R A+ + + + V D ++++ L + TI +D+ L SQF Sbjct: 473 AELEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNNDEIWPSLKPTI-QDLQRLLSQFTE 531 Query: 1601 CKFIFIQRKWNECSYRLAQFVLSRKN 1678 KF+FI R+ N + R+A+ LS N Sbjct: 532 VKFVFIPREGNTLAERVARESLSFLN 557 Score = 37.7 bits (86), Expect(2) = 1e-09 Identities = 27/114 (23%), Positives = 45/114 (39%), Gaps = 2/114 (1%) Frame = +3 Query: 690 WTAKNNGIFIVKSAYNLSQTLKEGRMAKAETSKAREDG--RRMWRXXXXXXXXXXXXXXX 863 W ++G + VKS Y + + R + E S+ + +++W+ Sbjct: 215 WDYTSSGDYTVKSGYWVLTQIINKRSSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWK-- 272 Query: 864 XXCIHGWLATTVALKRRGVQVGDTCKRCGSAK*TIEHLFFHCPESLFIWKLAPI 1025 C+ L AL R + C RC S K T+ HL F C + W ++ I Sbjct: 273 --CLSNSLPVAGALAYRHLSKESACIRCPSCKETVNHLLFKCTFARLTWAISSI 324 >gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1374 Score = 48.1 bits (113), Expect(2) = 6e-09 Identities = 50/204 (24%), Positives = 86/204 (42%), Gaps = 3/204 (1%) Frame = +2 Query: 1076 LTGYFLWHIWKARNLWIF*HQRLKVHVVVQKAITEWQEFEAVTRSQKTPQAGRLVVTNTQ 1255 L + LW +WK RN +F + V+ KA + + R + PQ V ++T+ Sbjct: 1156 LIPWILWRLWKNRNDLVFKGREFTAPQVILKATEDMDAWN--NRKEPQPQ----VTSSTR 1209 Query: 1256 RARTAVEPGVISICVASESHFAGKD---SGAGYTFQDDQGNLLLANVVFCSNVQHLYVVE 1426 +P + KD G G+ ++ G LL + + Q + E Sbjct: 1210 DRCVKWQPPSHGWVKCNTDGAWSKDLGNCGVGWVLRNHTGRLLWLGLRALPSQQSVLETE 1269 Query: 1427 LDAIRSALTTAKQRNIDRVDIGLDDKIVVAWLQEKTPTTTDGTTILEDILLLKSQFLCCK 1606 ++A+R A+ + + N RV D + +V+ +Q + + I +DI L F K Sbjct: 1270 VEALRWAVLSLSRFNYRRVIFESDSQYLVSLIQNEMDIPSLAPRI-QDIRNLLRHFEEVK 1328 Query: 1607 FIFIQRKWNECSYRLAQFVLSRKN 1678 F F +R+ N + R A+ LS N Sbjct: 1329 FQFTRREGNNVADRTARESLSLMN 1352 Score = 40.0 bits (92), Expect(2) = 6e-09 Identities = 28/121 (23%), Positives = 48/121 (39%), Gaps = 2/121 (1%) Frame = +3 Query: 669 KTRTVLVWTAKNNGIFIVKSAYNLSQTLKEGRMAKAETSKAREDG--RRMWRXXXXXXXX 842 +TR W +G + VKS Y + + R E + D +++W+ Sbjct: 1005 ETRDRFTWEYSRSGHYSVKSGYWVMTEIINQRNNPQEVLQPSLDPIFQQIWKLDVPPKIH 1064 Query: 843 XXXXXXXXXCIHGWLATTVALKRRGVQVGDTCKRCGSAK*TIEHLFFHCPESLFIWKLAP 1022 C++ L+ L R + +C RC S T+ HL F CP + W ++P Sbjct: 1065 HFLWR----CVNNCLSVASNLAYRHLAREKSCVRCPSHGETVNHLLFKCPFARLTWAISP 1120 Query: 1023 I 1025 + Sbjct: 1121 L 1121 >gb|EEC71228.1| hypothetical protein OsI_03168 [Oryza sativa Indica Group] Length = 995 Score = 45.1 bits (105), Expect(2) = 8e-09 Identities = 30/111 (27%), Positives = 46/111 (41%), Gaps = 4/111 (3%) Frame = +3 Query: 690 WTAKNNGIFIVKSAYNLSQT----LKEGRMAKAETSKAREDGRRMWRXXXXXXXXXXXXX 857 W G+F V+SAYNL ++ + + + + S A D + W+ Sbjct: 673 WPHDKRGLFTVRSAYNLVRSNLFVVAQSSNGRGQHSGANVDS-QFWKALWTINAPGKMLI 731 Query: 858 XXXXCIHGWLATTVALKRRGVQVGDTCKRCGSAK*TIEHLFFHCPESLFIW 1010 +H L T L+RR V + C CG IEH+F CP + +W Sbjct: 732 HLWRSVHDCLPTGFQLRRRHVPATEGCIFCGHDD-RIEHVFLVCPFAATVW 781 Score = 42.7 bits (99), Expect(2) = 8e-09 Identities = 48/195 (24%), Positives = 73/195 (37%), Gaps = 4/195 (2%) Frame = +2 Query: 1040 IATDSHIQKGDELTGYFLWHIWKARNLW----IF*HQRLKVHVVVQKAITEWQEFEAVTR 1207 + SHIQK + L HIW+ARN + H R +H +V V Sbjct: 808 LTRSSHIQK--TVLAVTLRHIWEARNFSRNNPVITHPRQVIHKIVSYVDM------IVQH 859 Query: 1208 SQKTPQAGRLVVTNTQRARTAVEPGVISICVASESHFAGKDSGAGYTFQDDQGNLLLANV 1387 K A + T PG++ I + A +G + +D +LA Sbjct: 860 CPKDRNASGCDLPLPVTKWTPPPPGMVLINSDAALFQASNQTGLAFVIRDHSATCMLAAN 919 Query: 1388 VFCSNVQHLYVVELDAIRSALTTAKQRNIDRVDIGLDDKIVVAWLQEKTPTTTDGTTILE 1567 + + + E IR AL AK V + D V+ +Q + I+ Sbjct: 920 KRITGLLSPELAEALVIRFALEHAKAEGFQNVLMASDCLSVIKRIQSGARDLSVVGVIVR 979 Query: 1568 DILLLKSQFLCCKFI 1612 DI L+++FL C FI Sbjct: 980 DIKKLETEFLECSFI 994 >dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 56.2 bits (134), Expect(2) = 1e-08 Identities = 50/191 (26%), Positives = 84/191 (43%), Gaps = 1/191 (0%) Frame = +2 Query: 1085 YFLWHIWKARNLWIF*HQRLKVHVVVQKAITEWQEFEAVTRSQKTPQ-AGRLVVTNTQRA 1261 Y LW++WKARN +F + ++ ++ E E + + Q A + V + A Sbjct: 1145 YILWNLWKARNRLVFDNNITAPSDILNRSFMESSEARCLLAKRTGLQTAFQTWVVWSPPA 1204 Query: 1262 RTAVEPGVISICVASESHFAGKDSGAGYTFQDDQGNLLLANVVFCSNVQHLYVVELDAIR 1441 + C S SH A AG +++ G L +A + + ++ EL +R Sbjct: 1205 AGFTKLNSDGAC-KSHSHLAS----AGGLLRNENG-LWVAGYICNIGTANSFLAELWGLR 1258 Query: 1442 SALTTAKQRNIDRVDIGLDDKIVVAWLQEKTPTTTDGTTILEDILLLKSQFLCCKFIFIQ 1621 L AK R ++ D + VV L++ P T D + +++D LL F K I Sbjct: 1259 EGLLLAKNRGFTKLIAETDSEAVVQVLRKDGPVTPDASILVKDCKLLLDHFQEIKVTHIL 1318 Query: 1622 RKWNECSYRLA 1654 R+ N+C+ LA Sbjct: 1319 REGNQCADFLA 1329 Score = 31.2 bits (69), Expect(2) = 1e-08 Identities = 16/45 (35%), Positives = 21/45 (46%) Frame = +3 Query: 885 LATTVALKRRGVQVGDTCKRCGSAK*TIEHLFFHCPESLFIWKLA 1019 L V KRRG+ +C CG T++HLF C + W A Sbjct: 1061 LMVNVERKRRGLADAASCPVCGEEDETLDHLFRRCLLAEACWDSA 1105