BLASTX nr result
ID: Cephaelis21_contig00044243
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00044243 (1042 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga... 195 1e-47 emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga... 177 4e-42 gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptas... 177 4e-42 gb|AAD37019.2| putative non-LTR retrolelement reverse transcript... 171 3e-40 dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] 170 6e-40 >emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1378 Score = 195 bits (496), Expect = 1e-47 Identities = 120/349 (34%), Positives = 179/349 (51%), Gaps = 10/349 (2%) Frame = +3 Query: 9 RVEARGYAGGIWVFWK-DNIDVEILLNHNEFVHMKVSLNLSKDSFLFTAVYGIPQSKNRS 185 RVEA G+ GGIW+FWK + + V +H++ + +++ + D +LF+A+Y P S R Sbjct: 58 RVEAEGFRGGIWLFWKSEEVTVTPYGSHSQHLTVEIR-RIGDDPWLFSAIYASPDSTLRK 116 Query: 186 ELWKCLRLIAEVEILPWILAGDFNAILESDEKSGG-SDKNRRGCKHFRQCMKEINAQDLG 362 ELW+ L I PW+LAGDFN E++G S + +R CK F ++ DLG Sbjct: 117 ELWRELEQIKNQYTGPWLLAGDFNETSSLCERNGSESSEMQRRCKDFANWIENNALIDLG 176 Query: 363 FKGAKFTWSRGNL-----HERIDRAIGNLEFLELFPNVQVHHLMRVKSDHRPLQIHTEAG 527 F G TWSRG R+DR + N E+ F V +L + +SDH P+ I T +G Sbjct: 177 FTGPAHTWSRGLSPTTFKSARLDRGLANSEWKLKFTEGVVRNLPKSQSDHCPILIST-SG 235 Query: 528 ICSMKQ--RPFRFLSSWLMHQDFGEMVATNWKDSQTLTETVEKFSVAANTWNRDIFGHIL 701 + + +PFRF ++WL HQ F E V NW + ++ F+ N WN++ F +I Sbjct: 236 FAPVPRIIKPFRFQAAWLNHQVFCEFVRKNWNADAPIVPFLKSFADKLNKWNKEEFYNIF 295 Query: 702 QMKCRIMARIKGIQEALEKKPNRFXXXXXXXXXXXXXXXXDQEEALWKQKSCCDWIALGD 881 + K + ARI G+Q L D EE LW QKS + I GD Sbjct: 296 RKKSELWARISGVQALLSTGRQNHLIKLEAKLRREMDIVLDDEETLWFQKSRMEAICDGD 355 Query: 882 RNTKFFHSKAKAKRRTNFI*SLLIDD-VWVDDQEKLKAETINYFKKLYT 1025 RNT++FH +R N I L +D W+ + ++KA + Y+K L++ Sbjct: 356 RNTRYFHLSTVIRRSRNRIDMLQNNDGEWISNPMEVKAMVLGYWKHLFS 404 >emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1363 Score = 177 bits (449), Expect = 4e-42 Identities = 114/347 (32%), Positives = 168/347 (48%), Gaps = 9/347 (2%) Frame = +3 Query: 12 VEARGYAGGIWVFWKDNIDVEILLNH--NEFVHMKVSLNLSKDSFLFTAVYGIPQSKNRS 185 + A G GGIW+ WK +I + ++ N F H L L T ++ R+ Sbjct: 59 IPAFGKRGGIWLMWKADIALVHYADYQPNHF-HALFKLRSDIPEVLLTGMHAPSVVSERN 117 Query: 186 ELWKCLRLIAEVEILPWILAGDFNAILESDEKSGGSDKNRRGCKHFRQCMKEINAQDLGF 365 + W L + PW++AGD N +L +EK GG + K + + DLGF Sbjct: 118 KYWVDLTEDSPPRGTPWLVAGDMNEVLHGNEKMGGRQVGKEQGKQCKDWIAANALLDLGF 177 Query: 366 KGAKFTWSRGN-----LHERIDRAIGNLEFLELFPNVQVHHLMRVKSDHRPLQIHTEAGI 530 +G KFTW+ G + ER+DRA+ N E+L+LFP+ +V HL R SDH PL I Sbjct: 178 QGPKFTWTNGRTGGSLIKERLDRALVNSEWLDLFPDTKVIHLPRTFSDHCPLLILFNENP 237 Query: 531 CSMKQRPFRFLSSWLMHQDFGEMVATNW-KDSQTLTETVEKFSVAANTWNRDIFGHILQM 707 S + PFR W H DF ++ W + + F + +W++ +FG I Q Sbjct: 238 RS-ESFPFRCKEVWAYHPDFTNVIEETWGSHHNSYVAARDLFLSSVKSWSKYVFGSIFQK 296 Query: 708 KCRIMARIKGIQEALEKKPNRFXXXXXXXXXXXXXXXXDQEEALWKQKSCCDWIALGDRN 887 K RI+AR+ GIQ++L P+ F QE W QK+ D LGD N Sbjct: 297 KKRILARLGGIQKSLSIHPSVFLSKLEIDLLVELNELSKQERVFWAQKAGIDRAKLGDMN 356 Query: 888 TKFFHSKAKAKRRTNFI*SLLIDD-VWVDDQEKLKAETINYFKKLYT 1025 TK+FH+ AK + I L D+ WV + E LK +++F+K++T Sbjct: 357 TKYFHTLAKIRTCKRKISCLKNDNHDWVSNNEDLKKMMMSHFEKIFT 403 >gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H [Medicago truncatula] Length = 1296 Score = 177 bits (449), Expect = 4e-42 Identities = 112/356 (31%), Positives = 174/356 (48%), Gaps = 11/356 (3%) Frame = +3 Query: 6 HRVEARGYAGGIWVFWKDNIDV-EILLNHNEFVHMKVSLNLSKDSFLFTA--VYGIPQSK 176 H +EA G++GG+W+ ++ +L+ N++ ++ + + + + T +Y P Sbjct: 56 HIIEANGHSGGVWLLKHSTTNITSTVLDFNQY---SITFIIGRGAAITTCTCIYASPNYS 112 Query: 177 NRSELWKCLRLIAEVEILPWILAGDFNAILESDEKSGGSDKNRRGCKHFRQCMKEINAQD 356 R LW L I + PW+L GDFN E+ GG+ + R F M N D Sbjct: 113 MRPNLWNYLVNINDTITGPWMLIGDFNETHLPSEQRGGTFHHNRAAT-FSNFMNNCNLLD 171 Query: 357 LGFKGAKFTWSRGN-----LHERIDRAIGNLEFLELFPNVQVHHLMRVKSDHRPLQIHTE 521 L G +FTW + N L +++DR + N+++ FP V L R+ SDH PL + Sbjct: 172 LTTTGGRFTWHKNNNGIRILSKKLDRGMANVDWRLSFPEAFVEVLCRLHSDHNPLLLRFG 231 Query: 522 AGICSMKQRPFRFLSSWLMHQDFGEMVATNWKDSQTLTETVEKFSVAANT--WNRDIFGH 695 + RPFRF ++W+ H D+G +V +W + T T V N+ +N D+FG+ Sbjct: 232 GLPLTRGPRPFRFEAAWIDHYDYGNVVKRSW-STHTHNPTASLIKVMENSIIFNHDVFGN 290 Query: 696 ILQMKCRIMARIKGIQEALEKKPNRFXXXXXXXXXXXXXXXXDQEEALWKQKSCCDWIAL 875 I Q K R+ R+KG+Q LE+ + QEE LW QKS W+ L Sbjct: 291 IFQRKSRVEWRLKGVQSYLERVDSYRHTLLEKELQDEYNHILFQEEMLWYQKSREQWVKL 350 Query: 876 GDRNTKFFHSKAKAKRRTNFI*SL-LIDDVWVDDQEKLKAETINYFKKLYT*SGHP 1040 GD+NT FFH++ +R+ N I L L + + D L+ E + YFKK + S P Sbjct: 351 GDKNTAFFHAQTVIRRKWNKIHKLQLPNGISTSDSNILQEEALKYFKKFFCGSQIP 406 >gb|AAD37019.2| putative non-LTR retrolelement reverse transcriptase [Arabidopsis thaliana] Length = 855 Score = 171 bits (433), Expect = 3e-40 Identities = 110/346 (31%), Positives = 157/346 (45%), Gaps = 8/346 (2%) Frame = +3 Query: 12 VEARGYAGGIWVFWKDNI-DVEILLNHNEFVHMKVSLNLSKDSFLFTAVYGIPQSKNRSE 188 V+ARG +GGIW+ WK + DV I+ + +F+H KV L+ L AVY P RS Sbjct: 508 VDARGQSGGIWLLWKSEVGDVSIVESAEQFIHAKVGNGLAAIHLL--AVYAAPSVSRRSG 565 Query: 189 LWKCLRLIAEVEILPWILAGDFNAILESDEKSGGSDKNRRGCKHFRQCMKEINAQDLGFK 368 LW L I + P I+ D+GFK Sbjct: 566 LWSLLSRIVQSVDEPIIVG------------------------------------DMGFK 589 Query: 369 GAKFTWSRGNLH-----ERIDRAIGNLEFLELFPNVQVHHLMRVKSDHRPLQIHTEAGIC 533 G KFTW RG + +R+DR + + + V HL SDH P+ I E + Sbjct: 590 GNKFTWKRGRVESTFVAKRLDRVLCRPQTRLKWQEASVTHLPFFASDHAPIYIQLEPEVR 649 Query: 534 SMK-QRPFRFLSSWLMHQDFGEMVATNWKDSQTLTETVEKFSVAANTWNRDIFGHILQMK 710 S +RPFRF ++WL H F +++ +W + WNR++FG + + K Sbjct: 650 SNPLRRPFRFEAAWLTHSGFKDLLQASWNTEGETPVALAALKSKLKKWNREVFGDVNRRK 709 Query: 711 CRIMARIKGIQEALEKKPNRFXXXXXXXXXXXXXXXXDQEEALWKQKSCCDWIALGDRNT 890 +M IK +QE LE +QEE LW QKS W+ LGDRNT Sbjct: 710 ESLMNEIKVVQELLEINQTDNLLSKEEELIKEFDVVLEQEEVLWFQKSREKWVELGDRNT 769 Query: 891 KFFHSKAKAKRRTNFI*SLLIDD-VWVDDQEKLKAETINYFKKLYT 1025 K+FH+ +RR N I L DD WV Q++L+ ++Y+ +LY+ Sbjct: 770 KYFHTMTVVRRRRNRIEMLKADDGSWVSQQQELEKMAVDYYSRLYS 815 >dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 170 bits (430), Expect = 6e-40 Identities = 107/348 (30%), Positives = 163/348 (46%), Gaps = 11/348 (3%) Frame = +3 Query: 12 VEARGYAGGIWVFWKDNIDVEILLNHNEFVHMKVSLNLSKDSFLFTAVYGIPQSKNRSEL 191 V G+AGG+ + WK +++ ++ ++++ +H S L + T Y P + + Sbjct: 60 VNPLGFAGGLLLLWKPALNLSVISHNSQAIHTLASHRLG--NCFITFAYIRPNTFAKCRF 117 Query: 192 WKCLRLIAEVEILPWILAGDFNAILESDEKSGGSDKNRRGCKHFRQCMKEINAQDLGFKG 371 W+ + +A PW++ GD N I SDE+ G S N ++F + D G G Sbjct: 118 WEYCKQLANSIQSPWMVVGDLNDIATSDEQWGSSSLNYTSLQNFVDAYSDCGLLDPGSSG 177 Query: 372 AKFTWSR--GNL---HERIDRAIGNLEFLELFPNVQVHHLMRVKSDHRPLQIHTEAGICS 536 FTW R GN R+DR + N+ FP +V L R+ SDH P+ EAG Sbjct: 178 PNFTWCRFIGNRVVQRRRLDRVLWNVSAQLTFPEAKVSVLPRLCSDHNPILFLDEAGNPP 237 Query: 537 MKQ-RPFRFLSSWLMHQDFGEMVATNWKDS-----QTLTETVEKFSVAANTWNRDIFGHI 698 ++ RP RF ++WL +D+ + WK++ L + + + + WNR++FG+I Sbjct: 238 VRSLRPVRFEAAWLTSEDYKHI----WKEATEREGSNLEDIIATVTQKSLLWNRNVFGNI 293 Query: 699 LQMKCRIMARIKGIQEALEKKPNRFXXXXXXXXXXXXXXXXDQEEALWKQKSCCDWIALG 878 K +I RI GIQ A + QEE LW QK+ DWI G Sbjct: 294 FNRKRKIENRILGIQRAWNYNTSVRLQDLEKRLLSELNEVLVQEETLWFQKARTDWIRNG 353 Query: 879 DRNTKFFHSKAKAKRRTNFI*SLLIDDVWVDDQEKLKAETINYFKKLY 1022 DRNT F+H A KR N + L + W DD + L IN+F L+ Sbjct: 354 DRNTTFYHRSALIKRNRNRVRFLKLQGAWTDDADLLTEHIINFFSTLF 401