BLASTX nr result
ID: Cephaelis21_contig00022408
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00022408 (1241 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAD32950.1| putative non-LTR retroelement reverse transcripta... 102 2e-19 ref|XP_002446679.1| hypothetical protein SORBIDRAFT_06g020406 [S... 82 3e-13 ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thalia... 82 3e-13 gb|EEE59920.1| hypothetical protein OsJ_12548 [Oryza sativa Japo... 82 4e-13 gb|AAB82639.1| putative non-LTR retroelement reverse transcripta... 81 5e-13 >gb|AAD32950.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 773 Score = 102 bits (254), Expect = 2e-19 Identities = 71/228 (31%), Positives = 109/228 (47%), Gaps = 4/228 (1%) Frame = +2 Query: 17 WNSDLIYKTFCKPDAEAILILPTSALGSQDKLHKIHSKIGAYTVKSTSTWLAKLRHQHPH 196 WN DL+ K + D I + S G+ D + I++ G Y+VKS L KL Q Sbjct: 425 WNEDLLCKLIHQNDIPHIRAIRPSITGANDAITWIYTHDGNYSVKSGYHLLRKLSQQQHA 484 Query: 197 QLESSSRLELTKKIWKRTWEMQIKGKIKHFRLRTYHLLLPTGHQLQLSEIHLDSTCKICR 376 L S + + + ++ W+ KIKHF R+ H LPT L+ + D TC+ C Sbjct: 485 SLPSPNEVS-AQTVFTNIWKQNAPPKIKHFWWRSAHNALPTAGNLKRRRLITDDTCQRCG 543 Query: 377 EAPEAIEHLLFHCTKV*RIWELSLVS-WPGLQKFTDHFQGWWEQICSIRMLSINQDRIEF 553 EA E + HLLF C IWE + + PG ++ F E I + S +D + Sbjct: 544 EASEDVNHLLFQCRVSKEIWEQAHIKLCPGDSLMSNSFNQNLESIQKLNQ-SARKD-VSL 601 Query: 554 TAYLL*SIWKTRNDFTFNAVMVSVAKIVERA---R*EWQEFLSIHEQK 688 ++ IWK RND FN S+ +++A + +W+E L+ +EQ+ Sbjct: 602 FPFIGWRIWKMRNDLIFNNKRWSIPDSIQKALIDQQQWKESLNCNEQQ 649 >ref|XP_002446679.1| hypothetical protein SORBIDRAFT_06g020406 [Sorghum bicolor] gi|241937862|gb|EES11007.1| hypothetical protein SORBIDRAFT_06g020406 [Sorghum bicolor] Length = 395 Score = 82.0 bits (201), Expect = 3e-13 Identities = 94/402 (23%), Positives = 159/402 (39%), Gaps = 18/402 (4%) Frame = +2 Query: 23 SDLIYKTFCKPDAEAILILPTSALGSQDKLHKIHSKIGAYTVKSTSTWLAKLRHQHPHQL 202 S+L+ + F D+EAIL +P S +D H K G +TV+S L +L+ Sbjct: 1 SELVKRVFYPIDSEAILQMPLSMRKQKDCWAWHHEKNGLFTVRSAYRMLIELKKSREDYF 60 Query: 203 E---SSSRLELTKKIWKRTWEMQIKGKIKHFRLRTYHLLLPTGHQLQLSEIHLDSTCKIC 373 E + S ++K WK+ W M++ KIK F R +PT L+ + S CKIC Sbjct: 61 EGRANCSDFATSQKEWKKLWSMKLPSKIKVFCWRLALNSIPTASVLKSRNLASTSHCKIC 120 Query: 374 REAPEAIEHLLFHCTKV*RIWEL------SLVSWPGLQKFTDHFQGWWEQICSIRMLSIN 535 + EH L CT +W L +L+S + W + + Sbjct: 121 GAVDDTWEHSLLFCTMSKCVWALLDEDITNLISHLRISN-----PKHWITFMCCNIPQAD 175 Query: 536 QDRIEFTAYLL*SIWKTRNDFTFNAVMVSVAKIVERAR*EWQEFLSIH--EQKTSTRLVG 709 R+ T + +IW+ R V S I+ + +E I E K + Sbjct: 176 GIRVLVTCW---AIWQARRKAIHEGVFQSPFSIMVTINRQIEELQMIRGMELKGGNQNQS 232 Query: 710 RNQLPL*QAPHHNDWMVGYETMRISAIFKKNIACGSYGLLTEDRHGITQQASAIFYEREV 889 + + L +AP G + + A + + G+ G++ + G SA+ Sbjct: 233 KQKTRLWKAPDQ-----GKCKINVDAAVNRVGSKGAVGVVCRNDRGEFIAPSAMIIPNIT 287 Query: 890 SALTLSLRAIRDTQLRTYKLRWNKAILLSDEMDIVRHLQDTSPIFTDIFLLTNL------ 1051 TL A + K I+ SD ++IVR++ + P+ T + +L ++ Sbjct: 288 EPETLEGMACLEALALAEDCGIRKIIVASDCLNIVRNISE-MPLCTYVMILKDIQERAKS 346 Query: 1052 FQDCKFVLISKDANKGCCRLAQFALQ-SSSSETWNTSFPTWL 1174 F +F ++ N+ RL ++A W S P +L Sbjct: 347 FDYVRFAHEGRECNREADRLVKYACSLEDGRHVWLGSPPVFL 388 >ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thaliana] gi|4972055|emb|CAB43923.1| putative protein [Arabidopsis thaliana] gi|7269807|emb|CAB79667.1| putative protein [Arabidopsis thaliana] gi|67633766|gb|AAY78807.1| putative reverse transcriptase/RNA-dependent DNA polymerase [Arabidopsis thaliana] gi|332660185|gb|AEE85585.1| Ribonuclease H-like protein [Arabidopsis thaliana] Length = 575 Score = 82.0 bits (201), Expect = 3e-13 Identities = 89/394 (22%), Positives = 154/394 (39%), Gaps = 19/394 (4%) Frame = +2 Query: 2 DNGTIWNSDLIYKTFCKPDAEAILILPTSALGSQDKLHKIHSKIGAYTVKSTSTWLAKL- 178 ++G W D+I F + + + I L D ++ G YTVKS L ++ Sbjct: 177 ESGREWRKDVIEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSGYWVLTQII 236 Query: 179 -RHQHPHQLESSSRLELTKKIWKRTWEMQIKGKIKHFRLRTYHLLLPTGHQLQLSEIHLD 355 + P ++ S + +KIWK Q KI+HF + LP L + + Sbjct: 237 NKRSSPQEVSEPSLNPIYQKIWKS----QTSPKIQHFLWKCLSNSLPVAGALAYRHLSKE 292 Query: 356 STCKICREAPEAIEHLLFHCTKV*RIWELSLVSWPGLQKFTDHFQGWWEQICSIRMLSIN 535 S C C E + HLLF CT W +S + P G W + + + Sbjct: 293 SACIRCPSCKETVNHLLFKCTFARLTWAISSIPIP--------LGGEWADSIYVNLYWVF 344 Query: 536 ---------QDRIEFTAYLL*SIWKTRNDFTFNAVMVSVAKIVERAR*EWQEF-LSIHEQ 685 + + +LL +WK RN+ F + +++ RA + +E+ + + Sbjct: 345 NLGNGNPQWEKASQLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAE 404 Query: 686 KTSTR-LVGRNQLPL*QAPHHNDWMVGYETMRISAIFKKNIACGSYGLLTEDRHGITQQA 862 T+ V R+ + P H W+ + + + N CG G + + G + Sbjct: 405 SCGTKPQVNRSSCGRWRPPPH-QWV---KCNTDATWNRDNERCG-IGWVLRNEKGEVKWM 459 Query: 863 SAIFYEREVSALTLSLRAIRDTQLRTYKLRWNKAILLSDEMDIVRHLQD------TSPIF 1024 A + S L L A+R L + ++N I SD ++ L + P Sbjct: 460 GARALPKLKSVLEAELEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNNDEIWPSLKPTI 519 Query: 1025 TDIFLLTNLFQDCKFVLISKDANKGCCRLAQFAL 1126 D+ L + F + KFV I ++ N R+A+ +L Sbjct: 520 QDLQRLLSQFTEVKFVFIPREGNTLAERVARESL 553 >gb|EEE59920.1| hypothetical protein OsJ_12548 [Oryza sativa Japonica Group] Length = 1076 Score = 81.6 bits (200), Expect = 4e-13 Identities = 98/409 (23%), Positives = 152/409 (37%), Gaps = 26/409 (6%) Frame = +2 Query: 17 WNSDLIYKTFCKPDAEAILI--LPTSALGSQDKLHKIHSKIGAYTVKSTSTWLAKLRHQH 190 WN L+ + DA +L LP + H H K G +TVKS L + Sbjct: 664 WNETLVRHVLKEEDANEVLKIRLPNHQMDDFPAWH--HEKSGLFTVKSAYKLAWNLSGKG 721 Query: 191 PHQLESSSRLELTKKIWKRTWEMQIKGKIKHFRLRTYHLLLPTGHQLQLSEIHLDSTCKI 370 Q SS+ +KIW R W +++ K+K F + LPT + +I ++ TC + Sbjct: 722 VVQSSSSTATSGERKIWSRVWNAKVQAKVKIFIWKLAQDKLPTWENKRRRKIEMNGTCPV 781 Query: 371 CREAPEAIEHLLFHCTKV*RIWELSLVSW--PGLQKFTDHFQGWWEQICSIRMLSINQDR 544 C E H CTK + E W PG KF W I + +N+++ Sbjct: 782 CGTKGENSYHATVECTKARALREALRAVWHLPGEDKFLWTGPDW----LLILLDGVNEEQ 837 Query: 545 IEFTAYLL*SIWKTRNDFTFNAVMVSVAKIVERAR*EWQEFLSIHEQ-----KTSTRLVG 709 Y+L W RND S+A V ++E L + Q K + Sbjct: 838 RTHIMYMLWRAWYLRNDLIHGDGRCSIAGSVSFLT-SYEEVLLPNRQMPDDIKGKKPMYS 896 Query: 710 RNQLPL*QAPHHND-WMV---GYETMRISAIFKKNIACGSYGLLTEDRHGITQQASAIFY 877 Q A + W+ G + + A F+ S G++ D G+ A+ Sbjct: 897 EGQKEKHMAEKQSSGWIAPPDGAAKINVDAGFRMETGEASAGIVIRDCRGLILLAACKTL 956 Query: 878 ----EREVSALTLSLRAIRDTQLRTYKLRW--NKAILLSDEMDIVRHLQ---DTSPIFTD 1030 E + SL IR L+W IL +D ++V L+ + ++ Sbjct: 957 HPCSSAEQAEALASLEGIR------CALQWIHMPVILETDNAEVVARLKTKHSSRSVWEG 1010 Query: 1031 IFLLTNL----FQDCKFVLISKDANKGCCRLAQFALQSSSSETWNTSFP 1165 + + Q + I +D+NK LAQ AL S + W P Sbjct: 1011 VIMEAKAAMQGLQAVEVAHIKRDSNKVAHTLAQMALSSGNCLEWRLCAP 1059 >gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1374 Score = 81.3 bits (199), Expect = 5e-13 Identities = 85/400 (21%), Positives = 156/400 (39%), Gaps = 10/400 (2%) Frame = +2 Query: 5 NGTIWNSDLIYKTFCKPDAEAILILPTSALGSQDKLHKIHSKIGAYTVKSTSTWLAKLRH 184 +G WN +L+ F E IL L ++D+ +S+ G Y+VKS + ++ + Sbjct: 975 DGRDWNWNLVSLLFPDNTQENILALRPGGKETRDRFTWEYSRSGHYSVKSGYWVMTEIIN 1034 Query: 185 Q--HPHQLESSSRLELTKKIWKRTWEMQIKGKIKHFRLRTYHLLLPTGHQLQLSEIHLDS 358 Q +P ++ S + ++IWK + + KI HF R + L L + + Sbjct: 1035 QRNNPQEVLQPSLDPIFQQIWK----LDVPPKIHHFLWRCVNNCLSVASNLAYRHLAREK 1090 Query: 359 TCKICREAPEAIEHLLFHCTKV*RIWELS-LVSWPGLQKFTDHFQGWWEQICSIRMLSIN 535 +C C E + HLLF C W +S L + PG + F+ + + Sbjct: 1091 SCVRCPSHGETVNHLLFKCPFARLTWAISPLPAPPGGEWAESLFRNMHHVLSVHKSQPEE 1150 Query: 536 QDRIEFTAYLL*SIWKTRNDFTFNAVMVSVAKIVERAR*EWQEFLSIHEQKTSTRLVGRN 715 D ++L +WK RND F + +++ +A + + + E + R+ Sbjct: 1151 SDHHALIPWILWRLWKNRNDLVFKGREFTAPQVILKATEDMDAWNNRKEPQPQVTSSTRD 1210 Query: 716 QLPL*QAPHHNDWMVGYETMRISAIFKKNIACGSYGLLTEDRHGITQQASAIFYEREVSA 895 + Q P H G+ + K++ G + + G + S Sbjct: 1211 RCVKWQPPSH-----GWVKCNTDGAWSKDLGNCGVGWVLRNHTGRLLWLGLRALPSQQSV 1265 Query: 896 LTLSLRAIRDTQLRTYKLRWNKAILLSDEMDIVRHLQD------TSPIFTDIFLLTNLFQ 1057 L + A+R L + + + I SD +V +Q+ +P DI L F+ Sbjct: 1266 LETEVEALRWAVLSLSRFNYRRVIFESDSQYLVSLIQNEMDIPSLAPRIQDIRNLLRHFE 1325 Query: 1058 DCKFVLISKDANKGCCRLAQFALQSSSSETWNTSF-PTWL 1174 + KF ++ N R A+ +L + + S P W+ Sbjct: 1326 EVKFQFTRREGNNVADRTARESLSLMNYDPKMYSITPDWI 1365