BLASTX nr result
ID: Cephaelis21_contig00036921
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00036921 (2062 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 305 2e-80 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 303 2e-79 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 301 3e-79 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 292 2e-76 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 286 1e-74 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 305 bits (782), Expect = 2e-80 Identities = 181/584 (30%), Positives = 304/584 (52%), Gaps = 11/584 (1%) Frame = +3 Query: 342 AWNMRGINSPLKKNYLVDFVKKNSVDLMGVLECKLSDTRLDHLLKTKFVGWMQCNNFGVH 521 +WN+RG+N P K + +F+ + + + +LE ++ + + W NN+ Sbjct: 5 SWNVRGMNDPFKIKEIKNFLYSHKIVVCALLETRVREQNASKVQGKLGKDWKWLNNYSHS 64 Query: 522 EAGRILVLWNPMTVDIDVVDTLSQLIHLRVTCKVSSQTFLVSFV--YGLHTIVNRRSMWD 695 RI + W P V++ + T QL+ C + Q+ + V YGLHTI +R+S+W Sbjct: 65 ARERIWIGWRPAWVNVTLTHTQEQLM----VCDIQDQSHKLKMVAVYGLHTIADRKSLWS 120 Query: 696 NLMHYDLGKHEPWIVLGDFNSVLRYNERKNGEPVTQYQIKDFVDCCMLLGLTDCNSSGFF 875 L+ + + +P I++GDFN+V N+R G VT + +DF + L + S+ + Sbjct: 121 GLLQC-VQQQDPMIIIGDFNAVCHSNDRLYGTLVTDAETEDFQQFLLQSNLIESRSTWSY 179 Query: 876 YTSTNNT-----VWSKLDRVMVNDIWVQNYMRVSTVFLSPGCSDHCPSVTTLFRAPVGGR 1040 Y+ +N++ V S++D+ VN +W+ Y VS +L PG SDH P + L G Sbjct: 180 YSWSNSSIGRDRVLSRIDKAYVNLVWLGMYAEVSVQYLPPGISDHSPLLFNLMTGRPQGG 239 Query: 1041 RSFMFYDMWTDHDDF*DIVRQSWQGHLYGTEQYMLCRKLKRLKIPLKTLNNEHFSHISTR 1220 + F F ++ + +F + V ++W + + LK +K LK + + + Sbjct: 240 KPFKFMNVMAEQGEFLETVEKAWNSVNGRFKLQAIWLNLKAVKRELKQMKTQKIGLAHEK 299 Query: 1221 ACIAREELEALQLRA---HDSLGDTDIHAQLGELRRTAWRLSEAERKFYYQKAKCRYIIS 1391 R +L+ LQ + H+ + TD + + +LR W S E QK++ ++ Sbjct: 300 VKNLRHQLQDLQSQDDFDHNDIMQTDAKSIMNDLRH--W--SHIEDSILQQKSRITWLQQ 355 Query: 1392 ADRNTKLFHAVVKRNARRNFIASVMRGDASVTCSSEEVAQEFVQYYTDLLGTDSVTT-SV 1568 D N+KLF VK N I + D V ++EV +E +++Y LLGT + T V Sbjct: 356 GDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRASTLMGV 415 Query: 1569 DPEVFLMCPKVSADAWPMLTQPVTDDEIRQHIFDIGTDRAPGPNGYTSGFFRHSWEIVGG 1748 D +SA A L + V EI + + IG D+APG +G+ + FF+ SW + Sbjct: 416 DLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSIKQ 475 Query: 1749 DVCAAIKEFFSSGRLLKQVNHTVIALLPKSSHASSVGDYRPILCCNIIYKAISKILASRM 1928 ++ A I+EFF++ R+ + +N V+ LLPK HA+ V ++RPI CC +IYK ISK+L +RM Sbjct: 476 EIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLTNRM 535 Query: 1929 AKVLPQIIHESQSAFVEGRSMVENIHMTQELISRYGRKRVSPRC 2060 ++ ++++E+QS F+ GR + +NI + ELI Y RK +SPRC Sbjct: 536 KGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHMSPRC 579 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 303 bits (775), Expect = 2e-79 Identities = 185/593 (31%), Positives = 307/593 (51%), Gaps = 21/593 (3%) Frame = +3 Query: 345 WNMRGINSPLKKNYLVDFVKKNSVDLMGVLECKLSDTRLDHLLKTKFVGWMQCNNFGVHE 524 WN+RG+N K + + ++++N+ ++E ++ ++++ L+ F W N+ + Sbjct: 6 WNVRGLNKSSKHSVIKKWIEENNFQFGCLVETRVKESKVSQLVGKLFKDWSILTNYEHNR 65 Query: 525 AGRILVLWNPMTVDIDVVDTLSQLIHLRVTCKVSSQTFLVSFVYGLHTIVNRRSMWDNLM 704 GRI VLW V + + QL+ V + F SFVY + + R+ +W L Sbjct: 66 RGRIWVLWRK-NVRLSPIYKSCQLLTCSVKLEDRQDEFFCSFVYASNYVEERKVLWSELK 124 Query: 705 -HYD--LGKHEPWIVLGDFNSVLRYNERKNG--EPVTQYQIKDFVDCCMLLGLTDCNSSG 869 HYD + +H+PW +LGDFN L E P+ ++DF LTD + G Sbjct: 125 DHYDSPIIRHKPWTLLGDFNETLDIAEHSQSFVHPMVTPGMRDFQQVINYCSLTDMAAQG 184 Query: 870 FFYTSTNNT----VWSKLDRVMVNDIWVQNYMRVSTVFLSPGCSDHCP---SVTTLFRAP 1028 +T N + KLDRV++ND W Q + + +VF + GCSDH S+ + Sbjct: 185 PLFTWCNKREHGLIMKKLDRVLINDCWNQTFSQSYSVFEAGGCSDHLRCRISLNSEAGNK 244 Query: 1029 VGGRRSFMFYDMWTDHDDF*DIVRQSWQGH----LYGTEQYMLCRKLKRLKIPLKTLNNE 1196 V G + F F + TD +DF +V W+ L + + + LK LK ++++ + Sbjct: 245 VQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILSTSTLFRFSKNLKGLKPKIRSMARD 304 Query: 1197 HFSHISTRACIAREELEALQLRAHDSLGDTDIHAQLGE-LRRTAW-RLSEAERKFYYQKA 1370 ++S +A E + L + H +L + A E + W R++ E K+ QK+ Sbjct: 305 RLGNLSKKA---NEAYKILCAKQHVNLTNPSSMAMEEENAAYSRWDRVAILEEKYLKQKS 361 Query: 1371 KCRYIISADRNTKLFHAVVKRNARRNFIASVMRGDASVTCSSEEV---AQEFVQYYTDLL 1541 K + D+NTK FH N I ++ D V +E+ A+ F + + L+ Sbjct: 362 KLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAERFFREFLQLI 421 Query: 1542 GTDSVTTSVDPEVFLMCPKVSADAWPMLTQPVTDDEIRQHIFDIGTDRAPGPNGYTSGFF 1721 D ++ L+ + S L +PVT +EIR+ +F + +D++PGP+GYTS FF Sbjct: 422 PNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFRMPSDKSPGPDGYTSEFF 481 Query: 1722 RHSWEIVGGDVCAAIKEFFSSGRLLKQVNHTVIALLPKSSHASSVGDYRPILCCNIIYKA 1901 + +WEI+G + A++ FF+ G L K +N T++AL+PK + A + DYRPI CCN++YK Sbjct: 482 KATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREMKDYRPISCCNVLYKV 541 Query: 1902 ISKILASRMAKVLPQIIHESQSAFVEGRSMVENIHMTQELISRYGRKRVSPRC 2060 ISKI+A+R+ VLP+ I +QSAFV+ R ++EN+ + EL+ Y + +S RC Sbjct: 542 ISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKDYHKDTISTRC 594 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 301 bits (772), Expect = 3e-79 Identities = 174/583 (29%), Positives = 293/583 (50%), Gaps = 6/583 (1%) Frame = +3 Query: 330 MKISAWNMRGINSPLKKNYLVDFVKKNSVDLMGVLECKLSDTRLDHLLKTKFVGWMQCNN 509 MKI+ WN+RG+N P+K + F+ + L + E ++ + K W NN Sbjct: 1 MKITTWNVRGLNDPIKVKEVKHFLHSQKISLCSLFETRVRQQNSGKIQKKFGNRWSWINN 60 Query: 510 FGVHEAGRILVLWNPMTVDIDVVDTLSQLIHLRVTCKVSSQTFLVSFVYGLHTIVNRRSM 689 + GRI V W V+I+V+ Q+I + V F ++ VYGLHTI +R+ + Sbjct: 61 YACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHTIADRKVL 120 Query: 690 WDNLMHYDLGKHEPWIVLGDFNSVLRYNERKNGEPVTQYQIKDFVDCCMLLGLTDCNSSG 869 W+ L ++ HEP I++GD+N+V +R NG V++ + D + L + ++G Sbjct: 121 WEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQLLEAPTTG 180 Query: 870 FFYTSTNNTVW-----SKLDRVMVNDIWVQNYMRVSTVFLSPGCSDHCPSVTTLFRAPVG 1034 FY+ N ++ S++D+ VN W+ Y V + G SDH P + L Sbjct: 181 LFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAGISDHSPLIFNLATQHDE 240 Query: 1035 GRRSFMFYDMWTDHDDF*DIVRQSWQGHLYGTEQYMLCRKLKRLKIPLKTLNNEHFSHIS 1214 G R F F + D + F ++V+++W + + + +L+ +K LK+ +++ FS Sbjct: 241 GGRPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWVRLQAVKRALKSFHSKKFSKAH 300 Query: 1215 TRACIAREELEALQLRAHDSLGDTDIHAQLGELRRTAWRLSEAERKFYYQKAKCRYIISA 1394 + R +L A+Q S +++ + +L + S + QK++ +++ Sbjct: 301 CQVEELRRKLAAVQALPEVSQV-SELQEEEKDLIAQLRKWSTIDESILKQKSRIQWLSLG 359 Query: 1395 DRNTKLFHAVVKRNARRNFIASVMRGDASVTCSSEEVAQEFVQYYTDLLGTDSVTT-SVD 1571 D N+K F +K RN I + + E+ E +Y LLGT S ++D Sbjct: 360 DSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSSSQLEAID 419 Query: 1572 PEVFLMCPKVSADAWPMLTQPVTDDEIRQHIFDIGTDRAPGPNGYTSGFFRHSWEIVGGD 1751 V + K+SA + L QP+T EI Q + DI +APG +G+ S FF+ SW ++ + Sbjct: 420 LHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKSWLVIKQE 479 Query: 1752 VCAAIKEFFSSGRLLKQVNHTVIALLPKSSHASSVGDYRPILCCNIIYKAISKILASRMA 1931 + I +FF +G + K +N T + L+PK A DYRPI CC+ +YK ISKIL R+ Sbjct: 480 IYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISKILTKRLQ 539 Query: 1932 KVLPQIIHESQSAFVEGRSMVENIHMTQELISRYGRKRVSPRC 2060 V+ +++ +Q+ F+ R + +NI + ELI Y R+ VSPRC Sbjct: 540 AVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHVSPRC 582 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 292 bits (748), Expect = 2e-76 Identities = 179/584 (30%), Positives = 293/584 (50%), Gaps = 12/584 (2%) Frame = +3 Query: 342 AWNMRGINSPLKKNYLVDFVKKNSVDLMGVLECKLSDTRLDHLLKTKFVGWMQCNNFGVH 521 +WN+RG N+ +++ + K + +LE ++ + R L + F GW N+ Sbjct: 6 SWNVRGFNNSVRRRNFRKWFKLSKALFGSILETRVKEHRARRSLLSSFPGWKSVCNYEFA 65 Query: 522 EAGRILVLWNPMTVDIDVVDTLSQLIHLRVTCKVSSQTFLVSFVYGLHTIVNRRSMWDNL 701 GRI V+W+P V++ V+ Q I V S F+V+FVY ++ RR +W L Sbjct: 66 ALGRIWVVWDP-AVEVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRRLWSEL 124 Query: 702 MHYDLGK---HEPWIVLGDFNSVLRYNERKNGEPVTQYQIKDFVDCCMLLGLTDCNSSGF 872 + +PWI+LGDFN L + G +++F +C + ++D G Sbjct: 125 ELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDLPFRGN 184 Query: 873 FYT----STNNTVWSKLDRVMVNDIWVQNYMRVSTVFLSPGCSDHCPSVTTLFRAPVGGR 1040 YT NN + K+DR++VND W+ F + SDHCPS + G Sbjct: 185 HYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAMEFSDHCPSCVNISNQSGGRN 244 Query: 1041 RSFMFYDMWTDHDDF*DIVRQSWQGHLY-GTEQYMLCRKLKRLKIPLKTLNNEHFSHIST 1217 + F + H +F + +R +W Y G+ + L +K K LK ++T N EH+S + Sbjct: 245 KPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTFNREHYSGLEK 304 Query: 1218 RACIAREELEALQLRAHDSLGDTDIHAQLGELRRTAW-RLSEAERKFYYQKAKCRYIISA 1394 R A + L+ Q + A L + +W L+ AE +F QK++ ++ Sbjct: 305 RVVQAAQNLKTCQNNL--LAAPSSYLAGLEKEAHRSWAELALAEERFLCQKSRVLWLKCG 362 Query: 1395 DRNTKLFHAVVKRNARRNFIASVMRGDASVTCSSEEVAQEFVQYYTDLLGTDSVTTS--- 1565 D NT FH ++ N I ++ +++E+ V ++ +L G+ S S Sbjct: 363 DSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFGSSSHLISAEG 422 Query: 1566 VDPEVFLMCPKVSADAWPMLTQPVTDDEIRQHIFDIGTDRAPGPNGYTSGFFRHSWEIVG 1745 + L K + +L V++ +I+ F + ++++PGP+GYTS FF+ +W IVG Sbjct: 423 ISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSEFFKKTWSIVG 482 Query: 1746 GDVCAAIKEFFSSGRLLKQVNHTVIALLPKSSHASSVGDYRPILCCNIIYKAISKILASR 1925 + AA++EFF SGRLL Q N T + ++PK +A + ++RPI CCN IYK ISK+LA R Sbjct: 483 PSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPISCCNAIYKVISKLLARR 542 Query: 1926 MAKVLPQIIHESQSAFVEGRSMVENIHMTQELISRYGRKRVSPR 2057 + +LP I SQSAFV+GR + EN+ + EL+ +G+ +S R Sbjct: 543 LENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQANISSR 586 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 286 bits (732), Expect = 1e-74 Identities = 185/591 (31%), Positives = 297/591 (50%), Gaps = 20/591 (3%) Frame = +3 Query: 345 WNMRGINSPLKKNYLVDFVKKNSVDLMGVLECKLSDTRLDHLLKTKFVGWMQCNNFGVHE 524 WN+RG N+ ++ +VK N GV+E + + + GW N+ + Sbjct: 8 WNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKDRKFINALLPGWSFVENYAFSD 67 Query: 525 AGRILVLWNPMTVDIDVVDTLSQLIHLRVTCKVSSQTFLVSFVYGLHTIVNRRSMWDNLM 704 G+I V+W+P +V + VV Q+I V S +VS VY + + +R+ +W ++ Sbjct: 68 LGKIWVMWDP-SVQVVVVAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKELWIEIV 126 Query: 705 HYDLGK---HEPWIVLGDFNSVLRYNERKNGEPVT---QYQIKDFVDCCMLLGLTDCNSS 866 + + PW+VLGDFN VL E N PV+ ++DF DC + L+D Sbjct: 127 NMVVSGIIGDRPWLVLGDFNQVLNPQEHSN--PVSLNVDINMRDFRDCLLAAELSDLRYK 184 Query: 867 GFFYTSTNNT----VWSKLDRVMVNDIWVQNYMRVSTVFLSPGCSDHCPSVTTLFRAPVG 1034 G +T N + V K+DR++VND W + +F S SDH L + Sbjct: 185 GNTFTWWNKSHTTPVAKKIDRILVNDSWNALFPSSLGIFGSLDFSDHVSCGVVLEETSIK 244 Query: 1035 GRRSFMFYDMWTDHDDF*DIVRQSWQG-HLYGTEQYMLCRKLKRLKIPLKTLNNEHFSHI 1211 +R F F++ + DF ++VR +W ++ G+ + + +KLK LK P+K + ++S + Sbjct: 245 AKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRLNYSEL 304 Query: 1212 STRACIAREELEALQLRAHDSLGD-TDIHAQLGELRRTAWR-LSEAERKFYYQKAKCRYI 1385 R A + L Q R +L D T I+A W L+ AE F+ QK++ + Sbjct: 305 EKRTKEAHDFLIGCQDR---TLADPTPINASFELEAERKWHILTAAEESFFRQKSRISWF 361 Query: 1386 ISADRNTKLFHAVVKRNARRNFIASVMRGDASVTCSSEEVAQEFVQYYTDLLGTDSVTTS 1565 D NTK FH + N I+++ G+ + S E + Y+ LLG + Sbjct: 362 AEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLLGDE----- 416 Query: 1566 VDPEVF-------LMCPKVSADAWPMLTQPVTDDEIRQHIFDIGTDRAPGPNGYTSGFFR 1724 VDP + L+ + S L ++++IR +F + +++ GP+G+T+ FF Sbjct: 417 VDPYLMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAEFFI 476 Query: 1725 HSWEIVGGDVCAAIKEFFSSGRLLKQVNHTVIALLPKSSHASSVGDYRPILCCNIIYKAI 1904 SW IVG +V AIKEFFSSG LLKQ N T I L+PK + + D+RPI C N +YK I Sbjct: 477 DSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPISCLNTLYKVI 536 Query: 1905 SKILASRMAKVLPQIIHESQSAFVEGRSMVENIHMTQELISRYGRKRVSPR 2057 +++L R+ ++L +I +QSAF+ GRS+ EN+ + +L+ Y +SPR Sbjct: 537 ARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNISPR 587