BLASTX nr result
ID: Cephaelis21_contig00031647
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00031647 (1804 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 269 e-103 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 256 3e-99 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 256 3e-99 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 262 9e-98 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 246 4e-96 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 269 bits (688), Expect(3) = e-103 Identities = 141/332 (42%), Positives = 198/332 (59%), Gaps = 5/332 (1%) Frame = +2 Query: 752 AQVAFIEGRHMFENVYLTQELIKQYKHKRISPGCFIKVDLKKEYDSVS*DFLIRVLNGLG 931 AQ FI GRH+ +N+ L ELI+ Y K +SP C +KVD++K YDSV FL +L G Sbjct: 546 AQSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIRKAYDSVEWSFLETLLYEFG 605 Query: 932 FPLTYVGWMVECVTTTSFSIWINGEIHGIF*GKKGLRQGDLIFPYLFVLCMEYLSRLLNR 1111 FP +VGW++ECV+T S+S+ +NG F +KGLRQGD + P+LF LCMEYLSR L Sbjct: 606 FPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFALCMEYLSRCLEE 665 Query: 1112 NTKALQFFHHFRCQALKMSHLTYADDLMFFSRGDVNSVKILWDSLMEFGEVSGLQANSFK 1291 + F H +C+ L ++HL +ADDL+ F R D +S+ + + +F SGL A+ K Sbjct: 666 LKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKFSHASGLAASHEK 725 Query: 1292 SQVFFGGVQLETCEEILDLTQIPLGELPVRYLGIPLAAEGLRILHYVPLLEKISSQIGVW 1471 S ++F GV ET E+ D + LGELP RYLG+PL ++ L PL+E I+++ W Sbjct: 726 SNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKPLVEMITNRAQTW 785 Query: 1472 TSSSLLYAGRLELI*SILQGIHCFWLSTLPVPCGVIDRVIALCRRFLWDG-----GKPLV 1636 + L YAGRL+LI SIL + +W P+ VI V +CR+FLW G K V Sbjct: 786 MAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFLWTGKTEETKKAPV 845 Query: 1637 AWKDLTVSKEKGGLGIRDVKA*NNTLLAKVLW 1732 AW + K +GG + ++K N + K+LW Sbjct: 846 AWATIQRPKSRGGWNVINMKYWNRAAMLKLLW 877 Score = 130 bits (327), Expect(3) = e-103 Identities = 80/243 (32%), Positives = 129/243 (53%), Gaps = 3/243 (1%) Frame = +3 Query: 12 RVKRVKEALEEAKPTLQNSPGDSDLQQKVIE-LRRDTRFLCETERSFLYQKAKCCYFLNS 188 +VK ++ L++ + Q+ +D+ Q + + D R E S L QK++ + Sbjct: 299 KVKNLRHQLQDLQS--QDDFDHNDIMQTDAKSIMNDLRHWSHIEDSILQQKSRITWLQQG 356 Query: 189 DRNSKLFHFVVKRNAKKNFISAVIKEDGDPTTSMNEVVQEFLDYYHNLLGSIHSVTRLNT 368 D NSKLF VK N I + EDG +EV +E L++Y LLG+ S T + Sbjct: 357 DTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRAS-TLMGV 415 Query: 369 DVLT--SGPLIQPDDCDLLCRMVDDTEIKSVVFAMGNYKSPGPDCYSADFFKKAWSVVGN 542 D+ T G + + L R V TEI + +GN K+PG D ++A FFKK+W + Sbjct: 416 DLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSIKQ 475 Query: 543 DVCAAVKEFFLTRKLLKKLNHTIIALVLKKSHTNSITDYRAISCRNVVYKIISKILANRL 722 ++ A ++EFF ++ + +N ++ L+ K H + ++R I+C V+YKIISK+L NR+ Sbjct: 476 EIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLTNRM 535 Query: 723 TGI 731 GI Sbjct: 536 KGI 538 Score = 25.0 bits (53), Expect(3) = e-103 Identities = 8/27 (29%), Positives = 14/27 (51%) Frame = +3 Query: 1710 LFWLKFYGIILFWVKWVHHTYLQKDSI 1790 L W + WV+W+H Y+++ I Sbjct: 875 LLWAIEFKRDKLWVRWIHSYYIKRQDI 901 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 256 bits (655), Expect(2) = 3e-99 Identities = 139/333 (41%), Positives = 196/333 (58%), Gaps = 6/333 (1%) Frame = +2 Query: 752 AQVAFIEGRHMFENVYLTQELIKQYKHKRISPGCFIKVDLKKEYDSVS*DFLIRVLNGLG 931 +Q AF+ GR + ENV L E++ Y ISP +KVDLKK +DSV +F+ L L Sbjct: 415 SQSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALA 474 Query: 932 FPLTYVGWMVECVTTTSFSIWINGEIHGIF*GKKGLRQGDLIFPYLFVLCMEYLSRLLNR 1111 P Y+ W+ +C+TT SF+I +NG G F KGLRQGD + PYLFVL ME S+LL Sbjct: 475 IPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYS 534 Query: 1112 NTKALQFFHHFRCQALKMSHLTYADDLMFFSRGDVNSVKILWDSLMEFGEVSGLQANSFK 1291 + +H + L +SHL +ADD+M F G +S+ + ++L +F + SGL+ N K Sbjct: 535 RYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDK 594 Query: 1292 SQVFFGGVQLETCEEILDLTQ-IPLGELPVRYLGIPLAAEGLRILHYVPLLEKISSQIGV 1468 SQ+F G+ L E I P G P+RYLG+PL LRI Y PLLEK+S+++ Sbjct: 595 SQLFQAGLDLS--ERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRS 652 Query: 1469 WTSSSLLYAGRLELI*SILQGIHCFWLSTLPVPCGVIDRVIALCRRFLWDGG-----KPL 1633 W S +L +AGR +LI S++ G+ FW+ST +P G I ++ +LC +FLW G Sbjct: 653 WVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSK 712 Query: 1634 VAWKDLTVSKEKGGLGIRDVKA*NNTLLAKVLW 1732 V+W D + K +GGLG R N TLL +++W Sbjct: 713 VSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIW 745 Score = 134 bits (336), Expect(2) = 3e-99 Identities = 77/232 (33%), Positives = 131/232 (56%), Gaps = 3/232 (1%) Frame = +3 Query: 45 AKPTLQNSPGDSDLQQKVIELRRDTRFLCETERSFLYQKAKCCYFLNSDRNSKLFHFVVK 224 A P++ N+ + + Q+K + L E SF +Q+++ +F D N+ FH +V Sbjct: 184 ANPSVSNAALELEAQRKWV-------LLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVD 236 Query: 225 RNAKKNFISAVIKEDGDPTTSMNEVVQEFLDYYHNLLGSIHS---VTRLNTDVLTSGPLI 395 N I++++ +G S ++ + YY LLGSI S + + + ++L + Sbjct: 237 SRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRCS 296 Query: 396 QPDDCDLLCRMVDDTEIKSVVFAMGNYKSPGPDCYSADFFKKAWSVVGNDVCAAVKEFFL 575 Q D C L + D EIK+ ++ K+ GPD YS +FF+ WS++G +V AA+ EFF Sbjct: 297 Q-DQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355 Query: 576 TRKLLKKLNHTIIALVLKKSHTNSITDYRAISCRNVVYKIISKILANRLTGI 731 + +LLK+ N T + L+ K S+ +I+++R ISC N +YK+ISK+L +RL G+ Sbjct: 356 SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGL 407 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 256 bits (655), Expect(2) = 3e-99 Identities = 139/333 (41%), Positives = 196/333 (58%), Gaps = 6/333 (1%) Frame = +2 Query: 752 AQVAFIEGRHMFENVYLTQELIKQYKHKRISPGCFIKVDLKKEYDSVS*DFLIRVLNGLG 931 +Q AF+ GR + ENV L E++ Y ISP +KVDLKK +DSV +F+ L L Sbjct: 415 SQSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALA 474 Query: 932 FPLTYVGWMVECVTTTSFSIWINGEIHGIF*GKKGLRQGDLIFPYLFVLCMEYLSRLLNR 1111 P Y+ W+ +C+TT SF+I +NG G F KGLRQGD + PYLFVL ME S+LL Sbjct: 475 IPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYS 534 Query: 1112 NTKALQFFHHFRCQALKMSHLTYADDLMFFSRGDVNSVKILWDSLMEFGEVSGLQANSFK 1291 + +H + L +SHL +ADD+M F G +S+ + ++L +F + SGL+ N K Sbjct: 535 RYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDK 594 Query: 1292 SQVFFGGVQLETCEEILDLTQ-IPLGELPVRYLGIPLAAEGLRILHYVPLLEKISSQIGV 1468 SQ+F G+ L E I P G P+RYLG+PL LRI Y PLLEK+S+++ Sbjct: 595 SQLFQAGLDLS--ERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRS 652 Query: 1469 WTSSSLLYAGRLELI*SILQGIHCFWLSTLPVPCGVIDRVIALCRRFLWDGG-----KPL 1633 W S +L +AGR +LI S++ G+ FW+ST +P G I ++ +LC +FLW G Sbjct: 653 WVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSK 712 Query: 1634 VAWKDLTVSKEKGGLGIRDVKA*NNTLLAKVLW 1732 V+W D + K +GGLG R N TLL +++W Sbjct: 713 VSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIW 745 Score = 134 bits (336), Expect(2) = 3e-99 Identities = 77/232 (33%), Positives = 131/232 (56%), Gaps = 3/232 (1%) Frame = +3 Query: 45 AKPTLQNSPGDSDLQQKVIELRRDTRFLCETERSFLYQKAKCCYFLNSDRNSKLFHFVVK 224 A P++ N+ + + Q+K + L E SF +Q+++ +F D N+ FH +V Sbjct: 184 ANPSVSNAALELEAQRKWV-------LLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVD 236 Query: 225 RNAKKNFISAVIKEDGDPTTSMNEVVQEFLDYYHNLLGSIHS---VTRLNTDVLTSGPLI 395 N I++++ +G S ++ + YY LLGSI S + + + ++L + Sbjct: 237 SRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRCS 296 Query: 396 QPDDCDLLCRMVDDTEIKSVVFAMGNYKSPGPDCYSADFFKKAWSVVGNDVCAAVKEFFL 575 Q D C L + D EIK+ ++ K+ GPD YS +FF+ WS++G +V AA+ EFF Sbjct: 297 Q-DQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355 Query: 576 TRKLLKKLNHTIIALVLKKSHTNSITDYRAISCRNVVYKIISKILANRLTGI 731 + +LLK+ N T + L+ K S+ +I+++R ISC N +YK+ISK+L +RL G+ Sbjct: 356 SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGL 407 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 262 bits (669), Expect(2) = 9e-98 Identities = 140/350 (40%), Positives = 205/350 (58%), Gaps = 10/350 (2%) Frame = +2 Query: 755 QVAFIEGRHMFENVYLTQELIKQYKHKRISPGCFIKVDLKKEYDSVS*DFLIRVLNGLGF 934 Q AF++ R + EN+ L EL+K Y IS C IK+D+ K +DSV FLI V LGF Sbjct: 562 QSAFVKDRLLIENLLLATELVKDYHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGF 621 Query: 935 PLTYVGWMVECVTTTSFSIWINGEIHGIF*GKKGLRQGDLIFPYLFVLCMEYLSRLLNRN 1114 P ++ W+ C+TT SFS+ +NGE+ G F +GLRQG + PYLFV+CM+ LS++L++ Sbjct: 622 PREFIHWINICITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKA 681 Query: 1115 TKALQFFHHFRCQALKMSHLTYADDLMFFSRGDVNSVKILWDSLMEFGEVSGLQANSFKS 1294 A F +H +C+ + ++HL++ADDLM S G + S++ + EF + SGL+ + KS Sbjct: 682 AAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKS 741 Query: 1295 QVFFGGVQLETCEEILDLTQIPLGELPVRYLGIPLAAEGLRILHYVPLLEKISSQIGVWT 1474 V+ G+ E+ D G+LPVRYLG+PL + L +PLLE++ +IG WT Sbjct: 742 TVYLAGLSATARNEVADRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWT 801 Query: 1475 SSSLLYAGRLELI*SILQGIHCFWLSTLPVPCGVIDRVIALCRRFLWDG-----GKPLVA 1639 S L YAGRL LI S+L I FWL+ +P I + +C FLW G K ++ Sbjct: 802 SRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKIS 861 Query: 1640 WKDLTVSKEKGGLGIRDVKA*NNTLLAKVLW-----NNTLLGKVGSPHLL 1774 W + K++GGLG+R +K N+ K++W +N+L K HLL Sbjct: 862 WHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLL 911 Score = 123 bits (309), Expect(2) = 9e-98 Identities = 73/199 (36%), Positives = 105/199 (52%), Gaps = 4/199 (2%) Frame = +3 Query: 138 ERSFLYQKAKCCYFLNSDRNSKLFHFVVKRNAKKNFISAVIKEDGDPTTSMNEVVQE--- 308 E +L QK+K + D+N+K FH N I ++ DG T +E+ E Sbjct: 353 EEKYLKQKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAER 412 Query: 309 -FLDYYHNLLGSIHSVTRLNTDVLTSGPLIQPDDCDLLCRMVDDTEIKSVVFAMGNYKSP 485 F ++ + VT L D L+ R V EI+ V+F M + KSP Sbjct: 413 FFREFLQLIPNDFEGVTITELQQLLPVRCSDADQQSLI-RPVTAEEIRKVLFRMPSDKSP 471 Query: 486 GPDCYSADFFKKAWSVVGNDVCAAVKEFFLTRKLLKKLNHTIIALVLKKSHTNSITDYRA 665 GPD Y+++FFK W ++G++ AV+ FF L K +N TI+AL+ KK+ + DYR Sbjct: 472 GPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREMKDYRP 531 Query: 666 ISCRNVVYKIISKILANRL 722 ISC NV+YK+ISKI+ANRL Sbjct: 532 ISCCNVLYKVISKIIANRL 550 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 246 bits (629), Expect(2) = 4e-96 Identities = 125/331 (37%), Positives = 189/331 (57%), Gaps = 5/331 (1%) Frame = +2 Query: 755 QVAFIEGRHMFENVYLTQELIKQYKHKRISPGCFIKVDLKKEYDSVS*DFLIRVLNGLGF 934 Q AF++ R + ENV L EL+K Y + ++P C +K+D+ K +DSV FL+ L L F Sbjct: 859 QSAFVKERLLMENVLLATELVKDYHKESVTPRCAMKIDISKAFDSVQWQFLLNTLEALNF 918 Query: 935 PLTYVGWMVECVTTTSFSIWINGEIHGIF*GKKGLRQGDLIFPYLFVLCMEYLSRLLNRN 1114 P T+ W+ C++T +FS+ +NGE+ G F +GLRQG + PYLFV+CM LS +++ Sbjct: 919 PETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEA 978 Query: 1115 TKALQFFHHFRCQALKMSHLTYADDLMFFSRGDVNSVKILWDSLMEFGEVSGLQANSFKS 1294 +H +C+ + ++HL +ADDLM F G S++ + + EF SGLQ + KS Sbjct: 979 AVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKS 1038 Query: 1295 QVFFGGVQLETCEEILDLTQIPLGELPVRYLGIPLAAEGLRILHYVPLLEKISSQIGVWT 1474 ++ GV + L G+LPVRYLG+PL + + Y PL+E + ++I WT Sbjct: 1039 TIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWT 1098 Query: 1475 SSSLLYAGRLELI*SILQGIHCFWLSTLPVPCGVIDRVIALCRRFLWDG-----GKPLVA 1639 + SL YAGRL L+ S++ I FW+S +P G I + LC FLW G K +A Sbjct: 1099 ARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIA 1158 Query: 1640 WKDLTVSKEKGGLGIRDVKA*NNTLLAKVLW 1732 W + K++GGLGI+ + N K++W Sbjct: 1159 WSSICQPKKEGGLGIKSLAEANKVSCLKLIW 1189 Score = 133 bits (335), Expect(2) = 4e-96 Identities = 88/242 (36%), Positives = 128/242 (52%), Gaps = 7/242 (2%) Frame = +3 Query: 18 KRVKEA---LEEAKPTLQNSPGDSDLQQKVIELRRDTRFLCETERSFLYQKAKCCYFLNS 188 KR +EA L E + T +P + ++ ++ D L E E FL QK+K + Sbjct: 608 KRTREAHILLCEKQATTLANPSQETIAEE-LKAYTDWTHLSELEEGFLKQKSKLHWMNVG 666 Query: 189 DRNSKLFHFVVKRNAKKNFISAVIKEDGDPTTSMNEVVQEFLDYYHNLL----GSIHSVT 356 D N+ FH + +N I + + + + E+ E +++ L G H ++ Sbjct: 667 DGNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAERFFNEFLNRQSGDFHGIS 726 Query: 357 RLNTDVLTSGPLIQPDDCDLLCRMVDDTEIKSVVFAMGNYKSPGPDCYSADFFKKAWSVV 536 + L S D ++L R V EI+ V+FAM N KSPGPD Y+++FFK WS+ Sbjct: 727 VEDLRNLMSYRCSVTDQ-NILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWSLT 785 Query: 537 GNDVCAAVKEFFLTRKLLKKLNHTIIALVLKKSHTNSITDYRAISCRNVVYKIISKILAN 716 G D AA++ FF+ L K LN TI+AL+ KK + DYR ISC NV+YK+ISKILAN Sbjct: 786 GPDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCNVLYKVISKILAN 845 Query: 717 RL 722 RL Sbjct: 846 RL 847