BLASTX nr result

ID: Cephaelis21_contig00031647 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00031647
         (1804 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   269   e-103
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           256   3e-99
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   256   3e-99
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   262   9e-98
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   246   4e-96

>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  269 bits (688), Expect(3) = e-103
 Identities = 141/332 (42%), Positives = 198/332 (59%), Gaps = 5/332 (1%)
 Frame = +2

Query: 752  AQVAFIEGRHMFENVYLTQELIKQYKHKRISPGCFIKVDLKKEYDSVS*DFLIRVLNGLG 931
            AQ  FI GRH+ +N+ L  ELI+ Y  K +SP C +KVD++K YDSV   FL  +L   G
Sbjct: 546  AQSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIRKAYDSVEWSFLETLLYEFG 605

Query: 932  FPLTYVGWMVECVTTTSFSIWINGEIHGIF*GKKGLRQGDLIFPYLFVLCMEYLSRLLNR 1111
            FP  +VGW++ECV+T S+S+ +NG     F  +KGLRQGD + P+LF LCMEYLSR L  
Sbjct: 606  FPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFALCMEYLSRCLEE 665

Query: 1112 NTKALQFFHHFRCQALKMSHLTYADDLMFFSRGDVNSVKILWDSLMEFGEVSGLQANSFK 1291
               +  F  H +C+ L ++HL +ADDL+ F R D +S+  +  +  +F   SGL A+  K
Sbjct: 666  LKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKFSHASGLAASHEK 725

Query: 1292 SQVFFGGVQLETCEEILDLTQIPLGELPVRYLGIPLAAEGLRILHYVPLLEKISSQIGVW 1471
            S ++F GV  ET  E+ D   + LGELP RYLG+PL ++ L      PL+E I+++   W
Sbjct: 726  SNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKPLVEMITNRAQTW 785

Query: 1472 TSSSLLYAGRLELI*SILQGIHCFWLSTLPVPCGVIDRVIALCRRFLWDG-----GKPLV 1636
             +  L YAGRL+LI SIL  +  +W    P+   VI  V  +CR+FLW G      K  V
Sbjct: 786  MAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFLWTGKTEETKKAPV 845

Query: 1637 AWKDLTVSKEKGGLGIRDVKA*NNTLLAKVLW 1732
            AW  +   K +GG  + ++K  N   + K+LW
Sbjct: 846  AWATIQRPKSRGGWNVINMKYWNRAAMLKLLW 877



 Score =  130 bits (327), Expect(3) = e-103
 Identities = 80/243 (32%), Positives = 129/243 (53%), Gaps = 3/243 (1%)
 Frame = +3

Query: 12   RVKRVKEALEEAKPTLQNSPGDSDLQQKVIE-LRRDTRFLCETERSFLYQKAKCCYFLNS 188
            +VK ++  L++ +   Q+    +D+ Q   + +  D R     E S L QK++  +    
Sbjct: 299  KVKNLRHQLQDLQS--QDDFDHNDIMQTDAKSIMNDLRHWSHIEDSILQQKSRITWLQQG 356

Query: 189  DRNSKLFHFVVKRNAKKNFISAVIKEDGDPTTSMNEVVQEFLDYYHNLLGSIHSVTRLNT 368
            D NSKLF   VK     N I  +  EDG      +EV +E L++Y  LLG+  S T +  
Sbjct: 357  DTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRAS-TLMGV 415

Query: 369  DVLT--SGPLIQPDDCDLLCRMVDDTEIKSVVFAMGNYKSPGPDCYSADFFKKAWSVVGN 542
            D+ T   G  +     + L R V  TEI   +  +GN K+PG D ++A FFKK+W  +  
Sbjct: 416  DLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSIKQ 475

Query: 543  DVCAAVKEFFLTRKLLKKLNHTIIALVLKKSHTNSITDYRAISCRNVVYKIISKILANRL 722
            ++ A ++EFF   ++ + +N  ++ L+ K  H   + ++R I+C  V+YKIISK+L NR+
Sbjct: 476  EIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLTNRM 535

Query: 723  TGI 731
             GI
Sbjct: 536  KGI 538



 Score = 25.0 bits (53), Expect(3) = e-103
 Identities = 8/27 (29%), Positives = 14/27 (51%)
 Frame = +3

Query: 1710 LFWLKFYGIILFWVKWVHHTYLQKDSI 1790
            L W   +     WV+W+H  Y+++  I
Sbjct: 875  LLWAIEFKRDKLWVRWIHSYYIKRQDI 901


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  256 bits (655), Expect(2) = 3e-99
 Identities = 139/333 (41%), Positives = 196/333 (58%), Gaps = 6/333 (1%)
 Frame = +2

Query: 752  AQVAFIEGRHMFENVYLTQELIKQYKHKRISPGCFIKVDLKKEYDSVS*DFLIRVLNGLG 931
            +Q AF+ GR + ENV L  E++  Y    ISP   +KVDLKK +DSV  +F+   L  L 
Sbjct: 415  SQSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALA 474

Query: 932  FPLTYVGWMVECVTTTSFSIWINGEIHGIF*GKKGLRQGDLIFPYLFVLCMEYLSRLLNR 1111
             P  Y+ W+ +C+TT SF+I +NG   G F   KGLRQGD + PYLFVL ME  S+LL  
Sbjct: 475  IPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYS 534

Query: 1112 NTKALQFFHHFRCQALKMSHLTYADDLMFFSRGDVNSVKILWDSLMEFGEVSGLQANSFK 1291
               +    +H +   L +SHL +ADD+M F  G  +S+  + ++L +F + SGL+ N  K
Sbjct: 535  RYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDK 594

Query: 1292 SQVFFGGVQLETCEEILDLTQ-IPLGELPVRYLGIPLAAEGLRILHYVPLLEKISSQIGV 1468
            SQ+F  G+ L   E I       P G  P+RYLG+PL    LRI  Y PLLEK+S+++  
Sbjct: 595  SQLFQAGLDLS--ERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRS 652

Query: 1469 WTSSSLLYAGRLELI*SILQGIHCFWLSTLPVPCGVIDRVIALCRRFLWDGG-----KPL 1633
            W S +L +AGR +LI S++ G+  FW+ST  +P G I ++ +LC +FLW G         
Sbjct: 653  WVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSK 712

Query: 1634 VAWKDLTVSKEKGGLGIRDVKA*NNTLLAKVLW 1732
            V+W D  + K +GGLG R     N TLL +++W
Sbjct: 713  VSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIW 745



 Score =  134 bits (336), Expect(2) = 3e-99
 Identities = 77/232 (33%), Positives = 131/232 (56%), Gaps = 3/232 (1%)
 Frame = +3

Query: 45  AKPTLQNSPGDSDLQQKVIELRRDTRFLCETERSFLYQKAKCCYFLNSDRNSKLFHFVVK 224
           A P++ N+  + + Q+K +        L   E SF +Q+++  +F   D N+  FH +V 
Sbjct: 184 ANPSVSNAALELEAQRKWV-------LLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVD 236

Query: 225 RNAKKNFISAVIKEDGDPTTSMNEVVQEFLDYYHNLLGSIHS---VTRLNTDVLTSGPLI 395
                N I++++  +G    S   ++   + YY  LLGSI S   + + + ++L +    
Sbjct: 237 SRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRCS 296

Query: 396 QPDDCDLLCRMVDDTEIKSVVFAMGNYKSPGPDCYSADFFKKAWSVVGNDVCAAVKEFFL 575
           Q D C  L +   D EIK+   ++   K+ GPD YS +FF+  WS++G +V AA+ EFF 
Sbjct: 297 Q-DQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355

Query: 576 TRKLLKKLNHTIIALVLKKSHTNSITDYRAISCRNVVYKIISKILANRLTGI 731
           + +LLK+ N T + L+ K S+  +I+++R ISC N +YK+ISK+L +RL G+
Sbjct: 356 SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGL 407


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  256 bits (655), Expect(2) = 3e-99
 Identities = 139/333 (41%), Positives = 196/333 (58%), Gaps = 6/333 (1%)
 Frame = +2

Query: 752  AQVAFIEGRHMFENVYLTQELIKQYKHKRISPGCFIKVDLKKEYDSVS*DFLIRVLNGLG 931
            +Q AF+ GR + ENV L  E++  Y    ISP   +KVDLKK +DSV  +F+   L  L 
Sbjct: 415  SQSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALA 474

Query: 932  FPLTYVGWMVECVTTTSFSIWINGEIHGIF*GKKGLRQGDLIFPYLFVLCMEYLSRLLNR 1111
             P  Y+ W+ +C+TT SF+I +NG   G F   KGLRQGD + PYLFVL ME  S+LL  
Sbjct: 475  IPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYS 534

Query: 1112 NTKALQFFHHFRCQALKMSHLTYADDLMFFSRGDVNSVKILWDSLMEFGEVSGLQANSFK 1291
               +    +H +   L +SHL +ADD+M F  G  +S+  + ++L +F + SGL+ N  K
Sbjct: 535  RYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDK 594

Query: 1292 SQVFFGGVQLETCEEILDLTQ-IPLGELPVRYLGIPLAAEGLRILHYVPLLEKISSQIGV 1468
            SQ+F  G+ L   E I       P G  P+RYLG+PL    LRI  Y PLLEK+S+++  
Sbjct: 595  SQLFQAGLDLS--ERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRS 652

Query: 1469 WTSSSLLYAGRLELI*SILQGIHCFWLSTLPVPCGVIDRVIALCRRFLWDGG-----KPL 1633
            W S +L +AGR +LI S++ G+  FW+ST  +P G I ++ +LC +FLW G         
Sbjct: 653  WVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSK 712

Query: 1634 VAWKDLTVSKEKGGLGIRDVKA*NNTLLAKVLW 1732
            V+W D  + K +GGLG R     N TLL +++W
Sbjct: 713  VSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIW 745



 Score =  134 bits (336), Expect(2) = 3e-99
 Identities = 77/232 (33%), Positives = 131/232 (56%), Gaps = 3/232 (1%)
 Frame = +3

Query: 45  AKPTLQNSPGDSDLQQKVIELRRDTRFLCETERSFLYQKAKCCYFLNSDRNSKLFHFVVK 224
           A P++ N+  + + Q+K +        L   E SF +Q+++  +F   D N+  FH +V 
Sbjct: 184 ANPSVSNAALELEAQRKWV-------LLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVD 236

Query: 225 RNAKKNFISAVIKEDGDPTTSMNEVVQEFLDYYHNLLGSIHS---VTRLNTDVLTSGPLI 395
                N I++++  +G    S   ++   + YY  LLGSI S   + + + ++L +    
Sbjct: 237 SRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLTYRCS 296

Query: 396 QPDDCDLLCRMVDDTEIKSVVFAMGNYKSPGPDCYSADFFKKAWSVVGNDVCAAVKEFFL 575
           Q D C  L +   D EIK+   ++   K+ GPD YS +FF+  WS++G +V AA+ EFF 
Sbjct: 297 Q-DQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355

Query: 576 TRKLLKKLNHTIIALVLKKSHTNSITDYRAISCRNVVYKIISKILANRLTGI 731
           + +LLK+ N T + L+ K S+  +I+++R ISC N +YK+ISK+L +RL G+
Sbjct: 356 SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGL 407


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  262 bits (669), Expect(2) = 9e-98
 Identities = 140/350 (40%), Positives = 205/350 (58%), Gaps = 10/350 (2%)
 Frame = +2

Query: 755  QVAFIEGRHMFENVYLTQELIKQYKHKRISPGCFIKVDLKKEYDSVS*DFLIRVLNGLGF 934
            Q AF++ R + EN+ L  EL+K Y    IS  C IK+D+ K +DSV   FLI V   LGF
Sbjct: 562  QSAFVKDRLLIENLLLATELVKDYHKDTISTRCAIKIDISKAFDSVQWPFLINVFTILGF 621

Query: 935  PLTYVGWMVECVTTTSFSIWINGEIHGIF*GKKGLRQGDLIFPYLFVLCMEYLSRLLNRN 1114
            P  ++ W+  C+TT SFS+ +NGE+ G F   +GLRQG  + PYLFV+CM+ LS++L++ 
Sbjct: 622  PREFIHWINICITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKA 681

Query: 1115 TKALQFFHHFRCQALKMSHLTYADDLMFFSRGDVNSVKILWDSLMEFGEVSGLQANSFKS 1294
              A  F +H +C+ + ++HL++ADDLM  S G + S++ +     EF + SGL+ +  KS
Sbjct: 682  AAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKS 741

Query: 1295 QVFFGGVQLETCEEILDLTQIPLGELPVRYLGIPLAAEGLRILHYVPLLEKISSQIGVWT 1474
             V+  G+      E+ D      G+LPVRYLG+PL  + L     +PLLE++  +IG WT
Sbjct: 742  TVYLAGLSATARNEVADRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWT 801

Query: 1475 SSSLLYAGRLELI*SILQGIHCFWLSTLPVPCGVIDRVIALCRRFLWDG-----GKPLVA 1639
            S  L YAGRL LI S+L  I  FWL+   +P   I  +  +C  FLW G      K  ++
Sbjct: 802  SRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKIS 861

Query: 1640 WKDLTVSKEKGGLGIRDVKA*NNTLLAKVLW-----NNTLLGKVGSPHLL 1774
            W  +   K++GGLG+R +K  N+    K++W     +N+L  K    HLL
Sbjct: 862  WHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLL 911



 Score =  123 bits (309), Expect(2) = 9e-98
 Identities = 73/199 (36%), Positives = 105/199 (52%), Gaps = 4/199 (2%)
 Frame = +3

Query: 138 ERSFLYQKAKCCYFLNSDRNSKLFHFVVKRNAKKNFISAVIKEDGDPTTSMNEVVQE--- 308
           E  +L QK+K  +    D+N+K FH         N I  ++  DG   T  +E+  E   
Sbjct: 353 EEKYLKQKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAER 412

Query: 309 -FLDYYHNLLGSIHSVTRLNTDVLTSGPLIQPDDCDLLCRMVDDTEIKSVVFAMGNYKSP 485
            F ++   +      VT      L        D   L+ R V   EI+ V+F M + KSP
Sbjct: 413 FFREFLQLIPNDFEGVTITELQQLLPVRCSDADQQSLI-RPVTAEEIRKVLFRMPSDKSP 471

Query: 486 GPDCYSADFFKKAWSVVGNDVCAAVKEFFLTRKLLKKLNHTIIALVLKKSHTNSITDYRA 665
           GPD Y+++FFK  W ++G++   AV+ FF    L K +N TI+AL+ KK+    + DYR 
Sbjct: 472 GPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREMKDYRP 531

Query: 666 ISCRNVVYKIISKILANRL 722
           ISC NV+YK+ISKI+ANRL
Sbjct: 532 ISCCNVLYKVISKIIANRL 550


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  246 bits (629), Expect(2) = 4e-96
 Identities = 125/331 (37%), Positives = 189/331 (57%), Gaps = 5/331 (1%)
 Frame = +2

Query: 755  QVAFIEGRHMFENVYLTQELIKQYKHKRISPGCFIKVDLKKEYDSVS*DFLIRVLNGLGF 934
            Q AF++ R + ENV L  EL+K Y  + ++P C +K+D+ K +DSV   FL+  L  L F
Sbjct: 859  QSAFVKERLLMENVLLATELVKDYHKESVTPRCAMKIDISKAFDSVQWQFLLNTLEALNF 918

Query: 935  PLTYVGWMVECVTTTSFSIWINGEIHGIF*GKKGLRQGDLIFPYLFVLCMEYLSRLLNRN 1114
            P T+  W+  C++T +FS+ +NGE+ G F   +GLRQG  + PYLFV+CM  LS +++  
Sbjct: 919  PETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEA 978

Query: 1115 TKALQFFHHFRCQALKMSHLTYADDLMFFSRGDVNSVKILWDSLMEFGEVSGLQANSFKS 1294
                   +H +C+ + ++HL +ADDLM F  G   S++ + +   EF   SGLQ +  KS
Sbjct: 979  AVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKS 1038

Query: 1295 QVFFGGVQLETCEEILDLTQIPLGELPVRYLGIPLAAEGLRILHYVPLLEKISSQIGVWT 1474
             ++  GV      + L       G+LPVRYLG+PL  + +    Y PL+E + ++I  WT
Sbjct: 1039 TIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWT 1098

Query: 1475 SSSLLYAGRLELI*SILQGIHCFWLSTLPVPCGVIDRVIALCRRFLWDG-----GKPLVA 1639
            + SL YAGRL L+ S++  I  FW+S   +P G I  +  LC  FLW G      K  +A
Sbjct: 1099 ARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIA 1158

Query: 1640 WKDLTVSKEKGGLGIRDVKA*NNTLLAKVLW 1732
            W  +   K++GGLGI+ +   N     K++W
Sbjct: 1159 WSSICQPKKEGGLGIKSLAEANKVSCLKLIW 1189



 Score =  133 bits (335), Expect(2) = 4e-96
 Identities = 88/242 (36%), Positives = 128/242 (52%), Gaps = 7/242 (2%)
 Frame = +3

Query: 18   KRVKEA---LEEAKPTLQNSPGDSDLQQKVIELRRDTRFLCETERSFLYQKAKCCYFLNS 188
            KR +EA   L E + T   +P    + ++ ++   D   L E E  FL QK+K  +    
Sbjct: 608  KRTREAHILLCEKQATTLANPSQETIAEE-LKAYTDWTHLSELEEGFLKQKSKLHWMNVG 666

Query: 189  DRNSKLFHFVVKRNAKKNFISAVIKEDGDPTTSMNEVVQEFLDYYHNLL----GSIHSVT 356
            D N+  FH   +    +N I  +   + +   +  E+  E   +++  L    G  H ++
Sbjct: 667  DGNNSYFHKAAQVRKMRNSIREIRGPNAETLQTSEEIKGEAERFFNEFLNRQSGDFHGIS 726

Query: 357  RLNTDVLTSGPLIQPDDCDLLCRMVDDTEIKSVVFAMGNYKSPGPDCYSADFFKKAWSVV 536
              +   L S      D  ++L R V   EI+ V+FAM N KSPGPD Y+++FFK  WS+ 
Sbjct: 727  VEDLRNLMSYRCSVTDQ-NILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWSLT 785

Query: 537  GNDVCAAVKEFFLTRKLLKKLNHTIIALVLKKSHTNSITDYRAISCRNVVYKIISKILAN 716
            G D  AA++ FF+   L K LN TI+AL+ KK     + DYR ISC NV+YK+ISKILAN
Sbjct: 786  GPDFIAAIQSFFVKGFLPKGLNATILALIPKKDEAIEMKDYRPISCCNVLYKVISKILAN 845

Query: 717  RL 722
            RL
Sbjct: 846  RL 847