BLASTX nr result

ID: Cephaelis21_contig00016100 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00016100
         (835 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002303671.1| predicted protein [Populus trichocarpa] gi|2...   100   5e-19
gb|AAM13850.1| unknown protein [Arabidopsis thaliana]                  94   5e-17
ref|XP_002887879.1| RWP-RK domain-containing protein [Arabidopsi...    93   6e-17
ref|XP_003565547.1| PREDICTED: protein NLP3-like [Brachypodium d...    92   2e-16
gb|EAY73195.1| hypothetical protein OsI_01067 [Oryza sativa Indi...    92   2e-16

>ref|XP_002303671.1| predicted protein [Populus trichocarpa] gi|222841103|gb|EEE78650.1|
            predicted protein [Populus trichocarpa]
          Length = 953

 Score =  100 bits (248), Expect = 5e-19
 Identities = 92/305 (30%), Positives = 140/305 (45%), Gaps = 28/305 (9%)
 Frame = +1

Query: 4    GDKRYLITSDQPFHLKEPVHGLCSNRKHCMDCLIPIDDDMSTEEEGLLGPPRRVFSKW*S 183
            G +  L TS QPF L    +GL   R   +  +  +D +   E    LG P RVF +   
Sbjct: 169  GGQHVLTTSGQPFVLDPHSNGLHQYRMVSLMYMFSVDGESDRE----LGLPGRVFRQ--- 221

Query: 184  RSEPWL**LHLY*ISSAR*GCGLCYWLCFKALPMPSSITALKPTQPESSIVLKLV----- 348
            +S  W   +  Y   S++    L + L +       ++   +P+      VL+L+     
Sbjct: 222  KSPEWTPNVQYY---SSKEYSRLDHALRYNVRGT-LALPVFEPSGQSCVGVLELIMNSQK 277

Query: 349  VSYKPILWGVCKRL----------FEKPEVTL---GLPTLVPEIEDLLREVCTTHRLPLA 489
            ++Y P +  VCK L           + P + +   G    + EI ++L  VC TH+LPLA
Sbjct: 278  INYAPEVDKVCKALEAVNLKSSEILDPPSIQICNEGRQNALSEILEILTMVCETHKLPLA 337

Query: 490  QTWVSSEYKNEAIWYKFGASYAVNDDFG--------SDDEEGSCIVDVR--DFLQACQSY 639
            QTWV   +++  + Y  G   +     G        S  +    +VD R   F +AC  +
Sbjct: 338  QTWVPCIHRS-VLTYGGGLKKSCTSFDGNCNGQVCMSTTDVAFYVVDARMWGFREACLEH 396

Query: 640  RIKKGRGVVGKALSSGGACFCKDISQLSISEYSLVRRAQQAGLTGCFAISLTHRIWWADY 819
             ++KG+GV G+A  S  +CFC DI+Q   +EY LV  A+  GLT CFAI L    +  D 
Sbjct: 397  HLQKGQGVAGRAFLSQNSCFCPDITQFCKTEYPLVHYARMFGLTSCFAIFL-RSSYTGDD 455

Query: 820  VYIVE 834
             YI+E
Sbjct: 456  DYILE 460


>gb|AAM13850.1| unknown protein [Arabidopsis thaliana]
          Length = 841

 Score = 93.6 bits (231), Expect = 5e-17
 Identities = 88/307 (28%), Positives = 137/307 (44%), Gaps = 35/307 (11%)
 Frame = +1

Query: 19   LITSDQPFHLKEPVHGLCSNRKHCMDCLIPIDDDMSTEEEGLLGPPRRVFSKW*SRSEPW 198
            L TS QPF L    +GL   R   +  +  +D     E +G LG P RVF K   +   W
Sbjct: 140  LTTSGQPFVLGPNSNGLNQYRMVSLTYMFSLDG----ERDGELGLPGRVFRK---KLPEW 192

Query: 199  L**LHLY*ISS-AR*GCGLCYWLCFKALPMPSSITALKPTQPESSIVLKLV-----VSYK 360
               +  Y     +R G  L Y      +    ++   +P++     V++L+     ++Y 
Sbjct: 193  TPNVQYYSSKEFSRLGHALHY-----NVQGTLALPVFEPSRQLCVGVVELIMTSPKINYA 247

Query: 361  PILWGVCKRL------------FEKPEV-TLGLPTLVPEIEDLLREVCTTHRLPLAQTWV 501
            P +  VCK L             E  ++   G    + EI ++L  VC T++LPLAQTWV
Sbjct: 248  PEVEKVCKALEAVNLKTSEILNHETTQICNEGRQNALAEILEILTVVCETYKLPLAQTWV 307

Query: 502  SSEYKNEAIWYKFGASYAVNDDFGSDDEEGSC--------------IVD--VRDFLQACQ 633
               Y++      FG  +  +        +GSC              +VD  V  F  AC 
Sbjct: 308  PCRYRSVLA---FGGGFKKS----CSSFDGSCMGKVCMSTSDLAVYVVDAHVWGFRDACA 360

Query: 634  SYRIKKGRGVVGKALSSGGACFCKDISQLSISEYSLVRRAQQAGLTGCFAISLTHRIWWA 813
             + ++KG+GV G+A  SG  CFC+D+++   ++Y LV  A+   LT CFA+ L    +  
Sbjct: 361  EHHLQKGQGVAGRAFQSGNLCFCRDVTRFCKTDYPLVHYARMFKLTSCFAVCL-KSTYTG 419

Query: 814  DYVYIVE 834
            D  Y++E
Sbjct: 420  DDEYVLE 426


>ref|XP_002887879.1| RWP-RK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
            gi|297333720|gb|EFH64138.1| RWP-RK domain-containing
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 842

 Score = 93.2 bits (230), Expect = 6e-17
 Identities = 88/307 (28%), Positives = 137/307 (44%), Gaps = 35/307 (11%)
 Frame = +1

Query: 19   LITSDQPFHLKEPVHGLCSNRKHCMDCLIPIDDDMSTEEEGLLGPPRRVFSKW*SRSEPW 198
            L TS QPF L    +GL   R   +  +  +D     E +G LG P RVF K   R   W
Sbjct: 141  LTTSGQPFVLGPNSNGLNQYRMVSLTYMFSLDG----ERDGELGLPGRVFRK---RLPEW 193

Query: 199  L**LHLY*ISS-AR*GCGLCYWLCFKALPMPSSITALKPTQPESSIVLKLV-----VSYK 360
               +  Y     +R G  L Y      +    ++   +P++     V++L+     ++Y 
Sbjct: 194  TPNVQYYSSKEFSRLGHALHY-----NVQGTLALPVFEPSRQLCVGVVELIMTSPKINYA 248

Query: 361  PILWGVCKRL----FEKPEV---------TLGLPTLVPEIEDLLREVCTTHRLPLAQTWV 501
            P +  VCK L     +  E+           G    + EI ++L  VC T++LPLAQTWV
Sbjct: 249  PEVEKVCKALEAVNLKTSEILNNETTQICNEGRQNALAEILEILTVVCETYKLPLAQTWV 308

Query: 502  SSEYKNEAIWYKFGASYAVNDDFGSDDEEGSC--------------IVD--VRDFLQACQ 633
               +++      FG  +  +        +GSC              +VD  V  F  AC 
Sbjct: 309  PCRHRSVLA---FGGGFQKS----CSSFDGSCMGKVCMSTSDLAVYVVDAHVWGFRDACS 361

Query: 634  SYRIKKGRGVVGKALSSGGACFCKDISQLSISEYSLVRRAQQAGLTGCFAISLTHRIWWA 813
             + ++KG+GV G+A  SG  CFC+D+++   ++Y LV  A+   LT CFA+ L    +  
Sbjct: 362  EHHLQKGQGVAGRAFQSGNLCFCRDVTRFCKTDYPLVHYARMFKLTSCFAVCL-KSTYTG 420

Query: 814  DYVYIVE 834
            D  Y++E
Sbjct: 421  DDEYVLE 427


>ref|XP_003565547.1| PREDICTED: protein NLP3-like [Brachypodium distachyon]
          Length = 942

 Score = 91.7 bits (226), Expect = 2e-16
 Identities = 94/307 (30%), Positives = 135/307 (43%), Gaps = 30/307 (9%)
 Frame = +1

Query: 4    GDKRYLITSDQPFHLKEPVHGLCSNRKHCMDCLIPIDDDMSTEEEGLLGPPRRVFSKW*S 183
            GD+  L TS QPF L     GL   R   M  +  ID D + E    LG P RV+ +   
Sbjct: 167  GDRYVLTTSGQPFVLDHQSIGLLQYRAVSMMYMFSIDGDNAGE----LGLPGRVYKQ--- 219

Query: 184  RSEPWL**LHLY*ISS-AR*GCGLCYWLCFK-ALPMPSSITALKPTQPESSIVLKLVVSY 357
            +   W   +  Y  +   R    + Y +    ALP+        P+      V++L+++ 
Sbjct: 220  KVPEWTPNVQYYSSTEYPRLNHAISYNVHGTVALPV------FDPSVQSCIAVVELIMTS 273

Query: 358  KPILWG-----VCKRL----------FEKPEVTL---GLPTLVPEIEDLLREVCTTHRLP 483
            K I +      VCK L           + P V +   G  + + EI ++L  VC  H+LP
Sbjct: 274  KKINYADEVDKVCKALEAVNLKSTEILDHPNVQICNEGRQSALVEILEILTVVCEEHKLP 333

Query: 484  LAQTWVSSEYKN--------EAIWYKFGASYAVNDDFGSDDEEGSCIVDVR--DFLQACQ 633
            LAQTWV  +Y++        +     F  S  + +   S  +    ++D     F  AC 
Sbjct: 334  LAQTWVPCKYRSVLAHGGGVKKSCLSFDGS-CMGEVCMSTSDVAFHVIDAHMWGFRDACI 392

Query: 634  SYRIKKGRGVVGKALSSGGACFCKDISQLSISEYSLVRRAQQAGLTGCFAISLTHRIWWA 813
             + ++KG+GV GKA      CF KDISQ    EY LV  A+  GL GCFA+ L       
Sbjct: 393  EHHLQKGQGVSGKAFIYHRPCFSKDISQFCKVEYPLVHYARMFGLAGCFAVCLQSPYTGD 452

Query: 814  DYVYIVE 834
            DY YI+E
Sbjct: 453  DY-YILE 458


>gb|EAY73195.1| hypothetical protein OsI_01067 [Oryza sativa Indica Group]
          Length = 866

 Score = 91.7 bits (226), Expect = 2e-16
 Identities = 93/307 (30%), Positives = 136/307 (44%), Gaps = 30/307 (9%)
 Frame = +1

Query: 4   GDKRYLITSDQPFHLKEPVHGLCSNRKHCMDCLIPIDDDMSTEEEGLLGPPRRVFSKW*S 183
           GD+  L TS QPF L +   GL   R   M  +  +D     E  G LG P RV+ +   
Sbjct: 90  GDRYVLTTSGQPFVLDQQSIGLLQYRAVSMMYMFSVDG----ENAGELGLPGRVYKQ--- 142

Query: 184 RSEPWL**LHLY*ISS-AR*GCGLCYWLCFK-ALPMPSSITALKPTQPESSIVLKLVVSY 357
           +   W   +  Y  +   R    + Y +    ALP+        P+      V++L+++ 
Sbjct: 143 KVPEWTPNVQYYSSTEYPRLNHAISYNVHGTVALPV------FDPSVQNCIAVVELIMTS 196

Query: 358 KPILWG-----VCKRL----------FEKPEVTL---GLPTLVPEIEDLLREVCTTHRLP 483
           K I +      VCK L           + P V +   G  + + EI ++L  VC  H+LP
Sbjct: 197 KKINYAGEVDKVCKALEAVNLKSTEILDHPNVQICNEGRQSALVEILEILTVVCEEHKLP 256

Query: 484 LAQTWVSSEYKN--------EAIWYKFGASYAVNDDFGSDDEEGSCIVDVR--DFLQACQ 633
           LAQTWV  +Y++        +     F  S  + +   S  +    ++D     F  AC 
Sbjct: 257 LAQTWVPCKYRSVLAHGGGVKKSCLSFDGS-CMGEVCMSTSDVAFHVIDAHMWGFRDACV 315

Query: 634 SYRIKKGRGVVGKALSSGGACFCKDISQLSISEYSLVRRAQQAGLTGCFAISLTHRIWWA 813
            + ++KG+GV GKA      CF KDISQ    EY LV  A+  GL GCFAI L   ++  
Sbjct: 316 EHHLQKGQGVSGKAFIYRRPCFSKDISQFCKLEYPLVHYARMFGLAGCFAICL-QSMYTG 374

Query: 814 DYVYIVE 834
           D  YI+E
Sbjct: 375 DDDYILE 381


Top