BLASTX nr result

ID: Cephaelis21_contig00014852 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00014852
         (1706 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAA78386.1| protein 1 [Petunia x hybrida]                         535   e-149
ref|XP_002283843.1| PREDICTED: uncharacterized protein LOC100245...   529   e-147
ref|XP_002323871.1| predicted protein [Populus trichocarpa] gi|2...   513   e-143
ref|XP_003543812.1| PREDICTED: uncharacterized protein LOC100787...   473   e-131
ref|XP_003554566.1| PREDICTED: uncharacterized protein LOC100101...   472   e-130

>emb|CAA78386.1| protein 1 [Petunia x hybrida]
          Length = 421

 Score =  535 bits (1379), Expect = e-149
 Identities = 292/437 (66%), Positives = 325/437 (74%), Gaps = 32/437 (7%)
 Frame = +2

Query: 275  MGRSPCCDKVGLKKGPWTPEEDQKLLAYIEGHGHGSWRALPAKAGLQRCGKSCRLRWTNY 454
            MGRSPCCDKVGLKKGPWTPEEDQKLLAYIE HGHGSWRALPAKAGLQRCGKSCRLRWTNY
Sbjct: 1    MGRSPCCDKVGLKKGPWTPEEDQKLLAYIEEHGHGSWRALPAKAGLQRCGKSCRLRWTNY 60

Query: 455  LRPDIKRGKFSIQEEQSIIQLHALLGNRWSAIATHLAKRTDNEIKNYWNTHLKKRLAKMG 634
            LRPDIKRGKF++QEEQ+IIQLHALLGNRWSAIATHL KRTDNEIKNYWNTHLKKRL KMG
Sbjct: 61   LRPDIKRGKFTLQEEQTIIQLHALLGNRWSAIATHLPKRTDNEIKNYWNTHLKKRLVKMG 120

Query: 635  IDPVTHKPKSDAMLSSDGQSKHAANLSHMAQWESARLEAEARLVRQSKLRSNS------- 793
            IDPVTHKPK+DA+LS DGQSK+AANLSHMAQWESARLEAEARLVRQSKLRSNS       
Sbjct: 121  IDPVTHKPKNDALLSHDGQSKNAANLSHMAQWESARLEAEARLVRQSKLRSNSFQNPLAS 180

Query: 794  --LFASPEFSTPSSPLNKPSLAP---PMPPRCLDILKAWNGVWAK--------------S 916
              LF SP   TPSSPL+KP + P   P  PRCLD+LKAWNGVW K              S
Sbjct: 181  HELFTSP---TPSSPLHKPIVTPTKAPGSPRCLDVLKAWNGVWTKPMNDVLHADGSTSAS 237

Query: 917  NEPGASGLGGDLESPTSTVSYSENTAPQMSSSNRMGDSSTAFYEFVGNNSSGSFDGGIMK 1096
                 + LG DLESPTST+SY EN   Q  S+  + ++ST+ +EFVG NSSGS +GGIM 
Sbjct: 238  ATVSVNALGLDLESPTSTLSYFENA--QHISTGMIQENSTSLFEFVG-NSSGSSEGGIMN 294

Query: 1097 EEGEDDWKGLGRKS--NLP---DGI-ENSVTFISALQDMTMPPDSGAAWNGDSLRAYDQD 1258
            EE E+DWKG G  S  +LP   DGI ENS++  S LQD+TMP D+   W  +SLR+  +D
Sbjct: 295  EESEEDWKGFGNSSTGHLPEYKDGINENSMSLTSTLQDLTMPMDT--TWTAESLRSNAED 352

Query: 1259 EDHARSGGNFVERFTDLLLSNASPTDRSISEGCGESDSGAVTTSAAGPGTEYLEDNKNYW 1438
              H   G NFVE FTDLLLS +   D  +S    +SD+G     +    +E   DNKNYW
Sbjct: 353  ISH---GNNFVETFTDLLLSTSG--DGGLSGNGTDSDNGG---GSGNDPSETCGDNKNYW 404

Query: 1439 NSILNLVNSSPPDSPMF 1489
            NSI NLVNSSP DS MF
Sbjct: 405  NSIFNLVNSSPSDSAMF 421


>ref|XP_002283843.1| PREDICTED: uncharacterized protein LOC100245564 [Vitis vinifera]
          Length = 393

 Score =  529 bits (1362), Expect = e-147
 Identities = 280/423 (66%), Positives = 320/423 (75%), Gaps = 18/423 (4%)
 Frame = +2

Query: 275  MGRSPCCDKVGLKKGPWTPEEDQKLLAYIEGHGHGSWRALPAKAGLQRCGKSCRLRWTNY 454
            MGRSPCCDKVGLKKGPWTPEEDQKLLAYIE HGHGSWRALP+KAGLQRCGKSCRLRWTNY
Sbjct: 1    MGRSPCCDKVGLKKGPWTPEEDQKLLAYIEEHGHGSWRALPSKAGLQRCGKSCRLRWTNY 60

Query: 455  LRPDIKRGKFSIQEEQSIIQLHALLGNRWSAIATHLAKRTDNEIKNYWNTHLKKRLAKMG 634
            LRPDIKRGKFS+QEEQ+IIQLHALLGNRWSAIATHL KRTDNEIKNYWNTHLKKRLAKMG
Sbjct: 61   LRPDIKRGKFSLQEEQTIIQLHALLGNRWSAIATHLPKRTDNEIKNYWNTHLKKRLAKMG 120

Query: 635  IDPVTHKPKSDAMLSSDGQSKHAANLSHMAQWESARLEAEARLVRQSKLRSNSLF---AS 805
            IDPVTHKPKSDA+LSSDGQSK+AANLSHMAQWESARLEAEARLVR+SKLRSNS      +
Sbjct: 121  IDPVTHKPKSDALLSSDGQSKNAANLSHMAQWESARLEAEARLVRESKLRSNSFNHHPGT 180

Query: 806  PEFSTPSSPLNKPSLAPPMPPRCLDILKAWNGVWAKSNEPGASGLG-------------- 943
            P  ST ++   K +     PP  LD+LKAW+GVW KS E G S  G              
Sbjct: 181  PSASTSAAVGGKTAAVSSSPPLYLDVLKAWHGVWPKSTEGGGSSGGGGGGGGGGGGVAVA 240

Query: 944  GDLESPTSTVSYSENTAPQMSSSNRMGDSSTAFYEFVGNNSSGSFDGGIMKEEG-EDDWK 1120
            GDLESPTST+SYSEN A           +STA  +FVGN  SGS +GGI+K+EG + +WK
Sbjct: 241  GDLESPTSTLSYSENAA---------AVASTAVIDFVGN--SGSCEGGIIKDEGDQQEWK 289

Query: 1121 GLGRKSNLPDGIENSVTFISALQDMTMPPDSGAAWNGDSLRAYDQDEDHARSGGNFVERF 1300
            G+G  + LP+      +F SAL DM +P DSG AW  +SL+          +GG+F+E F
Sbjct: 290  GMGSSTQLPE------SFTSALHDMAVPMDSG-AWTPESLKTV--------NGGHFIEGF 334

Query: 1301 TDLLLSNASPTDRSISEGCGESDSGAVTTSAAGPGTEYLEDNKNYWNSILNLVNSSPPDS 1480
            T+LLLSN++    S ++G G+SD+G      +G G++Y EDNKNYWNSILNLVNSSP DS
Sbjct: 335  TELLLSNSTNRTLSDTDGGGDSDNG----GCSGSGSDYYEDNKNYWNSILNLVNSSPYDS 390

Query: 1481 PMF 1489
            PMF
Sbjct: 391  PMF 393


>ref|XP_002323871.1| predicted protein [Populus trichocarpa] gi|222866873|gb|EEF04004.1|
            predicted protein [Populus trichocarpa]
          Length = 401

 Score =  513 bits (1322), Expect = e-143
 Identities = 279/417 (66%), Positives = 318/417 (76%), Gaps = 12/417 (2%)
 Frame = +2

Query: 275  MGRSPCCDKVGLKKGPWTPEEDQKLLAYIEGHGHGSWRALPAKAGLQRCGKSCRLRWTNY 454
            MGRSPCCDKVGLKKGPWTPEEDQKLLAYIE HGHGSWRALPAKAGLQRCGKSCRLRWTNY
Sbjct: 1    MGRSPCCDKVGLKKGPWTPEEDQKLLAYIEEHGHGSWRALPAKAGLQRCGKSCRLRWTNY 60

Query: 455  LRPDIKRGKFSIQEEQSIIQLHALLGNRWSAIATHLAKRTDNEIKNYWNTHLKKRLAKMG 634
            LRPDIKRGKFS+QEEQ+IIQLHALLGNRWSAIATHL KRTDNEIKNYWNTHLKKRLAKMG
Sbjct: 61   LRPDIKRGKFSLQEEQTIIQLHALLGNRWSAIATHLPKRTDNEIKNYWNTHLKKRLAKMG 120

Query: 635  IDPVTHKPKSDAMLSSDGQSKHAANLSHMAQWESARLEAEARLVRQSKLRSNSLFASPEF 814
            IDPVTHK K+DA+LS DGQSK+AANLSHMAQWESARLEAEARLVR+SKLRS S+      
Sbjct: 121  IDPVTHKSKNDALLSIDGQSKNAANLSHMAQWESARLEAEARLVRESKLRSQSIQHQLSS 180

Query: 815  STP-------SSPLNKPSLAPPMPPRCLDILKAWNGVWAKSNEPGASGL----GGDLESP 961
            +TP       SSP    S     PPR LD LKAWN  W+KS+E    GL    G  LESP
Sbjct: 181  TTPGYFPGSGSSP-GSTSSTLAQPPRSLDALKAWNDGWSKSSEGNGGGLNMGIGDVLESP 239

Query: 962  TSTVSYSENTAPQMSSSNRMGDSSTAFYEFVGNNSSGSFDGGIMKEEGEDDWKGLGRKSN 1141
            TST+++SEN AP + +S  +G++S +  EFVG  +SGS + GI+KEEGE DWK L   S+
Sbjct: 240  TSTLTFSEN-APPVMNSGAVGENSISMIEFVG--TSGSTETGIIKEEGEHDWKSLSNSSH 296

Query: 1142 LPDGIENSVTFISALQDMTMPPDSGAAWNGDSLRAYDQDEDHARSGGNFVER-FTDLLLS 1318
            LPD   NSV+  S L DMT+  +  A WN DSLRA   + D+   G N +E  FT LLLS
Sbjct: 297  LPD---NSVSLTSTLHDMTISME--APWNPDSLRA---NCDNVHVGKNVMEEGFTHLLLS 348

Query: 1319 NASPTDRSISEGCGESDSGAVTTSAAGPGTEYLEDNKNYWNSILNLVNSSPPDSPMF 1489
            +++  +RS+S+   +SD      S +G G+ Y EDNKNYWNSILNLVNSSP +SPMF
Sbjct: 349  DSA--ERSLSDDGKDSDHSG--GSGSGSGSNYYEDNKNYWNSILNLVNSSPSNSPMF 401


>ref|XP_003543812.1| PREDICTED: uncharacterized protein LOC100787446 [Glycine max]
          Length = 422

 Score =  473 bits (1217), Expect = e-131
 Identities = 267/446 (59%), Positives = 300/446 (67%), Gaps = 41/446 (9%)
 Frame = +2

Query: 275  MGRSPCCDKVGLKKGPWTPEEDQKLLAYIEGHGHGSWRALPAKAGLQRCGKSCRLRWTNY 454
            MGRSPCCDKVGLKKGPWTPEEDQKLLAYIE HGHGSWRALPAKAGLQRCGKSCRLRWTNY
Sbjct: 1    MGRSPCCDKVGLKKGPWTPEEDQKLLAYIEEHGHGSWRALPAKAGLQRCGKSCRLRWTNY 60

Query: 455  LRPDIKRGKFSIQEEQSIIQLHALLGNRWSAIATHLAKRTDNEIKNYWNTHLKKRLAKMG 634
            LRPDIKRGKFS+QEEQ+IIQLHALLGNRWSAIATHL KRTDNEIKNYWNTHLKKRL KMG
Sbjct: 61   LRPDIKRGKFSLQEEQTIIQLHALLGNRWSAIATHLPKRTDNEIKNYWNTHLKKRLTKMG 120

Query: 635  IDPVTHKPKSDAMLSSDGQSKHAANLSHMAQWESARLEAEARLVRQSKLRSNSL------ 796
            IDPVTHKPK+DA+LSSDGQSK AANLSHMAQWESARLEAEARLVR+SK+RS+SL      
Sbjct: 121  IDPVTHKPKNDALLSSDGQSKTAANLSHMAQWESARLEAEARLVRESKIRSHSLQQQFGS 180

Query: 797  -----FASPEFSTPSSPL-----NKPSLAPPMPP----------RCLDILKAWN-GVWAK 913
                  +S   ST +S L     NKP   PP PP            LD+LKAWN G W K
Sbjct: 181  SSSTFASSSSASTSASALNNNSNNKPEAPPPPPPPPPPSPPPSRSSLDVLKAWNSGGWLK 240

Query: 914  SNEPGAS-----GLGGDLESPTSTVSYSENTAPQM------SSSNRMGDSSTAFYEFV-- 1054
            SNE   +     G+ GDLESPTST+S+SEN  P M      +++N   DS+    EFV  
Sbjct: 241  SNEGSGAIASNVGVSGDLESPTSTLSFSENAPPIMNGIRGENNNNNKNDSAMPMIEFVRI 300

Query: 1055 -GNNSSGSFDGGIMKEEGEDDWKGLGRKSNLPDGIENSVTFISALQDMTMPPDSGAAWNG 1231
             GN+SS      ++KEEGE +WKG    S          TF S+L + TM  +    W  
Sbjct: 301  SGNSSS------LVKEEGEQEWKGTYDSS--------ITTFSSSLHEFTM--NMEGTWAS 344

Query: 1232 DSLRAYDQDEDHARSGGNFVERFTDLLLSNASPTDRSISEGCGESDSGAVTTSAAGPGTE 1411
            +SLR     +D     G     FT+LLL   S      SEG G+S++        G   +
Sbjct: 345  ESLRTSGSHDDDIVEEG-----FTNLLLKTNSEDPNLSSEGGGQSNN---DDGGDGSNND 396

Query: 1412 YLEDNKNYWNSILNLVNSSPPDSPMF 1489
            + EDN NYWNSILNLVNSSP   PMF
Sbjct: 397  FYEDNNNYWNSILNLVNSSPSHPPMF 422


>ref|XP_003554566.1| PREDICTED: uncharacterized protein LOC100101835 [Glycine max]
          Length = 410

 Score =  472 bits (1214), Expect = e-130
 Identities = 264/434 (60%), Positives = 298/434 (68%), Gaps = 29/434 (6%)
 Frame = +2

Query: 275  MGRSPCCDKVGLKKGPWTPEEDQKLLAYIEGHGHGSWRALPAKAGLQRCGKSCRLRWTNY 454
            MGRSPCCDKVGLKKGPWTPEEDQKLLAYIE HGHGSWRALPAKAGLQRCGKSCRLRWTNY
Sbjct: 1    MGRSPCCDKVGLKKGPWTPEEDQKLLAYIEEHGHGSWRALPAKAGLQRCGKSCRLRWTNY 60

Query: 455  LRPDIKRGKFSIQEEQSIIQLHALLGNRWSAIATHLAKRTDNEIKNYWNTHLKKRLAKMG 634
            LRPDIKRGKFS+QEEQ+IIQLHALLGNRWSAIATHL KRTDNEIKNYWNTH+KKRL KMG
Sbjct: 61   LRPDIKRGKFSLQEEQTIIQLHALLGNRWSAIATHLPKRTDNEIKNYWNTHIKKRLTKMG 120

Query: 635  IDPVTHKPKSDAMLSSDGQSKHAANLSHMAQWESARLEAEARLVRQSKLRSNSL------ 796
            IDPVTHKPK+DA+LSSDGQSK AANLSHMAQWESARLEAEARLVR+SK+RS+SL      
Sbjct: 121  IDPVTHKPKNDALLSSDGQSKTAANLSHMAQWESARLEAEARLVRESKIRSHSLQHQLGS 180

Query: 797  -----FASPEFSTPSSPL---NKPSLAPPMPP---RCLDILKAWN-GVWAKSNEPGAS-- 934
                  +S   ST +S L   NKP    P PP     LD+LKAWN G W +SNE      
Sbjct: 181  SSSTFASSSSASTSASALNNNNKPEAQRPPPPPSRSSLDVLKAWNSGGWLESNEGNGGIV 240

Query: 935  ---GLGGDLESPTSTVSYSENTAPQMS----SSNRMGDSSTAFYEFVGNNSSGSFDGGIM 1093
               G+ GDLESPTST+S+SEN  P M+     +N   DS+    EFVGN+ + S    ++
Sbjct: 241  SNVGVSGDLESPTSTLSFSENAPPIMNGIGGENNNNNDSAMPMIEFVGNSGNSS---SLV 297

Query: 1094 KEEGEDDWKGLGRKSNLPDGIENSVTFISALQDMTMPPDSGAAWNGDSLRAYDQDEDHAR 1273
            KEE E +WK     S        + TF S L + TM  +    W  +SLR          
Sbjct: 298  KEEAEQEWKSTYDSS-------ITTTFSSGLHEFTM--NMEGTWASESLRT--------- 339

Query: 1274 SGGNFV--ERFTDLLLSNASPTDRSISEGCGESDSGAVTTSAAGPGTEYLEDNKNYWNSI 1447
            SG + +  E FT+LLL   S      SE  GES +G       G  +++ EDN NYWNSI
Sbjct: 340  SGSHDIVEEGFTNLLLKTNSDDPSLSSEDGGESKNG---DGGGGTNSDFYEDNNNYWNSI 396

Query: 1448 LNLVNSSPPDSPMF 1489
            LNLVNSSP  SPMF
Sbjct: 397  LNLVNSSPSHSPMF 410


Top