BLASTX nr result

ID: Ephedra26_contig00014690 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00014690
         (1143 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004252719.1| PREDICTED: uncharacterized protein LOC101249...    82   4e-13
ref|XP_006366423.1| PREDICTED: heterogeneous nuclear ribonucleop...    77   1e-11
gb|EOY15466.1| Hydroxyproline-rich glycoprotein family protein, ...    72   5e-10
gb|EOY15467.1| Hydroxyproline-rich glycoprotein family protein, ...    70   2e-09
gb|EXC52457.1| hypothetical protein L484_000896 [Morus notabilis]      67   2e-08
ref|XP_004148098.1| PREDICTED: uncharacterized protein LOC101221...    67   2e-08
ref|XP_006338048.1| PREDICTED: uncharacterized protein LOC102603...    58   6e-06
ref|XP_002301125.1| predicted protein [Populus trichocarpa]            58   8e-06

>ref|XP_004252719.1| PREDICTED: uncharacterized protein LOC101249715 [Solanum
            lycopersicum]
          Length = 348

 Score = 82.0 bits (201), Expect = 4e-13
 Identities = 92/337 (27%), Positives = 127/337 (37%), Gaps = 27/337 (8%)
 Frame = +3

Query: 159  MDETLKRKERLQALKNQAETPTNTTEDSNQESLMNPLDD---SNPAPSSQFSPSFDYYTD 329
            MDE+ KRKERL A++ +A    +  E      L NPL D    N    +   P FDYYTD
Sbjct: 1    MDESEKRKERLNAMRMEASQSGDYNEAVGYGGLTNPLTDVPSGNVESYAMPRPRFDYYTD 60

Query: 330  PLAAFS-NKKKSRQENSSPRPTK---STQQNPQ----TKSGFSNLHNNITGFXXXXXXXX 485
            P+AAFS NK+ + Q + SP+ ++   +   NPQ    T  G  ++     G         
Sbjct: 61   PMAAFSANKRSNNQPHVSPQVSQQCYTRATNPQSPICTPRGNYSVDQRSQGVHHTFNPLG 120

Query: 486  XWHQSTKHG----------SFSPNPPQFGGRPSSPVPRFFNSPD---GPRSQPFYGNSMQ 626
               Q++  G          + S + P+    P+S +   F SP    G R    YG    
Sbjct: 121  NPGQNSPFGIPQRGSPSAWNNSFDTPKNYLPPNSSMGGNFASPGIQRGGRPGFHYGQGSG 180

Query: 627  NGQQFYGNNNMMGQG-RGSPRSRPQFGFPNLGHGSPASSPPIHRFCQNQQEVARGSPFHG 803
                 YG +   G G RG+P       + + GH    S    HR    Q    RGSP+ G
Sbjct: 181  QPGSGYGGSPYQGSGYRGNP-------YQDSGHRGSPSQGSGHRGSPYQHSGNRGSPYQG 233

Query: 804  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYAPRGRGGRSFVSS 983
                                                             + RG  S   S
Sbjct: 234  SGQG-------------------------------------------RSQWRGNSSSPFS 250

Query: 984  PRSNNRGRNGEN--VSAREQPELFVKKSMVDDPWKDL 1088
             R   RG  G +   S   +P+L+  KSMV+DPWK+L
Sbjct: 251  FRGGRRGGRGSHGGTSGESRPDLYYSKSMVEDPWKEL 287


>ref|XP_006366423.1| PREDICTED: heterogeneous nuclear ribonucleoproteins A2/B1-like
            [Solanum tuberosum]
          Length = 348

 Score = 77.4 bits (189), Expect = 1e-11
 Identities = 88/333 (26%), Positives = 122/333 (36%), Gaps = 23/333 (6%)
 Frame = +3

Query: 159  MDETLKRKERLQALKNQAETPTNTTEDSNQESLMNPLDD---SNPAPSSQFSPSFDYYTD 329
            MDE+ KRKERL+A++ +A    +  E      L NPL D    N    +   P FDYYTD
Sbjct: 1    MDESEKRKERLKAMRMEASQSGDYNEAVGHGGLTNPLTDVPSGNVESYAMPRPRFDYYTD 60

Query: 330  PLAAFS-NKKKSRQENSSPRPTKSTQQNPQTKSGFSNLHNNITGFXXXXXXXXXWHQSTK 506
            P+AAFS NK+ + Q + SP+ ++     P+  +  S +                    T 
Sbjct: 61   PMAAFSANKRSNNQPHVSPQISQQCYTPPRATNPQSPI-------------------CTP 101

Query: 507  HGSFSPNPPQFG------GRPSSPVPRFFNSPDGPRSQP-FYGNSMQNGQQFYGNNNMMG 665
             G++S +    G      G P    P  F +P   R  P  + NS      +   N+ MG
Sbjct: 102  RGNYSVDQRSQGVHYNPLGNPGQNSP--FGTPQ--RGSPSAWNNSFGTPNNYLPPNSSMG 157

Query: 666  QGRGSP----RSRPQF------GFPNLGHGSPASSPPIHRFCQNQQEVARGSPFHGXXXX 815
                SP      RP F      G P  G+G        +R    Q    RGSP+      
Sbjct: 158  GNFASPGIHQGGRPGFHYGQGSGQPGSGYGGSPYQGSGYRGNPYQDSGHRGSPYQHSGNR 217

Query: 816  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYAPRGRGGRSFVSSPRSN 995
                                                         + RG  S   S R  
Sbjct: 218  GSPYQRPGNRGIPYQGSGQG-----------------------RSQWRGNSSSPISFRGG 254

Query: 996  NRGRNGEN--VSAREQPELFVKKSMVDDPWKDL 1088
             RG  G +   S   +P+L+  KSMV+DPWK+L
Sbjct: 255  RRGGRGSHGGTSGESRPDLYYSKSMVEDPWKEL 287


>gb|EOY15466.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao]
          Length = 368

 Score = 71.6 bits (174), Expect = 5e-10
 Identities = 98/366 (26%), Positives = 134/366 (36%), Gaps = 42/366 (11%)
 Frame = +3

Query: 159  MDETLKRKERLQALK---NQAETPTNTTEDSNQESLMNPLDDSNPAPSSQ----FSPSFD 317
            MDE+ KRKERL+A++    Q+E P N    S    L NPL +++   + Q     +P FD
Sbjct: 1    MDESEKRKERLKAMRLEAAQSEVPNNVATPSVPGHLSNPLSETSSTAAVQEDFCSTPRFD 60

Query: 318  YYTDPLAAFS-NKKKSRQENSSPRPTKSTQQNPQTKSGF-----SNLHNNITGF------ 461
            YYTDP+AAFS NKK+ + +N S +        P T SG+     S  H     +      
Sbjct: 61   YYTDPMAAFSANKKRGKADNQSTQ----NYFTPPTTSGWPVARVSPSHPGPRNYDMNPPV 116

Query: 462  ----XXXXXXXXXWHQSTKHGSFSPNPPQFGGRPSSPVPRFFNSPDGPRSQPFYGNS-MQ 626
                         +HQ   H +F+ +         SP+ R   SP    S   +GNS   
Sbjct: 117  RHMQSQYSLDQRMYHQQGPHSNFAAH--------RSPITR---SP----SHMHHGNSDAW 161

Query: 627  NGQQFYGN----------NNMMGQGRGSPRSRPQFGFPNLGHGSPASSPPIHRFCQNQQE 776
            NG Q +GN            M G     P + P+F  P+  + S  S+ P   F      
Sbjct: 162  NGSQAFGNYYSSASDGSPGGMFGTPLMHPGTTPRFWNPS--NASRYSNSPTPGFSPADIP 219

Query: 777  VARGSPFHGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYAPRG 956
              RG P                                                  + RG
Sbjct: 220  YGRGRP------------------------------QQFGNYPLPSPGHGGSLGLSSGRG 249

Query: 957  R----GGRSFVSSPRSNNRGRNGENVSAREQ----PELFVKKSMVDDPWKDLLASVSHGR 1112
            R    GG       RS  RG      S+       PE F  +SM++DPW+ L   +   R
Sbjct: 250  RGRGYGGSITHGIGRSGGRGLGFHGHSSASNRMMGPESFYDESMLEDPWQHLKPVLWRRR 309

Query: 1113 EAGSAS 1130
            EAG  S
Sbjct: 310  EAGMDS 315


>gb|EOY15467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao]
          Length = 345

 Score = 69.7 bits (169), Expect = 2e-09
 Identities = 93/350 (26%), Positives = 127/350 (36%), Gaps = 26/350 (7%)
 Frame = +3

Query: 159  MDETLKRKERLQALK---NQAETPTNTTEDSNQESLMNPLDDSNPAPSSQ----FSPSFD 317
            MDE+ KRKERL+A++    Q+E P N    S    L NPL +++   + Q     +P FD
Sbjct: 1    MDESEKRKERLKAMRLEAAQSEVPNNVATPSVPGHLSNPLSETSSTAAVQEDFCSTPRFD 60

Query: 318  YYTDPLAAFSNKKKSRQENSSPRPTKSTQQNPQTKSGFSNLHNNITGFXXXXXXXXXWHQ 497
            YYTDP+AA S    +R   S P P ++   NP  +   S                  +HQ
Sbjct: 61   YYTDPMAATSGWPVARVSPSHPGP-RNYDMNPPVRHMQSQ----------YSLDQRMYHQ 109

Query: 498  STKHGSFSPNPPQFGGRPSSPVPRFFNSPDGPRSQPFYGNS-MQNGQQFYGN-------- 650
               H +F+ +         SP+ R   SP    S   +GNS   NG Q +GN        
Sbjct: 110  QGPHSNFAAH--------RSPITR---SP----SHMHHGNSDAWNGSQAFGNYYSSASDG 154

Query: 651  --NNMMGQGRGSPRSRPQFGFPNLGHGSPASSPPIHRFCQNQQEVARGSPFHGXXXXXXX 824
                M G     P + P+F  P+  + S  S+ P   F        RG P          
Sbjct: 155  SPGGMFGTPLMHPGTTPRFWNPS--NASRYSNSPTPGFSPADIPYGRGRP---------- 202

Query: 825  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYAPRGR----GGRSFVSSPRS 992
                                                    + RGR    GG       RS
Sbjct: 203  --------------------QQFGNYPLPSPGHGGSLGLSSGRGRGRGYGGSITHGIGRS 242

Query: 993  NNRGRNGENVSAREQ----PELFVKKSMVDDPWKDLLASVSHGREAGSAS 1130
              RG      S+       PE F  +SM++DPW+ L   +   REAG  S
Sbjct: 243  GGRGLGFHGHSSASNRMMGPESFYDESMLEDPWQHLKPVLWRRREAGMDS 292


>gb|EXC52457.1| hypothetical protein L484_000896 [Morus notabilis]
          Length = 346

 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 62/207 (29%), Positives = 91/207 (43%), Gaps = 18/207 (8%)
 Frame = +3

Query: 159 MDETLKRKERLQALKNQAETPTNTTEDSNQES----LMNPLDDSN----PAPSSQFSPSF 314
           M+E+ KR+ERL+A++++A   +  ++++   +    L NPL +++    P   S  +  F
Sbjct: 1   MEESEKRRERLRAMRHEAAAQSVNSDNNEAPAMPCYLSNPLVETSAAAPPPEQSHGTSRF 60

Query: 315 DYYTDPLAAFSNKKKSRQENSSPRPTKSTQQNPQTKSGFSNLHNNITGFXXXXXXXXXWH 494
           D+YTDP+AAFS  K+    N++  P  S    P   SG   L +               H
Sbjct: 61  DFYTDPMAAFSANKR---RNNTSDPISSHHVTPPANSGSPMLRSPSPFSGPRYAGMSPAH 117

Query: 495 QSTKHGSFSPNP----PQ-FGGRPSSP-----VPRFFNSPDGPRSQPFYGNSMQNGQQFY 644
           Q     ++SPNP    PQ FG  P S      + R FN   G    P  G     G   +
Sbjct: 118 QF--QSNYSPNPRMYQPQGFGHDPISQSGELGMSRPFNMHQG-NMDPSIGPGSAAGYYNF 174

Query: 645 GNNNMMGQGRGSPRSRPQFGFPNLGHG 725
            +N   G    SPR  P   F N G G
Sbjct: 175 PSNQPRGSRFPSPRIGPTGSFFNAGQG 201


>ref|XP_004148098.1| PREDICTED: uncharacterized protein LOC101221481 [Cucumis sativus]
           gi|449528802|ref|XP_004171392.1| PREDICTED:
           uncharacterized protein LOC101231125 [Cucumis sativus]
          Length = 307

 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 70/226 (30%), Positives = 90/226 (39%), Gaps = 32/226 (14%)
 Frame = +3

Query: 159 MDETLKRKERLQALKNQAETP--TNTTEDSNQESLMNPLDDSNPAPSSQFSPS----FDY 320
           M+E+ KR+ERL+A++ +A      N  E S    L NPL +S+     Q +P     FDY
Sbjct: 1   MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMVGQLAPCTAPRFDY 60

Query: 321 YTDPLAAFSNKKKSRQENSSP---------RPTKSTQQNPQTKSGFSNLHNNITGFXXXX 473
           YT+P+AAFS  KK  +  + P           T ST   P T  G S    +        
Sbjct: 61  YTNPMAAFSTSKKKGKIENQPVSDNFVPYHHNTSSTTYFPPTFPGDSEAGGH---GRPGM 117

Query: 474 XXXXXWHQSTKHGSFSPNPPQFGGRPSSPVPRFFNSP---DGPRSQPFYGNSMQNGQQFY 644
                 +Q   H    P  P     P+ P PR  NSP    GPR  P Y N  QN   + 
Sbjct: 118 PRPYAVNQGDLHMWRGPRGPFVNQFPTQP-PREMNSPSHVSGPRGNP-YTNPTQNRANYR 175

Query: 645 ------GNNNMMGQGRGS--------PRSRPQFGFPNLGHGSPASS 740
                 G       GRGS        P  R  +G     HG  +SS
Sbjct: 176 SSSPNPGFRGSFSPGRGSYGHHGNMTPSPRFGYGRATGSHGRHSSS 221


>ref|XP_006338048.1| PREDICTED: uncharacterized protein LOC102603652 [Solanum tuberosum]
          Length = 286

 Score = 58.2 bits (139), Expect = 6e-06
 Identities = 37/93 (39%), Positives = 51/93 (54%), Gaps = 16/93 (17%)
 Frame = +3

Query: 159 MDETLKRKERLQALKNQAETPTNTTEDSNQ------ESLMNPLDDSNPAPSSQFSPS--F 314
           M+E+ KRKERL+A++ +A    +  E+ N         L NPL ++  A S +  P   F
Sbjct: 1   MEESEKRKERLKAIRKEAAEAGDNNEEQNSIGGPLDHGLTNPLIETPSAASGKDEPRPRF 60

Query: 315 DYYTDPLAAFSNKKK--------SRQENSSPRP 389
           DYYTDP+AAFS   K        S+  N+SPRP
Sbjct: 61  DYYTDPMAAFSANNKMNNLSPQVSQPCNTSPRP 93


>ref|XP_002301125.1| predicted protein [Populus trichocarpa]
          Length = 347

 Score = 57.8 bits (138), Expect = 8e-06
 Identities = 64/230 (27%), Positives = 93/230 (40%), Gaps = 20/230 (8%)
 Frame = +3

Query: 159 MDETLKRKERLQALKNQAETPTNTTEDSNQES-----LMNPLDDSNPAPSSQF-----SP 308
           M++  KR ERL+A++  A     T  D+ + S     L NPL + NPA          +P
Sbjct: 1   MEDAEKRSERLKAMRAVASAQAETCNDNVETSAVPGLLANPLLE-NPATQPALEELSATP 59

Query: 309 SFDYYTDPLAAFSNKKK----SRQENSSPRPTKSTQQNPQTKSGFSNLHN-NITGFXXXX 473
            FD+YTDP AAFS+ +K    + Q     RP  +    PQ  S      N  +T      
Sbjct: 60  RFDFYTDPSAAFSSDRKRTATANQVARGFRPPNNISSMPQFSSPHPGQRNPEVTPSSAYQ 119

Query: 474 XXXXXWHQSTKHGSFSPNPPQFGGRPSSPVPRFFNSPDGPRSQPFYGN----SMQNGQQF 641
                   +    ++SPN   + G+       F+ +P    ++PF  N     M NG   
Sbjct: 120 MQNNYSPANQMQSNYSPNQRMYPGQGPYHNAAFYRTPSN-FARPFTMNQGTPEMWNGPGG 178

Query: 642 YGNNNMMGQGRGSPRSRPQFGFPNLGHGSPASSP-PIHRFCQNQQEVARG 788
             +N+     RG  R  P     N G G   SSP P+  +  +     RG
Sbjct: 179 PASNHSSTPYRGISRPYP-IHQGNPGFGPVGSSPSPVSGYGGSPASSGRG 227


Top